Spaces:
				
			
			
	
			
			
		Build error
		
	
	
	
			
			
	
	
	
	
		
		
		Build error
		
	DiscoveryBench Evaluation Utils
- eval_w_subhypo_gen.py: Implements the DiscoveryBench logic for evaluating agent-generated hypotheses.
- lm_utils.py: Provides utility functions necessary for the evaluation process.
- openai_helpers.py: Includes helper functions for OpenAI-related tasks.
- openai_semantic_gen_prompts.py: Contains prompts used for semantic generation.
- response_parser.py: Handles the parsing of agent-generated hypotheses.