DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
Paper
•
2401.02954
•
Published
•
52
Perspectives on the State and Future of Deep Learning - 2023
Paper
•
2312.09323
•
Published
•
8
Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to
the Edge of Generalization
Paper
•
2405.15071
•
Published
•
42
Sibyl: Simple yet Effective Agent Framework for Complex Real-world
Reasoning
Paper
•
2407.10718
•
Published
•
19
LAB-Bench: Measuring Capabilities of Language Models for Biology
Research
Paper
•
2407.10362
•
Published
•
6
SciCode: A Research Coding Benchmark Curated by Scientists
Paper
•
2407.13168
•
Published
•
17
MindSearch: Mimicking Human Minds Elicits Deep AI Searcher
Paper
•
2407.20183
•
Published
•
43
Building and better understanding vision-language models: insights and
future directions
Paper
•
2408.12637
•
Published
•
133
Promptriever: Instruction-Trained Retrievers Can Be Prompted Like
Language Models
Paper
•
2409.11136
•
Published
•
22
Recursive Language Models
Paper
•
2512.24601
•
Published
•
79
DeepResearchEval: An Automated Framework for Deep Research Task Construction and Agentic Evaluation
Paper
•
2601.09688
•
Published
•
126
MAXS: Meta-Adaptive Exploration with LLM Agents
Paper
•
2601.09259
•
Published
•
95