Safetensors
qwen2
pat-jj commited on
Commit
951599a
·
verified ·
1 Parent(s): 0d1c0d7

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +25 -0
README.md ADDED
@@ -0,0 +1,25 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ ---
4
+ # DeepRetrieval
5
+ ## Overview
6
+
7
+ DeepRetrieval is a novel approach that uses reinforcement learning (RL) to train Large Language Models (LLMs) for query generation without requiring supervised data. Instead of relying on expensive human-annotated or distilled reference queries, DeepRetrieval enables LLMs to learn through direct trial and error, using retrieval metrics as rewards.
8
+ ## Key Features
9
+
10
+ - **No Supervision Required**: Eliminates the need for expensive human-annotated or distilled reference queries
11
+ - **RL-Based Framework**: Uses reinforcement learning to optimize query generation directly for retrieval performance
12
+ - **State-of-the-Art Performance**: Achieves remarkable results across diverse retrieval tasks
13
+
14
+ Please view our [GitHub page](https://github.com/pat-jj/DeepRetrieval) for instructions.
15
+
16
+ [DeepRetrieval Paper](arxiv.org/abs/2503.00223)
17
+ ```
18
+ @article{jiang2025deepretrievalhackingrealsearch,
19
+ title={DeepRetrieval: Hacking Real Search Engines and Retrievers with Large Language Models via Reinforcement Learning},
20
+ author={Pengcheng Jiang and Jiacheng Lin and Lang Cao and Runchu Tian and SeongKu Kang and Zifeng Wang and Jimeng Sun and Jiawei Han},
21
+ year={2025},
22
+ journal = {arXiv preprint arXiv: 2503.00223},
23
+ url={https://arxiv.org/abs/2503.00223}
24
+ }
25
+ ```