BytedTsinghua-SIA
/

RL-MemoryAgent-7B

Model card Files Files and versions

RL-MemoryAgent-7B / README.md

huiyeruzhou's picture

Update README.md

e7dc7cb verified 11 months ago

|

history blame contribute delete

1.48 kB

	---
	license: apache-2.0
	base_model:
	- Qwen/Qwen2.5-7B-Instruct
	---

	## Model Description

	The RL-MemAgent-7B is a part of the MemAgent framework, which enables Large Language Models (LLMs) to process arbitrarily long texts through end-to-end Reinforcement Learning without altering their core architecture.



	## Usage

	This model is ideal for tasks requiring the understanding and processing of very long documents, such as comprehensive question answering, summarizing extensive reports, or analyzing large codebases.

	For detailed instructions on how to use, evaluate, and train models within the MemAgent framework, please refer to the main [MemAgent GitHub repository](https://github.com/BytedTsinghua-SIA/MemAgent).


	## Links

	* Paper: [https://arxiv.org/abs/2507.02259](https://arxiv.org/abs/2507.02259)
	* Blog: [https://memagent-sialab.github.io/](https://memagent-sialab.github.io/)
	* GitHub: [https://github.com/BytedTsinghua-SIA/MemAgent](https://github.com/BytedTsinghua-SIA/MemAgent)

	## Citation

	If you find this work useful, please consider citing our paper:

	```bibtex
	@article{yu2025memagent,
	title={MemAgent: Reshaping Long-Context LLM with Multi-Conv RL-based Memory Agent},
	author={Yu, Hongli and Chen, Tinghong and Feng, Jiangtao and Chen, Jiangjie and Dai, Weinan and Yu, Qiying and Zhang, Ya-Qin and Ma, Wei-Ying and Liu, Jingjing and Wang, Mingxuan and others},
	journal={arXiv preprint arXiv:2507.02259},
	year={2025}
	}
	```