-
How to Train Data-Efficient LLMs
Paper • 2402.09668 • Published • 43 -
Adapting Large Language Models via Reading Comprehension
Paper • 2309.09530 • Published • 80 -
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
Paper • 2403.03507 • Published • 189 -
MathScale: Scaling Instruction Tuning for Mathematical Reasoning
Paper • 2403.02884 • Published • 17
peng
superpeng
·
AI & ML interests
None yet
Recent Activity
liked
a dataset
about 12 hours ago
Intelligent-Internet/II-Medical-Reasoning-SFT
upvoted
a
collection
about 12 hours ago
II-Medical
liked
a dataset
about 12 hours ago
lavita/medical-eval-sphere
Organizations
None yet