Fuli Luo

luofuli

·

AI & ML interests

None yet

Recent Activity

authored a paper 7 days ago

MOPD: Multi-Teacher On-Policy Distillation for Capability Integration in LLM Post-Training

authored a paper 6 months ago

MiMo-V2-Flash Technical Report

authored a paper 8 months ago

Stabilizing MoE Reinforcement Learning by Aligning Training and Inference Routers

View all activity

Organizations

None yet

authored a paper 7 days ago

MOPD: Multi-Teacher On-Policy Distillation for Capability Integration in LLM Post-Training

Paper • 2606.30406 • Published 10 days ago • 15

authored a paper 6 months ago

MiMo-V2-Flash Technical Report

Paper • 2601.02780 • Published Jan 6 • 40

authored a paper 8 months ago

Stabilizing MoE Reinforcement Learning by Aligning Training and Inference Routers

Paper • 2510.11370 • Published Oct 13, 2025 • 4

authored a paper over 1 year ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22, 2025 • 455

liked 2 models over 1 year ago

deepseek-ai/DeepSeek-V3-Base

685B • Updated Mar 27, 2025 • 14.7k • 1.7k

mistralai/Mamba-Codestral-7B-v0.1

7B • Updated Jul 24, 2025 • 43.5k • 615

updated a collection almost 2 years ago

DeepSeek-V2.5

2 items • Updated Nov 27, 2025 • 48

upvoted a paper almost 2 years ago

DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

Paper • 2406.11931 • Published Jun 17, 2024 • 71

updated a collection almost 2 years ago

DeepSeek-Prover

DeepSeek-Prover-Series • 10 items • Updated Nov 27, 2025 • 66