Nicolas Chauville's picture

2 2

Nicolas Chauville

chocho

·

AI & ML interests

None yet

Organizations

upvoted an article 6 months ago

Article

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

By

•

Feb 11

• 58

upvoted an article 7 months ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

By

and 2 others •

Jan 28

• 877