Edward Beeching's picture

Edward Beeching PRO

edbeeching

HuggingFaceH4

·

https://edbeeching.github.io/

edbeeching

AI & ML interests

None yet

Recent Activity

updated a model about 3 hours ago

edbeeching/Qwen3-4B-GKD

published a model about 4 hours ago

edbeeching/Qwen3-4B-GKD

updated a model about 5 hours ago

edbeeching/Qwen3-4B-GKD-push

View all activity

Organizations

upvoted an article 16 days ago

Article

Mixture of Experts (MoEs) in Transformers

+5

20 days ago

•

135

upvoted an article 8 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

+21

Jul 8, 2025

•

765

upvoted a paper 12 months ago

Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning

Paper • 2503.07572 • Published Mar 10, 2025 • 48

upvoted an article about 1 year ago

Article

Open R1: Update #3

Mar 11, 2025

•

297

upvoted an article over 1 year ago

Article

How NuminaMath Won the 1st AIMO Progress Prize

+6

Jul 11, 2024

•

127

upvoted a paper about 2 years ago

A General Theoretical Paradigm to Understand Learning from Human Preferences

Paper • 2310.12036 • Published Oct 18, 2023 • 19

upvoted a collection over 2 years ago

Reward models on the hub

UNMAINTAINED: See RewardBench... A place to collect reward models, an often not released artifact of RLHF. • 18 items • Updated Apr 13, 2024 • 25

upvoted 2 papers over 2 years ago

Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2

Paper • 2311.10702 • Published Nov 17, 2023 • 19

Zephyr: Direct Distillation of LM Alignment

Paper • 2310.16944 • Published Oct 25, 2023 • 123

upvoted a paper almost 3 years ago

The RefinedWeb Dataset for Falcon LLM: Outperforming Curated Corpora with Web Data, and Web Data Only

Paper • 2306.01116 • Published Jun 1, 2023 • 43