Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Edward Beeching's picture
275 8 14

Edward Beeching

edbeeching
misovalko's profile picture RichardForests's profile picture Arshavir's profile picture
·
https://edbeeching.github.io/
  • edbeeching

AI & ML interests

None yet

Organizations

Hugging Face's profile picture HuggingFaceBR4's profile picture trl internal testing's profile picture Jack of All Trades project's profile picture HuggingFaceM4's profile picture Simulation Environments Tests and Builds's profile picture TRL's profile picture BigCode's profile picture Hugging Face H4's profile picture ShapeNet's profile picture 🤗 H4 Community's profile picture Explorer of Simulate alpha's profile picture BigCode Data's profile picture Hugging Face H4 Community's profile picture Hugging Face Smol Cluster's profile picture Open LLM Leaderboard's profile picture H4-colab's profile picture HuggingFaceH4-colab's profile picture H4 Alignment Handbook's profile picture Project-Numina's profile picture Godot RL Agents's profile picture Data Agents's profile picture nltpt's profile picture Reliable Agents's profile picture Hugging Face Science's profile picture HF CMU Collab's profile picture Open R1's profile picture

edbeeching's activity

upvoted a paper about 2 months ago

Optimizing Test-Time Compute via Meta Reinforcement Fine-Tuning

Paper • 2503.07572 • Published Mar 10 • 44
upvoted an article 10 months ago
view article
Article

How NuminaMath Won the 1st AIMO Progress Prize

Jul 11, 2024
• 120
upvoted a paper over 1 year ago

A General Theoretical Paradigm to Understand Learning from Human Preferences

Paper • 2310.12036 • Published Oct 18, 2023 • 14
upvoted a collection over 1 year ago

Reward models on the hub

Collection
UNMAINTAINED: See RewardBench... A place to collect reward models, an often not released artifact of RLHF. • 18 items • Updated Apr 13, 2024 • 25
upvoted 2 papers over 1 year ago

Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2

Paper • 2311.10702 • Published Nov 17, 2023 • 20

Zephyr: Direct Distillation of LM Alignment

Paper • 2310.16944 • Published Oct 25, 2023 • 122
upvoted a paper almost 2 years ago

The RefinedWeb Dataset for Falcon LLM: Outperforming Curated Corpora with Web Data, and Web Data Only

Paper • 2306.01116 • Published Jun 1, 2023 • 34
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs