Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
rakeshchow202 's Collections
Financial
RL-Llm

RL-Llm

updated Feb 27
Upvote
-

  • Kimi k1.5: Scaling Reinforcement Learning with LLMs

    Paper • 2501.12599 • Published Jan 22 • 123

  • Teaching Language Models to Critique via Reinforcement Learning

    Paper • 2502.03492 • Published Feb 5 • 24

  • NatureLM: Deciphering the Language of Nature for Scientific Discovery

    Paper • 2502.07527 • Published Feb 11 • 20

  • MetaChain: A Fully-Automated and Zero-Code Framework for LLM Agents

    Paper • 2502.05957 • Published Feb 9 • 16

  • DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

    Paper • 2501.12948 • Published Jan 22 • 416

  • openai/gsm8k

    Viewer • Updated Jan 4, 2024 • 17.6k • 344k • 831
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs