Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Posts
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
alignment-handbook 's Collections
Handbook v0.1 models and datasets
DPO vs KTO vs IPO
Constitutional AI

Handbook v0.1 models and datasets

updated Nov 10, 2023

Models and datasets for v0.1 of the alignment handbook

Upvote
24

  • alignment-handbook/zephyr-7b-sft-full

    Text Generation • Updated Jan 10, 2024 • 8.37k • • 25

  • alignment-handbook/zephyr-7b-sft-qlora

    Updated Jan 9, 2024 • 567 • 8

  • alignment-handbook/zephyr-7b-dpo-full

    Text Generation • Updated Jan 10, 2024 • 87 • 3

  • alignment-handbook/zephyr-7b-dpo-qlora

    Updated Jan 9, 2024 • 49 • 9

  • HuggingFaceH4/ultrachat_200k

    Viewer • Updated Oct 16, 2024 • 515k • 15.2k • 533

  • HuggingFaceH4/ultrafeedback_binarized

    Viewer • Updated Oct 16, 2024 • 187k • 8.09k • 292
Upvote
24
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs