Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
Nebius AI Studio
Novita
Fireworks
Replicate
Hyperbolic
Cohere
Together AI
fal
SambaNova
Cerebras
Nscale
HF Inference API
Misc
Reset Misc
alignment-handbook
Inference Endpoints
text-generation-inference
4-bit precision
custom_code
8-bit precision
Eval Results
Merge
Misc with no match
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
4,514
Full-text search
Edit filters
Sort: Trending
Active filters:
alignment-handbook
Clear all
DUAL-GPO/phi-2-gpo-test-longest-iter-random-1
Updated
Mar 27, 2024
•
4
DUAL-GPO/phi-2-gpo-test-longest-iter-random1-0
Updated
Mar 27, 2024
DUAL-GPO/phi-2-gpo-test-longest-iter-random1-1
Updated
Mar 27, 2024
•
1
DUAL-GPO/phi-2-gpo-test-longest-iter-random2-0
Updated
Mar 27, 2024
DUAL-GPO/phi-2-gpo-test-longest-iter-random2-1
Updated
Mar 27, 2024
•
1
DUAL-GPO/phi-2-gpo-test-longest-iter-random2-2
Updated
Mar 27, 2024
DUAL-GPO/phi-2-gpo-test-longest-iter-random2-3
Updated
Mar 27, 2024
alvarobartt/mistral-7b-orpo-alignment-handbook
Text Generation
•
Updated
Mar 27, 2024
•
14
DUAL-GPO/phi-2-gpo-test-longest-iter-random2-4
Updated
Mar 27, 2024
•
2
DUAL-GPO/phi-2-dpo-test-iter-0
Updated
Mar 28, 2024
kykim0/gemma-2b-ultrachat-sft
Text Generation
•
Updated
Mar 28, 2024
•
12
•
1
alvarobartt/mistral-7b-orpo-airoboros-pref-10k
Text Generation
•
Updated
Mar 28, 2024
•
12
kykim0/gemma-7b-ultrachat-sft
Text Generation
•
Updated
Mar 29, 2024
•
17
shineil/zephyr-7b-gemma-dpo
Text Generation
•
Updated
Mar 29, 2024
•
11
mradermacher/mistral-7b-orpo-capybara-reproduction-GGUF
Updated
May 6, 2024
•
236
EllieS/zephyr-7b-sft-lora-timedial
Updated
Mar 29, 2024
EllieS/zephyr-7b-dpo-lora-timedial
Updated
Mar 29, 2024
Minbyul/selfbiorag-7b-dpo-full-wo-healthsearch_qa-ep3
Text Generation
•
Updated
Apr 9, 2024
•
15
DUAL-GPO/zephyr-7b-gpo-iter1
Updated
Mar 29, 2024
DUAL-GPO-2/phi-2-ipo-test-iter-0
Updated
Mar 30, 2024
jetmoe/jetmoe-8b-sft
Text Generation
•
Updated
Apr 15, 2024
•
19
•
6
jetmoe/jetmoe-8b-chat
Text Generation
•
Updated
May 11, 2024
•
50
•
29
pkarypis/gpt2-sft-port
Text Generation
•
Updated
Apr 25, 2024
•
18
DUAL-GPO/zephyr-7b-gpo-iter2
Updated
Apr 1, 2024
nthakur/mistral-7b-instruct-v0.2-dpo-multilingual-mix-1st-apr-final
Updated
Apr 2, 2024
•
2
Shamane/mistral-instruct-v2-sec-cpt-qlora
Updated
Apr 2, 2024
•
4
Serega6678/My_script_50pct_LLM_pretraining
Updated
Apr 4, 2024
•
2
objects76/zephyr-7b-dpo-qlora
Updated
Apr 5, 2024
•
1
Serega6678/prototype_joint_trained
Updated
Apr 4, 2024
•
2
DUAL-GPO/zephyr-7b-ipo-qlora-v0
Updated
Apr 6, 2024
•
1
Previous
1
...
8
9
10
11
12
...
100
Next