Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
Replicate
Novita
Nebius AI Studio
Featherless AI
Nscale
Fireworks
Hyperbolic
fal
Cohere
Together AI
Cerebras
SambaNova
HF Inference API
Misc
Reset Misc
alignment-handbook
Inference Endpoints
text-generation-inference
4-bit precision
custom_code
8-bit precision
Eval Results
Merge
Misc with no match
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
4,553
Full-text search
Edit filters
Sort: Trending
Active filters:
alignment-handbook
Clear all
ShenaoZ/0.001_ablation_4iters_bs256_nodpo_iter_2
Text Generation
•
Updated
Apr 23, 2024
•
16
ShenaoZ/0.01_ablation_4iters_bs256_iter_3
Text Generation
•
Updated
Apr 23, 2024
•
16
ShenaoZ/0.001_ablation_4iters_bs256_decalpha_iter_3
Text Generation
•
Updated
Apr 23, 2024
•
16
ShenaoZ/0.001_ablation_4iters_bs256_declr_iter_3
Text Generation
•
Updated
Apr 23, 2024
•
15
ShenaoZ/0.01_ablation_4iters_bs256_nodpo_iter_1
Text Generation
•
Updated
Apr 23, 2024
•
16
ShenaoZ/0.001_ablation_4iters_bs256_nodpo_iter_3
Text Generation
•
Updated
Apr 23, 2024
•
18
ShenaoZ/0.01_ablation_4iters_bs256_iter_4
Text Generation
•
Updated
Apr 23, 2024
•
16
ShenaoZ/0.001_ablation_4iters_bs256_decalpha_iter_4
Text Generation
•
Updated
Apr 23, 2024
•
16
ShenaoZ/0.001_ablation_4iters_bs256_declr_iter_4
Text Generation
•
Updated
Apr 23, 2024
•
21
ShenaoZ/0.01_ablation_4iters_bs256_nodpo_iter_2
Text Generation
•
Updated
Apr 23, 2024
•
19
ShenaoZ/0.001_ablation_4iters_bs256_nodpo_iter_4
Text Generation
•
Updated
Apr 24, 2024
•
24
AmberYifan/safe-spin-iter1
Text Generation
•
Updated
Apr 23, 2024
•
16
ShenaoZ/0.001_ablation_4iters_bs256_sample2_iter_1
Text Generation
•
Updated
Apr 23, 2024
•
18
martimfasantos/tinyllama-1.1b-chat-dpo-qlora
Updated
Apr 24, 2024
•
3
ShenaoZ/0.01_ablation_5iters_bs256_nodpo_iter_1
Text Generation
•
Updated
Apr 23, 2024
•
20
ShenaoZ/0.001_ablation_5iters_bs256_nodpo_iter_1
Text Generation
•
Updated
Apr 23, 2024
•
20
chansung/coding_llamaduo_60k
Updated
Apr 24, 2024
•
9
ShenaoZ/0.01_ablation_4iters_bs256_nodpo_iter_3
Text Generation
•
Updated
Apr 24, 2024
•
16
ShenaoZ/0.001_ablation_5iters_bs256_nodpo_iter_2
Text Generation
•
Updated
Apr 24, 2024
•
22
ShenaoZ/0.01_ablation_5iters_bs256_nodpo_iter_2
Text Generation
•
Updated
Apr 24, 2024
•
24
chansung/coding_llamaduo_60k_v0.2
Updated
Apr 24, 2024
•
20
ShenaoZ/0.001_ablation_4iters_bs256_sample2_iter_2
Text Generation
•
Updated
Apr 24, 2024
•
21
ShenaoZ/0.001_ablation_5iters_bs256_nodpo_iter_3
Text Generation
•
Updated
Apr 24, 2024
•
14
ShenaoZ/0.01_ablation_5iters_bs256_nodpo_iter_3
Text Generation
•
Updated
Apr 24, 2024
•
16
ShenaoZ/0.001_ablation_4iters_bs256_nodpo_sample2_iter_1
Text Generation
•
Updated
Apr 24, 2024
•
16
DUAL-GPO-2/phi-2-gpo-renew2-b0.001-extra-v2-i1
Updated
Apr 24, 2024
•
54
ShenaoZ/0.001_ablation_3iters_bs256_nodpo_iter_1
Text Generation
•
Updated
Apr 24, 2024
•
16
ShenaoZ/0.01_ablation_3iters_bs256_nodpo_iter_1
Text Generation
•
Updated
Apr 24, 2024
•
20
ShenaoZ/0.001_ablation_5iters_bs256_nodpo_iter_4
Text Generation
•
Updated
Apr 24, 2024
•
17
ShenaoZ/0.001_ablation_4iters_bs256_sample2_iter_3
Text Generation
•
Updated
Apr 24, 2024
•
23
Previous
1
...
17
18
19
20
21
...
100
Next