luckeciano/Qwen-2.5-7B-RL-LACPO-NoBaselineNoKLNoEntropyNoSmooth Text Generation • 8B • Updated May 9 • 3
luckeciano/Qwen-2.5-7B-RL-LACPO-NoBaselineNoKLNoEntropy0.5NoSmooth Text Generation • 8B • Updated Apr 30 • 3
luckeciano/Qwen-2.5-7B-RL-LACPO-NoBaselineNoKLNoEntropy0.5Smooth10 Text Generation • 8B • Updated May 22 • 3
luckeciano/Qwen-2.5-7B-RL-LACPO-BaselineNoKLNoEntropy0.1Smooth10 Text Generation • 8B • Updated May 2 • 3
s-a-malik/Qwen-2.5-7B-Embedding-Entropy-0.45-Missing-Response Text Generation • 8B • Updated May 10 • 3
luckeciano/Qwen-2.5-7B-RL-LACPO-BaselineNoKLNoEntropyNoSmoothSoftLabel Text Generation • 8B • Updated May 14 • 103
luckeciano/Qwen-2.5-7B-RL-LACPO-BaselineNoKLNoEntropyNoSmoothVF0.1 Text Generation • 8B • Updated May 15 • 3