File size: 976 Bytes
9e275d2 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 |
---
pipeline_tag: text-generation
inference: true
license: apache-2.0
datasets:
- simplescaling/s1K-1.1
base_model:
- Qwen/Qwen2.5-0.5B-Instruct
library_name: transformers
language:
- ar
- de
- en
- es
- fr
- it
- ja
- ko
- pt
- ru
- th
- vi
- zh
---
# Model Summary
> s1.1-0.5B is a sucessor of [s1](https://huggingface.co/2stacks/s1-0.5B) with better reasoning performance by leveraging reasoning traces from r1 instead of Gemini. This model was created simply to test the process used to train the original s1.1 cited below using consumer grade GPUs.
- **Logs:** https://wandb.ai/2stacks-sms/s1/runs/ishervdt?nw=nwuser2stacks
- **Repository:** [simplescaling/s1](https://github.com/simplescaling/s1)
- **Paper:** https://arxiv.org/abs/2501.19393
Thanks to [Ryan Marten](https://huggingface.co/ryanmarten) for helping generate r1 traces for s1K.
# Use
The model usage is documented [here](https://github.com/simplescaling/s1?tab=readme-ov-file#inference). |