diffullama-gsm / README.md
Sansa's picture
Update README.md
86aca7b verified
---
base_model:
- diffusionfamily/diffullama
library_name: peft
---
# Model Card for Model ID
<!-- Provide a quick summary of what the model is/does. -->
DiffuLLaMA LoRA tuned on GSM8K-symbolic dataset.
## Model description
Details and model loading can be seen [https://github.com/HKUNLP/DiffuLLaMA](https://github.com/HKUNLP/DiffuLLaMA).
### Framework versions
- Transformers 4.44.2
- Pytorch 2.1.1+cu121
- Datasets 2.21.0
- Tokenizers 0.19.1
- PEFT 0.12.0
```
@misc{gong2024scalingdiffusionlanguagemodels,
title={Scaling Diffusion Language Models via Adaptation from Autoregressive Models},
author={Shansan Gong and Shivam Agarwal and Yizhe Zhang and Jiacheng Ye and Lin Zheng and Mukai Li and Chenxin An and Peilin Zhao and Wei Bi and Jiawei Han and Hao Peng and Lingpeng Kong},
year={2024},
eprint={2410.17891},
archivePrefix={arXiv},
primaryClass={cs.CL},
url={https://arxiv.org/abs/2410.17891},
}
```