diffusionfamily
/

diffullama-gsm

Model card Files Files and versions

diffullama-gsm / README.md

Sansa's picture

Update README.md

86aca7b verified 6 months ago

|

history blame contribute delete

961 Bytes

	---
	base_model:
	- diffusionfamily/diffullama
	library_name: peft
	---

	# Model Card for Model ID

	<!-- Provide a quick summary of what the model is/does. -->
	DiffuLLaMA LoRA tuned on GSM8K-symbolic dataset.

	## Model description

	Details and model loading can be seen [https://github.com/HKUNLP/DiffuLLaMA](https://github.com/HKUNLP/DiffuLLaMA).


	### Framework versions

	- Transformers 4.44.2
	- Pytorch 2.1.1+cu121
	- Datasets 2.21.0
	- Tokenizers 0.19.1
	- PEFT 0.12.0

	```
	@misc{gong2024scalingdiffusionlanguagemodels,
	title={Scaling Diffusion Language Models via Adaptation from Autoregressive Models},
	author={Shansan Gong and Shivam Agarwal and Yizhe Zhang and Jiacheng Ye and Lin Zheng and Mukai Li and Chenxin An and Peilin Zhao and Wei Bi and Jiawei Han and Hao Peng and Lingpeng Kong},
	year={2024},
	eprint={2410.17891},
	archivePrefix={arXiv},
	primaryClass={cs.CL},
	url={https://arxiv.org/abs/2410.17891},
	}
	```