DiffuCoder-7B-Instruct

The DiffuCoder-7B-Instruct model builds on the DiffuCoder-7B-Base checkpoint with instruction-tuning to better follow code-related prompts.

  • Training recipe: with a newly introduced pad token, we train this model with fixed length conditionally on OpenCoder-SFT data for 5 epochs.

  • Benchmarks: Demonstrates stronger instruction-following capabilities than the Base model.

More details and usage examples:

Acknowledgement

To power this HuggingFace model release, we reuse Dream's modeling architecture and generation utils.

Downloads last month
4
Safetensors
Model size
7.62B params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for apple/DiffuCoder-7B-Instruct

Base model

Qwen/Qwen2.5-7B
Finetuned
(1)
this model
Finetunes
1 model