DiffuCoder-7B-Instruct
The DiffuCoder-7B-Instruct model builds on the DiffuCoder-7B-Base checkpoint with instruction-tuning to better follow code-related prompts.
Training recipe: with a newly introduced pad token, we train this model with fixed length conditionally on OpenCoder-SFT data for 5 epochs.
Benchmarks: Demonstrates stronger instruction-following capabilities than the Base model.
More details and usage examples:
Acknowledgement
To power this HuggingFace model release, we reuse Dream's modeling architecture and generation utils.
- Downloads last month
- 4
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support