File size: 198 Bytes
f0c5b40
 
 
 
 
1
2
3
4
5
Reparameterized Absorbing Discrete Diffusion (RADD) small model with lambda-dce loss trained for 400k iterations. 

Code: https://github.com/ML-GSAI/RADD. 

Paper: https://arxiv.org/abs/2406.03736.