view article Article TimeScope: How Long Can Your Video Large Multimodal Model Go? By orrzohar and 3 others • 16 days ago • 32
view article Article How to generate text: using different decoding methods for language generation with Transformers By patrickvonplaten • Mar 1, 2020 • 231
view post Post 1776 I was curious about the Block Diffusion hybrid model and tried retraining it on a DNA tokenizer + dataset 🧬. Too early to evaluate, but it generates sequences (AAATGG TTATTG CAAATC...) and was improving on the validation set during trainingModel: monsoon-nlp/dna-blockdiff-papayaOriginal paper: Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models (2503.09573) See translation 🔥 8 8 + Reply