Faster Video Diffusion with Trainable Sparse Attention Paper • 2505.13389 • Published May 19 • 37
Compression, Transduction, and Creation: A Unified Framework for Evaluating Natural Language Generation Paper • 2109.06379 • Published Sep 14, 2021
Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs Paper • 2406.20098 • Published Jun 28, 2024
O1 Replication Journey: A Strategic Progress Report -- Part 1 Paper • 2410.18982 • Published Oct 8, 2024 • 3
Jais and Jais-chat: Arabic-Centric Foundation and Instruction-Tuned Open Generative Large Language Models Paper • 2308.16149 • Published Aug 30, 2023 • 28
O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson? Paper • 2411.16489 • Published Nov 25, 2024 • 49
Running 116 116 TxT360: Trillion Extracted Text 📖 Create a large-scale deduplicated text dataset for LLM training