view article Article LeRobot Community Datasets: The “ImageNet” of Robotics — When and How? 3 days ago • 44
TorchTitan: One-stop PyTorch native solution for production ready LLM pre-training Paper • 2410.06511 • Published Oct 9, 2024 • 2
Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model Paper • 2408.11039 • Published Aug 20, 2024 • 62
Bamba Collection Collection of Bamba - hybrid Mamba2 model architecture based models trained on open data • 9 items • Updated 15 days ago • 23
70% Size, 100% Accuracy: Lossless LLM Compression for Efficient GPU Inference via Dynamic-Length Float Paper • 2504.11651 • Published 28 days ago • 28
Byte Latent Transformer: Patches Scale Better Than Tokens Paper • 2412.09871 • Published Dec 13, 2024 • 103
RealHarm: A Collection of Real-World Language Model Application Failures Paper • 2504.10277 • Published 29 days ago • 11