How I Trained Action Chunking Transformer (ACT) on SO-101: My Journey, Gotchas, and Lessons By sherryxychen • 14 days ago • 29
Model statistics of the 50 most downloaded entities on Hugging Face By lbourdois • about 6 hours ago • 9
Ring-flash-linear-2.0: A Highly Efficient Hybrid Architecture for Test-Time Scaling By RichardBian and 8 others • 4 days ago • 8
High-Quality Datasets for Far-Field ASR (Treble Technologies x Hugging Face) By treble-technologies and 4 others • about 6 hours ago • 8
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 233
Reactive Transformer (RxT): Fixing the Memory Problem in Conversational AI By AdamF92 • 5 days ago • 5
How I Trained Action Chunking Transformer (ACT) on SO-101: My Journey, Gotchas, and Lessons By sherryxychen • 14 days ago • 29
Model statistics of the 50 most downloaded entities on Hugging Face By lbourdois • about 6 hours ago • 9
Ring-flash-linear-2.0: A Highly Efficient Hybrid Architecture for Test-Time Scaling By RichardBian and 8 others • 4 days ago • 8
High-Quality Datasets for Far-Field ASR (Treble Technologies x Hugging Face) By treble-technologies and 4 others • about 6 hours ago • 8
DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 233
Reactive Transformer (RxT): Fixing the Memory Problem in Conversational AI By AdamF92 • 5 days ago • 5