Behavior Knowledge Merge in Reinforced Agentic Models Paper • 2601.13572 • Published 15 days ago • 23
Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting Paper • 2601.02151 • Published 29 days ago • 105
π_RL: Online RL Fine-tuning for Flow-based Vision-Language-Action Models Paper • 2510.25889 • Published Oct 29, 2025 • 66
SongBloom: Coherent Song Generation via Interleaved Autoregressive Sketching and Diffusion Refinement Paper • 2506.07634 • Published Jun 9, 2025 • 6
Reasoning Vectors: Transferring Chain-of-Thought Capabilities via Task Arithmetic Paper • 2509.01363 • Published Sep 1, 2025 • 59
view article Article Exploring Name Diversity in Modern LLMs: A Grimdark Trilogy Experiment Sep 27, 2024 • 11
Q4_0_4_8-GGUF Collection GGUF dedicated for ARM i8mm devices / Snapdragon X Elite / Snapdragon 8 Gen 1,2,3 • 27 items • Updated Sep 21, 2024 • 2