view article Article LLMGameHub: How We Won the Gradio Agents & MCP HackathonΒ 2025 By kikikita and 1 other β’ 17 days ago β’ 16
Rethinking Verification for LLM Code Generation: From Generation to Testing Paper β’ 2507.06920 β’ Published Jul 9 β’ 28
Where to find Grokking in LLM Pretraining? Monitor Memorization-to-Generalization without Test Paper β’ 2506.21551 β’ Published Jun 26 β’ 28
ComfyUI-Copilot: An Intelligent Assistant for Automated Workflow Development Paper β’ 2506.05010 β’ Published Jun 5 β’ 76
ComfyUI-R1: Exploring Reasoning Models for Workflow Generation Paper β’ 2506.09790 β’ Published Jun 11 β’ 51
Leveraging Self-Attention for Input-Dependent Soft Prompting in LLMs Paper β’ 2506.05629 β’ Published Jun 5 β’ 35
General-Reasoner: Advancing LLM Reasoning Across All Domains Paper β’ 2505.14652 β’ Published May 20 β’ 23
REASONING GYM: Reasoning Environments for Reinforcement Learning with Verifiable Rewards Paper β’ 2505.24760 β’ Published May 30 β’ 69
SmolVLA: A Vision-Language-Action Model for Affordable and Efficient Robotics Paper β’ 2506.01844 β’ Published Jun 2 β’ 127
Taming LLMs by Scaling Learning Rates with Gradient Grouping Paper β’ 2506.01049 β’ Published Jun 1 β’ 38
ARIA: Training Language Agents with Intention-Driven Reward Aggregation Paper β’ 2506.00539 β’ Published May 31 β’ 30
Satori-SWE: Evolutionary Test-Time Scaling for Sample-Efficient Software Engineering Paper β’ 2505.23604 β’ Published May 29 β’ 24
Tool-Star: Empowering LLM-Brained Multi-Tool Reasoner via Reinforcement Learning Paper β’ 2505.16410 β’ Published May 22 β’ 57
LLaDA-V: Large Language Diffusion Models with Visual Instruction Tuning Paper β’ 2505.16933 β’ Published May 22 β’ 33
view article Article How to Build an MCP Server with Gradio By abidlabs and 1 other β’ Apr 30 β’ 189