On the Mechanism and Dynamics of Modular Addition: Fourier Features, Lottery Ticket, and Grokking Paper • 2602.16849 • Published 9 days ago • 6
Sample-efficient Learning of Infinite-horizon Average-reward MDPs with General Function Approximation Paper • 2404.12648 • Published Apr 19, 2024
In-Context Linear Regression Demystified: Training Dynamics and Mechanistic Interpretability of Multi-Head Softmax Attention Paper • 2503.12734 • Published Mar 17, 2025
Llama-3.1-FoundationAI-SecurityLLM-8B-Instruct Technical Report Paper • 2508.01059 • Published Aug 1, 2025 • 34
Llama-3.1-FoundationAI-SecurityLLM-Reasoning-8B Technical Report Paper • 2601.21051 • Published about 1 month ago • 14
On the Mechanism and Dynamics of Modular Addition: Fourier Features, Lottery Ticket, and Grokking Paper • 2602.16849 • Published 9 days ago • 6
From Words to Actions: Unveiling the Theoretical Underpinnings of LLM-Driven Autonomous Systems Paper • 2405.19883 • Published May 30, 2024
Llama-3.1-FoundationAI-SecurityLLM-Reasoning-8B Technical Report Paper • 2601.21051 • Published about 1 month ago • 14
Demystifying the Slash Pattern in Attention: The Role of RoPE Paper • 2601.08297 • Published Jan 13 • 4
Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning Paper • 2305.18459 • Published May 29, 2023
Symmetric Mean-field Langevin Dynamics for Distributional Minimax Problems Paper • 2312.01127 • Published Dec 2, 2023
In-Context Linear Regression Demystified: Training Dynamics and Mechanistic Interpretability of Multi-Head Softmax Attention Paper • 2503.12734 • Published Mar 17, 2025
Taming Polysemanticity in LLMs: Provable Feature Recovery via Sparse Autoencoders Paper • 2506.14002 • Published Jun 16, 2025 • 5
On Computation and Generalization of Generative Adversarial Imitation Learning Paper • 2001.02792 • Published Jan 9, 2020 • 1
Muon Outperforms Adam in Tail-End Associative Memory Learning Paper • 2509.26030 • Published Sep 30, 2025 • 20
Unlocking Out-of-Distribution Generalization in Transformers via Recursive Latent Space Reasoning Paper • 2510.14095 • Published Oct 15, 2025 • 6
Build Your Personalized Research Group: A Multiagent Framework for Continual and Interactive Science Automation Paper • 2510.15624 • Published Oct 17, 2025 • 15