Zhuoran Yang Research Group

university

https://zhuoranyang.github.io/

Activity Feed

AI & ML interests

Reinforcement Learning, Language Models, Diffusion Models

Recent Activity

zhuoranyang updated a Space 7 days ago

y-agent/modular-addition-feature-learning

JLiangHe submitted a paper 8 days ago

On the Mechanism and Dynamics of Modular Addition: Fourier Features, Lottery Ticket, and Grokking

JLiangHe authored a paper 8 days ago

Sample-efficient Learning of Infinite-horizon Average-reward MDPs with General Function Approximation

View all activity

Papers

On the Mechanism and Dynamics of Modular Addition: Fourier Features, Lottery Ticket, and Grokking

View all Papers

zhuoranyang

updated a Space 7 days ago

Modular Addition Feature Learning

🔢

Explore modular addition learning visualizations

JLiangHe

submitted a paper to Daily Papers 8 days ago

On the Mechanism and Dynamics of Modular Addition: Fourier Features, Lottery Ticket, and Grokking

Paper • 2602.16849 • Published 9 days ago • 6

JLiangHe

authored 6 papers 8 days ago

Sample-efficient Learning of Infinite-horizon Average-reward MDPs with General Function Approximation

Paper • 2404.12648 • Published Apr 19, 2024

In-Context Linear Regression Demystified: Training Dynamics and Mechanistic Interpretability of Multi-Head Softmax Attention

Paper • 2503.12734 • Published Mar 17, 2025

Llama-3.1-FoundationAI-SecurityLLM-8B-Instruct Technical Report

Paper • 2508.01059 • Published Aug 1, 2025 • 34

Llama-3.1-FoundationAI-SecurityLLM-Reasoning-8B Technical Report

Paper • 2601.21051 • Published about 1 month ago • 14

On the Mechanism and Dynamics of Modular Addition: Fourier Features, Lottery Ticket, and Grokking

Paper • 2602.16849 • Published 9 days ago • 6

From Words to Actions: Unveiling the Theoretical Underpinnings of LLM-Driven Autonomous Systems

Paper • 2405.19883 • Published May 30, 2024

zhuoranyang

published a Space 10 days ago

Modular Addition Feature Learning

🔢

Explore modular addition learning visualizations

zhuoranyang

submitted a paper to Daily Papers 29 days ago

Llama-3.1-FoundationAI-SecurityLLM-Reasoning-8B Technical Report

Paper • 2601.21051 • Published about 1 month ago • 14

Fengzhuo

submitted a paper to Daily Papers about 1 month ago

Demystifying the Slash Pattern in Attention: The Role of RoPE

Paper • 2601.08297 • Published Jan 13 • 4

zhuoranyang

authored 9 papers 4 months ago

A Theoretical Analysis of Deep Q-Learning

Paper • 1901.00137 • Published Jan 1, 2019

Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning

Paper • 2305.18459 • Published May 29, 2023

Symmetric Mean-field Langevin Dynamics for Distributional Minimax Problems

Paper • 2312.01127 • Published Dec 2, 2023

In-Context Linear Regression Demystified: Training Dynamics and Mechanistic Interpretability of Multi-Head Softmax Attention

Paper • 2503.12734 • Published Mar 17, 2025

Taming Polysemanticity in LLMs: Provable Feature Recovery via Sparse Autoencoders

Paper • 2506.14002 • Published Jun 16, 2025 • 5

On Computation and Generalization of Generative Adversarial Imitation Learning

Paper • 2001.02792 • Published Jan 9, 2020 • 1

Muon Outperforms Adam in Tail-End Associative Memory Learning

Paper • 2509.26030 • Published Sep 30, 2025 • 20

Unlocking Out-of-Distribution Generalization in Transformers via Recursive Latent Space Reasoning

Paper • 2510.14095 • Published Oct 15, 2025 • 6

Build Your Personalized Research Group: A Multiagent Framework for Continual and Interactive Science Automation

Paper • 2510.15624 • Published Oct 17, 2025 • 15

AI & ML interests

Recent Activity

Papers

Team members 6

y-agent's activity

Modular Addition Feature Learning

Modular Addition Feature Learning