Yongliang Shen

tricktreat

tricktreat

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?

upvoted a paper 9 days ago

Online Experiential Learning for Language Models

upvoted a paper 11 days ago

Code-A1: Adversarial Evolving of Code LLM and Test LLM via Reinforcement Learning

View all activity

Organizations

upvoted a paper 1 day ago

Why Does Self-Distillation (Sometimes) Degrade the Reasoning Capability of LLMs?

Paper • 2603.24472 • Published 2 days ago • 37

upvoted a paper 9 days ago

Online Experiential Learning for Language Models

Paper • 2603.16856 • Published 10 days ago • 56

upvoted a paper 11 days ago

Code-A1: Adversarial Evolving of Code LLM and Test LLM via Reinforcement Learning

Paper • 2603.15611 • Published 11 days ago • 10

upvoted a paper 18 days ago

LoGeR: Long-Context Geometric Reconstruction with Hybrid Memory

Paper • 2603.03269 • Published 24 days ago • 61

upvoted a paper 20 days ago

DreamWorld: Unified World Modeling in Video Generation

Paper • 2603.00466 • Published 28 days ago • 16

upvoted a paper 22 days ago

Heterogeneous Agent Collaborative Reinforcement Learning

Paper • 2603.02604 • Published 25 days ago • 189

upvoted a paper 29 days ago

OmniGAIA: Towards Native Omni-Modal AI Agents

Paper • 2602.22897 • Published 30 days ago • 53

liked a Space about 1 month ago

EasySteer Demo

🚗

Generate steered LLM responses with custom tone vectors

upvoted 3 papers about 1 month ago

Internalizing Meta-Experience into Memory for Guided Reinforcement Learning in Large Language Models

Paper • 2602.10224 • Published Feb 10 • 19

SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning

Paper • 2602.08234 • Published Feb 9 • 72

SCALE: Self-uncertainty Conditioned Adaptive Looking and Execution for Vision-Language-Action Models

Paper • 2602.04208 • Published Feb 4 • 19

upvoted 9 papers about 2 months ago

InftyThink+: Effective and Efficient Infinite-Horizon Reasoning via Reinforcement Learning

Paper • 2602.06960 • Published Feb 6 • 14

MemGUI-Bench: Benchmarking Memory of Mobile GUI Agents in Dynamic Environments

Paper • 2602.06075 • Published Feb 3 • 13

AOrchestra: Automating Sub-Agent Creation for Agentic Orchestration

Paper • 2602.03786 • Published Feb 3 • 89

SWE-Master: Unleashing the Potential of Software Engineering Agents via Post-Training

Paper • 2602.03411 • Published Feb 3 • 38

Learning Query-Specific Rubrics from Human Preferences for DeepResearch Report Generation

Paper • 2602.03619 • Published Feb 3 • 27

Everything in Its Place: Benchmarking Spatial Intelligence of Text-to-Image Models

Paper • 2601.20354 • Published Jan 28 • 112

Yongliang Shen

AI & ML interests

Recent Activity

Organizations

tricktreat's activity

EasySteer Demo