2 49 151

wangrui

varuy322

varuy322

AI & ML interests

None yet

Recent Activity

liked a model about 23 hours ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B

liked a model 1 day ago

HuggingFaceTB/SmolLM2-135M-Instruct

liked a dataset 3 days ago

Jackrong/GPT-OSS-20B-Distilled-Reasoning-Mini

View all activity

Organizations

None yet

upvoted a collection 9 days ago

agent

Collection

168 items • Updated about 3 hours ago • 10

upvoted 3 papers 9 days ago

Large Language Model Agent: A Survey on Methodology, Applications and Challenges

Paper • 2503.21460 • Published Mar 27 • 79

Agent KB: Leveraging Cross-Domain Experience for Agentic Problem Solving

Paper • 2507.06229 • Published Jul 8 • 72

Cognitive Kernel-Pro: A Framework for Deep Research Agents and Agent Foundation Models Training

Paper • 2508.00414 • Published 13 days ago • 84

upvoted 2 papers 15 days ago

A Survey of Context Engineering for Large Language Models

Paper • 2507.13334 • Published 28 days ago • 239

Agentic Reinforced Policy Optimization

Paper • 2507.19849 • Published 19 days ago • 138

upvoted a paper about 1 month ago

Pre-Trained Policy Discriminators are General Reward Models

Paper • 2507.05197 • Published Jul 7 • 39

upvoted a collection about 2 months ago

🔥LLM 2025

Collection

6 items • Updated 17 days ago • 2

upvoted 2 papers 2 months ago

Expanding RL with Verifiable Rewards Across Diverse Domains

Paper • 2503.23829 • Published Mar 31 • 24

WebThinker: Empowering Large Reasoning Models with Deep Research Capability

Paper • 2504.21776 • Published Apr 30 • 59

upvoted an article 3 months ago

Article

Tiny Agents in Python: a MCP-powered agent in ~70 lines of code

and 3 others •

May 23

• 155

upvoted a paper 3 months ago

Seed1.5-VL Technical Report

Paper • 2505.07062 • Published May 11 • 149

upvoted a collection 3 months ago

Llama Nemotron

Collection

Open, Production-ready Enterprise Models • 11 items • Updated 14 days ago • 64

upvoted 2 papers 4 months ago

Sleep-time Compute: Beyond Inference Scaling at Test-time

Paper • 2504.13171 • Published Apr 17 • 15

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14 • 280

upvoted 4 collections 4 months ago

upvoted a paper 5 months ago

DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing

Paper • 2111.09543 • Published Nov 18, 2021 • 3

wangrui

AI & ML interests

Recent Activity

Organizations

varuy322's activity

Tiny Agents in Python: a MCP-powered agent in ~70 lines of code