victor-wu (victor wu)

upvoted a paper 2 months ago

RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic Sampling

Paper • 2506.08672 • Published Jun 10 • 31

upvoted 2 papers 3 months ago

Discrete Markov Bridge

Paper • 2505.19752 • Published May 26 • 17

Seek in the Dark: Reasoning via Test-Time Instance-Level Policy Gradient in Latent Space

Paper • 2505.13308 • Published May 19 • 27

upvoted an article 5 months ago

Article

Open R1: Update #3

By

and 9 others •

Mar 11

• 295

upvoted a paper 5 months ago

From Hours to Minutes: Lossless Acceleration of Ultra Long Sequence Generation up to 100K Tokens

Paper • 2502.18890 • Published Feb 26 • 30

upvoted 3 articles 6 months ago

Article

What is test-time compute and how to scale it?

By

and 1 other •

Feb 6

• 100

Article

Open-source DeepResearch – Freeing our search agents

By

and 4 others •

Feb 4

• 1.28k

Article

Open R1: Update #2

By

and 6 others •

Feb 10

• 217

upvoted an article 11 months ago

Article

A failed experiment: Infini-Attention, and why we should keep trying?

By

and 2 others •

Aug 14, 2024

• 69

victor wu

AI & ML interests

Organizations

RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic Sampling

Discrete Markov Bridge

Seek in the Dark: Reasoning via Test-Time Instance-Level Policy Gradient in Latent Space

Open R1: Update #3

From Hours to Minutes: Lossless Acceleration of Ultra Long Sequence Generation up to 100K Tokens

What is test-time compute and how to scale it?

Open-source DeepResearch – Freeing our search agents

Open R1: Update #2

A failed experiment: Infini-Attention, and why we should keep trying?

victor wu

AI & ML interests

Organizations

victor-wu's activity

Open R1: Update #3

What is test-time compute and how to scale it?

Open-source DeepResearch – Freeing our search agents

Open R1: Update #2

A failed experiment: Infini-Attention, and why we should keep trying?