Dawei Li's picture

4 49 1

Dawei Li

wjldw

·

https://david-li0406.github.io/

AI & ML interests

LLM, NLP, Data Mining

Recent Activity

updated a model about 3 hours ago

wjldw/ToolPRM-Base-v4

published a model about 4 hours ago

wjldw/ToolPRM-Base-v4

updated a model about 4 hours ago

wjldw/ToolPRM-CoT-v4

View all activity

Organizations

upvoted 2 papers 23 days ago

On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models

Paper • 2512.07783 • Published 26 days ago • 36

Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning

Paper • 2512.07461 • Published 26 days ago • 74

upvoted a paper 29 days ago

Nex-N1: Agentic Models Trained via a Unified Ecosystem for Large-Scale Environment Construction

Paper • 2512.04987 • Published 30 days ago • 76

upvoted a paper about 1 month ago

VisPlay: Self-Evolving Vision-Language Models from Images

Paper • 2511.15661 • Published Nov 19, 2025 • 42

upvoted a paper 2 months ago

Generalizing Test-time Compute-optimal Scaling as an Optimizable Graph

Paper • 2511.00086 • Published Oct 29, 2025 • 41

upvoted 2 papers 3 months ago

VLA-RFT: Vision-Language-Action Reinforcement Fine-tuning with Verified Rewards in World Simulators

Paper • 2510.00406 • Published Oct 1, 2025 • 65

Who's Your Judge? On the Detectability of LLM-Generated Judgments

Paper • 2509.25154 • Published Sep 29, 2025 • 29

upvoted 2 papers 4 months ago

Self-Rewarding Vision-Language Model via Reasoning Decomposition

Paper • 2508.19652 • Published Aug 27, 2025 • 84

MMTok: Multimodal Coverage Maximization for Efficient Inference of VLMs

Paper • 2508.18264 • Published Aug 25, 2025 • 25

upvoted 11 papers 5 months ago

Speech-to-LaTeX: New Models and Datasets for Converting Spoken Equations and Sentences

Paper • 2508.03542 • Published Aug 5, 2025 • 4

When Good Sounds Go Adversarial: Jailbreaking Audio-Language Models with Benign Inputs

Paper • 2508.03365 • Published Aug 5, 2025 • 4

TextQuests: How Good are LLMs at Text-Based Video Games?

Paper • 2507.23701 • Published Jul 31, 2025 • 2

Fact2Fiction: Targeted Poisoning Attack to Agentic Fact-checking System

Paper • 2508.06059 • Published Aug 8, 2025 • 4

Deep Ignorance: Filtering Pretraining Data Builds Tamper-Resistant Safeguards into Open-Weight LLMs

Paper • 2508.06601 • Published Aug 8, 2025 • 6

Bifrost-1: Bridging Multimodal LLMs and Diffusion Models with Patch-level CLIP Latents

Paper • 2508.05954 • Published Aug 8, 2025 • 6

GLiClass: Generalist Lightweight Model for Sequence Classification Tasks

Paper • 2508.07662 • Published Aug 11, 2025 • 9

Compressing Chain-of-Thought in LLMs via Step Entropy

Paper • 2508.03346 • Published Aug 5, 2025 • 8

Shortcut Learning in Generalist Robot Policies: The Role of Dataset Diversity and Fragmentation

Paper • 2508.06426 • Published Aug 8, 2025 • 10

VisR-Bench: An Empirical Study on Visual Retrieval-Augmented Generation for Multilingual Long Document Understanding

Paper • 2508.07493 • Published Aug 10, 2025 • 8

MoBE: Mixture-of-Basis-Experts for Compressing MoE-based LLMs

Paper • 2508.05257 • Published Aug 7, 2025 • 13