Dang Kai's picture

Dang Kai

dangkai-nk

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

upvoted a paper 7 months ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

upvoted a paper 8 months ago

WorldPM: Scaling Human Preference Modeling

View all activity

Organizations

upvoted a paper about 1 month ago

Stabilizing Reinforcement Learning with LLMs: Formulation and Practices

Paper • 2512.01374 • Published Dec 1, 2025 • 96

upvoted a paper 7 months ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2, 2025 • 187

upvoted 2 papers 8 months ago

WorldPM: Scaling Human Preference Modeling

Paper • 2505.10527 • Published May 15, 2025 • 34

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14, 2025 • 320

liked 3 datasets 9 months ago

ByteDance/ComTQA

Viewer • Updated Oct 16, 2024 • 9.07k • 65 • 19

arc-agi-community/arc-agi-2

Viewer • Updated Apr 2, 2025 • 1.12k • 168 • 11

lmms-lab/Omni_Bench

Viewer • Updated Mar 27, 2025 • 1.14k • 30 • 1

liked a dataset 10 months ago

PhoenixZ/MM-AlignBench

Updated Mar 1, 2025 • 29 • 4

liked 12 datasets 11 months ago

czh-up/CoMT

Viewer • Updated Feb 10, 2025 • 3.85k • 2.11k • 9

jonathan-roberts1/zerobench

Viewer • Updated 13 days ago • 434 • 1.37k • 29

USC-GVL/PhysBench

Updated Mar 5, 2025 • 234 • 15

jan-hq/Maze-Reasoning

Viewer • Updated Feb 6, 2025 • 100k • 109 • 20

TIGER-Lab/AceCode-87K

Viewer • Updated Feb 8, 2025 • 87.1k • 898 • 47

simplescaling/s1K

Viewer • Updated Feb 11, 2025 • 1k • 2.12k • 233

allenai/RLVR-IFeval

Viewer • Updated Nov 21, 2024 • 15k • 734 • 25

OpenCoder-LLM/opc-sft-stage2

Viewer • Updated Nov 24, 2024 • 436k • 1.15k • 96

likaixin/APPS-verified

Viewer • Updated Nov 17, 2024 • 4.21k • 15 • 5

likaixin/TACO-verified

Viewer • Updated Apr 17, 2025 • 12.9k • 307 • 18

GTSinger/GTSinger

Viewer • Updated Feb 9, 2025 • 69 • 13.3k • 35

cais/hle

Viewer • Updated Sep 10, 2025 • 2.5k • 19.9k • 646