Nguyen Trung Hieu's picture

2 7 3

Nguyen Trung Hieu

JunHill

·

AI & ML interests

NLP, RL

Recent Activity

liked a model 18 days ago

baonn/epic

updated a dataset about 1 month ago

JunHill/Dedup-4shot-DAPO-Math-6k

published a dataset about 1 month ago

JunHill/Dedup-4shot-DAPO-Math-6k

View all activity

Organizations

liked a model 18 days ago

baonn/epic

Updated 13 days ago • 2

updated a dataset about 1 month ago

JunHill/Dedup-4shot-DAPO-Math-6k

Viewer • Updated Nov 18 • 6k • 9

published a dataset about 1 month ago

JunHill/Dedup-4shot-DAPO-Math-6k

Viewer • Updated Nov 18 • 6k • 9

upvoted an article about 1 month ago

Article

Putting RL back in RLHF

Jun 12, 2024

•

109

updated a dataset about 2 months ago

JunHill/llava_data

Updated Nov 14 • 70

updated a model about 2 months ago

JunHill/trim

Updated Nov 8 • 4

published a model about 2 months ago

JunHill/trim

Updated Nov 8 • 4

liked a dataset 3 months ago

lmms-lab/LLaVA-Video-178K

Viewer • Updated Oct 11, 2024 • 1.63M • 17.2k • 184

published a dataset 3 months ago

JunHill/llava_data

Updated Nov 14 • 70

updated a dataset 3 months ago

JunHill/4shot-amc

Viewer • Updated Oct 3 • 83 • 11

published a dataset 3 months ago

JunHill/4shot-amc

Viewer • Updated Oct 3 • 83 • 11

updated a dataset 3 months ago

JunHill/4shot-aime25

Viewer • Updated Oct 3 • 30 • 9

published a dataset 3 months ago

JunHill/4shot-aime25

Viewer • Updated Oct 3 • 30 • 9

updated a dataset 3 months ago

JunHill/4shot-aime24

Viewer • Updated Oct 3 • 30 • 10

published a dataset 3 months ago

JunHill/4shot-aime24

Viewer • Updated Oct 3 • 30 • 10

updated a dataset 3 months ago

JunHill/Dedup-4shot-DAPO-Math-17k

Viewer • Updated Oct 3 • 17.4k • 10

published a dataset 3 months ago

JunHill/Dedup-4shot-DAPO-Math-17k

Viewer • Updated Oct 3 • 17.4k • 10

upvoted a paper 10 months ago

Prompt Cache: Modular Attention Reuse for Low-Latency Inference

Paper • 2311.04934 • Published Nov 7, 2023 • 33

commented a paper 11 months ago

LoRA Learns Less and Forgets Less

Paper • 2405.09673 • Published May 15, 2024 • 90 •

upvoted a paper about 1 year ago

Conditioned Language Policy: A General Framework for Steerable Multi-Objective Finetuning

Paper • 2407.15762 • Published Jul 22, 2024 • 10