Penghui Qi's picture

Penghui Qi

QPHutu

·

QPHutu

AI & ML interests

None yet

Recent Activity

authored a paper about 10 hours ago

Rethinking the Divergence Regularization in LLM RL

upvoted a paper 1 day ago

Flow-DPPO: Divergence Proximal Policy Optimization for Flow Matching Models

upvoted a paper 1 day ago

Rethinking the Divergence Regularization in LLM RL

View all activity

Organizations

Collections 4

View 4 collections

Papers 10

arxiv:2606.09821

arxiv:2602.04879

arxiv:2601.19362

arxiv:2510.26788

models 0

None public yet

datasets 0

None public yet