Kai Zhang
drogozhang
AI & ML interests
NLP
Recent Activity
authored
a paper
about 4 hours ago
LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model
upvoted
a
paper
about 6 hours ago
VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use