Jiuqi Wang
LeonardoWjq
AI & ML interests
reinforcement learning
Recent Activity
authored
a paper
2 months ago
PokeeResearch: Effective Deep Research via Reinforcement Learning from
AI Feedback and Robust Reasoning Scaffold
new activity
about 1 year ago
stable-diffusion-v1-5/stable-diffusion-v1-5:Update README.md