arxiv:2511.07332
Johan Samir Obando Ceron
johanobandoc
AI & ML interests
Reinforcement Learning, Deep Learning, LLMs/LVMs.
Recent Activity
upvoted a paper about 9 hours ago
The Reward Was in Your Data All Along: Correcting Flow Matching with Discriminator-Guided RL upvoted a paper 4 months ago
Bigger, Better, Faster: Human-level Atari with human-level efficiency upvoted a paper 7 months ago
Grounding Computer Use Agents on Human Demonstrations