Lewei Lu's picture

Lewei Lu

luotto

·

ottolu

AI & ML interests

None yet

Recent Activity

liked a dataset 6 days ago

miromind-ai/MiroVerse-v0.1

upvoted a paper 11 days ago

ScreenCoder: Advancing Visual-to-Code Generation for Front-End Automation via Modular Multimodal Agents

liked a dataset 21 days ago

interstellarninja/hermes_reasoning_tool_use

View all activity

Organizations

upvoted a paper 11 days ago

ScreenCoder: Advancing Visual-to-Code Generation for Front-End Automation via Modular Multimodal Agents

Paper • 2507.22827 • Published 14 days ago • 89

upvoted 4 papers about 1 month ago

A Survey on Latent Reasoning

Paper • 2507.06203 • Published Jul 8 • 87

WebSailor: Navigating Super-human Reasoning for Web Agent

Paper • 2507.02592 • Published Jul 3 • 109

PyVision: Agentic Vision with Dynamic Tooling

Paper • 2507.07998 • Published Jul 10 • 31

Thinking with Images for Multimodal Reasoning: Foundations, Methods, and Future Frontiers

Paper • 2506.23918 • Published Jun 30 • 86

upvoted 4 papers about 2 months ago

TaskCraft: Automated Generation of Agentic Tasks

Paper • 2506.10055 • Published Jun 11 • 32

CoMemo: LVLMs Need Image Context with Image Memory

Paper • 2506.06279 • Published Jun 6 • 9

OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling

Paper • 2506.20512 • Published Jun 25 • 46

MMSearch-R1: Incentivizing LMMs to Search

Paper • 2506.20670 • Published Jun 25 • 62

upvoted 2 papers 2 months ago

Language-Image Alignment with Fixed Text Encoders

Paper • 2506.04209 • Published Jun 4 • 11

Visual Embodied Brain: Let Multimodal Large Language Models See, Think, and Control in Spaces

Paper • 2506.00123 • Published May 30 • 34

upvoted 3 papers 3 months ago

ZeroGUI: Automating Online GUI Learning at Zero Human Cost

Paper • 2505.23762 • Published May 29 • 46

Visual Planning: Let's Think Only with Images

Paper • 2505.11409 • Published May 16 • 57

Tool-Star: Empowering LLM-Brained Multi-Tool Reasoner via Reinforcement Learning

Paper • 2505.16410 • Published May 22 • 57

upvoted a collection 3 months ago

SigLIP2

36 items • Updated Jul 10 • 82

upvoted a paper 3 months ago

Seed1.5-VL Technical Report

Paper • 2505.07062 • Published May 11 • 149

upvoted a collection 4 months ago

Qwen3

84 items • Updated 7 days ago • 1.08k

upvoted 3 papers 4 months ago

VisuLogic: A Benchmark for Evaluating Visual Reasoning in Multi-modal Large Language Models

Paper • 2504.15279 • Published Apr 21 • 75

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14 • 280

Inference-Time Scaling for Generalist Reward Modeling

Paper • 2504.02495 • Published Apr 3 • 57