arxiv:2505.04620
Shengqiong Wu
ChocoWu
AI & ML interests
Large Language Model, Multimodal learning, Scene graph Generation
Recent Activity
upvoted
a
paper
4 days ago
UniVA: Universal Video Agent towards Open-Source Next-Generation Video Generalist
upvoted
a
paper
about 1 month ago
Latent Diffusion Model without Variational Autoencoder
upvoted
a
paper
about 1 month ago
Less is More: Improving LLM Reasoning with Minimal Test-Time
Intervention