24 30 9

Xilin Wei

Wiselnn

AI & ML interests

None yet

Recent Activity

upvoted a paper about 22 hours ago

SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience

upvoted a paper 4 days ago

Beyond Fixed: Variable-Length Denoising for Diffusion Large Language Models

upvoted a paper 17 days ago

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction

View all activity

Organizations

upvoted a paper about 22 hours ago

SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience

Paper • 2508.04700 • Published 1 day ago • 37

upvoted a paper 4 days ago

Beyond Fixed: Variable-Length Denoising for Diffusion Large Language Models

Paper • 2508.00819 • Published 6 days ago • 58

upvoted a paper 17 days ago

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction

Paper • 2507.15852 • Published 17 days ago • 37

upvoted a paper about 1 month ago

ScaleCap: Inference-Time Scalable Image Captioning via Dual-Modality Debiasing

Paper • 2506.19848 • Published Jun 24 • 26

upvoted a paper 2 months ago

Video World Models with Long-term Spatial Memory

Paper • 2506.05284 • Published Jun 5 • 53

upvoted a paper 3 months ago

Creation-MMBench: Assessing Context-Aware Creative Intelligence in MLLM

Paper • 2503.14478 • Published Mar 18 • 49

upvoted a collection 3 months ago

MM-IFEngine

Collection

[ICCV 2025] Official Implementation of "MM-IFEngine: Towards Multimodal Instruction Following" • 2 items • Updated 23 days ago • 5

upvoted 2 papers 4 months ago

VCR-Bench: A Comprehensive Evaluation Framework for Video Chain-of-Thought Reasoning

Paper • 2504.07956 • Published Apr 10 • 47

MM-IFEngine: Towards Multimodal Instruction Following

Paper • 2504.07957 • Published Apr 10 • 34

upvoted a paper 5 months ago

Unified Reward Model for Multimodal Understanding and Generation

Paper • 2503.05236 • Published Mar 7 • 124

upvoted a collection 5 months ago

VideoRoPE: What Makes for Good Video Rotary Position Embeddi

Collection

A storage repo for VideoRoPE. • 6 items • Updated Jun 17 • 3

upvoted 4 papers 5 months ago

upvoted 3 papers 6 months ago

SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation

Paper • 2502.13128 • Published Feb 18 • 42

Light-A-Video: Training-free Video Relighting via Progressive Light Fusion

Paper • 2502.08590 • Published Feb 12 • 44

VideoRoPE: What Makes for Good Video Rotary Position Embedding?

Paper • 2502.05173 • Published Feb 7 • 66

upvoted 2 papers 7 months ago

InternLM-XComposer2.5-Reward: A Simple Yet Effective Multi-Modal Reward Model

Paper • 2501.12368 • Published Jan 21 • 47

OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding?

Paper • 2501.05510 • Published Jan 9 • 44

Xilin Wei

AI & ML interests

Recent Activity

Organizations

Wiselnn's activity