38 63 38

Qinghong (Kevin) Lin

KevinQHLin

http://qhlin.me/

AI & ML interests

Vision-Language Model, Video Understanding, Human-AI Interaction

Recent Activity

reacted to Reality123b's post with 🤗 2 days ago

Happy birthday to me!!!

liked a dataset 5 days ago

likaixin/GroundCUA-train

reacted to Jaward's post with 🤯 7 days ago

Incredible work!! They claim this is the year of recursive language models (I hope so). As models get bigger and better managing their context windows to fit longer prompts has been a standing engineering problem. They propose an inference technique that allows the model to externally crunch down long prompts into snippets that it can recursively call itself on, instead of directly feeding the entire prompt into the transformer. This could make models cheaper and more efficient but I doubt if big tech will adopt it since they profit more with the current approach (bigger models = longer context windows = more expensive the model). Once again such work came from academia/oss community cuz I doubt big tech would have shared these findings lol. They probably have much better inference methods that we may never know of haha. Paper: https://arxiv.org/pdf/2512.24601

View all activity

Organizations

reacted to Reality123b's post with 🤗 2 days ago

Post

1918

Happy birthday to me!!!

2 replies

liked a dataset 5 days ago

likaixin/GroundCUA-train

Viewer • Updated 6 days ago • 50.6k • 30 • 3

reacted to Jaward's post with 🤯 7 days ago

Post

897

upvoted a paper 19 days ago

DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI

Paper • 2512.16676 • Published 24 days ago • 203

authored a paper 25 days ago

Video Reality Test: Can AI-Generated ASMR Videos fool VLMs and Humans?

Paper • 2512.13281 • Published 27 days ago • 63

liked a dataset 25 days ago

kolerk/Video_Reality_Test

Viewer • Updated 5 days ago • 149 • 549 • 7

upvoted a paper 25 days ago

Video Reality Test: Can AI-Generated ASMR Videos fool VLMs and Humans?

Paper • 2512.13281 • Published 27 days ago • 63

submitted a paper to Daily Papers 25 days ago

Video Reality Test: Can AI-Generated ASMR Videos fool VLMs and Humans?

Paper • 2512.13281 • Published 27 days ago • 63

upvoted a paper 25 days ago

EVOLVE-VLA: Test-Time Training from Environment Feedback for Vision-Language-Action Models

Paper • 2512.14666 • Published 25 days ago • 8

authored 2 papers 25 days ago

Show-o: One Single Transformer to Unify Multimodal Understanding and Generation

Paper • 2408.12528 • Published Aug 22, 2024 • 51

UI-Vision: A Desktop-centric GUI Benchmark for Visual Perception and Interaction

Paper • 2503.15661 • Published Mar 19, 2025 • 2

upvoted 2 papers about 1 month ago

OmniPSD: Layered PSD Generation with Diffusion Transformer

Paper • 2512.09247 • Published Dec 10, 2025 • 46

PaperDebugger: A Plugin-Based Multi-Agent System for In-Editor Academic Writing, Review, and Editing

Paper • 2512.02589 • Published Dec 2, 2025 • 68

upvoted a paper about 2 months ago

The Image as Its Own Reward: Reinforcement Learning with Adversarial Reward for Image Generation

Paper • 2511.20256 • Published Nov 25, 2025 • 27

liked a model about 2 months ago

facebook/sam3

Mask Generation • 0.9B • Updated Nov 20, 2025 • 1.41M • 1.33k

authored a paper about 2 months ago

Computer-Use Agents as Judges for Generative User Interface

Paper • 2511.15567 • Published Nov 19, 2025 • 52

upvoted a paper about 2 months ago

Computer-Use Agents as Judges for Generative User Interface

Paper • 2511.15567 • Published Nov 19, 2025 • 52

commented a paper about 2 months ago

Computer-Use Agents as Judges for Generative User Interface

Paper • 2511.15567 • Published Nov 19, 2025 • 52 •

upvoted 2 papers about 2 months ago

SAM 3: Segment Anything with Concepts

Paper • 2511.16719 • Published Nov 20, 2025 • 127

SAM 3D: 3Dfy Anything in Images

Paper • 2511.16624 • Published Nov 20, 2025 • 110

Qinghong (Kevin) Lin

AI & ML interests

Recent Activity

Organizations

KevinQHLin's activity