2 16

Haocheng Xi

Xihc20

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 months ago

Sparse VideoGen2: Accelerate Video Generation with Sparse Attention via Semantic-Aware Permutation

upvoted a paper 3 months ago

GoT-R1: Unleashing Reasoning Capability of MLLM for Visual Generation with Reinforcement Learning

upvoted a paper 3 months ago

Thinkless: LLM Learns When to Think

View all activity

Organizations

upvoted a paper 2 months ago

Sparse VideoGen2: Accelerate Video Generation with Sparse Attention via Semantic-Aware Permutation

Paper • 2505.18875 • Published May 24 • 42

upvoted 4 papers 3 months ago

GoT-R1: Unleashing Reasoning Capability of MLLM for Visual Generation with Reinforcement Learning

Paper • 2505.17022 • Published May 22 • 27

Thinkless: LLM Learns When to Think

Paper • 2505.13379 • Published May 19 • 51

Delta Attention: Fast and Accurate Sparse Attention Inference by Delta Correction

Paper • 2505.11254 • Published May 16 • 49

AdaptThink: Reasoning Models Can Learn When to Think

Paper • 2505.13417 • Published May 19 • 82

upvoted a paper 4 months ago

Learning Adaptive Parallel Reasoning with Language Models

Paper • 2504.15466 • Published Apr 21 • 43

updated a dataset 5 months ago

Efficient-Large-Model/COAT-ToolBench

Updated Mar 26 • 2

published a dataset 5 months ago

Efficient-Large-Model/COAT-ToolBench

Updated Mar 26 • 2

published a Space 5 months ago

Sparse VideoGen

📈

Demos

upvoted 2 papers 6 months ago

S*: Test Time Scaling for Code Generation

Paper • 2502.14382 • Published Feb 20 • 63

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published Feb 16 • 165

upvoted 2 papers 8 months ago

SageAttention2 Technical Report: Accurate 4 Bit Attention for Plug-and-play Inference Acceleration

Paper • 2411.10958 • Published Nov 17, 2024 • 56

NVILA: Efficient Frontier Visual Language Models

Paper • 2412.04468 • Published Dec 5, 2024 • 60

updated a dataset 8 months ago

Xihc20/CropVBench

Preview • Updated Dec 3, 2024 • 3

upvoted an article 9 months ago

Article

Unbelievable! Run 70B LLM Inference on a Single 4GB GPU with This NEW Technique

•

Nov 30, 2023

• 34

upvoted a paper 9 months ago

T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation

Paper • 2407.14505 • Published Jul 19, 2024 • 27

commented a paper 9 months ago

COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training

Paper • 2410.19313 • Published Oct 25, 2024 • 19 •

upvoted a paper 9 months ago

COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training

Paper • 2410.19313 • Published Oct 25, 2024 • 19

commented a paper 9 months ago

COAT: Compressing Optimizer states and Activation for Memory-Efficient FP8 Training

Paper • 2410.19313 • Published Oct 25, 2024 • 19 •

upvoted a paper 10 months ago

PUMA: Empowering Unified MLLM with Multi-granular Visual Generation

Paper • 2410.13861 • Published Oct 17, 2024 • 57

Haocheng Xi

AI & ML interests

Recent Activity

Organizations

Xihc20's activity

Sparse VideoGen

Unbelievable! Run 70B LLM Inference on a Single 4GB GPU with This NEW Technique