11 23 17

Huiqiang Jiang PRO

iofu728

https://www.microsoft.com/en-us/research/people/hjiang/

AI & ML interests

None yet

Recent Activity

authored a paper 6 days ago

RetroInfer: A Vector-Storage Approach for Scalable Long-Context LLM Inference

upvoted a paper 7 days ago

RetroInfer: A Vector-Storage Approach for Scalable Long-Context LLM Inference

commented on a paper 7 days ago

RetroInfer: A Vector-Storage Approach for Scalable Long-Context LLM Inference

View all activity

Organizations

iofu728's activity

authored a paper 6 days ago

RetroInfer: A Vector-Storage Approach for Scalable Long-Context LLM Inference

Paper • 2505.02922 • Published 8 days ago • 23

upvoted a paper 7 days ago

RetroInfer: A Vector-Storage Approach for Scalable Long-Context LLM Inference

Paper • 2505.02922 • Published 8 days ago • 23

commented a paper 7 days ago

RetroInfer: A Vector-Storage Approach for Scalable Long-Context LLM Inference

Paper • 2505.02922 • Published 8 days ago • 23 •

liked a model 15 days ago

Qwen/Qwen3-235B-A22B

Text Generation • Updated 13 days ago • 99.1k • • 790

upvoted a paper 15 days ago

MMInference: Accelerating Pre-filling for Long-Context VLMs via Modality-Aware Permutation Sparse Attention

Paper • 2504.16083 • Published 21 days ago • 8

commented a paper 15 days ago

MMInference: Accelerating Pre-filling for Long-Context VLMs via Modality-Aware Permutation Sparse Attention

Paper • 2504.16083 • Published 21 days ago • 8 •

liked a model 3 months ago

moonshotai/Moonlight-16B-A3B

Text Generation • Updated Feb 26 • 6.27k • 81

upvoted a paper 3 months ago

Optimizing Large Language Model Training Using FP4 Quantization

Paper • 2501.17116 • Published Jan 28 • 39

liked a model 4 months ago

Qwen/Qwen2.5-14B-Instruct-1M

Text Generation • Updated Jan 29 • 151k • • 306

upvoted a paper 4 months ago

Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models

Paper • 2501.13629 • Published Jan 23 • 48

liked 2 models 4 months ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-32B

Text Generation • Updated Feb 24 • 1.28M • • 1.37k

deepseek-ai/DeepSeek-R1

Text Generation • Updated Mar 27 • 1.14M • • 12.1k

upvoted a paper 4 months ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 276

updated a dataset 5 months ago

microsoft/SCBench

Viewer • Updated Dec 24, 2024 • 922 • 1.32k • 6

upvoted a paper 5 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 367

authored a paper 5 months ago

SCBench: A KV Cache-Centric Analysis of Long-Context Methods

Paper • 2412.10319 • Published Dec 13, 2024 • 10

upvoted a paper 5 months ago

SCBench: A KV Cache-Centric Analysis of Long-Context Methods

Paper • 2412.10319 • Published Dec 13, 2024 • 10

commented a paper 5 months ago

SCBench: A KV Cache-Centric Analysis of Long-Context Methods

Paper • 2412.10319 • Published Dec 13, 2024 • 10 •

New activity in microsoft/SCBench 5 months ago

rename

#2 opened 5 months ago by

liyucheng

updated a dataset 5 months ago

MInference/SCBench

Viewer • Updated Dec 13, 2024 • 922 • 184