52 51 77

Marc Sun

marcsun13

AI & ML interests

LLM, Quantization, Training, Inference

Recent Activity

new activity 7 days ago

hf-accelerate/model-memory-usage:Error accessing Llama4 models: 'does not have any library metadata on the Hub'

updated a Space 7 days ago

hf-accelerate/model-memory-usage

liked a Space 8 days ago

Tinkering/Pytorch-day-prez

View all activity

Organizations

marcsun13's activity

upvoted an article 14 days ago

Article

Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs

15 days ago

• 25

upvoted a collection 25 days ago

Gemma 3 QAT

Collection

Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 15 items • Updated 25 days ago • 191

upvoted 2 articles about 1 month ago

Article

Memory-efficient Diffusion Transformers with Quanto and Diffusers

Jul 30, 2024

• 66

Article

Welcome Llama 4 Maverick & Scout on Hugging Face!

Apr 5

• 143

upvoted a collection about 1 month ago

Llama 4

Collection

Llama 4 release • 13 items • Updated 15 days ago • 496

upvoted an article about 2 months ago

Article

NVIDIA's GTC 2025 Announcement for Physical AI Developers: New Open Models and Datasets

Mar 18

• 38

upvoted 2 articles 2 months ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

Mar 12

• 412

Article

LLM Inference on Edge: A Fun and Easy Guide to run LLMs via React Native on your Phone!

Mar 7

• 55

upvoted a paper 5 months ago

LoftQ: LoRA-Fine-Tuning-Aware Quantization for Large Language Models

Paper • 2310.08659 • Published Oct 12, 2023 • 28

upvoted an article 7 months ago

Article

Fixing Gradient Accumulation

Oct 16, 2024

• 53

upvoted 3 articles 8 months ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Sep 18, 2024

• 243

Article

Accelerate 1.0.0

Sep 13, 2024

• 52

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16, 2024

• 365

upvoted an article 9 months ago

Article

XetHub is joining Hugging Face!

Aug 8, 2024

• 94

upvoted an article 12 months ago

Article

Benchmarking Text Generation Inference

May 29, 2024

• 31

upvoted a paper 12 months ago

Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations

Paper • 2405.18392 • Published May 28, 2024 • 12

upvoted an article 12 months ago

Article

License to Call: Introducing Transformers Agents 2.0

May 13, 2024

• 131

upvoted a paper about 1 year ago

BitNet: Scaling 1-bit Transformers for Large Language Models

Paper • 2310.11453 • Published Oct 17, 2023 • 102

upvoted an article about 1 year ago

Article

Welcome Llama 3 - Meta's new open LLM

Apr 18, 2024

• 289