Marc Sun's picture

Building on HF

Marc Sun

marcsun13

huggingface

·

AI & ML interests

LLM, Quantization, Training, Inference

Recent Activity

upvoted an article 1 day ago

Is it agentic enough? Benchmarking open models on your own tooling

liked a Space 2 days ago

nanotron/ultrascale-playbook

liked a model 8 days ago

google/gemma-4-E4B-it-qat-mobile-transformers

View all activity

Organizations

upvoted an article 1 day ago

Article

Is it agentic enough? Benchmarking open models on your own tooling

+1

lysandre, SaylorTwift, pcuenq

•

1 day ago

• 14

liked a Space 2 days ago

The Ultra-Scale Playbook

The ultimate guide to training LLM on large GPU Clusters

liked 2 models 8 days ago

google/gemma-4-E4B-it-qat-mobile-transformers

Any-to-Any • 3B • Updated 14 days ago • 1.8k • 20

google/gemma-4-E2B-it-qat-mobile-transformers

Any-to-Any • 2B • Updated 14 days ago • 7.28k • 53

upvoted an article 10 days ago

Article

Arcee Becomes the First Major American AI Lab to Replace AWS S3 with Hugging Face Private Storage, in a Multi-Million Dollar Commercial Partnership

clem

•

10 days ago

• 32

liked a Space 14 days ago

Defeating the trainer-generator precision mismatch in TRL

Download research PDF (Pro access required)

updated a Space 28 days ago

vibecheck

Check Hugging Face model details, CI status, and hardware support

upvoted 2 articles 2 months ago

Article

Tiny Agents in Python: a MCP-powered agent in ~70 lines of code

+2

celinah, julien-c, Wauplin, evalstate

•

May 23, 2025

• 172

Article

The PR you would have opened yourself

pcuenq, awni

•

Apr 16

• 72

upvoted an article 3 months ago

Article

Welcome Gemma 4: Frontier multimodal intelligence on device

+5

merve, pcuenq, sergiopaniego, burtenshaw, Steveeeeeeen, alvarobartt, SaylorTwift

•

Apr 2

• 909

liked a model 3 months ago

kernels-community/flash-attn3

Updated 21 days ago • 289k • 48

upvoted an article 4 months ago

Article

GGML and llama.cpp join HF to ensure the long-term progress of Local AI

+4

ggerganov, ngxson, allozaur, lysandre, victor, julien-c

•

Feb 20

• 507

upvoted 8 articles 5 months ago

Article

Continuous batching from first principles

+1

ror, ArthurZ, mcpotato

•

Nov 25, 2025

• 408

Article

Diffusers welcomes FLUX-2

+6

YiYiXu, dg845, sayakpaul, OzzyGT, dn6, ariG23498, linoyts, multimodalart

•

Nov 25, 2025

• 190

Article

We Got Claude to Fine-Tune an Open Source LLM

burtenshaw, evalstate

•

Dec 4, 2025

• 630

Article

New in llama.cpp: Model Management

ggml-org

•

Dec 11, 2025

• 137

Article

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

+4

itazap, ariG23498, ArthurZ, sergiopaniego, merve, pcuenq

•

Dec 18, 2025

• 124

Article

NVIDIA brings agents to life with DGX Spark and Reachy Mini

+1

jeffboudier, nader-at-nvidia, alecfong

•

Jan 5

• 66

Article

Introducing Falcon-H1-Arabic: Pushing the Boundaries of Arabic Language AI with Hybrid Architecture

tiiuae

•

Jan 5

• 43

Article

NVIDIA Cosmos Reason 2 Brings Advanced Reasoning To Physical AI

nvidia

•

Jan 5

• 64