7 33 82

neuralink

https://phucnguyen.dev

AI & ML interests

nanotron @ hf

Recent Activity

liked a model about 2 months ago

baidu/ERNIE-4.5-0.3B-PT

upvoted an article 2 months ago

Arc Virtual Cell Challenge: A Primer

upvoted an article 5 months ago

The Transformers Library: standardizing model definitions

View all activity

Organizations

liked a model about 2 months ago

baidu/ERNIE-4.5-0.3B-PT

Text Generation • Updated Aug 29 • 11k • • 92

upvoted an article 2 months ago

Article

Arc Virtual Cell Challenge: A Primer

Jul 18

• 59

upvoted an article 5 months ago

Article

The Transformers Library: standardizing model definitions

May 15

• 119

liked a model 6 months ago

Qwen/Qwen3-235B-A22B

Text Generation • 235B • Updated Jul 26 • 166k • • 1.04k

upvoted 2 articles 6 months ago

Article

You could have designed state of the art positional encoding

Nov 25, 2024

• 374

Article

Welcome Llama 4 Maverick & Scout on Hugging Face!

Apr 5

• 145

liked a dataset 6 months ago

nanotron/ultrascale-playbook-data

Updated Mar 12 • 3.84k • 7

upvoted a paper 6 months ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published Apr 7 • 198

upvoted an article 7 months ago

Article

Open R1: Update #3

and 9 others •

Mar 11

• 295

liked a Space 7 months ago

Predict Memory

🧮

Calculate memory usage for model configurations

upvoted an article 7 months ago

Article

LLM Inference on Edge: A Fun and Easy Guide to run LLMs via React Native on your Phone!

Mar 7

• 85

New activity in nanotron/ultrascale-playbook 8 months ago

Make hash section working

#89 opened 8 months ago by

mishig

upvoted an article 8 months ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

• 1.3k

liked a Space 8 months ago

670

Open Deep-Research

🏆

OpenAI's Deep Research, but open

New activity in nanotron/ultrascale-playbook 8 months ago

More ressources

#73 opened 8 months ago by

eliebak

liked a Space 8 months ago

3.3k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

New activity in nanotron/ultrascale-playbook 8 months ago

xrsrke/link_nanotron_fp8_appexdix

#21 opened 8 months ago by

neuralink

xrsrke/fix_width_height_for_fp8_graph

#46 opened 8 months ago by

neuralink

updated a Space 8 months ago

3.3k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

upvoted an article 8 months ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

• 880

neuralink

AI & ML interests

Recent Activity

Organizations

neuralink's activity

Arc Virtual Cell Challenge: A Primer

The Transformers Library: standardizing model definitions

You could have designed state of the art positional encoding

Welcome Llama 4 Maverick & Scout on Hugging Face!

Open R1: Update #3

Predict Memory

LLM Inference on Edge: A Fun and Easy Guide to run LLMs via React Native on your Phone!

Make hash section working

Open-source DeepResearch – Freeing our search agents

Open Deep-Research

More ressources

The Ultra-Scale Playbook

xrsrke/link_nanotron_fp8_appexdix

xrsrke/fix_width_height_for_fp8_graph

The Ultra-Scale Playbook

Open-R1: a fully open reproduction of DeepSeek-R1