garrethlee (Garreth Lee)

liked a Space 2 months ago

The Smol Training Playbook

📚

2.8k

The secrets to building world-class LLMs

liked a dataset 4 months ago

HuggingFaceM4/FineVision

Viewer • Updated Oct 21, 2025 • 24.2M • 101k • 463

liked a model 4 months ago

google/embeddinggemma-300m

liked a dataset 5 months ago

nvidia/Granary

Viewer • Updated Aug 14, 2025 • 116M • 4.22k • 165

liked a Space 9 months ago

Dia 1.6B

👯

1.74k

Generate realistic dialogue from a script, using Dia!

liked a Space 11 months ago

The Ultra-Scale Playbook

🌌

3.62k

The ultimate guide to training LLM on large GPU Clusters

liked a model 12 months ago

deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27, 2025 • 439k • • 12.9k

liked a dataset about 1 year ago

HuggingFaceFW/fineweb-2

Viewer • Updated Oct 27, 2025 • 4.48B • 61.5k • 710

liked 2 Spaces about 1 year ago

Number Tokenization Blog

📈

106

Explore how tokenization affects arithmetic in LLMs

Hub LFS Analysis

📈

19

An analysis of LFS files on the Hub.

liked a model about 1 year ago

GoToCompany/gemma2-9b-cpt-sahabatai-v1-instruct

9B • Updated Nov 6, 2024 • 585 • 45

liked a Space about 1 year ago

Sahabat-AI Chatbot (Gemma2 9b)

😻

4

Chatbot

liked 2 datasets about 1 year ago

indolem/IndoMMLU

Updated Oct 11, 2023 • 168 • 19

PleIAs/common_corpus

Viewer • Updated Jun 10, 2025 • 470M • 41.5k • 321

liked 2 Spaces about 1 year ago

Scaling FineWeb to 1000+ languages: Step 1: finding signal in 100s of evaluation tasks

📝

86

Evaluate multilingual models using FineTasks

TxT360: Trillion Extracted Text

📖

131

Explore and analyze the TxT360 dataset for LLM pre-training

liked 2 Spaces over 1 year ago

Model Memory Utility

🚀

994

Calculate vRAM needed for model training and inference

FineWeb: decanting the web for the finest text data at scale

🍷

1.25k

Generate high-quality text data for LLMs using FineWeb

liked a model almost 2 years ago

mistralai/Mistral-7B-Instruct-v0.2

Text Generation • 7B • Updated Jul 24, 2025 • 2.36M • • 3.05k

Garreth Lee

AI & ML interests

Organizations

The Smol Training Playbook

HuggingFaceM4/FineVision

google/embeddinggemma-300m

nvidia/Granary

Dia 1.6B

The Ultra-Scale Playbook

deepseek-ai/DeepSeek-R1

HuggingFaceFW/fineweb-2

Number Tokenization Blog

Hub LFS Analysis

GoToCompany/gemma2-9b-cpt-sahabatai-v1-instruct

Sahabat-AI Chatbot (Gemma2 9b)

indolem/IndoMMLU

PleIAs/common_corpus

Scaling FineWeb to 1000+ languages: Step 1: finding signal in 100s of evaluation tasks

TxT360: Trillion Extracted Text

Model Memory Utility

FineWeb: decanting the web for the finest text data at scale

mistralai/Mistral-7B-Instruct-v0.2

Garreth Lee

AI & ML interests

Organizations

garrethlee's activity

The Smol Training Playbook

Dia 1.6B

The Ultra-Scale Playbook

Number Tokenization Blog

Hub LFS Analysis

Sahabat-AI Chatbot (Gemma2 9b)

Scaling FineWeb to 1000+ languages: Step 1: finding signal in 100s of evaluation tasks

TxT360: Trillion Extracted Text

Model Memory Utility

FineWeb: decanting the web for the finest text data at scale