19 9 169

Nathan Cooper

ncoop57

https://nathancooper.io

AI & ML interests

The intersection of Software Engineering, Deep Learning, NLP, and Graph Networks.

Recent Activity

liked a dataset 3 days ago

answerdotai/enwiki

upvoted an article about 1 month ago

SmolLM3: smol, multilingual, long-context reasoner

liked a Space 5 months ago

Mar2Ding/SAM2Long-Demo

View all activity

Organizations

upvoted an article about 1 month ago

Article

SmolLM3: smol, multilingual, long-context reasoner

and 22 others •

Jul 8

• 625

upvoted a paper 6 months ago

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published Jan 28 • 123

upvoted 2 articles 7 months ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

and 2 others •

Jan 28

• 877

Article

Synthetic Data Generation with FastData and Hugging Face

•

Jan 7

• 15

upvoted a collection 7 months ago

ModernBERT

Collection

Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated Dec 19, 2024 • 149

upvoted a paper 8 months ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published Dec 18, 2024 • 153

upvoted an article 12 months ago

Article

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 649

upvoted a collection about 1 year ago

Qwen2

Collection

Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 39 items • Updated 24 days ago • 368

upvoted a paper over 1 year ago

Stable LM 2 1.6B Technical Report

Paper • 2402.17834 • Published Feb 27, 2024 • 3

Nathan Cooper

AI & ML interests

Recent Activity

Organizations

ncoop57's activity

SmolLM3: smol, multilingual, long-context reasoner

Open-R1: a fully open reproduction of DeepSeek-R1

Synthetic Data Generation with FastData and Hugging Face

Uncensor any LLM with abliteration