Colin Raffel's picture

Colin Raffel

craffel

·

http://colinraffel.com

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 hours ago

Risk Under Pressure: Compute-Aware Evaluation of Adversarial Robustness in Language Models

updated a bucket 4 days ago

craffel/moto_checkpoints

published a bucket 4 days ago

craffel/moto_checkpoints

View all activity

Organizations

liked a model 14 days ago

ibm-granite/granite-speech-4.1-2b-plus

Automatic Speech Recognition • 2B • Updated Apr 29 • 14.8k • 70

liked a model 6 months ago

deepseek-ai/DeepSeek-V3.2

Text Generation • 685B • Updated Dec 1, 2025 • 3.45M • • 1.45k

liked a model 7 months ago

ShantanuT01/BERT-tiny-RAID

Text Classification • 4.39M • Updated Sep 15, 2025 • 265 • 1

liked a dataset 10 months ago

nvidia/Nemotron-Post-Training-Dataset-v2

Viewer • Updated Aug 21, 2025 • 6.34M • 7.51k • 137

liked 2 datasets 11 months ago

nvidia/HelpSteer3

Viewer • Updated Nov 16, 2025 • 133k • 8.9k • 111

NousResearch/Hermes-3-Dataset

Viewer • Updated Jul 11, 2025 • 959k • 776 • 311

liked a model 12 months ago

nanonets/Nanonets-OCR-s

Image-Text-to-Text • 4B • Updated Jun 20, 2025 • 78.9k • 1.59k

liked 4 models about 1 year ago

OpenHands/openhands-lm-32b-v0.1

Text Generation • 33B • Updated Apr 16, 2025 • 114 • 391

teapotai/teapotllm

Text Generation • 0.8B • Updated Feb 21 • 68 • 186

Qwen/Qwen2.5-VL-32B-Instruct

Image-Text-to-Text • 33B • Updated Apr 14, 2025 • 424k • 490

mistralai/Mistral-Small-3.1-24B-Instruct-2503

24B • Updated Dec 22, 2025 • 289k • 1.37k

liked 3 models over 1 year ago

Qwen/QwQ-32B

Text Generation • 33B • Updated Mar 11, 2025 • 50k • • 2.93k

bartowski/DeepSeek-R1-Distill-Qwen-32B-GGUF

Text Generation • 33B • Updated Jan 22, 2025 • 23.6k • 305

mistralai/Mistral-Small-24B-Instruct-2501

24B • Updated Jul 28, 2025 • 56.8k • 957

liked a Space over 1 year ago

The Ultra-Scale Playbook

The ultimate guide to training LLM on large GPU Clusters

liked 4 datasets over 1 year ago

nvidia/Daring-Anteater

Viewer • Updated Jun 17, 2024 • 99.5k • 3.23k • 29

microsoft/orca-agentinstruct-1M-v1

Viewer • Updated Nov 1, 2024 • 1.05M • 3.33k • 465

mlabonne/orpo-dpo-mix-40k

Viewer • Updated Oct 17, 2024 • 44.2k • 1.19k • 302

LLM360/TxT360

Updated May 26, 2025 • 49.3k • 262

liked a model over 1 year ago

Zyphra/Zamba2-2.7B-instruct

Text Generation • 3B • Updated Feb 14, 2025 • 362 • 83