In a Training Loop 🔄

11 30 126

Jisoo Kim PRO

kuotient

AI & ML interests

NLP

Recent Activity

liked a Space 2 days ago

engineerA314/smol-training-playbook-ko

liked a model 10 days ago

naver-hyperclovax/HyperCLOVAX-SEED-Think-32B

liked a model 10 days ago

skt/A.X-K1

View all activity

Organizations

upvoted an article 20 days ago

Article

How We Use Claude Code Skills to Run 1,000+ ML Experiments a Day

Dec 8, 2025

•

upvoted an article about 1 month ago

Article

Transformers v5: Simple model definitions powering the AI ecosystem

Dec 1, 2025

•

268

upvoted a paper about 2 months ago

Black-Box On-Policy Distillation of Large Language Models

Paper • 2511.10643 • Published Nov 13, 2025 • 49

upvoted an article about 2 months ago

Article

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

Nov 3, 2025

•

upvoted a paper 3 months ago

KORMo: Korean Open Reasoning Model for Everyone

Paper • 2510.09426 • Published Oct 10, 2025 • 83

upvoted 2 articles 4 months ago

Article

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

Sep 11, 2025

•

177

Article

Exploring Environments Hub: Your Language Model needs better (open) environments to learn

Sep 4, 2025

•

upvoted a paper 4 months ago

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL

Paper • 2508.13167 • Published Aug 6, 2025 • 129

upvoted a collection 5 months ago

Tool Use Reasoning

Collection

A collection of tool use reasoning dataset in Hermes format • 5 items • Updated Jul 23, 2025 • 9

upvoted 2 articles 5 months ago

Article

ChatML vs Harmony: Understanding the new Format from OpenAI 🔍

Aug 9, 2025

•

Article

Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training

Aug 8, 2025

•

upvoted an article 9 months ago

Article

Training Large Language Models with Interpreter Feedback using WebAssembly

Apr 3, 2025

•

upvoted 2 papers 10 months ago

R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning

Paper • 2503.05592 • Published Mar 7, 2025 • 27

Kanana: Compute-efficient Bilingual Language Models

Paper • 2502.18934 • Published Feb 26, 2025 • 65

upvoted a paper about 1 year ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 376

upvoted an article about 1 year ago

Article

The Beginners Guide to Cleaning a Dataset

Nov 18, 2024

•

upvoted 2 papers over 1 year ago

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19, 2024 • 140

Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution

Paper • 2409.12191 • Published Sep 18, 2024 • 78

upvoted an article over 1 year ago

Article

Efficient Deep Learning: A Comprehensive Overview of Optimization Techniques 👐 📚

Aug 26, 2024

•

upvoted a paper over 1 year ago

The Mamba in the Llama: Distilling and Accelerating Hybrid Models

Paper • 2408.15237 • Published Aug 27, 2024 • 42

Jisoo Kim PRO

AI & ML interests

Recent Activity

Organizations

kuotient's activity

How We Use Claude Code Skills to Run 1,000+ ML Experiments a Day

Transformers v5: Simple model definitions powering the AI ecosystem

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

Exploring Environments Hub: Your Language Model needs better (open) environments to learn

ChatML vs Harmony: Understanding the new Format from OpenAI 🔍

Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training

Training Large Language Models with Interpreter Feedback using WebAssembly

The Beginners Guide to Cleaning a Dataset

Efficient Deep Learning: A Comprehensive Overview of Optimization Techniques 👐 📚