17 20 152

Igor Kuzmin

igorktech

igorktech

AI & ML interests

AI, Chatbots, NLP, Reinforcement Learning in conversational assistants.

Recent Activity

upvoted a paper 1 day ago

Simple Projection Variants Improve ColBERT Performance

liked a Space 7 days ago

k-mktr/gpu-poor-llm-arena

published a model 12 days ago

unlogic-ai/steppe-1.8b-sft-v2

View all activity

Organizations

upvoted a paper 1 day ago

Simple Projection Variants Improve ColBERT Performance

Paper • 2510.12327 • Published Oct 14, 2025 • 7

upvoted a collection 5 months ago

mental therapy datasets

Collection

21 items • Updated Apr 29, 2024 • 9

upvoted an article 7 months ago

Article

Learn the Hugging Face Kernel Hub in 5 Minutes

Jun 12, 2025

•

151

upvoted a paper 8 months ago

Quartet: Native FP4 Training Can Be Optimal for Large Language Models

Paper • 2505.14669 • Published May 20, 2025 • 78

upvoted an article 8 months ago

Article

Exploring Hard Negative Mining with NV-Retriever in Korean Financial Text

Jan 12, 2025

•

upvoted a collection 9 months ago

EmoPillars

Collection

This collection contains models and a dataset for fine-grained context-aware and context-less emotion classification. • 7 items • Updated Apr 25, 2025 • 4

upvoted 2 papers 11 months ago

LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers

Paper • 2502.15007 • Published Feb 20, 2025 • 174

Do I Know This Entity? Knowledge Awareness and Hallucinations in Language Models

Paper • 2411.14257 • Published Nov 21, 2024 • 14

upvoted 2 papers about 1 year ago

Rho-1: Not All Tokens Are What You Need

Paper • 2404.07965 • Published Apr 11, 2024 • 93

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published Dec 18, 2024 • 158

upvoted a collection about 1 year ago

Tulu 3 Datasets

Collection

All datasets released with Tulu 3 -- state of the art open post-training recipes. • 33 items • Updated Dec 23, 2025 • 96

upvoted a paper over 1 year ago

PingPong: A Benchmark for Role-Playing Language Models with User Emulation and Multi-Model Evaluation

Paper • 2409.06820 • Published Sep 10, 2024 • 68

upvoted 2 articles over 1 year ago

Article

How NuminaMath Won the 1st AIMO Progress Prize

Jul 11, 2024

•

125

Article

Introduction to ggml

Aug 13, 2024

•

262

upvoted a paper over 1 year ago

The Russian-focused embedders' exploration: ruMTEB benchmark and Russian embedding model design

Paper • 2408.12503 • Published Aug 22, 2024 • 27

upvoted an article over 1 year ago

Article

Tool Use, Unified

Aug 12, 2024

•

120

upvoted a paper over 1 year ago

Linear Transformers with Learnable Kernel Functions are Better In-Context Models

Paper • 2402.10644 • Published Feb 16, 2024 • 81

upvoted 2 articles over 1 year ago

Article

Training and Finetuning Embedding Models with Sentence Transformers v3

May 28, 2024

•

263

Article

Uncensor any LLM with abliteration

Jun 13, 2024

•

765

upvoted a paper over 1 year ago

Vikhr: The Family of Open-Source Instruction-Tuned Large Language Models for Russian

Paper • 2405.13929 • Published May 22, 2024 • 54

Igor Kuzmin

AI & ML interests

Recent Activity

Organizations

igorktech's activity

Learn the Hugging Face Kernel Hub in 5 Minutes

Exploring Hard Negative Mining with NV-Retriever in Korean Financial Text

How NuminaMath Won the 1st AIMO Progress Prize

Introduction to ggml

Tool Use, Unified

Training and Finetuning Embedding Models with Sentence Transformers v3

Uncensor any LLM with abliteration