alkinun's picture

alkinun

AtAndDev

·

AI & ML interests

LLMs, Alignment, Merging, Unsloth, DPO, SFT, ORPO, SPIN..

Recent Activity

updated a dataset about 7 hours ago

GradyanAkincilari/1200-R1-0528-v2-mermaid-fixed

published a dataset about 7 hours ago

GradyanAkincilari/1200-R1-0528-v2-mermaid-fixed

liked a model about 9 hours ago

unsloth/DeepSeek-R1-0528-Qwen3-8B-unsloth-bnb-4bit

View all activity

Organizations

upvoted a paper about 19 hours ago

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

Paper • 2506.20920 • Published 2 days ago • 23

upvoted a paper 9 days ago

Your Brain on ChatGPT: Accumulation of Cognitive Debt when Using an AI Assistant for Essay Writing Task

Paper • 2506.08872 • Published 18 days ago • 11

upvoted an article 20 days ago

Article

Uncensor any LLM with abliteration

By

•

Jun 13, 2024

• 618

upvoted an article 25 days ago

Article

System Prompt Learning: Teaching LLMs to Learn Problem-Solving Strategies from Experience

By

•

26 days ago

• 13

upvoted a paper about 1 month ago

OmniConsistency: Learning Style-Agnostic Consistency from Paired Stylization Data

Paper • 2505.18445 • Published May 24 • 64

upvoted an article about 1 month ago

Article

Falcon-Edge: A series of powerful, universal, fine-tunable 1.58bit language models.

By

and 9 others •

May 15

• 35

upvoted a paper about 2 months ago

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published May 6 • 175

upvoted an article 2 months ago

Article

LLaMA 4 Fine-Tuning with Mental Health Counseling Data

By

•

Apr 14

• 3

upvoted a paper 2 months ago

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published Apr 14 • 274

upvoted a collection 3 months ago

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 11 items • Updated Apr 28 • 496

upvoted a paper 3 months ago

Rethinking RL Scaling for Vision Language Models: A Transparent, From-Scratch Framework and Comprehensive Evaluation Scheme

Paper • 2504.02587 • Published Apr 3 • 30

upvoted an article 3 months ago

Article

Welcome Llama 4 Maverick & Scout on Hugging Face!

By

and 6 others •

Apr 5

• 145

upvoted a paper 3 months ago

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Paper • 2411.10440 • Published Nov 15, 2024 • 126

upvoted a collection 3 months ago

Llama 3.2

Meta's new Llama 3.2 vision and text models including 1B, 3B, 11B and 90B. Includes GGUF, 4-bit bnb and original versions. • 27 items • Updated 29 days ago • 65

upvoted a paper 3 months ago

DeepPerception: Advancing R1-like Cognitive Visual Perception in MLLMs for Knowledge-Intensive Visual Grounding

Paper • 2503.12797 • Published Mar 17 • 30

upvoted a collection 3 months ago

Gemma 2 Release

15 items • Updated 29 days ago • 219

upvoted a paper 3 months ago

Rewards Are Enough for Fast Photo-Realistic Text-to-image Generation

Paper • 2503.13070 • Published Mar 17 • 10

upvoted 3 articles 3 months ago

Article

Building a Beneficial AI

By

•

Mar 16

• 6

Article

Digest of models based on YandexGPT 5 Lite

By

•

Mar 19

• 31

Article

The Large Language Model Course

By

•

Jan 16

• 194