Hasan Can Solakoğlu's picture

28 576

Hasan Can Solakoğlu

hcsolakoglu

·

AI & ML interests

NLP, Vision, Data Science

Recent Activity

liked a model about 12 hours ago

MiniMaxAI/MiniMax-M2.1

liked a dataset 3 days ago

nvidia/Nemotron-Pretraining-SFT-v1

liked a model 3 days ago

nvidia/NVIDIA-Nemotron-3-Nano-30B-A3B-FP8

View all activity

Organizations

upvoted a paper 8 days ago

AutoMathText: Autonomous Data Selection with Language Models for Mathematical Texts

Paper • 2402.07625 • Published Feb 12, 2024 • 17

upvoted an article 14 days ago

Article

Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance

17 days ago

•

81

upvoted a paper 4 months ago

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published Aug 7 • 180

upvoted 5 collections 4 months ago

Apertus LLM

Democratizing Open and Compliant LLMs for Global Language Environments: 8B and 70B open-data open-weights models, multilingual in >1000 languages • 4 items • Updated Oct 1 • 315

NVIDIA Nemotron V2

Open, Production-ready Enterprise Models. Nvidia Open Model license. • 9 items • Updated 3 days ago • 100

InternVL3.5

This collection includes all released checkpoints of InternVL3.5, covering different training stages (e.g., Pretraining, SFT, MPO, Cascade RL). • 54 items • Updated Sep 28 • 103

GroveMoE

GroveMoE is an open-source family of large language models developed by the AGI Center, Ant Research Institute. • 4 items • Updated 2 days ago • 7

Nemotron-Pre-Training-Datasets

Large scale pre-training datasets used in the Nemotron family of models. • 11 items • Updated 3 days ago • 81

upvoted a collection 5 months ago

Zebra-CoT-v1.0

Zebra-CoT Dataset • 6 items • Updated Jul 23 • 3

upvoted an article 5 months ago

Article

OpenReasoning-Nemotron: A Family of State-of-the-Art Distilled Reasoning Models

Jul 18

•

50

upvoted a collection 5 months ago

Seed-X

A powerful open-source multilingual translation language model series, including instruction and reasoning models. • 8 items • Updated Aug 22 • 65

upvoted an article 6 months ago

Article

FineWeb-C: A Community-Driven Dataset for Educational Quality Annotations in 122 Languages

Jul 8

•

32

upvoted 2 papers 6 months ago

SparseLoRA: Accelerating LLM Fine-Tuning with Contextual Sparsity

Paper • 2506.16500 • Published Jun 19 • 16

Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy

Paper • 2507.01352 • Published Jul 2 • 56

upvoted 2 collections 6 months ago

Reward Models 06-2025

Nemotron reward models. For use in RLHF pipelines and LLM-as-a-Judge • 8 items • Updated 3 days ago • 23

ERNIE 4.5

collection of ERNIE 4.5 models. • 27 items • Updated Nov 11 • 180

upvoted 2 papers 6 months ago

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

Paper • 2506.20920 • Published Jun 26 • 75

LongWriter-Zero: Mastering Ultra-Long Text Generation via Reinforcement Learning

Paper • 2506.18841 • Published Jun 23 • 56

upvoted a collection 6 months ago

AceReason

Math and Code reasoning model trained through reinforcement learning (RL) • 7 items • Updated 3 days ago • 20

upvoted a collection 7 months ago

MiniCPM4

MiniCPM4: Ultra-Efficient LLMs on End Devices • 29 items • Updated Sep 8 • 82