Edit Models filters

Inference Providers

HF Inference API

Misc

Reasoning-Course

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

45

Full-text search

Active filters: Reasoning-Course

tariktuna/Summarizer-Demo-SmolGRPO-135M

Text Generation • 0.1B • Updated Jul 1 • 8

donaldyy/SmolGRPO-135M

Text Generation • 0.1B • Updated Jul 3 • 11

dzungever/SmolLM-135M-Instruct-GRPO

Text Generation • 0.1B • Updated Jul 8 • 3

vovk/SmolGRPO-135M

Text Generation • 0.1B • Updated Jul 14 • 9

alfredcs/torchrun-medgemma-27b-grpo-merged

Image-Text-to-Text • 27B • Updated Jul 15 • 11

dzur658/SmolGRPO-135M

Text Generation • 0.1B • Updated Jul 22 • 10

Mhammad2023/SmolGRPO-135M

Text Generation • 0.1B • Updated Jul 22 • 9

amritzeon/SmolGRPO-135M

Text Generation • 0.1B • Updated Jul 25 • 8

eduliza/SmolGRPO-135M

Text Generation • 0.1B • Updated Jul 25 • 8

0xtimi/SmolGRPO-135M

Text Generation • 0.1B • Updated about 1 month ago • 9

TBao-THUer/SmolGRPO-135M

Text Generation • 0.1B • Updated 23 days ago • 14

kavanmevada/SmolGRPO-135M

Text Generation • 0.4B • Updated 11 days ago • 10

kavanmevada/SmolGRPO-135M-adapter

Updated 15 days ago

niikun/SmolGRPO-135M

Text Generation • 0.1B • Updated 14 days ago • 6

harikrushna2272/SmolGRPO-135M

Text Generation • 0.1B • Updated 12 days ago • 4