merve's picture

Building on HF

merve PRO

merve

huggingface

·

https://github.com/merveenoyan/smol-vision

AI & ML interests

I love this website VLMs, vision & co

Recent Activity

updated a dataset about 10 hours ago

uv-scripts/object-detection

published a dataset about 10 hours ago

uv-scripts/object-detection

updated a dataset about 10 hours ago

merve/license-plates-coco

View all activity

Organizations

upvoted an article 9 days ago

Article

Mixture of Experts (MoEs) in Transformers

+5

9 days ago

•

120

upvoted an article 12 days ago

Article

IBM and UC Berkeley Diagnose Why Enterprise Agents Fail Using IT-Bench and MAST

16 days ago

•

18

upvoted a collection 18 days ago

Tiny Aya

Bridging Scale and Multilingual Depth • 10 items • Updated 18 days ago • 64

upvoted an article 27 days ago

Article

From Golden Gate Bridge to Broken JSON: Why Anthropic's SAE Steering Fails for Structured Output

27 days ago

•

21

upvoted an article 29 days ago

Article

Community Evals: Because we're done trusting black-box leaderboards over the community

+5

Feb 4

•

85

upvoted a changelog 29 days ago

Hugging Face Changelog

Community Evals and Benchmark Repositories

29 days ago

• 67

upvoted 2 articles 29 days ago

Article

🚀 SyGra V2.0.0

29 days ago

•

8

Article

Introducing SyGra Studio

29 days ago

•

25

upvoted 3 articles about 1 month ago

Article

Nemotron ColEmbed V2: Raising the Bar for Multimodal Retrieval with ViDoRe V3’s Top Model

about 1 month ago

•

28

Article

Training Design for Text-to-Image Models: Lessons from Ablations

Feb 3

•

68

Article

H Company's new Holo2 model takes the lead in UI Localization

Feb 3

•

5

upvoted a paper about 1 month ago

C-RADIOv4 (Tech Report)

Paper • 2601.17237 • Published Jan 24 • 10

upvoted a collection about 1 month ago

Open Coding Agents

13 items • Updated 2 days ago • 49

upvoted an article about 1 month ago

Article

Llasa Goes RL: Training LLaSA with GRPO for Improved Prosody and Expressiveness

Nov 5, 2025

•

12

upvoted a collection about 1 month ago

Nemotron ColEmbed V2

State-of-the-Art Late Interaction Vision-Language Embedding Models • 3 items • Updated about 8 hours ago • 10

upvoted 2 articles about 1 month ago

Article

Security, Governance and Performance for Dell On-Prem AI Builders

Jan 21

•

7

Article

AssetOpsBench: Bridging the Gap Between AI Agent Benchmarks and Industrial Reality

Jan 21

•

31

upvoted an article about 2 months ago

Article

LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family

Jan 19

•

87

upvoted 2 collections about 2 months ago

LightOnOCR-2 🦉

LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family • 12 items • Updated 4 days ago • 22

Kanana-2

Open Source Kanana-2 • 29 items • Updated 5 days ago • 36