4 20 11

Jesse

jessepisel

jessepisel

AI & ML interests

computer vision, optimization, spatial

Recent Activity

reacted to merve's post with 🔥 10 days ago

liked a model about 2 months ago

nanonets/Nanonets-OCR-s

liked a model 2 months ago

thinkonward/denoizer

View all activity

Organizations

upvoted a collection 4 months ago

Cogito v1 Preview

Collection

5 items • Updated Apr 8 • 116

upvoted a paper 4 months ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published Apr 7 • 197

upvoted 2 papers 5 months ago

Open Deep Search: Democratizing Search with Open-source Reasoning Agents

Paper • 2503.20201 • Published Mar 26 • 48

Beyond RAG: Task-Aware KV Cache Compression for Comprehensive Knowledge Reasoning

Paper • 2503.04973 • Published Mar 6 • 24

upvoted 2 articles 5 months ago

Article

Open R1: Update #3

and 9 others •

Mar 11

• 295

Article

Open R1: Update #2

and 6 others •

Feb 10

• 216

upvoted a paper 5 months ago

LLM as a Broken Telephone: Iterative Generation Distorts Information

Paper • 2502.20258 • Published Feb 27 • 27

upvoted a paper 6 months ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 241

upvoted a collection 6 months ago

Tulu 3 Models

Collection

All models released with Tulu 3 -- state of the art open post-training recipes. • 11 items • Updated Apr 30 • 100

upvoted 2 articles 6 months ago

Article

Welcome to Inference Providers on the Hub 🔥

and 6 others •

Jan 28

• 486

Article

Open-R1: a fully open reproduction of DeepSeek-R1

and 2 others •

Jan 28

• 876

upvoted a paper 8 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 373

upvoted an article 8 months ago

Article

Use Models from the Hugging Face Hub in LM Studio

•

Nov 28, 2024

• 140

upvoted a paper 8 months ago

CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models

Paper • 2411.18613 • Published Nov 27, 2024 • 59

upvoted 2 papers 9 months ago

TÜLU 3: Pushing Frontiers in Open Language Model Post-Training

Paper • 2411.15124 • Published Nov 22, 2024 • 65

Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models

Paper • 2411.04996 • Published Nov 7, 2024 • 52

upvoted 2 papers 10 months ago

LLMs Know More Than They Show: On the Intrinsic Representation of LLM Hallucinations

Paper • 2410.02707 • Published Oct 3, 2024 • 49

LLaVA-3D: A Simple yet Effective Pathway to Empowering LMMs with 3D-awareness

Paper • 2409.18125 • Published Sep 26, 2024 • 35

upvoted 2 papers 11 months ago

SplatFields: Neural Gaussian Splats for Sparse 3D and 4D Reconstruction

Paper • 2409.11211 • Published Sep 17, 2024 • 9

Implicit Neural Representations with Fourier Kolmogorov-Arnold Networks

Paper • 2409.09323 • Published Sep 14, 2024 • 5

Jesse

AI & ML interests

Recent Activity

Organizations

jessepisel's activity

Open R1: Update #3

Open R1: Update #2

Welcome to Inference Providers on the Hub 🔥

Open-R1: a fully open reproduction of DeepSeek-R1

Use Models from the Hugging Face Hub in LM Studio