EDDY GIUSEPE CHIRINOS ISIDRO, PhD

EddyGiusepe

AI & ML interests

Speech Processing (ASR, Speaker Identification, Text-to-Speech) NLP (text classification, Sentence similarity, Token classification, etc) Computer vision (Image classification)

Recent Activity

liked a model 12 days ago

yasserrmd/DentaInstruct-1.2B

upvoted a changelog 17 days ago

Inference Providers now fully support OpenAI-compatible API

liked a Space 28 days ago

hf-audio/open_asr_leaderboard

View all activity

Organizations

upvoted a changelog 17 days ago

Changelog

Inference Providers now fully support OpenAI-compatible API

27 days ago

• 76

upvoted a collection about 1 month ago

💬Urdu ASR Models

Collection

Collection of fine-tuned Urdu speech recognition models. • 9 items • Updated Jul 14 • 2

upvoted a collection about 2 months ago

V-JEPA 2

Collection

A frontier video understanding model developed by FAIR, Meta, which extends the pretraining objectives of https://ai.meta.com/blog/v-jepa-yann • 8 items • Updated Jun 13 • 154

upvoted 2 articles 3 months ago

Article

Vision Language Models (Better, Faster, Stronger)

and 4 others •

May 12

• 504

Article

LeMaterial: an open source initiative to accelerate materials discovery and research

and 9 others •

Dec 10, 2024

• 52

upvoted a collection 3 months ago

D-FINE

Collection

State-of-the-art real-time object detection model with Apache 2.0 licence • 15 items • Updated May 5 • 55

upvoted 2 collections 4 months ago

InternVL3

Collection

34 items • Updated Apr 20 • 80

Llama 4

Collection

Meta's new Llama 4 multimodal models, Scout & Maverick. Includes Dynamic GGUFs, 16-bit & Dynamic 4-bit uploads. Run & fine-tune them with Unsloth! • 15 items • Updated about 5 hours ago • 47

upvoted a collection 5 months ago

MoshiVis v0.1

Collection

MoshiVis is a Vision Speech Model built as a perceptually-augmented version of Moshi v0.1 for conversing about image inputs • 8 items • Updated Mar 21 • 22

upvoted an article 5 months ago

Article

Open R1: Update #4

and 3 others •

Mar 26

• 48

upvoted a collection 5 months ago

Qwen2.5-Omni

Collection

End-to-End Omni (text, audio, image, video, and natural speech interaction) model based Qwen2.5 • 7 items • Updated 24 days ago • 155

upvoted an article 5 months ago

Article

DeepSearch Using Visual RAG in Agentic Frameworks 🔎

and 1 other •

Mar 21

• 35

upvoted a collection 5 months ago

Gemma 3 Release

Collection

28 items • Updated 4 days ago • 435

upvoted 2 articles 5 months ago

Article

Open-Source Handwritten Signature Detection Model

•

Mar 14

• 117

Article

Welcome to Inference Providers on the Hub 🔥

and 6 others •

Jan 28

• 487

upvoted a collection 5 months ago

Gemma 3

Collection

All versions of Google's new multimodal models including QAT in 1B, 4B, 12B, and 27B sizes. In GGUF, dynamic 4-bit and 16-bit formats. • 55 items • Updated about 5 hours ago • 76

upvoted 2 articles 5 months ago

Article

Introducing EuroBERT: A High-Performance Multilingual Encoder Model

and 3 others •

Mar 10

• 146

Article

A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality

and 3 others •

Mar 4

• 75

upvoted a collection 5 months ago

Cohere Labs Aya Vision

Collection

Aya Vision is a state-of-the-art family of vision models that brings multimodal capabilities to 23 languages. • 5 items • Updated 14 days ago • 69

upvoted an article 6 months ago

Article

ColPali: Efficient Document Retrieval with Vision Language Models 👀

•

Jul 5, 2024

• 287

EDDY GIUSEPE CHIRINOS ISIDRO, PhD

AI & ML interests

Recent Activity

Organizations

EddyGiusepe's activity

Inference Providers now fully support OpenAI-compatible API

Vision Language Models (Better, Faster, Stronger)

LeMaterial: an open source initiative to accelerate materials discovery and research

Open R1: Update #4

DeepSearch Using Visual RAG in Agentic Frameworks 🔎

Open-Source Handwritten Signature Detection Model

Welcome to Inference Providers on the Hub 🔥

Introducing EuroBERT: A High-Performance Multilingual Encoder Model

A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality

ColPali: Efficient Document Retrieval with Vision Language Models 👀