Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Maya: Multilingual Multimodal model

community
https://github.com/nahidalam/maya
Activity Feed Request to join this org

AI & ML interests

None defined yet.

Shayekh Islam's profile picture Genta Indra Winata's profile picture Nahid Alam's profile picture Drishti Sharma's profile picture Satya Vegesna's profile picture Ashvanth.S's profile picture Anthony Susevski's profile picture Ryan Chan's profile picture Surya Guthikonda's profile picture Abhipsha's profile picture Karthik's profile picture Roshan Santhosh's profile picture Isha Chaturvedi's profile picture
Organization Card
Community About org cards

We introduce Maya, an open-source Multilingual Multimodal model.

  1. A multilingual image-text pretraining dataset in eight languages, based on the LLaVA pretraining dataset;
  2. A novel toxicity-free version across eight languages; and
  3. A multilingual image-text 8B model supporting these languages, enhancing culture and linguistics.

Collections 1

Maya @CVPR 2025
Two papers from the Maya Project have been accepted at CVPR 2025!
  • Understanding and Mitigating Toxicity in Image-Text Pretraining Datasets: A Case Study on LLaVA

    Paper • 2505.06356 • Published May 9 • 3
  • Behind Maya: Building a Multilingual Vision Language Model

    Paper • 2505.08910 • Published May 13 • 2
Maya @CVPR 2025
Two papers from the Maya Project have been accepted at CVPR 2025!
  • Understanding and Mitigating Toxicity in Image-Text Pretraining Datasets: A Case Study on LLaVA

    Paper • 2505.06356 • Published May 9 • 3
  • Behind Maya: Building a Multilingual Vision Language Model

    Paper • 2505.08910 • Published May 13 • 2

models 1

maya-multimodal/maya

Image-Text-to-Text • 8B • Updated May 15 • 82 • 69

datasets 1

maya-multimodal/pretrain

Viewer • Updated May 19 • 4.4M • 24 • 18
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs