A Unified Multimodal Data Quality Classifier for generating quality scores for both image-text caption data and interleaved document data
Weizhi Wang
weizhiwang
AI & ML interests
None yet
Recent Activity
updated
a collection
2 days ago
UniFilter
updated
a collection
2 days ago
UniFilter
updated
a collection
2 days ago
UniFilter
Organizations
models
11
weizhiwang/UniFilter-Qwen2.5-1.5B
2B
•
Updated
•
6
weizhiwang/Open-Qwen2VL
Image-Text-to-Text
•
Updated
•
16
•
19
weizhiwang/mlm-filter-qwen2.5-1.5b-gpt4o
Text Generation
•
2B
•
Updated
•
7
•
3
weizhiwang/Open-Qwen2VL-base
Image-Text-to-Text
•
Updated
•
1
weizhiwang/unifilter_mllm_sft_checkpoints
Updated
weizhiwang/LLaVA-Video-Llama-3.1-8B
8B
•
Updated
•
64
•
5
weizhiwang/llava-video-llama-3.1-8b-siglip-so-384-aapool-144-projector
Updated
weizhiwang/mlm-filter-llava-13b-gpt4v
Text Generation
•
Updated
•
6
•
6
weizhiwang/LongMem-558M
Updated
•
4
weizhiwang/clip_datacomp_medium_itm_th_66_AND_odf_th_20_gpt4v
Updated
datasets
11
weizhiwang/OBELICS_HQ_5M_UniFilter
Viewer
•
Updated
•
5.06M
•
327
weizhiwang/unifilter_train_data
Viewer
•
Updated
•
1.56M
•
41
weizhiwang/cnsi-chatbot
Updated
•
38
weizhiwang/mlm_filter_instructions
Updated
•
17
•
5
weizhiwang/agent_eval
Viewer
•
Updated
•
851
•
6
weizhiwang/Open-Qwen2VL-Data
Viewer
•
Updated
•
13M
•
4.11k
•
22
weizhiwang/Open-Qwen2VL-Data-Interleaved
Viewer
•
Updated
•
23.3M
•
809
•
2
weizhiwang/mmc4_fewer_faces
Updated
•
7
weizhiwang/datacomp-hq
Updated
•
9
weizhiwang/llava_v15_instruction_images
Preview
•
Updated
•
146
•
6