Gemini: A Family of Highly Capable Multimodal Models Paper • 2312.11805 • Published Dec 19, 2023 • 48
SC-MIL: Supervised Contrastive Multiple Instance Learning for Imbalanced Classification in Pathology Paper • 2303.13405 • Published Mar 23, 2023 • 2
PathologyBERT -- Pre-trained Vs. A New Transformer Language Model for Pathology Domain Paper • 2205.06885 • Published May 13, 2022 • 1
MLLM4PUE: Toward Universal Embeddings in Computational Pathology through Multimodal LLMs Paper • 2502.07221 • Published Feb 11, 2025 • 1
Hibou: A Family of Foundational Vision Transformers for Pathology Paper • 2406.05074 • Published Jun 7, 2024 • 10
Pathology Image Compression with Pre-trained Autoencoders Paper • 2503.11591 • Published Mar 14, 2025 • 5
SOP: A Scalable Online Post-Training System for Vision-Language-Action Models Paper • 2601.03044 • Published 3 days ago • 26
UniCorn: Towards Self-Improving Unified Multimodal Models through Self-Generated Supervision Paper • 2601.03193 • Published 3 days ago • 39
LTX-2: Efficient Joint Audio-Visual Foundation Model Paper • 2601.03233 • Published 3 days ago • 77
HISTAI: An Open-Source, Large-Scale Whole Slide Image Dataset for Computational Pathology Paper • 2505.12120 • Published May 17, 2025 • 5
A Survey of Pathology Foundation Model: Progress and Future Directions Paper • 2504.04045 • Published Apr 5, 2025 • 1
Ensemble of Pathology Foundation Models for MIDOG 2025 Track 2: Atypical Mitosis Classification Paper • 2509.02591 • Published Aug 29, 2025 • 1
DiffThinker: Towards Generative Multimodal Reasoning with Diffusion Models Paper • 2512.24165 • Published 10 days ago • 47
Pathology-CoT: Learning Visual Chain-of-Thought Agent from Expert Whole Slide Image Diagnosis Behavior Paper • 2510.04587 • Published Oct 6, 2025 • 2
Coupling Experts and Routers in Mixture-of-Experts via an Auxiliary Loss Paper • 2512.23447 • Published 11 days ago • 93
LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation Paper • 2512.23576 • Published 11 days ago • 64
Towards a Visual-Language Foundation Model for Computational Pathology Paper • 2307.12914 • Published Jul 24, 2023 • 1