zai-org/GLM-ASR-Nano-2512 Automatic Speech Recognition β’ 2B β’ Updated Dec 24, 2025 β’ 47.8k β’ 333
zai-org/GLM-4.1V-9B-Thinking Image-Text-to-Text β’ 10B β’ Updated Oct 25, 2025 β’ 130k β’ β’ 766
CogVLM: Visual Expert for Pretrained Language Models Paper β’ 2311.03079 β’ Published Nov 6, 2023 β’ 27
CogAgent: A Visual Language Model for GUI Agents Paper β’ 2312.08914 β’ Published Dec 14, 2023 β’ 31
CogVLM2: Visual Language Models for Image and Video Understanding Paper β’ 2408.16500 β’ Published Aug 29, 2024 β’ 57
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning Paper β’ 2507.01006 β’ Published Jul 1, 2025 β’ 250
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning Paper β’ 2507.01006 β’ Published Jul 1, 2025 β’ 250