FailSense Datasets and Benchmarks ACIDE/AHA-Calvin-1p Viewer • Updated Jul 2 • 12.3k • 17 ACIDE/AHA-Calvin-2p Viewer • Updated Jul 2 • 12.3k • 49 ACIDE/AHA-Calvin Viewer • Updated Jul 2 • 12.3k • 42 ACIDE/DROID_1p_bench Viewer • Updated Jul 12 • 138 • 43
User-VLM 360° Datasets and Benchmarks ACIDE/user-vlm-pt Viewer • Updated Feb 14 • 132k • 61 ACIDE/user-vlm-instruct Viewer • Updated Feb 14 • 112k • 19 ACIDE/user-vlm-dpo Viewer • Updated Feb 14 • 17.2k • 14 ACIDE/user-vlm-face-bench Viewer • Updated Feb 14 • 1.2k • 21
FailSense 3B Failure Detection for Robotic Manipulation with VLMs ACIDE/FailSense-AHA-Calvin-1p-3b Updated Jul 7 ACIDE/FailSense-AHA-Calvin-2p-3b Updated Jul 14 ACIDE/FailSense-Video-Calvin-1p-3b Updated Jul 14 ACIDE/FailSense-Video-Calvin-2p-3b Updated Jul 14 • 2
User-VLM 360° Models A series of Personalized Vision Language Models for Social Human-Robot Interactions ACIDE/User-VLM-3B-base Image-Text-to-Text • 3B • Updated about 8 hours ago • 3 ACIDE/User-VLM-10B-base Image-Text-to-Text • 10B • Updated about 8 hours ago • 3 ACIDE/User-VLM-3B-Instruct Visual Question Answering • Updated about 8 hours ago ACIDE/User-VLM-10B-Instruct Visual Question Answering • Updated about 8 hours ago
FailSense Datasets and Benchmarks ACIDE/AHA-Calvin-1p Viewer • Updated Jul 2 • 12.3k • 17 ACIDE/AHA-Calvin-2p Viewer • Updated Jul 2 • 12.3k • 49 ACIDE/AHA-Calvin Viewer • Updated Jul 2 • 12.3k • 42 ACIDE/DROID_1p_bench Viewer • Updated Jul 12 • 138 • 43
FailSense 3B Failure Detection for Robotic Manipulation with VLMs ACIDE/FailSense-AHA-Calvin-1p-3b Updated Jul 7 ACIDE/FailSense-AHA-Calvin-2p-3b Updated Jul 14 ACIDE/FailSense-Video-Calvin-1p-3b Updated Jul 14 ACIDE/FailSense-Video-Calvin-2p-3b Updated Jul 14 • 2
User-VLM 360° Datasets and Benchmarks ACIDE/user-vlm-pt Viewer • Updated Feb 14 • 132k • 61 ACIDE/user-vlm-instruct Viewer • Updated Feb 14 • 112k • 19 ACIDE/user-vlm-dpo Viewer • Updated Feb 14 • 17.2k • 14 ACIDE/user-vlm-face-bench Viewer • Updated Feb 14 • 1.2k • 21
User-VLM 360° Models A series of Personalized Vision Language Models for Social Human-Robot Interactions ACIDE/User-VLM-3B-base Image-Text-to-Text • 3B • Updated about 8 hours ago • 3 ACIDE/User-VLM-10B-base Image-Text-to-Text • 10B • Updated about 8 hours ago • 3 ACIDE/User-VLM-3B-Instruct Visual Question Answering • Updated about 8 hours ago ACIDE/User-VLM-10B-Instruct Visual Question Answering • Updated about 8 hours ago