view article Article Introducing HalluMix: A Task-Agnostic, Multi-Domain Benchmark for Detecting Hallucinations in Real-World Scenarios By quotientai and 3 others • May 2 • 19
Running on CPU Upgrade 13.4k 13.4k Open LLM Leaderboard 🏆 Track, rank and evaluate open LLMs and chatbots