boatbomber's picture
Add RobloxQA_OpenEnded
bed23b0
raw
history blame
648 Bytes
# Your leaderboard name
TITLE = """<h1 align="center" id="space-title">Roblox LLM Leaderboard</h1>"""
# What does your leaderboard evaluate?
INTRODUCTION_TEXT = """
Tracking LLM capabilities regarding Roblox game development.
Benchmarks:
- [RobloxQA](https://huggingface.co/datasets/boatbomber/RobloxQA-v1.0): Multiple choice question answering about Roblox APIs and concepts.
- [RobloxQA_OpenEnded](https://huggingface.co/datasets/boatbomber/RobloxQA-v1.0): Question answering about Roblox APIs and concepts without giving the multiple choices. \
Correctness judged by an LLM by comparing the generated answer to the correct answer choice.
"""