view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) By natolambert and 3 others • Dec 9, 2022 • 269
The Vault: A Comprehensive Multilingual Dataset for Advancing Code Understanding and Generation Paper • 2305.06156 • Published May 9, 2023 • 2
CodeMMLU: A Multi-Task Benchmark for Assessing Code Understanding Capabilities of CodeLLMs Paper • 2410.01999 • Published Oct 2, 2024 • 10
CodeMMLU Collection CodeMMLU: A Multi-Task Benchmark for Assessing Code Understanding Capabilities • 2 items • Updated Oct 15, 2024 • 1
The Vault: A Comprehensive Multilingual Dataset for Advancing Code Understanding and Generation Paper • 2305.06156 • Published May 9, 2023 • 2
CodeMMLU Collection CodeMMLU: A Multi-Task Benchmark for Assessing Code Understanding Capabilities • 2 items • Updated Oct 15, 2024 • 1
CodeMMLU Collection CodeMMLU: A Multi-Task Benchmark for Assessing Code Understanding Capabilities • 2 items • Updated Oct 15, 2024 • 1