Interesting Datasets A collection of datasets that I come across LDJnr/Capybara Viewer • Updated Jun 7, 2024 • 16k • 1.67k • 248 teknium/openhermes Viewer • Updated Sep 7, 2023 • 243k • 1.1k • 219 VMware/open-instruct Viewer • Updated Jul 12, 2023 • 143k • 129 • 44 euclaise/WritingPrompts_preferences Viewer • Updated Dec 25, 2023 • 265k • 74 • 10
Requires Filtering Datasets that HAS TO BE FILTERED. Squish42/bluemoon-fandom-1-1-rp-cleaned Updated Jul 9, 2023 • 157 • 78
Augmentable A collection of datasets that should be augmented further with gpt-4 allenai/ai2_arc Viewer • Updated Dec 21, 2023 • 7.79k • 336k • 314 codeparrot/apps Updated Oct 20, 2022 • 17.2k • 201 facebook/belebele Viewer • Updated Aug 12, 2024 • 110k • 10.7k • 122 google/boolq Viewer • Updated Jan 22, 2024 • 12.7k • 31.7k • 99
Interesting Datasets A collection of datasets that I come across LDJnr/Capybara Viewer • Updated Jun 7, 2024 • 16k • 1.67k • 248 teknium/openhermes Viewer • Updated Sep 7, 2023 • 243k • 1.1k • 219 VMware/open-instruct Viewer • Updated Jul 12, 2023 • 143k • 129 • 44 euclaise/WritingPrompts_preferences Viewer • Updated Dec 25, 2023 • 265k • 74 • 10
Augmentable A collection of datasets that should be augmented further with gpt-4 allenai/ai2_arc Viewer • Updated Dec 21, 2023 • 7.79k • 336k • 314 codeparrot/apps Updated Oct 20, 2022 • 17.2k • 201 facebook/belebele Viewer • Updated Aug 12, 2024 • 110k • 10.7k • 122 google/boolq Viewer • Updated Jan 22, 2024 • 12.7k • 31.7k • 99
Requires Filtering Datasets that HAS TO BE FILTERED. Squish42/bluemoon-fandom-1-1-rp-cleaned Updated Jul 9, 2023 • 157 • 78