Nemotron-Personas Collection A collection of multilingual, region-specific synthetic persona datasets that support sovereign AI development across many countries and regions. • 5 items • Updated 13 days ago • 21
Running Featured 1.29k FineWeb: decanting the web for the finest text data at scale 🍷 1.29k Explore the FineWeb dataset and its creation process
A Flexible Large Language Models Guardrail Development Methodology Applied to Off-Topic Prompt Detection Paper • 2411.12946 • Published Nov 20, 2024 • 22
protectai/distilroberta-base-rejection-v1 Text Classification • 82.1M • Updated Mar 11, 2024 • 4.61k • • 8