Each dataset is split into easy, medium and a difficult split using the familiarity metric. Please see our paper for details.
Jonas Golde
whoisjones
AI & ML interests
Data-efficient transfer learning
Recent Activity
updated
a model
about 10 hours ago
whoisjones/finerweb-multilabel-classifier-xlmr-4o
updated
a model
about 10 hours ago
whoisjones/finerweb-binary-classifier-xlmr-4o
updated
a model
about 10 hours ago
whoisjones/finerweb-binary-classifier-xlmr-gemma3