Pre-training Data Quality and Quantity for a Low-Resource Language: New Corpus and BERT Models for Maltese
Paper
•
2205.10517
•
Published
University of Malta NLP Group with a main interest in developing tools for the processing of the Maltese language