Papers
arxiv:2303.13310

SwissBERT: The Multilingual Language Model for Switzerland

Published on Mar 23, 2023
Authors:
,
,

Abstract

SwissBERT, a masked language model for Swiss languages, outperforms previous models in natural language understanding tasks, especially for contemporary news and Romansh Grischun, utilizing language adapters for potential future expansion.

AI-generated summary

We present SwissBERT, a masked language model created specifically for processing Switzerland-related text. SwissBERT is a pre-trained model that we adapted to news articles written in the national languages of Switzerland -- German, French, Italian, and Romansh. We evaluate SwissBERT on natural language understanding tasks related to Switzerland and find that it tends to outperform previous models on these tasks, especially when processing contemporary news and/or Romansh Grischun. Since SwissBERT uses language adapters, it may be extended to Swiss German dialects in future work. The model and our open-source code are publicly released at https://github.com/ZurichNLP/swissbert.

Community

Sign up or log in to comment

Models citing this paper 1

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2303.13310 in a dataset README.md to link it from this page.

Spaces citing this paper 1

Collections including this paper 0

No Collection including this paper

Add this paper to a collection to link it from this page.