My pretrained LMs on FineWeb datasets - part of my TensorFlow Model Garden LMs project
Stefan Schweter PRO
stefan-it
AI & ML interests
Flair Library 💕, NER & PoS Tagging, LM Pretraining (mostly encoder-only & encoder-decoder), Historical Language Models, German Language Models, Bavarian NLP
Recent Activity
published
a dataset
2 days ago
bavarian-nlp/barwiki-dumps
updated
a dataset
2 days ago
bavarian-nlp/barwiki-dumps
published
a dataset
3 days ago
bavarian-nlp/barwiki-20250801
Organizations
⚙️ Fine-Tuned Historical NER Models (hmTEAMS)
Fined-Tuned NER Models on Historical NER Datasets (HIPE-2022) with Flair and hmTEAMS as backbone LM
⚙️ Fine-Tuned Historical NER Models (hmByT5)
Fined-Tuned NER Models on Historical NER Datasets (HIPE-2022) with Flair and hmBERT as backbone LM
-
hmbyt5-preliminary/flair-hipe-2022-ajmc-de
Token Classification • Updated -
hmbyt5-preliminary/flair-hipe-2022-ajmc-en
Token Classification • Updated -
hmbyt5-preliminary/flair-hipe-2022-ajmc-fr
Token Classification • Updated • 1 -
hmbyt5-preliminary/flair-hipe-2022-newseye-de
Token Classification • Updated
⚙️ Fine-Tuned Historical NER Models (hmBERT Tiny)
Fined-Tuned NER Models on Historical NER Datasets (HIPE-2022) with Flair and hmBERT Tiny as backbone LM
🇹🇷 Turkish Language Models
My pretrained Language Models for Turkish
🇬🇪 Georgian NER Models
My fine-tuned NER models for Georgian
-
stefan-it/autotrain-flair-georgian-ner-xlm_r_large-bs4-e10-lr5e-06-1
Token Classification • Updated • 4 -
stefan-it/autotrain-flair-georgian-ner-xlm_r_large-bs4-e10-lr5e-06-2
Token Classification • Updated -
stefan-it/autotrain-flair-georgian-ner-xlm_r_large-bs4-e10-lr5e-06-3
Token Classification • Updated -
stefan-it/autotrain-flair-georgian-ner-xlm_r_large-bs4-e10-lr5e-06-4
Token Classification • Updated • 1
🧹 Fine-Tuned CleanCoNLL Models
My fine-tuned Flair NER models on CleanCoNLL dataset (with different seeds)
📚 Historical Multilingual Language Models
A Collection of Historical Multilingual Language Models
-
dbmdz/bert-base-historic-multilingual-cased
Fill-Mask • 0.1B • Updated • 334 • • 8 -
dbmdz/bert-base-historic-multilingual-64k-td-cased
Fill-Mask • 0.1B • Updated • 6 • 1 -
hmbyt5-preliminary/byt5-small-historic-multilingual-span20-flax
Updated • 2 -
hmteams/teams-base-historic-multilingual-discriminator
0.1B • Updated • 8
⚙️ Fine-Tuned Historical NER Models (hmBERT)
Fined-Tuned NER Models on Historical NER Datasets (HIPE-2022) with Flair and hmBERT as backbone LM
⚙️ Fine-Tuned Flair Models on German MobIE Dataset
Fine-Tuned Flair Models on German MobIE Dataset using 🤗 AutoTrain SpaceRunner
-
stefan-it/autotrain-flair-mobie-gbert_base-bs16-e10-lr5e-05-2
Token Classification • Updated • 2 -
stefan-it/autotrain-flair-mobie-gbert_base-bs16-e10-lr3e-05-3
Token Classification • Updated • 2 -
stefan-it/autotrain-flair-mobie-gbert_base-bs16-e10-lr5e-05-5
Token Classification • Updated • 2 -
stefan-it/autotrain-flair-mobie-gbert_base-bs16-e10-lr5e-05-3
Token Classification • Updated • 4 • 1
⚙️ Fine-Tuned Historical NER Models (hmBERT 64k)
Fined-Tuned NER Models on Historical NER Datasets (HIPE-2022) with Flair and hmBERT 64k as backbone LM
😱 Microsoft Papers with no code/data release
Collection of Microsoft Papers with no code/data release
-
MEGA: Multilingual Evaluation of Generative AI
Paper • 2303.12528 • Published -
MEGAVERSE: Benchmarking Large Language Models Across Languages, Modalities, Models and Tasks
Paper • 2311.07463 • Published • 15 -
Kosmos-2.5: A Multimodal Literate Model
Paper • 2309.11419 • Published • 50 -
A Unified View of Masked Image Modeling
Paper • 2210.10615 • Published
💼 Fine-Tuned CO-Funer Models
My fine-tuned Flair models on CO-FUN NER Dataset
🔧 xLSTM Language Models
My trained xLSTM LMs (under development)
🏡 FineWeb-LMs
My pretrained LMs on FineWeb datasets - part of my TensorFlow Model Garden LMs project
📚 Historical Multilingual Language Models
A Collection of Historical Multilingual Language Models
-
dbmdz/bert-base-historic-multilingual-cased
Fill-Mask • 0.1B • Updated • 334 • • 8 -
dbmdz/bert-base-historic-multilingual-64k-td-cased
Fill-Mask • 0.1B • Updated • 6 • 1 -
hmbyt5-preliminary/byt5-small-historic-multilingual-span20-flax
Updated • 2 -
hmteams/teams-base-historic-multilingual-discriminator
0.1B • Updated • 8
⚙️ Fine-Tuned Historical NER Models (hmTEAMS)
Fined-Tuned NER Models on Historical NER Datasets (HIPE-2022) with Flair and hmTEAMS as backbone LM
⚙️ Fine-Tuned Historical NER Models (hmBERT)
Fined-Tuned NER Models on Historical NER Datasets (HIPE-2022) with Flair and hmBERT as backbone LM
⚙️ Fine-Tuned Historical NER Models (hmByT5)
Fined-Tuned NER Models on Historical NER Datasets (HIPE-2022) with Flair and hmBERT as backbone LM
-
hmbyt5-preliminary/flair-hipe-2022-ajmc-de
Token Classification • Updated -
hmbyt5-preliminary/flair-hipe-2022-ajmc-en
Token Classification • Updated -
hmbyt5-preliminary/flair-hipe-2022-ajmc-fr
Token Classification • Updated • 1 -
hmbyt5-preliminary/flair-hipe-2022-newseye-de
Token Classification • Updated
⚙️ Fine-Tuned Flair Models on German MobIE Dataset
Fine-Tuned Flair Models on German MobIE Dataset using 🤗 AutoTrain SpaceRunner
-
stefan-it/autotrain-flair-mobie-gbert_base-bs16-e10-lr5e-05-2
Token Classification • Updated • 2 -
stefan-it/autotrain-flair-mobie-gbert_base-bs16-e10-lr3e-05-3
Token Classification • Updated • 2 -
stefan-it/autotrain-flair-mobie-gbert_base-bs16-e10-lr5e-05-5
Token Classification • Updated • 2 -
stefan-it/autotrain-flair-mobie-gbert_base-bs16-e10-lr5e-05-3
Token Classification • Updated • 4 • 1
⚙️ Fine-Tuned Historical NER Models (hmBERT Tiny)
Fined-Tuned NER Models on Historical NER Datasets (HIPE-2022) with Flair and hmBERT Tiny as backbone LM
⚙️ Fine-Tuned Historical NER Models (hmBERT 64k)
Fined-Tuned NER Models on Historical NER Datasets (HIPE-2022) with Flair and hmBERT 64k as backbone LM
🇹🇷 Turkish Language Models
My pretrained Language Models for Turkish
😱 Microsoft Papers with no code/data release
Collection of Microsoft Papers with no code/data release
-
MEGA: Multilingual Evaluation of Generative AI
Paper • 2303.12528 • Published -
MEGAVERSE: Benchmarking Large Language Models Across Languages, Modalities, Models and Tasks
Paper • 2311.07463 • Published • 15 -
Kosmos-2.5: A Multimodal Literate Model
Paper • 2309.11419 • Published • 50 -
A Unified View of Masked Image Modeling
Paper • 2210.10615 • Published
🇬🇪 Georgian NER Models
My fine-tuned NER models for Georgian
-
stefan-it/autotrain-flair-georgian-ner-xlm_r_large-bs4-e10-lr5e-06-1
Token Classification • Updated • 4 -
stefan-it/autotrain-flair-georgian-ner-xlm_r_large-bs4-e10-lr5e-06-2
Token Classification • Updated -
stefan-it/autotrain-flair-georgian-ner-xlm_r_large-bs4-e10-lr5e-06-3
Token Classification • Updated -
stefan-it/autotrain-flair-georgian-ner-xlm_r_large-bs4-e10-lr5e-06-4
Token Classification • Updated • 1
💼 Fine-Tuned CO-Funer Models
My fine-tuned Flair models on CO-FUN NER Dataset
🧹 Fine-Tuned CleanCoNLL Models
My fine-tuned Flair NER models on CleanCoNLL dataset (with different seeds)
🔧 xLSTM Language Models
My trained xLSTM LMs (under development)