Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
209.6
TFLOPS
1774
284
123
Stefan Schweter
PRO
stefan-it
Follow
Lanine's profile picture
Piroman's profile picture
hrishi4musiq's profile picture
3444 followers
Β·
343 following
https://schweter.bayern
stefan-it
stefan-it
AI & ML interests
Flair Library π, NER & PoS Tagging, LM Pretraining (mostly encoder-only & encoder-decoder), Historical Language Models, German Language Models, Bavarian NLP π₯¨
Recent Activity
upvoted
a
paper
4 minutes ago
Huxley-GΓΆdel Machine: Human-Level Coding Agent Development by an Approximation of the Optimal Self-Improving Machine
upvoted
a
collection
about 1 hour ago
BabyLM 2025
commented
on
a paper
about 1 hour ago
Mask and You Shall Receive: Optimizing Masked Language Modeling For Pretraining BabyLMs
View all activity
Organizations
stefan-it
's models
1,342
Sort:Β Recently updated
stefan-it/nanochat-german-v1
0.6B
β’
Updated
1 day ago
β’
52
β’
1
stefan-it/nanochat-german-base-checkpoint
Updated
2 days ago
stefan-it/nanochat-german-base
0.6B
β’
Updated
3 days ago
β’
17
stefan-it/nanochat-german-tokenizer
Updated
3 days ago
β’
5
stefan-it/ettin-encoder-400m-tokenizer-fix
Fill-Mask
β’
0.4B
β’
Updated
Jul 20
β’
3
stefan-it/flair-ettin-400m-ner-conll03
Updated
Jul 17
stefan-it/ModernBERT-large-tokenizer-fix
Fill-Mask
β’
0.4B
β’
Updated
Jul 16
β’
4
stefan-it/flair-modernbert-large-ner-conll03
Updated
May 9
stefan-it/bert5urk
1B
β’
Updated
Mar 3
β’
33
β’
11
stefan-it/neobert-ner-conll03
0.2B
β’
Updated
Mar 2
β’
8
β’
1
stefan-it/electra-base-gc4-64k-0-cased-discriminator
0.1B
β’
Updated
Mar 1
β’
4
β’
1
stefan-it/electra-base-gc4-64k-100000-cased-discriminator
0.1B
β’
Updated
Mar 1
β’
5
stefan-it/electra-base-gc4-64k-200000-cased-discriminator
0.1B
β’
Updated
Mar 1
β’
4
stefan-it/electra-base-gc4-64k-300000-cased-discriminator
0.1B
β’
Updated
Mar 1
β’
3
stefan-it/electra-base-gc4-64k-400000-cased-discriminator
0.1B
β’
Updated
Mar 1
β’
4
stefan-it/electra-base-gc4-64k-500000-cased-discriminator
0.1B
β’
Updated
Mar 1
β’
1
stefan-it/electra-base-gc4-64k-600000-cased-discriminator
0.1B
β’
Updated
Mar 1
β’
7
stefan-it/electra-base-gc4-64k-700000-cased-discriminator
0.1B
β’
Updated
Mar 1
β’
3
stefan-it/electra-base-gc4-64k-800000-cased-discriminator
0.1B
β’
Updated
Mar 1
β’
4
stefan-it/electra-base-gc4-64k-900000-cased-discriminator
0.1B
β’
Updated
Mar 1
β’
4
stefan-it/electra-base-gc4-64k-1000000-cased-discriminator
0.1B
β’
Updated
Mar 1
β’
2
stefan-it/electra-base-gc4-64k-300000-cased-generator
Fill-Mask
β’
59.5M
β’
Updated
Mar 1
β’
2
stefan-it/electra-base-gc4-64k-400000-cased-generator
Fill-Mask
β’
59.5M
β’
Updated
Mar 1
β’
5
stefan-it/electra-base-gc4-64k-500000-cased-generator
Fill-Mask
β’
59.5M
β’
Updated
Mar 1
β’
3
stefan-it/electra-base-gc4-64k-600000-cased-generator
Fill-Mask
β’
59.5M
β’
Updated
Mar 1
β’
2
stefan-it/electra-base-gc4-64k-700000-cased-generator
Fill-Mask
β’
59.5M
β’
Updated
Mar 1
β’
2
stefan-it/electra-base-gc4-64k-800000-cased-generator
Fill-Mask
β’
59.5M
β’
Updated
Mar 1
β’
3
stefan-it/electra-base-gc4-64k-900000-cased-generator
Fill-Mask
β’
59.5M
β’
Updated
Mar 1
β’
6
stefan-it/electra-base-gc4-64k-1000000-cased-generator
Fill-Mask
β’
59.5M
β’
Updated
Mar 1
β’
5
stefan-it/it5-efficient-small-el32
0.1B
β’
Updated
Feb 24
β’
6
β’
2
Previous
1
2
3
...
45
Next