Set of BERT models with a modified embeddings layer
NeuML
company
Verified
AI & ML interests
Applying machine learning to solve everyday problems
Recent Activity
View all activity
Add knowledge to your txtai agents and processes.
Datasets with medical and scientific literature.
Models distilled with Model2Vec - 100K / 500K / 1M / 2M / 8M parameter variants.
-
NeuML/pubmedbert-base-embeddings-8M
Sentence Similarity • Updated • 27 • 8 -
NeuML/pubmedbert-base-embeddings-2M
Sentence Similarity • Updated • 72 • 3 -
NeuML/pubmedbert-base-embeddings-1M
Sentence Similarity • Updated • 22 • 2 -
NeuML/pubmedbert-base-embeddings-500K
Sentence Similarity • Updated • 48 • 2
Embeddings indexes and datasets for Wikipedia data.
Late interaction models
StaticVectors models to detect language. Exports of FastText that run in NumPy without needing FastText
Models for working with medical and scientific literature.
-
NeuML/pubmedbert-base-embeddings
Sentence Similarity • 0.1B • Updated • 180k • • 150 -
NeuML/pubmedbert-base-embeddings-matryoshka
Sentence Similarity • 0.1B • Updated • 1.61k • • 23 -
NeuML/pubmedbert-base-embeddings-8M
Sentence Similarity • Updated • 27 • 8 -
NeuML/pubmedbert-base-colbert
Sentence Similarity • Updated • 79 • 6
Text to Speech (TTS) models compatible with txtai's TextToSpeech pipeline.
Legacy word vectors (FastText, GloVe, Word2Vec) stored in the StaticVectors format
Set of BERT models with a modified embeddings layer
Late interaction models
Add knowledge to your txtai agents and processes.
StaticVectors models to detect language. Exports of FastText that run in NumPy without needing FastText
Datasets with medical and scientific literature.
Models for working with medical and scientific literature.
-
NeuML/pubmedbert-base-embeddings
Sentence Similarity • 0.1B • Updated • 180k • • 150 -
NeuML/pubmedbert-base-embeddings-matryoshka
Sentence Similarity • 0.1B • Updated • 1.61k • • 23 -
NeuML/pubmedbert-base-embeddings-8M
Sentence Similarity • Updated • 27 • 8 -
NeuML/pubmedbert-base-colbert
Sentence Similarity • Updated • 79 • 6
Models distilled with Model2Vec - 100K / 500K / 1M / 2M / 8M parameter variants.
-
NeuML/pubmedbert-base-embeddings-8M
Sentence Similarity • Updated • 27 • 8 -
NeuML/pubmedbert-base-embeddings-2M
Sentence Similarity • Updated • 72 • 3 -
NeuML/pubmedbert-base-embeddings-1M
Sentence Similarity • Updated • 22 • 2 -
NeuML/pubmedbert-base-embeddings-500K
Sentence Similarity • Updated • 48 • 2
Text to Speech (TTS) models compatible with txtai's TextToSpeech pipeline.
Embeddings indexes and datasets for Wikipedia data.
Legacy word vectors (FastText, GloVe, Word2Vec) stored in the StaticVectors format