rajpurkar/squad
Viewer • Updated • 98.2k • 160k • 364
How to use LLukas22/paraphrase-multilingual-mpnet-base-v2-embedding-all with sentence-transformers:
from sentence_transformers import SentenceTransformer
model = SentenceTransformer("LLukas22/paraphrase-multilingual-mpnet-base-v2-embedding-all")
sentences = [
"That is a happy person",
"That is a happy dog",
"That is a very happy person",
"Today is a sunny day"
]
embeddings = model.encode(sentences)
similarities = model.similarity(embeddings, embeddings)
print(similarities.shape)
# [4, 4]How to use LLukas22/paraphrase-multilingual-mpnet-base-v2-embedding-all with Transformers:
# Load model directly
from transformers import AutoTokenizer, AutoModel
tokenizer = AutoTokenizer.from_pretrained("LLukas22/paraphrase-multilingual-mpnet-base-v2-embedding-all")
model = AutoModel.from_pretrained("LLukas22/paraphrase-multilingual-mpnet-base-v2-embedding-all")This model is a fine-tuned version of paraphrase-multilingual-mpnet-base-v2 on the following datasets: squad, newsqa, LLukas22/cqadupstack, LLukas22/fiqa, LLukas22/scidocs, deepset/germanquad, LLukas22/nq.
Using this model becomes easy when you have sentence-transformers installed:
pip install -U sentence-transformers
Then you can use the model like this:
from sentence_transformers import SentenceTransformer
sentences = ["This is an example sentence", "Each sentence is converted"]
model = SentenceTransformer('LLukas22/paraphrase-multilingual-mpnet-base-v2-embedding-all')
embeddings = model.encode(sentences)
print(embeddings)
The following hyperparameters were used during training:
| Epoch | Train Loss | Validation Loss |
|---|---|---|
| 0 | 0.085 | 0.0625 |
| 1 | 0.0598 | 0.0554 |
| 2 | 0.0484 | 0.0518 |
| 3 | 0.0405 | 0.0485 |
| 4 | 0.0341 | 0.0463 |
| 5 | 0.0287 | 0.0454 |
| 6 | 0.0243 | 0.0445 |
| 7 | 0.0207 | 0.0426 |
| 8 | 0.0177 | 0.0424 |
| 9 | 0.0153 | 0.0421 |
| 10 | 0.0134 | 0.0417 |
| 11 | 0.012 | 0.0411 |
| 12 | 0.011 | 0.0414 |
| Epoch | top_1 | top_3 | top_5 | top_10 | top_25 |
|---|---|---|---|---|---|
| 0 | 0.261 | 0.351 | 0.384 | 0.422 | 0.459 |
| 1 | 0.272 | 0.365 | 0.4 | 0.439 | 0.477 |
| 2 | 0.276 | 0.37 | 0.404 | 0.443 | 0.481 |
| 3 | 0.292 | 0.391 | 0.426 | 0.465 | 0.503 |
| 4 | 0.295 | 0.395 | 0.431 | 0.47 | 0.51 |
| 5 | 0.299 | 0.4 | 0.437 | 0.476 | 0.514 |
| 6 | 0.306 | 0.404 | 0.44 | 0.478 | 0.515 |
| 7 | 0.309 | 0.41 | 0.445 | 0.485 | 0.521 |
| 8 | 0.31 | 0.411 | 0.448 | 0.487 | 0.524 |
| 9 | 0.315 | 0.417 | 0.454 | 0.493 | 0.529 |
| 10 | 0.319 | 0.42 | 0.457 | 0.495 | 0.53 |
| 11 | 0.323 | 0.424 | 0.46 | 0.497 | 0.531 |
| 12 | 0.324 | 0.427 | 0.464 | 0.501 | 0.536 |
This model was trained as part of my Master's Thesis 'Evaluation of transformer based language models for use in service information systems'. The source code is available on Github.