ddrg
/

math_structure_bert

Feature Extraction

text-embeddings-inference

Model card Files Files and versions Community

jdrechsel commited on Apr 25

Commit

e8eba8d

·

verified ·

1 Parent(s): 027323f

Update README.md

Files changed (1) hide show

README.md +1 -14

README.md CHANGED Viewed

@@ -1,16 +1,3 @@
----
-# For reference on model card metadata, see the spec: https://github.com/huggingface/hub-docs/blob/main/modelcard.md?plain=1
-# Doc / guide: https://huggingface.co/docs/hub/model-cards
-{}
----
----
-datasets:
-- ddrg/math_formulas
-- ddrg/math_formula_retrieval
-- ddrg/math_text
-- ddrg/named_math_formulas
----
 # MAMUT Bert (Mathematical Structure Aware BERT)
 <!-- Provide a quick summary of what the model is/does. -->
@@ -30,7 +17,7 @@ This model has been mathematically pretrained based on four tasks/datasets:
 - **[Named Math Formulas (NMF)](https://huggingface.co/datasets/ddrg/named_math_formulas):** Next-Sentence-Prediction (NSP)-like task associating a name of a well known mathematical identity (e.g., Pythagorean Theorem) with a formula representation (and the task is to classify whether the formula matches the identity described by the name)
 - **[Math Formula Retrieval (MFR)](https://huggingface.co/datasets/ddrg/math_formula_retrieval):** NSP-like task associating two formulas (and the task is to decide whether both describe the same mathematical concept(identity))
-[!Training Overview](img/mamutbert-training.png)
 Compared to bert-base-cased, 300 additional mathematical [LaTeX tokens](added_tokens.json) have been added before the mathematical pre-training.

 # MAMUT Bert (Mathematical Structure Aware BERT)
 <!-- Provide a quick summary of what the model is/does. -->
 - **[Named Math Formulas (NMF)](https://huggingface.co/datasets/ddrg/named_math_formulas):** Next-Sentence-Prediction (NSP)-like task associating a name of a well known mathematical identity (e.g., Pythagorean Theorem) with a formula representation (and the task is to classify whether the formula matches the identity described by the name)
 - **[Math Formula Retrieval (MFR)](https://huggingface.co/datasets/ddrg/math_formula_retrieval):** NSP-like task associating two formulas (and the task is to decide whether both describe the same mathematical concept(identity))
+![Training Overview](mamutbert-training.png)
 Compared to bert-base-cased, 300 additional mathematical [LaTeX tokens](added_tokens.json) have been added before the mathematical pre-training.