jdrechsel commited on
Commit
e8eba8d
·
verified ·
1 Parent(s): 027323f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -14
README.md CHANGED
@@ -1,16 +1,3 @@
1
- ---
2
- # For reference on model card metadata, see the spec: https://github.com/huggingface/hub-docs/blob/main/modelcard.md?plain=1
3
- # Doc / guide: https://huggingface.co/docs/hub/model-cards
4
- {}
5
- ---
6
- ---
7
- datasets:
8
- - ddrg/math_formulas
9
- - ddrg/math_formula_retrieval
10
- - ddrg/math_text
11
- - ddrg/named_math_formulas
12
- ---
13
-
14
  # MAMUT Bert (Mathematical Structure Aware BERT)
15
 
16
  <!-- Provide a quick summary of what the model is/does. -->
@@ -30,7 +17,7 @@ This model has been mathematically pretrained based on four tasks/datasets:
30
  - **[Named Math Formulas (NMF)](https://huggingface.co/datasets/ddrg/named_math_formulas):** Next-Sentence-Prediction (NSP)-like task associating a name of a well known mathematical identity (e.g., Pythagorean Theorem) with a formula representation (and the task is to classify whether the formula matches the identity described by the name)
31
  - **[Math Formula Retrieval (MFR)](https://huggingface.co/datasets/ddrg/math_formula_retrieval):** NSP-like task associating two formulas (and the task is to decide whether both describe the same mathematical concept(identity))
32
 
33
- [!Training Overview](img/mamutbert-training.png)
34
 
35
  Compared to bert-base-cased, 300 additional mathematical [LaTeX tokens](added_tokens.json) have been added before the mathematical pre-training.
36
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  # MAMUT Bert (Mathematical Structure Aware BERT)
2
 
3
  <!-- Provide a quick summary of what the model is/does. -->
 
17
  - **[Named Math Formulas (NMF)](https://huggingface.co/datasets/ddrg/named_math_formulas):** Next-Sentence-Prediction (NSP)-like task associating a name of a well known mathematical identity (e.g., Pythagorean Theorem) with a formula representation (and the task is to classify whether the formula matches the identity described by the name)
18
  - **[Math Formula Retrieval (MFR)](https://huggingface.co/datasets/ddrg/math_formula_retrieval):** NSP-like task associating two formulas (and the task is to decide whether both describe the same mathematical concept(identity))
19
 
20
+ ![Training Overview](mamutbert-training.png)
21
 
22
  Compared to bert-base-cased, 300 additional mathematical [LaTeX tokens](added_tokens.json) have been added before the mathematical pre-training.
23