Update README.md
Browse files
README.md
CHANGED
@@ -1,16 +1,3 @@
|
|
1 |
-
---
|
2 |
-
# For reference on model card metadata, see the spec: https://github.com/huggingface/hub-docs/blob/main/modelcard.md?plain=1
|
3 |
-
# Doc / guide: https://huggingface.co/docs/hub/model-cards
|
4 |
-
{}
|
5 |
-
---
|
6 |
-
---
|
7 |
-
datasets:
|
8 |
-
- ddrg/math_formulas
|
9 |
-
- ddrg/math_formula_retrieval
|
10 |
-
- ddrg/math_text
|
11 |
-
- ddrg/named_math_formulas
|
12 |
-
---
|
13 |
-
|
14 |
# MAMUT Bert (Mathematical Structure Aware BERT)
|
15 |
|
16 |
<!-- Provide a quick summary of what the model is/does. -->
|
@@ -30,7 +17,7 @@ This model has been mathematically pretrained based on four tasks/datasets:
|
|
30 |
- **[Named Math Formulas (NMF)](https://huggingface.co/datasets/ddrg/named_math_formulas):** Next-Sentence-Prediction (NSP)-like task associating a name of a well known mathematical identity (e.g., Pythagorean Theorem) with a formula representation (and the task is to classify whether the formula matches the identity described by the name)
|
31 |
- **[Math Formula Retrieval (MFR)](https://huggingface.co/datasets/ddrg/math_formula_retrieval):** NSP-like task associating two formulas (and the task is to decide whether both describe the same mathematical concept(identity))
|
32 |
|
33 |
-
[
|
34 |
|
35 |
Compared to bert-base-cased, 300 additional mathematical [LaTeX tokens](added_tokens.json) have been added before the mathematical pre-training.
|
36 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
# MAMUT Bert (Mathematical Structure Aware BERT)
|
2 |
|
3 |
<!-- Provide a quick summary of what the model is/does. -->
|
|
|
17 |
- **[Named Math Formulas (NMF)](https://huggingface.co/datasets/ddrg/named_math_formulas):** Next-Sentence-Prediction (NSP)-like task associating a name of a well known mathematical identity (e.g., Pythagorean Theorem) with a formula representation (and the task is to classify whether the formula matches the identity described by the name)
|
18 |
- **[Math Formula Retrieval (MFR)](https://huggingface.co/datasets/ddrg/math_formula_retrieval):** NSP-like task associating two formulas (and the task is to decide whether both describe the same mathematical concept(identity))
|
19 |
|
20 |
+

|
21 |
|
22 |
Compared to bert-base-cased, 300 additional mathematical [LaTeX tokens](added_tokens.json) have been added before the mathematical pre-training.
|
23 |
|