madatnlp
/

ke-t5-scratch

Text2Text Generation

generated_from_keras_callback

Model card Files Files and versions Community

madatnlp commited on May 8, 2022

Commit

9ed7f46

·

1 Parent(s): 4f6593b

Training in progress epoch 0

Files changed (2) hide show

README.md +5 -32
tf_model.h5 +1 -1

README.md CHANGED Viewed

@@ -13,9 +13,9 @@ probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [madatnlp/ke-t5-math-py](https://huggingface.co/madatnlp/ke-t5-math-py) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Train Loss: 0.2662
-- Validation Loss: 0.7210
-- Epoch: 27
 ## Model description
@@ -34,41 +34,14 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- optimizer: {'name': 'Adam', 'learning_rate': 0.001, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-07, 'amsgrad': False}
 - training_precision: float32
 ### Training results
 | Train Loss | Validation Loss | Epoch |
 |:----------:|:---------------:|:-----:|
-| 3.9330     | 1.9960          | 0     |
-| 1.9578     | 1.4484          | 1     |
-| 1.5358     | 1.3202          | 2     |
-| 1.3807     | 1.1251          | 3     |
-| 1.2885     | 1.0331          | 4     |
-| 1.1943     | 1.0004          | 5     |
-| 1.1366     | 0.9263          | 6     |
-| 1.0507     | 0.8866          | 7     |
-| 1.0160     | 0.8788          | 8     |
-| 0.9553     | 0.8301          | 9     |
-| 0.9149     | 0.8480          | 10    |
-| 0.8545     | 0.8021          | 11    |
-| 0.8271     | 0.7890          | 12    |
-| 0.7783     | 0.7549          | 13    |
-| 0.7166     | 0.6960          | 14    |
-| 0.6853     | 0.6828          | 15    |
-| 0.6142     | 0.7129          | 16    |
-| 0.5774     | 0.6368          | 17    |
-| 0.5612     | 0.6432          | 18    |
-| 0.4980     | 0.6483          | 19    |
-| 0.4723     | 0.6485          | 20    |
-| 0.4245     | 0.6569          | 21    |
-| 0.4040     | 0.6494          | 22    |
-| 0.3733     | 0.6970          | 23    |
-| 0.3461     | 0.7069          | 24    |
-| 0.3166     | 0.6597          | 25    |
-| 0.2912     | 0.6372          | 26    |
-| 0.2662     | 0.7210          | 27    |
 ### Framework versions

 This model is a fine-tuned version of [madatnlp/ke-t5-math-py](https://huggingface.co/madatnlp/ke-t5-math-py) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Train Loss: 13.5076
+- Validation Loss: 11.8125
+- Epoch: 0
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- optimizer: {'name': 'Adam', 'learning_rate': 1e-05, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-07, 'amsgrad': False}
 - training_precision: float32
 ### Training results
 | Train Loss | Validation Loss | Epoch |
 |:----------:|:---------------:|:-----:|
+| 13.5076    | 11.8125         | 0     |
 ### Framework versions

tf_model.h5 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:75047029361c87147fddf308c342e6c668df9506dcc048984e4b9671edda7d05
 size 831509840

 version https://git-lfs.github.com/spec/v1
+oid sha256:ba93083be54c6ab91c48c6dab375b7fb03583825f33c4f2d84d3c75876f8f83a
 size 831509840