End of training

Files changed (4) hide show

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [t5-large](https://huggingface.co/t5-large) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.4813
 ## Model description
@@ -35,8 +35,8 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
-- train_batch_size: 4
-- eval_batch_size: 4
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
@@ -46,16 +46,16 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 15.6709       | 1.0   | 26   | 11.0887         |
-| 6.7694        | 2.0   | 52   | 5.8590          |
-| 2.3307        | 3.0   | 78   | 3.1000          |
-| 2.8756        | 4.0   | 104  | 1.5217          |
-| 1.3362        | 5.0   | 130  | 0.9868          |
-| 2.5012        | 6.0   | 156  | 0.7289          |
-| 1.6317        | 7.0   | 182  | 0.5822          |
-| 1.6992        | 8.0   | 208  | 0.5199          |
-| 0.8064        | 9.0   | 234  | 0.4914          |
-| 2.8149        | 10.0  | 260  | 0.4813          |
 ### Framework versions

 This model is a fine-tuned version of [t5-large](https://huggingface.co/t5-large) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.9452
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
+- train_batch_size: 8
+- eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 14.6117       | 1.0   | 13   | 12.1354         |
+| 4.7373        | 2.0   | 26   | 5.1054          |
+| 1.7596        | 3.0   | 39   | 3.2154          |
+| 1.3121        | 4.0   | 52   | 2.1497          |
+| 1.1923        | 5.0   | 65   | 1.6547          |
+| 0.3427        | 6.0   | 78   | 1.2120          |
+| 0.3447        | 7.0   | 91   | 1.1350          |
+| 1.8694        | 8.0   | 104  | 0.9754          |
+| 0.7097        | 9.0   | 117  | 0.9623          |
+| 1.4038        | 10.0  | 130  | 0.9452          |
 ### Framework versions

runs/Apr12_15-35-11_MacBook-Pro.local/events.out.tfevents.1712907313.MacBook-Pro.local.4407.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:1e7e83ae40608b43da58766578ce1a6f340e7a429364d640da6387f7d01917a5
+size 5604

runs/Apr12_15-36-28_MacBook-Pro.local/events.out.tfevents.1712907390.MacBook-Pro.local.4444.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:0dc087102cacc1b026c3f90707a67e69536c5436261ac90048ed641606302f8a
+size 35545

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:c7f9c10a05dd7ff4231bae0141c5d2701c7c93df002e72140d6cc0add5cc8d39
 size 5112

 version https://git-lfs.github.com/spec/v1
+oid sha256:9ccb753145931f2866595b15a9e1be780b8431d487b1d0a09fba33c6c29d44ed
 size 5112