cy948 commited on
Commit
27dac23
·
verified ·
1 Parent(s): 92b89b2

End of training

Browse files
Files changed (1) hide show
  1. README.md +16 -18
README.md CHANGED
@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  This model is a fine-tuned version of [bigcode/starcoderbase-1b](https://huggingface.co/bigcode/starcoderbase-1b) on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: 0.7299
20
 
21
  ## Model description
22
 
@@ -44,28 +44,26 @@ The following hyperparameters were used during training:
44
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
  - lr_scheduler_type: cosine
46
  - lr_scheduler_warmup_steps: 30
47
- - training_steps: 1600
48
 
49
  ### Training results
50
 
51
  | Training Loss | Epoch | Step | Validation Loss |
52
  |:-------------:|:------:|:----:|:---------------:|
53
- | 1.1981 | 0.0625 | 100 | 1.1007 |
54
- | 0.9021 | 0.125 | 200 | 0.8910 |
55
- | 0.7494 | 0.1875 | 300 | 0.8009 |
56
- | 0.6504 | 0.25 | 400 | 0.7542 |
57
- | 0.5704 | 0.3125 | 500 | 0.7279 |
58
- | 0.5081 | 0.375 | 600 | 0.7175 |
59
- | 0.4696 | 0.4375 | 700 | 0.7109 |
60
- | 0.4245 | 0.5 | 800 | 0.7151 |
61
- | 0.3962 | 0.5625 | 900 | 0.7135 |
62
- | 0.3686 | 0.625 | 1000 | 0.7156 |
63
- | 0.3514 | 0.6875 | 1100 | 0.7244 |
64
- | 0.3418 | 0.75 | 1200 | 0.7268 |
65
- | 0.3317 | 0.8125 | 1300 | 0.7302 |
66
- | 0.3228 | 0.875 | 1400 | 0.7301 |
67
- | 0.3211 | 0.9375 | 1500 | 0.7304 |
68
- | 0.3188 | 1.0 | 1600 | 0.7299 |
69
 
70
 
71
  ### Framework versions
 
16
 
17
  This model is a fine-tuned version of [bigcode/starcoderbase-1b](https://huggingface.co/bigcode/starcoderbase-1b) on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Loss: 0.8171
20
 
21
  ## Model description
22
 
 
44
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
45
  - lr_scheduler_type: cosine
46
  - lr_scheduler_warmup_steps: 30
47
+ - training_steps: 1400
48
 
49
  ### Training results
50
 
51
  | Training Loss | Epoch | Step | Validation Loss |
52
  |:-------------:|:------:|:----:|:---------------:|
53
+ | 0.9892 | 0.0714 | 100 | 0.9351 |
54
+ | 0.7083 | 0.1429 | 200 | 0.7751 |
55
+ | 0.5442 | 0.2143 | 300 | 0.7227 |
56
+ | 0.4403 | 0.2857 | 400 | 0.7127 |
57
+ | 0.3724 | 0.3571 | 500 | 0.7285 |
58
+ | 0.3268 | 0.4286 | 600 | 0.7396 |
59
+ | 0.3057 | 0.5 | 700 | 0.7587 |
60
+ | 0.2798 | 0.5714 | 800 | 0.7704 |
61
+ | 0.2616 | 0.6429 | 900 | 0.7888 |
62
+ | 0.2418 | 0.7143 | 1000 | 0.8010 |
63
+ | 0.2314 | 0.7857 | 1100 | 0.8092 |
64
+ | 0.2268 | 0.8571 | 1200 | 0.8147 |
65
+ | 0.223 | 0.9286 | 1300 | 0.8185 |
66
+ | 0.2207 | 1.0 | 1400 | 0.8171 |
 
 
67
 
68
 
69
  ### Framework versions