cy948
/

starcoder-peft-airscript

Generated from Trainer

Model card Files Files and versions Community

cy948 commited on Oct 26, 2024

Commit

27dac23

·

verified ·

1 Parent(s): 92b89b2

End of training

Files changed (1) hide show

README.md +16 -18

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [bigcode/starcoderbase-1b](https://huggingface.co/bigcode/starcoderbase-1b) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.7299
 ## Model description
@@ -44,28 +44,26 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_steps: 30
-- training_steps: 1600
 ### Training results
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
-| 1.1981        | 0.0625 | 100  | 1.1007          |
-| 0.9021        | 0.125  | 200  | 0.8910          |
-| 0.7494        | 0.1875 | 300  | 0.8009          |
-| 0.6504        | 0.25   | 400  | 0.7542          |
-| 0.5704        | 0.3125 | 500  | 0.7279          |
-| 0.5081        | 0.375  | 600  | 0.7175          |
-| 0.4696        | 0.4375 | 700  | 0.7109          |
-| 0.4245        | 0.5    | 800  | 0.7151          |
-| 0.3962        | 0.5625 | 900  | 0.7135          |
-| 0.3686        | 0.625  | 1000 | 0.7156          |
-| 0.3514        | 0.6875 | 1100 | 0.7244          |
-| 0.3418        | 0.75   | 1200 | 0.7268          |
-| 0.3317        | 0.8125 | 1300 | 0.7302          |
-| 0.3228        | 0.875  | 1400 | 0.7301          |
-| 0.3211        | 0.9375 | 1500 | 0.7304          |
-| 0.3188        | 1.0    | 1600 | 0.7299          |
 ### Framework versions

 This model is a fine-tuned version of [bigcode/starcoderbase-1b](https://huggingface.co/bigcode/starcoderbase-1b) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.8171
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_steps: 30
+- training_steps: 1400
 ### Training results
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
+| 0.9892        | 0.0714 | 100  | 0.9351          |
+| 0.7083        | 0.1429 | 200  | 0.7751          |
+| 0.5442        | 0.2143 | 300  | 0.7227          |
+| 0.4403        | 0.2857 | 400  | 0.7127          |
+| 0.3724        | 0.3571 | 500  | 0.7285          |
+| 0.3268        | 0.4286 | 600  | 0.7396          |
+| 0.3057        | 0.5    | 700  | 0.7587          |
+| 0.2798        | 0.5714 | 800  | 0.7704          |
+| 0.2616        | 0.6429 | 900  | 0.7888          |
+| 0.2418        | 0.7143 | 1000 | 0.8010          |
+| 0.2314        | 0.7857 | 1100 | 0.8092          |
+| 0.2268        | 0.8571 | 1200 | 0.8147          |
+| 0.223         | 0.9286 | 1300 | 0.8185          |
+| 0.2207        | 1.0    | 1400 | 0.8171          |
 ### Framework versions