whathefish commited on
Commit
badd8d7
·
1 Parent(s): 08a576a

Training in progress epoch 0

Browse files
.gitignore ADDED
@@ -0,0 +1 @@
 
 
1
+ checkpoint-*/
README.md ADDED
@@ -0,0 +1,54 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ tags:
4
+ - generated_from_keras_callback
5
+ model-index:
6
+ - name: whathefish/my_awesome_model
7
+ results: []
8
+ ---
9
+
10
+ <!-- This model card has been generated automatically according to the information Keras had access to. You should
11
+ probably proofread and complete it, then remove this comment. -->
12
+
13
+ # whathefish/my_awesome_model
14
+
15
+ This model is a fine-tuned version of [distilbert-base-german-cased](https://huggingface.co/distilbert-base-german-cased) on an unknown dataset.
16
+ It achieves the following results on the evaluation set:
17
+ - Train Loss: 0.5923
18
+ - Validation Loss: 0.5516
19
+ - Train Accuracy: 0.7467
20
+ - Epoch: 0
21
+
22
+ ## Model description
23
+
24
+ More information needed
25
+
26
+ ## Intended uses & limitations
27
+
28
+ More information needed
29
+
30
+ ## Training and evaluation data
31
+
32
+ More information needed
33
+
34
+ ## Training procedure
35
+
36
+ ### Training hyperparameters
37
+
38
+ The following hyperparameters were used during training:
39
+ - optimizer: {'name': 'Adam', 'learning_rate': {'class_name': 'PolynomialDecay', 'config': {'initial_learning_rate': 2e-05, 'decay_steps': 215, 'end_learning_rate': 0.0, 'power': 1.0, 'cycle': False, 'name': None}}, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-08, 'amsgrad': False}
40
+ - training_precision: float32
41
+
42
+ ### Training results
43
+
44
+ | Train Loss | Validation Loss | Train Accuracy | Epoch |
45
+ |:----------:|:---------------:|:--------------:|:-----:|
46
+ | 0.5923 | 0.5516 | 0.7467 | 0 |
47
+
48
+
49
+ ### Framework versions
50
+
51
+ - Transformers 4.26.1
52
+ - TensorFlow 2.9.0
53
+ - Datasets 2.9.0
54
+ - Tokenizers 0.13.2
config.json ADDED
@@ -0,0 +1,24 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "_name_or_path": "distilbert-base-german-cased",
3
+ "activation": "gelu",
4
+ "architectures": [
5
+ "DistilBertForSequenceClassification"
6
+ ],
7
+ "attention_dropout": 0.1,
8
+ "dim": 768,
9
+ "dropout": 0.1,
10
+ "hidden_dim": 3072,
11
+ "initializer_range": 0.02,
12
+ "max_position_embeddings": 512,
13
+ "model_type": "distilbert",
14
+ "n_heads": 12,
15
+ "n_layers": 6,
16
+ "output_past": true,
17
+ "pad_token_id": 0,
18
+ "qa_dropout": 0.1,
19
+ "seq_classif_dropout": 0.2,
20
+ "sinusoidal_pos_embds": true,
21
+ "tie_weights_": true,
22
+ "transformers_version": "4.26.1",
23
+ "vocab_size": 31102
24
+ }
runs/Feb13_22-50-23_Janniss-MBP/1676325028.7793531/events.out.tfevents.1676325028.Janniss-MBP.1424.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:54c17dfed734976bd1e6678347462c2bf7941a4cc2a1b978c8bc4bac482df5e3
3
+ size 5663
runs/Feb13_22-50-23_Janniss-MBP/events.out.tfevents.1676325028.Janniss-MBP.1424.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:41b403c9223ac27bd28562d7d8e02240d34b871680f2665eeea2691a62e1237a
3
+ size 3718
runs/Feb13_23-10-00_Janniss-MBP/1676326204.277919/events.out.tfevents.1676326204.Janniss-MBP.1424.3 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e5c94408898054a6eeab49787430545d49baffeaa464b1d680478e704af35b39
3
+ size 5663
runs/Feb13_23-10-00_Janniss-MBP/events.out.tfevents.1676326204.Janniss-MBP.1424.2 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0b9d629e9ce29ab50577a0574c924f5324399d045c7861519433716449293315
3
+ size 3802
runs/Feb13_23-20-26_Janniss-MBP/1676326830.769512/events.out.tfevents.1676326830.Janniss-MBP.1776.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:da27a9bea0311a32e4e42cd890054be516c2ffa9d29ddf65f4ad5a0e4ace8c33
3
+ size 5663
runs/Feb13_23-20-26_Janniss-MBP/events.out.tfevents.1676326830.Janniss-MBP.1776.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:15015c500fdaf6204810f5385edb2a087f317c042f1831b1a830390b6647d57f
3
+ size 3718
runs/Feb13_23-25-18_Janniss-MBP/1676327122.33038/events.out.tfevents.1676327122.Janniss-MBP.1776.3 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c9a7ca0696d7b526b8a96f0aaa7821865b681e93c54806e17c17a997d08f98d1
3
+ size 5663
runs/Feb13_23-25-18_Janniss-MBP/events.out.tfevents.1676327122.Janniss-MBP.1776.2 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f014a022b2a6c7eded1b57a44a0fe7ad91e1364970eea69a69f94c324169346e
3
+ size 3718
runs/Feb13_23-28-50_Janniss-MBP/1676327335.582318/events.out.tfevents.1676327335.Janniss-MBP.1776.5 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6df736c76216e3f88019a00dc26506e48061eaae2f96fe6fa5254e82810be69d
3
+ size 5663
runs/Feb13_23-28-50_Janniss-MBP/events.out.tfevents.1676327335.Janniss-MBP.1776.4 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b08de06f117a82d7bb88673fa8d74a14aa41a4e950bdc1c7334f4ac733523196
3
+ size 3718
runs/Feb14_08-49-59_Janniss-MBP/1676361003.275512/events.out.tfevents.1676361003.Janniss-MBP.10037.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ec2beada9d862a108b61dda2a29c2c93d8e43e2649099e6b3886780ae24ae043
3
+ size 5663
runs/Feb14_08-49-59_Janniss-MBP/events.out.tfevents.1676361003.Janniss-MBP.10037.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:48bf63eeb537c4ae052eafbd8db5f6ba212933fc7fc8023a202ad2f81dd1716a
3
+ size 3802
special_tokens_map.json ADDED
@@ -0,0 +1,7 @@
 
 
 
 
 
 
 
 
1
+ {
2
+ "cls_token": "[CLS]",
3
+ "mask_token": "[MASK]",
4
+ "pad_token": "[PAD]",
5
+ "sep_token": "[SEP]",
6
+ "unk_token": "[UNK]"
7
+ }
tf_model.h5 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:aa284551b780a26180573930024b66571b3a80f5ca917028d2c23a524949431b
3
+ size 269733656
tokenizer.json ADDED
The diff for this file is too large to render. See raw diff
 
tokenizer_config.json ADDED
@@ -0,0 +1,14 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "cls_token": "[CLS]",
3
+ "do_lower_case": false,
4
+ "mask_token": "[MASK]",
5
+ "model_max_length": 512,
6
+ "name_or_path": "distilbert-base-german-cased",
7
+ "pad_token": "[PAD]",
8
+ "sep_token": "[SEP]",
9
+ "special_tokens_map_file": null,
10
+ "strip_accents": null,
11
+ "tokenize_chinese_chars": true,
12
+ "tokenizer_class": "DistilBertTokenizer",
13
+ "unk_token": "[UNK]"
14
+ }
vocab.txt ADDED
The diff for this file is too large to render. See raw diff