Update README.md
Browse files
README.md
CHANGED
@@ -7,22 +7,25 @@ license: mit
|
|
7 |
|
8 |
# Fine-tuning BERT for Fill Mask task on VizWiz dataset
|
9 |
|
10 |
-
|
11 |
-
* model: bert-base-uncased
|
12 |
-
* downstream_tasks: fill-mask
|
13 |
|
14 |
-
|
15 |
-
* vizwiz_data_size: 22K
|
|
|
|
|
|
|
16 |
|
17 |
-
|
18 |
-
* random_seed: 16
|
19 |
-
* max_token_len: 78
|
20 |
-
* train_batch_size: 32
|
21 |
-
* val_batch_size: 16
|
22 |
-
* num_epochs: 5
|
23 |
-
* learning_rate: 5e-06
|
24 |
-
* split_train: 0.8
|
25 |
-
* optimizer: adamw
|
26 |
|
27 |
-
|
28 |

|
|
|
7 |
|
8 |
# Fine-tuning BERT for Fill Mask task on VizWiz dataset
|
9 |
|
10 |
+
## Fine-tuining information
|
11 |
+
* model: **bert-base-uncased**
|
12 |
+
* downstream_tasks: **fill-mask**
|
13 |
|
14 |
+
## Dataset information
|
15 |
+
* vizwiz_data_size: **22K**
|
16 |
+
* max_token_len: 78
|
17 |
+
### Tokens distribution
|
18 |
+

|
19 |
|
20 |
+
## Training information
|
21 |
+
* random_seed: **16**
|
22 |
+
* max_token_len: **78**
|
23 |
+
* train_batch_size: **32**
|
24 |
+
* val_batch_size: **16**
|
25 |
+
* num_epochs: **5**,
|
26 |
+
* learning_rate: **5e-06**,
|
27 |
+
* split_train: **0.8**
|
28 |
+
* optimizer: **adamw**
|
29 |
|
30 |
+
## Learning curves
|
31 |

|