filippo571 commited on
Commit
795a3c6
·
1 Parent(s): 4a905ed

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +15 -13
README.md CHANGED
@@ -7,12 +7,11 @@ model-index:
7
  results: []
8
  ---
9
 
10
- <!-- This model card has been generated automatically according to the information Keras had access to. You should
11
- probably proofread and complete it, then remove this comment. -->
12
-
13
  # pull_request_comments_model
14
 
15
- This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on an unknown dataset.
 
 
16
  It achieves the following results on the evaluation set:
17
  - Train Loss: 0.0791
18
  - Train Accuracy: 0.9955
@@ -20,23 +19,26 @@ It achieves the following results on the evaluation set:
20
  - Validation Accuracy: 0.8291
21
  - Epoch: 12
22
 
23
- ## Model description
24
 
25
- More information needed
 
 
 
 
 
 
26
 
27
  ## Intended uses & limitations
28
 
29
- More information needed
30
-
31
- ## Training and evaluation data
32
-
33
- More information needed
34
-
35
- ## Training procedure
36
 
37
  ### Training hyperparameters
38
 
39
  The following hyperparameters were used during training:
 
 
40
  - optimizer: {'name': 'Adam', 'learning_rate': {'class_name': 'PolynomialDecay', 'config': {'initial_learning_rate': 2e-05, 'decay_steps': 280, 'end_learning_rate': 0.0, 'power': 1.0, 'cycle': False, 'name': None}}, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-08, 'amsgrad': False}
41
  - training_precision: float32
42
 
 
7
  results: []
8
  ---
9
 
 
 
 
10
  # pull_request_comments_model
11
 
12
+ ## Model description
13
+
14
+ This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on a pull request comments dataset.
15
  It achieves the following results on the evaluation set:
16
  - Train Loss: 0.0791
17
  - Train Accuracy: 0.9955
 
19
  - Validation Accuracy: 0.8291
20
  - Epoch: 12
21
 
22
+ ## Training and evaluation data
23
 
24
+ Training and evaluation data used for this model are the pull request comments of the tensorflow repository on GitHub.
25
+ In particular, of all the pull request data (commit comments, review comments, events, exc.) only the rows with Type equal to PC (Pull request Comment) or RC (Review Comment) have been entered into the dataset.
26
+ These comments has been classified into 4 categories:
27
+ 1) ML (Machine Learning), if the comment is about specific machine learning aspects, algorithms exc.
28
+ 2) Code, if the comment concerns either style and documentation in the code or maintainability issues or possible bugs exc.
29
+ 3) Management, if the comment is about management activities like checking an activity status, assign a review to someone, trigger Jenkins CI
30
+ 4) Other, if the comment doesn't belong to any of the above categories
31
 
32
  ## Intended uses & limitations
33
 
34
+ One possible use of this model could be to label the pull request comments, clearly only on GitHub repositories that are about Machine Learning.
35
+ In this way a developer, before reading a comment entirely, can have a preview of what that comment is about.
 
 
 
 
 
36
 
37
  ### Training hyperparameters
38
 
39
  The following hyperparameters were used during training:
40
+ - batch_size: 32
41
+ - num_epochs: 20
42
  - optimizer: {'name': 'Adam', 'learning_rate': {'class_name': 'PolynomialDecay', 'config': {'initial_learning_rate': 2e-05, 'decay_steps': 280, 'end_learning_rate': 0.0, 'power': 1.0, 'cycle': False, 'name': None}}, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-08, 'amsgrad': False}
43
  - training_precision: float32
44