Update README.md
Browse files
README.md
CHANGED
@@ -132,7 +132,7 @@ for i, prob in enumerate(probabilities):
|
|
132 |
### Training Data
|
133 |
The model was trained on a novel, heterogeneous corpus of 23,969 multi-labeled documents from 11 diverse sources, including government, academia, industry, and civil society, with some sources translated from Japanese. This approach was designed to address the "interpretive diversity" of SDG-related language.
|
134 |
|
135 |
-
For full details on reconstructing the training corpus, please refer to **Supplementary Information
|
136 |
|
137 |
### Evaluation
|
138 |
This model was selected based on its superior generalization performance (especially recall) on external datasets like the OSDG Community Dataset and the SDGi Corpus. On a human-coded sample of scientific articles, the model achieved a macro-averaged **F1-score of 0.623**. For a full breakdown of performance metrics, please see the paper.
|
|
|
132 |
### Training Data
|
133 |
The model was trained on a novel, heterogeneous corpus of 23,969 multi-labeled documents from 11 diverse sources, including government, academia, industry, and civil society, with some sources translated from Japanese. This approach was designed to address the "interpretive diversity" of SDG-related language.
|
134 |
|
135 |
+
For full details on reconstructing the training corpus, please refer to **Supplementary Information S4** in our paper.
|
136 |
|
137 |
### Evaluation
|
138 |
This model was selected based on its superior generalization performance (especially recall) on external datasets like the OSDG Community Dataset and the SDGi Corpus. On a human-coded sample of scientific articles, the model achieved a macro-averaged **F1-score of 0.623**. For a full breakdown of performance metrics, please see the paper.
|