amarmol commited on
Commit
a899994
·
verified ·
1 Parent(s): c49b2bc

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +27 -2
README.md CHANGED
@@ -11,7 +11,32 @@ base_model:
11
  - PlanTL-GOB-ES/roberta-base-bne
12
  pipeline_tag: text-classification
13
  library_name: transformers
 
 
14
  ---
15
 
16
- Comment classification model according to their toxicity.
17
- This model has been obtained by fine-tuning the RoBERTa language model in Spanish (https://huggingface.co/PlanTL-GOB-ES/roberta-base-bne)
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
11
  - PlanTL-GOB-ES/roberta-base-bne
12
  pipeline_tag: text-classification
13
  library_name: transformers
14
+ datasets:
15
+ - gplsi/SocialTOX
16
  ---
17
 
18
+ # 🧠 Toxicity_model_RoBERTa-base-bne– Spanish Toxicity Classifier Multiclass (Fine-tuned)
19
+
20
+ ## 📌 Model Description
21
+
22
+ This model is a fine-tuned version** of `RoBERTa-base-bne`, specifically trained to classify the toxicity level of **Spanish-language user comments on news articles**. It distinguishes between two categories:
23
+
24
+ - **Non-toxic**
25
+ - **Slightly toxic**
26
+ - **Toxic**
27
+
28
+ ---
29
+
30
+ ## 📂 Training Data
31
+
32
+ The model was fine-tuned on the **[SocialTOX dataset](https://huggingface.co/datasets/gplsi/SocialTOX)**, a collection of Spanish-language comments annotated for varying levels of toxicity. These comments come from news platforms and represent real-world scenarios of online discourse. In this case, a Binary classifier was developed, where the classes \textit{Slightly toxic} and \textit{Toxic} were merged into a single \textit{Toxic} category.
33
+
34
+ ---
35
+
36
+ ## Training hyperparameters
37
+ - epochs: 7
38
+ - learning_rate: 1.51E-06
39
+ - Adam_epsilon: 2.80E-08
40
+ - weight_decay: 3.88E-12
41
+ - batch_size: 16
42
+ - max_seq_length: 512