yongchao
/

ai_text_detector

Text Classification

Model card Files Files and versions Community

yongchao commited on May 29, 2024

Commit

de94882

·

verified ·

1 Parent(s): 9bd3f1d

Update README.md

Files changed (1) hide show

README.md +34 -1

README.md CHANGED Viewed

@@ -5,4 +5,37 @@ datasets:
 metrics:
 - accuracy
 pipeline_tag: text-classification
----

 metrics:
 - accuracy
 pipeline_tag: text-classification
+---
+# BERT-based Classification Model for AI Generated Text Detection
+## Model Overview
+This BERT-based model is fine-tuned for the task of Ai generated text detection, especially in a TEXT-SQL senario.
+Please be noted that this model is still in testing phase, its validity has not been fully tested.
+## Model Details
+- **Architecture**: BERT (bert-base-uncased)
+- **Training Data**: The model was trained on a dataset of 2000 labeled human and ai created questions.
+- **Training Procedure**:
+  - **Epochs**: 10
+  - **Batch Size**: 16
+  - **Learning Rate**: 2e-5
+  - **Warmup Steps**: 500
+  - **Weight Decay**: 0.01
+- **Model Performance**:
+  - **Accuracy**: 84.5%
+  - **Precision**: 1
+  - **Recall**: 0.845
+  - **F1 Score**: 0.916
+## Limitations and Ethical Considerations
+### Limitations
+The model may not perform well on text that are significantly different from the training data.
+### Ethical Considerations
+Be aware of potential biases in the training data that could affect the model's predictions. Ensure that the model is used in a fair and unbiased manner.
+## References
+- **BERT Paper**: Devlin, J., Chang, M.-W., Lee, K., & Toutanova, K. (2019). BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding.
+- **Dataset**: [Link to the dataset](https://huggingface.co/datasets/yongchao/gptgen_text_detection)