Create README.md
Browse files
README.md
ADDED
|
@@ -0,0 +1,31 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
license: cc-by-4.0
|
| 3 |
+
tags:
|
| 4 |
+
- multi-label-classification
|
| 5 |
+
- text-classification
|
| 6 |
+
- onnx
|
| 7 |
+
- web-classification
|
| 8 |
+
- firefox-ai
|
| 9 |
+
- preview
|
| 10 |
+
language:
|
| 11 |
+
- multilingual
|
| 12 |
+
datasets:
|
| 13 |
+
- tshasan/multi-label-web-classification
|
| 14 |
+
base_model: Alibaba-NLP/gte-modernbert-base
|
| 15 |
+
pipeline_tag: text-classification
|
| 16 |
+
---
|
| 17 |
+
|
| 18 |
+
# modernBERT-URLTITLE-classifier-preview
|
| 19 |
+
|
| 20 |
+
## Model Overview
|
| 21 |
+
|
| 22 |
+
This is a **preview version** of a multi-label web classification model fine-tuned from `Alibaba-NLP/gte-modernbert-base`. It classifies websites into multiple categories based on their URLs and titles. The model supports 11 labels: `Uncatergorized`,`News`, `Entertainment`, `Shop`, `Chat`, `Education`, `Government`, `Health`, `Technology`, `Work`, and `Travel`.
|
| 23 |
+
|
| 24 |
+
- **Developed by**: Taimur Hasan
|
| 25 |
+
- **Model Type**: Multi-label Text Classification
|
| 26 |
+
- **Status**: Preview (under active development
|
| 27 |
+
### Architecture
|
| 28 |
+
- **Fine-tuning**: Unfroze the last 4 encoder layers and the pooler
|
| 29 |
+
- **Problem Type**: Multi-label classification
|
| 30 |
+
- **Output Labels**: 11 (`News`, `Entertainment`, `Shop`, `Chat`, `Education`, `Government`, `Health`, `Technology`, `Work`, `Travel`,`Uncatergorized`)
|
| 31 |
+
- **Input Format**: Concatenated string: `"{url}:{title}"`
|