selfconstruct3d
/

cybersec_classifier

Model card Files Files and versions Community

selfconstruct3d commited on 16 days ago

Commit

a967b13

·

verified ·

1 Parent(s): 1fe6cc1

Update README.md

Files changed (1) hide show

README.md +6 -1

README.md CHANGED Viewed

@@ -9,6 +9,11 @@ language:
 This repository hosts a lightweight `scikit-learn`-based MLP classifier trained to distinguish cybersecurity-related content from other text, using sentence-transformer embeddings. It supports English and German input texts.
 ## 📦 Model Details
 - **Architecture**: `MLPClassifier` with hidden layers `(128, 64)`
@@ -52,7 +57,7 @@ X_train_emb = embedder.encode(X_train.tolist(), convert_to_numpy=True, show_prog
 X_test_emb = embedder.encode(X_test.tolist(), convert_to_numpy=True, show_progress_bar=True)
 # Load the trained classifier
-model_path = hf_hub_download(repo_id="selfconstruct3d/cybersec-classifier", filename="cybersec_classifier.pkl")
 model = joblib.load(model_path)
 # Predict

 This repository hosts a lightweight `scikit-learn`-based MLP classifier trained to distinguish cybersecurity-related content from other text, using sentence-transformer embeddings. It supports English and German input texts.
+## 📊 Training Data
+The model was trained on a multilingual dataset of cybersecurity and non-cybersecurity news articles. The dataset is publicly available on Zenodo:
+🔗 [https://zenodo.org/records/16417939](https://zenodo.org/records/16417939)
 ## 📦 Model Details
 - **Architecture**: `MLPClassifier` with hidden layers `(128, 64)`
 X_test_emb = embedder.encode(X_test.tolist(), convert_to_numpy=True, show_progress_bar=True)
 # Load the trained classifier
+model_path = hf_hub_download(repo_id="selfconstruct3d/cybersec_classifier", filename="cybersec_classifier.pkl")
 model = joblib.load(model_path)
 # Predict