AventIQ-AI
/

DistilBERT-SentimentAnalyzer

PyTorch

Safetensors

distilbert

Model card Files Files and versions Community

gautamnancy commited on May 5

Commit

e5340b1

verified ·

1 Parent(s): 53c4a01

Upload READ_ME.md

Browse files

Files changed (1) hide show

READ_ME.md +125 -0

READ_ME.md ADDED Viewed

	@@ -0,0 +1,125 @@

+# DistilBERT Quantized Model for Sentiment Analysis on Yelp Polarity Dataset
+This repository hosts a quantized version of the DistilBERT model, fine-tuned for sentiment analysis tasks on the Yelp Polarity dataset. The model has been optimized using post-training quantization to make it suitable for resource-constrained environments while maintaining high accuracy.
+## Model Details
+- **Model Architecture:** DistilBERT Base Uncased
+- **Task:** Sentiment Analysis
+- **Dataset:** Yelp Polarity
+- **Quantization:** Dynamic Quantization (INT8 on Linear layers)
+- **Fine-tuning Framework:** Hugging Face Transformers
+---
+### Installation
+```sh
+pip install transformers datasets evaluate scikit-learn torch
+from transformers import AutoTokenizer, AutoModelForSequenceClassification
+import torch
+# Load trained model and tokenizer
+model = AutoModelForSequenceClassification.from_pretrained("./results")
+tokenizer = AutoTokenizer.from_pretrained("./results")
+# Set model to eval mode
+model.eval()
+# 10 Sample review texts
+sample_texts = [
+    "The food was absolutely wonderful!",
+    "Terrible experience. I will never come back.",
+    "Average service, but the food was decent.",
+    "I loved the ambiance and the staff was super friendly!",
+    "Worst food I've had in a long time.",
+    "Highly recommend this place for a date night.",
+    "The waiter was rude and the food was cold.",
+    "Amazing pizza, will order again!",
+    "They took too long to serve and it was overpriced.",
+    "Best customer service and delicious desserts!"
+]
+# Predict and print results
+for text in sample_texts:
+    inputs = tokenizer(text, return_tensors="pt", truncation=True, padding=True, max_length=512)
+    with torch.no_grad():
+        outputs = model(**inputs)
+        prediction = torch.argmax(outputs.logits, dim=-1).item()
+        sentiment = "Positive" if prediction == 1 else "Negative"
+    print(f"Text: {text}\\nPredicted Sentiment: {sentiment}\\n")
+Quantization
+import os
+import torch
+from transformers import AutoModelForSequenceClassification, AutoTokenizer
+# Load the fine-tuned model
+model = AutoModelForSequenceClassification.from_pretrained("./results")
+# Apply dynamic quantization
+quantized_model = torch.quantization.quantize_dynamic(
+    model,
+    {torch.nn.Linear},
+    dtype=torch.qint8
+)
+# Define path
+quantized_model_path = "./results/quantized_model"
+# Create directory if it doesn't exist
+os.makedirs(quantized_model_path, exist_ok=True)
+# Save the quantized model weights
+torch.save(quantized_model.state_dict(), f"{quantized_model_path}/pytorch_model.bin")
+# Save config and tokenizer
+model.config.save_pretrained(quantized_model_path)
+tokenizer = AutoTokenizer.from_pretrained("./results")
+tokenizer.save_pretrained(quantized_model_path)
+print("✅ Quantized model saved at:", quantized_model_path)
+Performance Metrics
+Accuracy: Approx. 95% on Yelp Polarity Test Subset
+Precision, Recall, F1-score: Computed during evaluation using scikit-learn
+Fine-Tuning Details
+Dataset
+Source: Yelp Polarity (via Hugging Face Datasets)
+Train samples used: 50,000
+Test samples used: 10,000
+Training
+Number of epochs: 3
+Batch size: 16
+Evaluation strategy: Per epoch
+Learning rate: 2e-5
+Weight decay: 0.01
+Repository-Structure
+.
+├── results/                     # Contains fine-tuned and quantized model files
+│   ├── pytorch_model.bin        # Quantized model weights
+│   ├── config.json              # Model config
+│   └── tokenizer/               # Tokenizer files
+├── logs/                        # Training logs
+├── README.md                    # Model documentation
+Limitations
+The model is trained only on Yelp reviews and may not generalize to other domains.
+Post-training quantization may cause minor accuracy degradation compared to full-precision models.