Spaces:

Blaiseboy
/

BioGPT-chatbot

Sleeping

App Files Files Community

Blaiseboy commited on 10 days ago

Commit

f1ca076

verified ·

1 Parent(s): 84c3636

Upload 4 files

Browse files

Files changed (4) hide show

README.md +98 -0
app.py +184 -0
medical_chatbot.py +447 -0
requirements.txt +33 -0

README.md ADDED Viewed

	@@ -0,0 +1,98 @@

+---
+title: Pediatric Medical Assistant
+emoji: 🩺
+colorFrom: blue
+colorTo: green
+sdk: gradio
+sdk_version: 3.40.0
+app_file: app.py
+pinned: false
+license: mit
+---
+# 🩺 Pediatric Medical Assistant
+An AI-powered medical assistant specialized in pediatric healthcare, powered by BioGPT and advanced medical knowledge retrieval.
+## 🌟 Features
+- **Medical Q&A**: Ask questions about pediatric health conditions, symptoms, and treatments
+- **BioGPT Integration**: Powered by Microsoft's BioGPT, a medical language model trained on biomedical literature
+- **Pediatric Focus**: Specialized knowledge base focused on children's health and medical conditions
+- **Vector Search**: Advanced semantic search using sentence transformers and FAISS
+- **Educational Content**: Provides evidence-based medical information for learning purposes
+## 🚀 How to Use
+1. **Ask Medical Questions**: Type your pediatric health questions in plain English
+2. **Get AI Responses**: Receive evidence-based answers from the medical AI
+3. **Explore Topics**: Ask about symptoms, treatments, prevention, and general pediatric health
+### Example Questions:
+- "What causes fever in children?"
+- "How to treat a child's persistent cough?"
+- "When should I be concerned about my baby's breathing?"
+- "What are the signs of dehydration in infants?"
+- "How can I prevent common childhood infections?"
+## 🛠️ Technology Stack
+- **AI Model**: BioGPT (Microsoft) - Medical language model
+- **Embeddings**: Sentence Transformers for semantic search
+- **Vector Database**: FAISS for efficient similarity search
+- **Interface**: Gradio for user-friendly web interface
+- **Deployment**: Hugging Face Spaces
+## ⚠️ Important Disclaimer
+**This tool is for educational and informational purposes only.**
+- **Not Medical Advice**: The information provided is not intended as medical advice, diagnosis, or treatment
+- **Consult Professionals**: Always consult qualified healthcare professionals for:
+  - Medical emergencies
+  - Diagnosis and treatment decisions
+  - Personalized medical advice
+  - Medication guidance
+- **Educational Use**: This AI assistant is designed to provide general medical education and should supplement, not replace, professional medical consultation
+## 🔧 Technical Details
+- **Model**: BioGPT-Large with fallback to base BioGPT
+- **Knowledge Base**: Curated pediatric medical content
+- **Search Method**: Hybrid vector + keyword search
+- **Response Generation**: Context-aware medical responses
+- **Safety**: Built-in disclaimers and safety reminders
+## 📊 Performance
+- **Response Time**: Typically 2-5 seconds
+- **Knowledge Coverage**: Focused on pediatric medicine
+- **Accuracy**: Based on medical literature training data
+- **Availability**: 24/7 through Hugging Face Spaces
+## 🏥 Medical Specialization
+This assistant specializes in:
+- Pediatric symptoms and conditions
+- Common childhood illnesses
+- Preventive care guidance
+- When to seek medical attention
+- General health education for parents and caregivers
+## 📝 License
+This project is licensed under the MIT License - see the license file for details.
+## 🤝 Contributing
+This is an educational project. For suggestions or improvements, please reach out through appropriate channels.
+## 🔗 Related Resources
+- [BioGPT Research Paper](https://arxiv.org/abs/2210.10341)
+- [Hugging Face Transformers](https://huggingface.co/transformers/)
+- [American Academy of Pediatrics](https://www.aap.org/)
+---
+**Remember**: While this AI can provide helpful medical information, it cannot replace the expertise and judgment of trained healthcare professionals. Always prioritize professional medical care for your child's health needs.

app.py ADDED Viewed

	@@ -0,0 +1,184 @@

+import gradio as gr
+import os
+import torch
+from medical_chatbot import ColabBioGPTChatbot
+def initialize_chatbot():
+    """Initialize the chatbot with proper error handling"""
+    try:
+        print("🚀 Initializing Medical Chatbot...")
+        # Check if GPU is available but use CPU for stability on HF Spaces
+        use_gpu = torch.cuda.is_available()
+        use_8bit = use_gpu  # Only use 8-bit if GPU is available
+        chatbot = ColabBioGPTChatbot(use_gpu=use_gpu, use_8bit=use_8bit)
+        # Try to load medical data
+        medical_file = "Pediatric_cleaned.txt"
+        if os.path.exists(medical_file):
+            chatbot.load_medical_data(medical_file)
+            status = f"✅ Medical file '{medical_file}' loaded successfully! Ready to chat!"
+            success = True
+        else:
+            status = f"❌ Medical file '{medical_file}' not found. Please ensure the file is in the same directory."
+            success = False
+        return chatbot, status, success
+    except Exception as e:
+        error_msg = f"❌ Failed to initialize chatbot: {str(e)}"
+        print(error_msg)
+        return None, error_msg, False
+# Initialize chatbot at startup
+print("🏥 Starting Pediatric Medical Assistant...")
+chatbot, startup_status, medical_file_loaded = initialize_chatbot()
+def generate_response(user_input, history):
+    """Generate response with proper error handling"""
+    if not chatbot:
+        return history + [("System Error", "❌ Chatbot failed to initialize. Please refresh the page and try again.")], ""
+    if not medical_file_loaded:
+        return history + [(user_input, "⚠️ Medical data failed to load. The chatbot may not have access to the full medical knowledge base.")], ""
+    if not user_input.strip():
+        return history, ""
+    try:
+        # Generate response
+        bot_response = chatbot.chat(user_input)
+        # Add to history
+        history = history + [(user_input, bot_response)]
+        return history, ""
+    except Exception as e:
+        error_response = f"⚠️ Sorry, I encountered an error: {str(e)}. Please try rephrasing your question."
+        history = history + [(user_input, error_response)]
+        return history, ""
+# Create custom CSS for better styling
+custom_css = """
+.gradio-container {
+    font-family: 'Segoe UI', Tahoma, Geneva, Verdana, sans-serif;
+}
+.chatbot {
+    height: 500px !important;
+}
+.message {
+    padding: 10px;
+    margin: 5px;
+    border-radius: 10px;
+}
+.user-message {
+    background-color: #e3f2fd;
+    margin-left: 20%;
+}
+.bot-message {
+    background-color: #f5f5f5;
+    margin-right: 20%;
+}
+"""
+# Create Gradio interface
+with gr.Blocks(css=custom_css, title="Pediatric Medical Assistant") as demo:
+    gr.Markdown(
+        """
+        # 🩺 Pediatric Medical Assistant
+        Welcome to your AI-powered pediatric medical assistant! This chatbot uses advanced medical AI (BioGPT)
+        to provide evidence-based information about children's health and medical conditions.
+        **⚠️ Important Disclaimer:** This tool provides educational information only.
+        Always consult qualified healthcare professionals for medical diagnosis, treatment, and personalized advice.
+        """
+    )
+    # Display startup status
+    gr.Markdown(f"**System Status:** {startup_status}")
+    # Chat interface
+    with gr.Row():
+        with gr.Column(scale=4):
+            chatbot_ui = gr.Chatbot(
+                label="💬 Chat with Medical AI",
+                height=500,
+                show_label=True,
+                avatar_images=("👤", "🤖")
+            )
+            with gr.Row():
+                user_input = gr.Textbox(
+                    placeholder="Ask a pediatric health question... (e.g., 'What causes fever in children?')",
+                    lines=2,
+                    max_lines=5,
+                    show_label=False,
+                    scale=4
+                )
+                submit_btn = gr.Button("Send 📤", variant="primary", scale=1)
+        with gr.Column(scale=1):
+            gr.Markdown(
+                """
+                ### 💡 Example Questions:
+                - "What causes fever in children?"
+                - "How to treat a child's cough?"
+                - "When should I call the doctor?"
+                - "What are signs of dehydration?"
+                - "How to prevent common infections?"
+                ### 🔧 System Info:
+                - **Model:** BioGPT (Medical AI)
+                - **Specialization:** Pediatric Medicine
+                - **Search:** Vector + Keyword
+                """
+            )
+    # Event handlers
+    def submit_message(user_msg, history):
+        return generate_response(user_msg, history)
+    # Connect events
+    user_input.submit(
+        fn=submit_message,
+        inputs=[user_input, chatbot_ui],
+        outputs=[chatbot_ui, user_input],
+        show_progress=True
+    )
+    submit_btn.click(
+        fn=submit_message,
+        inputs=[user_input, chatbot_ui],
+        outputs=[chatbot_ui, user_input],
+        show_progress=True
+    )
+    # Footer
+    gr.Markdown(
+        """
+        ---
+        **🏥 Medical AI Assistant** | Powered by BioGPT | For Educational Purposes Only
+        **Remember:** Always consult healthcare professionals for medical emergencies and personalized medical advice.
+        """
+    )
+# Launch configuration for Hugging Face Spaces
+if __name__ == "__main__":
+    # For Hugging Face Spaces deployment
+    demo.launch(
+        server_name="0.0.0.0",  # Required for HF Spaces
+        server_port=7860,       # Default port for HF Spaces
+        show_error=True,        # Show errors for debugging
+        show_tips=False,        # Disable tips for cleaner interface
+        enable_queue=True,      # Enable queue for better performance
+        max_threads=10          # Limit concurrent users
+    )

medical_chatbot.py ADDED Viewed

	@@ -0,0 +1,447 @@

+import os
+import re
+import torch
+import warnings
+import numpy as np
+import faiss
+from transformers import (
+    AutoTokenizer,
+    AutoModelForCausalLM,
+    BitsAndBytesConfig
+)
+from sentence_transformers import SentenceTransformer
+from typing import List, Dict, Optional
+import time
+from datetime import datetime
+# Suppress warnings for cleaner output
+warnings.filterwarnings('ignore')
+class ColabBioGPTChatbot:
+    def __init__(self, use_gpu=True, use_8bit=True):
+        """Initialize BioGPT chatbot optimized for Hugging Face Spaces"""
+        print("🏥 Initializing Medical Chatbot...")
+        self.use_gpu = use_gpu
+        self.use_8bit = use_8bit
+        self.device = "cuda" if torch.cuda.is_available() and use_gpu else "cpu"
+        print(f"🖥️ Using device: {self.device}")
+        self.tokenizer = None
+        self.model = None
+        self.knowledge_chunks = []
+        self.conversation_history = []
+        self.embedding_model = None
+        self.faiss_index = None
+        self.faiss_ready = False
+        self.use_embeddings = True
+        # Initialize components
+        self.setup_biogpt()
+        self.load_sentence_transformer()
+    def setup_biogpt(self):
+        """Setup BioGPT model with fallback to base BioGPT if Large fails"""
+        print("🧠 Loading BioGPT model...")
+        try:
+            # Try BioGPT-Large first
+            model_name = "microsoft/BioGPT-Large"
+            print(f"Attempting to load {model_name}...")
+            if self.use_8bit and self.device == "cuda":
+                quantization_config = BitsAndBytesConfig(
+                    load_in_8bit=True,
+                    llm_int8_threshold=6.0,
+                    llm_int8_has_fp16_weight=False,
+                )
+            else:
+                quantization_config = None
+            self.tokenizer = AutoTokenizer.from_pretrained(model_name)
+            if self.tokenizer.pad_token is None:
+                self.tokenizer.pad_token = self.tokenizer.eos_token
+            self.model = AutoModelForCausalLM.from_pretrained(
+                model_name,
+                quantization_config=quantization_config,
+                torch_dtype=torch.float16 if self.device == "cuda" else torch.float32,
+                device_map="auto" if self.device == "cuda" else None,
+                trust_remote_code=True,
+                low_cpu_mem_usage=True
+            )
+            if self.device == "cuda" and quantization_config is None:
+                self.model = self.model.to(self.device)
+            print("✅ BioGPT-Large loaded successfully!")
+        except Exception as e:
+            print(f"❌ BioGPT-Large loading failed: {e}")
+            print("🔁 Falling back to base BioGPT...")
+            self.setup_fallback_biogpt()
+    def setup_fallback_biogpt(self):
+        """Fallback to microsoft/BioGPT if BioGPT-Large fails"""
+        try:
+            model_name = "microsoft/BioGPT"
+            print(f"Loading fallback model: {model_name}")
+            self.tokenizer = AutoTokenizer.from_pretrained(model_name)
+            if self.tokenizer.pad_token is None:
+                self.tokenizer.pad_token = self.tokenizer.eos_token
+            self.model = AutoModelForCausalLM.from_pretrained(
+                model_name,
+                torch_dtype=torch.float32,
+                trust_remote_code=True,
+                low_cpu_mem_usage=True
+            )
+            if self.device == "cuda":
+                self.model = self.model.to(self.device)
+            print("✅ Base BioGPT model loaded successfully!")
+        except Exception as e:
+            print(f"❌ Failed to load fallback BioGPT: {e}")
+            self.model = None
+            self.tokenizer = None
+    def load_sentence_transformer(self):
+        """Load sentence transformer for embeddings"""
+        try:
+            print("🔮 Loading sentence transformer...")
+            self.embedding_model = SentenceTransformer('all-MiniLM-L6-v2')
+            # Initialize FAISS index (will be populated when data is loaded)
+            embedding_dim = 384  # Dimension for all-MiniLM-L6-v2
+            self.faiss_index = faiss.IndexFlatL2(embedding_dim)
+            self.faiss_ready = True
+            print("✅ Sentence transformer and FAISS index ready!")
+        except Exception as e:
+            print(f"❌ Failed to load sentence transformer: {e}")
+            self.use_embeddings = False
+            self.faiss_ready = False
+    def load_medical_data(self, file_path):
+        """Load and process medical data"""
+        print(f"📖 Loading medical data from {file_path}...")
+        try:
+            if not os.path.exists(file_path):
+                raise FileNotFoundError(f"File {file_path} not found")
+            with open(file_path, 'r', encoding='utf-8') as f:
+                text = f.read()
+            print(f"📄 File loaded: {len(text):,} characters")
+        except Exception as e:
+            print(f"❌ Error loading file: {e}")
+            raise ValueError(f"Failed to load medical data: {e}")
+        # Create chunks
+        print("📝 Creating medical chunks...")
+        chunks = self.create_medical_chunks(text)
+        print(f"📋 Created {len(chunks)} medical chunks")
+        self.knowledge_chunks = chunks
+        # Generate embeddings if available
+        if self.use_embeddings and self.embedding_model and self.faiss_ready:
+            try:
+                self.generate_embeddings_with_progress(chunks)
+                print("✅ Medical data loaded with embeddings!")
+            except Exception as e:
+                print(f"⚠️ Embedding generation failed: {e}")
+                print("✅ Medical data loaded (keyword search mode)")
+        else:
+            print("✅ Medical data loaded (keyword search mode)")
+    def create_medical_chunks(self, text: str, chunk_size: int = 400) -> List[Dict]:
+        """Create medically-optimized text chunks"""
+        chunks = []
+        # Split by paragraphs first
+        paragraphs = [p.strip() for p in text.split('\n\n') if len(p.strip()) > 50]
+        chunk_id = 0
+        for paragraph in paragraphs:
+            if len(paragraph.split()) <= chunk_size:
+                chunks.append({
+                    'id': chunk_id,
+                    'text': paragraph,
+                    'medical_focus': self.identify_medical_focus(paragraph)
+                })
+                chunk_id += 1
+            else:
+                # Split large paragraphs by sentences
+                sentences = re.split(r'[.!?]+', paragraph)
+                current_chunk = ""
+                for sentence in sentences:
+                    sentence = sentence.strip()
+                    if not sentence:
+                        continue
+                    if len(current_chunk.split()) + len(sentence.split()) <= chunk_size:
+                        current_chunk += sentence + ". "
+                    else:
+                        if current_chunk.strip():
+                            chunks.append({
+                                'id': chunk_id,
+                                'text': current_chunk.strip(),
+                                'medical_focus': self.identify_medical_focus(current_chunk)
+                            })
+                            chunk_id += 1
+                        current_chunk = sentence + ". "
+                if current_chunk.strip():
+                    chunks.append({
+                        'id': chunk_id,
+                        'text': current_chunk.strip(),
+                        'medical_focus': self.identify_medical_focus(current_chunk)
+                    })
+                    chunk_id += 1
+        return chunks
+    def identify_medical_focus(self, text: str) -> str:
+        """Identify the medical focus of a text chunk"""
+        text_lower = text.lower()
+        categories = {
+            'pediatric_symptoms': ['fever', 'cough', 'rash', 'vomiting', 'diarrhea'],
+            'treatments': ['treatment', 'therapy', 'medication', 'antibiotics'],
+            'diagnosis': ['diagnosis', 'diagnostic', 'symptoms', 'signs'],
+            'emergency': ['emergency', 'urgent', 'serious', 'hospital'],
+            'prevention': ['prevention', 'vaccine', 'immunization', 'avoid']
+        }
+        for category, keywords in categories.items():
+            if any(keyword in text_lower for keyword in keywords):
+                return category
+        return 'general_medical'
+    def generate_embeddings_with_progress(self, chunks: List[Dict]):
+        """Generate embeddings and add to FAISS index"""
+        print("🔮 Generating embeddings...")
+        try:
+            texts = [chunk['text'] for chunk in chunks]
+            # Generate embeddings in batches
+            batch_size = 32
+            all_embeddings = []
+            for i in range(0, len(texts), batch_size):
+                batch_texts = texts[i:i+batch_size]
+                batch_embeddings = self.embedding_model.encode(batch_texts, show_progress_bar=False)
+                all_embeddings.extend(batch_embeddings)
+                progress = min(i + batch_size, len(texts))
+                print(f"   Progress: {progress}/{len(texts)} chunks processed", end='\r')
+            print(f"\n   ✅ Generated embeddings for {len(texts)} chunks")
+            # Add to FAISS index
+            embeddings_array = np.array(all_embeddings).astype('float32')
+            self.faiss_index.add(embeddings_array)
+            print("✅ Embeddings added to FAISS index!")
+        except Exception as e:
+            print(f"❌ Embedding generation failed: {e}")
+            raise
+    def retrieve_medical_context(self, query: str, n_results: int = 3) -> List[str]:
+        """Retrieve relevant medical context"""
+        if self.use_embeddings and self.embedding_model and self.faiss_ready and self.faiss_index.ntotal > 0:
+            try:
+                # Generate query embedding
+                query_embedding = self.embedding_model.encode([query])
+                # Search FAISS index
+                distances, indices = self.faiss_index.search(
+                    np.array(query_embedding).astype('float32'),
+                    min(n_results, self.faiss_index.ntotal)
+                )
+                # Get relevant chunks
+                context_chunks = []
+                for idx in indices[0]:
+                    if idx != -1 and idx < len(self.knowledge_chunks):
+                        context_chunks.append(self.knowledge_chunks[idx]['text'])
+                if context_chunks:
+                    return context_chunks
+            except Exception as e:
+                print(f"⚠️ Embedding search failed: {e}")
+        # Fallback to keyword search
+        return self.keyword_search_medical(query, n_results)
+    def keyword_search_medical(self, query: str, n_results: int) -> List[str]:
+        """Medical-focused keyword search"""
+        if not self.knowledge_chunks:
+            return []
+        query_words = set(query.lower().split())
+        chunk_scores = []
+        for chunk_info in self.knowledge_chunks:
+            chunk_text = chunk_info['text']
+            chunk_words = set(chunk_text.lower().split())
+            # Calculate relevance score
+            word_overlap = len(query_words.intersection(chunk_words))
+            base_score = word_overlap / len(query_words) if query_words else 0
+            # Boost medical content
+            medical_boost = 0
+            if chunk_info.get('medical_focus') in ['pediatric_symptoms', 'treatments', 'diagnosis']:
+                medical_boost = 0.3
+            final_score = base_score + medical_boost
+            if final_score > 0:
+                chunk_scores.append((final_score, chunk_text))
+        # Return top matches
+        chunk_scores.sort(reverse=True)
+        return [chunk for _, chunk in chunk_scores[:n_results]]
+    def generate_biogpt_response(self, context: str, query: str) -> str:
+        """Generate medical response using BioGPT"""
+        if not self.model or not self.tokenizer:
+            return "Medical model not available. Please check the setup."
+        try:
+            # Create medical prompt
+            prompt = f"""Medical Context: {context[:800]}
+Question: {query}
+Medical Answer:"""
+            # Tokenize
+            inputs = self.tokenizer(
+                prompt,
+                return_tensors="pt",
+                truncation=True,
+                max_length=1024
+            )
+            # Move to device
+            if self.device == "cuda":
+                inputs = {k: v.to(self.device) for k, v in inputs.items()}
+            # Generate response
+            with torch.no_grad():
+                outputs = self.model.generate(
+                    **inputs,
+                    max_new_tokens=150,
+                    do_sample=True,
+                    temperature=0.7,
+                    top_p=0.9,
+                    pad_token_id=self.tokenizer.eos_token_id,
+                    repetition_penalty=1.1
+                )
+            # Decode response
+            full_response = self.tokenizer.decode(outputs[0], skip_special_tokens=True)
+            # Extract generated part
+            if "Medical Answer:" in full_response:
+                generated_response = full_response.split("Medical Answer:")[-1].strip()
+            else:
+                generated_response = full_response[len(prompt):].strip()
+            return self.clean_medical_response(generated_response)
+        except Exception as e:
+            print(f"⚠️ BioGPT generation failed: {e}")
+            return self.fallback_response(context, query)
+    def clean_medical_response(self, response: str) -> str:
+        """Clean and format medical response"""
+        # Remove incomplete sentences and limit length
+        sentences = re.split(r'[.!?]+', response)
+        clean_sentences = []
+        for sentence in sentences:
+            sentence = sentence.strip()
+            if len(sentence) > 10 and not sentence.endswith(('and', 'or', 'but', 'however')):
+                clean_sentences.append(sentence)
+            if len(clean_sentences) >= 3:
+                break
+        if clean_sentences:
+            cleaned = '. '.join(clean_sentences) + '.'
+        else:
+            cleaned = response[:200] + '...' if len(response) > 200 else response
+        return cleaned
+    def fallback_response(self, context: str, query: str) -> str:
+        """Fallback response when BioGPT fails"""
+        sentences = [s.strip() for s in context.split('.') if len(s.strip()) > 20]
+        if sentences:
+            response = sentences[0] + '.'
+            if len(sentences) > 1:
+                response += ' ' + sentences[1] + '.'
+        else:
+            response = context[:300] + '...'
+        return response
+    def handle_conversational_interactions(self, query: str) -> Optional[str]:
+        """Handle conversational interactions"""
+        query_lower = query.lower().strip()
+        # Greetings
+        if any(greeting in query_lower for greeting in ['hello', 'hi', 'hey', 'good morning', 'good afternoon']):
+            return "👋 Hello! I'm your pediatric medical AI assistant. How can I help you with medical questions today?"
+        # Thanks
+        if any(thanks in query_lower for thanks in ['thank you', 'thanks', 'thx']):
+            return "🙏 You're welcome! I'm glad I could help. Remember to consult healthcare professionals for medical decisions. What else can I help you with?"
+        # Goodbyes
+        if any(bye in query_lower for bye in ['bye', 'goodbye', 'see you later']):
+            return "👋 Goodbye! Take care and remember to consult healthcare professionals for any medical concerns. Stay healthy!"
+        return None
+    def chat(self, query: str) -> str:
+        """Main chat function"""
+        if not query.strip():
+            return "Hello! I'm your pediatric medical AI assistant. How can I help you today?"
+        # Handle conversational interactions
+        conversational_response = self.handle_conversational_interactions(query)
+        if conversational_response:
+            return conversational_response
+        if not self.knowledge_chunks:
+            return "Please load medical data first to access the medical knowledge base."
+        if not self.model or not self.tokenizer:
+            return "Medical model not available. Please check the setup and try again."
+        # Retrieve context
+        context = self.retrieve_medical_context(query)
+        if not context:
+            return "I don't have specific information about this topic in my medical database. Please consult with a healthcare professional for personalized medical advice."
+        # Generate response
+        main_context = '\n\n'.join(context)
+        response = self.generate_biogpt_response(main_context, query)
+        # Format final response
+        final_response = f"🩺 **Medical Information:** {response}\n\n⚠️ **Important:** This information is for educational purposes only. Always consult with qualified healthcare professionals for medical diagnosis, treatment, and personalized advice."
+        return final_response

requirements.txt ADDED Viewed

	@@ -0,0 +1,33 @@

+# Core ML and NLP libraries
+torch>=2.0.0,<2.2.0
+transformers>=4.30.0,<4.40.0
+sentence-transformers>=2.2.0,<3.0.0
+accelerate>=0.20.0,<0.25.0
+# Quantization support (for GPU optimization)
+bitsandbytes>=0.41.0,<0.43.0
+# Vector search (CPU version for HF Spaces compatibility)
+faiss-cpu>=1.7.4,<1.8.0
+# Scientific computing
+numpy>=1.21.0,<1.26.0
+scipy>=1.9.0,<1.12.0
+# Gradio for web interface
+gradio>=3.40.0,<4.0.0
+# Essential utilities
+tqdm>=4.64.0
+requests>=2.28.0
+packaging>=21.0
+# Tokenization support
+tokenizers>=0.13.0,<0.16.0
+# System monitoring
+psutil>=5.9.0
+# Additional stability packages
+safetensors>=0.3.0
+huggingface-hub>=0.15.0