Spaces:

ArchCoder
/

federated-credit-scoring

Running

App Files Files Community

Transcendental-Programmer commited on about 18 hours ago

Commit

0af9146

1 Parent(s): b407fad

fix: fixed server error

Browse files

Files changed (3) hide show

DEPLOYMENT.md +297 -1
app.py +172 -114
webapp/streamlit_app.py +172 -114

DEPLOYMENT.md CHANGED Viewed

@@ -117,4 +117,300 @@ After deployment, you'll have:
 - ✅ Professional presentation of your project
 - ✅ Educational value for visitors
-**Your federated learning demo will be live and working!** 🚀

 - ✅ Professional presentation of your project
 - ✅ Educational value for visitors
+**Your federated learning demo will be live and working!** 🚀
+# FinFedRAG Deployment Guide
+## Overview
+This project implements a federated learning framework with RAG capabilities for financial data. The system can be deployed using Docker Compose for local development or Kubernetes for production environments.
+## Debugging and Monitoring
+### Enhanced Debugging Features
+The web application now includes comprehensive debugging capabilities:
+1. **Debug Information Panel**: Located in the sidebar, shows:
+   - Real-time server health status
+   - Recent debug messages and logs
+   - Connection error details
+   - Client simulator status
+2. **Detailed Error Logging**: All operations are logged with:
+   - Connection attempts and failures
+   - Server response details
+   - Timeout and network error handling
+   - Client registration and training status updates
+3. **Real-time Status Monitoring**:
+   - Server health checks
+   - Training progress tracking
+   - Client connection status
+   - Error message history
+### Using the Debug Features
+1. **Enable Debug Mode**: Uncheck "Demo Mode" in the sidebar
+2. **View Debug Information**: Expand the "Debug Information" section in the sidebar
+3. **Monitor Logs**: Check the "Recent Logs" section for real-time updates
+4. **Clear Logs**: Use the "Clear Debug Logs" button to reset the log history
+## Local Development Setup
+### Prerequisites
+- Python 3.8+
+- Docker and Docker Compose
+- Required Python packages (see requirements.txt)
+### Quick Start
+1. **Clone and Setup**:
+   ```bash
+   git clone <repository-url>
+   cd FinFedRAG-Financial-Federated-RAG
+   python -m venv venv
+   source venv/bin/activate  # On Windows: venv\Scripts\activate
+   pip install -r requirements.txt
+   ```
+2. **Start the Federated Server**:
+   ```bash
+   python src/main.py --mode server
+   ```
+3. **Start Multiple Clients** (in separate terminals):
+   ```bash
+   python src/main.py --mode client --client-id client1
+   python src/main.py --mode client --client-id client2
+   python src/main.py --mode client --client-id client3
+   ```
+4. **Run the Web Application**:
+   ```bash
+   streamlit run app.py
+   ```
+### Docker Compose Deployment
+For containerized deployment:
+```bash
+cd docker
+docker-compose up --build
+```
+This will start:
+- 1 federated server on port 8000
+- 3 federated clients
+- All services connected via Docker network
+## Kubernetes Deployment
+### Architecture Overview
+The Kubernetes setup provides a production-ready deployment with:
+- **Server Deployment**: Single federated learning server
+- **Client Deployment**: Multiple federated learning clients (3 replicas)
+- **Service Layer**: Internal service discovery
+- **ConfigMaps**: Configuration management
+- **Namespace Isolation**: Dedicated `federated-learning` namespace
+### Components
+#### 1. Server Deployment (`kubernetes/deployments/server.yaml`)
+```yaml
+- Replicas: 1 (single server instance)
+- Port: 8000 (internal)
+- Config: Mounted from ConfigMap
+- Image: fl-server:latest
+```
+#### 2. Client Deployment (`kubernetes/deployments/client.yaml`)
+```yaml
+- Replicas: 3 (multiple client instances)
+- Environment: SERVER_HOST=fl-server-service
+- Config: Mounted from ConfigMap
+- Image: fl-client:latest
+```
+#### 3. Service (`kubernetes/services/service.yaml`)
+```yaml
+- Type: ClusterIP (internal communication)
+- Port: 8000
+- Selector: app=fl-server
+```
+### Deployment Steps
+1. **Build Docker Images**:
+   ```bash
+   docker build -f docker/Dockerfile.server -t fl-server:latest .
+   docker build -f docker/Dockerfile.client -t fl-client:latest .
+   ```
+2. **Create Namespace**:
+   ```bash
+   kubectl create namespace federated-learning
+   ```
+3. **Create ConfigMaps**:
+   ```bash
+   kubectl create configmap server-config --from-file=config/server_config.yaml -n federated-learning
+   kubectl create configmap client-config --from-file=config/client_config.yaml -n federated-learning
+   ```
+4. **Deploy Services**:
+   ```bash
+   kubectl apply -f kubernetes/services/service.yaml
+   kubectl apply -f kubernetes/deployments/server.yaml
+   kubectl apply -f kubernetes/deployments/client.yaml
+   ```
+5. **Verify Deployment**:
+   ```bash
+   kubectl get pods -n federated-learning
+   kubectl get services -n federated-learning
+   ```
+### Accessing the Application
+#### Option 1: Port Forwarding
+```bash
+kubectl port-forward service/fl-server-service 8080:8000 -n federated-learning
+```
+#### Option 2: Load Balancer
+Modify the service to use LoadBalancer type:
+```yaml
+apiVersion: v1
+kind: Service
+metadata:
+  name: fl-server-service
+  namespace: federated-learning
+spec:
+  type: LoadBalancer  # Changed from ClusterIP
+  selector:
+    app: fl-server
+  ports:
+  - port: 8080
+    targetPort: 8000
+```
+#### Option 3: Ingress Controller
+Create an ingress resource for external access:
+```yaml
+apiVersion: networking.k8s.io/v1
+kind: Ingress
+metadata:
+  name: fl-ingress
+  namespace: federated-learning
+spec:
+  rules:
+  - host: fl.example.com
+    http:
+      paths:
+      - path: /
+        pathType: Prefix
+        backend:
+          service:
+            name: fl-server-service
+            port:
+              number: 8000
+```
+### Monitoring and Debugging in Kubernetes
+1. **View Pod Logs**:
+   ```bash
+   kubectl logs -f deployment/fl-server -n federated-learning
+   kubectl logs -f deployment/fl-client -n federated-learning
+   ```
+2. **Check Pod Status**:
+   ```bash
+   kubectl describe pods -n federated-learning
+   ```
+3. **Access Pod Shell**:
+   ```bash
+   kubectl exec -it <pod-name> -n federated-learning -- /bin/bash
+   ```
+4. **Monitor Resource Usage**:
+   ```bash
+   kubectl top pods -n federated-learning
+   ```
+## Troubleshooting
+### Common Issues
+1. **Connection Refused Errors**:
+   - Check if server is running: `kubectl get pods -n federated-learning`
+   - Verify service exists: `kubectl get services -n federated-learning`
+   - Check pod logs for startup errors
+2. **Client Registration Failures**:
+   - Ensure server is healthy before starting clients
+   - Check network connectivity between pods
+   - Verify ConfigMap configurations
+3. **Training Status Issues**:
+   - Monitor server logs for aggregation errors
+   - Check client participation in training rounds
+   - Verify model update sharing
+### Debug Commands
+```bash
+# Check all resources in namespace
+kubectl get all -n federated-learning
+# View detailed pod information
+kubectl describe pod <pod-name> -n federated-learning
+# Check service endpoints
+kubectl get endpoints -n federated-learning
+# View ConfigMap contents
+kubectl get configmap server-config -n federated-learning -o yaml
+```
+## Production Considerations
+1. **Resource Limits**: Add resource requests and limits to deployments
+2. **Health Checks**: Implement liveness and readiness probes
+3. **Secrets Management**: Use Kubernetes secrets for sensitive data
+4. **Persistent Storage**: Add persistent volumes for model storage
+5. **Monitoring**: Integrate with Prometheus/Grafana for metrics
+6. **Logging**: Use centralized logging (ELK stack, Fluentd)
+## Scaling
+### Horizontal Pod Autoscaling
+```yaml
+apiVersion: autoscaling/v2
+kind: HorizontalPodAutoscaler
+metadata:
+  name: fl-client-hpa
+  namespace: federated-learning
+spec:
+  scaleTargetRef:
+    apiVersion: apps/v1
+    kind: Deployment
+    name: fl-client
+  minReplicas: 3
+  maxReplicas: 10
+  metrics:
+  - type: Resource
+    resource:
+      name: cpu
+      target:
+        type: Utilization
+        averageUtilization: 70
+```
+This deployment guide provides comprehensive information for both local development and production Kubernetes deployment, with enhanced debugging capabilities for better monitoring and troubleshooting.

app.py CHANGED Viewed

@@ -4,9 +4,14 @@ import numpy as np
 import time
 import threading
 import json
 from datetime import datetime
-# Client Simulator Class (moved to top)
 class ClientSimulator:
     def __init__(self, server_url):
         self.server_url = server_url
@@ -14,18 +19,21 @@ class ClientSimulator:
         self.is_running = False
         self.thread = None
         self.last_update = "Never"
     def start(self):
         self.is_running = True
         self.thread = threading.Thread(target=self._run_client, daemon=True)
         self.thread.start()
     def stop(self):
         self.is_running = False
     def _run_client(self):
         try:
-            # Register with server
             client_info = {
                 'dataset_size': 100,
                 'model_params': 10000,
@@ -33,9 +41,11 @@ class ClientSimulator:
             }
             resp = requests.post(f"{self.server_url}/register",
-                               json={'client_id': self.client_id, 'client_info': client_info})
             if resp.status_code == 200:
                 st.session_state.training_history.append({
                     'round': 0,
                     'active_clients': 1,
@@ -43,15 +53,13 @@ class ClientSimulator:
                     'timestamp': datetime.now()
                 })
-                # Simulate client participation
                 while self.is_running:
                     try:
-                        # Get training status
-                        status = requests.get(f"{self.server_url}/training_status")
                         if status.status_code == 200:
                             data = status.json()
-                            # Update training history
                             st.session_state.training_history.append({
                                 'round': data.get('current_round', 0),
                                 'active_clients': data.get('active_clients', 0),
@@ -59,56 +67,127 @@ class ClientSimulator:
                                 'timestamp': datetime.now()
                             })
-                            # Keep only last 50 entries
                             if len(st.session_state.training_history) > 50:
                                 st.session_state.training_history = st.session_state.training_history[-50:]
-                        time.sleep(5)  # Check every 5 seconds
                     except Exception as e:
-                        print(f"Client simulator error: {e}")
                         time.sleep(10)
         except Exception as e:
-            print(f"Failed to start client simulator: {e}")
             self.is_running = False
 st.set_page_config(page_title="Federated Credit Scoring Demo", layout="centered")
-st.title("Federated Credit Scoring Demo (Federated Learning)")
 # Sidebar configuration
 st.sidebar.header("Configuration")
 SERVER_URL = st.sidebar.text_input("Server URL", value="http://localhost:8080")
-DEMO_MODE = st.sidebar.checkbox("Demo Mode (No Server Required)", value=True)
 # Initialize session state
 if 'client_simulator' not in st.session_state:
     st.session_state.client_simulator = None
 if 'training_history' not in st.session_state:
     st.session_state.training_history = []
-st.markdown("""
-This demo shows how multiple banks can collaboratively train a credit scoring model using federated learning, without sharing raw data.
-Enter customer features below to get a credit score prediction from the federated model.
-""")
-# --- Client Simulator ---
 st.sidebar.header("Client Simulator")
-if st.sidebar.button("Start Client Simulator"):
     if not DEMO_MODE:
-        st.session_state.client_simulator = ClientSimulator(SERVER_URL)
-        st.session_state.client_simulator.start()
-        st.sidebar.success("Client simulator started!")
     else:
-        st.sidebar.warning("Client simulator only works in Real Mode")
-if st.sidebar.button("Stop Client Simulator"):
     if st.session_state.client_simulator:
         st.session_state.client_simulator.stop()
         st.session_state.client_simulator = None
-        st.sidebar.success("Client simulator stopped!")
-# --- Feature Input Form ---
 st.header("Enter Customer Features")
 with st.form("feature_form"):
     features = []
@@ -119,124 +198,103 @@ with st.form("feature_form"):
             features.append(val)
     submitted = st.form_submit_button("Predict Credit Score")
-# --- Prediction ---
 if submitted:
     if DEMO_MODE:
-        # Demo mode - simulate prediction
-        with st.spinner("Processing prediction..."):
-            time.sleep(1)  # Simulate processing time
-        # Simple demo prediction based on feature values
-        demo_prediction = sum(features) / len(features) * 100 + 500  # Scale to credit score range
-        st.success(f"Demo Prediction: Credit Score = {demo_prediction:.2f}")
-        st.info("💡 This is a demo prediction. In a real federated system, this would come from the trained model.")
-        # Show what would happen in real mode
-        st.markdown("---")
-        st.markdown("**What happens in real federated learning:**")
-        st.markdown("1. Your features are sent to the federated server")
-        st.markdown("2. Server uses the global model (trained by multiple banks)")
-        st.markdown("3. Prediction is returned without exposing any bank's data")
     else:
-        # Real mode - connect to server
         try:
-            with st.spinner("Connecting to federated server..."):
                 resp = requests.post(f"{SERVER_URL}/predict", json={"features": features}, timeout=10)
             if resp.status_code == 200:
                 prediction = resp.json().get("prediction")
                 st.success(f"Predicted Credit Score: {prediction:.2f}")
-                st.info("🎯 This prediction comes from the federated model trained by multiple banks!")
             else:
-                st.error(f"Prediction failed: {resp.json().get('error', 'Unknown error')}")
         except Exception as e:
-            st.error(f"Error connecting to server: {e}")
-            st.info("💡 Try enabling Demo Mode to see the interface without a server.")
-# --- Training Progress ---
-st.header("Federated Training Progress")
 if DEMO_MODE:
-    # Demo training progress
     col1, col2, col3, col4 = st.columns(4)
     with col1:
-        st.metric("Current Round", "3/10")
     with col2:
-        st.metric("Active Clients", "3")
     with col3:
-        st.metric("Model Accuracy", "85.2%")
     with col4:
-        st.metric("Training Status", "Active")
-    st.info("💡 Demo mode showing simulated training progress. In real federated learning, multiple banks would be training collaboratively.")
 else:
-    # Real training progress
     try:
         status = requests.get(f"{SERVER_URL}/training_status", timeout=5)
         if status.status_code == 200:
             data = status.json()
             col1, col2, col3, col4 = st.columns(4)
             with col1:
-                st.metric("Current Round", f"{data.get('current_round', 0)}/{data.get('total_rounds', 10)}")
             with col2:
-                st.metric("Active Clients", data.get('active_clients', 0))
             with col3:
-                st.metric("Clients Ready", data.get('clients_ready', 0))
             with col4:
-                st.metric("Training Status", "Active" if data.get('training_active', False) else "Inactive")
-            # Show training history
-            if st.session_state.training_history:
-                st.subheader("Training History")
-                history_df = st.session_state.training_history
-                st.line_chart(history_df.set_index('round')[['active_clients', 'clients_ready']])
         else:
-            st.warning("Could not fetch training status.")
     except Exception as e:
-        st.warning(f"Could not connect to server for training status: {e}")
-# --- Server Health Check ---
-if not DEMO_MODE:
-    st.header("Server Health")
-    try:
-        health = requests.get(f"{SERVER_URL}/health", timeout=5)
-        if health.status_code == 200:
-            health_data = health.json()
-            st.success(f"✅ Server is healthy")
-            st.json(health_data)
-        else:
-            st.error("❌ Server health check failed")
-    except Exception as e:
-        st.error(f"❌ Cannot connect to server: {e}")
-# --- How it works ---
-st.header("How Federated Learning Works")
-st.markdown("""
-**Traditional ML:** All banks send their data to a central server → Privacy risk ❌
-**Federated Learning:**
-1. Each bank keeps their data locally ✅
-2. Banks train models on their own data ✅
-3. Only model updates (not data) are shared ✅
-4. Server aggregates updates to create global model ✅
-5. Global model is distributed back to all banks ✅
-**Result:** Collaborative learning without data sharing! 🎯
-""")
-# --- Client Simulator Status ---
 if st.session_state.client_simulator and not DEMO_MODE:
-    st.header("Client Simulator Status")
     if st.session_state.client_simulator.is_running:
-        st.success("🟢 Client simulator is running and participating in federated learning")
-        st.info(f"Client ID: {st.session_state.client_simulator.client_id}")
-        st.info(f"Last update: {st.session_state.client_simulator.last_update}")
     else:
-        st.warning("🔴 Client simulator is not running")
-st.markdown("---")
-st.markdown("""
-*This is a demonstration of federated learning concepts. For full functionality, run the federated server and clients locally.*
-""")

 import time
 import threading
 import json
+import logging
 from datetime import datetime
+# Configure logging
+logging.basicConfig(level=logging.DEBUG)
+logger = logging.getLogger(__name__)
+# Client Simulator Class
 class ClientSimulator:
     def __init__(self, server_url):
         self.server_url = server_url
         self.is_running = False
         self.thread = None
         self.last_update = "Never"
+        self.last_error = None
     def start(self):
         self.is_running = True
         self.thread = threading.Thread(target=self._run_client, daemon=True)
         self.thread.start()
+        logger.info(f"Client simulator started for {self.server_url}")
     def stop(self):
         self.is_running = False
+        logger.info("Client simulator stopped")
     def _run_client(self):
         try:
+            logger.info(f"Attempting to register client {self.client_id} with server {self.server_url}")
             client_info = {
                 'dataset_size': 100,
                 'model_params': 10000,
             }
             resp = requests.post(f"{self.server_url}/register",
+                               json={'client_id': self.client_id, 'client_info': client_info},
+                               timeout=10)
             if resp.status_code == 200:
+                logger.info(f"Successfully registered client {self.client_id}")
                 st.session_state.training_history.append({
                     'round': 0,
                     'active_clients': 1,
                     'timestamp': datetime.now()
                 })
                 while self.is_running:
                     try:
+                        logger.debug(f"Checking training status from {self.server_url}/training_status")
+                        status = requests.get(f"{self.server_url}/training_status", timeout=5)
                         if status.status_code == 200:
                             data = status.json()
+                            logger.debug(f"Training status: {data}")
                             st.session_state.training_history.append({
                                 'round': data.get('current_round', 0),
                                 'active_clients': data.get('active_clients', 0),
                                 'timestamp': datetime.now()
                             })
                             if len(st.session_state.training_history) > 50:
                                 st.session_state.training_history = st.session_state.training_history[-50:]
+                        else:
+                            logger.warning(f"Training status returned {status.status_code}: {status.text}")
+                        time.sleep(5)
+                    except requests.exceptions.Timeout:
+                        logger.warning("Timeout while checking training status")
+                        self.last_error = "Timeout connecting to server"
+                        time.sleep(10)
+                    except requests.exceptions.ConnectionError as e:
+                        logger.error(f"Connection error while checking training status: {e}")
+                        self.last_error = f"Connection error: {e}"
+                        time.sleep(10)
                     except Exception as e:
+                        logger.error(f"Unexpected error in client simulator: {e}")
+                        self.last_error = f"Unexpected error: {e}"
                         time.sleep(10)
+        except requests.exceptions.ConnectionError as e:
+            logger.error(f"Failed to connect to server {self.server_url}: {e}")
+            self.last_error = f"Failed to connect to server: {e}"
+            self.is_running = False
         except Exception as e:
+            logger.error(f"Failed to start client simulator: {e}")
+            self.last_error = f"Failed to start: {e}"
             self.is_running = False
+def check_server_health(server_url):
+    """Check if server is reachable and healthy"""
+    try:
+        logger.debug(f"Checking server health at {server_url}/health")
+        resp = requests.get(f"{server_url}/health", timeout=5)
+        if resp.status_code == 200:
+            logger.info("Server is healthy")
+            return True, resp.json()
+        else:
+            logger.warning(f"Server health check returned {resp.status_code}")
+            return False, f"HTTP {resp.status_code}: {resp.text}"
+    except requests.exceptions.Timeout:
+        logger.error("Server health check timeout")
+        return False, "Timeout"
+    except requests.exceptions.ConnectionError as e:
+        logger.error(f"Server health check connection error: {e}")
+        return False, f"Connection refused: {e}"
+    except Exception as e:
+        logger.error(f"Server health check unexpected error: {e}")
+        return False, f"Unexpected error: {e}"
 st.set_page_config(page_title="Federated Credit Scoring Demo", layout="centered")
+st.title("Federated Credit Scoring Demo")
 # Sidebar configuration
 st.sidebar.header("Configuration")
 SERVER_URL = st.sidebar.text_input("Server URL", value="http://localhost:8080")
+DEMO_MODE = st.sidebar.checkbox("Demo Mode", value=True)
 # Initialize session state
 if 'client_simulator' not in st.session_state:
     st.session_state.client_simulator = None
 if 'training_history' not in st.session_state:
     st.session_state.training_history = []
+if 'debug_messages' not in st.session_state:
+    st.session_state.debug_messages = []
+# Debug section in sidebar
+with st.sidebar.expander("Debug Information"):
+    st.write("**Server Status:**")
+    if not DEMO_MODE:
+        is_healthy, health_info = check_server_health(SERVER_URL)
+        if is_healthy:
+            st.success("✅ Server is healthy")
+            st.json(health_info)
+        else:
+            st.error(f"❌ Server error: {health_info}")
+    st.write("**Recent Logs:**")
+    if st.session_state.debug_messages:
+        for msg in st.session_state.debug_messages[-5:]:  # Show last 5 messages
+            st.text(msg)
+    else:
+        st.text("No debug messages yet")
+    if st.button("Clear Debug Logs"):
+        st.session_state.debug_messages = []
+# Sidebar educational content
+with st.sidebar.expander("About Federated Learning"):
+    st.markdown("""
+    **Traditional ML:** Banks send data to central server → Privacy risk
+    **Federated Learning:**
+    - Banks keep data locally
+    - Only model updates are shared
+    - Collaborative learning without data sharing
+    """)
+# Client Simulator in sidebar
 st.sidebar.header("Client Simulator")
+if st.sidebar.button("Start Client"):
     if not DEMO_MODE:
+        try:
+            st.session_state.client_simulator = ClientSimulator(SERVER_URL)
+            st.session_state.client_simulator.start()
+            st.sidebar.success("Client started!")
+            st.session_state.debug_messages.append(f"{datetime.now()}: Client simulator started")
+        except Exception as e:
+            st.sidebar.error(f"Failed to start client: {e}")
+            st.session_state.debug_messages.append(f"{datetime.now()}: Failed to start client - {e}")
     else:
+        st.sidebar.warning("Only works in Real Mode")
+if st.sidebar.button("Stop Client"):
     if st.session_state.client_simulator:
         st.session_state.client_simulator.stop()
         st.session_state.client_simulator = None
+        st.sidebar.success("Client stopped!")
+        st.session_state.debug_messages.append(f"{datetime.now()}: Client simulator stopped")
+# Main content - focused on core functionality
 st.header("Enter Customer Features")
 with st.form("feature_form"):
     features = []
             features.append(val)
     submitted = st.form_submit_button("Predict Credit Score")
+# Prediction results
 if submitted:
+    logger.info(f"Prediction requested with {len(features)} features")
     if DEMO_MODE:
+        with st.spinner("Processing..."):
+            time.sleep(1)
+        demo_prediction = sum(features) / len(features) * 100 + 500
+        st.success(f"Predicted Credit Score: {demo_prediction:.2f}")
+        st.session_state.debug_messages.append(f"{datetime.now()}: Demo prediction: {demo_prediction:.2f}")
     else:
         try:
+            logger.info(f"Sending prediction request to {SERVER_URL}/predict")
+            with st.spinner("Connecting to server..."):
                 resp = requests.post(f"{SERVER_URL}/predict", json={"features": features}, timeout=10)
             if resp.status_code == 200:
                 prediction = resp.json().get("prediction")
                 st.success(f"Predicted Credit Score: {prediction:.2f}")
+                st.session_state.debug_messages.append(f"{datetime.now()}: Real prediction: {prediction:.2f}")
+                logger.info(f"Prediction successful: {prediction}")
             else:
+                error_msg = f"Prediction failed: {resp.json().get('error', 'Unknown error')}"
+                st.error(error_msg)
+                st.session_state.debug_messages.append(f"{datetime.now()}: {error_msg}")
+                logger.error(f"Prediction failed with status {resp.status_code}: {resp.text}")
+        except requests.exceptions.Timeout:
+            error_msg = "Timeout connecting to server"
+            st.error(error_msg)
+            st.session_state.debug_messages.append(f"{datetime.now()}: {error_msg}")
+            logger.error("Prediction request timeout")
+        except requests.exceptions.ConnectionError as e:
+            error_msg = f"Connection error: {e}"
+            st.error(error_msg)
+            st.session_state.debug_messages.append(f"{datetime.now()}: {error_msg}")
+            logger.error(f"Prediction connection error: {e}")
         except Exception as e:
+            error_msg = f"Unexpected error: {e}"
+            st.error(error_msg)
+            st.session_state.debug_messages.append(f"{datetime.now()}: {error_msg}")
+            logger.error(f"Prediction unexpected error: {e}")
+# Training progress - simplified
+st.header("Training Progress")
 if DEMO_MODE:
     col1, col2, col3, col4 = st.columns(4)
     with col1:
+        st.metric("Round", "3/10")
     with col2:
+        st.metric("Clients", "3")
     with col3:
+        st.metric("Accuracy", "85.2%")
     with col4:
+        st.metric("Status", "Active")
 else:
     try:
+        logger.debug(f"Fetching training status from {SERVER_URL}/training_status")
         status = requests.get(f"{SERVER_URL}/training_status", timeout=5)
         if status.status_code == 200:
             data = status.json()
+            logger.debug(f"Training status received: {data}")
             col1, col2, col3, col4 = st.columns(4)
             with col1:
+                st.metric("Round", f"{data.get('current_round', 0)}/{data.get('total_rounds', 10)}")
             with col2:
+                st.metric("Clients", data.get('active_clients', 0))
             with col3:
+                st.metric("Ready", data.get('clients_ready', 0))
             with col4:
+                st.metric("Status", "Active" if data.get('training_active', False) else "Inactive")
         else:
+            error_msg = f"Training status failed: HTTP {status.status_code}"
+            st.warning(error_msg)
+            st.session_state.debug_messages.append(f"{datetime.now()}: {error_msg}")
+            logger.warning(f"Training status returned {status.status_code}: {status.text}")
+    except requests.exceptions.Timeout:
+        error_msg = "Training status timeout"
+        st.warning(error_msg)
+        st.session_state.debug_messages.append(f"{datetime.now()}: {error_msg}")
+        logger.warning("Training status request timeout")
+    except requests.exceptions.ConnectionError as e:
+        error_msg = f"Training status connection error: {e}"
+        st.warning(error_msg)
+        st.session_state.debug_messages.append(f"{datetime.now()}: {error_msg}")
+        logger.error(f"Training status connection error: {e}")
     except Exception as e:
+        error_msg = f"Training status unexpected error: {e}"
+        st.warning(error_msg)
+        st.session_state.debug_messages.append(f"{datetime.now()}: {error_msg}")
+        logger.error(f"Training status unexpected error: {e}")
+# Client status in sidebar
 if st.session_state.client_simulator and not DEMO_MODE:
+    st.sidebar.header("Client Status")
     if st.session_state.client_simulator.is_running:
+        st.sidebar.success("Connected")
+        st.sidebar.info(f"ID: {st.session_state.client_simulator.client_id}")
+        if st.session_state.client_simulator.last_error:
+            st.sidebar.error(f"Last Error: {st.session_state.client_simulator.last_error}")
     else:
+        st.sidebar.warning("Disconnected")

webapp/streamlit_app.py CHANGED Viewed

@@ -4,9 +4,14 @@ import numpy as np
 import time
 import threading
 import json
 from datetime import datetime
-# Client Simulator Class (moved to top)
 class ClientSimulator:
     def __init__(self, server_url):
         self.server_url = server_url
@@ -14,18 +19,21 @@ class ClientSimulator:
         self.is_running = False
         self.thread = None
         self.last_update = "Never"
     def start(self):
         self.is_running = True
         self.thread = threading.Thread(target=self._run_client, daemon=True)
         self.thread.start()
     def stop(self):
         self.is_running = False
     def _run_client(self):
         try:
-            # Register with server
             client_info = {
                 'dataset_size': 100,
                 'model_params': 10000,
@@ -33,9 +41,11 @@ class ClientSimulator:
             }
             resp = requests.post(f"{self.server_url}/register",
-                               json={'client_id': self.client_id, 'client_info': client_info})
             if resp.status_code == 200:
                 st.session_state.training_history.append({
                     'round': 0,
                     'active_clients': 1,
@@ -43,15 +53,13 @@ class ClientSimulator:
                     'timestamp': datetime.now()
                 })
-                # Simulate client participation
                 while self.is_running:
                     try:
-                        # Get training status
-                        status = requests.get(f"{self.server_url}/training_status")
                         if status.status_code == 200:
                             data = status.json()
-                            # Update training history
                             st.session_state.training_history.append({
                                 'round': data.get('current_round', 0),
                                 'active_clients': data.get('active_clients', 0),
@@ -59,56 +67,127 @@ class ClientSimulator:
                                 'timestamp': datetime.now()
                             })
-                            # Keep only last 50 entries
                             if len(st.session_state.training_history) > 50:
                                 st.session_state.training_history = st.session_state.training_history[-50:]
-                        time.sleep(5)  # Check every 5 seconds
                     except Exception as e:
-                        print(f"Client simulator error: {e}")
                         time.sleep(10)
         except Exception as e:
-            print(f"Failed to start client simulator: {e}")
             self.is_running = False
 st.set_page_config(page_title="Federated Credit Scoring Demo", layout="centered")
-st.title("Federated Credit Scoring Demo (Federated Learning)")
 # Sidebar configuration
 st.sidebar.header("Configuration")
 SERVER_URL = st.sidebar.text_input("Server URL", value="http://localhost:8080")
-DEMO_MODE = st.sidebar.checkbox("Demo Mode (No Server Required)", value=False)
 # Initialize session state
 if 'client_simulator' not in st.session_state:
     st.session_state.client_simulator = None
 if 'training_history' not in st.session_state:
     st.session_state.training_history = []
-st.markdown("""
-This demo shows how multiple banks can collaboratively train a credit scoring model using federated learning, without sharing raw data.
-Enter customer features below to get a credit score prediction from the federated model.
-""")
-# --- Client Simulator ---
 st.sidebar.header("Client Simulator")
-if st.sidebar.button("Start Client Simulator"):
     if not DEMO_MODE:
-        st.session_state.client_simulator = ClientSimulator(SERVER_URL)
-        st.session_state.client_simulator.start()
-        st.sidebar.success("Client simulator started!")
     else:
-        st.sidebar.warning("Client simulator only works in Real Mode")
-if st.sidebar.button("Stop Client Simulator"):
     if st.session_state.client_simulator:
         st.session_state.client_simulator.stop()
         st.session_state.client_simulator = None
-        st.sidebar.success("Client simulator stopped!")
-# --- Feature Input Form ---
 st.header("Enter Customer Features")
 with st.form("feature_form"):
     features = []
@@ -119,124 +198,103 @@ with st.form("feature_form"):
             features.append(val)
     submitted = st.form_submit_button("Predict Credit Score")
-# --- Prediction ---
 if submitted:
     if DEMO_MODE:
-        # Demo mode - simulate prediction
-        with st.spinner("Processing prediction..."):
-            time.sleep(1)  # Simulate processing time
-        # Simple demo prediction based on feature values
-        demo_prediction = sum(features) / len(features) * 100 + 500  # Scale to credit score range
-        st.success(f"Demo Prediction: Credit Score = {demo_prediction:.2f}")
-        st.info("💡 This is a demo prediction. In a real federated system, this would come from the trained model.")
-        # Show what would happen in real mode
-        st.markdown("---")
-        st.markdown("**What happens in real federated learning:**")
-        st.markdown("1. Your features are sent to the federated server")
-        st.markdown("2. Server uses the global model (trained by multiple banks)")
-        st.markdown("3. Prediction is returned without exposing any bank's data")
     else:
-        # Real mode - connect to server
         try:
-            with st.spinner("Connecting to federated server..."):
                 resp = requests.post(f"{SERVER_URL}/predict", json={"features": features}, timeout=10)
             if resp.status_code == 200:
                 prediction = resp.json().get("prediction")
                 st.success(f"Predicted Credit Score: {prediction:.2f}")
-                st.info("🎯 This prediction comes from the federated model trained by multiple banks!")
             else:
-                st.error(f"Prediction failed: {resp.json().get('error', 'Unknown error')}")
         except Exception as e:
-            st.error(f"Error connecting to server: {e}")
-            st.info("💡 Try enabling Demo Mode to see the interface without a server.")
-# --- Training Progress ---
-st.header("Federated Training Progress")
 if DEMO_MODE:
-    # Demo training progress
     col1, col2, col3, col4 = st.columns(4)
     with col1:
-        st.metric("Current Round", "3/10")
     with col2:
-        st.metric("Active Clients", "3")
     with col3:
-        st.metric("Model Accuracy", "85.2%")
     with col4:
-        st.metric("Training Status", "Active")
-    st.info("💡 Demo mode showing simulated training progress. In real federated learning, multiple banks would be training collaboratively.")
 else:
-    # Real training progress
     try:
         status = requests.get(f"{SERVER_URL}/training_status", timeout=5)
         if status.status_code == 200:
             data = status.json()
             col1, col2, col3, col4 = st.columns(4)
             with col1:
-                st.metric("Current Round", f"{data.get('current_round', 0)}/{data.get('total_rounds', 10)}")
             with col2:
-                st.metric("Active Clients", data.get('active_clients', 0))
             with col3:
-                st.metric("Clients Ready", data.get('clients_ready', 0))
             with col4:
-                st.metric("Training Status", "Active" if data.get('training_active', False) else "Inactive")
-            # Show training history
-            if st.session_state.training_history:
-                st.subheader("Training History")
-                history_df = st.session_state.training_history
-                st.line_chart(history_df.set_index('round')[['active_clients', 'clients_ready']])
         else:
-            st.warning("Could not fetch training status.")
     except Exception as e:
-        st.warning(f"Could not connect to server for training status: {e}")
-# --- Server Health Check ---
-if not DEMO_MODE:
-    st.header("Server Health")
-    try:
-        health = requests.get(f"{SERVER_URL}/health", timeout=5)
-        if health.status_code == 200:
-            health_data = health.json()
-            st.success(f"✅ Server is healthy")
-            st.json(health_data)
-        else:
-            st.error("❌ Server health check failed")
-    except Exception as e:
-        st.error(f"❌ Cannot connect to server: {e}")
-# --- How it works ---
-st.header("How Federated Learning Works")
-st.markdown("""
-**Traditional ML:** All banks send their data to a central server → Privacy risk ❌
-**Federated Learning:**
-1. Each bank keeps their data locally ✅
-2. Banks train models on their own data ✅
-3. Only model updates (not data) are shared ✅
-4. Server aggregates updates to create global model ✅
-5. Global model is distributed back to all banks ✅
-**Result:** Collaborative learning without data sharing! 🎯
-""")
-# --- Client Simulator Status ---
 if st.session_state.client_simulator and not DEMO_MODE:
-    st.header("Client Simulator Status")
     if st.session_state.client_simulator.is_running:
-        st.success("🟢 Client simulator is running and participating in federated learning")
-        st.info(f"Client ID: {st.session_state.client_simulator.client_id}")
-        st.info(f"Last update: {st.session_state.client_simulator.last_update}")
     else:
-        st.warning("🔴 Client simulator is not running")
-st.markdown("---")
-st.markdown("""
-*This is a demonstration of federated learning concepts. For full functionality, run the federated server and clients locally.*
-""")

 import time
 import threading
 import json
+import logging
 from datetime import datetime
+# Configure logging
+logging.basicConfig(level=logging.DEBUG)
+logger = logging.getLogger(__name__)
+# Client Simulator Class
 class ClientSimulator:
     def __init__(self, server_url):
         self.server_url = server_url
         self.is_running = False
         self.thread = None
         self.last_update = "Never"
+        self.last_error = None
     def start(self):
         self.is_running = True
         self.thread = threading.Thread(target=self._run_client, daemon=True)
         self.thread.start()
+        logger.info(f"Client simulator started for {self.server_url}")
     def stop(self):
         self.is_running = False
+        logger.info("Client simulator stopped")
     def _run_client(self):
         try:
+            logger.info(f"Attempting to register client {self.client_id} with server {self.server_url}")
             client_info = {
                 'dataset_size': 100,
                 'model_params': 10000,
             }
             resp = requests.post(f"{self.server_url}/register",
+                               json={'client_id': self.client_id, 'client_info': client_info},
+                               timeout=10)
             if resp.status_code == 200:
+                logger.info(f"Successfully registered client {self.client_id}")
                 st.session_state.training_history.append({
                     'round': 0,
                     'active_clients': 1,
                     'timestamp': datetime.now()
                 })
                 while self.is_running:
                     try:
+                        logger.debug(f"Checking training status from {self.server_url}/training_status")
+                        status = requests.get(f"{self.server_url}/training_status", timeout=5)
                         if status.status_code == 200:
                             data = status.json()
+                            logger.debug(f"Training status: {data}")
                             st.session_state.training_history.append({
                                 'round': data.get('current_round', 0),
                                 'active_clients': data.get('active_clients', 0),
                                 'timestamp': datetime.now()
                             })
                             if len(st.session_state.training_history) > 50:
                                 st.session_state.training_history = st.session_state.training_history[-50:]
+                        else:
+                            logger.warning(f"Training status returned {status.status_code}: {status.text}")
+                        time.sleep(5)
+                    except requests.exceptions.Timeout:
+                        logger.warning("Timeout while checking training status")
+                        self.last_error = "Timeout connecting to server"
+                        time.sleep(10)
+                    except requests.exceptions.ConnectionError as e:
+                        logger.error(f"Connection error while checking training status: {e}")
+                        self.last_error = f"Connection error: {e}"
+                        time.sleep(10)
                     except Exception as e:
+                        logger.error(f"Unexpected error in client simulator: {e}")
+                        self.last_error = f"Unexpected error: {e}"
                         time.sleep(10)
+        except requests.exceptions.ConnectionError as e:
+            logger.error(f"Failed to connect to server {self.server_url}: {e}")
+            self.last_error = f"Failed to connect to server: {e}"
+            self.is_running = False
         except Exception as e:
+            logger.error(f"Failed to start client simulator: {e}")
+            self.last_error = f"Failed to start: {e}"
             self.is_running = False
+def check_server_health(server_url):
+    """Check if server is reachable and healthy"""
+    try:
+        logger.debug(f"Checking server health at {server_url}/health")
+        resp = requests.get(f"{server_url}/health", timeout=5)
+        if resp.status_code == 200:
+            logger.info("Server is healthy")
+            return True, resp.json()
+        else:
+            logger.warning(f"Server health check returned {resp.status_code}")
+            return False, f"HTTP {resp.status_code}: {resp.text}"
+    except requests.exceptions.Timeout:
+        logger.error("Server health check timeout")
+        return False, "Timeout"
+    except requests.exceptions.ConnectionError as e:
+        logger.error(f"Server health check connection error: {e}")
+        return False, f"Connection refused: {e}"
+    except Exception as e:
+        logger.error(f"Server health check unexpected error: {e}")
+        return False, f"Unexpected error: {e}"
 st.set_page_config(page_title="Federated Credit Scoring Demo", layout="centered")
+st.title("Federated Credit Scoring Demo")
 # Sidebar configuration
 st.sidebar.header("Configuration")
 SERVER_URL = st.sidebar.text_input("Server URL", value="http://localhost:8080")
+DEMO_MODE = st.sidebar.checkbox("Demo Mode", value=True)
 # Initialize session state
 if 'client_simulator' not in st.session_state:
     st.session_state.client_simulator = None
 if 'training_history' not in st.session_state:
     st.session_state.training_history = []
+if 'debug_messages' not in st.session_state:
+    st.session_state.debug_messages = []
+# Debug section in sidebar
+with st.sidebar.expander("Debug Information"):
+    st.write("**Server Status:**")
+    if not DEMO_MODE:
+        is_healthy, health_info = check_server_health(SERVER_URL)
+        if is_healthy:
+            st.success("✅ Server is healthy")
+            st.json(health_info)
+        else:
+            st.error(f"❌ Server error: {health_info}")
+    st.write("**Recent Logs:**")
+    if st.session_state.debug_messages:
+        for msg in st.session_state.debug_messages[-5:]:  # Show last 5 messages
+            st.text(msg)
+    else:
+        st.text("No debug messages yet")
+    if st.button("Clear Debug Logs"):
+        st.session_state.debug_messages = []
+# Sidebar educational content
+with st.sidebar.expander("About Federated Learning"):
+    st.markdown("""
+    **Traditional ML:** Banks send data to central server → Privacy risk
+    **Federated Learning:**
+    - Banks keep data locally
+    - Only model updates are shared
+    - Collaborative learning without data sharing
+    """)
+# Client Simulator in sidebar
 st.sidebar.header("Client Simulator")
+if st.sidebar.button("Start Client"):
     if not DEMO_MODE:
+        try:
+            st.session_state.client_simulator = ClientSimulator(SERVER_URL)
+            st.session_state.client_simulator.start()
+            st.sidebar.success("Client started!")
+            st.session_state.debug_messages.append(f"{datetime.now()}: Client simulator started")
+        except Exception as e:
+            st.sidebar.error(f"Failed to start client: {e}")
+            st.session_state.debug_messages.append(f"{datetime.now()}: Failed to start client - {e}")
     else:
+        st.sidebar.warning("Only works in Real Mode")
+if st.sidebar.button("Stop Client"):
     if st.session_state.client_simulator:
         st.session_state.client_simulator.stop()
         st.session_state.client_simulator = None
+        st.sidebar.success("Client stopped!")
+        st.session_state.debug_messages.append(f"{datetime.now()}: Client simulator stopped")
+# Main content - focused on core functionality
 st.header("Enter Customer Features")
 with st.form("feature_form"):
     features = []
             features.append(val)
     submitted = st.form_submit_button("Predict Credit Score")
+# Prediction results
 if submitted:
+    logger.info(f"Prediction requested with {len(features)} features")
     if DEMO_MODE:
+        with st.spinner("Processing..."):
+            time.sleep(1)
+        demo_prediction = sum(features) / len(features) * 100 + 500
+        st.success(f"Predicted Credit Score: {demo_prediction:.2f}")
+        st.session_state.debug_messages.append(f"{datetime.now()}: Demo prediction: {demo_prediction:.2f}")
     else:
         try:
+            logger.info(f"Sending prediction request to {SERVER_URL}/predict")
+            with st.spinner("Connecting to server..."):
                 resp = requests.post(f"{SERVER_URL}/predict", json={"features": features}, timeout=10)
             if resp.status_code == 200:
                 prediction = resp.json().get("prediction")
                 st.success(f"Predicted Credit Score: {prediction:.2f}")
+                st.session_state.debug_messages.append(f"{datetime.now()}: Real prediction: {prediction:.2f}")
+                logger.info(f"Prediction successful: {prediction}")
             else:
+                error_msg = f"Prediction failed: {resp.json().get('error', 'Unknown error')}"
+                st.error(error_msg)
+                st.session_state.debug_messages.append(f"{datetime.now()}: {error_msg}")
+                logger.error(f"Prediction failed with status {resp.status_code}: {resp.text}")
+        except requests.exceptions.Timeout:
+            error_msg = "Timeout connecting to server"
+            st.error(error_msg)
+            st.session_state.debug_messages.append(f"{datetime.now()}: {error_msg}")
+            logger.error("Prediction request timeout")
+        except requests.exceptions.ConnectionError as e:
+            error_msg = f"Connection error: {e}"
+            st.error(error_msg)
+            st.session_state.debug_messages.append(f"{datetime.now()}: {error_msg}")
+            logger.error(f"Prediction connection error: {e}")
         except Exception as e:
+            error_msg = f"Unexpected error: {e}"
+            st.error(error_msg)
+            st.session_state.debug_messages.append(f"{datetime.now()}: {error_msg}")
+            logger.error(f"Prediction unexpected error: {e}")
+# Training progress - simplified
+st.header("Training Progress")
 if DEMO_MODE:
     col1, col2, col3, col4 = st.columns(4)
     with col1:
+        st.metric("Round", "3/10")
     with col2:
+        st.metric("Clients", "3")
     with col3:
+        st.metric("Accuracy", "85.2%")
     with col4:
+        st.metric("Status", "Active")
 else:
     try:
+        logger.debug(f"Fetching training status from {SERVER_URL}/training_status")
         status = requests.get(f"{SERVER_URL}/training_status", timeout=5)
         if status.status_code == 200:
             data = status.json()
+            logger.debug(f"Training status received: {data}")
             col1, col2, col3, col4 = st.columns(4)
             with col1:
+                st.metric("Round", f"{data.get('current_round', 0)}/{data.get('total_rounds', 10)}")
             with col2:
+                st.metric("Clients", data.get('active_clients', 0))
             with col3:
+                st.metric("Ready", data.get('clients_ready', 0))
             with col4:
+                st.metric("Status", "Active" if data.get('training_active', False) else "Inactive")
         else:
+            error_msg = f"Training status failed: HTTP {status.status_code}"
+            st.warning(error_msg)
+            st.session_state.debug_messages.append(f"{datetime.now()}: {error_msg}")
+            logger.warning(f"Training status returned {status.status_code}: {status.text}")
+    except requests.exceptions.Timeout:
+        error_msg = "Training status timeout"
+        st.warning(error_msg)
+        st.session_state.debug_messages.append(f"{datetime.now()}: {error_msg}")
+        logger.warning("Training status request timeout")
+    except requests.exceptions.ConnectionError as e:
+        error_msg = f"Training status connection error: {e}"
+        st.warning(error_msg)
+        st.session_state.debug_messages.append(f"{datetime.now()}: {error_msg}")
+        logger.error(f"Training status connection error: {e}")
     except Exception as e:
+        error_msg = f"Training status unexpected error: {e}"
+        st.warning(error_msg)
+        st.session_state.debug_messages.append(f"{datetime.now()}: {error_msg}")
+        logger.error(f"Training status unexpected error: {e}")
+# Client status in sidebar
 if st.session_state.client_simulator and not DEMO_MODE:
+    st.sidebar.header("Client Status")
     if st.session_state.client_simulator.is_running:
+        st.sidebar.success("Connected")
+        st.sidebar.info(f"ID: {st.session_state.client_simulator.client_id}")
+        if st.session_state.client_simulator.last_error:
+            st.sidebar.error(f"Last Error: {st.session_state.client_simulator.last_error}")
     else:
+        st.sidebar.warning("Disconnected")