Spaces:

milwright
/

chatui-helper

Running

App Files Files Community

milwright commited on Jul 15

Commit

1920e6d

verified ·

1 Parent(s): 0b0106d

Delete CLAUDE.md

Browse files

Files changed (1) hide show

CLAUDE.md +0 -296

CLAUDE.md DELETED Viewed

@@ -1,296 +0,0 @@
-# CLAUDE.md
-This file provides guidance to Claude Code (claude.ai/code) when working with code in this repository.
-## Project Overview
-Chat UI Helper is a Gradio-based tool for generating and configuring chat interfaces for HuggingFace Spaces. It creates deployable packages with custom assistants, web scraping capabilities, and optional vector RAG functionality.
-## Core Architecture
-### Main Application Flow (`app.py`)
-The application follows a three-tab Gradio interface pattern:
-1. **Configuration Tab**: Space setup, assistant configuration, tool settings
-2. **Sandbox Preview Tab**: Interactive testing with real OpenRouter API integration
-3. **Support Docs Tab**: Comprehensive guidance and templates via `support_docs.py`
-### Template Generation System
-- `SPACE_TEMPLATE` (lines 130-710): Complete HuggingFace Space template with export functionality
-- `generate_zip()` function (lines 869-935): Orchestrates package creation with all dependencies
-- Key template variables: `{system_prompt}`, `{model}`, `{enable_vector_rag}`, `{api_key_var}`, `{grounding_urls}`, `{enable_dynamic_urls}`, `{enable_web_search}`
-### Preview Sandbox Architecture
-- Real OpenRouter API integration in preview mode (`preview_chat_response()` line 1185)
-- URL context testing with dynamic add/remove functionality
-- Configuration-aware responses using exact model and parameters from user configuration
-- Fallback messaging when `OPENROUTER_API_KEY` environment variable not set
-- Legacy tuple format compatibility for Gradio 4.44.1 ChatInterface
-- Comprehensive debugging with enhanced error handling and API response validation
-### Document Processing Pipeline (RAG)
-- **RAGTool** (`rag_tool.py`): Main orchestrator with 10MB file size validation
-- **DocumentProcessor** (`document_processor.py`): PDF/DOCX/TXT/MD parsing with semantic chunking (800 chars, 100 overlap)
-- **VectorStore** (`vector_store.py`): FAISS-based similarity search and base64 serialization
-### Web Scraping Architecture
-Simple HTTP + BeautifulSoup approach with crawl4ai integration:
-- `enhanced_fetch_url_content()` (lines 79-128): Enhanced requests with timeout and user-agent headers
-- Content cleaning: Removes scripts, styles, navigation elements
-- Content limits: ~4000 character truncation for context management
-- URL content caching: `get_cached_grounding_context()` (line 1441) prevents redundant fetches
-- `extract_urls_from_text()` (line 51): Regex-based URL extraction for dynamic fetching
-## Development Commands
-### Environment Setup
-**Important**: This application requires Python ≥3.10 for Gradio 5.x compatibility.
-```bash
-# Recommended: Use Python 3.11+ environment
-python3.11 -m venv venv311
-source venv311/bin/activate  # or venv311\Scripts\activate on Windows
-pip install -r requirements.txt
-```
-### Running the Application
-```bash
-# With virtual environment activated
-python app.py
-```
-### Testing Commands
-```bash
-# Test vector database functionality (requires all RAG dependencies)
-python test_vector_db.py
-# Test RAG fixes and error handling
-python test_rag_fix.py
-# Test OpenRouter API key validation
-python test_api_key.py
-# Test minimal Gradio functionality (for debugging)
-python test_minimal.py
-# Test preview functionality components
-python test_preview.py
-# Test individual RAG components
-python -c "from test_vector_db import test_document_processing; test_document_processing()"
-python -c "from test_vector_db import test_vector_store; test_vector_store()"
-python -c "from test_vector_db import test_rag_tool; test_rag_tool()"
-```
-### Pre-Test Setup for RAG Components
-```bash
-# Create test document for vector database testing
-echo "This is a test document for RAG functionality testing." > test_document.txt
-# Verify all dependencies are installed
-python -c "import sentence_transformers, faiss, fitz; print('RAG dependencies available')"
-```
-## Key Dependencies and Versions
-### Required Dependencies
-- **Gradio ≥4.44.1**: Main UI framework (5.37.0 recommended for Python ≥3.10)
-- **requests ≥2.32.3**: HTTP requests for web content fetching
-- **beautifulsoup4 ≥4.12.3**: HTML parsing for web scraping
-- **python-dotenv ≥1.0.0**: Environment variable management
-### Optional RAG Dependencies
-- **sentence-transformers ≥2.2.2**: Text embeddings
-- **faiss-cpu ==1.7.4**: Vector similarity search
-- **PyMuPDF ≥1.23.0**: PDF text extraction
-- **python-docx ≥0.8.11**: DOCX document processing
-- **numpy ==1.26.4**: Numerical operations
-### Optional Web Search Dependencies
-- **crawl4ai ≥0.2.0**: Advanced web crawling for web search functionality
-- **aiohttp ≥3.8.0**: Async HTTP client for crawl4ai
-## Configuration Patterns
-### Conditional Dependency Loading
-```python
-try:
-    from rag_tool import RAGTool
-    HAS_RAG = True
-except ImportError:
-    HAS_RAG = False
-    RAGTool = None
-```
-This pattern allows graceful degradation when optional vector dependencies are unavailable.
-### Template Variable Substitution
-Generated spaces use these key substitutions:
-- `{system_prompt}`: Combined assistant configuration
-- `{grounding_urls}`: Static URL list for context
-- `{enable_dynamic_urls}`: Runtime URL fetching capability
-- `{enable_vector_rag}`: Document search integration
-- `{enable_web_search}`: Web search integration via crawl4ai
-- `{rag_data_json}`: Serialized embeddings and chunks
-- `{api_key_var}`: Customizable API key environment variable name
-### Access Control Pattern
-- Environment variable `SPACE_ACCESS_CODE` for student access control
-- Global state management for session-based access in generated spaces
-- Security-first approach storing credentials as HuggingFace Spaces secrets
-### RAG Integration Workflow
-1. Documents uploaded through Gradio File component with conditional visibility (`HAS_RAG` flag)
-2. Processed via DocumentProcessor (PDF/DOCX/TXT/MD support) in `process_documents()` function
-3. Chunked and embedded using sentence-transformers (800 chars, 100 overlap)
-4. FAISS index created and serialized to base64 for deployment portability
-5. Embedded in generated template via `{rag_data_json}` template variable
-## Implementation Notes
-### Research Template System (Simplified)
-- **Simple Toggle**: `toggle_research_assistant()` function (line 1704) provides simple on/off functionality
-- **Direct System Prompt**: Enables predefined academic research prompt with DOI verification and LibKey integration
-- **Auto-Enable Dynamic URLs**: Research template automatically enables dynamic URL fetching for academic sources
-- **Template Content**: Academic inquiry focus with DOI-verified sources, fact-checking, and proper citation requirements
-### State Management Across Tabs
-- Extensive use of `gr.State()` for maintaining session data
-- Cross-tab functionality through shared state variables (`sandbox_state`, `preview_config_state`)
-- URL content caching to prevent redundant web requests (`url_content_cache` global variable)
-- Preview debugging with comprehensive error handling and API response validation
-### Gradio Compatibility and Message Format Handling
-- **Target Version**: Gradio 5.37.0 (requires Python ≥3.10)
-- **Legacy Support**: Gradio 4.44.1 compatibility with JSON schema workarounds
-- **Message Format**: Preview uses legacy tuple format `[user_msg, bot_msg]` for ChatInterface compatibility
-- **Generated Spaces**: Use modern dictionary format `{"role": "user", "content": "..."}` for OpenRouter API
-### Security Considerations
-- Never embed API keys or access codes in generated templates
-- Environment variable pattern for all sensitive configuration (`{api_key_var}` template variable)
-- Input validation for uploaded files and URL processing
-- Content length limits for web scraping operations
-## Tool Configuration Changes
-### Code Execution Functionality Removed
-**Important**: Code execution functionality has been completely removed from the application. Do not attempt to re-add it.
-- All `enable_code_execution` parameters and checkboxes have been removed
-- The `toggle_code_execution` function has been removed
-- Code execution logic in preview and generation functions has been removed
-- Generated spaces no longer support code execution capabilities
-### Web Search Integration
-- **Enable Web Search**: Checkbox to enable web search functionality using crawl4ai
-- **Technology**: Uses crawl4ai library with DuckDuckGo for search results
-- **Implementation**: Integrated in both preview mode and generated spaces
-- **Fallback**: Simple HTTP requests if crawl4ai is not available
-## Testing Infrastructure
-### Current Test Structure
-- `test_vector_db.py`: Comprehensive RAG component testing
-- `test_api_key.py`: OpenRouter API validation
-- `test_minimal.py`: Basic Gradio functionality debugging
-- `test_preview.py`: Preview functionality component testing
-### Test Dependencies
-RAG testing requires: `sentence-transformers`, `faiss-cpu`, `PyMuPDF`, `python-docx`
-Core testing requires: `gradio`, `requests`, `beautifulsoup4`, `python-dotenv`
-### Testing Status
-- **Functional**: Four main test files covering core functionality
-- **Usage**: Run individual Python test modules directly
-- **Coverage**: Basic component testing, no automated integration tests
-## Known Issues and Compatibility
-### RAG Processing "Connection errored out" Issue
-- **Issue**: Server crashes or hangs during document processing with "Connection errored out" error
-- **Root Cause**: Memory-intensive embedding model download/initialization causing server timeout
-- **Symptoms**:
-  - `stream.ts:185 Method not implemented.`
-  - `Failed to load resource: net::ERR_INCOMPLETE_CHUNKED_ENCODING`
-  - Server becomes unresponsive during RAG document processing
-- **Solutions**:
-  1. **Use smaller batch sizes**: Reduced from 32 to 16 chunks per batch
-  2. **Improved error handling**: Better feedback for network/memory issues
-  3. **CPU-only processing**: Force CPU usage to avoid GPU/multiprocessing conflicts
-  4. **Environment variables**: Set `TOKENIZERS_PARALLELISM=false` to prevent multiprocessing issues
-  5. **Smaller model**: Default model changed from `sentence-transformers/all-MiniLM-L6-v2` to `all-MiniLM-L6-v2`
-- **Testing**: Run `python test_rag_fix.py` to verify RAG functionality
-- **Prevention**: Process documents one at a time, use smaller files (<5MB)
-### Gradio 4.44.1 JSON Schema Bug
-- **Issue**: TypeError in `json_schema_to_python_type` prevents app startup in some environments
-- **Symptom**: "argument of type 'bool' is not iterable" error during API schema generation
-- **Workaround**: Individual component functions work correctly
-- **Solution**: Upgrade to Gradio 5.x for full compatibility
-### Python Version Requirements
-- **Minimum**: Python 3.9 (for Gradio 4.44.1)
-- **Recommended**: Python 3.11+ (for Gradio 5.x and optimal performance)
-## Common Claude Code Anti-Patterns to Avoid
-### Message Format Reversion
-**❌ Don't revert to:** New dictionary format in preview functions
-```python
-# WRONG - breaks Gradio 4.44.1 ChatInterface
-history.append({"role": "user", "content": message})
-history.append({"role": "assistant", "content": response})
-```
-**✅ Keep:** Legacy tuple format for preview compatibility
-```python
-# CORRECT - works with current Gradio ChatInterface
-history.append([message, response])
-```
-### Template Variable Substitution
-**❌ Don't change:** Template string escaping patterns in `SPACE_TEMPLATE`
-- Keep double backslashes: `\\n\\n` (becomes `\n\n` after Python string processing)
-- Keep double braces: `{{variable}}` (becomes `{variable}` after format())
-- **Reason**: Template undergoes two levels of processing (Python format + HuggingFace deployment)
-### Code Execution Re-Addition
-**❌ Don't re-add:** Code execution functionality has been intentionally removed
-- Do not add `enable_code_execution` parameters back to functions
-- Do not create code execution UI components
-- Do not add code execution logic to preview or generation workflows
-- **Reason**: Code execution functionality was removed by design
-### Conditional Dependency Loading
-**❌ Don't remove:** `HAS_RAG` flag and conditional imports
-```python
-# WRONG - breaks installations without vector dependencies
-from rag_tool import RAGTool
-```
-**✅ Keep:** Graceful degradation pattern
-```python
-# CORRECT - allows app to work without optional dependencies
-try:
-    from rag_tool import RAGTool
-    HAS_RAG = True
-except ImportError:
-    HAS_RAG = False
-    RAGTool = None
-```
-### URL Management and Preview Functionality
-**❌ Don't remove:** Dynamic URL add/remove functionality or real API integration in preview
-- Keep `add_urls()`, `remove_urls()`, `add_chat_urls()`, `remove_chat_urls()` functions
-- Maintain URL count state management with `gr.State()`
-- Keep actual OpenRouter API calls in preview mode when `OPENROUTER_API_KEY` is set
-- **Reason**: Users expect scalable URL input interface and realistic preview testing
-## Development-Only Utilities
-### MCP Servers
-- **Gradio Docs**: Available at https://gradio-docs-mcp.hf.space/gradio_api/mcp/sse
-- Use `gradio_docs.py` utility for development assistance
-- **CRITICAL**: Do NOT import in main application - this is for development tooling only
-Usage for development:
-```bash
-python -c "from gradio_docs import gradio_docs; print(gradio_docs.search_docs('ChatInterface'))"
-```