Spaces:

Agents-MCP-Hackathon
/

DataForge

Runtime error

App Files Files Community

ai-puppy commited on Jun 8

Commit

3f04a52

1 Parent(s): bb43287

save

Browse files

Files changed (2) hide show

README.md +1 -0
app.py +42 -64

README.md CHANGED Viewed

@@ -9,6 +9,7 @@ app_file: app.py
 pinned: false
 license: mit
 short_description: CodeAct Agent to process large data set
 ---
 An example chatbot using [Gradio](https://gradio.app), [`huggingface_hub`](https://huggingface.co/docs/huggingface_hub/v0.22.2/en/index), and the [Hugging Face Inference API](https://huggingface.co/docs/api-inference/index).

 pinned: false
 license: mit
 short_description: CodeAct Agent to process large data set
+tags: agent-demo-track
 ---
 An example chatbot using [Gradio](https://gradio.app), [`huggingface_hub`](https://huggingface.co/docs/huggingface_hub/v0.22.2/en/index), and the [Hugging Face Inference API](https://huggingface.co/docs/api-inference/index).

app.py CHANGED Viewed

@@ -62,8 +62,6 @@ codeact_model = init_chat_model("gpt-4.1-2025-04-14", model_provider="openai")
 # Store uploaded file path globally
 uploaded_file_path = None
-# Chat functionality removed - focusing on file analysis
 def handle_file_upload(file):
     """Handle file upload and store the path globally"""
     global uploaded_file_path
@@ -167,16 +165,30 @@ def run_file_analysis():
     return asyncio.run(analyze_uploaded_file())
 # Create the Gradio interface
-with gr.Blocks(title="DataForge - AI-Powered File Analysis") as demo:
-    gr.Markdown("# 🔍 DataForge - AI-Powered File Analysis")
     gr.Markdown("""
-    Upload any file and ask specific questions for targeted AI analysis. Our guided approach:
-    1. 📋 **Examines** your file structure and patterns automatically
-    2. 🎯 **Generates** specific code guidance based on your question
-    3. 🚀 **Executes** enhanced analysis with improved accuracy
-    **Simply upload a file and ask any question you want!**
     """)
     with gr.Row():
@@ -193,6 +205,7 @@ with gr.Blocks(title="DataForge - AI-Powered File Analysis") as demo:
                 interactive=False
             )
             # Question Section
             gr.Markdown("### ❓ Ask Your Question")
             user_question = gr.Textbox(
@@ -202,25 +215,25 @@ with gr.Blocks(title="DataForge - AI-Powered File Analysis") as demo:
                 value=""
             )
-            analyze_btn = gr.Button("🔍 Run Guided Analysis", variant="primary", size="lg")
-            # Analysis Info
-            gr.Markdown("### ℹ️ How It Works")
             gr.Markdown("""
-            **Guided Analysis Process:**
-            - 🎯 **Question-aware**: Code generation tailored to your specific question
-            - 📋 **Smart examination**: Automatically detects file structure and patterns
-            - 🚀 **Dynamic optimization**: Creates targeted analysis approach
-            - ✅ **Higher accuracy**: Prevents common code generation errors
-            - 🔧 **Quality control**: Built-in validation to avoid syntax issues
             """)
         with gr.Column(scale=2):
             analysis_output = gr.Textbox(
-                label="📊 Guided Analysis Results",
                 lines=25,
                 max_lines=35,
-                placeholder="Upload a file, type your question, and click 'Run Guided Analysis' to see detailed results here...",
                 interactive=False
             )
@@ -238,50 +251,15 @@ with gr.Blocks(title="DataForge - AI-Powered File Analysis") as demo:
     )
     gr.Markdown("---")
-    gr.Markdown("## 💡 Example Questions by File Type")
-    with gr.Accordion("🔐 Security Analysis Questions", open=False):
-        gr.Markdown("""
-        **For Log Files:**
-        - "Find any failed login attempts and suspicious IP addresses"
-        - "Identify potential security threats or anomalies"
-        - "Show me authentication errors and user access patterns"
-        - "Are there any brute force attacks or repeated failures?"
-        **For Access Logs:**
-        - "Detect unusual access patterns or potential intrusions"
-        - "Find requests with suspicious user agents or payloads"
-        - "Identify high-frequency requests from single IPs"
-        """)
-    with gr.Accordion("⚡ Performance Analysis Questions", open=False):
-        gr.Markdown("""
-        **For Application Logs:**
-        - "Which API endpoints are slowest and why?"
-        - "Find performance bottlenecks and response time issues"
-        - "Show me timeout errors and failed requests"
-        - "What are the peak usage times and load patterns?"
-        **For System Logs:**
-        - "Identify resource usage spikes and memory issues"
-        - "Find database query performance problems"
-        - "Show me error rates and system health indicators"
-        """)
-    with gr.Accordion("📈 Data Analysis Questions", open=False):
-        gr.Markdown("""
-        **For CSV/Data Files:**
-        - "Analyze data distribution and find statistical insights"
-        - "Identify outliers and anomalies in the dataset"
-        - "What correlations exist between different columns?"
-        - "Generate a comprehensive data quality report"
-        **For JSON Files:**
-        - "Parse the structure and extract key information"
-        - "Find patterns in nested data and relationships"
-        - "Summarize the main data points and values"
-        """)
 if __name__ == "__main__":
-    print("Starting DataForge application...")
     demo.launch()

 # Store uploaded file path globally
 uploaded_file_path = None
 def handle_file_upload(file):
     """Handle file upload and store the path globally"""
     global uploaded_file_path
     return asyncio.run(analyze_uploaded_file())
 # Create the Gradio interface
+with gr.Blocks(title="DataForge - AI CodeAct Agent") as demo:
+    gr.Markdown("# 🤖 DataForge - AI CodeAct Agent")
     gr.Markdown("""
+    ## 🔑 **AI Writes Code to Analyze Your Data Locally**
+    **Why DataForge handles massive files when other AI tools fail:**
+    ❌ **Other AI Tools**: Upload data to LLM → Hit limits → Fail on large files
+    ✅ **DataForge**: AI writes code → Code processes data locally → No limits!
+    ### 💪 **Key Benefits:**
+    - **♾️ No Size Limits** - Process GB+ files locally
+    - **🛡️ Complete Privacy** - Data never leaves your machine
+    - **⚡ Lightning Fast** - No uploads, pure local processing
+    - **🎯 Custom Analysis** - Code written for your specific question
+    """)
+    # Supported File Types - Simple Version
+    gr.Markdown("## 📋 **Supported Files**")
+    gr.Markdown("""
+    **📊 Data:** CSV, JSON, XML, TSV
+    **📝 Logs:** Application, access, error, audit logs
+    **🗂️ Text:** Any text file, code files, configs
+    **💾 Size:** No limits - handles multi-GB files locally
     """)
     with gr.Row():
                 interactive=False
             )
             # Question Section
             gr.Markdown("### ❓ Ask Your Question")
             user_question = gr.Textbox(
                 value=""
             )
+            analyze_btn = gr.Button("🤖 Activate CodeAct Agent", variant="primary", size="lg")
+            # How it works
+            gr.Markdown("### 🔬 **How It Works**")
             gr.Markdown("""
+            1. **🔍 AI samples** your file structure
+            2. **⚡ AI writes** custom analysis code
+            3. **🚀 Code processes** your entire file locally
+            4. **📊 Results** delivered to you
+            **Your data never leaves your machine!**
             """)
         with gr.Column(scale=2):
             analysis_output = gr.Textbox(
+                label="🤖 CodeAct Agent Analysis Results",
                 lines=25,
                 max_lines=35,
+                placeholder="Upload a file, ask your question, and click 'Activate CodeAct Agent' to watch the AI write and execute custom analysis code in real-time...",
                 interactive=False
             )
     )
     gr.Markdown("---")
+    gr.Markdown("## 💡 **Example Questions**")
+    gr.Markdown("""
+    **🔐 Security:** "Find failed login attempts and suspicious IPs"
+    **⚡ Performance:** "Identify slowest API endpoints and bottlenecks"
+    **📊 Data:** "Analyze statistical patterns and outliers"
+    **🔍 General:** "Summarize key insights and anomalies"
+    """)
 if __name__ == "__main__":
+    print("🤖 Starting DataForge CodeAct Agent Application...")
+    print("🚀 Initializing advanced AI-powered file analysis capabilities...")
     demo.launch()