Spaces:
Sleeping
Sleeping
File size: 1,769 Bytes
2705160 62ad9da 2705160 d123508 2b4218b 2705160 2b4218b 62ad9da |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 |
---
title: Template Final Assignment
emoji: 🕵🏻♂️
colorFrom: indigo
colorTo: indigo
sdk: gradio
sdk_version: 5.25.2
app_file: app.py
pinned: false
hf_oauth: true
hf_oauth_expiration_minutes: 480
short_description: Gemini Agent for GAIA Evaluation
---
# 🧠 Gemini Agent for GAIA Evaluation
This project contains a Gemini-powered CodeAgent built with [smolagents](https://github.com/smol-ai/smol-agents) for use
in the **GAIA Unit 4 Evaluation** on [Hugging Face Spaces](https://hf.co/learn/agents-course/unit0/introduction).
## 🚀 Features
* Uses the **Gemini 2.0 Flash** model via `LiteLLMModel`
* Equipped with essential tools:
* `DuckDuckGoSearchTool` for quick lookups
* `RunPythonFileTool` for executing `.py` scripts
* `ReverseTextTool` for decoding reversed questions
* `download_server` for fetching files from URLs
* Base tools (math, string manipulation, etc.)
## 📋 Evaluation Strategy
The agent reads questions from the GAIA evaluation endpoint, applies reasoning using a system prompt with strict
guidelines, and submits answers back for scoring.
## 🛠️ Setup
1. Clone this repository or Space
2. Set your environment variables:
```
GEMINI_API_KEY=your_api_key_here
SPACE_ID=your_hf_space_id
```
3. Install dependencies:
```bash
pip install -r requirements.txt
```
4. Run locally:
```bash
python app.py
```
Or launch directly via [Hugging Face Spaces](https://huggingface.co/spaces/).
## 🧪 Evaluation Flow
1. Log in to Hugging Face through the UI
2. Click “Run Evaluation & Submit All Answers”
3. The agent will fetch tasks, solve them, and submit results
Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference |