Spaces:
Sleeping
Sleeping
Update README.md
Browse files
README.md
CHANGED
@@ -14,25 +14,24 @@ hf_oauth_expiration_minutes: 480
|
|
14 |
|
15 |
# 🧠 GAIA-Agent
|
16 |
|
17 |
-
A **LLM agent**
|
18 |
-
|
19 |
-
|
20 |
-
|
21 |
-
It uses real-time web search, Wikipedia lookups, file parsing, image analysis, audio transcription to provide accurate answers.
|
22 |
|
23 |
## 🛠️ Technical Details
|
24 |
|
25 |
-
| Component
|
26 |
-
|
27 |
-
| Agent
|
28 |
-
| LLM | [Meta LLaMA 4 Maverick 17B
|
29 |
-
| Web
|
30 |
-
| HTML
|
31 |
-
| Wikipedia | [Wikimedia API](https://api.wikimedia.org/wiki/Core_REST_API)
|
32 |
-
| File
|
33 |
-
| Audio
|
34 |
-
|
|
35 |
-
| UI
|
36 |
|
37 |
|
38 |
## 🔐 Setup Instructions
|
@@ -58,7 +57,7 @@ It uses real-time web search, Wikipedia lookups, file parsing, image analysis, a
|
|
58 |
playwright install
|
59 |
sudo apt-get install libgtk-3-0
|
60 |
```
|
61 |
-
5.
|
62 |
```bash
|
63 |
python app.py
|
64 |
```
|
|
|
14 |
|
15 |
# 🧠 GAIA-Agent
|
16 |
|
17 |
+
A general-purpose **LLM agent** built to solve the final assignment of the [Hugging Face Agents course](https://huggingface.co/agents-course), which consists of 20 Level 1 questions from the [GAIA benchmark](https://huggingface.co/datasets/andrewrreed/GAIA).
|
18 |
+
|
19 |
+
The agent uses a variety of tools—web search, Wikipedia extraction, file parsing, audio transcription, and more—to gather evidence and reason through answers. It is built with [LangGraph](https://www.langchain.com/langgraph) and [LangChain](https://www.langchain.com/), and all interactions are tracked with [LangFuse](https://www.langfuse.com/).
|
20 |
+
|
|
|
21 |
|
22 |
## 🛠️ Technical Details
|
23 |
|
24 |
+
| Component | Technology |
|
25 |
+
|-------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
|
26 |
+
| **Agent Framework** | [LangGraph](https://www.langchain.com/langgraph) + [LangChain](https://www.langchain.com/) |
|
27 |
+
| **LLM** | [Meta LLaMA 4 Maverick 17B 128E Instruct](https://huggingface.co/meta-llama/Llama-4-Maverick-17B-128E-Instruct) via [Groq](https://groq.com/) |
|
28 |
+
| **Web Search** | [SerperAPI](https://serper.dev/) (Google Search), [requests](https://docs.python-requests.org/), [Playwright](https://playwright.dev/python/) for dynamic content rendering |
|
29 |
+
| **HTML Parsing** | [BeautifulSoup](https://www.crummy.com/software/BeautifulSoup/), [markdownify](https://github.com/matthewwithanm/python-markdownify) to convert HTML to Markdown |
|
30 |
+
| **Wikipedia** | [Wikimedia API](https://api.wikimedia.org/wiki/Core_REST_API) |
|
31 |
+
| **File Parsing** | [Unstructured](https://unstructured.io/) (PDF, DOCX, PPTX), [pandas](https://pandas.pydata.org/) (CSV, TSV, XLSX) |
|
32 |
+
| **Audio Transcription** | [OpenAI Whisper (base)](https://github.com/openai/whisper) |
|
33 |
+
| **Monitoring** | [LangFuse](https://www.langfuse.com/) |
|
34 |
+
| **Frontend UI** | [Gradio](https://www.gradio.app/), based on the [Final Assignment Template](https://huggingface.co/spaces/agents-course/First_agent_template) |
|
35 |
|
36 |
|
37 |
## 🔐 Setup Instructions
|
|
|
57 |
playwright install
|
58 |
sudo apt-get install libgtk-3-0
|
59 |
```
|
60 |
+
5. Run the app with:
|
61 |
```bash
|
62 |
python app.py
|
63 |
```
|