phucdev commited on
Commit
906af2b
·
1 Parent(s): 5bd1c41

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +50 -1
README.md CHANGED
@@ -12,4 +12,53 @@ hf_oauth: true
12
  hf_oauth_expiration_minutes: 480
13
  ---
14
 
15
- Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
12
  hf_oauth_expiration_minutes: 480
13
  ---
14
 
15
+ # 🧠 GAIA-Agent
16
+
17
+ A **LLM agent** designed to solve the final assignment of the [HuggingFace Agents course](https://huggingface.co/agents-course).
18
+ The assignment consists of 20 level 1 questions of the [GAIA benchmark](https://huggingface.co/datasets/andrewrreed/GAIA).
19
+ This agent is built with [LangGraph](https://www.langchain.com/langgraph), [LangChain](https://www.langchain.com/),
20
+ and is tracked using [LangFuse](https://www.langfuse.com/).
21
+ It uses real-time web search, Wikipedia lookups, file parsing, image analysis, audio transcription to provide accurate answers.
22
+
23
+ ## 🛠️ Technical Details
24
+
25
+ | Component/ Tool | Tech Used |
26
+ |---------------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------|
27
+ | Agent framework | [LangGraph](https://www.langchain.com/langgraph) + [LangChain](https://www.langchain.com/) |
28
+ | LLM | [Meta LLaMA 4 Maverick 17B 128e](https://huggingface.co/meta-llama/Llama-4-Maverick-17B-128E-Instruct) via [Groq](https://groq.com/) |
29
+ | Web search | [SerperAPI](https://serper.dev/), [requests](https://requests.readthedocs.io/en/latest/), [Playwright](https://playwright.dev/python/) for rendering JSON |
30
+ | HTML parsing | [BeautifulSoup](https://www.crummy.com/software/BeautifulSoup/), [markdownify](https://github.com/matthewwithanm/python-markdownify) for converting HTML to Markdown |
31
+ | Wikipedia | [Wikimedia API](https://api.wikimedia.org/wiki/Core_REST_API) |
32
+ | File parsing | [Unstructured](https://unstructured.io/) (for PDFs, PowerPoint, Word), [pandas](https://pandas.pydata.org/) (for CSV, TSV, Excel) |
33
+ | Audio transcription | [OpenAI Whisper base](https://github.com/openai/whisper) |
34
+ | Agent monitoring | [LangFuse](https://www.langfuse.com/) |
35
+ | UI / API | [Gradio](https://www.gradio.app/) based on the [Final Assignment Template](https://huggingface.co/spaces/agents-course/First_agent_template) |
36
+
37
+
38
+ ## 🔐 Setup Instructions
39
+
40
+ 1. **Clone the repository**:
41
+ ```bash
42
+ git clone https://github.com/phucdev/GAIA-agent.git
43
+ cd GAIA-agent
44
+ ```
45
+ 2. **Install dependencies**:
46
+ ```bash
47
+ pip install -r requirements.txt
48
+ ```
49
+ 3. **Create API keys and set-up environment variables**:
50
+ Use `env.example` to create a `.env` file in the root directory and replace the placeholders with your actual API keys:
51
+ ```plaintext
52
+ GROQ_API_KEY=your_groq_api_key
53
+ SERPER_API_KEY=your_serper_api_key
54
+ ... (other API keys as needed)
55
+ ```
56
+ 4. Optional: For rendering JS with Playwright, you may need some additional setup:
57
+ ```bash
58
+ playwright install
59
+ sudo apt-get install libgtk-3-0
60
+ ```
61
+ 5. Then you can run the app with:
62
+ ```bash
63
+ python app.py
64
+ ```