File size: 1,204 Bytes
241a290
d4a7e07
 
 
 
241a290
 
 
 
 
 
 
d4a7e07
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
---
title: πŸ“š RAG-based PDF Query Application
emoji: πŸ“š
colorFrom: purple
colorTo: blue
sdk: streamlit
sdk_version: 1.42.0
app_file: app.py
pinned: false
---

Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

# πŸ“š RAG-based PDF Query Application

This application leverages Retrieval-Augmented Generation (RAG) to enable users to ask questions about a PDF document. The PDF is processed and split into chunks, which are then indexed and used to find the most relevant information when the user submits a query.

## Features

- Upload a PDF and ask questions based on its contents.
- PDF is processed using **PyMuPDF** and text is split into chunks.
- **FAISS** is used to create an index for quick retrieval of relevant text chunks.
- The application uses the **Groq API** for generating answers based on the retrieved context.

## How it works

1. Upload a PDF document.
2. The application extracts text from the PDF and splits it into smaller chunks.
3. A **FAISS index** is built using embeddings of these chunks.
4. When you submit a query, the system retrieves the most relevant chunks and sends them to the **Groq API** to generate an answer.