Spaces:

Agents-MCP-Hackathon
/

hfcontext7

Running

App Files Files Community

Abdullah Meda commited on 7 days ago

Commit

c6fe03c

1 Parent(s): 0955e72

refactoring edits

Browse files

Files changed (8) hide show

README.md +35 -111
Relevant README 1.md +0 -311
Relevant README 2.md +0 -618
app.py +20 -14
schemas.py +1 -1
make_docs.py → scripts/make_docs.py +0 -0
make_rag_db.py → scripts/make_rag_db.py +0 -0
utils.py +6 -6

README.md CHANGED Viewed

@@ -14,39 +14,19 @@ license: apache-2.0
 short_description: Latest 🤗 documentation for LLMs and AI code editors
 ---
-# 🐠 HfContext7 MCP Server
-<p align="center">
-  <em>Real-time HuggingFace Documentation for AI Coding Assistants and LLMs</em>
-</p>
-## 🚀 What is HfContext7?
----
-**HfContext7** is a specialized Model Context Protocol (MCP) server designed to provide AI coding assistants and Large Language Models (LLMs) with **real-time, up-to-date documentation** from the HuggingFace ecosystem.
-Inspired by the groundbreaking [Context7 MCP Server](https://github.com/upstash/context7), HfContext7 specifically targets the rapidly evolving HuggingFace libraries, ensuring your AI assistant always has the latest and most accurate information.
----
-## ❌ The Problem We Solve
-The HuggingFace ecosystem evolves at lightning speed. New APIs, features, and best practices emerge constantly, making it challenging for LLMs trained on static datasets to keep up. This leads to:
-- ❌ Outdated code examples based on old training data
-- ❌ Hallucinated APIs that no longer exist or never existed
-- ❌ Generic answers that don't reflect current HuggingFace best practices
-- ❌ Confusion between similar HuggingFace libraries (Transformers, Diffusers, PEFT, etc.)
----
-## ✅ How HfContext7 Solves It
-HfContext7 MCP server solves these issues by:
-- **Real-time Documentation**: Fetching the latest HuggingFace documentation directly from official sources.
-- **Semantic Search**: Leveraging advanced embeddings and vector search (powered by Milvus and OpenAI embeddings) to retrieve highly relevant documentation snippets.
-- **Seamless Integration**: Easily integrates with popular AI coding assistants (Cursor, Claude Desktop, Windsurf, etc.) via MCP.
 Simply add `use hfcontext7` to your prompt:
@@ -62,108 +42,52 @@ HfContext7 instantly provides your AI assistant with accurate, up-to-date Huggin
 ---
-## 📚 Supported HuggingFace Libraries (28+)
-HfContext7 supports a wide range of HuggingFace libraries, including:
-- **Transformers** – State-of-the-art NLP models
-- **Diffusers** – Diffusion models for image/audio generation
-- **PEFT** – Parameter-Efficient Fine-Tuning (LoRA, etc.)
-- **TRL** – Transformer Reinforcement Learning
-- **Datasets** – Access and share datasets
-- **Accelerate** – Simplified distributed training
-- **Text Generation Inference (TGI)** – High-performance inference
-- **Optimum** – Hardware-optimized transformers
-- **AutoTrain** – No-code training platform
-- **bitsandbytes** – 8-bit optimizers and quantization
-...and many more! (Full list available in `repos_config.json`)
----
-## 🛠️ Available Tools
-HfContext7 provides essential tools for AI coding assistants:
-- **`list_huggingface_resources_names`**: Lists all available HuggingFace resources in the documentation database.
-- **`get_huggingface_documentation`**: Retrieves relevant documentation for a specific topic, optionally filtered by resource names.
 ---
-## ⚙️ Quick Start
-### 1. Clone and Install
-```bash
-git clone <repo-url>
-cd hfcontext7
-pip install -r requirements.txt
-```
-### 2. Set OpenAI API Key
-```bash
-echo "OPENAI_API_KEY=your_key_here" > .env
-```
-### 3. Build Documentation Database
-```bash
-python make_docs.py
-python make_rag_db.py
-```
-### 4. Run the Server
-```bash
-python app.py
-```
 ---
-## 🔌 MCP Client Setup
-### Cursor & Claude Desktop Example
-```json
-{
-  "mcpServers": {
-    "hfcontext7": {
-      "command": "python",
-      "args": ["/path/to/hfcontext7/app.py"],
-      "env": {
-        "OPENAI_API_KEY": "your_openai_api_key"
-      }
-    }
-  }
-}
-```
----
-## 💡 How It Works
-HfContext7 MCP server workflow:
-1. **Crawls** official HuggingFace documentation repositories.
-2. **Organizes** documentation using semantic embeddings (OpenAI embeddings + Milvus vector DB).
-3. **Serves** relevant documentation snippets directly into your AI assistant's context via MCP.
-4. **Updates** easily—just re-run the build scripts to refresh documentation.
 ---
-## 🌟 Inspired by Context7
-This project was heavily inspired by the incredible [Context7 MCP Server](https://github.com/upstash/context7) by Upstash, which revolutionized how LLMs access general development documentation. While Context7 provides broad coverage across many frameworks, HfContext7 focuses specifically on the HuggingFace ecosystem, providing deeper, more specialized knowledge for AI/ML development.
----
-## 📄 License
-Apache 2.0
----
-<p align="center">
-  <strong>Stop fighting outdated HuggingFace examples. Get the latest docs in every prompt. 🚀</strong>
-</p>

 short_description: Latest 🤗 documentation for LLMs and AI code editors
 ---
+# HFContext7: Up-to-date 🤗 Docs for your LLM
+### The Problem: Your LLM is stuck in the past
+You ask your AI assistant for a code snippet using the latest `diffusers` feature, and it confidently spits out code that was deprecated six months ago. You're trying to debug a `transformers` pipeline, and the LLM hallucinates parameters that don't exist. Sound familiar?
+Large Language Models are powerful, but their knowledge is frozen in time. The Hugging Face ecosystem, however, moves at lightning speed. This knowledge gap leads to wasted time, frustrating debugging sessions, and a reliance on constant tab-switching to find the right documentation page.
+### The Solution: Fresh Docs, Right in Your Prompt
+**HFContext7** is a Model Context Protocol (MCP) server that acts as a bridge between your AI assistant and the ever-evolving Hugging Face documentation. It provides your LLM with the ability to fetch the **single most relevant** documentation page for your query, ensuring the context it uses is fresh, accurate, and directly from the source.
+**Inspired by the (unfortunately closed-source) `Context7` project**, we wanted to build an open-source alternative focused specifically on the rich, complex, and rapidly changing Hugging Face ecosystem.
 Simply add `use hfcontext7` to your prompt:
 ---
+### Under the Hood: A Smarter RAG Pipeline
+Traditional RAG (Retrieval-Augmented Generation) on large documentation sets can be slow, expensive, and imprecise. Embedding entire libraries' worth of content leads to massive vector databases and often returns noisy, irrelevant chunks.
+We took a different, more surgical approach:
+1.  **Structural Pre-processing:** We first clone the official documentation for major Hugging Face libraries. We parse their structure (`_toctree.yml`) and organize the content into a clean, hierarchical file tree. This preserves the logical layout created by the library authors.
+2.  **Indexing Paths, Not Pages:** Instead of embedding the full text of each page (which can be huge), we only embed the **file paths**. A path like `Transformers/Main Classes/Trainer.md` contains a wealth of semantic information about the content. This keeps our vector index small, fast, and surprisingly effective.
+3.  **Two-Step Retrieval Magic:** This is where the magic happens.
+    *   **Step 1: Candidate Search:** When you ask a question, we embed your query and perform a semantic search against our index of *file paths*. This instantly gives us the top 50 most likely documentation pages.
+    *   **Step 2: LLM-Powered Selection:** We don't just dump all 50 files into the context. Instead, we generate a `tree`-like view of their file structure and present it to a powerful LLM (GPT-4o) along with your original question. The LLM's only job is to analyze this structure and choose the **one file** that is the most likely to contain the answer.
+This approach is fast, cheap, and highly precise. It leverages the inherent structure of good documentation and uses a powerful reasoning engine for the final selection, ensuring you get the whole, relevant page, not just a random chunk.
 ---
+### Challenges along the way
+Building HFContext7 wasn't straightforward. We faced a few key challenges:
+*   **The "Needle in a Haystack" Problem:** The HF ecosystem is massive. A simple keyword or vector search often returns dozens of tangentially related results. Our two-step retrieval pipeline was born from the need to drastically improve precision and find that one perfect document.
+*   **Scalability & Cost:** The idea of embedding the entirety of the HF docs was daunting. It would be slow to process and expensive to host. The path-embedding strategy was our answer to create a system that is both performant and cost-effective.
+*   **Taming Diverse Structures:** Not all documentation is created equal. We had to write a robust parser to handle the different ways various HF projects structure their `_toctree.yml` files, creating a unified and navigable database. Some libraries like Hugging Face.js and Sentence Transformers use completely different documentation structures that don't follow the standard `_toctree.yml` format.
+*   **Content Overflow Issues:** Raw markdown files often contain excessive comments, metadata, and navigation links that bloat the LLM's context window without adding value. Cleaning this content while preserving the essential information proved to be a delicate balance.
+*   **Infrastructure Limitations:** We initially planned to transition to Hugging Face Inference Providers for a more integrated experience, but couldn't access the $25 HF credits during development due to credit card requirements, forcing us to stick with OpenAI's APIs for now.
 ---
+### Roadmap for the Future
+HFContext7 is just getting started. Here's where we're headed:
+-   🗺️ **Expanded Coverage:** Integrating more libraries from the Hugging Face ecosystem, including support for frameworks with non-standard documentation structures like Hugging Face.js and Sentence Transformers.
+-   🎯 **Enhanced Precision:** Moving beyond single-file retrieval to identify and return the most relevant *sections* within a document.
+-   🧑‍💻 **Enhanced Agentic Retrieval:** Building more sophisticated retrieval mechanisms that can provide broader documentation context while maintaining high accuracy, allowing for multi-document synthesis and cross-reference capabilities.
+-   🧹 **Content Optimization:** Implementing smart content cleaning to remove unnecessary markdown comments, metadata, and navigation elements that waste context window space without losing critical information.
+-   🤗 **HF Native Integration:** Transitioning to Hugging Face Inference Providers for embeddings and LLM calls, creating a fully integrated experience within the HF ecosystem.
+-   🧩 **Enhanced Chunking Strategy:** Implementing a Context 7-inspired chunking approach that focuses on examples and creates distinct, semantically meaningful sections for each chunk, improving retrieval precision.
 ---
+### Available Tools
+This server exposes the following tools to an MCP client:
+*   `list_huggingface_resources_names()`: Returns a list of all the HF libraries and resources available for querying.
+*   `get_huggingface_documentation(topic: str, resource_names: list[str] = [])`: The main workhorse. Takes a topic (your question) and an optional list of resource names to search within, and returns the content of the most relevant documentation page.

Relevant README 1.md DELETED Viewed

@@ -1,311 +0,0 @@
-<h1 align="center">Crawl4AI RAG MCP Server</h1>
-<p align="center">
-  <em>Web Crawling and RAG Capabilities for AI Agents and AI Coding Assistants</em>
-</p>
-A powerful implementation of the [Model Context Protocol (MCP)](https://modelcontextprotocol.io) integrated with [Crawl4AI](https://crawl4ai.com) and [Supabase](https://supabase.com/) for providing AI agents and AI coding assistants with advanced web crawling and RAG capabilities.
-With this MCP server, you can <b>scrape anything</b> and then <b>use that knowledge anywhere</b> for RAG.
-The primary goal is to bring this MCP server into [Archon](https://github.com/coleam00/Archon) as I evolve it to be more of a knowledge engine for AI coding assistants to build AI agents. This first version of the Crawl4AI/RAG MCP server will be improved upon greatly soon, especially making it more configurable so you can use different embedding models and run everything locally with Ollama.
-## Overview
-This MCP server provides tools that enable AI agents to crawl websites, store content in a vector database (Supabase), and perform RAG over the crawled content. It follows the best practices for building MCP servers based on the [Mem0 MCP server template](https://github.com/coleam00/mcp-mem0/) I provided on my channel previously.
-The server includes several advanced RAG strategies that can be enabled to enhance retrieval quality:
-- **Contextual Embeddings** for enriched semantic understanding
-- **Hybrid Search** combining vector and keyword search
-- **Agentic RAG** for specialized code example extraction
-- **Reranking** for improved result relevance using cross-encoder models
-See the [Configuration section](#configuration) below for details on how to enable and configure these strategies.
-## Vision
-The Crawl4AI RAG MCP server is just the beginning. Here's where we're headed:
-1. **Integration with Archon**: Building this system directly into [Archon](https://github.com/coleam00/Archon) to create a comprehensive knowledge engine for AI coding assistants to build better AI agents.
-2. **Multiple Embedding Models**: Expanding beyond OpenAI to support a variety of embedding models, including the ability to run everything locally with Ollama for complete control and privacy.
-3. **Advanced RAG Strategies**: Implementing sophisticated retrieval techniques like contextual retrieval, late chunking, and others to move beyond basic "naive lookups" and significantly enhance the power and precision of the RAG system, especially as it integrates with Archon.
-4. **Enhanced Chunking Strategy**: Implementing a Context 7-inspired chunking approach that focuses on examples and creates distinct, semantically meaningful sections for each chunk, improving retrieval precision.
-5. **Performance Optimization**: Increasing crawling and indexing speed to make it more realistic to "quickly" index new documentation to then leverage it within the same prompt in an AI coding assistant.
-## Features
-- **Smart URL Detection**: Automatically detects and handles different URL types (regular webpages, sitemaps, text files)
-- **Recursive Crawling**: Follows internal links to discover content
-- **Parallel Processing**: Efficiently crawls multiple pages simultaneously
-- **Content Chunking**: Intelligently splits content by headers and size for better processing
-- **Vector Search**: Performs RAG over crawled content, optionally filtering by data source for precision
-- **Source Retrieval**: Retrieve sources available for filtering to guide the RAG process
-## Tools
-The server provides essential web crawling and search tools:
-### Core Tools (Always Available)
-1. **`crawl_single_page`**: Quickly crawl a single web page and store its content in the vector database
-2. **`smart_crawl_url`**: Intelligently crawl a full website based on the type of URL provided (sitemap, llms-full.txt, or a regular webpage that needs to be crawled recursively)
-3. **`get_available_sources`**: Get a list of all available sources (domains) in the database
-4. **`perform_rag_query`**: Search for relevant content using semantic search with optional source filtering
-### Conditional Tools
-5. **`search_code_examples`** (requires `USE_AGENTIC_RAG=true`): Search specifically for code examples and their summaries from crawled documentation. This tool provides targeted code snippet retrieval for AI coding assistants.
-## Prerequisites
-- [Docker/Docker Desktop](https://www.docker.com/products/docker-desktop/) if running the MCP server as a container (recommended)
-- [Python 3.12+](https://www.python.org/downloads/) if running the MCP server directly through uv
-- [Supabase](https://supabase.com/) (database for RAG)
-- [OpenAI API key](https://platform.openai.com/api-keys) (for generating embeddings)
-## Installation
-### Using Docker (Recommended)
-1. Clone this repository:
-   ```bash
-   git clone https://github.com/coleam00/mcp-crawl4ai-rag.git
-   cd mcp-crawl4ai-rag
-   ```
-2. Build the Docker image:
-   ```bash
-   docker build -t mcp/crawl4ai-rag --build-arg PORT=8051 .
-   ```
-3. Create a `.env` file based on the configuration section below
-### Using uv directly (no Docker)
-1. Clone this repository:
-   ```bash
-   git clone https://github.com/coleam00/mcp-crawl4ai-rag.git
-   cd mcp-crawl4ai-rag
-   ```
-2. Install uv if you don't have it:
-   ```bash
-   pip install uv
-   ```
-3. Create and activate a virtual environment:
-   ```bash
-   uv venv
-   .venv\Scripts\activate
-   # on Mac/Linux: source .venv/bin/activate
-   ```
-4. Install dependencies:
-   ```bash
-   uv pip install -e .
-   crawl4ai-setup
-   ```
-5. Create a `.env` file based on the configuration section below
-## Database Setup
-Before running the server, you need to set up the database with the pgvector extension:
-1. Go to the SQL Editor in your Supabase dashboard (create a new project first if necessary)
-2. Create a new query and paste the contents of `crawled_pages.sql`
-3. Run the query to create the necessary tables and functions
-## Configuration
-Create a `.env` file in the project root with the following variables:
-```
-# MCP Server Configuration
-HOST=0.0.0.0
-PORT=8051
-TRANSPORT=sse
-# OpenAI API Configuration
-OPENAI_API_KEY=your_openai_api_key
-# LLM for summaries and contextual embeddings
-MODEL_CHOICE=gpt-4.1-nano
-# RAG Strategies (set to "true" or "false", default to "false")
-USE_CONTEXTUAL_EMBEDDINGS=false
-USE_HYBRID_SEARCH=false
-USE_AGENTIC_RAG=false
-USE_RERANKING=false
-# Supabase Configuration
-SUPABASE_URL=your_supabase_project_url
-SUPABASE_SERVICE_KEY=your_supabase_service_key
-```
-### RAG Strategy Options
-The Crawl4AI RAG MCP server supports four powerful RAG strategies that can be enabled independently:
-#### 1. **USE_CONTEXTUAL_EMBEDDINGS**
-When enabled, this strategy enhances each chunk's embedding with additional context from the entire document. The system passes both the full document and the specific chunk to an LLM (configured via `MODEL_CHOICE`) to generate enriched context that gets embedded alongside the chunk content.
-- **When to use**: Enable this when you need high-precision retrieval where context matters, such as technical documentation where terms might have different meanings in different sections.
-- **Trade-offs**: Slower indexing due to LLM calls for each chunk, but significantly better retrieval accuracy.
-- **Cost**: Additional LLM API calls during indexing.
-#### 2. **USE_HYBRID_SEARCH**
-Combines traditional keyword search with semantic vector search to provide more comprehensive results. The system performs both searches in parallel and intelligently merges results, prioritizing documents that appear in both result sets.
-- **When to use**: Enable this when users might search using specific technical terms, function names, or when exact keyword matches are important alongside semantic understanding.
-- **Trade-offs**: Slightly slower search queries but more robust results, especially for technical content.
-- **Cost**: No additional API costs, just computational overhead.
-#### 3. **USE_AGENTIC_RAG**
-Enables specialized code example extraction and storage. When crawling documentation, the system identifies code blocks (≥300 characters), extracts them with surrounding context, generates summaries, and stores them in a separate vector database table specifically designed for code search.
-- **When to use**: Essential for AI coding assistants that need to find specific code examples, implementation patterns, or usage examples from documentation.
-- **Trade-offs**: Significantly slower crawling due to code extraction and summarization, requires more storage space.
-- **Cost**: Additional LLM API calls for summarizing each code example.
-- **Benefits**: Provides a dedicated `search_code_examples` tool that AI agents can use to find specific code implementations.
-#### 4. **USE_RERANKING**
-Applies cross-encoder reranking to search results after initial retrieval. Uses a lightweight cross-encoder model (`cross-encoder/ms-marco-MiniLM-L-6-v2`) to score each result against the original query, then reorders results by relevance.
-- **When to use**: Enable this when search precision is critical and you need the most relevant results at the top. Particularly useful for complex queries where semantic similarity alone might not capture query intent.
-- **Trade-offs**: Adds ~100-200ms to search queries depending on result count, but significantly improves result ordering.
-- **Cost**: No additional API costs - uses a local model that runs on CPU.
-- **Benefits**: Better result relevance, especially for complex queries. Works with both regular RAG search and code example search.
-### Recommended Configurations
-**For general documentation RAG:**
-```
-USE_CONTEXTUAL_EMBEDDINGS=false
-USE_HYBRID_SEARCH=true
-USE_AGENTIC_RAG=false
-USE_RERANKING=true
-```
-**For AI coding assistant with code examples:**
-```
-USE_CONTEXTUAL_EMBEDDINGS=true
-USE_HYBRID_SEARCH=true
-USE_AGENTIC_RAG=true
-USE_RERANKING=true
-```
-**For fast, basic RAG:**
-```
-USE_CONTEXTUAL_EMBEDDINGS=false
-USE_HYBRID_SEARCH=true
-USE_AGENTIC_RAG=false
-USE_RERANKING=false
-```
-## Running the Server
-### Using Docker
-```bash
-docker run --env-file .env -p 8051:8051 mcp/crawl4ai-rag
-```
-### Using Python
-```bash
-uv run src/crawl4ai_mcp.py
-```
-The server will start and listen on the configured host and port.
-## Integration with MCP Clients
-### SSE Configuration
-Once you have the server running with SSE transport, you can connect to it using this configuration:
-```json
-{
-  "mcpServers": {
-    "crawl4ai-rag": {
-      "transport": "sse",
-      "url": "http://localhost:8051/sse"
-    }
-  }
-}
-```
-> **Note for Windsurf users**: Use `serverUrl` instead of `url` in your configuration:
-> ```json
-> {
->   "mcpServers": {
->     "crawl4ai-rag": {
->       "transport": "sse",
->       "serverUrl": "http://localhost:8051/sse"
->     }
->   }
-> }
-> ```
->
-> **Note for Docker users**: Use `host.docker.internal` instead of `localhost` if your client is running in a different container. This will apply if you are using this MCP server within n8n!
-### Stdio Configuration
-Add this server to your MCP configuration for Claude Desktop, Windsurf, or any other MCP client:
-```json
-{
-  "mcpServers": {
-    "crawl4ai-rag": {
-      "command": "python",
-      "args": ["path/to/crawl4ai-mcp/src/crawl4ai_mcp.py"],
-      "env": {
-        "TRANSPORT": "stdio",
-        "OPENAI_API_KEY": "your_openai_api_key",
-        "SUPABASE_URL": "your_supabase_url",
-        "SUPABASE_SERVICE_KEY": "your_supabase_service_key"
-      }
-    }
-  }
-}
-```
-### Docker with Stdio Configuration
-```json
-{
-  "mcpServers": {
-    "crawl4ai-rag": {
-      "command": "docker",
-      "args": ["run", "--rm", "-i",
-               "-e", "TRANSPORT",
-               "-e", "OPENAI_API_KEY",
-               "-e", "SUPABASE_URL",
-               "-e", "SUPABASE_SERVICE_KEY",
-               "mcp/crawl4ai"],
-      "env": {
-        "TRANSPORT": "stdio",
-        "OPENAI_API_KEY": "your_openai_api_key",
-        "SUPABASE_URL": "your_supabase_url",
-        "SUPABASE_SERVICE_KEY": "your_supabase_service_key"
-      }
-    }
-  }
-}
-```
-## Building Your Own Server
-This implementation provides a foundation for building more complex MCP servers with web crawling capabilities. To build your own:
-1. Add your own tools by creating methods with the `@mcp.tool()` decorator
-2. Create your own lifespan function to add your own dependencies
-3. Modify the `utils.py` file for any helper functions you need
-4. Extend the crawling capabilities by adding more specialized crawlers

Relevant README 2.md DELETED Viewed

@@ -1,618 +0,0 @@
-# Context7 MCP - Up-to-date Code Docs For Any Prompt
-[![Website](https://img.shields.io/badge/Website-context7.com-blue)](https://context7.com) [![smithery badge](https://smithery.ai/badge/@upstash/context7-mcp)](https://smithery.ai/server/@upstash/context7-mcp) [<img alt="Install in VS Code (npx)" src="https://img.shields.io/badge/VS_Code-VS_Code?style=flat-square&label=Install%20Context7%20MCP&color=0098FF">](https://insiders.vscode.dev/redirect?url=vscode%3Amcp%2Finstall%3F%7B%22name%22%3A%22context7%22%2C%22command%22%3A%22npx%22%2C%22args%22%3A%5B%22-y%22%2C%22%40upstash%2Fcontext7-mcp%40latest%22%5D%7D)
-[![繁體中文](https://img.shields.io/badge/docs-繁體中文-yellow)](./docs/README.zh-TW.md) [![簡體中文](https://img.shields.io/badge/docs-簡體中文-yellow)](./docs/README.zh-CN.md) [![한국어 문서](https://img.shields.io/badge/docs-한국어-green)](./docs/README.ko.md) [![Documentación en Español](https://img.shields.io/badge/docs-Español-orange)](./docs/README.es.md) [![Documentation en Français](https://img.shields.io/badge/docs-Français-blue)](./docs/README.fr.md) [![Documentação em Português (Brasil)](<https://img.shields.io/badge/docs-Português%20(Brasil)-purple>)](./docs/README.pt-BR.md) [![Documentazione in italiano](https://img.shields.io/badge/docs-Italian-red)](./docs/README.it.md) [![Dokumentasi Bahasa Indonesia](https://img.shields.io/badge/docs-Bahasa%20Indonesia-pink)](./docs/README.id-ID.md) [![Dokumentation auf Deutsch](https://img.shields.io/badge/docs-Deutsch-darkgreen)](./docs/README.de.md) [![Документация на русском языке](https://img.shields.io/badge/docs-Русский-darkblue)](./docs/README.ru.md) [![Türkçe Doküman](https://img.shields.io/badge/docs-Türkçe-blue)](./docs/README.tr.md) [![Arabic Documentation](https://img.shields.io/badge/docs-Arabic-white)](./docs/README.ar.md)
-## ❌ Without Context7
-LLMs rely on outdated or generic information about the libraries you use. You get:
-- ❌ Code examples are outdated and based on year-old training data
-- ❌ Hallucinated APIs don't even exist
-- ❌ Generic answers for old package versions
-## ✅ With Context7
-Context7 MCP pulls up-to-date, version-specific documentation and code examples straight from the source — and places them directly into your prompt.
-Add `use context7` to your prompt in Cursor:
-```txt
-Create a basic Next.js project with app router. use context7
-```
-```txt
-Create a script to delete the rows where the city is "" given PostgreSQL credentials. use context7
-```
-Context7 fetches up-to-date code examples and documentation right into your LLM's context.
-- 1️⃣ Write your prompt naturally
-- 2️⃣ Tell the LLM to `use context7`
-- 3️⃣ Get working code answers
-No tab-switching, no hallucinated APIs that don't exist, no outdated code generations.
-## 📚 Adding Projects
-Check out our [project addition guide](./docs/adding-projects.md) to learn how to add (or update) your favorite libraries to Context7.
-## 🛠️ Installation
-### Requirements
-- Node.js >= v18.0.0
-- Cursor, Windsurf, Claude Desktop or another MCP Client
-<details>
-<summary><b>Installing via Smithery</b></summary>
-To install Context7 MCP Server for any client automatically via [Smithery](https://smithery.ai/server/@upstash/context7-mcp):
-```bash
-npx -y @smithery/cli@latest install @upstash/context7-mcp --client <CLIENT_NAME> --key <YOUR_SMITHERY_KEY>
-```
-You can find your Smithery key in the [Smithery.ai webpage](https://smithery.ai/server/@upstash/context7-mcp).
-</details>
-<details>
-<summary><b>Install in Cursor</b></summary>
-Go to: `Settings` -> `Cursor Settings` -> `MCP` -> `Add new global MCP server`
-Pasting the following configuration into your Cursor `~/.cursor/mcp.json` file is the recommended approach. You may also install in a specific project by creating `.cursor/mcp.json` in your project folder. See [Cursor MCP docs](https://docs.cursor.com/context/model-context-protocol) for more info.
-> Since Cursor 1.0, you can click the install button below for instant one-click installation.
-#### Cursor Remote Server Connection
-[![Install MCP Server](https://cursor.com/deeplink/mcp-install-dark.svg)](https://cursor.com/install-mcp?name=context7&config=eyJ1cmwiOiJodHRwczovL21jcC5jb250ZXh0Ny5jb20vbWNwIn0%3D)
-```json
-{
-  "mcpServers": {
-    "context7": {
-      "url": "https://mcp.context7.com/mcp"
-    }
-  }
-}
-```
-#### Cursor Local Server Connection
-[![Install MCP Server](https://cursor.com/deeplink/mcp-install-dark.svg)](https://cursor.com/install-mcp?name=context7&config=eyJjb21tYW5kIjoibnB4IC15IEB1cHN0YXNoL2NvbnRleHQ3LW1jcCJ9)
-```json
-{
-  "mcpServers": {
-    "context7": {
-      "command": "npx",
-      "args": ["-y", "@upstash/context7-mcp"]
-    }
-  }
-}
-```
-<details>
-<summary>Alternative: Use Bun</summary>
-[![Install MCP Server](https://cursor.com/deeplink/mcp-install-dark.svg)](https://cursor.com/install-mcp?name=context7&config=eyJjb21tYW5kIjoiYnVueCAteSBAdXBzdGFzaC9jb250ZXh0Ny1tY3AifQ%3D%3D)
-```json
-{
-  "mcpServers": {
-    "context7": {
-      "command": "bunx",
-      "args": ["-y", "@upstash/context7-mcp"]
-    }
-  }
-}
-```
-</details>
-<details>
-<summary>Alternative: Use Deno</summary>
-[![Install MCP Server](https://cursor.com/deeplink/mcp-install-dark.svg)](https://cursor.com/install-mcp?name=context7&config=eyJjb21tYW5kIjoiZGVubyBydW4gLS1hbGxvdy1lbnYgLS1hbGxvdy1uZXQgbnBtOkB1cHN0YXNoL2NvbnRleHQ3LW1jcCJ9)
-```json
-{
-  "mcpServers": {
-    "context7": {
-      "command": "deno",
-      "args": ["run", "--allow-env", "--allow-net", "npm:@upstash/context7-mcp"]
-    }
-  }
-}
-```
-</details>
-</details>
-<details>
-<summary><b>Install in Windsurf</b></summary>
-Add this to your Windsurf MCP config file. See [Windsurf MCP docs](https://docs.windsurf.com/windsurf/mcp) for more info.
-#### Windsurf Remote Server Connection
-```json
-{
-  "mcpServers": {
-    "context7": {
-      "serverUrl": "https://mcp.context7.com/sse"
-    }
-  }
-}
-```
-#### Windsurf Local Server Connection
-```json
-{
-  "mcpServers": {
-    "context7": {
-      "command": "npx",
-      "args": ["-y", "@upstash/context7-mcp"]
-    }
-  }
-}
-```
-</details>
-<details>
-<summary><b>Install in VS Code</b></summary>
-[<img alt="Install in VS Code (npx)" src="https://img.shields.io/badge/VS_Code-VS_Code?style=flat-square&label=Install%20Context7%20MCP&color=0098FF">](https://insiders.vscode.dev/redirect?url=vscode%3Amcp%2Finstall%3F%7B%22name%22%3A%22context7%22%2C%22command%22%3A%22npx%22%2C%22args%22%3A%5B%22-y%22%2C%22%40upstash%2Fcontext7-mcp%40latest%22%5D%7D)
-[<img alt="Install in VS Code Insiders (npx)" src="https://img.shields.io/badge/VS_Code_Insiders-VS_Code_Insiders?style=flat-square&label=Install%20Context7%20MCP&color=24bfa5">](https://insiders.vscode.dev/redirect?url=vscode-insiders%3Amcp%2Finstall%3F%7B%22name%22%3A%22context7%22%2C%22command%22%3A%22npx%22%2C%22args%22%3A%5B%22-y%22%2C%22%40upstash%2Fcontext7-mcp%40latest%22%5D%7D)
-Add this to your VS Code MCP config file. See [VS Code MCP docs](https://code.visualstudio.com/docs/copilot/chat/mcp-servers) for more info.
-#### VS Code Remote Server Connection
-```json
-"mcp": {
-  "servers": {
-    "context7": {
-      "type": "http",
-      "url": "https://mcp.context7.com/mcp"
-    }
-  }
-}
-```
-#### VS Code Local Server Connection
-```json
-"mcp": {
-  "servers": {
-    "context7": {
-      "type": "stdio",
-      "command": "npx",
-      "args": ["-y", "@upstash/context7-mcp"]
-    }
-  }
-}
-```
-</details>
-<details>
-<summary><b>Install in Zed</b></summary>
-It can be installed via [Zed Extensions](https://zed.dev/extensions?query=Context7) or you can add this to your Zed `settings.json`. See [Zed Context Server docs](https://zed.dev/docs/assistant/context-servers) for more info.
-```json
-{
-  "context_servers": {
-    "Context7": {
-      "command": {
-        "path": "npx",
-        "args": ["-y", "@upstash/context7-mcp"]
-      },
-      "settings": {}
-    }
-  }
-}
-```
-</details>
-<details>
-<summary><b>Install in Claude Code</b></summary>
-Run this command. See [Claude Code MCP docs](https://docs.anthropic.com/en/docs/agents-and-tools/claude-code/tutorials#set-up-model-context-protocol-mcp) for more info.
-#### Claude Code Remote Server Connection
-```sh
-claude mcp add --transport sse context7 https://mcp.context7.com/sse
-```
-#### Claude Code Local Server Connection
-```sh
-claude mcp add context7 -- npx -y @upstash/context7-mcp
-```
-</details>
-<details>
-<summary><b>Install in Claude Desktop</b></summary>
-Add this to your Claude Desktop `claude_desktop_config.json` file. See [Claude Desktop MCP docs](https://modelcontextprotocol.io/quickstart/user) for more info.
-```json
-{
-  "mcpServers": {
-    "Context7": {
-      "command": "npx",
-      "args": ["-y", "@upstash/context7-mcp"]
-    }
-  }
-}
-```
-</details>
-<details>
-<summary><b>Install in BoltAI</b></summary>
-Open the "Settings" page of the app, navigate to "Plugins," and enter the following JSON:
-```json
-{
-  "mcpServers": {
-    "context7": {
-      "command": "npx",
-      "args": ["-y", "@upstash/context7-mcp"]
-    }
-  }
-}
-```
-Once saved, enter in the chat `get-library-docs` followed by your Context7 documentation ID (e.g., `get-library-docs /nuxt/ui`). More information is available on [BoltAI's Documentation site](https://docs.boltai.com/docs/plugins/mcp-servers). For BoltAI on iOS, [see this guide](https://docs.boltai.com/docs/boltai-mobile/mcp-servers).
-</details>
-<details>
-<summary><b>Using Docker</b></summary>
-If you prefer to run the MCP server in a Docker container:
-1. **Build the Docker Image:**
-   First, create a `Dockerfile` in the project root (or anywhere you prefer):
-   <details>
-   <summary>Click to see Dockerfile content</summary>
-   ```Dockerfile
-   FROM node:18-alpine
-   WORKDIR /app
-   # Install the latest version globally
-   RUN npm install -g @upstash/context7-mcp
-   # Expose default port if needed (optional, depends on MCP client interaction)
-   # EXPOSE 3000
-   # Default command to run the server
-   CMD ["context7-mcp"]
-   ```
-   </details>
-   Then, build the image using a tag (e.g., `context7-mcp`). **Make sure Docker Desktop (or the Docker daemon) is running.** Run the following command in the same directory where you saved the `Dockerfile`:
-   ```bash
-   docker build -t context7-mcp .
-   ```
-2. **Configure Your MCP Client:**
-   Update your MCP client's configuration to use the Docker command.
-   _Example for a cline_mcp_settings.json:_
-   ```json
-   {
-     "mcpServers": {
-       "Сontext7": {
-         "autoApprove": [],
-         "disabled": false,
-         "timeout": 60,
-         "command": "docker",
-         "args": ["run", "-i", "--rm", "context7-mcp"],
-         "transportType": "stdio"
-       }
-     }
-   }
-   ```
-   _Note: This is an example configuration. Please refer to the specific examples for your MCP client (like Cursor, VS Code, etc.) earlier in this README to adapt the structure (e.g., `mcpServers` vs `servers`). Also, ensure the image name in `args` matches the tag used during the `docker build` command._
-</details>
-<details>
-<summary><b>Install in Windows</b></summary>
-The configuration on Windows is slightly different compared to Linux or macOS (_`Cline` is used in the example_). The same principle applies to other editors; refer to the configuration of `command` and `args`.
-```json
-{
-  "mcpServers": {
-    "github.com/upstash/context7-mcp": {
-      "command": "cmd",
-      "args": ["/c", "npx", "-y", "@upstash/context7-mcp@latest"],
-      "disabled": false,
-      "autoApprove": []
-    }
-  }
-}
-```
-</details>
-<details>
-<summary><b>Install in Augment Code</b></summary>
-To configure Context7 MCP in Augment Code, follow these steps:
-1. Press Cmd/Ctrl Shift P or go to the hamburger menu in the Augment panel
-2. Select Edit Settings
-3. Under Advanced, click Edit in settings.json
-4. Add the server configuration to the `mcpServers` array in the `augment.advanced` object
-```json
-"augment.advanced": {
-    "mcpServers": [
-        {
-            "name": "context7",
-            "command": "npx",
-            "args": ["-y", "@upstash/context7-mcp"]
-        }
-    ]
-}
-```
-Once the MCP server is added, restart your editor. If you receive any errors, check the syntax to make sure closing brackets or commas are not missing.
-</details>
-<details>
-<summary><b>Install in Roo Code</b></summary>
-Add this to your Roo Code MCP configuration file. See [Roo Code MCP docs](https://docs.roocode.com/features/mcp/using-mcp-in-roo) for more info.
-#### Roo Code Remote Server Connection
-```json
-{
-  "mcpServers": {
-    "context7": {
-      "type": "streamable-http",
-      "url": "https://mcp.context7.com/mcp"
-    }
-  }
-}
-```
-#### Roo Code Local Server Connection
-```json
-{
-  "mcpServers": {
-    "context7": {
-      "command": "npx",
-      "args": ["-y", "@upstash/context7-mcp"]
-    }
-  }
-}
-```
-</details>
-<details>
-<summary><b>Install in Zencoder</b></summary>
-To configure Context7 MCP in Zencoder, follow these steps:
-1. Go to the Zencoder menu (...)
-2. From the dropdown menu, select Agent tools
-3. Click on the Add custom MCP
-4. Add the name and server configuration from below, and make sure to hit the Install button
-```json
-{
-    "command": "npx",
-    "args": [
-        "-y",
-        "@upstash/context7-mcp@latest"
-    ]
-}
-```
-Once the MCP server is added, you can easily continue using it.
-</details>
-## 🔧 Environment Variables
-The Context7 MCP server supports the following environment variables:
-- `DEFAULT_MINIMUM_TOKENS`: Set the minimum token count for documentation retrieval (default: 10000)
-Example configuration with environment variables:
-```json
-{
-  "mcpServers": {
-    "context7": {
-      "command": "npx",
-      "args": ["-y", "@upstash/context7-mcp"],
-      "env": {
-        "DEFAULT_MINIMUM_TOKENS": "6000"
-      }
-    }
-  }
-}
-```
-## 🔨 Available Tools
-Context7 MCP provides the following tools that LLMs can use:
-- `resolve-library-id`: Resolves a general library name into a Context7-compatible library ID.
-  - `libraryName` (required): The name of the library to search for
-- `get-library-docs`: Fetches documentation for a library using a Context7-compatible library ID.
-  - `context7CompatibleLibraryID` (required): Exact Context7-compatible library ID (e.g., `/mongodb/docs`, `/vercel/next.js`)
-  - `topic` (optional): Focus the docs on a specific topic (e.g., "routing", "hooks")
-  - `tokens` (optional, default 10000): Max number of tokens to return. Values less than the configured `DEFAULT_MINIMUM_TOKENS` value or the default value of 10000 are automatically increased to that value.
-## 💻 Development
-Clone the project and install dependencies:
-```bash
-bun i
-```
-Build:
-```bash
-bun run build
-```
-<details>
-<summary><b>Local Configuration Example</b></summary>
-```json
-{
-  "mcpServers": {
-    "context7": {
-      "command": "npx",
-      "args": ["tsx", "/path/to/folder/context7-mcp/src/index.ts"]
-    }
-  }
-}
-```
-</details>
-<details>
-<summary><b>Testing with MCP Inspector</b></summary>
-```bash
-npx -y @modelcontextprotocol/inspector npx @upstash/context7-mcp
-```
-</details>
-## 🚨 Troubleshooting
-<details>
-<summary><b>Module Not Found Errors</b></summary>
-If you encounter `ERR_MODULE_NOT_FOUND`, try using `bunx` instead of `npx`:
-```json
-{
-  "mcpServers": {
-    "context7": {
-      "command": "bunx",
-      "args": ["-y", "@upstash/context7-mcp"]
-    }
-  }
-}
-```
-This often resolves module resolution issues in environments where `npx` doesn't properly install or resolve packages.
-</details>
-<details>
-<summary><b>ESM Resolution Issues</b></summary>
-For errors like `Error: Cannot find module 'uriTemplate.js'`, try the `--experimental-vm-modules` flag:
-```json
-{
-  "mcpServers": {
-    "context7": {
-      "command": "npx",
-      "args": ["-y", "--node-options=--experimental-vm-modules", "@upstash/[email protected]"]
-    }
-  }
-}
-```
-</details>
-<details>
-<summary><b>TLS/Certificate Issues</b></summary>
-Use the `--experimental-fetch` flag to bypass TLS-related problems:
-```json
-{
-  "mcpServers": {
-    "context7": {
-      "command": "npx",
-      "args": ["-y", "--node-options=--experimental-fetch", "@upstash/context7-mcp"]
-    }
-  }
-}
-```
-</details>
-<details>
-<summary><b>General MCP Client Errors</b></summary>
-1. Try adding `@latest` to the package name
-2. Use `bunx` as an alternative to `npx`
-3. Consider using `deno` as another alternative
-4. Ensure you're using Node.js v18 or higher for native fetch support
-</details>
-## ⚠️ Disclaimer
-Context7 projects are community-contributed and while we strive to maintain high quality, we cannot guarantee the accuracy, completeness, or security of all library documentation. Projects listed in Context7 are developed and maintained by their respective owners, not by Context7. If you encounter any suspicious, inappropriate, or potentially harmful content, please use the "Report" button on the project page to notify us immediately. We take all reports seriously and will review flagged content promptly to maintain the integrity and safety of our platform. By using Context7, you acknowledge that you do so at your own discretion and risk.
-## 🤝 Connect with Us
-Stay updated and join our community:
-- 📢 Follow us on [X](https://x.com/contextai) for the latest news and updates
-- 🌐 Visit our [Website](https://context7.com)
-- 💬 Join our [Discord Community](https://upstash.com/discord)
-## 📺 Context7 In Media
-- [Better Stack: "Free Tool Makes Cursor 10x Smarter"](https://youtu.be/52FC3qObp9E)
-- [Cole Medin: "This is Hands Down the BEST MCP Server for AI Coding Assistants"](https://www.youtube.com/watch?v=G7gK8H6u7Rs)
-- [Income Stream Surfers: "Context7 + SequentialThinking MCPs: Is This AGI?"](https://www.youtube.com/watch?v=-ggvzyLpK6o)
-- [Julian Goldie SEO: "Context7: New MCP AI Agent Update"](https://www.youtube.com/watch?v=CTZm6fBYisc)
-- [JeredBlu: "Context 7 MCP: Get Documentation Instantly + VS Code Setup"](https://www.youtube.com/watch?v=-ls0D-rtET4)
-- [Income Stream Surfers: "Context7: The New MCP Server That Will CHANGE AI Coding"](https://www.youtube.com/watch?v=PS-2Azb-C3M)
-- [AICodeKing: "Context7 + Cline & RooCode: This MCP Server Makes CLINE 100X MORE EFFECTIVE!"](https://www.youtube.com/watch?v=qZfENAPMnyo)
-- [Sean Kochel: "5 MCP Servers For Vibe Coding Glory (Just Plug-In & Go)"](https://www.youtube.com/watch?v=LqTQi8qexJM)
-## ⭐ Star History
-[![Star History Chart](https://api.star-history.com/svg?repos=upstash/context7&type=Date)](https://www.star-history.com/#upstash/context7&Date)
-## 📄 License
-MIT

app.py CHANGED Viewed

@@ -15,8 +15,8 @@ from utils import copy_search_results, create_documentation_string, choice_promp
 _ = dotenv.load_dotenv()
-subprocess.run(["python3", "make_docs.py"])
-subprocess.run(["python3", "make_rag_db.py"])
 client = MilvusClient("milvus.db")
 embedding_fn = model.dense.OpenAIEmbeddingFunction(
@@ -37,8 +37,6 @@ def list_huggingface_resources_names() -> list[str]:
     with open("repos_config.json", "r") as f:
         repos = json.load(f)
-    print([repo["title"] for repo in repos])
     return [repo["title"] for repo in repos]
@@ -114,10 +112,12 @@ def get_huggingface_documentation(topic: str, resource_names: list[str] = []) ->
             text_format=Response,
         )
-        file_ids = response.output_parsed.file_ids
         # Create the documentation string using the file IDs and template
-        documentation_string = create_documentation_string(file_ids, temp_folder)
         # Clean up temporary folder
         shutil.rmtree(temp_folder, ignore_errors=True)
@@ -135,23 +135,29 @@ def load_readme() -> str:
             content = f.read()
         # Skip YAML frontmatter if it exists
         if content.startswith("---"):
-            # Find the second '---' line
-            lines = content.split("\n")
-            start_index = 0
             dash_count = 0
             for i, line in enumerate(lines):
                 if line.strip() == "---":
                     dash_count += 1
                     if dash_count == 2:
                         start_index = i + 1
                         break
-            # Join the lines after the frontmatter
-            content = "\n".join(lines[start_index:])
         return content
     except FileNotFoundError:
         return "README.md not found"

 _ = dotenv.load_dotenv()
+subprocess.run(["python3", "scripts/make_docs.py"])
+subprocess.run(["python3", "scripts/make_rag_db.py"])
 client = MilvusClient("milvus.db")
 embedding_fn = model.dense.OpenAIEmbeddingFunction(
     with open("repos_config.json", "r") as f:
         repos = json.load(f)
     return [repo["title"] for repo in repos]
             text_format=Response,
         )
+        file_id = response.output_parsed.file_id
+        print(f"{topic} -> {file_id}")
         # Create the documentation string using the file IDs and template
+        documentation_string = create_documentation_string([file_id], temp_folder)
         # Clean up temporary folder
         shutil.rmtree(temp_folder, ignore_errors=True)
             content = f.read()
         # Skip YAML frontmatter if it exists
+        lines = content.split("\n")
+        start_index = 0
         if content.startswith("---"):
+            # Find the second '---' line to skip frontmatter
             dash_count = 0
             for i, line in enumerate(lines):
                 if line.strip() == "---":
                     dash_count += 1
                     if dash_count == 2:
                         start_index = i + 1
                         break
+        # Find the line that starts with "### The Problem: Your LLM is stuck in the past"
+        for i in range(start_index, len(lines)):
+            if lines[i].startswith("### The Problem: Your LLM is stuck in the past"):
+                start_index = i
+                break
+        # Join the lines from the target starting point
+        content = "\n".join(lines[start_index:])
         return content
     except FileNotFoundError:
         return "README.md not found"

schemas.py CHANGED Viewed

@@ -1,4 +1,4 @@
 from pydantic import BaseModel
 class Response(BaseModel):
-    file_ids: list[str]

 from pydantic import BaseModel
 class Response(BaseModel):
+    file_id: str

make_docs.py → scripts/make_docs.py RENAMED Viewed

File without changes

make_rag_db.py → scripts/make_rag_db.py RENAMED Viewed

File without changes

utils.py CHANGED Viewed

@@ -18,17 +18,17 @@ choice_prompt = Template("""
 The user has asked the following question: $question
-The goal is get the user the 3 most relevant documentation files to answer the question.
-Here is the tree structure of the documentation. Your task is to return the numeric ids \
-associated with the 3 most relevant .md and .mdx files.
 <tree>
 $tree_structure
 </tree>
-Sample response: ["1.3.2", "11.4.12", "7.12.11"]
-Top 3 file ids:
 """.strip())
@@ -51,7 +51,7 @@ def create_documentation_string(file_ids, temp_folder):
         # Find the corresponding file in the temp folder
         docs_path = Path(temp_folder) / "docs"
         for file_path in docs_path.rglob("*.md*"):
-            if file_id in str(file_path):
                 try:
                     with open(file_path, 'r', encoding='utf-8') as f:
                         content = f.read()

 The user has asked the following question: $question
+The goal is get the user the 1 most relevant documentation file to answer the question.
+Here is the tree structure of the documentation. Your task is to return the numeric id \
+associated with the most relevant .md and .mdx file.
 <tree>
 $tree_structure
 </tree>
+Sample response: "1.3.2"
+Top 1 file id:
 """.strip())
         # Find the corresponding file in the temp folder
         docs_path = Path(temp_folder) / "docs"
         for file_path in docs_path.rglob("*.md*"):
+            if file_id + "." in str(file_path):
                 try:
                     with open(file_path, 'r', encoding='utf-8') as f:
                         content = f.read()