Spaces:

HanLee
/

Demo

Sleeping

HanLee commited on Nov 26, 2023

Commit

fd20dfc

1 Parent(s): 36ba8c8

feat: 02_10 wip

Files changed (2) hide show

README.md CHANGED Viewed

@@ -2,10 +2,9 @@
 This is the repository for the LinkedIn Learning course `Hands-On AI: Building and Deploying LLM-Powered Apps`. The full course is available from [LinkedIn Learning][lil-course-url].
 _See the readme file in the main branch for updated instructions and information._
-## Lab4: Indexing Documents into Vector Database
-In the previous lab, we enabled document loading and chunking them into smaller sub documents. Now, we will need to index them into our search engine vector databse in order for us to build our Chat with PDF application using the RAG (Retrieval Augmented Generation) pattern.
-In this lab, we will implement adding OpenAI's embedding model and index the documents we chunked in the previous section into a Vector Database. We will be using [Chroma](https://www.trychroma.com/) as the vector database of choice. Chroma is a lightweight embedding database that can live in memory, similar to SQLite.
 ## Exercises

 This is the repository for the LinkedIn Learning course `Hands-On AI: Building and Deploying LLM-Powered Apps`. The full course is available from [LinkedIn Learning][lil-course-url].
 _See the readme file in the main branch for updated instructions and information._
+## Lab5: Putting it All Together
+In Lab 2, we created the basic scaffold of our Chat with PDF App. In Lab 3, we added PDF uploading and processing functionality. In Lab 4, we added the capability to indexing documents into a vector database. Now we have all the required pieces together, it's time for us to assemble our RAG (retrieval-augmented generation) system using Langchain.
 ## Exercises

app/app.py CHANGED Viewed

@@ -92,18 +92,12 @@ def create_search_engine(*, docs: List[Document], embeddings: Embeddings) -> Vec
         client_settings=client_settings
     )
     search_engine._client.reset()
-    ##########################################################################
-    # Exercise 1b:
-    # Now we have defined our encoder model and initialized our search engine
-    # client, please create the search engine from documents
-    ##########################################################################
     search_engine = Chroma.from_documents(
         client=client,
         documents=docs,
         embedding=embeddings,
         client_settings=client_settings
     )
-    ##########################################################################
     return search_engine
@@ -136,15 +130,9 @@ async def on_chat_start():
     await msg.update()
     # Indexing documents into our search engine
-    ##########################################################################
-    # Exercise 1a:
-    # Add OpenAI's embedding model as the encoder. The most standard one to
-    # use is text-embedding-ada-002.
-    ##########################################################################
     embeddings = OpenAIEmbeddings(
         model="text-embedding-ada-002"
     )
-    ##########################################################################
     try:
         search_engine = await cl.make_async(create_search_engine)(
             docs=docs,

         client_settings=client_settings
     )
     search_engine._client.reset()
     search_engine = Chroma.from_documents(
         client=client,
         documents=docs,
         embedding=embeddings,
         client_settings=client_settings
     )
     return search_engine
     await msg.update()
     # Indexing documents into our search engine
     embeddings = OpenAIEmbeddings(
         model="text-embedding-ada-002"
     )
     try:
         search_engine = await cl.make_async(create_search_engine)(
             docs=docs,