Merge remote-tracking branch 'origin/dev' into document_config

pinecone-io · Oct 31, 2023 · ec0a550 · ec0a550
2 parents 9561945 + 2a33760
commit ec0a550
Show file tree

Hide file tree

Showing 9 changed files with 9 additions and 25 deletions.
diff --git a/.readme-content/resin-chat.gif b/.readme-content/resin-chat.gif
diff --git a/.readme-content/resin-new.gif b/.readme-content/resin-new.gif
diff --git a/.readme-content/resin-start.gif b/.readme-content/resin-start.gif
diff --git a/.readme-content/resin-upsert.gif b/.readme-content/resin-upsert.gif
diff --git a/README.md b/README.md
@@ -40,7 +40,7 @@ By enhancing language models with access to unlearned knowledge and inifinite me
 </ol>
 </details>
 
-## Why Canopy? [TODO: TBD]
+## Why Canopy?
 
 * **Ease of use** - Installed with a single command and can deploy an AI application in minutes. **Canopy** is designed to be easy to use and easy to integrate with your existing applications and compatible with OpenAI /chat/completions API. 
 
@@ -58,9 +58,7 @@ By enhancing language models with access to unlearned knowledge and inifinite me
 
 > more information about the Core Library usage can be found in the [Library Documentation](docs/library.md)
 
-2. **Canopy Service** - a webservice that wraps the **Canopy Core** and exposes it as a REST API. The service is built on top of FastAPI, Uvicorn and Gunicorn and can be easily deployed in production. 
-
-> For the complete documentation please go to: [#TODO: LINK](link.link.com) 
+2. **Canopy Service** - a webservice that wraps the **Canopy Core** and exposes it as a REST API. The service is built on top of FastAPI, Uvicorn and Gunicorn and can be easily deployed in production. The service also comes with a built in Swagger UI for easy testing and documentation. After you [start the server](#3-start-the-canopy-service), you can access the Swagger UI at `http://host:port/docs` (default: `http://localhost:8000/docs`)
 
 3. **Canopy CLI** - Canopy comes with a fully functional CLI that is purposley built to allow users to quickly test their configuration and application before shipping, the CLI also comes with managment operations that allow you to create indexes and load data quickly
 
@@ -133,8 +131,6 @@ And follow the CLI instructions. The index that will be created will have a pref
 
 > To learn more about Pinecone Indexes and how to manage them, please refer to the following guide: [Understanding indexes](https://docs.pinecone.io/docs/indexes)
 
-![](.readme-content/canopy-new.gif)
-
 ### 2. Uploading data
 
 You can load data into your **Canopy** Index by simply using the CLI:
@@ -162,8 +158,6 @@ Canopy support single or mulitple files in jsonl or praquet format. The document
 
 Follow the instructions in the CLI to upload your data.
 
-![](.readme-content/canopy-upsert.gif)
-
 ### 3. Start the **Canopy** service
 
 **Canopy** service serve as a proxy between your application and Pinecone. It will also handle the RAG part of the application. To start the service, run:
@@ -172,19 +166,14 @@ Follow the instructions in the CLI to upload your data.
 canopy start
 ```
 
-Now, you should be prompted with standard Uvicorn logs:
+Now, you should be prompted with the following standard Uvicorn message:
 
 ```
-Starting Canopy service on 0.0.0.0:8000
-INFO:     Started server process [24431]
-INFO:     Waiting for application startup.
-INFO:     Application startup complete.
+...
+
 INFO:     Uvicorn running on http://0.0.0.0:8000 (Press CTRL+C to quit)
 ```
 
-![](.readme-content/canopy-start.gif)
-
-
 > **_📝 NOTE:_**
 >
 > The canopy start command will keep the terminal occupied. To proceed with the next steps, please open a new terminal window.
@@ -202,8 +191,6 @@ canopy chat
 
 This will open a chat interface in your terminal. You can ask questions and the **Canopy** will try to answer them using the data you uploaded.
 
-![](.readme-content/canopy-chat.gif)
-
 To compare the chat response with and without RAG use the `--baseline` flag
 
 ```bash
@@ -212,9 +199,6 @@ canopy chat --baseline
 
 This will open a similar chat interface window, but will send your question directly to the LLM without the RAG pipeline.
 
-![](.readme-content/canopy-chat-no-rag.gif)
-
-
 ### 5. Stop the **Canopy** service
 
 To stop the service, simply press `CTRL+C` in the terminal where you started it.

diff --git a/config/config.yaml b/config/config.yaml
@@ -116,4 +116,4 @@ chat_engine:
         params:
           model_name:                   # The name of the model to use for encoding
             text-embedding-ada-002
-          batch_size: 100               # The number of document chunks to encode in each call to the encoding model
+          batch_size: 400               # The number of document chunks to encode in each call to the encoding model
diff --git a/src/canopy/knowledge_base/knowledge_base.py b/src/canopy/knowledge_base/knowledge_base.py
@@ -490,7 +490,7 @@ def _query_index(self,
     def upsert(self,
                documents: List[Document],
                namespace: str = "",
-               batch_size: int = 100,
+               batch_size: int = 200,
                show_progress_bar: bool = False):
         """
         Upsert documents into the knowledge base.

diff --git a/src/canopy/knowledge_base/record_encoder/openai.py b/src/canopy/knowledge_base/record_encoder/openai.py
@@ -17,7 +17,7 @@ class OpenAIRecordEncoder(DenseRecordEncoder):
     def __init__(self,
                  *,
                  model_name: str = "text-embedding-ada-002",
-                 batch_size: int = 100,
+                 batch_size: int = 400,
                  **kwargs):
         encoder = OpenAIEncoder(model_name)
         super().__init__(dense_encoder=encoder, batch_size=batch_size, **kwargs)

diff --git a/src/canopy_server/api_models.py b/src/canopy_server/api_models.py
@@ -19,7 +19,7 @@ class ContextQueryRequest(BaseModel):
 
 class ContextUpsertRequest(BaseModel):
     documents: List[Document]
-    batch_size: int = 100
+    batch_size: int = 200
 
 
 class ContextDeleteRequest(BaseModel):