GitHub - Hivekind/AI-fashion-assistant: AI fashion assistant implemented with RAG, LangChain and MongoDB Atlas.

Description

We'll build an AI fashion assistant, using a fashion shop dataset from HuggingFace for indexing, and set up a RAG chain to process user queries and generate responses.

Retrieval-Augmented Generation (RAG)

RAG a technique that enhances the knowledge of language models by integrating additional data. A RAG application has two main components:

1. Indexing:

Ingest and index data from a specific source, typically done offline.

2. Retrieval and Generation:

During runtime, process the user's query, retrieve relevant data, and generate a response.

The data used in this project is the Fashion Shop Dataset from HuggingFace. The data consists of questions and answers related to fashion. The data is stored in MongoDB Atlas and the embeddings are generated using LangChain. The user query is passed to LangChain to generate an embedding and then the most similar data is retrieved from MongoDB Atlas.

Project: Shopping Assistant
Dataset: Fashion Shop Dataset from HuggingFace

Prerequisites

MongoDB Atlas Subscription (Free Tier is fine)
Open AI API key
LangChain API key

Quick Start Steps

Setup env var for OpenAI API key and MongoDB connection string

export OPENAI_API_KEY=<your-api-key>
export MONGODB_CONN_STRING=<your-conn-string>

Setup env var for LangChain API key and tracing, required for: hub.pull(...) in the script

export LANGCHAIN_TRACING_V2=true
export LANGCHAIN_API_KEY=<your-api-key>

Create a new Python environment

python3 -m venv env

Activate the new Python environment

source env/bin/activate

Install the requirements

pip3 install -r requirements.txt

1. Indexing:

Embedding is generated from the combination of question + answer fields. This is based on the assumption that we want to match user query against both the question and answer fields in the database. The embedding is stored in MongoDB and index is created for vector search.

Load, Transform, Embed and Store

Run the below script:

python3 ingest.py

2. Retrieval and Generation:

Run the below script, by passing your prompt as an argument.

python3 query.py  -q "What is the store policy for returns?"

Google Colab Notebook

You can also run the code in a Google Colab notebook. The notebook is available here.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.gitignore		.gitignore
AI_fashion_assistant.ipynb		AI_fashion_assistant.ipynb
README.md		README.md
ingest.py		ingest.py
params.py		params.py
query.py		query.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Description

Retrieval-Augmented Generation (RAG)

1. Indexing:

2. Retrieval and Generation:

Prerequisites

Quick Start Steps

1. Indexing:

Load, Transform, Embed and Store

2. Retrieval and Generation:

Google Colab Notebook

Resources

About

Releases

Packages

Languages

Hivekind/AI-fashion-assistant

Folders and files

Latest commit

History

Repository files navigation

Description

Retrieval-Augmented Generation (RAG)

1. Indexing:

2. Retrieval and Generation:

Prerequisites

Quick Start Steps

1. Indexing:

Load, Transform, Embed and Store

2. Retrieval and Generation:

Google Colab Notebook

Resources

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages