Multimodal E-Commerce Customer Service Chatbot

Problem:

Customers interact with brands via text, images, and voice but most chatbots only handle text. This fragmented approach limits accessibility and creates inconsistent user experiences.

Solution:

A unified, multimodal customer service Chatbot aims to solve real-world challenges in e-commerce. It is designed to handle multimodal inputs including text, voice and images to provide:

Personalized shopping recommendations
Instant FAQ responses
Seamless customer support across your e-commerce platform.

Tools & Technologies

LangChain: To orchestrate various language processing tasks
Groq API: For generating responses (Groq / Llama3)
Chainlit: To build interactive, user-friendly chatbot interface
OpenAI-CLIP: For image embeddings and similarity search
Speech Recognition: Google Speech-to-Text for voice input
Hugging Face Model: For Text embeddings and similarity search

🔗 Workflow Diagram

Text Input: Users type queries, and the chatbot responds with product suggestions or FAQ answers.
Image Input: Using AI models like CLIP, the chatbot analyzes uploaded images (like product photos) to find similar items.
Voice Input: Integrated speech-to-text library which lets users speak their queries which makes interactions smooth and accessible.

---
config:
  theme: forest
  look: handDrawn
---
flowchart LR
    subgraph Data_Sources[Data Sources]
        A[Product Data:<br/>Titles, Prices, FAQs,<br/>Product URLs]
        B[Images]
    end

    subgraph Embeddings[Embedding Models]
        A--> C1[Product Data<br/>Embedding Model]
        B--> C2[Image Embedding<br/>Model]
    end
    C1 --> D1((Product Data Embeds))
    C2 --> D2((Image Embeds))
    subgraph Storage[Vector Store & URLs]
        D1 --> E[Vector Store]
        D2 --> E
        E --> F[Pickle File:<br/>Stores Product URLs]
    end
    subgraph Interaction[User & LLM]
        G((User))
        G -->|Text/Voice/Image Query| H[LLM - Groq & Llama3]
        H -->|Vector Search| E
        E -->|Relevant Info<br/>Similar Image Search / Product Data| H
        H -->|Final Response| G
    end

Installation

This project uses Poetry for dependency management and packaging. Follow these steps to set up your environment:

Clone the repository:

git clone https://github.com/emanalytic/MultiModal-E-Commerce-Customer-Support-Chatbot.git
cd MultiModal-E-Commerce-Customer-Support-Chatbot

Install Poetry: (if not already installed)
```
pipx install poetry
```
Install project dependencies:
```
poetry install
```

Running the Project

Set up your environment variables:
```
GROQ_API_KEY= your_api_key_here  
```
Run the Chainlit App locally:
```
poetry run chainlit run chainlit_ui.py -w
```
Currently, Chainlit lacks built-in audio support.For voice mode, run the app via terminal with main.py using Poetry as well.
```
poetry run python main.py
```

Name		Name	Last commit message	Last commit date
Latest commit History 59 Commits
.idea		.idea
backend		backend
faiss_index		faiss_index
image_index		image_index
.gitignore		.gitignore
README.md		README.md
chainlit.md		chainlit.md
chainlit_ui.py		chainlit_ui.py
config.ini		config.ini
main.py		main.py
prompt.txt		prompt.txt
pyproject.toml~		pyproject.toml~
test_img.png		test_img.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Multimodal E-Commerce Customer Service Chatbot

Problem:

Solution:

Tools & Technologies

🔗 Workflow Diagram

Installation

Running the Project

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

emanalytic/MultiModal-E-Commerce-Customer-Support-Chatbot

Folders and files

Latest commit

History

Repository files navigation

Multimodal E-Commerce Customer Service Chatbot

Problem:

Solution:

Tools & Technologies

🔗 Workflow Diagram

Installation

Running the Project

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages