CLIP Text-to-Image Search Application

This project consists of two main components: a CLIP embedding service and a Next.js image search application. Before running these services, you need to prepare the dataset using the Jupyter notebook.

DEMO

Prerequisites

Docker and Docker Compose
Python 3.8+ (for dataset preparation)
Jupyter Notebook or VSCode with Jupyter extension

Environment Variables

Create a .env file in the root directory with the following content:

Note: For CLIP_MODEL_ID, use any model from Hugging Face that supports 768-dimensional embeddings.

PINECONE_API_KEY=your_pinecone_api_key
PINECONE_INDEX_NAME=your_pinecone_index_name
CLIP_MODEL_ID=your_clip_model_id

Dataset Preparation

Before running the main application, you need to prepare the dataset:

Follow the instructions in the prepare-dataset-notebook/README.md file to set up and run the image-prepare.ipynb notebook.
This notebook will download images, create embeddings, and store them in Pinecone.

Main Application Setup

Build the Docker images:

docker-compose build

Start the services:

docker-compose up

Services

CLIP Fast API Service

A stateless service that converts text or images into embeddings using the CLIP model.
Built with the CLIP model specified in the .env file.
Runs on port 8000.

Next.js Image Search Application

A web application that allows users to search for images using text queries.
Performs embedding search for top-K values on the Pinecone database.
Runs on port 3000.

Usage

Ensure both services are running (docker-compose up).
Access the Next.js application at http://localhost:3000.
Enter a text query to search for similar images in the Pinecone database.

Code Structure

The CLIP Fast API service is defined in the clipfastapi service in the Docker Compose file.
The Next.js Image Search application is defined in the clipimagesearch service.

Troubleshooting

If you encounter issues, ensure all environment variables are correctly set in the .env file.
For CLIP model issues, verify that you're using a compatible model from Hugging Face.

For any persistent problems, please open an issue in the repository.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

CLIP Text-to-Image Search Application

DEMO

Prerequisites

Environment Variables

Dataset Preparation

Main Application Setup

Services

CLIP Fast API Service

Next.js Image Search Application

Usage

Code Structure

Troubleshooting

Files

README.md

Latest commit

History

README.md

File metadata and controls

CLIP Text-to-Image Search Application

DEMO

Prerequisites

Environment Variables

Dataset Preparation

Main Application Setup

Services

CLIP Fast API Service

Next.js Image Search Application

Usage

Code Structure

Troubleshooting