Clone Github repository
I've prepared a Github repository with product data for our made-up shoe store Solemates, we'll start by cloning the Github repo.
note
The image folder with all the shoe images for SoleMates is 134.2 MB
Clone the GitHub repository to get started:
Terminal
git clone https://github.com/norahsakal/solemates-data.git
Then navigate to the cloned folder:
Terminal
cd solemates-data
Inside the folder, you'll find:
- CSV file:
data/solemates_shoe_directory.csv
(product data) - For convenience, a pre-embedded dataset with all text and image embeddings is also available:
data/solemates_shoe_directory_with_embeddings_token_count.csv
- Image folder: footwear (shoe images, ~134 MB)
- Jupyter Notebook folder: full Jupyter Notebook of what we'll walk through
Let's pip install
all the Python dependencies from the requirements.txt
in the next step.