Skip to main content

Clone Github repository

I've prepared a Github repository with product data for our made-up shoe store Solemates, we'll start by cloning the Github repo.

note

The image folder with all the shoe images for SoleMates is 134.2 MB

Clone the GitHub repository to get started:

Terminal
git clone https://github.com/norahsakal/solemates-data.git

Then navigate to the cloned folder:

Terminal
cd solemates-data

Inside the folder, you'll find:

  • CSV file: data/solemates_shoe_directory.csv (product data)
  • For convenience, a pre-embedded dataset with all text and image embeddings is also available: data/solemates_shoe_directory_with_embeddings_token_count.csv
  • Image folder: footwear (shoe images, ~134 MB)
  • Jupyter Notebook folder: full Jupyter Notebook of what we'll walk through

Let's pip install all the Python dependencies from the requirements.txt in the next step.