Imagine I have a database of thousands of documents and can extract each page as an image.
In the Readme, I can see that you need to pass a small number of images.
Is there a way to create embeddings for each image so I can do a retrieval phase (vector similarity) and use a smaller subset of images to respond to user queries?
Thanks!