Is there a way to get embeddings from this model

Imagine I have a database of thousands of documents and can extract each page as an image.

In the Readme, I can see that you need to pass a small number of images.
Is there a way to create embeddings for each image so I can do a retrieval phase (vector similarity) and use a smaller subset of images to respond to user queries?

Thanks!