Use jest or ava to create unit tests for prompts --> driving direct-to-LLM calls to validate the prompts. Say - running each prompt against 10 receipts and if it passes 7 out of 10 then consider the prompt validated.
The receipt used should be fetched from our Hugging Face dataset of receipts.
Won't be possible to integrate with Github CI (not in 2025 anyways).