Ideally, html should be skipped and not read aloud https://stackoverflow.com/questions/1732348/regex-match-open-tags-except-xhtml-self-contained-tags might come in handy to use regex to check if tehre is html (this is a joke(