lone surrogates are invalid utf8, so html5gum 0.3.0, which takes &str/String, is not able to handle those.
after merging #25, html5gum will be able to read arbitrary bytes. at this point the expectation might be that lone surrogates produce error tokens, but they do not.
note: lone surrogates have no impact on parsing behavior. only some error tokens are missing from token stream.