Catch fails when illegal letters/bytes in text

Hi there,

in the last few weeks I got errors from vienna.at. I generated a feed via Creator and let FTR catch the full article.

In FreshRSS I see literal html code in some articles instead of the article text, only the title is correct. Today I did some investigation in this issue. I think Creator, is not the problem. But FTR cannot deal with the faulty text of the article.

Example: https://www.vienna.at/funf-fuhrerscheinabnahmen-wegen-raserei-in-wien-liesing/8511634

In the first sentence there are two single bytes with value 0x02 within the text. Marked here with X:

  • Fünf FahrzeuglenXkern wurde im dortigen Bereich der Führerschein an Ort und Stelle abgenomXmen.

These illegal chars do not replace a regular char of the text.

When fetching this article by ftr.fivefilters.net (v3.10) I got this error:

grafik

When trying this with my self-hosted FTR 3.9.13, I got this red errorbox on yellow ground but without the black and red text like in the screenshot above.

Can you intercept this somehow, @fivefilters ?

Edit:
The places in the words look as if they are hidden hyphenations/syllabifications