Unable to get article from Atlas Obscura

When I try to get this feed on my self hosted 3.9.13 instance I get an error: https://www.atlasobscura.com/feeds/latest

The error is:

This page contains the following errors:
error on line 2 at column 1: Extra content at the end of the document
Below is a rendering of the page up to the first error.

I do not get this error on the public 3.10 instance. In looking at the debug, it looks like it’s the first item in the feed that is causing the problem:
https://www.atlasobscura.com/places/cat-head-delta-blues-folk-art

When I look at the debug, it throws this error, which I think is from Readability:

<b>Deprecated</b>:  mb_convert_encoding(): Handling HTML entities via mbstring is deprecated; use htmlspecialchars, htmlentities, or mb_encode_numericentity/mb_decode_numericentity instead in <b>/var/www/html/fulltext/libraries/readability/JSLikeHTMLElement.php</b> on line <b>59</b><br />

Any suggestions on how to fix this?

The ‘Depricated’ mostly means, that your PHP version is too new for your 3.9.13

If there is no site config file for this domain, FTR tries to predict which is the relevant part to catch. But FTR seems to use methods that are deprecated in newer PHP versions.

If you write your own site config with a body selector, FTR will not try to predict content, as long as it finds your selector pattern in the content.

This time it was a little more difficult, because A.O. is using at least two different HTML schemes, so I needed to fiddling out two selectors and other settings so most content of both schemes will be catched by FTR and wallabag (read-later server, that uses same config files)

Please update your site patterns now. If you find an article which is not shown properly, post the direct URL here. Maybe they use more schemes on their site.

That seems to have done the trick. Thanks so much!

1 Like