Trying to get a full extraction from Boing Boing

When I try to add Boing Boing https://boingboing.net to RFT FTR, I’m getting an error:

### This page contains the following errors:

error on line 2 at column 1: Extra content at the end of the document

### Below is a rendering of the page up to the first error.

I suspect this is a problem with the siteconfig for boing boing, but I have no idea how I should fix it. When I try to load it into the point and click extractor, it won’t load the URL at all.

Any suggestions?

Hello @blinkingline, welcome at FiveFilters forum!

‘RFT’? Or did you mean FTR for Fulltext-RSS?

The html structure on BoingBoing has changed some time ago, so our existing config for this domain doesn’t work.

Main problem was the single_page_link-line which doesn’t match any longer to any useful content.

I fixed that config just now. If you are a self-hoster, please go to /admin/update.php and click the update button. For ftr.fivefilters.net you have to wait for the next full hour until the new config takes effect.

If you have further issues, please send a direct link to the articles you try to get, not only the domain.

1 Like

That worked perfectly, Thanks so much!

2 Likes