Cannot get rss-feed for achgut.com any more

Hello five filters support team,

About two weeks ago the following rss feed address stopped getting me any more full-text feeds in newsboat, my text-only feed-reader of choice in Linux Mint 21.3:

https://www.achgut.com/rss

Here is the full debug log:

* APCu is enabled and available on server
* Supplied URL: https://www.achgut.com/rss
* Proxy will not be used
* Caching is enabled...
* Cache key not found in APCu
* ** Loading class HumbleHttpAgent (humble-http-agent/HumbleHttpAgent.php)
* ** Loading class ContentExtractor (content-extractor/ContentExtractor.php)
* ** Loading class SiteConfig (content-extractor/SiteConfig.php)
* --------
* Attempting to process URL as feed
* ** Loading class SimplePie_HumbleHttpAgent (humble-http-agent/SimplePie_HumbleHttpAgent.php)
* ** Loading class DisableSimplePieSanitize (DisableSimplePieSanitize.php)
* Fetching URL (https://www.achgut.com/rss)
* Starting parallel fetch (curl_multi_*)
* Processing set of 1
* ...https://www.achgut.com/rss
* ......adding to pool
* . looking for site config for achgut.com in custom folder
* . looking for site config for achgut.com in standard folder
* ... found site config in standard folder (achgut.com.txt)
* Adding site config to APC cache with key sc.achgut.com
* Cached site config with key achgut.com
* Adding site config to APC cache with key sc.achgut.com.merged
* Cached site config with key achgut.com.merged
* Checking fingerprints...
* No fingerprint matches
* ... site config for global.merged.ex found in APCu
* Appending site config settings from global.txt
* ......user-agent set to: Mozilla/5.0 (compatible; Googlebot/2.1; +http://www.google.com/bot.html)
* ......referer set to: http://www.google.co.uk/url?sa=t&source=web&cd=1
* Sending request...
* Received responses
* ... site config for achgut.com.merged already loaded in this request
* Checking fingerprints...
* No fingerprint matches
* ... site config for global.merged.ex found in APCu
* Appending site config settings from global.txt
* --------
* Constructing a single-item feed from URL
* ** Loading class FeedWriter (feedwriter/FeedWriter.php)
* --------
* Fetching feed items
* Starting parallel fetch (curl_multi_*)
* Processing set of 1
* ...https://www.achgut.com/rss
* ......in memory
* --------
* Processing feed item 1
* Item URL: https://www.achgut.com/rss
* ** Loading class FeedItem (feedwriter/FeedItem.php)
* URL already fetched - in memory (https://www.achgut.com/rss, effective: https://www.achgut.com/rss)
* Done!

I hope you can get the feed working again. Thank you in advance!

ThoLan

Weird, I don’t have problems with my self-hosted Full-Texr RSS (FTR) 3.9.13. But when using ftr.fivfilters.net I got an ‘[unable to retrieve full-text content]’ too. Maybe achgut is blocking the FTR server.

@ThoLan:
Are you self-hosting FTR (version?) or using the hosted service on fivefilters.net|org?

When posting logs or configs, please use the code button </> to prevent the log from being formatted due to * or links or other things.

@HolgerAusB
I am using the paid for hosted service of fivefilters.org. It may be that achgut.com is blocking the paid for FTR server on their end, but I don’t have any means to actually confirm that.
Maybe fivefilters-support could look into that directly?

Hi @ThoLan,

You are right that their servers are blocking requests from the FTR servers. In such cases we suggest users set up a Feed Control account and use the proxy feature.