Hi @HolgerAusB, thanks for these fixes!
they are using JavaScript to generate their text
They are using Javascript to display the text, but thankfully the actual text content is there in the HTML too. So Push to Kindle can get it without the browser extension.
FTR and wallabag can fetch full articles from eltonjohn.com
just by sending the URL to them, while P2K can not.
Do you have a URL I can check? I tried the URL Mario shared and it seems to work.
standard.co.uk
is fixed, but lead image is missing, while it appears in Fulltext-RSS. I don’t know why.
Hmm, that’s interesting. I’ll need to look into it some more, but my guess is it’s due to the width
and height
attributes. All the other images have regular integers, but this one uses ‘auto’ and modern ‘vh’ units. See below…
<img
src="https://static.standard.co.uk/2025/04/09/9/59/SEI245908892.jpeg?trim=366,0,885,0&quality=100&auto=webp&width=1920"
width="auto"
height="100vh"
alt="Sir Elton John on his collaboration with Brandi Carlile, standing up for young artists — and why the future is dance"
class="sc-eqUAAy kRUyJB">
Full-Text RSS doesn’t care so much, but Push to Kindle tries to clean up and remove elements it considers problematic. My guess is that those values are causing the image to be stripped from the output.
I haven’t tested this, but I would guess that adding something like this to the config file will fix it:
strip_attr: //img[@width="auto"]/@width
strip_attr: //img[contains(@height, 'vh')]/@height
Possibly only one of the above is needed.