Pages with problem

Hi. Only a portion of this page is being retrieved: Sir Elton John on his collaboration with Brandi Carlile, standing up for young artists — and why the future is dance | The Standard

This one, too: "Who Believes In Angels?" Song-by-Song - Elton John

Is it fixable?

Hi @mariomarinato,

  • standard.co.uk is fixed, but lead image is missing, while it appears in Fulltext-RSS. I don’t know why.
  • eltonjohn.com is also fixed. But they are using JavaScript to generate their text. So you need the browser extension or bookmarklet at the moment

Hi @fivefilters: FTR and wallabag can fetch full articles from eltonjohn.com just by sending the URL to them, while P2K can not. Maybe you can have a look on this

cheers
Holger

Thank you very much! Take care.

2 Likes

Hi @HolgerAusB, thanks for these fixes!

they are using JavaScript to generate their text

They are using Javascript to display the text, but thankfully the actual text content is there in the HTML too. So Push to Kindle can get it without the browser extension.

FTR and wallabag can fetch full articles from eltonjohn.com just by sending the URL to them, while P2K can not.

Do you have a URL I can check? I tried the URL Mario shared and it seems to work.

standard.co.uk is fixed, but lead image is missing, while it appears in Fulltext-RSS. I don’t know why.

Hmm, that’s interesting. I’ll need to look into it some more, but my guess is it’s due to the width and height attributes. All the other images have regular integers, but this one uses ‘auto’ and modern ‘vh’ units. See below…

<img
src="https://static.standard.co.uk/2025/04/09/9/59/SEI245908892.jpeg?trim=366,0,885,0&amp;quality=100&amp;auto=webp&amp;width=1920"
width="auto"
height="100vh"
alt="Sir Elton John on his collaboration with Brandi Carlile, standing up for young artists — and why the future is dance"
class="sc-eqUAAy kRUyJB">

Full-Text RSS doesn’t care so much, but Push to Kindle tries to clean up and remove elements it considers problematic. My guess is that those values are causing the image to be stripped from the output.

I haven’t tested this, but I would guess that adding something like this to the config file will fix it:

strip_attr: //img[@width="auto"]/@width
strip_attr: //img[contains(@height, 'vh')]/@height

Possibly only one of the above is needed.

1 Like

Hi @fivefilters, eltonjohn did not work as expected 3 hours after I uploaded my config with the link in first post. But about 2 hours after writing my post it suddenly worked. So no more problem on this domain.

For standard-co I’ll try to strip heigth/width on the weekend.

Thank you for pointing this out

Hi @HolgerAusB, sorry to hear about the delay. We’ve been moving to different servers, so that might have affected the automatic updates.

fixed: for standard.co.uk the window wide lead image is now in.