Hi,
Is there a way to preserve iframe and embedded content without completely disabling prune, which is useful to clean the final content?
Thanks
Hi,
Is there a way to preserve iframe and embedded content without completely disabling prune, which is useful to clean the final content?
Thanks
Hard to tell, without an example URL, @anarcho
You could try to explicitly include iframes by concatenating the following to your existing body selector. Don’t know, if this will work.
Let’s say, your existing body selector is
body: //div[@id='main']
now try to expand this to:
body: //div[@id='main'] | //iframe
If this does not work, you need prune:no and have to strip the extra fringe
Apologies for slow reply.
One thing to note on the iframe issue and Full-Text RSS, there are two endpoints: extract.php (single-article extraction) and makefulltextfeed.php (feed conversion).
We have a parameter xss, which sanitizes the HTML and removes certain elements like iframes and other stuff. By default this is off for makefulltextfeed.php because we assume the output is going to a feed reader which has its own sanitisation. For extract.php xss sanitisation is on, and will strip iframes. So if you’re using that endpoint, you have to sent xss=0 to preserve iframes. Just be aware that if you’re outputting the results to users you should do your own sanitisation.