More and more often I have the problem that only parts of the content are extracted. In this case the headlines are missing. So far I haven’t found a way to include them via config. Is this a bug?
Config: body: //div[@id=‘content-left’] or/combi //div[@class=‘body-text__paragraph-header’] with prune: no
Next Problem - missing Website Links - Source: https://www.tripsavvy.com/the-best-museums-in-dallas-4767608
Config: tidy: no
body: //div[contains(concat(’ ‘,normalize-space(@class),’ ‘),’ chop-content ‘)] | //ul[contains(concat(’ ‘,normalize-space(@class),’ ‘),’ ordered-list__list ')] | //div[@class=‘mntl-sc-block-location__website’]