- HTML output instead of XML output. The encoding of the page affects the process I have running it through the Simple DOM Parser, which produces the special characters. I use the html=1 flag. The source has the less than greater than characters converted to their ASCII notations like < and I dont think the DOM parser likes that so I tried converting them.
This returns a page that starts out with an XML header when you view the source, and an HTML header when you view it in Firebug or Google Chrome Code Viewer.
- I need to get rid of the header that says “this is auto generated blah blah” and leave only the title and text of the article.
Please let me know how I can accomplish these 2 things.
Regards,
Rick