i have tried personally AJAX and file_get_contents to fetch the HTML from the url and parse the sun and rain like images and links.
Here's the one thing things i allow us to date: http://sites-check.com/testing/ajax_parser/
This works fine for most URL, However when I've placed http://www.google.com. It parses all pages but shows 500 internal server error for Google News Link. I've checked other web addresses for Google like google.com.au and there's a same issue which stuck on News URL.
You should check at above link by placing google.com on above link. I've opened up this news URL on browser and it is working correctly.
Any Guess? You will see on Internet panel if this break using the 500 Error.
I've placed onError block on my small AJAX call nevertheless its also not caching individuals error.
I've added the look for request and response Header.
Server Error Log
[Wed May 18 04:19:48 2011] [error] [client 18.104.22.168] Premature end of script headers: ajax_function.php, referer: sites-check.com/testing/ajax_parser [Wed May 18 04:19:48 2011] [error] [client 22.214.171.124] File does not exist: /home/onlinetechs/sites-check.com/internal_error.html, referer: sites-check.com/testing/ajax_parser