curl - Crawling itunes.apple.com -


I'm trying to crawl the apple iitos website. I am getting the output in binary format. For example

Curl-A "Mozilla / 5.0"

returns binary returns.

Can anyone tell me what am I missing?

Thank you

You are retracting binary because the page you quoted HTML / XML is not returning, it is returning an Apple WebObject wget to:

  wget http://itunes.apple.com/us/app/the- Far-islands-by-john-buchan / id327765949? Mt = 8 --2010-08-03 12:38:38: 14- http://itunes.apple.com/us/app/the-far-islands-by- John-buchan / id327765949? Mt = 8 solve itin solving Apple.com ... 17.250.237.16 connect to itunes.apple.com | 17.250.237.16 |: 80 ... Added HTTP request sent, waiting for response ... 200 Apple Web Objects Length: 22 9 00 (22K) [Text / html] Saving them: Id327765949? Mt = 8 '100% [=========================================== ==== 0.05 S in 2010-08-03 12:38:14 (440 KB / s) - `id327765949? Mt = 8 'saved [22 9 00/22 9 00]  

Look for more information, but if you want to crawl it, then you need to use that thing Maybe a browser simulates and can interpret this way - maybe it will work.


Comments

Popular posts from this blog

Eclipse CDT variable colors in editor -

AJAX doesn't send POST query -

wpf - Custom Message Box Advice -