curl - Crawling itunes.apple.com -
I'm trying to crawl the apple iitos website. I am getting the output in binary format. For example
Curl-A "Mozilla / 5.0"
returns binary returns.
Can anyone tell me what am I missing?
Thank you
You are retracting binary because the page you quoted HTML / XML is not returning, it is returning an Apple WebObject wget
to:
wget http://itunes.apple.com/us/app/the- Far-islands-by-john-buchan / id327765949? Mt = 8 --2010-08-03 12:38:38: 14- http://itunes.apple.com/us/app/the-far-islands-by- John-buchan / id327765949? Mt = 8 solve itin solving Apple.com ... 17.250.237.16 connect to itunes.apple.com | 17.250.237.16 |: 80 ... Added HTTP request sent, waiting for response ... 200 Apple Web Objects Length: 22 9 00 (22K) [Text / html] Saving them: Id327765949? Mt = 8 '100% [=========================================== ==== 0.05 S in 2010-08-03 12:38:14 (440 KB / s) - `id327765949? Mt = 8 'saved [22 9 00/22 9 00]
Look for more information, but if you want to crawl it, then you need to use that thing Maybe a browser simulates and can interpret this way - maybe it will work.
Comments
Post a Comment