peterclough12 0 Posted February 15, 2015 Report Share Posted February 15, 2015 I'm scraping links from Amazon into a list. When I navigate to these links using $list item, the browser is changing the contents of the link as well as showing two consequetive copies of the page within the same browser i.e. one on top of the other. It is changing:http://www.amazon.com/s/ref=sr_pg_2?rh=n%3A133140011%2Ck%3Anutribullet+recipe+book&page=2&keywords=nutribullet+recipe+book&ie=UTF8&qid=1423992933 into:http://www.amazon.com/s/ref=sr_pg_2?rh=n:133140011%2Ck:nutribullet+recipe+book&page=2&keywords=nutribullet+recipe+book&ie=UTF8&qid=1423992933 i.e. it seems to be changing '%3A' into ':' as it navigates to the page and obviously has problems loading the page. Any suggestions anybody? Quote Link to post Share on other sites
Pete 121 Posted February 15, 2015 Report Share Posted February 15, 2015 http://www.w3schools.com/tags/ref_urlencode.asp Seems to be your problem look in the url %3A = : Quote Link to post Share on other sites
gavind 6 Posted February 19, 2015 Report Share Posted February 19, 2015 Hi Peter, just a curios here. Did you find out which caused this yet? Quote Link to post Share on other sites
deliter 203 Posted February 28, 2015 Report Share Posted February 28, 2015 that is insane!! two seperate pages of the same page on top of each other! wrap url decode around the items after they have been scraped,so you will know exactly what that link was meant to be,that seems to be url encoded Quote Link to post Share on other sites
Recommended Posts
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.