Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revisionBoth sides next revision
proxy_scraper:faq [2016-01-25 18:32] svenproxy_scraper:faq [2016-05-27 08:13] sven
Line 7: Line 7:
 Tools like ScrapeBox can not handle CONNECT proxies and so you will see different results when using them in that software. Tools like ScrapeBox can not handle CONNECT proxies and so you will see different results when using them in that software.
  
 +
 +**Why does this proxy just work for bing.com?**
 +
 +There was something strange that I discovered when building the tool. When the program searches for new sources, it could happen that it found proxies who are on port 80 but only seem to work for bing.com.
 +You see them in your list with that TAG and the anonymous level been gray it it could not check that.
 +
 +I have checked that closely and really was not able to test that proxy successfully against anything else but bing.
 +Sometimes I was able to access facebook.com or ebay.com with it. Even Twitter.com sometimes but thats it. Those proxies are only useful if you want to get access to those sites.
 +
 +So what type is that I asked myself and opened that IP directly in browser. This is what you get:
 +
 +<code>
 +Invalid URL
 +The requested URL "[no URL]", is invalid.
 +
 +Reference #9.5c957441.1464336353.8984bf0 
 +</code>
 +
 +Having a look at the header showed:
 +<code>
 +HTTP/1.0 400 Bad Request
 +Server: AkamaiGHost
 +Mime-Version: 1.0
 +Content-Type: text/html
 +Content-Length: 208
 +Expires: Fri, 27 May 2016 07:54:48 GMT
 +Date: Fri, 27 May 2016 07:54:48 GMT
 +Connection: close
 +</code>
 +OK so **Server: AkamaiGHost**. This is strange as I never heard anything about this. I searched for information about it and YES! [[https://en.wikipedia.org/wiki/Akamai_Technologies|Wikipedia]] had something for me.
 +And the last sentence on this should get you worried:
 +
 +//The National Security Agency and Federal Bureau of Investigation have reportedly used Facebook's Akamai content delivery network (CDN) to collect information on Facebook users.
 +Akamai has been accused of blocking access to web sites for visitors using Tor.//
 +
 +You better not use those sites for anything other than bing scraping. I would never use that for facebook and you should also stay away from that.
 +To not use them in export files, just uncheck //UNKNOWN// as anonymous level and you rule them out.