This is an old revision of the document!


Options

The software comes with a lot of options that should be suitable for all tasks.

On this screen you can setup your own internal proxy server. This will allow you to use the software with every other software that offers you to enter a proxy. By default the proxy is running on local host (127.0.0.1) and port 8080. You can of course change this to your needs and also add an authorization (login/password). Once an application (e.g. your Browser) is making use of this proxy server, GSA Proxy Scraper will use any of the proxies in random order. Each new request to the proxy is using a new proxy. If you plan not to use any proxy, you can also choose the one by specifying the TAGs a proxy must have to be used here.

A proxy tag is something like a property you give a proxy. This makes is easy for you to group proxies in a maximum of 8 categories. Some of the tags are also set automatically by the proxy testing (Bing, Google, SSL…).

You can change the name and colour of it by clicking on it.

At the bottom you can define how to test the proxies and how fast. There are two types, connect timeout and the timeout when being connected and receiving data from it. If you choose a low timeout, then the proxies who are slow will not pass and you have just a lot of fast once.

The number of threads define how many parallel searches/tests the program will perform.

A website that offers free proxy servers is called a provider within GSA Proxy Scraper. The software comes with over 800 providers. Having them checked in the list means that the automatic searching mode will parse that provider. The maximum quality a provider can have is 100 which would mean that all extracted proxies do work. That however is very unlikely. The Last Working column shows the number of working proxies from extracted once on last parsing. As a description you can enter whatever you want.

There is another option to proxies called “Parse extracted links from search-engine-tests”. This option will use a proxy as search query in certain tests (Google Search, Yahoo, Bing,…) and will save every extracted link on the result page. When having this option checked, it will parse all those links for new proxies. Often new proxy sources are found that way that you can later add here as well. In the proxy list it is shown as “Proxy-Search Links - …”.

Under the list you have various options to manage the providers by adding new once or deleting some no longer working once.

  • Import - Will open a popup menu where you can import various proxy sources from other programs
  • Add - Will let you add proxy sources in different ways
  • Edit - Will open the editor to fine tune the options for each provider
  • Delete - Will delete all selected providers
  • Clear Cache - This will delete all the cache files for the provider. Cache files hold information about proxies being extracted from the sources and found as being dead so further proxy crawling will no longer wast time on that. Clearing the cache might be useful only for special cases where all proxies had been tagged as being down due to some network errors.