Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
Next revisionBoth sides next revision
search_engine_ranker:script_manual [2019-08-09 08:02] – [Data Extraction] svensearch_engine_ranker:script_manual [2020-11-05 09:14] sven
Line 8: Line 8:
   * [[http://forum.gsa-online.de/discussion/13/gsa-script-editor-notepad-ser-edition|GSA Script Editor (Notepad++ SER Edition)]]   * [[http://forum.gsa-online.de/discussion/13/gsa-script-editor-notepad-ser-edition|GSA Script Editor (Notepad++ SER Edition)]]
   * [[http://forum.gsa-online.de/discussion/18057/passing-simple-protections|Tool to Debug Form-Submissions in Chrome Browser]]   * [[http://forum.gsa-online.de/discussion/18057/passing-simple-protections|Tool to Debug Form-Submissions in Chrome Browser]]
 +  * [[https://github.com/seospace/gsapi|Integrate 3rd party tools or custom coded scripts instead of using script-manual]]
  
   * [[http://forum.gsa-online.de/categories/new-engines-platforms|Forum to find help/suggestions]]   * [[http://forum.gsa-online.de/categories/new-engines-platforms|Forum to find help/suggestions]]
Line 44: Line 45:
 |page must have|This parameter is used to check whenever the webpage is usable for this engine or not. The content of this variable has to be present in the webpage (either pure text or html source). The variable can have multiple values separated by a %%|%% where just one has to match**.**\\ \\ //Example: //\\ //page must have1=Powered by XYZ%%|%%XYZ Powered//\\ //page must have2=!not allowed to access this page//\\ //page must have3=Webpage%%|%%Homepage//\\ \\ In the above example you see 3 variables and each of them has to match before a engine is used for that website. The ! parameter in "page must have2" means that the following should not appear on the page.| |page must have|This parameter is used to check whenever the webpage is usable for this engine or not. The content of this variable has to be present in the webpage (either pure text or html source). The variable can have multiple values separated by a %%|%% where just one has to match**.**\\ \\ //Example: //\\ //page must have1=Powered by XYZ%%|%%XYZ Powered//\\ //page must have2=!not allowed to access this page//\\ //page must have3=Webpage%%|%%Homepage//\\ \\ In the above example you see 3 variables and each of them has to match before a engine is used for that website. The ! parameter in "page must have2" means that the following should not appear on the page.|
 |url must have|This parameter is used the same way as "page must have" but for the URL string itself and not for the website content. \\ \\ //Example: url must have1=/wiki///\\ // url must have2=!wikipedia.com//\\ \\ In the above example we will not use this engine for URLs that contain //wikipedia//.com or do not have "/wiki/" in there URL.| |url must have|This parameter is used the same way as "page must have" but for the URL string itself and not for the website content. \\ \\ //Example: url must have1=/wiki///\\ // url must have2=!wikipedia.com//\\ \\ In the above example we will not use this engine for URLs that contain //wikipedia//.com or do not have "/wiki/" in there URL.|
-|fixed url|If no "//search term//" is used, you have to use this tag to indicate that the submission is just happening on one fixed site (usualy some kind of web 2.0  site with blog creation). \\ \\ //Example: fixed url=//[[https://my.opera.com/|https://my.opera.com]]\\ \\ This is than adding the URL to the target URLs of the project and starts a submission to it. |+|fixed url|If no "//search term//" is used, you have to use this tag to indicate that the submission is just happening on one fixed site (usualy some kind of web 2.0  site with blog creation). \\ \\ //Example: fixed url=//%%https://my.opera.com/%%\\ \\ This is than adding the URL to the target URLs of the project and starts a submission to it. |
 |search term|This is used to search for new targets on the internet with the help of search engines like google.\\ \\ //Example: //\\ //search term="Powered by XYZ"%%|%%"Powered by XYZ"//\\ \\ The example above uses two possible search queries. You can use also encoded the search term already if you want with "%90%AF%24". This is useful for queries in some strange language. The program will pick one of the queries randomly so the order is not important.| |search term|This is used to search for new targets on the internet with the help of search engines like google.\\ \\ //Example: //\\ //search term="Powered by XYZ"%%|%%"Powered by XYZ"//\\ \\ The example above uses two possible search queries. You can use also encoded the search term already if you want with "%90%AF%24". This is useful for queries in some strange language. The program will pick one of the queries randomly so the order is not important.|
 |add keyword to search|1 = Add a keyword from the project to the search query\\ 0 = Never add a keyword to the search query\\ 2 = Add just sometimes a keyword to it if it seems to be useful (default)| |add keyword to search|1 = Add a keyword from the project to the search query\\ 0 = Never add a keyword to the search query\\ 2 = Add just sometimes a keyword to it if it seems to be useful (default)|
Line 121: Line 122:
 |verify submission|1 = verify the submission\\ 0 = do not verify the submission but assume that the link is submitted and will be visible there or is already (default) Even though this is the default behaviour, you should set it to "1" as used in most engines.\\ 2 = this submission step is not creating a link but is used for something else.\\ Example: verify submission=1\\ \\ After a successful submission the URL is put to the verification list and checked in intervals if the link appears there.| |verify submission|1 = verify the submission\\ 0 = do not verify the submission but assume that the link is submitted and will be visible there or is already (default) Even though this is the default behaviour, you should set it to "1" as used in most engines.\\ 2 = this submission step is not creating a link but is used for something else.\\ Example: verify submission=1\\ \\ After a successful submission the URL is put to the verification list and checked in intervals if the link appears there.|
 |verify by\\ \\ verify search for|Defines how to verify a submission. Possible value for "verify by":\\ \\ search = try to locate the search form on the page and submit a search using "//verify search for//"\\ url    = downloads the URL in "//verify url//" and checks for the present of "//verify search for//" which is by default your URL\\ txturl = same as "url" but checks also if the link is appearing just as a text (no html link code).\\ email  = logs into your email account and tries to find a link that is coming from the same webpage domain.\\ extractemail = just parses for data in emails\\ without downloading any URL. This might be important for verification URLs that are used later on in scripts.\\ \\ //Example: //\\ //verify by=email+search,url//\\ //verify search for=%website_title%,%url%//\\ \\ This will do actually two verifications (separated by ,). First it tires to log into your email account and checks for links and also checks on the webpage itself using the search form and searching for "Website title". Once found by one of the methods it will use that resulting URL to check for your URL.| |verify by\\ \\ verify search for|Defines how to verify a submission. Possible value for "verify by":\\ \\ search = try to locate the search form on the page and submit a search using "//verify search for//"\\ url    = downloads the URL in "//verify url//" and checks for the present of "//verify search for//" which is by default your URL\\ txturl = same as "url" but checks also if the link is appearing just as a text (no html link code).\\ email  = logs into your email account and tries to find a link that is coming from the same webpage domain.\\ extractemail = just parses for data in emails\\ without downloading any URL. This might be important for verification URLs that are used later on in scripts.\\ \\ //Example: //\\ //verify by=email+search,url//\\ //verify search for=%website_title%,%url%//\\ \\ This will do actually two verifications (separated by ,). First it tires to log into your email account and checks for links and also checks on the webpage itself using the search form and searching for "Website title". Once found by one of the methods it will use that resulting URL to check for your URL.|
-|verify url\\ verify url remove\\ verify url replace|If you use "search", "url" or "txturl" in the variable "verify by" and you don't want to check on the last used URL then you can define a different URL here. Since version 11.71 you can also use multiple URLs seperated by %%|%%.\\ \\ //Example: //\\ //verify url=./index.php//\\ //verify url replace=/edit/;/show///\\ //verify url remove=sub///\\ \\ Imagine the last URL was: "[[http://www.something.com/sub/edit/?a=2|http://www.something.com/]][[http://www.something.com/sub/edit/?a=2|sub/edit/?a=2]]"\\ It would now be "[[http://www.something.com/show/index.php|http://www.something.com/show/index.php]]"|+|verify url\\ verify url remove\\ verify url replace|If you use "search", "url" or "txturl" in the variable "verify by" and you don't want to check on the last used URL then you can define a different URL here. Since version 11.71 you can also use multiple URLs seperated by %%|%%.\\ \\ //Example: //\\ //verify url=./index.php//\\ //verify url replace=/edit/;/show///\\ //verify url remove=sub///\\ \\ Imagine the last URL was: "%%http://www.something.com/sub/edit/?a=2%%"\\ It would now be "%%http://www.something.com/show/index.php%%"|
 |use original url to verify|1 = this will not use the last URL but the URL we started the whole engine with.\\ 0 = use the last URL at the end of the submission (default)\\ \\ //Example: use original url to verify=1//| |use original url to verify|1 = this will not use the last URL but the URL we started the whole engine with.\\ 0 = use the last URL at the end of the submission (default)\\ \\ //Example: use original url to verify=1//|
 |verify interval|Defines in what interval in minutes this verification should take place (default 180). \\ \\ //Example: verify interval=60//\\ \\ This checks each hour if the submission is verified.| |verify interval|Defines in what interval in minutes this verification should take place (default 180). \\ \\ //Example: verify interval=60//\\ \\ This checks each hour if the submission is verified.|
Line 241: Line 242:
  
 |%captcha% *image.php?*|tries to find the captcha that has image.php? in the URL path| |%captcha% *image.php?*|tries to find the captcha that has image.php? in the URL path|
-|%captcha% fixed:http://some-url-with-captcha.com/image.php?rnd=123|is not searching for a captcha but using that fixed URL|+|%captcha% fixed:%%http://some-url-with-captcha.com/image.php?rnd=123%%|is not searching for a captcha but using that fixed URL|
 |%captcha% overlay:*image.php?*|is searching for more than one captcha and combines them to one file| |%captcha% overlay:*image.php?*|is searching for more than one captcha and combines them to one file|