From 25adbdabb47499fe641c7bd9595024ff82667058 Mon Sep 17 00:00:00 2001 From: Xavier Roche Date: Mon, 19 Mar 2012 12:51:31 +0000 Subject: httrack 3.30.1 --- html/step9_opt6.html | 173 +++++++++++++++++++++++++++++++++++++++++++++++++++ 1 file changed, 173 insertions(+) create mode 100644 html/step9_opt6.html (limited to 'html/step9_opt6.html') diff --git a/html/step9_opt6.html b/html/step9_opt6.html new file mode 100644 index 0000000..110b27f --- /dev/null +++ b/html/step9_opt6.html @@ -0,0 +1,173 @@ + + + + + + + HTTrack Website Copier - Offline Browser + + + + + + + + + +
HTTrack Website Copier
+ + + + +
Open Source offline browser
+ + + + +
+ + + + +
+ + + + +
+ + +

Option panel : Spider

+ +
+ +
    +
    +

    + +
  • Accept cookies
  • +
    Accept cookies generated by the remote server +
    If you do not accept cookies, some "session-generated" pages will not be retrieved +


    + +
  • Check document type
  • +
    Define when the engine has to check document type +
    The engine must know the document type, to rewrite the file types. For example, if a link called /cgi-bin/gen_image.cgi generates a gif image, the generated file will not be called "gen_image.cgi" but "gen_image.gif" +
    Avoid "never", because the local mirror could be bogus +


    + +
  • Parse java files
  • +
    Must the engine parse .java files (java classes) to seek included filenames? +
    It is checked by default +


    + +
  • Spider
  • +
    Must the engine follow remote robots.txt rules when they exist? +
    The default is "follow" +


    + +
  • Update hack
  • +
    Attempt to limit transfers by wrapping known bogus responses from servers. + For example, pages with same size will be considered as "up to date", even if the timestamp seems + different. This can be useful for many dynamically generated pages, but this can also cause + not-updated pages in rare cases. +

    + +
  • Tolerant requests
  • +
    Tolerate wrong file size, and make requests compliant with old servers +
    It is unchecked by default, because this option can cause files to become bogus +


    + +
  • Force old HTTP/1.0 requests
  • +
    This option forces the engine to use HTTP/1.0 requests, and avoid HEAD requests. +
    Useful for some sites with old server versions, or with many dynamically generated pages. +


    +
+ +



+

Back to Home

+ + +
+
+
+ + + + + +
+ + + + + + -- cgit v1.2.3