Age | Commit message (Collapse) | Author |
|
|
|
|
|
|
|
|
|
* -Wcast-qual
* -Wmissing-parameter-type
* -Wold-style-definition
|
|
|
|
Cleaned-up page generation
|
|
entry"
|
|
|
|
|
|
|
|
holding address/uri or address/uri/filename rather than passing opaque char* of unknown size.
|
|
|
|
|
|
|
|
|
|
|
|
added the following default flags:
-Wformat
-Wformat-security
-Wmultichar
-Wwrite-strings
fixed several other warnings
|
|
address in memory, leading to have hashtable positions not anymore valid.
This issue was especially triggered when a redirect was processed ("Warning moved treated for .." messages)
* closes: #43
|
|
|
|
|
|
|
|
|
|
|
|
|
|
* protocol was finally dropped in Windows Media Services 2008
* mmsrip is not supported anymore
* some licensing issues regarding the protocol (ha-ha)
|
|
Fixed year notice
|
|
|
|
bug is still out there.
|
|
|
|
|
|
the current page being parsed, NOT the parent page. (alexei dot co at gmail dot com)
* closes: issue #20
|
|
|
|
|
|
|
|
|
|
|
|
related to the way non-ascii characters are being decoded
Rationale:
* inside URI
* non-ascii characters are read with the page encoding, and transformed into UTF-8
* url-escaped %xx are considered utf-8 sequences to be decoded, unless they form invalis sequences (in such case we left them as-is)
* html entities (names, or decimal/hex) are decoded as utf-8 characters
* inside query string
* non-ascii characters are read as binary, and escaped using %xx
* url-escaped %xx are left unless not harmful (alphanum, for example)
* html entities (names, or decimal/hex) are decoded as utf-8 characters and encoded back to the page encoding (possibly using %xx)
* inside hostnames
* non-ascii characters are encoded using IDNA
Example:
* are equivalent in a iso-8859-1 page: http://foo/café.html http://foo/caf%c3%a9.html http://caf&#a9;.html
|
|
|
|
|
|
|
|
Rationale:
* hostname is ASCII, non-ascii characters shall be encoded with IDNA
* URI filenames may embed non-ascii characters, which MUST be UTF-8 encoded
* query string may embed non-ascii characters, which are encoded with the pahe charset into %xx codes
|
|
Fixed HTML entities decoding which was done before charset decoding.
|
|
|
|
|
|
(RFC 3986)" (http://code.google.com/p/httrack/issues/detail?id=12)
|
|
(http://code.google.com/p/httrack/issues/detail?id=11)
|
|
javascript issues (http://code.google.com/p/httrack/issues/detail?id=4)
|
|
(http://code.google.com/p/httrack/issues/detail?id=2)
|
|
setup:
indent -l80 -lc80 -nhnl -nut -bad -bap -bbo -br -brf -bli2 -brs -bls -br -ss
-sai -pmt -nsaw -nsaf -nprs -i2 -ce -npsl -npcs -cs -sob -cdw -nbc -lp
logs:
indent: htsparse.c:364: Warning:old style assignment ambiguity in "=-". Assuming "= -"
indent: htsparse.c:366: Warning:old style assignment ambiguity in "=-". Assuming "= -"
indent: htsparse.c:368: Warning:old style assignment ambiguity in "=-". Assuming "= -"
indent: htsparse.c:370: Warning:old style assignment ambiguity in "=-". Assuming "= -"
indent: htsparse.c:387: Warning:old style assignment ambiguity in "=-". Assuming "= -"
indent: htsparse.c:738: Warning:old style assignment ambiguity in "=*". Assuming "= *"
indent: htsparse.c:907: Warning:old style assignment ambiguity in "=*". Assuming "= *"
indent: htsparse.c:925: Warning:old style assignment ambiguity in "=-". Assuming "= -"
indent: htsparse.c:970: Warning:old style assignment ambiguity in "=-". Assuming "= -"
indent: htsparse.c:971: Warning:old style assignment ambiguity in "=-". Assuming "= -"
indent: htsparse.c:1261: Warning:old style assignment ambiguity in "=*". Assuming "= *"
indent: htsparse.c:1277: Warning:old style assignment ambiguity in "=*". Assuming "= *"
indent: htsparse.c:1410: Warning:old style assignment ambiguity in "=*". Assuming "= *"
indent: htsparse.c:1459: Warning:old style assignment ambiguity in "=*". Assuming "= *"
indent: htsparse.c:1494: Warning:old style assignment ambiguity in "=-". Assuming "= -"
indent: htsparse.c:1504: Warning:old style assignment ambiguity in "=-". Assuming "= -"
indent: htsparse.c:1541: Warning:old style assignment ambiguity in "=-". Assuming "= -"
indent: htsparse.c:1583: Warning:old style assignment ambiguity in "=-". Assuming "= -"
indent: htsparse.c:1597: Warning:old style assignment ambiguity in "=-". Assuming "= -"
indent: htsparse.c:1625: Warning:old style assignment ambiguity in "=-". Assuming "= -"
indent: htsparse.c:2975: Warning:old style assignment ambiguity in "=-". Assuming "= -"
|