2023-05-11 Darshit Shah contrib/make-release: Update regexes to match the NEWS file 2023-05-11 Darshit Shah cfg.mk: Replace uses of filesystem with file system Update NEWS 2023-05-10 Darshit Shah bootstrap.sh: Update script from gnulib gnulib: Pull forward 2023-04-09 jinfuchiang (tiny change) Update deprecated option '--html-extension' to '--adjust-extension' * doc/wget.texi: Update option '--html-extension' to '--adjust-extension'. Renamed in Wget 1.12 to better reflect its behavior. 2023-03-19 Tim Rühsen Don't write core dump if --secure-option value isn't suppported. 2023-02-18 Tim Rühsen * src/retr.c (fd_read_body): Increase bufsize from 8k to 64k * src/retr.c (fd_read_body): Simplify gzip initialization * src/retr.c (fd_read_body): Use MAX instead of max Simplify url_error function. * src/url.c (url_error): simplify, remove url arg, return const char *. * src/url.h (url_error): remove url arg, return const char *. * src/html-url.c (get_urls_file): Simplify call to url_error(), remove call to free(). * src/http.c (metalink_from_http): Likewise. * src/main.c (main): Likewise. * src/metalink.c (retrieve_from_metalink, fetch_metalink_file): Likewise. * src/recur.c (retrieve_tree): Likewise. * src/res.c (res_retrieve_file): Likewise. * src/retr.c (retrieve_url, retrieve_from_file): Likewise. * src/url.c: Refactored url_error() * tests/valgrind-suppressions: Add rule for 'strcmp-sse2.S in libdl.so' 2023-02-17 Tim Rühsen * configure.ac: Remove CC flags -Wc90-c99-compat and -Wlong-long * fuzz/*: Use grep >/dev/null instead of grep -q * src/retr.c: Don't include hash.h Update copyright years * gnulib: Update 2022-12-11 Tim Rühsen * configure.ac: Disable nettle if NTLM is explicitly disabled 2022-12-10 Tim Rühsen * configure.ac: Allow disabling NTLM if nettle present (Savannah #63431) 2022-09-24 Tim Rühsen * fuzz/Makefile.am: Add -lz to fuzzer libs 2022-03-20 Tim Rühsen * src/main.c (print_help): Add --retry-on-host-error to help text Fix HSTS portability by using int64_t instead of time_t. * src/hsts.c: Use int64_t instead of time_t. * src/http.c: Use int64_t for parsing Strict-Transport-Security. src/http.c (time_to_rfc1123): Fix -Wformat-nonliteral * src/warc.c (warc_process_cdx_line): Fix variable type to idx_t * src/main.c (secs_to_human_time): Use snprintf instead of sprintf * src/main.c (main): Remove unused variable * src/netrc.c (test_parse_netrc): Check if HAVE_FMEMOPEN is defined 2022-02-26 Darshit Shah maint: post-release administrivia * NEWS: Add header line for next release. * .prev-version: Record previous version. * cfg.mk (old_NEWS_hash): Auto-update. * NEWS: Update NEWS items for release * .gitignore: Remove empty line at EOF * .gitignore: Update file * src.hsts.c(hsts_read_database): Use SCNd64 for portable format flags * configure.ac: Add some warning flags to ignore * cfg.mk: Remove passing syntax-checks from skip list Fix issues from syntax-check * doc/wget.texi: s/time stamp/timestamp/ * src/ftp-ls.c(clean_line): Same (ftp_parse_vms_ls): Don't cast return value of x*alloc * src/ftp.c: Don't cast return value of alloca * src/css-url.c: wget.h should be the first include * testenv/Makefile.am: Use $(PATH_SEPARATOR) instead of : when declaring a path * testenv/Test-no_proxy-env.py: s/can not/cannot/ * Update Copyright years * bootstrap: Update script * gnulib: Pull forward Cleanup some incorrect uses of AM Conditionals * configure.ac: Replace IRI_IS_ENABLED with WITH_IRI and METALINK_IS_ENABLED with WITH_METALINK * src/Makefile.am: Add the .c and .h files for IRI, Metalink and Xattr support to wget_SOURCES directly when needed instead of adding them as object files. * testenv/Makefile.am: Rename METALINK_IS_ENABLED to WITH_METALINK 2022-02-26 Darshit Shah Replace incorrect usage of AC_LIBOBJ in configure.ac AC_LIBOBJ is to be used for providing replacement functions for compatibility reasons. Not for conditional compilation of entire source files. Autotools provides other methods for achieving that. However, our build system has (ab-)used the AC_LIBOBJ macros for doing this for over 15 years. That stops today. * configure.ac: Replace uses of AC_LIBOBJ with automake conditionals that can be used in Makefiles * src/Makefile.am: Use the defined conditionals to select which files get built. 2022-02-25 Darshit Shah Fix case where installed gettext is newer than minimum version * configure.ac: Use AM_GNU_GETTEXT_REQUIRE_VERSION to specify a minimum version of gettext needed. This should allow newer versions of gettext to be copied for autipoint to use when available. This commit hopefully fixes the annoying issue when building from source on a system that has a newer version of gettext. 2022-02-13 Tim Rühsen * src/netrc.c (test_parse_netrc): Free netrc structure * .gitlab-ci.yml: Fix path to llvm-symbolizer * src/netrc.c (test_parse_netrc): New unit test function 2022-01-31 Aarni Koskela (tiny change) * src/http.c (parse_strict_transport_security): Fix typo in string 2022-01-23 Tim Rühsen * .gitlab-ci.yml (Scan-Build): Allow failure due to two false positives * configure.ac: Use pkg-config for gpgme, libidn2 and nettle 2022-01-22 Tim Rühsen * src/ftp.c: Small cleanups 2022-01-16 Nik Soggia (tiny change) Print newline after dot progress bar in non-verbose mode * src/progress.c (dot_finish): Print new in all progress bar contexts instead of just verbose 2022-01-12 Tim Rühsen * .gitlab-ci.yml: Fix artifact path for the Scan-Build runner 2022-01-09 Tim Rühsen * src/main.c (main): Unlink output document when --unlink is given 2021-12-22 Tim Rühsen fuzz/*.in: Update fuzzer corpora * .gitlab-ci.yml (CoverageReports): Fix artifacts paths * tests/valgrind-suppressions: Fix libidn rule * .gitlab-ci.yml: Fix artifacts paths * tests/valgrind-suppressions: Extend libidn rule * src/log.c (logprintf): Check earlier for verbosity * src/http.c (http_loop): Fix memleak 2021-12-21 Tim Rühsen Switch fuzzing build from C++ to C * Makefile.am (oss-fuzz): Build with $CC instead of $CXX. * README.md: Remove CXX and CXXFLAGS exports. 2021-12-01 Darshit Shah * src/http.c (http_loop): Hide password when printing status with -nv Reported-By: Per Lundberg Closes: #61492 * gnulib: Pull forward * src/hsts.c (hsts_read_database): Read time_t values as long long 2021-12-01 Thomas Niederberger * src/main.c (print_help): Add command line option for TLS 1.3 2021-10-11 Darshit Shah * retr.c (rotate_backups): Non existent files are not errors in this function 2021-10-08 Darshit Shah * ftp.c (ftp_loop_internal): Fix computation of total_downloaded_bytes When continuing a FTP download, or not starting one because the file is already fully retrieved, don't include the size of the file in the total_downloaded_bytes. Only the actual amount of data retrieved over the network should be considered there. Fixes: #61277 Reported-By: Michal Ruprich 2021-09-08 Darshit Shah Remove suprious print statements * src/gnutls.c: Remove fprintf statements. We should never print to console directly. Always honor the log levels. Fixes: #61125 2021-09-07 Darshit Shah maint: post-release administrivia * NEWS: Add header line for next release. * .prev-version: Record previous version. * cfg.mk (old_NEWS_hash): Auto-update. * NEWS: Update NEWS items for release * cfg.mk: Disable the indent check * gnulib: Pull forward 2021-08-21 WB <3572564-wadersgeek@users.noreply.gitlab.com> Fix #60956 (improve --page-requisites) * src/html_url.c (tag_handle_link): Check for "alternate stylesheet", "icon" and "manifest". 2021-08-07 Tim Rühsen * src/html-url.c (tag_handle_meta): Fix integer overflow 2021-07-05 Tim Rühsen * src/recur.c (download_child): Remove temporary robots.txt.tmp 2021-06-14 Josef Moellers Long pathnames patch 2021-06-07 Tim Rühsen * .gitlab-ci.yml: Remove scan-build deps from pages stage * .gitlab-ci.yml: Move scan-build/ to public/ only if it exists 2021-06-06 Tim Rühsen * .gitlab-ci.yml: Move scan-build results to parent directory * .gitlab-ci.yml: Move coverage results to parent directory * .gitlab-ci.yml: Don't allow failure of scan-build * .gitlab-ci.yml: Pages needs Build-Tarball * src/url.c (append_uri_pathel): Add cheap extra check to help static analyzers * src/http.c (gethttp): Add cheap extra check to help static analyzers * .gitlab-ci.yml: Exclude src/css_.o from scan-build * src/main.c (main): Removed unused variable 2021-06-06 Tim Rühsen Revert "Long pathnames patch" This reverts commit affad27664afccdedafd94711d350f368e859be4. Manual tests with very long path names did not work with this patch. We have to wait for a patch including automated tests. 2021-06-06 Tim Rühsen * .lgtm.yml: Add autoconf-archive * .gitlab-ci.yml: Re-arrange stages Remove .git/, .gitmodules and gnulib/ from bootstrap artefacts 2021-06-06 Darshit Shah Updated Gitlab CI Pipeline * .gitlab-ci.yml: COmpletely overhauled CI pipeline with dependencies for a faster and more robust CI. 2021-05-29 jmoellers Long pathnames patch 2021-05-22 Tim Rühsen * fuzz/wget_read_hunk_fuzzer.c: Add explicit cast for C++ compilation * fuzz/wget_ntlm_fuzzer.c: Add explicit cast for C++ compilation 2021-05-03 Tim Rühsen * src/http.c (initialize_request): Send Host HTTP header first This solves an issue where the server expects the Host: header as first one. This seems plausible (ahem) as the Host: header is the only one that is required. 2021-05-02 Tim Rühsen Improve wget_options_fuzzer * fuzz/fuzzer.h: Ignore -Wunused-parameter. * fuzz/wget_options_fuzzer.c: Let getaddrinfo() fail in while fuzzing. * fuzz/wget_options_fuzzer.in/*: Update corpora from OSS-Fuzz. 2021-05-02 Nekun (tiny change) Fix typo in VMS support code * src/utils.c: Remove unpaired brace 2021-04-15 Nils (tiny change) Use "nofollow" instead of "no-follow" in messages * src/html-url.c (get_urls_html_fm): Remove misleading debug message. * src/recur.c (retrieve_tree): Fix no-follow -> follow in DEBUGP. The attribute in html is "nofollow" so it is more consistent to call it so than to hyphenate it. 2021-04-15 Nils (tiny change) Print message for no-follow attribute only if norobots respected * src/html-url.c (get_urls_html_fm): Remove misleading log message. * src/recur.c (retrieve_tree): Add log message into correct if block. Commit e39be3283836b8cb7b9ee320456eefb2a2fda173 added a message that said links will not be followed whenever the nofollow attribute is found in a page. It didn't take into account that with -e robots=off (and equivalents) links will still be followed. This bug has been noticed multiple times: * https://www.reddit.com/r/DataHoarder/comments/mprq89/wget_respects_nofollow_attribute_despite_e/ * https://gist.github.com/simonw/27e810771137408fd7834ad153750c41#gistcomment-3648191 * https://superuser.com/questions/1494761/wget-wont-ignore-no-follow-attributes This commits makes it so that this message is only printed when a nofollow link is found and the norobots convention is respected. 2021-04-15 Tim Rühsen * configure.ac: Add AM_GNU_GETTEXT_VERSION 2021-04-11 Tim Rühsen * testenv/test/base_test.py: Rename valgrind-suppression-ssl -> valgrind-suppressions-ssl * testenv/test/base_test.py: Add --show-leak-kinds=all valgrind argument * tests/WgetTests.pm: Add --show-leak-kinds=all valgrind argument Fix testenv valgrind suppression * testenv/Makefile.am: Remove valgrind-suppressions from EXTRA_DIST. * testenv/valgrind-suppression-ssl: Rename to testenv/valgrind-suppressions-ssl. 2021-04-11 Tim Rühsen * bootstrap.conf: Add 'rename' to gnulib modules The missing module could be a reason for https://savannah.gnu.org/bugs/?60346. Reported-by: jrharris42@gmail.com 2021-04-11 Tim Rühsen * src/wget.h: Use strtoll() for str_to_wgint This fixes a regression reported at https://savannah.gnu.org/bugs/?60353. Reported-by: Michal Ruprich 2021-04-11 Shamil Gumirov (tiny change) Minor output fix to use quote_n() instead of quote() * src/ftp.c (ftp_retrieve_list): change quote to quote_n * src/iri.c (do_conversion): change quote to quote_n * src/url.c (convert_fname): change quote to quote_n The implementation quote() reuses the buffer it returns which leads to printing the same string for each quote() call in one output line. Instead, quote_n() should be used as highlighted in the doc: https://www.gnu.org/software/gnulib/manual/html_node/Quoting.html 2021-04-05 Tim Rühsen * src/Makefile.am: Add metalink.c and xattr.c to EXTRA_wget_SOURCES * testenv/Makefile.am: Allow tarball build without python3 2021-04-04 Tim Rühsen Fix spelling errors * NEWS: trough -> through. * README.checkout: Likewise. * contrib/spell-checker: Add exclusion 'parm' Remove gettext version requirements * bootstrap.conf: Remove gettext version requirements * configure.ac: Likewise. 2021-03-16 Darshit Shah Use a LOG_COMPILER for running fuzz tests This prevents needing a special case in the fuzz tests to detect valgrind usage and a new exec. Instead, we simply detect the environment in a shell script and start the test with valgrind in the first place. * fuzz/test-runner.sh: New script for running the fuzz tests * fuzz/main.c (main): Remove code for re-running under valgrind * fuzz/Makefile.am: Set the LOG_COMPILER to test-runner.sh and remember to add it to the list of extra files for packaging 2021-03-07 Tim Rühsen * configure.ac: Add AC_PROG_CC_C99 2021-03-07 Darshit Shah Bump gettext version to 0.20 * bootstrap.conf: Increase minimum version of gettext needed to 0.20 * configure.ac: Same * bootstrap: Update script * gnulib: Pull forward 2021-03-02 Darshit Shah Fix double free in FTP Code * src/ftp.c(getftp): Don't free `target`. If it is not pointing to targetbuf, then it still pointing to its original location of u->dir. This location will be free'd later. Doing so now causes a double free and hence crashes Wget * tests/Test-ftp-dir.px: New test to show double free error * tests/Makefile.am: Add new test 2021-01-31 Tim Rühsen * doc/wget.texi: Replace '' with in example Needed to hide from texi2pod.pl replacement of '' -> ". Reported-by: Vincent Lefevre 2021-01-23 Tim Rühsen Use gnulib's utime.h * bootstrap.conf: Remove utime-h (included by utime). * configure.ac: Remove header checks for utime.h and sys/utime.h. * src/utils.c: Simply #include . * configure.ac: Detect unresolved AX_ macros 2021-01-16 Tim Rühsen * configure.ac: Use noyywrap for AC_PROG_LEX Fix --quota on systems with 32bit long type * src/init.c (cmd_bytes_sum): Use WGINT_MIN and WGINT_MAX in check. * src/options.h (struct options): Make 'quota' of type wgint. * src/retr.c: Make 'total_downloaded_bytes' of type wgint. * src/utils.h: Fix comment. * src/wget.h: Add WGINT_MIN, remove SUM_SIZE_INT. * configure.ac: Remove obsolete AC_HEADER_STDC * configure.ac: Replace ` with ' * configure.ac: Use yywrap param for AC_PROG_LEX 2021-01-09 Darshit Shah maint: post-release administrivia * NEWS: Add header line for next release. * .prev-version: Record previous version. * cfg.mk (old_NEWS_hash): Auto-update. * NEWS: Update NEWS items for release * gnulib: Pull forward * cfg.mk: Add manual_title for web docs * contrib/make-release: Update to use maintainer-makefile * bootstrap.conf: Add gendocs and gnu-web-doc-update modules * configure.ac(AM_INIT_AUTOMAKE): Create lzip archives when making a distribution Prepare for making releases The maintainer makefile provides targets for making a release. They need these things: * cfg.mk: Set the hash of the NEWS file as it was at the beginning of this release * .prev-version: Create the .prev-version file Prepare NEWS file for new release * NEWS: Fix the copyright notice * NEWS: Add header for new release 2021-01-07 Darshit Shah Use PRId64 to correctly identify the format specifier * src/utils.c: Use PRId64 to correctly identify the format specifier for wgint values. This fixes a warning on 32-bit systems where wgint is a long long int instead of the long int that the format specifier indicated. Reported-by: Jeffrey Walton 2021-01-03 Darshit Shah * src/retr.c(rotate_backups): Simplify logic for handling filename rotation 2021-01-03 Matt Whitlock * configure.ac: Don't use Bashisms 2021-01-01 Darshit Shah Run make update-copyright * gnulib: Pull forward 2020-12-31 Darshit Shah * NEWS: Finalize release * gnulib: Update for release 2020-12-30 Darshit Shah * src/main.c: Disable use-askpass on VMS 2020-12-30 Steven M. Schweda Darshit Shah Fixes for running on VMS time_t on VMS is typically unsigned. (Lazy man's solution to 2038?) I added "(time_t)" type casts to negative values ("-1"), and changed tests to avoid complaints. * src/hsts.c (hsts_add_entry): Explicitly cast potentially negative time values to time_t to handle VMS quirks. (hsts_store_entry): Same (get_hsts_store_filename): Use new ajoin_dir_file function to join filenames (test_hsts_read_database): Same * src/init.c (struct options): use-askpass is not implemented on VMS (ajoin_dir_file): New Function to join filenames in a platform agnostic manner (wgetrc_user_file_name): Use ajoin_dir_file to join paths. Doing this correctly, eliminates the need for a special case on VMS * src/init.h: Add prototype for ajoin_dir_file * src/log.c (check_redirect_output): Ignore on VMS * src/main.c(option_data): Disable use-askpass on VMS (print_help): Same (get_hsts_database): Use ajoin_dir_file to join paths (print_version): Add VMS specific information to Version output * src/utils.c (fork_to_background): Fix signature on VMS 2020-12-29 Tim Rühsen Remove SIZEOF_WGINT as wgint is always int64_t * src/http.c (test_parse_range_header): Remove use of SIZEOF_WGINT. * src/utils.c (human_readable): Remove superfluous HR_NUMTYPE, * (number_to_string): Remove use of SIZEOF_WGINT. * src/utils.h: Remove use of SIZEOF_WGINT and HR_NUMTYPE. * src/wget.h: Remove #define SIZEOF_WGINT. src/wget.h: Cleanup code around wgint 2020-12-29 Darshit Shah Remove portability handling for str[n]casecmp * src/mswindows.c: Gnulib ensures we always have str{n}casecmp * configure.ac: Don't need to define HAVE_STR[N]CASECMP anymore 2020-12-29 Darshit Shah Remove portability handling code for wgint Gnulib's stdint.h module promises a C99 compliant stdint.h file on all platforms. Thus allowing us to directly use the fixed wodth integer type, int64_t wihout needing to resort to all the checks being performed. * src/wget.h: Assume that int64_t is always available and use it * src/mswindows.h: Remove portability code since gnulib handles it * configure.ac: Remove sizeof checks for integer types that are no longer used 2020-12-29 Tim Rühsen * .lgtm.yml: Call autoreconf to support Debian oldstable 2020-12-28 Tim Rühsen Fix --accept-regex/i--reject-regex for FTP * src/ftp.c (ftp_retrieve_glob): Call accept_url() with the full URL Reported-by: Frans de Boer 2020-12-28 Darshit Shah * tests/valgrind-suppressions: Make suppression for libidn false positive more generic * Makefile.am: Clean up gnulib_po/stamp-po tests/valgrind-suppressions: Add suppressions for false positives testenv/Test--https.py: Fix missing import .gitlab-ci.yml: Everyone needs an autoreconf now .gitlab-ci.yml: Even Debian Stable requires running autoreconf Fix usage of Magic number of tests testenv/test/base_test.py: Add new variable SKIP_TEST testenv/Test--https-crl.py: Use SKIP_TEST instead of magic number testenv/Test--https.py: Same testenv/Test-hsts.py: Same testenv/Test-no_proxy-env.py: Same testenv/Test-pinnedpubkey-der-https.py: Same testenv/Test-pinnedpubkey-der-no-check-https.py: Same testenv/Test-pinnedpubkey-hash-https.py: Same testenv/Test-pinnedpubkey-hash-no-check-fail-https.py: Same testenv/Test-pinnedpubkey-pem-fail-https.py: Same testenv/Test-pinnedpubkey-pem-https.py: Same .gitlab-ci.yml: All modern builds need autoreconf .po/POTFILES.in: Remove files containing no translateable strings Fix issues reported by codespell .gitlab-ci.yml: Need --force option to autopoint * .gitlab-ci.yml: Don't need autoreconf everywhere. Only on Arch This (partially) reverts commit 0309d13794303d82529addecfc31f0e1a8c5b97b. configure.ac: Revert to requiring gettext 0.19 .gitlab-ci.yml: Call autoreconf to refresh gettext files * NEWS: Update NEWS items configure.ac: Raise minimum gettext version to 0.20 Run autoupdate on configure.ac * configure.ac, m4/{wget, wget_manywarnings}.m4: Run autoupdate .gitignore: Ignore gnulib_po directory configure.ac: Replace obsolete macro AC_AIX with AC_USE_SYSTEM_EXTENSIONS 2020-12-27 Darshit Shah bootstrap.conf: Remove unneeded text block m4/getpagesize.m4: Don't track autogenerated file 2020-12-27 Darshit Shah Remove unneeded files from gnulib m4/exitfail.m4: Remove unneeded file m4/wchar.m4: Same m4/wctype.m4: Same The latest versions of these files will be copied by gnulib-tool upon its invokation. No need to track them around here 2020-12-27 Darshit Shah Use a separate domain for translating gnulib Use the --po-domain option to gnulib-tool to create a new textdomain that can be used by gnulib files for translations. This way, we don't have to maintain the list of all files that require translations in gnulib. * bootstrap.conf: Use --po-domain and --po-base options to create a separate base for gnulib translations * src/main.c(i18n_initialize): Call bindtextdomain on wget-gnulib to include those translations as well * Makefile.am: Add new directory gnulib_po to SUBDIRS * configure.ac: Generate gnulib_po/Makefile.in * lib/Makefile.am: Set AM_CPPFLAGS to empty since gnulib.mk expects it to be set 2020-12-27 Darshit Shah po/POTFILES.in: Update the list of files that need translation m4/getpagesize.m4: Update from gnulib configure.ac: Increase minimum required version to 2.64 gnulib: Pull forward 2020-12-22 Darshit Shah main.c (main): Warn when trying to use password without username 2020-12-13 Tim Rühsen * src/http.c (http_cleanup): Reset wget_cookie_jar after freeing This silences the wget_options_fuzzer which triggered #28610 on OSS-Fuzz. This issue can not happen with the Wget utility. The fuzzer runs main(),...,cleanup() in a loop which the Wget utility never does. 2020-11-08 Tim Rühsen * src/main.c: Add description to --help output of wait options 2020-07-11 Tomas Hozza testenv: Add test for handling of no_proxy environment variable * testenv/Test-no_proxy-env.py: Added new test for no_proxy env * testenv/Makefile.am: Added the new test to Makefile Added new test with 5 cases, which are testing various combinations of no_proxy environment variable definition and requested URLs. The test is skipped if the system does not support resolution of localhost subdomains to lcalhost address. 2020-07-11 Tomas Hozza testenv: Allow definition of environment variables for wget execuion * testenv/README: Added description for new EnvironmentVariable hook * testenv/conf/environment_variable.py: Added implementation of EnvironmentVariable hook * testenv/test/base_test.py: Modified exec_wget() to enable use of EnvironmentVariable hook Added new test hook called EnvironmentVariables, for defining environment variables when wget is executed in tests. This is handy for testing environment variables, which are accepted by wget. 2020-07-11 Tomas Hozza testenv: HTTPTest.begin() should return exit value * testenv/test/http_test.py: Ensure that HTTPTest.begin() always returns a value Previously the HTTPTest.begin() method always returned None. However this is not consistent with the begin() implementation of the parent class (BaseTest). This change ensures that HTTPTest.begin() returns a value. 2020-07-07 Darshit Shah Minor documentation clarification * src/wget.texi: Clarify behaviour of --quota Thanks to TheWild on IRC for pointing it out 2020-06-29 Tim Rühsen * src/ftp.c (ftp_loop_internal): Check for VERIFCERTERR to avoid SIGABRT There is a bug that causes wget to exit with SIGABRT when trying to receive files through FTP from a server with a certificate that failed the verification. The bug is filed in RedHat Bugzilla for Fedora: https://bugzilla.redhat.com/show_bug.cgi?id=1475861 Reported-by: Artem Egorenkov 2020-06-21 Tim Rühsen * src/host.c (lookup_host): Fix uninitialized pointer access in c-ares code Reported-by: Swapnil More 2020-05-22 Tim Rühsen * src/gnutls.c (ssl_init): Small cleanup fixing output of ncerts 2020-05-01 Tim Rühsen * src/convert.c (downloaded_files_free): Only compile if DEBUG_MALLOC or TESTING is defined * src/netrc.c (free_netrc): Only compile if DEBUG_MALLOC or TESTING is defined 2020-05-01 Вячеслав Петрищев Fix SSL/TLS timeout issues. * connect.c (fd_read, fd_peek): Let implementation take care about timeout. * gnutls.c (_do_handshake, _do_reauth, wgnutls_read_timeout): Fix support for interactive timeout. * gnutls.c (wgnutls_peek): Let wgnutls_read_timeout() take care about timeout. * openssl.c (openssl_read_peek): Fix 0 (-1) timeout. * retr.c (fd_read_body): Avoid wrong 'interactive timeout'. Set interactive to true for bar progress. * src/progress.c (bar_set_params): Set interactive to true. * src/retr.c (fd_read_body): Avoid call fd_read with 0 timeout. 2020-05-01 Вячеслав Петрищев New implementation for openssl_read, openssl_peek ssl_connect_with_timeout (openssl) Add new implementation for openssl_read, openssl_peek ssl_connect_with_timeout (openssl). Thats allow continue read after ETIMEDOUT (if timeout < opt.read_timeout) without 'Retrying', and do not create thread (under MSWin) for every read. Old implementation, (with fix for 'timeout') avaible for build with: -DOPENSSL_RUN_WITHTIMEOUT Add timeout for transport_implementation {reader,peeker}; * src/init.c (cmd_time): Add check for negative value. * src/connect.h (transport_implementation {reader,peeker}): Change function prototype. * src/connect.c (select_fd_nb): New function, aviod conversion to blocked under MSWin. * src/gnutls.c (gnutls_read): Fix: using timeout, seting ETIMEDOUT, conversion to blocked. (gnutls_peek): Likewise. (wgnutls_errstr): Add errmsg for ETIMEDOUT. (_do_handshake) (_do_reauth): Fix conversion to blocked. * src/openssl.c: Add new implementation for openssl_read, openssl_peek, ssl_connect_with_timeout. (init_prng): when option --random-file given warn user if RAND_load_file() fail. 2020-04-18 Tim Rühsen Add missing linefeed in output strings * src/ftp.c (getftp): Add linefeed to error message. * src/res.c (res_parse_from_file): Likewise. * src/utils.c (fopen_stat): Likewise. (open_stat): Likewise. 2020-03-28 Tim Rühsen * src/metalink.c: Include filename.h instead of dosname.h * gnulib: Update 2020-03-24 Tomas Hozza Don't print message about loading crl or ca-cert files with --no-verbose * src/gnutls.c (ssl_init): Use LOG_VERBOSE verbosity for informative message related to loading CRL or CA certificate file. Before change [1], wget didn't produce any output related to loading CA certificates when --no-verbose option has been used. When --no-verbose option is used, only error messages and basic information should get printed. Information about loading CRL or CA certificate is probably not a basic information. Any error when loading the CRL or CA certificate will be still printed with --no-verbose. Some users rely on wget not printing such information and they consider it a regression. Reported as https://bugzilla.redhat.com/show_bug.cgi?id=1807267 [1] http://git.savannah.gnu.org/cgit/wget.git/commit/?id=e4a8fe84e2b813b65d91aec29298eecabe4850a5 2020-03-06 Tim Rühsen Add NTLM fuzzer * configure.ac: Create WITH_NTLM conditional. * fuzz/Makefile.am: Add wget_ntlm_fuzzer. * fuzz/wget_ntlm_fuzzer.c: New file. * fuzz/wget_ntlm_fuzzer.in/*: Initial fuzz corpora. * src/http.c (check_auth): Fix memory leak * src/http-ntlm.c (mkhash): Fix read buffer overflow 2020-03-05 Tim Rühsen * README.checkout: Fix gperf typo and tab indentation * .gitlab-ci.yml: Generate online docs and coverage reports for Gitlab 2020-03-05 Tim Rühsen Add code coverage reporting * configure.ac: Add AX_CODE_COVERAGE. * Makefile.am: Add rules 'check-coverage' and 'fuzz-coverage'. * fuzz/Makefile.am: Amend LDADD, AM_CFLAGS and AM_CPPFLAGS. * src/Makefile.am: Amend LDADD, AM_CFLAGS and AM_CPPFLAGS. This add a new configure option, --enable-code-coverage. With this option enabled, generate test code coverage with make code-coverage and generate coverage for the fuzz test corpora with make fuzz-coverage 2020-03-05 Tim Rühsen * src/http.c: Don't cast 3rd param to request_set_header. * README: Fix typo 'can retrieves' -> 'can retrieve' 2020-03-03 Darshit Shah * doc/wget.texi: Add maintainers and mention wget2 doc/wget.texi: Clarify usage of --level 2020-03-03 Tim Rühsen * src/http.c (gethttp): Remove unneeded free of hstat.(message|error) * src/http.c (gethttp): Fix memleaks Reported-by: Вячеслав Петрищев * src/http.c (read_response_body): Fix memleak Reported-by: Вячеслав Петрищев 2020-02-29 Tim Rühsen * contrib/spell-checker: Add lightening to exceptions Always build ssl_cleanup * src/gnutls.c (ssl_cleanup): Don't build conditionally. * src/openssl.c (ssl_cleanup): Likewise 2020-02-28 Tim Rühsen * doc/wget.texi: Fix spelling error Reported-by: Jim Cathey 2020-02-27 Tim Rühsen * src/convert.c (convert_cleanup): Free downloaded_css_set * src/openssl.c (ssl_cleanup): Only compile for DEBUG_MALLOC or TESTING * src/gnutls.c (ssl_cleanup): Only compile for DEBUG_MALLOC or TESTING * src/netrc.c (netrc_cleanup): Only compile for DEBUG_MALLOC or TESTING * src/log.c (log_cleanup): Only compile for DEBUG_MALLOC or TESTING * src/host.c (host_cleanup): Only compile for DEBUG_MALLOC or TESTING * src/spider.c (spider_cleanup): Only compile for DEBUG_MALLOC or TESTING * src/res.c (res_cleanup): Only compile for DEBUG_MALLOC or TESTING * src/convert.c (convert_cleanup): Only compile for DEBUG_MALLOC or TESTING * src/html-url.c (cleanup_html_url): Only compile for DEBUG_MALLOC or TESTING * src/http.c (http_cleanup): Free hashtable basic_authed_hosts * src/cookies.c (cookie_jar_delete): Set psl to NULL after calling psl_free * src/init.c (cleanup): Call ssl_cleanup and connect_cleanup Add ssl_cleanup to free memory * src/ssl.h: Add declaration for ssl_cleanup. * src/openssl.c: Add ssl_cleanup stub. * src/gnutls.c: Implement ssl_cleanup. * src/http.c (http_cleanup): Call invalidate_persistent to free resources * src/cookies.c: Free PSL in cookie_jar_delete * src/connect.c (connect_cleanup): Only needed with DEBUG_MALLOC or TESTING * src/http.c (http_loop): Avoid unneeded strdup 2020-02-27 Вячеслав Петрищев * src/http.c (gethttp): Fix memleak 2020-02-23 Tim Rühsen * src/http.c: Do not reveal OS type to server via User-Agent: header Following the "privacy by design" principle, which is now European law by virtue of the GDPR since 2018, the client should not transmit the operating system. Fixes #57884 Reported-by: Bruno Haible 2020-02-22 Tim Rühsen * tests/unit-tests.c: Fix 'multiple definition of...' with gcc 10 Split unique_name into two functions with just one arg * src/http.c (check_file_output): Call unique_name_passthrough. * src/metalink.c (badhash_suffix): Call unique_name. * src/url.c (url_file_name): Call unique_name_passthrough. * src/utils.c: Split unique_name, amend description. * src/utils.h: Amend unique_name, add unique_name_passthrough. * .gitlab-ci.yml: Set tag 'linux' explicitly for each runner 2020-02-22 Tim Rühsen * src/http.c (print_response_line): Fixed comparison Introduced in 68868bbb3737d26c197f39edafc5d526334426b0 (13.2.2020). Reported-by: Gisle Vanem 2020-02-21 Tim Rühsen Add valgrind suppressions * testenv/valgrind-suppression-ssl: Add gnutls suppressions * tests/valgrind-suppressions: Likewise * tests/valgrind-suppressions-ssl: Likewise Add --gen-suppressions=all to valgrind options * testenv/test/base_test.py: Add --gen-suppressions=all to valgrind options * tests/Test-proxied-https-auth.px: Likewise * tests/WgetTests.pm: Likewise * cfg.mk: Exclude sc_prohibit_gnu_make_extensions * src/retr.c (fd_read_body): Check timer before accessing (Coverity #600494) * src/warc.c (warc_write_start_record): Check return values of fseek, fflush (Coverity #1419650) * src/ftp-ls.c (eat_carets): Fix caret decoding issue (Coverity #1419653) * src/warc.c (warc_write_block_from_file): Check for write error (Coverity #1419654) * src/warc.c (warc_write_end_record): Check return value of fseeko (Coverity #1419657) Add valgrind suppressions for HTTPS tests (GnuTLS false positive) * testenv/valgrind-suppression-ssl: Add new rule * tests/valgrind-suppressions: Likewise * tests/valgrind-suppressions-ssl: Likewise 2020-02-20 Tim Rühsen * src/init.c (setoptval): Add missing comma (fix compilation) * .gitlab-ci.yml: Add spell-checking to VPATH/Debian 2020-02-20 Tim Rühsen Fix typos found by codespell */*: Fix typos Thanks to https://fossies.org/features.html#codespell Reported-by: Jens Schleusener 2020-02-20 Tim Rühsen * contrib/spell-checker: Add new file * src/init.c (setoptval): Silence Coverity (Coverity #1419648) * src/main.c (init_switches): Remove unneeded code (Coverity #1419658) * src/ftp-ls.c (ftp_parse_vms_ls): Fix use of uninitialized stack mem (Coverity #609373) 2020-02-19 Tim Rühsen * src/retr.c (retrieve_url): Fix memory leak (Coverity #1316408) * src/http.c (set_file_timestamp): Fix stack memory leak (Coverity #1419655) * src/main.c (main): Initialize variable (Coverity #1419660) * .gitlab-ci.yml: Fix linebreak * src/http.c (metalink_from_http): Fix memleak (Coverity #1419661, #1419656) * .gitlab-ci.yml: Add 'coverity' runner * contrib/mk_authors: Create list of authors and translators since last version * src/progress.c (eta_to_human_short): Silence -Wformat-overflow 2020-02-14 Tim Rühsen * .gitlab-ci.yml: Don't copy gnulib manually * NEWS: Update Remove alloca includes and macros * bootstrap.conf: Remove gnulib module 'alloca'. * src/sysdep.h: Remove include of alloca.h. * src/wget.h: Likewise, remove macros BOUNDED_TO_ALLOCA and STRDUP_ALLOCA. * src/utils.h: Remove alloca_array * src/convert.c (convert_links_in_hashtable): Remove use of alloca * src/convert.c (local_quote_string): Remove use of alloca * src/convert.c (replace_attr_refresh_hack): Remove use of alloca * gnulib: Update * bootstrap.conf: Use fnmatch-gnu instead of fnmatch * src/convert.c (write_backup_file): Remove use of alloca * src/utils.c (fnmatch_nocase): Remove FNM_CASEFOLD emulation, gnulib takes care of it * src/cookies.c (cookie_header): Remove use of alloca * src/cookies.c (cookie_header): Remove use of alloca * src/cookies.c (cookie_handle_set_cookie): Remove use of alloca * src/cookies.c (parse_set_cookie): Explicitly convert from double to time_t * src/cookies.c (parse_set_cookie): Remove use of alloca * src/warc.c (warc_uuid_str): Add second param to windows_uuid_str * src/cookies.c (parse_set_cookie): Remove use of alloca * src/main.c (main): Code clean, reduce allocations * src/main.c: Modify macro IF_SSL to work with latest struct changes * configure.ac: List 'no' for --with-ssl in help output 2020-02-13 Tim Rühsen * src/ftp.c (ftp_retrieve_dirs): Remove use of alloca * src/ftp.c (ftp_retrieve_list): Remove use of alloca * src/ftp.c (ftp_loop_internal): Remove use of alloca * src/ftp.c (getftp): Remove use of alloca * src/http.c: Include xstrndup.h unconditionally * src/ftp-basic.c (ftp_request): Remove use of alloca * src/html-parse.c (name_allowed): Remove use of alloca * src/netrc.c (search_netrc): Remove use of alloca * src/init.c (parse_line): Remove use of alloca Remove use of alloca in setoptval * src/init.c (setoptval): Remove use of alloca * src/init.h: Add define MAX_LONGOPTION * src/main.c (struct cmdline_option): Make 'long_name' a char array * src/http.c (check_auth): Remove use of alloca * src/http.c (print_response_line): Remove use of alloca * src/http.c (request_set_user_header): Remove use of alloca * src/http.c (metalink_from_http): Remove use of alloca * src/http.c (basic_authentication_encode): Remove use of alloca * src/http.c (gethttp): Remove use of alloca * src/http.c (gethttp): Remove use of alloca * src/http.c (set_file_timestamp): Remove use of alloca * src/main.c (main): Remove use of alloca * src/url.c (append_uri_pathel): Replace alloca by fixed array / sprintf * src/url.c (append_uri_pathel): Replace alloca by fixed array / xmalloc * src/utils.c (make_directory): Replace alloca by fixed array / xmalloc * src/utils.c (unique_name_1): Replace alloca by xmalloc Add buffer size param to warc_uuid_str * src/http.c (gethttp): Add buffer size param to warc_uuid_str. * src/warc.c: Likewise * src/warc.h: Likewise 2020-02-12 Tim Rühsen * src/http-ntlm.c (ntlm_output): Remove alloca * src/http-ntlm.c (mkhash): Replace alloca by fixed length array * src/http-ntlm.c (ntlm_input): Replace alloca by fixed length array * src/gnutls.c (ssl_init): Replace alloca by fixed length arrays * src/retr.c (fd_read_body): Fix signedness warnings * src/retr.c (rotate_backups): Replace alloca by fixed length arrays * src/wget.h: Add inline to _unhex * src/metalink.h: Include dirname.h, remove declaration of last_component * configure.ac: Add -Wno-undef -Wno-float-equal to gcc's WARN_CFLAGS * src/convert.c (convert_links): Fix previous commit * src/convert.c (convert_links): Add \n (Fixes #57795) 2020-01-25 Ander Juaristi Do not overwrite restval if len is smaller * src/http.c (http_loop): overwrite 'restval' only if 'len' is greater than it. Else substract 'len' to 'restval'. When retrying a request, only send a 'Range' header if the previous request returned data. 2020-01-24 Вячеслав Петрищев Fix and cleanup progress bar code *src/progress.c (struct dot_progress) accumulated, rows: Type changed to wgint (print_row_stats): Fix missing unit name 'T' (dot_update): Add ability to reduce dot_draw runtime (bar_update): Avoid integer overflow * src/progress.c: Fix MINIMUM_SCREEN_WIDTH * src/progress.c (prepare_filename): Replace zero-width mbc (\xe2\x80\x8b, ...) Reenabling filename scrolling code in progress bar * src/progress.c: Reenabling filename scrolling code (bar_create): Reenabling file name scrolling. (bar_create): Memory allocation, filename preparation. (bar_finish): Filename memory deallocation. (prepare_filename): New function, replaces unprintable chars. Fix multibyte handling in progress bar * src/progress.c (count_cols): Fix return value if invalid multibyte sequence was encountered (bar_draw): Fix xrealoc size 2020-01-10 Tim Rühsen * src/progress.c (print_row_stats): Fix two integer overflows * .gitlab-ci.yml: Fix path for llvm-symbolizer * fuzz/wget_progress_fuzzer.in/*: Add fuzzer corpora * src/retr.c (calc_rate): Remove unwanted assertions Fix buffer overflows in progress 'bar' code * src/progress.c (progress_interactive_p): Sanitize input. (progress_update): Likewise. (bar_create): Use larger BUF_LEN. (bar_create): Remove superfluous memset. (bar_create): Fix filename layout. (bar_create): Remove filename scrolling code, it caused many buffer overflows later in bar_create. (bar_create): Support TB/s download speed. Add new fuzzer for the progress bar code * Makefile.am: Add wget_progress_fuzzer. * wget_progress_fuzzer.c: New file. 2020-01-10 Darshit Shah Update copyright year to 2020 * cfg.mk: Set update-copyright to use intervals 2019-12-27 Tim Rühsen Fix segfault in progress bar in certain locales * src/progress.c (create_image): Protect memset from negative count Reported-by: JunDong Xie 2019-12-27 Tim Rühsen (tiny change) * src/progress.c (count_cols): Fix return value if USE_NLS_PROGRESS_BAR is undefined Taking patch from #54126 Reported-by: Vyacheslav 2019-12-27 Tim Rühsen * src/progress.c: Allow const names for set_progress_implementation. * src/progress.c (dot_draw): Avoid integer overflow * fuzz/README.md: Add --enable-assert to fuzzing build * src/progress.c (print_row_stats): Fix UB if eta < 0 * src/progress.c (dot_update, dot_finish): Sanitize input * src/retr.c (calc_rate): Fix division by 0 * src/retr.c (calc_rate): Add TB/s * src/progress.c (create_image): Sanitize input param 'dl_total_time' * fuzz/README.md: Update clang instructions 2019-12-26 Lauri Nurmi (tiny change) Mark note to translators properly and elaborate its contents. * src/progress.c (create_image): Likewise The note was not even picked by xgettext to be included in the .pot, so barely any translators even saw the note. Also, this is one of the most cryptic messages to translate in wget, unless the translator happens to guess where the string is used, or looks at the source. 2019-12-26 Lauri Nurmi (tiny change) * src/main.c (print_help): Don't cut translatable string mid-sentence. Complicates translating. 2019-11-24 Tim Rühsen * fuzz/wget_read_hunk_fuzzer.c: Use cast to fix C++ error 2019-11-21 Tim Rühsen * src/Makefile.am: Add build_info.c to EXTRA_DIST * .gitlab-ci.yml: Skip tests on tarball build due to missing perl * tests/WgetTests.pm: Support to set the wget executable for testing * .gitlab-ci.yml: Add --with-ssl=no * .gitlab-ci.yml: Allow failure for Valgrind and Scan-Build CI runner * src/ftp.c (delelement): Make sure free'd pointer is set to NULL * configure.ac: Remove -fno-sanitize-recover=integer With this option on, several overflows from gnulib code (all harmless and on purpose) would break our tests. 2019-11-21 Tim Rühsen Replace XDIGIT_TO_NUM by _unhex() with proper unsigned handling * src/wget.h: Replace XDIGIT_TO_NUM by _unhex() * src/html-parse.c (decode_entity): Use _unhex() * src/host.c (is_valid_ipv6_address): Use _unhex() * src/url.c (url_unescape_1): Use unsigned char to avoid UB * src/res.c (free_specs): Likewise Code taken from Wget2. XDIGIT_TO_NUM had a signed issue, detected by scan-build static anylyzer. 2019-11-21 Tim Rühsen * src/connect.c (select_fd): Check for negative fd * .gitlab-ci.yml: Enable Windows executables for MinGW runner * .gitlab-ci.yml: Add CI runners from Wget2 2019-11-21 Tomas Hozza testenv: enable running tests on different wget binary * testenv/README: Document $WGET_PATH * testenv/test/base_test.py: Use $WGET_PATH instead ../src/wget if set Previously tests in testenv/ directory were run only on wget binary which was built from sources in src/ directory. However as a wget maintainer in a Linux distribution, I would like to be able to run upstream tests on the wget binary distributed with the distribution. This change enables one to define WGET_PATH environment variable to a path to wget binary which should be used by tests. 2019-11-21 Tim Rühsen * AUTHORS: Add Darshit Shah and Tim Rühsen as maintainers 2019-11-19 Tim Rühsen * src/cookie.c: Fix cookie file header to be recognized by 'file' * .mailmap: Allow mapping contributors for statistics 2019-10-03 Darshit Shah * configure.ac: Ignore -Wchkp which is deprecated Update gnulib * gnulib: Pull submodule * bootstrap: Update file from gnulib 2019-05-30 Tim Rühsen Revert "Add . to perl path for all perl tests" This reverts commit 79be99aff4717a75ec7531d8b3a4ec96341754ca. /usr/bin/env 8.28 and below doesn't know -S. 2019-05-30 Darshit Shah Revert " Hi, Thank you again Darshit for your response. The RejectHeaderField rule rejects ANY header" I accidentally commited this patch. This reverts commit 4ce2f93600827f47f7fea5e7eeb224232cef3c3d. 2019-05-30 Darshit Shah * .gitignore: Update gitignore file 2019-05-30 sulfastor Hi, Thank you again Darshit for your response. The RejectHeaderField rule rejects ANY header of the header field while RejectHeader rejects ONLY the specified full header. Since we wanted to be sure a header field is not sent to the server we wrote this rule. * doc/wget.texi: Added --disable-header documentation. * fuzz/wget_options_fuzzer.dict: Update with --disable-header inputs. * src/http.c (disabled_header): Checks for disabled headers (request_set_header): Doesn't let header to be set if disabled (gethttp): frees disabled header to let overriding * src/init.c (cmd_dis_header), (check_user_disabled_header) added new option disabled_headers. * src/main.c: added new option --disable-header, added help description * src/options.h: added new option --disable-header * src/utils.h (vec_remove_header) * src/utils.c (vec_remove_header) removes all header instances from vector * testenv/Makefile.am: Added new test files * testenv/server/http/http_server.py: Added new rule RejectHeaderField * testenv/conf/reject_header_field.py: Added new rule RejectHeaderField * testenv/README: Added help description for new rule * testenv/Test-disable-default-headers.py: Test without using --header * testenv/Test-disable-headers-after.py: Test using --header before --disable-header * testenv/Test-disable-headers-before.py: Test using --header after --disable-header 2019-05-28 Tim Rühsen * src/log.c (log_vprintf_internal): Don't log twice 2019-05-25 AviSoomirtee (tiny change) * src/openssl.c (ssl_init): post handshake auth for OpenSSL 2019-05-24 Tim Rühsen * bootstrap.conf: Explicitly set source_base and test_base 2019-05-07 Tim Rühsen * src/convert.c (convert_basename): Do not pass NULL to strrchr() * src/url.c (append_uri_pathel): Check for possible NULL dereference * fuzz/main.c: Fix paths for WIN32 * bootstrap.conf: Work around VPATH issue * configure.ac: Update gettext version to 0.19.3 * bootstrap.conf: Remove rsync as prerequisite './bootstrap --skip-po' won't need wget nor rsync * bootstrap.conf: Replace prerequisite xz by gzip 2019-05-03 Tim Rühsen Add sanitizer flags for ./configure * bootstrap.conf: Add 'warning' gnulib module * configure.ac: Add --enable-fsanitize-* flags * tests/Makefile.am: Add compiler flags for unit testing 2019-05-02 Tim Rühsen Add --enable-manywarnings from wget2 * .gitignore: Exclude /lib/Makefile.am * bootstrap.conf: Use --makefile-name=gnulib.mk with gnulib-tool * configure.ac: Use wget_MANYWARNINGS() * lib/Makefile.am: Define empty noinst_LIBRARIES and MAINTAINERCLEANFILES * m4/wget_manywarnings.m4: New file from GNU Wget2 * src/Makefile.am: Use WARN_CFLAGS for AM_CFLAGS 2019-04-28 Tim Rühsen Add CONTRIBUTING.md * CONTRIBUTING.md: New file for contributors * README: Mention CONTRIBUTING.md 2019-04-26 Tim Rühsen * fuzz/*.in/*: Update corpora from OSS-fuzz * fuzz/get_ossfuzz_corpora: Remove -f from unzip, mkdir *.in/ * .gitlab-ci.yml: Update Gitlab CI WORKDIR to /usr/local 2019-04-26 Eneas U de Queiroz (tiny change) * src/openssl.c: fix ssl_init for openssl 1.1.1 ssl_init fails with openssl 1.1.1 when openssl.cnf is not found. Redundant calls to intialization functions were removed as OPENSSL_config takes care of them for openssl versions < 1.1.0. For versions > 1.1.0, OPENSSL_init_ssl is preferred. 2019-04-11 Leif Ryge (tiny change) * doc/wget.texi: Fix typo in man page 2019-04-11 raminfp [log] miss check If logging is inhibited 2019-04-05 Tim Rühsen * src/iri.c (do_conversion): Remove unneeded debug lines 2019-04-05 Darshit Shah * NEWS: Update NEWS for new release 2019-04-05 Tim Ruehsen Fix a buffer overflow vulnerability * src/iri.c(do_conversion): Reallocate the output buffer to a larger size if it is already full 2019-04-05 Darshit Shah * NEWS: Update NEWS for new release 2019-04-03 Tim Rühsen * src/openssl.c (ssl_init): Check for X509_V_FLAG_PARTIAL_CHAIN 2019-04-01 Darshit Shah * gnulib: Pull forward 2019-03-26 Tim Rühsen * fuzz/main.c (test_all_from): Fix indentation 2019-03-25 Tim Rühsen Fix corner case in processing server response * src/http.c (response_head_terminator): Don't access uninitialized data * fuzz/wget_read_hunk_fuzzer.c: Sync response_head_terminator() Add new fuzzer wget_read_hunk_fuzzer.c * fuzz/Makefile.am: Add wget_read_hunk_fuzzer * fuzz/wget_read_hunk_fuzzer.c: New file * fuzz/wget_read_hunk_fuzzer.in/*: Fuzz corpora * src/connect.c: Add connect_cleanup() * src/connect.h: Add prototype for connect_cleanup() 2019-03-03 Tim Rühsen * fuzz/wget_netrc_fuzzer.c: Fix fuzzer * fuzz/wget_ftpls_fuzzer.c: Fix fuzzer 2019-02-22 Jeffrey Walton * src/openssl.c (ssl_init): Trust partial cert chain 2019-02-20 Tim Rühsen * src/ftp-ls.c (ftp_parse_vms_ls): Use snprintf instead of strcpy/strcat 2019-02-19 Darshit Shah * src/html-url.c(get_urls_html_fm): Add message in verbose mode with no-follow attribute 2019-02-19 Tim Rühsen Fix fuzz/ tests for OpenBSD * fuzz/wget_*_fuzzer.c: Take care when calling exit() * fuzz/Makefile.am: Remove hard-coded gcc flags Fix STDERR closing/restoring in fuzzers * fuzz/fuzzer.h: Add CLOSE_STDERR and RESTORE_STDERR * fuzz/wget_*_fuzzer.c: Use CLOSE_STDERR and RESTORE_STDERR * configure.ac: Rearrange AM_ICONV before gl_INIT 2019-02-19 Tim Rühsen Do not hard-code -ldl in fuzz/Makefile.am * configure.ac: Search for dlopen and add library to $FUZZ_LIBS * fuzz/Makefile.am: Link with $FUZZ_LIBS instead of -ldl This fixes linking on BSD systems. Reported-by: Nam Nguyen 2019-02-14 Tim Rühsen * doc/wget.texi: Correct --logfile -> --output-file 2019-02-10 Darshit Shah Update gnulib Update copyright statements 2019-01-23 Leon Klingele (tiny change) docs: --no-cache also sets the 'Cache-Control: no-cache' header * doc/wget.texi: Add Cache-Control to docs * src/wget.h: Add Cache-Control to comment of SEND_NOCACHE 2019-01-20 Tim Rühsen * .gitlab-ci.yml: Add minimal build * src/init.c (cleanup): Check HAVE_HSTS Reported-by: Simon Dales 2018-12-31 André Wolski NTLM restart authentication (trivial change) * src/http-ntlm.c (ntlm_input): Continue on NTLMSTATE_LAST, error on NTLMSTATE_TYPE3 The code comes from the cURL project. Thanks to Daniel Stenberg for donating the code. https://lists.gnu.org/archive/html/bug-wget/2018-12/msg00030.html 2018-12-28 Tim Rühsen * contrib/spell-checker: Remove trailing whitespace Fix typos detected by codespell (via contrib/spell-checker) * contrib/spell-checker: Add script for spell checking 2018-12-27 Tim Rühsen * fuzz/Makefile.am: Fix order of libraries for linking * src/wget.h: #undef _Noreturn when building with C++ * src/gnutls.c (ssl_connect_wget): Fix call to gnutls_set_default_priority() 2018-12-26 Tim Rühsen * NEWS: Add release changes for 1.20.1 2018-12-26 Tim Rühsen Don't save user/pw with --xattr Also the Referer info is reduced to scheme+host+port. * src/ftp.c (getftp): Change params of set_file_metadata() * src/http.c (gethttp): Change params of set_file_metadata() * src/xattr.c (set_file_metadata): Remove user/password from origin URL, reduce Referer value to scheme/host/port. * src/xattr.h: Change prototype of set_file_metadata() 2018-12-26 Tim Rühsen Don't use extended attributes (--xattr) by default * src/init.c (defaults): Set enable_xattr to false by default * src/main.c (print_help): Reverse option logic of --xattr * doc/wget.texi: Add description for --xattr Users may not be aware that the origin URL and Referer are saved including credentials, and possibly access tokens within the urls. 2018-12-13 Tim Rühsen * .travis.yml: Email to wget-dev instead bug-wget mailing list 2018-11-30 Darshit Shah * NEWS: Prepare for new version 2018-11-13 Darshit Shah * contrib/make-release: Add a small checklist for pending tasks Prepare NEWS for new release * configure.ac: gnulib now expects autoconf >=2.63 * gnulib: Update library 2018-11-13 Jay Satiro * src/init.c: Stop freeing the pointer returned by ws_mypath() .. since ws_mypath() saves the address it returns in a static pointer for reuse, to also be returned in later calls. 2018-11-13 Darshit Shah * src/ftp.c(ftp_retrieve_glob): Honor {accept,reject}-regex switches as well * src/ftp.c (ftp_retrieve_glob): Refactor to prevent looping over listing multiple times 2018-11-11 Tim Rühsen * .gitlab-ci.yml: Split into GnuTLS and OpenSSL build * Makefile.am: dist clean po/stamp-po Remove auto-generated files from po/ Add VPATH build 2018-11-09 Tim Rühsen Revert "Bail out on unexpected 416 server errors" This reverts commit 6f3b9959935ad7640bcf48a0a93848ed25ff8963. The code is obviously wrong, see https://savannah.gnu.org/bugs/?54963 Also, the example from the original post doesn't work any more. With other words, the broken server behavior has been fixed meanwhile. 2018-11-09 Rosen Penev (tiny change) openssl: Do not use engines when OpenSSL does not support * src/openssl.c: Check for OPENSSL_NO_ENGINE before including openssl/engine.h and before calling ENGINE_load_builtin_engines() Fixes compilation with no engines compiled. 2018-11-09 Tim Rühsen Fix HTTPS Perl tests * tests/SSLTest.pm: Rename server cert and key file * tests/Test-https*.px: Fix and remove OpenSSL hard-coding * tests/certs/create-certs.sh: Script to generate test files * tests/certs/*-template.txt: GnuTLS template files for certs and crl * tests/certs/*.pem: Keys, certs, crls * tests/certs/README: Removed commands, link to create-certs.sh 2018-10-28 Kapus, Timotej (tiny change) Replace some loops with string.h functions * src/init.c: Replace loop with strspn * src/url.c: Replace loop with strrchr 2018-10-26 Luiz Angelo Daros de Luca (tiny change) * .gitmodules: Use https:// instead of git:// for gnulib git:// does not work over http proxy * src/host.c (sufmatch): Fix dot-prefixed domain matching Current sufmatch does not match when domain is dot-prefixed. The example of no_proxy in man (.mit.edu) does use a dot-prefixed domain. 2018-10-26 Tim Rühsen * src/convert.c (convert_links): Fix fallthrough 2018-10-22 Darshit Shah * bootstrap: Update script from gnulib * gnulib: Update library 2018-10-19 Tim Rühsen * .lgtm.yml: New file to add LGTM to Gitlab.com CI 2018-10-16 Tim Rühsen * configure.ac: Fix build issue with libgpgme 2018-10-14 Tim Rühsen * fuzz/*_fuzzer.in/*: Update fuzzer corpora 2018-10-08 Nikos Mavrogiannopoulos Enable post-handshake auth under gnutls on TLS1.3 2018-09-20 Tim Rühsen * src/http.c (resp_new): Fix code to avoid false positive by clang * src/convert.c (convert_links): Fix code to avoid false positive by clang 2018-09-19 Tim Rühsen Add support for PCRE2 pattern matching * configure.ac: Check for libpcre2-8 * src/init.c (choices): Test for HAVE_LIBPCRE2 * src/main.c (main): Set regex compile and match functions * src/options.h: Test for HAVE_LIBPCRE2 * src/utils.c: Include pcre2.h, add functions compile_pcre2_regex() and match_pcre2_regex() * src/utils.h: Declare compile_pcre2_regex() and match_pcre2_regex() Fixes #54677 Reported-by: Noël Köthe 2018-09-07 Tim Rühsen Add . to perl path for all perl tests * tests/*.px: Add -I . to the shebang This allows perl test to be run from tests/ directory, e.g. via ./Test--post-file.px 2018-09-07 Tomas Hozza Add TLS 1.3 support for GnuTLS * doc/wget.texi: Add "TLSv1_3" to --secure-protocol * src/gnutls.c (set_prio_default): Use GNUTLS_TLS1_3 where needed Wget currently allows specifying "TLSv1_3" as the parameter for --secure-protocol option. However it is only implemented for OpenSSL and in case wget is compiled with GnuTLS, it causes wget to abort with: GnuTLS: unimplemented 'secure-protocol' option value 6 GnuTLS contains TLS 1.3 implementation since version 3.6.3 [1]. However currently it must be enabled explicitly in the application of it to be used. This will change after the draft is finalized. [2] However for the time being, I enabled it explicitly in case "TLSv1_3" is used with --secure-protocol. I also fixed man page to contain "TLSv1_3" in all listings of available parameters for --secure-protocol [1] https://lists.gnupg.org/pipermail/gnutls-devel/2018-July/008584.html [2] https://nikmav.blogspot.com/2018/05/gnutls-and-tls-13.html 2018-08-29 Tomas Korbar Avoid creating empty wget-log when using -O and -q in background * src/log.c (check_redirect_output): Check for quiet mode 2018-08-27 Tomas Hozza * src/warc.c (warc_write_cdx_record): Fix RESOURCE LEAK found by Coverity Error: RESOURCE_LEAK (CWE-772): - REAL ERROR wget-1.19.5/src/warc.c:1376: alloc_fn: Storage is returned from allocation function "url_escape". wget-1.19.5/src/url.c:284:3: alloc_fn: Storage is returned from allocation function "url_escape_1". wget-1.19.5/src/url.c:255:3: alloc_fn: Storage is returned from allocation function "xmalloc". wget-1.19.5/lib/xmalloc.c:41:11: alloc_fn: Storage is returned from allocation function "malloc". wget-1.19.5/lib/xmalloc.c:41:11: var_assign: Assigning: "p" = "malloc(n)". wget-1.19.5/lib/xmalloc.c:44:3: return_alloc: Returning allocated memory "p". wget-1.19.5/src/url.c:255:3: var_assign: Assigning: "newstr" = "xmalloc(newlen + 1)". wget-1.19.5/src/url.c:258:3: var_assign: Assigning: "p2" = "newstr". wget-1.19.5/src/url.c:275:3: return_alloc: Returning allocated memory "newstr". wget-1.19.5/src/url.c:284:3: return_alloc_fn: Directly returning storage allocated by "url_escape_1". wget-1.19.5/src/warc.c:1376: var_assign: Assigning: "redirect_location" = storage returned from "url_escape(redirect_location)". wget-1.19.5/src/warc.c:1381: noescape: Resource "redirect_location" is not freed or pointed-to in "fprintf". wget-1.19.5/src/warc.c:1387: leaked_storage: Returning without freeing "redirect_location" leaks the storage that it points to. \# 1385| fflush (warc_current_cdx_file); \# 1386| \# 1387|-> return true; \# 1388| } \# 1389| url_escape() really returns a newly allocated memory and it leaks when the warc_write_cdx_record() returns. The memory returned from url_escape() is usually stored in a temporary variable in other parts of the project and then freed. I took the same approach. 2018-08-27 Tomas Hozza * src/warc.c (warc_write_start_record): Fix potential RESOURCE LEAK In warc_write_start_record() function, the reutrn value of dup() is directly used in gzdopen() call and not stored anywhere. However the zlib documentation says that "The duplicated descriptor should be saved to avoid a leak, since gzdopen does not close fd if it fails." [1]. This change stores the FD in a variable and closes it in case gzopen() fails. [1] https://www.zlib.net/manual.html Error: RESOURCE_LEAK (CWE-772): wget-1.19.5/src/warc.c:217: open_fn: Returning handle opened by "dup". wget-1.19.5/src/warc.c:217: leaked_handle: Failing to save or close handle opened by "dup(fileno(warc_current_file))" leaks it. \# 215| \# 216| /* Start a new GZIP stream. */ \# 217|-> warc_current_gzfile = gzdopen (dup (fileno (warc_current_file)), "wb9"); \# 218| warc_current_gzfile_uncompressed_size = 0; \# 219| 2018-08-27 Tomas Hozza * src/utils.c (open_stat): Fix RESOURCE LEAK found by Coverity Error: RESOURCE_LEAK (CWE-772): wget-1.19.5/src/utils.c:914: open_fn: Returning handle opened by "open". [Note: The source code implementation of the function has been overridden by a user model.] wget-1.19.5/src/utils.c:914: var_assign: Assigning: "fd" = handle returned from "open(fname, flags, mode)". wget-1.19.5/src/utils.c:921: noescape: Resource "fd" is not freed or pointed-to in "fstat". [Note: The source code implementation of the function has been overridden by a builtin model.] wget-1.19.5/src/utils.c:924: leaked_handle: Handle variable "fd" going out of scope leaks the handle. \# 922| { \# 923| logprintf (LOG_NOTQUIET, _("Failed to stat file %s, error: %s\n"), fname, strerror(errno)); \# 924|-> return -1; \# 925| } \# 926| #if !(defined(WINDOWS) || defined(__VMS)) This seems to be a real issue, since the opened file descriptor in "fd" would leak. There is also additional check below the "fstat" call, which closes the opened "fd". 2018-08-27 Tomas Hozza * src/http.c (http_loop): Fix RESOURCE LEAK found by Coverity Error: RESOURCE_LEAK (CWE-772): wget-1.19.5/src/http.c:4486: alloc_fn: Storage is returned from allocation function "url_string". wget-1.19.5/src/url.c:2248:3: alloc_fn: Storage is returned from allocation function "xmalloc". wget-1.19.5/lib/xmalloc.c:41:11: alloc_fn: Storage is returned from allocation function "malloc". wget-1.19.5/lib/xmalloc.c:41:11: var_assign: Assigning: "p" = "malloc(n)". wget-1.19.5/lib/xmalloc.c:44:3: return_alloc: Returning allocated memory "p". wget-1.19.5/src/url.c:2248:3: var_assign: Assigning: "result" = "xmalloc(size)". wget-1.19.5/src/url.c:2248:3: var_assign: Assigning: "p" = "result". wget-1.19.5/src/url.c:2250:3: noescape: Resource "p" is not freed or pointed-to in function "memcpy". [Note: The source code implementation of the function has been overridden by a builtin model.] wget-1.19.5/src/url.c:2253:7: noescape: Resource "p" is not freed or pointed-to in function "memcpy". [Note: The source code implementation of the function has been overridden by a builtin model.] wget-1.19.5/src/url.c:2257:11: noescape: Resource "p" is not freed or pointed-to in function "memcpy". [Note: The source code implementation of the function has been overridden by a builtin model.] wget-1.19.5/src/url.c:2264:3: noescape: Resource "p" is not freed or pointed-to in function "memcpy". [Note: The source code implementation of the function has been overridden by a builtin model.] wget-1.19.5/src/url.c:2270:7: identity_transfer: Passing "p" as argument 1 to function "number_to_string", which returns an offset off that argument. wget-1.19.5/src/utils.c:1776:11: var_assign_parm: Assigning: "p" = "buffer". wget-1.19.5/src/utils.c:1847:3: return_var: Returning "p", which is a copy of a parameter. wget-1.19.5/src/url.c:2270:7: noescape: Resource "p" is not freed or pointed-to in function "number_to_string". wget-1.19.5/src/utils.c:1774:25: noescape: "number_to_string(char *, wgint)" does not free or save its parameter "buffer". wget-1.19.5/src/url.c:2270:7: var_assign: Assigning: "p" = "number_to_string(p, url->port)". wget-1.19.5/src/url.c:2273:3: noescape: Resource "p" is not freed or pointed-to in function "full_path_write". wget-1.19.5/src/url.c:1078:47: noescape: "full_path_write(struct url const *, char *)" does not free or save its parameter "where". wget-1.19.5/src/url.c:2287:3: return_alloc: Returning allocated memory "result". wget-1.19.5/src/http.c:4486: var_assign: Assigning: "hurl" = storage returned from "url_string(u, URL_AUTH_HIDE_PASSWD)". wget-1.19.5/src/http.c:4487: noescape: Resource "hurl" is not freed or pointed-to in "logprintf". wget-1.19.5/src/http.c:4513: leaked_storage: Variable "hurl" going out of scope leaks the storage it points to. \# 4511| { \# 4512| printwhat (count, opt.ntry); \# 4513|-> continue; \# 4514| } \# 4515| else There are two conditional branches, which call continue, without freeing memory potentially allocated and pointed to by"hurl" pointer. In fase "!opt.verbose" is True and some of the appropriate conditions in the following if/else if construction, in which "continue" is called, are also true, then the memory allocated to "hurl" will leak. 2018-08-27 Tomas Hozza * src/http.c (check_auth): Fix RESOURCE LEAK found by Coverity Error: RESOURCE_LEAK (CWE-772): wget-1.19.5/src/http.c:2434: alloc_fn: Storage is returned from allocation function "xmalloc". wget-1.19.5/lib/xmalloc.c:41:11: alloc_fn: Storage is returned from allocation function "malloc". wget-1.19.5/lib/xmalloc.c:41:11: var_assign: Assigning: "p" = "malloc(n)". wget-1.19.5/lib/xmalloc.c:44:3: return_alloc: Returning allocated memory "p". wget-1.19.5/src/http.c:2434: var_assign: Assigning: "auth_stat" = storage returned from "xmalloc(4UL)". wget-1.19.5/src/http.c:2446: noescape: Resource "auth_stat" is not freed or pointed-to in "create_authorization_line". wget-1.19.5/src/http.c:5203:70: noescape: "create_authorization_line(char const *, char const *, char const *, char const *, char const *, _Bool *, uerr_t *)" does not free or save its parameter "auth_err". wget-1.19.5/src/http.c:2476: leaked_storage: Variable "auth_stat" going out of scope leaks the storage it points to. \# 2474| /* Creating the Authorization header went wrong */ \# 2475| } \# 2476|-> } \# 2477| else \# 2478| { Error: RESOURCE_LEAK (CWE-772): wget-1.19.5/src/http.c:2431: alloc_fn: Storage is returned from allocation function "url_full_path". wget-1.19.5/src/url.c:1105:19: alloc_fn: Storage is returned from allocation function "xmalloc". wget-1.19.5/lib/xmalloc.c:41:11: alloc_fn: Storage is returned from allocation function "malloc". wget-1.19.5/lib/xmalloc.c:41:11: var_assign: Assigning: "p" = "malloc(n)". wget-1.19.5/lib/xmalloc.c:44:3: return_alloc: Returning allocated memory "p". wget-1.19.5/src/url.c:1105:19: var_assign: Assigning: "full_path" = "xmalloc(length + 1)". wget-1.19.5/src/url.c:1107:3: noescape: Resource "full_path" is not freed or pointed-to in function "full_path_write". wget-1.19.5/src/url.c:1078:47: noescape: "full_path_write(struct url const *, char *)" does not free or save its parameter "where". wget-1.19.5/src/url.c:1110:3: return_alloc: Returning allocated memory "full_path". wget-1.19.5/src/http.c:2431: var_assign: Assigning: "pth" = storage returned from "url_full_path(u)". wget-1.19.5/src/http.c:2446: noescape: Resource "pth" is not freed or pointed-to in "create_authorization_line". wget-1.19.5/src/http.c:5203:40: noescape: "create_authorization_line(char const *, char const *, char const *, char const *, char const *, _Bool *, uerr_t *)" does not free or save its parameter "path". wget-1.19.5/src/http.c:2476: leaked_storage: Variable "pth" going out of scope leaks the storage it points to. \# 2474| /* Creating the Authorization header went wrong */ \# 2475| } \# 2476|-> } \# 2477| else \# 2478| { Both "pth" and "auth_stat" are allocated in "check_auth()" function. These are used for creating the HTTP Authorization Request header via "create_authorization_line()" function. In case the creation went OK (auth_err == RETROK), then the memory previously allocated to "pth" and "auth_stat" is freed. However if the creation failed, then the memory is never freed and it leaks. 2018-08-27 Tomas Hozza * src/ftp.c (getftp): Fix RESOURCE LEAK found by Coverity Error: RESOURCE_LEAK (CWE-772): wget-1.19.5/src/ftp.c:1493: alloc_fn: Storage is returned from allocation function "fopen". wget-1.19.5/src/ftp.c:1493: var_assign: Assigning: "fp" = storage returned from "fopen(con->target, "wb")". wget-1.19.5/src/ftp.c:1811: leaked_storage: Variable "fp" going out of scope leaks the storage it points to. \# 1809| if (fp && !output_stream) \# 1810| fclose (fp); \# 1811|-> return err; \# 1812| } \# 1813| It can happen, that "if (!output_stream || con->cmd & DO_LIST)" on line #1398 can be true, even though "output_stream != NULL". In this case a new file is opened to "fp". Later it may happen in the FTPS branch, that some error will occure and code will jump to label "exit_error". In "exit_error", the "fp" is closed only if "output_stream == NULL". However this may not be true as described earlier and "fp" leaks. On line #1588, there is the following conditional free of "fp": /* Close the local file. */ if (!output_stream || con->cmd & DO_LIST) fclose (fp); Therefore the conditional at the end of the function after "exit_error" label should be modified to: if (fp && (!output_stream || con->cmd & DO_LIST)) fclose (fp); This will ensure that "fp" does not leak in any case it sould be opened. 2018-08-11 Tomas Hozza Don't limit the test suite HTTPS server to TLSv1 In Fedora, we are implementing crypto policies, in order to enhance the security of user systems. This is done on the system level by global configuration. It may happen that due to the active policy, only TLSv1.2 or higher will be available in crypto libraries. While wget as a client will by default determine the minimal TLS version supported by both client and server, the HTTPS server implementation in testenv/ hardcodes use of TLSv1. As a result all HTTPS related tests fail in case a more hardened crypto policy is set on the Fedora system. This change removes the explicit TLS version setting and leaves the determination of the minimal supported TLS version on the server and client. More information about Fedora change can be found here: https://fedoraproject.org/wiki/Changes/StrongCryptoSettings 2018-06-13 Tim Rühsen * src/gnutls.c (ssl_check_certificate): Fix grammar of error msg Reported-by: Nicholas Sielicki * fuzz/Makefile.am: Remove libtool LTLIB... from LDADD * src/http.c (http_loop): Fix --retry-on-host-error 2018-06-13 ethus3h (tiny change) Add new option --retry-on-host-error * doc/wget.texi: Add docs for --retry-on-host-error * src/http.c (http_loop): Add code for HOSTERR * src/init.c: Add option --retry-on-host-error * src/main.c: Likewise * src/options.h: Add options.retry_on_host_error 2018-05-29 Tim Rühsen Save original data to WARC file * src/retr.c (write_data): Cleanup, (fd_read_body): Write to WARC before uncompressing Fixes: #53968 2018-05-10 Tim Rühsen * fuzz/get_ossfuzz_corpora: Speed up corpora download 2018-05-09 Tim Rühsen * src/main.c (print_version): Silence UBSAN message * src/utils.ci (file_exists_p): Fix stat(NULL,...) * src/hsts.c (open_hsts_test_store): Fix unlink(NULL) * src/hash.c: Silence UBSAN for hash functions * fuzz/*_fuzzer.in: Update corpora from OSS-Fuzz * fuzz/get_ossfuzz_corpora: Fix path 2018-05-08 Tim Rühsen * src/hsts.h: Fix header guard * src/version.h: Add header guard * src/host.c (wait_ares): Remove void assignment Reported-by: Josef Moellers 2018-05-06 Tim Rühsen Update NEWS file for new release 2018-05-06 Tim Rühsen Fix cookie injection (CVE-2018-0494) * src/http.c (resp_new): Replace \r\n by space in continuation lines Fixes #53763 "Malicious website can write arbitrary cookie entries to cookie jar" HTTP header parsing left the \r\n from continuation line intact. The Set-Cookie code didn't check and could be tricked to write \r\n into the cookie jar, allowing a server to generate cookies at will. 2018-05-06 Tim Rühsen * tests/Test-https-weboftrust.px: Skip test, needs cert regen Fix make syntax-check * cfg.mk: Add fuzzer reproducers to exception list * po/POTFILES.in: Add src/spider.c Fix HTTPS tests * tests/Test-https-badcerts.px: Fix test return value * tests/Test-https-crl.px: Likewise * README: How to create certs with GnuTLS's certtool * tests/certs/revokedcrl.pem: Recreated revocation * tests/certs/server.crt: Recreated server cert with no expiry * tests/certs/test-ca-cert.pem: Recreated CA cert with no expiry * src/init.c: Bring new --ciphers into right order in options array 2018-05-05 Ander Juaristi * doc/wget.texi: Add description for --ciphers 2018-05-05 Ander Juarist OpenSSL: Better seeding of PRNG * src/openssl.c (init_prng): keep gathering entropy even though we already have enough (ssl_connect_with_timeout_callback): reseed PRNG again just before the handshake Reported-by: Jeffrey Walton 2018-05-05 Ander Juaristi Enhance SSL/TLS security This commit hardens SSL/TLS a bit more in the following ways: * Explicitly exclude NULL authentication and the 'MEDIUM' cipher list category. Ciphers in the 'HIGH' level are only considered - this includes all symmetric ciphers with key lengths larger than 128 bits, and some ('modern') 128-bit ciphers, such as AES in GCM mode. * Allow RSA key exchange by default, but exclude it when Perfect Forward Secrecy is desired (with --secure-protocol=PFS). * Introduce new option --ciphers to set the cipher list that the SSL/TLS engine will favor. This string is fed directly to the underlying TLS library (GnuTLS or OpenSSL) without further processing, and hence its format and syntax are directly dependent on the specific library. Reported-by: Jeffrey Walton 2018-04-28 Tim Rühsen * src/netrc.c (parse_netrc_fp): Fix two memleaks Add new fuzzer for the .netrc parser * fuzz/wget_netrc_fuzzer.c: New fuzzer * fuzz/wget_netrc_fuzzer.dict: Fuzzer dictionary * fuzz/wget_netrc_fuzzer.in: Initial corpora * src/ftp.c (getftp): Amend call to search_netrc() * src/http.c (initialize_request): Likewise * src/netrc.c: Cleanup, prepare code for fuzzing * src/netrc.h: Cleanup 2018-04-27 Tim Rühsen * src/utils.c (match_tail): Fix unsigned integer overflow Add new fuzzer for the Set-Cookie parser * fuzz/Makefile.am: Add wget_cookie_fuzzer * fuzz/wget_cookie_fuzzer.c: New fuzzer * fuzz/wget_cookie_fuzzer.dict: Fuzzers dictionary * fuzz/wget_cookie_fuzzer.in: Initial corpora 2018-04-26 Tim Rühsen Fix buffer overflow in CSS parser * src/css-url.c (get_uri_string): Check input length * fuzz/wget_css_fuzzer.repro/buffer-overflow-6600180399865856: Add reproducer corpus Fixes OSS-Fuzz issue #8033. This is a long standing bug affecting all versions <= 1.19.4. 2018-04-26 Tim Rühsen Fix buffer overflow in CSS parser * src/css-url.c (get_urls_css): Check input string length * fuzz/wget_css_fuzzer.repro/negative-size-param-5724866467594240: Add reproducer corpus Fixes OSS-Fuzz issue #8032. This is a long standing bug affecting all versions <= 1.19.4. 2018-04-26 Tim Rühsen Exclude fuzz corpora from tarball * fuzz/Makefile.am: Do not include corpora in tarball * fuzz/main.c: SKIP if corpora directory isn't found (make check) The fuzz corpora are thousands of files, not needed for a standard build from a distribution tarball. The reproducers of former issues are being included for regression testing. 2018-04-26 Tim Rühsen * tests/Makefile.am: Add -I/src to AM_CPPFLAGS 2018-04-26 Tim Rühsen Add CSS slowness reproducer (fixed) * fuzz/wget_css_fuzzer.repro/slowness-6275836549267456: New file This file created an extreme CPU usage with the old CSS parser. 2018-04-26 Tim Rühsen Update CSS grammar from 1.x to 2.2 * src/css-tokens.h: Add enums and fixate values * src/css.l: Include config.h, ignore several compiler warnings, update the grammar to CSS 2.2 Fixes OSS-Fuzz issue #8010 (slowness issue). This is a long standing bug affecting all versions <= 1.19.4. Some crafted CSS input was extremely slow / CPU wasting, so it could be used as a DOS attack against website scanning. The code/grammar changes were backported from Wget2.x. 2018-04-25 Tim Rühsen * src/res.c (add_path): Fix memleak (parsing robots.txt) Fixes OSS-Fuzz issue #8005. This is a long standing bug affecting all versions <= 1.19.4. * src/ftp-ls.c (ftp_parse_winnt_ls): Fix integer overflow Fixes OSS-Fuzz issue #7999. This is a long standing bug affecting all versions <= 1.19.4. 2018-04-24 Tim Rühsen Add new fuzzer for the URL parser * fuzz/Makefile.am: Add wget_url_fuzzer * fuzz/wget_url_fuzzer.c: New fuzzer * fuzz/wget_url_fuzzer.in: Initial corpora Add new fuzzer for robots.txt parsing * fuzz/Makefile.am: Add wget_robots_fuzzer * fuzz/wget_robots_fuzzer.c: New fuzzer * fuzz/wget_robots_fuzzer.in: Initial corpora * fuzz/README.md: Add CFLAGS for undefined sanitizer * src/ftp-ls.c (ftp_parse_winnt_ls): Fix integer overflow * src/ftp-ls.c (ftp_parse_vms_ls): Fix integer overflow by left shift * src/ftp-ls.c (ftp_parse_unix_ls): Fix integer overflow in date parsing 2018-04-22 Tim Rühsen * src/ftp-ls.c (ftp_parse_winnt_ls): Fix heap-buffer-overflow Fixes OSS-Fuzz issue #7931. This is a long standing bug affecting all versions <= 1.19.4. * src/ftp-ls.c (ftp_parse_winnt_ls): Fix heap-buffer-overflow Fixes OSS-Fuzz issue #7930. This is a long standing bug affecting all versions <= 1.19.4. * fuzz/wget_ftpls_fuzzer.in: Update corpora 2018-04-21 Tim Rühsen * src/ftp-ls.c (eat_carets): Fix heap-buffer-overflow * src/ftp-ls.c (ftp_parse_winnt_ls): Fix memleak * src/ftp-ls.c (ftp_parse_vms_ls): Fix heap-buffer-overflow * src/ftp-ls.c (ftp_parse_vms_ls): Fix heap-buffer-overflow * src/ftp-ls.c (ftp_parse_vms_ls): Fix memleak Add new fuzzer for the FTP listing parsers * fuzz/Makefile.am: Add wget_ftpls_fuzzer * fuzz/wget_ftpls_fuzzer.c: New fuzzer * fuzz/wget_ftpls_fuzzer.dict: Fuzzer dictionary * fuzz/wget_ftpls_fuzzer.in/starter: Starting corpus * src/ftp-ls.c: Parsing function take FILE * as argument, new function ftp_parse_ls_fp() * src/ftp.c: Remove static from freefileinfo() * src/ftp.h: Add ftp_parse_ls_fp() and freefileinfo() * fuzz/run-clang.sh: Remove -detect_leaks=0 from fuzzer command line * src/main.c (main): Fix memleak for fuzzing/testing * src/init.c: Fix fuzzing in case ~/.wgetrc doesn't exist 2018-04-20 Tim Rühsen Fix fuzzer build for C++ * fuzz/wget_css_fuzzer.c: Include wget.h outside 'extern "C"', undef fopen_wgetrc directly after wget.h * fuzz/wget_html_fuzzer.c: Likewise * fuzz/Makefile.am: Add -I/lib to oss-fuzz builds Add new HTML parser fuzzer * fuzz/Makefile.am: Add wget_html_fuzzer * fuzz/wget_html_fuzzer.c: New fuzzer * fuzz/wget_html_fuzzer.dict: HTML dictionary for fuzzing * fuzz/wget_html_fuzzer.in: Initial corpora * src/html-url.c: Add new function get_urls_html_fm() * src/html-url.h: Add ne function get_urls_html_fm() * src/wget.h: Fix define for fopen_wgetrc() * fuzz/wget_css_fuzzer.c: Fix build * fuzz/wget_css_fuzzer.in/*: Update fuzzer corpora * src/css-url.c (get_uri_string): Fix buffer overflow (read) * src/iri.h: Fix C++ compile error 2018-04-19 Tim Rühsen * src/http.c: Download and scan CSS files in spider mode * src/css-url.c (get_urls_css): Call yylex_destroy() to reset CSS scanner Add new fuzzer wget_css_fuzzer.c * fuzz/Makefile.am: Add wget_css_fuzzer.c * fuzz/wget_css_fuzzer.c: New fuzzer 2018-04-18 Tim Rühsen * src/html-url.h: Include needed header files * wget_options_fuzzer.in/*: Update fuzzer corpora * fuzz/README.md: Add CXXFLAGS and more configure options Fix oss-fuzz issue with exit() * src/wget.h: Define exit() as exit_wget() * fuzz/wget_options_fuzzer.c: Implement exit_wget() and cleanup 2018-04-17 Tim Rühsen * fuzz/wget_options_fuzzer.c: Declare fopen_* as C functions Fix fopen/stdin issues with fuzzing * fuzz/wget_options_fuzzer.c: Add fopen_wget() and fopen_wgetrc() * src/utils.c: Use fopen_wgetrc() for config files, don't read from stdin when fuzzing * src/wget.h: Define fopen as fopen_wget when fuzzing, define fopen_wgetrc as fopen when not fuzzing * configure.ac: AC_DEFINE FUZZING if --enable-fuzzing was given * fuzz/wget_options_fuzzer.c: Write fuzzer crash reports * src/log.c: Don't check_redirect_output() when fuzzing * src/main.c (promt_for_password): Avoid getpass() when fuzzing Fix double fclose() with -d while fuzzing * src/ftp.c (ftp_loop_internal): Set warc_tmp to NULL after ffclose() * src/init.c (cleanup): Set output_stream to NULL after fclose() * src/log.c (log_close): Set global stream vars to NULL after closing * src/recur.c (retrieve_tree): Set rejectedlog to NULL after closing * src/warc.c (warc_close): Set stream vars to NULL after closing * src/main.c (main): Don't background if TESTING * src/init.c (initialize): Return error, don't exit() 2018-04-16 Tim Rühsen * src/init.c (cmd_use_askpass): Return false on error * src/utils.c (compile_posix_regex): Hard-code string to regcomp regcomp() may be too cpu + memory intensive for fuzzing. See https://sourceware.org/glibc/wiki/Security%20Exceptions Fix 2 more memleaks * src/init.c (initialize): Use global var for wgetrc filename * src/iri.c (find_locale): Return strdup'ed locale string * src/options.h (struct options): Add wgetrcfile * src/init.c (cleanup): Set output_stream to NULL after closing Fix homedir memory leaks * src/hsts.c: Use opt.homedir * src/init.c: Likewise * src/main.c: Likewise * src/netrc.c: Likewise * src/options.h (struct options): Add homedir * src/main.c (main): Free opt.encoding_remote properly * src/host.c (wait_ares): Free ptimer * src/init.c (cleanup): Free regex objects properly * src/init.c (cleanup): Never call cleanup() twice * src/init.c (cmd_bytes_sum): Fix integer over- and underflow * src/main.c (save_hsts): Free hsts_store after closing Use strtol() instead of selfmade function * src/init.c (cmd_number): Use strtol() instead of selfmade function * bootstrap.conf: Add strtol gnulib module * src/hsts.c (hsts_hash_func): Allow integer overflow * init.c (cmd_spec_mirror): Fix uninitialzed stack variable * src/init.c (cleanup): Free more variables * wget_options_fuzzer.in: Add corpora directory * fuzz/wget_options_fuzzer.c: Suppress error messages from wget * src/utils.c (fopen_stat): Early return to allow fuzzing/fmemopen * src/init.c (initialize): Free mem before exit() Add OSS-Fuzz infrastruture * Makefile.am: Add fuzz/ to SUBDIRS * cfg.mk: Fix 'make syntax-check' * configure.ac: Add --enable-fuzzing * fuzz/Makefile.am: New file * fuzz/README.md: New file * fuzz/fuzzer.h: New file * fuzz/get_all_corpora: New file * fuzz/get_ossfuzz_corpora: New file * fuzz/glob_crash.c: New file * fuzz/main.c: New file * fuzz/run-afl.sh: New file * fuzz/run-clang.sh: New file * fuzz/view-coverage.sh: New file * fuzz/wget_options_fuzzer.c: New file * fuzz/wget_options_fuzzer.dict: New file * src/init.c (cleanup): Free more resources * src/main.c (init_switches): Initialize only once, (print_usage): Don't print if TESTING is defined * src/utils.h: Include wget.h 2018-04-05 Tim Rühsen Move unit-test code to tests/ * src/Makefile.am: Remove test.c and test.h * src/test.c: Rename to tests/unit-tests.c * src/test.h: Rename to tests/unit-tests.h * tests/Makefile.am: Add unit-tests.c and unit-tests.h * src/hsts.c: Amend #include * src/http.c: Likewise * src/init.c: Likewise * src/metalink.c: Likewise * src/res.c: Likewise * src/url.c: Likewise * src/utils.c: Likewise * src/main.c: Rename main() -> main_wget() for unit tests 2018-03-14 Tim Rühsen Fix some issues found by 'infer' 2018-03-08 Tim Rühsen * src/openssl.c: Fix build for OpenSSL 1.1.0 without TLS1_3_VERSION Add docs for --secure-protocol=TLSv1_3 * doc/wget.texi: Likewise 2018-03-08 Loganaden Velvindron (tiny change) Add TLS1.3 support for OpenSSL build * src/init.c: Add 'tlsv1_3 for --secure-protocol * src/openssl.c (ssl_init): Enable TLS1.3 if possible * src/options.h: Add secure_protocol_tlsv1_3 * doc/wget.texi: Add description of TLSv1_3 2018-03-07 Tim Rühsen * src/main.c: Add help text for --retry-on-http-error Reported-by: Giovanni Tirloni 2018-03-01 Tim Rühsen * src/url.c (convert_fname): Fix invalid free on iconv_open() failure Reported-by: Volkmar Klatt 2018-02-21 Tim Rühsen * src/mswindows.c: Fix prototype of fork_to_background() Reported-by: Gisle Vanem 2018-02-09 Tim Rühsen Fix warning to not print binary IP address * tests/Test-https-badcerts.px: Likewise * tests/Test-https-clientcert.px: Likewise * tests/Test-https-crl.px: Likewise * tests/Test-https-pfs.px: Likewise * tests/Test-https-selfsigned.px: Likewise * tests/Test-https-tlsv1.px: Likewise * tests/Test-https-tlsv1x.px: Likewise * tests/Test-https-weboftrust.px: Likewise Use gnulib's utime() * bootstrap.conf: Add modules utime and utime-h * src/utils.c (touch): Remove own code for gnulib's utime() 2018-02-09 Tim Rühsen Fix logging in background mode * ../src/main.c: Re-init logfile if changed for background mode * ../src/utils.c: fork_to_background() returns whether logfile changed * ../src/utils.h: Set return type bool for fork_to_background() Fixes: #53020 Reported-by: Noël Köthe 2018-02-09 Tim Rühsen * src/http.c: Fix two typos in comments 2018-01-31 Tim Rühsen * testenv/conf/expected_files.py: Ignore pubring.gpg Reported-by: Arkadiusz Miśkiewicz * bootstrap.conf: Replace gnulib module gettext by gettext-h 2018-01-22 Tim Rühsen Mention list and bugtracker for --help and in man page * doc/wget.texi: Mention list and bugtracker in man page * src/main.c: Mention list and bugtracker for --help 2018-01-21 Darshit Shah * contrib/make-release: Add another release script Update NEWS file for new release Fix typo in documentation 2018-01-21 Darshit Shah Switch off compression by default Gzip compression has a number of bugs which need to be ironed out before we can support it by default. Some of these stem from a misunderstanding of the HTTP spec, but a lot of them are also due to many web servers not being compliant with RFC 7231. With this commit, I am marking GZip compression support as experimental in GNU Wget pending further investigation and the addition of tests. * src/init.c (defaults): Switch of compression support by default * docs/wget.texi: State that compression is experimental 2018-01-21 Darshit Shah Pull gnulib module forward Revert "* src/init.c (defaults): Set compression_none as the default compression" This reverts commit 8283ac08467016283277578b96533febb8ce1adb. * src/init.c (defaults): Set compression_none as the default compression 2018-01-20 Reiji * src/http.c (gethttp): Fix bug that prevented all files from being decompressed 2018-01-19 Tim Rühsen * src/host.c (sufmatch): Fix to domain matching 2018-01-17 Gisle Vanem * src/netrc.c: Fix Standalone compilation of netrc file 2018-01-14 Darshit Shah * NEWS: Update NEWS file for new release * bootstrap: Use the faster gnulib-tool.py script if possible Sync the bootstrap script from gnulib Update Copyright years Replace HTTP urls with HTTPS where valid Update gnulib * src/netrc.c: Search for the correct netrc file on Windows 2018-01-10 Tim Rühsen * src/http.c: Exclude *.gz and *.tgz from decompression 2017-12-31 Tim Rühsen * src/utils.c (wg_pin_peer_pubkey): Fix format warning 2017-12-31 Peter Wu Avoid redirecting output to file when tcgetpgrp fails * src/log.c (check_redirect_output): tcgetpgrp can return -1 (ENOTTY), be sure to check whether a valid controlling terminal exists before redirecting. Fixes: #51181 2017-12-11 Darshit Shah Add new test for 416 responses * testenv/server/http/http_server.py: If there are multiple requests in which the requested range is unsatisfiable, then send a body in the in the 2nd response onwards * testenv/Test-416.py: New test to check how Wget handles 416 responses 2017-12-08 Darshit Shah Don't assume a 416 response has no body * http.c(gethttp): In case of a 416 response, try to drain the socket of any bytes before reusing the connection Reported-By: Iru Cai 2017-11-26 Tim Rühsen Support building with OpenSSL 1.1 w/o deprecated features * src/openssl.c (ssl_init): Fix code for the subject's issue Reported-by: Matthew Thode 2017-11-25 Tim Rühsen Avoid link conversion after 304 Not Modified * src/http.c (gethttp): Handle 304 before setting document content type Fixes: #52404 Reported-by: Ben Fuchs 2017-11-16 YX Hao Fix printing mutibyte chars as unprintable chars on Windows * src/log.c (get_warc_log_fp): Fix return value to stderr * src/main.c (main): Init logging as soon as possible, fix locale/charset on Windows 2017-11-16 Tim Rühsen * tests/Test-https-weboftrust.px: Fix/add sslport 2017-11-15 YX Hao Convert remote path to local encoding * src/url.c (url_file_name): Convert remote path to local encoding 2017-11-10 Tim Rühsen Do not use must-revalidate in Cache-Control header As the bug report states, 'must-revalidate' is a request directive. Fixes #52379 2017-11-06 Darshit Shah Fix Segfault due to derefencing null ptr * src/http.c(gethttp): When Encoding is gzip, ensure that the Content-Type Header was actually seen. Without this, the "type" variable is null causing a Segfault. Reported-By: Noël Köthe 2017-11-03 Tim Rühsen * src/http.c: Fix H_REDIRECTED * src/http.c: Add support for HTTP status code 308 2017-10-30 Tim Rühsen Fix uninitialized value messages in tests * tests/Test-https-*.px: Use correct variable in error message Reported-by: Jeffrey Walton 2017-10-30 Tim Rühsen Ignore dirmngr.conf, gpg.conf in test directories * testenv/conf/expected_files.py: Ignore dirmngr.conf, gpg.conf Libgpgme creates certain files if $HOME doesn't contain the .gnupg directory. These files disturb some metalink tests if we don't ignore them. Reported-by: Arkadiusz Miśkiewicz 2017-10-27 Tim Rühsen * contrib/release: Cleanup, use lzip instead of xz 2017-10-26 Tim Rühsen Fix heap overflow in HTTP protocol handling (CVE-2017-13090) * src/retr.c (fd_read_body): Stop processing on negative chunk size Reported-by: Antti Levomäki, Christian Jalio, Joonas Pihlaja from Forcepoint Reported-by: Juhani Eronen from Finnish National Cyber Security Centre 2017-10-26 Tim Rühsen Fix stack overflow in HTTP protocol handling (CVE-2017-13089) * src/http.c (skip_short_body): Return error on negative chunk size Reported-by: Antti Levomäki, Christian Jalio, Joonas Pihlaja from Forcepoint Reported-by: Juhani Eronen from Finnish National Cyber Security Centre 2017-10-26 Tim Rühsen Update for release 1.19.2 * NEWS: Add news for 1.19.2 * doc/wget.texi: Fix 1.20 to 1.19.2 2017-10-25 YX Hao Avoid unnecessary UTF-8 encoded fallback (trivial change) * src/retr.c (retrieve_url): Check for changed URL on redirect 2017-09-27 Tim Rühsen Add GNU extensions to .netrc parsing src/netrc.c (parse_netrc): Add 'port' and 'force' extensions Reported-by: September 20Tim Landscheidt 2017-09-18 Josef Moellers Bail out on unexpected 416 server errors * src/http.c (gethttp): Stop on 416 if file is incomplete 2017-08-04 Tim Schlueter Add gzip Content-Encoding decompression * src/http.c (struct http_stat): Add remote_encoding field. (read_response_body): Enable gzip decompression. (initialize_request): Send gzip Accept-Encoding header. (gethttp): Decompress files with gzip Content-Encoding. * src/retr.c: include zlib.h. (zalloc): New function. (zfree): New function. (fd_read_body): Decompress gzip data. * src/retr.h (fd_read_body enum): Add rb_compressed_gzip flag. Add --compression option * doc/wget.texi: Add --compression documentation. * src/init.c (cmd_spec_compression): New function. (commands[]): Add opt.compression. (defaults): Set default opt.compression value. * src/main.c (option_data[]): Add struct for --compression. (print_help, help[]): Add description for --compression. (main): Add incompatibility checks for --compression. * src/options.h (struct options): Add compression enum and field. Adjust Extension based on Content-Encoding * doc/wget.texi (--adjust-extension, adjust_extension): Updated documentation. * src/http.c (encoding_t): New enum. (struct http_stat): Add local_encoding field. (gethttp): --adjust-extension based on Content-Encoding. 2017-07-31 Darshit Shah Document gperf as a requirement 2017-07-28 Tim Rühsen * src/url.c (url_scheme): Use ASCII version of strncasecmp 2017-07-28 Tim Rühsen Fix misuse of strncasecmp * src/http.c (set_content_type): Use c_strcasecmp instead of strncasecmp See issue bug #51576 2017-07-09 Tim Rühsen Fix python test suite for GnuTLS 3.5.12+ * testenv/Test-*.py: Replace 127.0.0.1 by localhost * testenv/certs/server-template.cfg: Likewise * testenv/certs/server-cert.pem: Regenerate * testenv/certs/server-crl.pem: Likewise * testenv/test/base_test.py: Hardcode 'localhost' as server domain Reported-by: Ludovic Courtès 2017-06-13 Tim Rühsen Check for 304 response before applying --adjust-extension * src/http.c (gethttp): Move 304 code before --adjust-extension code This fixes applying --adjust-extension in combination with 304 HTTP responses. It could lead to .html extensions to arbitrary files. Reported-by: anfractuosity 2017-06-13 Tim Rühsen Fix buffer overflow in Public Key Pinning * src/utils.c (wget_base64_decode): Add param for destination size, (wg_pubkey_pem_to_der): Amend call to wget_base64_decode(), (wg_pin_peer_pubkey): Likewise and fix code style. * src/utils.h: Add param to wget_base64_decode() * src/http-ntlm.c (ntlm_input): Amend call to wget_base64_decode() * src/http.c (skip_content_type): Likewise Fixes #51227 2017-06-02 Tim Rühsen * doc/wget.texi: Mention --no-config 2017-05-16 Tim Rühsen * testenv/Test-recursive-basic.py: Check crawled files Fix two Metalink tests if $HOME is changed * conf/expected_files.py (gen_local_fs_snapshot): Skip processing of 'pubring.kbx' 2017-05-15 Tomas Hozza Add command line option to disable use of .netrc Although internally code uses option for (not) reading .netrc for credentials, it was not possible to turn this behavior off on command line. Note that it was possible to turn it off using wgetrc. Idea for this change came from Bruce Jerrick (bmj001@gmail.com). Reference: https://bugzilla.redhat.com/show_bug.cgi?id=1425097 2017-05-15 Tomas Hozza Fixed getting of credentials from .netrc There seemed to be a copy&paste error in http.c code, which decides whether to get credentials from .netrc. In ftp.c "user" and "pass" variables are char*, while in http.c, these are char**. For this reason they should be dereferenced when determining if password and user login is set to some value. Also since both variables are dereferenced on lines above the changed code, it does not really make sense to check if they are NULL. This patch is based on fix from Bruce Jerrick . Fedora bug: https://bugzilla.redhat.com/show_bug.cgi?id=1425097 2017-05-15 Tomas Hozza Added tests for HTTP authentication using credentials from .netrc Getting credentials from .netrc has been broken from time to time, thus adding a test coverage to prevent regressions. Also added setting of "HOME" environment variable when executing wget, to make sure LocalFiles like .netrc, which are created just for the test, are actually used. 2017-05-15 Tim Rühsen Fix Test-https-badcerts.px * util/createcerts.sh: Set date of invalid cert to +20 years * tests/certs/invalid.crt: Re-generated * tests/certs/invalid.key: Re-generated 2017-05-14 Tim Rühsen * util/createcerts.sh: Fix double equal (syntax-check) Fix Test-https-badcerts.px to work with GnuTLS * tests/Test-https-badcerts.px: Remove OpenSSL requirement * tests/certs/expired.crt: Re-created with valid dates * tests/certs/expired.key: Likewise * tests/certs/invalid.crt: Likewise * tests/certs/invalid.key: Likewise 2017-05-14 Vijo Cherian New shell script to create the certs and keys required for TLS tests * util/createcerts.sh: New file 2017-05-11 Tim Rühsen Fix HTTPS testing for stricter OpenSSL * testenv/certs/README: Amend cert creation extensions * testenv/certs/ca-cert.pem: Created without OCSP signing purpose Having the OCSP signing purpose set made newer versions of OpenSSL fail due to stricter checking. Test version of OpenSSL was 1.1.0e. 2017-05-11 Tim Rühsen * src/iri.c (idn_encode): Better IDNA 2003 compatibility 2017-05-10 Tim Rühsen * .gitlab-ci.yml: Also test OpenSSL build * .gitlab-ci.yml: Remove installation of texlive 2017-05-10 Tim Rühsen Add certs/wotca.pem to avoid temp. file creation * tests/certs/wotca.pem: New file * tests/Test-https-weboftrust.px: Remove creation of wotca.pem This change let Test-https-weboftrust.px survive VPATH builds. 2017-05-10 Tim Rühsen * cfg.mk: Exclude ^tests/certs/.* from syntax-check * po/POTFILES.in: Remove spider.c (syntax-check) * tests/Test-https-selfsigned.px: Add newline at EOF (syntax-check) 2017-05-10 Vijo Cherian Removed all uses of temp files. Added needed files to GIT Made CRL related files in the repo, instead of trying to generate them 2017-05-09 Tim Rühsen Cleanup on exit in Test-https-*.px * tests/Test-https-*.px: Cleanup on exit Auto-generate interca.conf and rootca.conf * configure.ac: Add interca.conf.in and rootca.conf.in to AC_CONFIG_FILES * tests/certs/interca.conf: Removed * tests/certs/rootca.conf: Removed * tests/certs/interca.conf.in: New file * tests/certs/rootca.conf.in: New file Fix path and VPATH issues of new https/TLS tests * tests/SSLTest.pm: Use $srcdir in read-only files, (_setup_server): Remove unneeded path fixation code * tests/Test-https-*.px: Use $srcdir for read-only files, use $cdir for writable files * tests/certs/interca.conf: Adjust paths * tests/certs/rootca.conf: Adjust paths Check for test server name resolution in tests * tests/Test-https-*.px: Skip if test server name resolution fails * tests/Makefile.am: Enable Test-https-badcerts again Fix WgetFeature.pm to allow multiple required features * tests/WgetFeature.cfg: Remove file * tests/WgetFeature.pm: Extend to multiple features, cleanup * .gitlab-ci.yml: Add wgettestingserver to /etc/hosts * tests/SSLServer.pm: Check for IO::Socket::SSL * tests/Test-https-*: Change server port to <= 32767 Move https test server ports from >32767 to <= 32767 * Test-https-badcerts.px: Change port * Test-https-crl.px: Likewise * Test-https-weboftrust.px: Likewise * tests/Makefile.am: Add SSLTest.pm and SSLServer.pm to EXTRA_DIST * tests//Makefile.am: Disable Test-https-badcerts.px Add Gitlab CI (Debian) * .gitlab-ci.yml: New file 2017-05-06 Tim Rühsen * .travis.yml: Use trusty for libidn2-dev 2017-05-04 Tim Rühsen * tests/certs/test-ca-key.pem: Add newline at EOF Add static HOSTSALIAS file * tests/certs/wgethosts: New file * tests/Test-https-*.px: Remove creation of wgethosts file 2017-05-04 Vijo Cherian Added new tests for SSL * tests/Test-https-badcerts.px : New file * tests/Test-https-clientcert.px : New file * tests/Test-https-crl.px : New file * tests/Test-https-weboftrust.px : New file * tests/certs/interca.conf : New file * tests/certs/rootca.conf : New file * tests/certs/test-ca-key.pem : New file Added all new SSL / HTTPS tests to make check Added Test for SSL Web of Trust, accept only if CA chain of trust is intact. Added a test script for client certificate Added Test for crlfile option of wget Added test to make sure that wget doesn't accept expired or invalid certs Some clean up : 1, Removed cause of warnings from perl & other cosmetic changes 2, Fix make -j 4 check such that it passes all tests 2017-05-04 Tomas Hozza Mention TLSv1_1 and TLSv1_2 as secure-protocol values in help * src/main.c: The --secure-protocol option accepts also values TLSv1_1 and TLSv1_2, as mentioned in the man page. However the help message doesn't mention these two values. This patch adds TLSv1_1 and TLSv1_2 as possible values to the help message. 2017-05-03 Tim Rühsen * src/http.c (gethttp): Support Wayback Machine's X-Archive-Orig-last-modified 2017-04-28 Vijo Cherian Added new tests for SSL * tests/Test-https-badcerts.px : New file * tests/Test-https-clientcert.px : New file * tests/Test-https-crl.px : New file * tests/Test-https-weboftrust.px : New file * tests/certs/interca.conf : New file * tests/certs/rootca.conf : New file * tests/certs/test-ca-key.pem : New file Added all new SSL / HTTPS tests to make check Added Test for SSL Web of Trust, accept only if CA chain of trust is intact. Added a test script for client certificate Added Test for crlfile option of wget Added test to make sure that wget doesn't accept expired or invalid certs Some clean up : Removed cause of warnings from perl & other cosmetic changes 2017-04-19 Vijo Cherian Added new tests for SSL * tests/SSLServer.pm: New file * tests/SSLTest.pm: New file * tests/Test-https-pfs.px: New file * tests/Test-https-selfsigned.px: New file * tests/Test-https-tlsv1.px: New file * tests/Test-https-tlsv1x.px: New file * tests/certs/server.crt: New file * tests/certs/server.key: New file * tests/certs/test-ca-cert.pem: New file Added 4 new test scripts all for SSL. Added base pm for SSL testing. Added SSL tests for TLSv1, TLSv1_1 and PFS. Added test for self signed cert : check that it fails without --no-check-certificate and passes with that flag. 2017-04-18 Tim Rühsen * src/utils.c: Remove non-portable __builtin_unreachable() 2017-04-18 Tim Rühsen Skip iconv() usage if HAVE_ICONV is not defined This helps on broken iconv implementations, e.g. Solaris. Reported-by: Mojca Miklavec 2017-04-18 Tim Rühsen Mention 'bash' for executing ./bootstrap Reported-by: Mojca Miklavec 2017-04-18 Tim Rühsen * bootstrap.conf: Make 'sed' options more portable 'sed -i' works differently on FreeBSD. Reported-by: Mojca Miklavec 2017-04-16 Tim Rühsen Fix charset transcoding issue for non-reversible codepoints * src/url.c: Check iconv() against 0, not -1 On some libiconv implementations, unknown codepoints become encoded as ?, e.g. when converting a non-ascii codepoint to ASCII. This results in ambigious file names which also fails our tests. 2017-04-16 Tim Rühsen * src/iri.c: Fix WIN32 idn2_free, forgotten code 2017-04-14 Darshit Shah * src/init.c: Set flstats correctly when using WGETRC env var 2017-04-08 Tim Rühsen Fix use of idn2_free() * src/connect.c (connect_to_ip): Use xfree() instead of idn2_free() * src/host.c (lookup_host): Use xfree() instead of idn2_free() * src/iri.h: Do not include idn2.h * src/url.c (url_free): Use xfree() instead of idn2_free() * src/url.h (struct url): Remove 'idn_allocated' from struct Reported-by: Gisle Vanem 2017-04-05 Anton Yuzhaninov Fix perl warnings in tests * tests/FTPServer.pm: Escape '{' in RE to fix warnings * tests/FTPTest.pm: Likewise * tests/HTTPServer.pm: Likewise * tests/HTTPTest.pm: Likewise * tests/Test-proxied-https-auth-keepalive.px: Likewise * tests/Test-proxied-https-auth.px: Likewise Escape '{' in RE to fix warnings: Unescaped left brace in regex is deprecated, passed through in regex; marked by <-- HERE in m/{{ <-- HERE port}}/ 2017-04-01 klemens Fix typos in comments 2017-03-31 Tim Rühsen * src/metalink.c (retrieve_from_metalink): Fix len in memset() 2017-03-24 Tim Rühsen Add gnulib module group-member 2017-03-24 Vijo Cherian Safeguards against TOCTTOU * src/utils.h: Add struct file_stat_s declaration, change prototypes of file_exists_p(), add prototypes for fopen_stat() and open_stat(). * src/utils.c: Extend file_exists_p(), new function fopen_stat() and open_stat(), add new param for file_exists_p(). * src/init.h: Add param file_stats_t to run_wgetrc(). * src/ftp.c: Amend calls to extended functions. * src/hsts.c: Likewise. * src/http.c: Likewise. * src/init.c: Likewise. * src/main.c: Likewise. * src/metalink.c: Likewise. * src/retr.c: Likewise. * src/url.c: Likewise. Added fopen_stat() and open_stat() that checks to makes sure the file didn't change underneath us. Return error from file_exists_p(). Added a way to return error from this file without major surgery to the callers. Fixes: #20369 2017-03-24 Tim Rühsen Update gnulib 2017-03-22 Christof Horschitz * src/warc.c (warc_write_cdx_record): Escape URLs 2017-03-20 Mike Frysinger Include libunistring headers only when used * src/iri.c: Check for libidn2 < 0.14 to include libunistring headers The unistring functions are used only when an older version of libidn2 is used, so don't include its headers either w/newer libdin2 versions. 2017-03-18 Tim Rühsen Fix links to www.robotstxt.org * NEWS: Fix links * doc/wget.texi: Likewise * src/res.c: Likewise Reported-by: Noël Köthe 2017-03-13 Tim Rühsen * tests/WgetTests.pm: Add -d to Wget test options 2017-03-08 Tim Rühsen Include for Windows Reported-by: Gisle Vanem 2017-03-08 Tim Rühsen Fix updating HSTS entries * src/hsts.c (hsts_store_entry): Always update 'created' field Fixes: #50490 Reported-by: Deian Stefan, Atyansh Jaiswal, Jonathan Luck 2017-03-06 Tim Rühsen Fix CRLF injection in Wget host part * src/url.c (url_parse): Reject control characters in host part of URL Reported-by: Orange Tsai 2017-03-04 Benjamin Esham * src/warc.c: Use warc_write_header_uri for all WARC-Target-URI fields The WARC spec requires that all URIs be enclosed in angle brackets. This was being done in most cases, but not for "WARC-Target-URI" fields in WARC blocks of type "response", "resource", "revisit", and "metadata". 2017-02-16 Tim Rühsen Fix 504 status handling * src/http.c (gethttp): Move 504 handling to correct place. (http_loop): Fix memeory leak. * testenv/server/http/http_server.py: Add Content-Length header on non-2xx status codes with a body Reported-by: Adam Sampson 2017-02-16 YX Hao * src/url.c (url_file_name): Do not charset convert local directory In a non-ASCII environment, the local path may contain non-ASCII characters. The server responded file name must be converted before it is concatenated to the local path. Conversion after concatenation may result in 'iconv' errors. 2017-02-15 Tim Rühsen * configure.ac: Remove manually resetting of LIBICONV variable Fixes: #48193 Reported-by: ilovezfs Reported-by: Charles * bootstrap.conf: Call gperf to create lib/unicase/special-casing-table.h * bootstrap.conf: Fix latest gnulib to work with gperf < 3.1 2017-02-14 Darshit Shah Pull GNULib to latest. * gnulib: Pull to latest commit. This fixes an issue with Gperf 3.1 2017-02-12 Tim Rühsen * src/main.c: Remove double 'verbose' option Fixes: #50290 2017-02-11 Tim Rühsen * NEWS: update isrc/http.c (check_retry_on_http_error): Fix gcc warning 2017-02-11 Tom Szilagyi Add support for --retry-on-http-error * doc/wget.text: Add documentation * src/http.c: Add function check_retry_on_http_error () * src/init.c: Add opt.retry_on_http_error * src/main.c: Add struct for retry-on-http-error to option_data[] * src/options.h: Add retry_on_http_error to struct options 2017-02-11 Tim Rühsen Revert "Add support for --retry-on-http-error" This reverts commit 977276374d1be7f171f5da8633368025c88b9eee. 2017-02-10 Tim Rühsen Change libtool library deps to non-libtool deps Reported-by: Yousong Zhou Fixes: #50260 2017-02-09 Tom Szilagyi Add support for --retry-on-http-error * doc/wget.texi: Add description for --retry-on-http-error * src/http.c (gethttp): Consider given HTTP response codes as non-fatal, transient errors. Supply a comma-separated list of 3-digit HTTP response codes as argument. Useful to work around special circumstances where retries are required, but the server responds with an error code normally not retried by Wget. Such errors might be 503 (Service Unavailable) and 429 (Too Many Requests). Retries enabled by this option are performed subject to the normal retry timing and retry count limitations of Wget. Using this option is intended to support special use cases only and is generally not recommended, as it can force retries even in cases where the server is actually trying to decrease its load. Please use it wisely and only if you know what you are doing. Example use and a starting point for manual testing: wget --retry-on-http-error=429,503 http://httpbin.org/status/503 2017-02-09 Tim Rühsen * tests/WgetTests.pm: Add --no-config to wget invocation Without --no-config, wgetrc commands may disturb the tests. Reported-by: Dagobert Michelsen 2017-02-06 Tim Rühsen * tests/WgetTests.pm (_verify_download): Fix failure message Reported-by: Zhiming Wang * src/http.c (initialize_request): Fix regression in .netrc auth Reported-by: Axel Reinhold * src/iri.c (idn_encode): Fix memory leak 2017-02-06 Tim Rühsen Remove skipping libunistring with --disable-iri * configure.ac: Remove checking for IRI=no On certain systems, gnulib falls back to libunistring for mbtowc() and wcwidth() (used in src/progress.c). Reported-by: Zhiming Wang 2017-02-04 Tim Rühsen * bootstrap.conf: Add gnulib module wcwidth Fix include/define clash with gnulib's unlink module * src/options.h: Rename options.unlink to options.unlink_requested * src/init.c: Replace options unlink member by unlink_requested * src/http.c: Likewise * src/ftp.c: Likewise * bootstrap.conf: Specify unicode modules more exact * src/xattr.h: Fix #define fsetxattr for MacOS and FreeBSD Reported-by: Zhiming Wang 2017-02-03 Tim Rühsen * contrib/release: New release helper script * cfg.mk: Fix syntax-check * doc/announcement_1.19.txt: Add new file * src/utils.c: Move macro FMT_MAX_LENGTH into scope * src/utils.c: Fix -Wformat= warnings * src/gnutls.c: Fix -Wformat= warnings * src/iri.c: Remove unused macro IDNA_FLAGS * src/iri.c: Remove use of __func__ macros * src/http.c: Fix -Wformat= warnings * src/progress.c: Remove unused macro move_to_end * src/html-parse.c: Remove unused macro SKIP_NON_WS * src/hsts.c: Remove unused macro CHECK_EXPLICIT_PORT * src/hsts.c: Fix -Wformat= warnings * src/hash.c: Explicitly convert float to int * src/ftp-ls.c: Fix -Wformat= warnings * src/ftp.c (ftp_retrieve_list): Add default to switch * src/css-url.h: Remove redundant declaration * src/ftp.c: Fix -Wformat= warning * src/http.c (test_parse_range_header): Fix constants 2017-02-02 Dale R. Worley Improve documentation of --trust-server-names. 2017-02-02 Tim Rühsen * src/url.c (mkalldirs): Add newline to log message * src/cookies.c (check_domain_match): Add newline to DEBUG lines 2017-01-17 Tim Rühsen * NEWS: update * README.checkout: Link HTTPS where possible, update flex homepage * README.checkout: Fix libidn to libidn2 2017-01-13 Tim Rühsen * src/iri.c: Use TR46 non-transitional for toASCII conversion * src/main.c: Fix _Noreturn compiler warnings * src/utils.c: Fix _Noreturn compiler warning * src/init.c (setval_internal): Fix sign compare warning 2017-01-13 Tim Rühsen Replace home-grown portability code by gnulib modules * bootstrap.conf: Add intprops, inttypes, limits-h, signal-h, stat, sys_types * src/ftp.c: Replace 'struct_stat' by 'struct stat' * src/hsts.c: Likewise * src/http.c: Likewise * src/main.c: Likewise * src/netrc.c: Likewise * src/retr.c: Likewise * src/url.c: Likewise * src/utils.c: Likewise * src/sysdep.h: Remove old portability code Further portability issues should be addressed by gnulib. 2017-01-13 Tim Rühsen Replace WGET_* m4 macros by gnulib modules * bootstrap.conf: Add hostent, inet_ntop, nanosleep, utimens * configure.ac: Remove WGET_STRUCT_UTIMBUF, WGET_FNMATCH, WGET_NANOSLEEP, WGET_POSIX_CLOCK, WGET_NSL_SOCKET * m4/wget.m4: Likewise * src/Makefile.am: Add $(LIB_NANOSLEEP) $(LIB_POSIX_SPAWN) to LDADD * tests/Makefile.am: Likewise * src/host.c (print_address): Use inet_ntop also for IPV4 2017-01-13 Tim Rühsen Remove libidn vulnerability work-around * src/iri.c (_utf8_is_valid): Removed Since we are using libidn2 for IDNs, we no longer need this work-around. 2017-01-13 Tim Rühsen * src/iri.c (idn_encode): Use TR46 transitional if available * gnulibx: Update 2017-01-07 Tim Rühsen Fix previous commit 2427ca4ac090dc6cc0c527f4ac4fc5a4bd468eb1 2017-01-06 vijeth-aradhya Fix http.c and ftp.c passwd logic error * src/ftp.c (getftp): Fix password/user selection * src/http.c (initialize_request): Likewise Before, netrc password won over interactive --ask-password but now --ask-password wins after change of program logic Fixes Issue #48811 2016-12-31 Giuseppe Scrivano * src/main.c (main): Add missing \n in error message 2016-12-28 Giuseppe Scrivano Check that fd_set has not fds bigger than FD_SETSIZE * src/connect.c: check that the fd is not bigger than FD_SETSIZE before using FD_SET. An fd_set cannot hold fds bigger than FD_SETSIZE, causing out-of-bounds write to a buffer on the stack. Reported by: Jann Horn 2016-12-20 Nikos Mavrogiannopoulos Avoid calling the gnutls priority functions multiple times * src/gnutls.c (ssl_connect_wget): Call gnutls_set_default_priority() for --secure-protocol=auto (default). The patch fixes a behavior that may have unintended side-effects in certain gnutls versions. Instead use the default priorities when no options are given. 2016-12-19 Tim Rühsen Print debug message when skipping certain recursive downloads * src/recur.c (retrieve_tree): Print debug message instead silently skipping recursive downloads. 2016-12-14 Rahul Bedarkar Rename base64_{encode,decode} (trivial patch) * src/http-ntlm.c: Rename base64_{encode,decode} * src/http.c: Likewise * src/utils.c: Likewise * src/utils.h: Likewise When statically linking with gnutls, we get definition clash error for base64_encode which is also defined by gnutls. To prevent definition clash, rename base64_{encode,decode} 2016-12-11 Tim Rühsen Add support for psl_latest() * configure.ac: Add check for psl_latest(), remove --with-psl-file * src/cookies.c (check_domain_match): Use psl_latest() if available 2016-11-09 Piotr Wajda Respect -o parameter again * log.c: don't choose log output dynamically when opt.lfilename is set Regression introduced by dd5c549f6af8e1143e1a6ef66725eea4bcd9ad50 Reported-by: Dale R. Worley 2016-11-07 Tim Rühsen Move Wget from IDN2003 (libidn) to IDN2008 (libidn2) * .travis.yml: Install libidn2-dev instead libidn11-dev. * bootstrap.conf: Add modules libunistring-optional, unistr/base, unicase/tolower. * configure.ac: Check for libidn2. * src/Makefile.am: Add $(LTLIBUNISTRING) to LDADD. * tests/Makefile.am: Set LDADD similar to LDADD in src/Makefile.am * src/connect.c: Use libidn2 code instead of libidn. * src/host.c: Likewise. * src/iri.c: Likewise. * src/iri.h: Likewise. * src/options.h: Likewise. * src/url.c: Likewise. * src/url.h: Likewise. * src/log.c: Fix C99 comment. IDN2003 should not be used any more due to security concerns. We use libunistring (resp. the unicode code from gnulib) for lowercasing UTF-8 before we give data to libidn2. TR#46 is missing, no support in libidn2 nor in libunistring. 2016-11-03 Tim Rühsen Use -O0 in contrib scripts * contrib/check-hard: Use -O0 instead of -O2 to speed up script. * contrib/travis-ci: Likewise. Update gnulib 2016-10-22 Tim Rühsen * src/metalink.c: Fix typo 'suceeded' -> 'succeeded' Reported-by: Göran Uddeborg , Anders Jonsson 2016-10-21 losgrandes Fixes #45790: wget prints it's progress even when background * src/log.c: Use tcgetpgrp(STDIN_FILENO) != getpgrp() to determine when to print to STD* or logfile. Deprecate log_request_redirect_output function. Use different file handles for STD* and logfile, to easily switch between them when changing fg/bg. * src/log.h: Make redirect_output function externally linked. * src/main.c: Don't use deprecated log_request_redirect_output function. Use redirect_output instead. * src/mswindows.c: Don't use deprecated log_request_redirect_output function. Use redirect_output instead. Fixes #46584: wget --spider always returns zero exit status * src/ftp.c: Return error as exit value if even one file doesn't exist 2016-10-12 Tim Rühsen * src/http.c (gethttp): Accept 206 for request w/o Range header Fixes: #49319 2016-10-07 Tim Rühsen * tests/Makefile.am: Let WGETRC work on *nix and Windows Properly include iconv.h * src/iri.c: Check HAVE_ICONV to include iconv.h * src/url.c: Same * bootstrap.conf: Add pipe-posix for pipe() 2016-10-07 Tim Rühsen Amend redirection behavior * src/recur.c (descend_redirect): Ignore WG_RR_LIST and WG_RR_REGEX for redirections. * testenv/Makefile.am: Add Test-recursive-redirect.py * testenv/Test-recursive-redirect.py: New test Test-recursive-redirect.py written by Dale R. Worley. Reported-by: "Dale R. Worley" 2016-09-30 Matthew White New: Metalink/XML v3 python class, update tests to use this class * testenv/misc/metalinkv3_xml.py: New Metalink/XML v3 python class * testenv/Test-metalink-xml-abspath-trust.py: Update test to use the new Metalink/XML v3 python class * testenv/Test-metalink-xml-abspath.py: Update test to use the new Metalink/XML v3 python class * testenv/Test-metalink-xml-absprefix-trust.py: Update test to use the new Metalink/XML v3 python class * testenv/Test-metalink-xml-absprefix.py: Update test to use the new Metalink/XML v3 python class * testenv/Test-metalink-xml-continue.py: Update test to use the new Metalink/XML v3 python class * testenv/Test-metalink-xml-emptyprefix-trust.py: Update test to use the new Metalink/XML v3 python class * testenv/Test-metalink-xml-homepath-trust.py: Update test to use the new Metalink/XML v3 python class * testenv/Test-metalink-xml-homepath.py: Update test to use the new Metalink/XML v3 python class * testenv/Test-metalink-xml-homeprefix-trust.py: Update test to use the new Metalink/XML v3 python class * testenv/Test-metalink-xml-homeprefix.py: Update test to use the new Metalink/XML v3 python class * testenv/Test-metalink-xml-nohash.py: Update test to use the new Metalink/XML v3 python class * testenv/Test-metalink-xml-nourls.py: Update test to use the new Metalink/XML v3 python class * testenv/Test-metalink-xml-prefix-trust.py: Update test to use the new Metalink/XML v3 python class * testenv/Test-metalink-xml-prefix.py: Update test to use the new Metalink/XML v3 python class * testenv/Test-metalink-xml-relpath-trust.py: Update test to use the new Metalink/XML v3 python class * testenv/Test-metalink-xml-relpath.py: Update test to use the new Metalink/XML v3 python class * testenv/Test-metalink-xml-relprefix-trust.py: Update test to use the new Metalink/XML v3 python class * testenv/Test-metalink-xml-relprefix.py: Update test to use the new Metalink/XML v3 python class * testenv/Test-metalink-xml-size.py: Update test to use the new Metalink/XML v3 python class * testenv/Test-metalink-xml-trust.py: Update test to use the new Metalink/XML v3 python class * testenv/Test-metalink-xml.py: Update test to use the new Metalink/XML v3 python class 2016-09-30 Matthew White New: --metalink-over-http Content-Type/Disposition Metalink/XML processing * src/http.c (metalink_from_http): Process the Content-Type header. Add an application/metalink4+xml URL as metalink metaurl. If the option opt.content_disposition is true, the Content-Disposition's filename is the metaurl's name * doc/wget.texi: Update --content-disposition and --metalink-over-http * doc/metalink-standard.txt: Update doc. Content-Type/Disposition processing through --metalink-over-http. Update download naming system about --trust-server-names and --content-disposition * testenv/Makefile.am: Add new files * testenv/Test-metalink-http-xml-type.py: New file. Metalink/HTTP Content-Type/Disposition header automated Metalink/XML tests * testenv/Test-metalink-http-xml-type-trust.py: New file. Metalink/HTTP Content-Type/Disposition header with --trust-server-names automated Metalink/XML tests * testenv/Test-metalink-http-xml-type-content.py: New file. Metalink/HTTP Content-Type/Disposition header with --content-disposition automated Metalink/XML tests * testenv/Test-metalink-http-xml-type-trust-content.py: New file. Metalink/HTTP Content-Type/Disposition header with --trust-server-names and --content-disposition automated Metalink/XML tests Process the Content-Type header, identify an application/metalink4+xml file. The Content-Disposition could provide an alternate name through the "filename" field for the metalink xml file. Respectively, the cli options --metalink-over-http and --content-disposition are required. When Metalink/XML auto-processing, to use the Content-Disposition's filename, the cli option --trust-server-names is also required. 2016-09-30 Matthew White Bugfix: Set NULL variable due to --content-disposition to Metalink origin * src/http.c (http_loop): Prevent SIGSEGV when hstat.local_file is NULL, opt.content_disposition has a role in leaving the value unset * src/http.c (gethttp): If hs->local_file is NULL (aka http_loop()'s hstat.local_file), set it to the value of hs->metalink->origin New: --trust-server-names saves Metalink/HTTP xml files using the "name" field * src/metalink.c (retrieve_from_metalink): If opt.trustservernames is true, use the basename of the metaurl's name to save the xml file * doc/metalink-standard.txt: Update doc. With --trust-server-names any Metalink/HTTP Link application/metalink4+xml file is saved using the basename of the "name" field, if any. Update Metalink/HTTP examples * testenv/Makefile.am: Add new file * testenv/Test-metalink-http-xml-trust-name.py: New file. Metalink/HTTP automated Metalink/XML, save xml files using the "name" field tests 2016-09-30 Matthew White Bugfix: Detect when a metalink:file doesn't have any hash * src/metalink.c (retrieve_from_metalink): Reject any metalink:file without hashes. Prompt the error and switch to the next file * testenv/Makefile.am: Add new file * testenv/Test-metalink-xml-nohash.py: New file. Metalink/XML with no hashes tests Prevent SIGSEGV. 2016-09-30 Matthew White Bugfix: Detect malformed base64 Metalink/HTTP Digest header * src/http.c (metalink_from_http): Fix hash_bin_len type. Use ssize_t instead than size_t. Reject -1 as base64_decode() return value * testenv/Makefile.am: Add new file * testenv/Test-metalink-http-baddigest.py: New file. Metalink/HTTP malformed base64 Digest header tests On malformed base64 input, ssize_t base64_decode() returns -1. Such value is too big for a size_t variable, and used as xmalloc() value will exaust all the memory. 2016-09-30 Matthew White New option --metalink-index to process Metalink application/metalink4+xml * NEWS: Mention the effect of --metalink-index over Metalink * src/init.c: Add new option metalinkindex (opt.metalink_index), initialize to -1 * src/main.c: Add new option metalink-index (--metalink-index=NUMBER) * src/options.h: Add new option metalink_index (int) * src/metalink.h: Add declaration of functions fetch_metalink_file(), replace_metalink_basename() * src/metalink.c: Add functions fetch_metalink_file() simple file fetch, replace_metalink_basename() replace file basename * src/metalink.c (retrieve_from_metalink): New. Process Metalink application/metalink4+xml of opt.metalink_index ordinal number * doc/wget.texi: Add new option metalink-index (--metalink-index) documentation * doc/metalink-standard.txt: Updated doc. Add documentation about Metalink application/metalink4+xml metaurls download naming system * doc/metalink-standard.txt: Update Metalink/XML and HTTP examples * testenv/Makefile.am: Add new files * testenv/Test-metalink-http-xml.py: New file. Metalink/HTTP automated Metalink/XML "application/metalink4+xml" --metalink-index tests * testenv/Test-metalink-http-xml-trust.py: New file. Metalink/HTTP automated Metalink/XML "application/metalink4+xml" --metalink-index retrieval with --trust-server-names tests WARNING: Do not use lib/dirname.c (dir_name) to get the directory name, it may append a dot '.' character to the directory name. 2016-09-30 Matthew White Bugfix: Prevent sorting when there are less than two elements * src/utils.c (stable_sort): Add condition nmemb > 1, sort only when there is more than one element Prevent SIGSEGV. 2016-09-30 Matthew White New: Parse Metalink/HTTP header for application/metalink4+xml * src/http.c (metalink_from_http): Parse Metalink/HTTP header for metaurls application/metalink4+xml media types * src/metalink.h: Add function declaration metalink_meta_cmp() * src/metalink.c: Add function metalink_meta_cmp() compare metalink metaurls priorities Add Metalink/HTTP application/metalink4+xml media types as metaurls to the metalink variable that will be used to download the files. 2016-09-30 Matthew White New test: Metalink shall not concatenate '/' to an empty directory prefix * testenv/Makefile.am: Add new file * testenv/Test-metalink-xml-emptyprefix-trust.py: New file. Metalink/XML empty directory prefix (--directory-prefix '') tests Detect a '/' character wrongfully concatenated to an empty directory prefix '' (not NULL), resulting in an absolute path as '/dir/file', instead than 'dir/file'. 2016-09-30 Matthew White Bugfix: Remove surrounding quotes from Metalink/HTTP key's value * src/metalink.h: Add declaration of function dequote_metalink_string() * src/metalink.c: Add function dequote_metalink_string() remove surrounding quotes from string, \' or \" * src/metalink.c (find_key_value, find_key_values): Call dequote_metalink_string() to remove the surrounding quotes from the parsed value * src/metalink.c (test_find_key_value, test_find_key_values): Add quoted key's values for unit-tests * testenv/Makefile.am: Add new file * testenv/Test-metalink-http-quoted.py: New file. Metalink/HTTP quoted values tests Some Metalink/HTTP keys, like "type" [2], may have a quoted value [1]: Link: ; rel=describedby; type="application/metalink4+xml" Wget was expecting a dequoted value from the Metalink module. This patch addresses this problem. References: [1] Metalink/HTTP: Mirrors and Hashes 1.1. Example Metalink Server Response https://tools.ietf.org/html/rfc6249#section-1.1 [2] Additional Link Relations 6. "type" https://tools.ietf.org/html/rfc6903#section-6 2016-09-30 Matthew White Bugfix: Process Metalink/XML url strings containing white spaces and CRLF * src/metalink.h: Add declaration of function clean_metalink_string() * src/metalink.c: Add directive #include "xmemdup0.h" * src/metalink.c: Add function clean_metalink_string() remove leading and trailing white spaces and CRLF from string * src/metalink.c (retrieve_from_metalink): Remove leading and trailing white spaces and CRLF from url resource mres->url * testenv/Makefile.am: Add new file * testenv/Test-metalink-xml-urlbreak.py: New test. Metalink/XML white spaces and CRLF in url resources tests White spaces and CRLF are not automatically removed by libmetalink from url strings. The Wget's Metalink module was unable to process such url strings. This patch implements the processing of such url strings cleaning off leading and trailing white spaces and CRLF. If a parsed Metalink/XML url string contains strings separated by CRLF, only the first of the series is accepted. 2016-09-30 Matthew White New test: Detect when there are no good Metalink url resources * testenv/Makefile.am: Add new file * testenv/Test-metalink-xml-nourls.py: New file. Metalink/XML unknown urls tests Test if when there are no good Metalink url resources there is any segmentation fault. 2016-09-30 Matthew White New: Metalink file size mismatch returns error code METALINK_SIZE_ERROR * src/wget.h (uerr_t): Add error code METALINK_SIZE_ERROR to enum * src/metalink.c (retrieve_from_metalink): Use boolean variable size_ok, when false set retr_err to METALINK_SIZE_ERROR * testenv/Makefile.am: Add new file * testenv/Test-metalink-xml-size.py: New file. Metalink/XML file size tests () Before this patch, no appropriate error code was returned to inform a file size mismatch. This patch introduces the error code METALINK_SIZE_ERROR to inform a file size mismatch. 2016-09-30 Matthew White New: Metalink/XML and Metalink/HTTP file naming safety rules * NEWS: Mention the effect of --trust-server-names over Metalink * src/metalink.h: Add declaration of function append_suffix_number() * src/metalink.c: Add function append_suffix_number() append number to string * src/metalink.c (retrieve_from_metalink): Safer Metalink/XML and Metalink/HTTP download naming system, opt.trustservernames based * doc/metalink-standard.txt: Update doc. Explain new Metalink/XML and Metalin/HTTP download naming system and --trust-server-names role * testenv/Makefile.am: Add new files * testenv/Test-metalink-xml-continue.py: Update test. Metalink/XML continue/keep existing files (HTTP 416) with --continue tests * testenv/Test-metalink-xml.py: Update test. Metalink/XML naming tests * testenv/Test-metalink-xml-trust.py: New file. Metalink/XML naming tests with --trust-server-names * testenv/Test-metalink-xml-abspath.py: Update test. Metalink/XML absolute path tests * testenv/Test-metalink-xml-abspath-trust.py: New file. Metalink/XML absolute path tests with --trust-server-names * testenv/Test-metalink-xml-relpath.py: Update test. Metalink/XML relative path tests * testenv/Test-metalink-xml-relpath-trust.py: New file. Metalink/XML relative path tests with --trust-server-names * testenv/Test-metalink-xml-homepath.py: Update test. Metalink/XML home path and ~ (tilde) tests * testenv/Test-metalink-xml-homepath-trust.py: New file. Metalink/XML home path and ~ (tilde) tests with --trust-server-names * testenv/Test-metalink-xml-prefix.py: New file. Metalink/XML naming tests with --directory-prefix * testenv/Test-metalink-xml-prefix-trust.py: New file. Metalink/XML naming tests with --directory-prefix and --trust-server-names * testenv/Test-metalink-xml-absprefix.py: New file. Metalink/XML absolute --directory-prefix tests * testenv/Test-metalink-xml-absprefix-trust.py: New file. Metalink/XML absolute --directory-prefix tests with --trust-server-names * testenv/Test-metalink-xml-relprefix.py: New file. Metalink/XML relative --directory-prefix tests * testenv/Test-metalink-xml-relprefix-trust.py: New file. Metalink/XML relative --directory-prefix tests with --trust-server-names * testenv/Test-metalink-xml-homeprefix.py: New file. Metalink/XML home --directory-prefix tests * testenv/Test-metalink-xml-homeprefix-trust.py: New file. Metalink/XML home --directory-prefix tests with --trust-server-names The option --trust-server-names allows to use the file names parsed from a Metalink/XML file. Without --trust-server-names, the safety mechanism provides secure and predictable file names. 2016-09-30 Matthew White New document: Metalink/XML and Metalink/HTTP standard reference * doc/metalink-standard.txt: New doc. Implemented and recommended Metalink/XML and Metalink/HTTP standard features 2016-09-30 Matthew White Enforce Metalink file name verification, strip directory if necessary * NEWS: Mention the use of a safe Metalink destination path * src/metalink.h: Add declaration of functions get_metalink_basename(), last_component(), metalink_check_safe_path() * src/metalink.c: Add directive #include "dosname.h" * src/metalink.c: Add function get_metalink_basename() to return the basename of a file name, strip w32's drive letter prefixes * src/metalink.c (retrieve_from_metalink): Enforce Metalink file name verification, if the file name is unsafe try its basename * doc/metalink.txt: Update document. Explain --directory-prefix The function get_metalink_basename() uses FILE_SYSTEM_PREFIX_LEN to catch any 'C:D:file' (w32 environment), then it removes each drive letter prefix, i.e. 'C:' and 'D:'. Unsafe file names contain an absolute, relative, or home path. Safe paths can be verified by libmetalink's metalink_check_safe_path(). 2016-09-27 Matthew White Implement Metalink/XML --directory-prefix option in Metalink module * NEWS: Mention the effect of --directory-prefix over Metalink * src/metalink.c (retrieve_from_metalink): Add opt.dir_prefix as prefix to the metalink:file name mfile->name * doc/metalink.txt: Update document. Explain --directory-prefix When --directory-prefix= is used, set the top of the retrieval tree to prefix. The default is . (the current directory). Metalink/XML and Metalink/HTTP files will be downloaded under prefix. 2016-09-27 Matthew White Change mfile->name to filename in Metalink module's messages * src/metalink.c (retrieve_from_metalink): Change mfile->name to filename when referring to the downloaded file The file name could have been changed by unique_create() (or by any other mean) before downloading. Use the name of the downloaded file (filename) when printing output which refer to it. 2016-09-27 Matthew White Add file size computation in Metalink module * NEWS: Mention Metalink's file size verification * src/metalink.c (retrieve_from_metalink): Add file size computation * doc/metalink.txt: Update document. Remove resolved bugs Reject downloaded files when they do not agree with their Metalink/XML metalink:size: https://tools.ietf.org/html/rfc5854#section-4.2.14 At the moment of writing, Metalink/HTTP headers do not provide a file size field. This information could be obtained from the Content-Length header field: https://tools.ietf.org/html/rfc6249#section-7 2016-09-27 Matthew White Update Metalink/XML tests and add a new test for home paths * testenv/Test-metalink-xml-relpath.py: Update test * testenv/Test-metalink-xml-homepath.py: New file. Reject home paths * testenv/Makefile.am: Add new file When --input-metalink= is used, each metalink:file name is verified by libmetalink's metalink_check_safe_path(). By design, absolute, relative and home paths are rejected. At the moment of writing, when --metalink-over-http is used, absolute, relative, and home paths aren't a concern. The destination file name is a combination of URL's file name and cli's "Directory Options" handled by src/url.c (url_file_name). 2016-09-27 Matthew White Bugfix: Keep the download progress when alternating metalink:url * NEWS: Mention the effects of --continue over Metalink * src/metalink.c (retrieve_from_metalink): On download error, resume output_stream with the next mres->url. Keep fully downloaded files started with --continue, otherwise rename/remove the file * testenv/Makefile.am: Add new file * testenv/Test-metalink-xml-continue.py: New file. Metalink/XML continue/keep existing files (HTTP 416) with --continue tests Before this patch, with --continue, existing and/or fully retrieved files which fail the sanity tests were renamed (--keep-badhash), or removed. This patch ensures that --continue doesn't rename/remove existing and/or fully retrieved files (HTTP 416) which fail the sanity tests. 2016-09-27 Matthew White Bugfix: Fix NULL filename and output_stream in Metalink module * NEWS: Mention the Metalink "path/file" name format handling * src/metalink.c (retrieve_from_metalink): Fix NULL filename, set filename to the right "path/file" value * src/metalink.c (retrieve_from_metalink): Fix NULL output_stream, set output_stream to filename when it is created by retrieve_url() * src/metalink.c (retrieve_from_metalink): Add RFC5854 comments about proper metalink:file "path/file" name format handling * doc/metalink.txt: Update document. Remove resolved bugs If unique_create() cannot create/open the destination file, filename and output_stream remain NULL. If fopen() is used instead, filename always remains NULL. Both functions cannot create "path/file" trees. Setting filename to the right value is sufficient to prevent SIGSEGV generating from testing a NULL value. This also allows retrieve_url() to create a "path/file" tree through opt.output_document. Reading NULL as output_stream, when it shall not be, leads to wrong results. For instance, a non-NULL output_stream tells when a stream was interrupted, reading NULL instead means to assume the contrary. This patch conforms to the RFC5854 specification: The Metalink Download Description Format 4.1.2.1. The "name" Attribute https://tools.ietf.org/html/rfc5854#section-4.1.2.1 2016-09-19 Matthew White Add metalink description * doc/metalink.txt Evaluation of "Directory Options" on the command line interacting with the option '--input-metalink=file': $ wget --input-metalink=file 2016-09-19 Matthew White Use python .replace instead than re.sub in Metalink tests * testenv/Test-metalink-http.py: Use python .replace * testenv/Test-metalink-xml.py: Use python .replace * testenv/Test-metalink-xml-abspath.py: Use python .replace * testenv/Test-metalink-xml-relpath.py: Use python .replace Use python .replace instead than re.sub, remove 'import re'. 2016-09-19 Matthew White Fix: Change Metalink/XML v3 file name into test.metalink * testenv/Test-metalink-xml-abspath.py: Change Metalink/XML v3 file name from test.meta4 into test.metalink * testenv/Test-metalink-xml-relpath.py: Change Metalink/XML v3 file name from test.meta4 into test.metalink * testenv/Test-metalink-xml.py: Change Metalink/XML v3 file name from test.meta4 into test.metalink 2016-09-13 Tim Rühsen Add two Metalink/XML tests * testenv/Test-metalink-xml-abspath.py: Reject absolute paths * testenv/Test-metalink-xml-relpath.py: Reject relative paths * testenv/Makefile.am: Add both new files to metalink tests 2016-09-09 Tim Rühsen Fix crash on 'srcset' inline URIs * src/html-url.c (tag_handle_img): Check append_url() for NULL return value before dereference. Crashed reproducable with parsing srcset="data:..." inline data. Reported-by: Coverity 2016-09-09 Tim Rühsen * src/hsts.c (hsts_store_open): NULL check param for fclose(). Reported-by: Coverity * src/ftp-ls.c (ftp_parse_winnt_ls): Fix memset params * src/utils.c (stable_sort): Use xmalloc instead of malloc 2016-09-08 Tim Rühsen * src/ftp-ls.c (ftp_parse_winnt_ls): Initialize struct fileinfo cur Reported-by: Coverity Add const to url param of some functions * src/http.c: Add const to first param of initialize_request(), initialize_proxy_configuration(), establish_connection(), check_file_output(), check_auth(), gethttp(), http_loop(). * src/http.h: Add const to first param of http_loop(). * Makefile.am: Remove trailing empty line * src/recur.c (retrieve_tree): Fix possible NULL dereference Reported-by: Coverity * src/http.c (initialize_request): Fix check for user Reported-by: Coverity * src/retr.c (retrieve_url): NULL check mynewloc Reported-by: Coverity * src/utils.c (stable_sort): Reduce tmp allocation size Reported-by: Coverity * Makefile.am: Add target 'check-valgrind' Fix some issues detected by Coverity * src/connect.c (connect_to_ip): Check return value of setsockopt. * src/ftp.c (ftp_retrieve_list): Check return value of chmod. * src/http.c (digest_authentication_encode): Cleanup code. * src/init.c (setval_internal): Explicitely check comind range. * src/main.c (main): Explicitely check optarg. * src/retr.c (retr_rate): Use snprintf instead sprintf, (retrieve_from_file): More verbose error message, (rotate_backups): Use snprintf instead sprintf, check return value of rename(). * src/url.c (mkalldirs): Check return value of unlink(). * src/utils.c (strdupdelim): Explicitely check beg and end for NULL, (merge_vecs): Fix sizeof argument to char *, (stable_sort): Use malloc instead of alloca. Code cleanup for --use-askpass * bootstrap.conf: Add xmemdup0 and strpbrk. * src/init.c (cmd_use_askpass): Add 'const' to char *, remove check for file existence. * src/main.c (run_use_askpass): C89 compat init of argv, added \n to error messages, fixed stripping of \n and \r from input, make run_use_askpass and use_askpass static. 2016-09-07 Tim Rühsen * src/http.c (check_file_output): Replace asprintf by aprint 2016-09-04 Tim Rühsen * testenv/README: Remove obsolete references to TEST_NAME 2016-09-03 Liam R. Howlett Add --use-askpass=COMMAND support * doc/wget.texi: Add --use-askpass to documentation. * src/init.c: Add cmd_use_askpasss to set opt.use_askpass based on argument, WGET_ASKPASS, and SSH_ASKPASS environment variables. opt.wget-askpass is freed in cleanup () * src/main.c: Update options & add spawn process of opt.use_askpass command. * src/options.h: Addition of string use_askpass. * src/url.c: Function scheme_leading_string to access the leading string of a parsed url. * src/url.h: Prototype for scheme_leading_string for returning the leading string. * bootstrap.conf: Add posix_spawn to gnulib_modules This adds the --use-askpass option which is disabled by default. --use-askpass=COMMAND will request the username and password for a given URL by executing the external program COMMAND. If COMMAND is left blank, then the external program in the environment variable WGET_ASKPASS will be used. If WGET_ASKPASS is not set then the environment variable SSH_ASKPASS is used. If there is no value set, an error is returned. If an error occurs requesting the username or password, wget will exit. 2016-09-02 Dale R. Worley Add tests for recursion and redirection. * testenv/Test-recursive-basic.py: New file. Test basic recursion * testenv/Test-recursive-include.py: New File. Recursion test with include directories * testenv/Test-redirect.py: New File. Basic redirection tests * testenv/Makefile.am: Add new tests to makefile Sort test names into order. * testenv/Makefile.am: Sort all the python tests in alphabetical order Corrections and amplifications to test documentation * testenv/README: Update documentation to meet current project status * testenv/Test-Proto.py: Same 2016-08-24 Giuseppe Scrivano Append .tmp to temporary files * src/http.c (struct http_stat): Add `temporary` flag. (check_file_output): Append .tmp to temporary files. (open_output_stream): Refactor condition to use hs->temporary instead. Reported-by: "Misra, Deapesh" Discovered by: Dawid Golunski (http://legalhackers.com) 2016-08-24 Tim Rühsen Limit file mode to u=rw on temp. downloaded files * bootstrap.conf: Add gnulib modules fopen, open. * src/http.c (open_output_stream): Limit file mode to u=rw on temporary downloaded files. Reported-by: "Misra, Deapesh" Discovered by: Dawid Golunski (http://legalhackers.com) 2016-08-21 Giuseppe Scrivano Fix some make syntax-check issues cfg.mk: Skip .der files. testenv/certs/server-template.cfg: Remove empty final line. testenv/certs/ca-template.cfg: Likewise. 2016-08-17 Tim Rühsen * src/css-url.c (get_urls_css): Fix memory leak * src/html-url.c (get_urls_html): Fix memory leak 2016-08-17 Tim Rühsen Improve PSL cookie checking * configure.ac: Add --with-psl-file to set a PSL file * src/cookies.c (check_domain_match): Load PSL_FILE with fallback to built-in data. This change allows package maintainers to make Wget use the latest PSL (DAFSA or plain text), without updating libpsl itself. E.g. Debian now comes with a DAFSA binary within the 'publicsuffix' package which allows very fast loading (no parsing or processing needed). 2016-08-10 Tobias Stoeckmann Fix stack overflow with way too many cookies * src/cookies.c (cookie_header): Use heap instead of stack. * src/http.c (request_send): Likewise. If wget has to handle an insanely large amount of cookies (~700,000 on 32 bit systems or ~530,000 on 64 bit systems), the stack is not large enough to hold these pointers, leading to undefined behaviour according to POSIX; expect a segmentation fault in real life. ;) 2016-08-09 Tobias Stoeckmann Fix signal race condition The signal handler for SIGALRM calls longjmp, but the handler is installed before the jump target has been initialized. If another process sends SIGALRM right between handler installation and target initialization, the jump leads to undefined behavior. This can easily be fixed by moving the signal handler installation into the "SETJMP == 0" conditional block, which means that the target has just been initialized. * src/utils.c: call signal after SETJMP. 2016-08-05 Jeffery To Remove hyphens from command names * src/init.c: Remove hyphens from command names * src/main.c: Likewise Options with hyphens (or underscores) in their command name cannot be set in a wgetrc file. 2016-08-04 Tim Rühsen * src/metalink.c (badhash_suffix): Fix quoting 2016-08-04 Matthew White Add new option --keep-badhash to keep Metalink's files with a bad hash * src/init.c: Add keepbadhash * src/main.c: Add keep-badhash * src/options.h: Add keep_badhash * doc/wget.texi: Add docs for --keep-badhash * src/metalink.h: Add prototypes badhash_suffix(), badhash_or_remove() * src/metalink.c: New functions badhash_suffix(), badhash_or_remove(). (retrieve_from_metalink): Call badhash_or_remove() on download error With --keep-badhash, append .badhash to Metalink's files with checksum mismatch. (retrieve_from_metalink): unique_create() may append another suffix to avoid overwriting existing files. Without --keep-badhash, remove downloaded files with checksum mismatch (this conforms to the old behaviour). 2016-08-03 Tim Rühsen * src/metalink.c: Remove C++ style comments 2016-08-03 Matthew White Add gnulib modules 'link', 'unlink' and 'symlink' * bootstrap.conf: Add 'link', 'unlink' and 'symlink' 2016-08-03 Matthew White New: --continue continues partially downloaded Metalink's files * src/metalink.c (retrieve_from_metalink): Continue file download if opt.always_rest is true Without --continue, download as a new file with an unique name (this conforms to the old behaviour). 2016-08-03 Matthew White Add support for Metalink's md2, and md4 hashes * bootstrap.conf: Add crypto/md2, and crypto/md4 * src/metalink.c (retrieve_from_metalink): Add md2, and md4 support This patch adds support for the deprecated (insecure) md2, and md4 Message-Digest algorithms to the Metalink module. 2016-08-03 Matthew White Add support for Metalink's md5, sha1, sha224, sha384, and sha512 hashes * bootstrap.conf: Add crypto/sha512 * src/metalink.c (retrieve_from_metalink): Add md5, sha1, sha224, sha384, and sha512 support Metalink's checksum verification was limited to sha256. This patch adds support for md5, sha1, sha224, sha384, and sha512. 2016-07-27 Sean Burford Style fixes and DEBUG on setxattr failure. * src/ftp.c: Fix style. * src/http.c: Likewise. * src/xattr.h: Likewise. * src/xattr.c: Likewise, (write_xattr_metadata): Print debug msg on error. 2016-07-22 Sean Burford Keep fetched URLs in POSIX extended attributes * configure.ac: Check for xattr availability * src/Makefile.am: Add xattr.c * src/ftp.c: Include xattr.h. (getftp): Set attributes if enabled. * src/http.c: Include xattr.h. (gethttp): Add parameter 'original_url', set attributes if enabled. (http_loop): Add 'original_url' to call of gethttp(). * src/init.c: Add new option --xattr. * src/main.c: Add new option --xattr, add description to help text. * src/options.h: Add new config member 'enable_xattr'. * src/xatrr.c: New file. * src/xattr.h: New file. These attributes provide a lightweight method of later determining where a file was downloaded from. This patch changes: * autoconf detects whether extended attributes are available and enables the code if they are. * The new flags --xattr and --no-xattr control whether xattr is enabled. * The new command "xattr = (on|off)" can be used in ~/.wgetrc or /etc/wgetrc * The original and redirected URLs are recorded as shown below. * This works for both single fetches and recursive mode. The attributes that are set are: user.xdg.origin.url: The URL that the content was fetched from. user.xdg.referrer.url: The URL that was originally requested. Here is an example, where http://archive.org redirects to https://archive.org: $ wget --xattr http://archive.org ... $ getfattr -d index.html user.xdg.origin.url="https://archive.org/" user.xdg.referrer.url="http://archive.org/" These attributes were chosen based on those stored by Google Chrome https://bugs.chromium.org/p/chromium/issues/detail?id=45903 and curl https://github.com/curl/curl/blob/master/src/tool_xattr.c 2016-07-02 Noël Köthe Fix typos * ChangeLog-2014-12-10: invokation -> invocation * doc/wget.texi: invokation -> invocation * src/main.c: seperated -> separated * src/options.h: seperated -> separated * testenv/README: invokation -> invocation * testenv/conf/wget_commands.py: invokation -> invocation 2016-06-30 Tim Rühsen Fix creating docs when make uses 'sh -e' * doc/Makefile.am: Save fallback for pod2man --utf8 Reported-by: Jérémie Courrèges-Anglas 2016-06-30 Tim Rühsen Fix compilation for OpenSSL 1.1.0 * src/openssl.c (ssl_init): Use SSL_is_init_finished() instead of SSL_state(), conditionally skip SSLeay function calls The python test suite makes SSL_peek() hang, consuming 100% CPU time. This does not happen on real world TLS connections, though, but needs investigations. 2016-06-29 Tim Rühsen Add script to generate test certs non-interactive * Test-pinnedpubkey-hash-https.py: Read hashed pubkey from file * Test-pinnedpubkey-hash-no-check-fail-https.py: Use invalid hash * certs/make_ca.sh: New script to generate test certs non-interactive * certs/ca-template.cfg: New file (template for CA cert) * certs/server-template.cfg: New file (template for server cert) * certs/server-pubkey-sha256.base64: New file (pubkey sha256 hash) 2016-06-27 Ander Juaristi Bypass world-writable checks on Windows * src/hsts.c (hsts_file_access_valid): we should check for "world-writable" files only on Unix-based systems. It's difficult to mimic the same behavior on Windows, so it's better to just not do it. Reported-by: Gisle Vanem Reported-by: Eli Zaretskii 2016-06-14 Tim Rühsen Update gnulib and bootstrap * gnulib: Sync gnulib submodule with upstream * bootstrap: Update to latest version from gnulib/build-aux/ 2016-06-12 Tim Rühsen Use ICONV_CONST to avoid type warning for iconv() * src/iri.c (do_conversion): Cast 2. param of iconv() to 'ICONV_CONST char **' * src/url.c (convert_fname): Likewise Remove check for HAVE_ICONV in src/url.c * src/url.c: Remove check for HAVE_ICONV Add gnulib module 'langinfo' * bootstrap.conf: Add 'langinfo' Include gnulib fcntl.h instead of sys/fcntl.h * src/gnutls.c: Include gnulib fcntl.h Add libraries to LDADD for wget * src/Makefile.am: Add $(GETADDRINFO_LIB) $(HOSTENT_LIB) $(INET_NTOP_LIB) $(LIBSOCKET) $(LIB_CLOCK_GETTIME) $(LIB_CRYPTO) $(LIB_SELECT) $(LTLIBICONV) $(LTLIBINTL) $(LTLIBTHREAD) $(SERVENT_LIB) to LDADD 2016-06-10 Tim Rühsen Explicitely mention Python3 in README.checkout * README.checkout: Python test suite needs Python3 2016-06-09 Giuseppe Scrivano NEWS: update 2016-06-09 Giuseppe Scrivano ftp: understand --trust-server-names on a HTTP->FTP redirect If not --trust-server-names is used, FTP will also get the destination file name from the original url specified by the user instead of the redirected url. Closes CVE-2016-4971. * src/ftp.c (ftp_get_listing): Add argument original_url. (getftp): Likewise. (ftp_loop_internal): Likewise. Use original_url to generate the file name if --trust-server-names is not provided. (ftp_retrieve_glob): Likewise. (ftp_loop): Likewise. 2016-06-07 Tim Rühsen Fix warnings for --disable-iri configure flag * src/iri.h: Fix #define for parse_charset * src/html-url.c: Surround some IRI code parts by #ifdef ENABLE_IRI * src/http.c: Likewise * src/iri.h: Likewise * src/recur.c: Likewise * src/retr.c: Likewise Fix warning about redefinition of MAP_FAILED * src/sysdep.h: Removed definition of MAP_FAILED * src/utils.c: Check and define MAP_FAILED after including sys/mmap.h 2016-06-03 Tim Rühsen Add new Test for missing scheme behavior * testenv/Makefile.am: Add Test-missing-scheme-retval.py * testenv: New file Test-missing-scheme-retval.py Idea and Perl implementation by Zdenek Dohnal 2016-05-27 Tim Rühsen Fallback to pod2man without utf-8 on error * doc/Makefile.am: Fallback to pod2man without utf-8 on error 2016-05-26 Ander Juaristi Correct HSTS debug message * src/main.c (save_hsts): save the in-memory HSTS database to a file only if something changed. * src/hsts.c (struct hsts_store): new field 'changed'. (hsts_match): update field 'changed' accordingly. (hsts_store_entry): update field 'changed' accordingly. (hsts_store_has_changed): new function. * src/hsts.h (hsts_store_has_changed): new function. Check the HSTS file is not world-writable * hsts.c (hsts_file_access_valid): check that the file is a regular file, and that it's not world-writable. (hsts_store_open): if the HSTS database file does not meet the above requirements, disable HSTS at all. 2016-05-24 Tim Rühsen Improve description of Perl libraries needed for test suite * README.checkout: Improve text about Perl libraries 2016-05-23 Tim Rühsen Remove special handling for Emacs in progress bar code * src/progress.c: Remove special 'emacs' code Fixes #47989 2016-04-25 Jernej Simončič Fix xsleep() for Windows (trivial change) * src/mswindows.c (xsleep): Fix check for number of seconds 2016-04-17 Sergio Gelato More accurate log message from do_conversion() * src/iri.c (do_conversion): More accurate log message 2016-04-17 Tim Rühsen Include sys/select.h if HAVE_LIBCARES * src/hosts.c: Include sys/select.h if HAVE_LIBCARES Reported-by: Gisle Vanem 2016-04-17 Gisle Vanem Fix Windows gnulib/c-ares incompatibility of select() * src/host.c: Undef 'select' on Windows 2016-04-15 Tim Rühsen Set X flags for python tests * testenv/*.py: Set eXecute flags Regression from commit 926e42d4678689195a0bbed210c6d027db7cc390 2016-04-11 Ander Juaristi Strictly comply with RFC 6797 * src/hsts.c (hsts_store_entry): strictly comply with RFC 6797. RFC 6797 states in section 8.1 that the UA's cached information should only be updated if: "either or both of the max-age and includeSubDomains header field value tokens are conveying information different than that already maintained by the UA." 2016-04-11 Ander Juaristi Correct HSTS database file description * src/hsts.c (hsts_store_dump): s/[:port]// 2016-04-11 moparisthebest Implement tests for new pinnedpubkey option * testenv/Makefile.am: Add new tests * testenv/Test-pinnedpubkey-der-https.py: New test * testenv/Test-pinnedpubkey-der-no-check-https.py: New Test * testenv/Test-pinnedpubkey-hash-https.py: New test * testenv/Test-pinnedpubkey-hash-no-check-fail-https.py: New test * testenv/Test-pinnedpubkey-pem-fail-https.py: New test * testenv/Test-pinnedpubkey-pem-https.py: New test * testenv/certs/README: How to generate public keys with openssl tool * testenv/certs/server-pubkey.der: New key file (DER format) * testenv/certs/server-pubkey.pem: New key file (PEM format) Implement --pinnedpubkey option to pin public keys * doc/wget.texi: Add description for --pinnedpubkey * src/gnutls.c: New function pkp_pin_peer_pubkey(), (ssl_check_certificate): Check pinned cert via pkp_pin_peer_pubkey() * src/init.c: Add option --pinnedpubkey * src/main.c: Add option --pinnedpubkey * src/openssl.c: New function pkp_pin_peer_pubkey(), (ssl_check_certificate): Check pinned cert via pkp_pin_peer_pubkey() * src/options.h: Add new option variable 'pinnedpubkey' * src/utils.c: New functions wg_pubkey_pem_to_der(), wg_pin_peer_pubkey() * src/utils.h: Add prototype for wg_pin_peer_pubkey() 2016-04-11 Tim Rühsen Use test file name for temp working directory name * testenv/test/base_test.py (__init__): Use test file name for name, remove 'name' parameter * testenv/test/http_test.py (__init__): Remove 'name' parameter * testenv/*.py: Remove TEST_NAME Using a fixed string (TEST_NAME) to build the working directory for testing caused random failures (or successes) when tests share the same TEST_NAME value. Not easy to find without digging into the python test suite code. We now use the test file name, which is unique within the test environment. 2016-04-11 Tim Rühsen Fix testenv/Test--rejected-log.py * testenv/Test--rejected-log.py: Add missing tabs in expected output Fail python tests when post_hook errors * testenv/test/base_test.py (__exit__): Return self.tests_passed (__test_cleanup): Set self.tests_passed to False on exception 2016-03-29 Darshit Shah Print the fingerprint instead of the raw pointer in debugging message * src/metalink.c (retrieve_from_metalink): Fix debug message to print the fingerprint instead of a pointer. * Do not delete the ChangeLog file since it is required by the Makefile and breaks compilation 2016-03-29 Darshit Shah Revert "Print the fingerprint instead of the raw pointer in debugging message" This reverts commit b916595168b6eb0f8868a67a9d214d5e0022871f. 2016-03-28 Tim Rühsen Fixed URLs and references in wget.texi * wget.texi: Replace server.com by example.com, replace ftp://wuarchive.wustl.edu by https://example.com, use HTTPS instead of HTTP where possible, fix list archive reference, remove reference to wget-notify@addictivecode.org, change bugtracker URL to bugtracker on Savannah, replace yoyodyne.com by example.com, fix URL to VMS port 2016-03-25 Giuseppe Scrivano * metalink.c (retrieve_from_metalink): Fix typo Print the fingerprint instead of the raw pointer in debugging message * src/metalink.c (retrieve_from_metalink): Fix debug message to print the fingerprint instead of a pointer. 2016-03-23 Tim Rühsen Add options --bind-dns-address and --dns-servers * README.checkout: Add description for libares * configure.ac: Add check for libares * doc/wget.texi: Add docs for the new options * src/build_info.c.in: Add +/-cares for --version output * src/host.c: (merge_address_lists): New static function (address_list_from_hostent): New static function (wait_ares): New static function (callback): New static function (lookup_host): Add libares resolver code * src/init.c: Add new options, (cleanup): Add cleanup code * src/main.c: Add global libares channel variable (cmdline_option option_data): Add new options (print_help): Add short descriptions (main): Add libares init code * src/options.h (struct options): Add option members The new options allow to specify alternative DNS servers and an alternate packet route for the resolver packets. Wget has to built with libares, enabled at configure time by ./configure --with-cares. 2016-03-16 Tim Rühsen Fix SNI server names with trailing dot(s) * src/gnutls.c (ssl_connect_wget, ssl_check_certificate): Fix SNI server name * src/openssl.c (ssl_connect_wget, ssl_check_certificate): Fix SNI server name Fixes #47408 2016-03-10 Tim Rühsen Fix links to original Robots Exclusion Standard * doc/wget.texi: Fix links 2016-03-05 Darshit Shah Fix assertion in Progress bar * src/progress.c (create_image): Fix off-by-one error in assert() statement for progress bar width. Reported-By: Gisle Vanem 2016-03-03 Giuseppe Scrivano src/url.c: fix make syntax-check 2016-03-03 Maks Orlovich Parse attributes, they have image URLs. * src/convert.h: Add link_noquote_html_p to permit rewriting URLs deep inside attributes without adding extraneous quoting * src/convert.c (convert_links): Honor link_noquote_html_p * src/html_url.c (tag_handle_img): New function. Add srcset parsing. 2016-03-01 Darshit Shah Sanitize value sent to memset to prevent SEGFAULT 2016-03-01 Darshit Shah Update documentation about bahviour of -c * docs/wget.texi: -c will restart download from scratch if server does not support RANGE. Reported-By: David Chavez http://stackoverflow.com/questions/30147332/unexpected-behavior-of-wget 2016-02-27 Tim Rühsen Fix writing WARC-Target-URI value src/warc.c: Add function warc_write_header_uri(), Use it for creating WARC-Target-URI Fixes #47281 2016-02-13 Darshit Shah Remove pointer to unused Mailing List * MAILING-LISTS: Remove pointer to old, unused mailing list for bug reports 2016-02-11 Tim Rühsen Retain value of errno in logprintf(), logputs() even better * src/log.c (logprintf,logputs): Save&Restore value of errno Reported-by: Gisle Vanem 2016-02-10 Tim Rühsen Retain value of errno in logprintf() * src/log.c (logprintf): Save&Restore value of errno Reported-by: Gisle Vanem 2016-02-01 Tim Rühsen Set AM_SILENT_RULES to yes by default * configure.ac: Set AM_SILENT_RULES to yes by default 2016-02-01 Ander Juaristi Enforce 'RejectHeader' rule in tests * server/http/http_server.py (_Handler.RejectHeader): enforce 'RejectHeader' rule. 2015-12-20 Tim Rühsen Fix Test-iri-forced-remote * tests/Test-iri-forced-remote.px: Fix encodings 2015-12-18 Eli Zaretskii Support non-ASCII URLs * src/url.c [HAVE_ICONV]: Include iconv.h and langinfo.h. (convert_fname): New function. [HAVE_ICONV]: Convert file name from remote encoding to local encoding. (url_file_name): Call convert_fname. (filechr_table): Don't consider bytes in 128..159 as control characters. * tests/Test-ftp-iri.px: Fix the expected file name to match the new file-name recoding. State the remote encoding explicitly on the Wget command line. * NEWS: Mention the URI recoding when built with libiconv. 2015-12-18 Giuseppe Scrivano * NEWS: Prepare new development cycle 2015-12-17 Tim Rühsen Cleanup code * src/iri.c (do_conversion): Code cleanup 2015-12-17 Eli Zaretskii Set URI encoding when redirected * src/retr.c (retrieve_url): Set URI on redirection 2015-12-17 Tim Rühsen Remove requesting X/Open 5, POSIX 1995 * src/sysdep.h: Remove #define _XOPEN_SOURCE 500 2015-12-16 Eli Zaretskii Avoid hanging on MS-Windows when invoked with --connect-timeout * src/connect.c (connect_to_ip) [WIN32]: Don't call fd_close if the connection timed out, to avoid hanging. 2015-12-15 Tim Rühsen Fix iconv conversion * src/iri.c: Kick out the last converted character from iconv() Thanks to Eli Zaretskii for suggesting the fix. Reported-by: "Andries E. Brouwer" 2015-12-14 Tim Rühsen Let Test-k survive on CygWin and Windows * tests/Test-k.px: Use --restrict-file-names for CygWin/Windows filename requirements. 2015-12-13 Ander Juaristi Fix leak in HSTS code * src/hsts.c (hsts_store_open): close fp if open. Remove unused variable in ftp code * src/ftp.c (getftp): fix compiler warning for unused variable. 2015-12-11 Giuseppe Scrivano Tag release 1.17.1 * NEWS: Update. * gnulib: sync from upstream. 2015-12-11 Tim Rühsen Remove ABOUT-NLS from git * ABOUT-NLS: Remove from repository, it will be autogenerated. * .gitignore: Add ABOUT-NLS and build-aux/ar-lib 2015-12-11 Jernej Simončič * src/metalink.c: Specify 'rb' as mode to open file 2015-12-10 Ángel González * doc/wget.texi: add hint for self-signed certificates 2015-12-10 Ander Juaristi Fix Coverity issues * src/ftp.c (getftp): on error, close the file and attempt to remove it before exiting. * src/hsts.c (hsts_store_open): update modification time in the end. 2015-12-10 Darshit Shah Faster Travis Builds * contrib/travis-ci: Test under different languages only when all the features are enabled. This covers the maximum number of strings. For the other option permutations, test only in the default C locale Fix remaining bugs in progress bar implementation * src/progress.c (create_image): Ensure that the entire screen width is drawn everytime to prevent any artefacts from leaking through. 2015-12-09 Darshit Shah Add bug-wget to list of travis recipients * .travis.yml: Add bug-wget to list of Travis Report Recipients Eliminate more compiler warnings * src/options.h (CHECK_CERT_MODES): Remove C99 style comma after last value * src/progress.c (create_image): Do not mix statements and declarations * src/init.c (cmd_boolean_internal): Mark unused parameters Re-enable test on multibyte locale in Travis * .travis.yml: Use Russian locale instead of Japanese since it seems to be more complete * contrib/travis-ci: Re-enable testing on a Russian locale Fix progress bar assertion with multibyte locales * src/progress.c (bar_create): Define size of progress buffer explicitly (create_image): Clean up progress bar image creation. Use memset instead of for loops to create arrays of the same byte. 2015-12-06 Darshit Shah Introduce Travis Integration * .travis.yml: Configuration file for Travis-CI * contrib/travis-ci: Script to run on travis. Similar to check-hard but modified for travis. * tests/valgrind-suppressions{-ssl}: Add extra suppressions to prevent a Valgrind False Positive errors in an old version Since Travis currently supports only public repositories on GitHub, the support for automated testing through Travis will be done using my Clone of Wget on GitHub at: https://github.com/darnir/wget.git Any commits pushed to this repository will trigger a build on Travis. 2015-12-04 Tim Rühsen SKIP SSL/TLS tests if configured without it * testenv/Makefile.am: Set SSL_TESTS env variable * testenv/Test--https-crl.py, testenv/Test--https.py, testenv/Test-hsts.py: Return 77 (SKIP) if SSL/TLS is not configured 2015-12-03 Tim Rühsen Add Test-hsts.py to SSL_TESTS * testenv/Makefile.am: Add Test-hsts.py to SSL_TESTS 2015-12-03 Ygal Blum Fix compilation when without-ssl is selected 2015-12-03 Darshit Shah Include Metalink and GPG information in version * src/build_info.c.in: Include the presence of Metalink and GPGME features in the output for wget --version 2015-12-03 Giuseppe Scrivano Add --check-certificate=quiet * doc/wget.texi: Add documentation for --check-certificate=quiet. * src/options.h (enum CHECK_CERT_MODES): New enum. * src/init.c (cmd_check_cert): New static function. (cmd_boolean_internal): Likewise. * src/gnutls.c (ssl_check_certificate): Handle CHECK_CERT_QUIET. * src/openssl.c (ssl_check_certificate): Handle CHECK_CERT_QUIET. 2015-11-24 Tim Rühsen Fix regression in HTTP authentication * src/http.c (initialize_request): Fix wrong params to search_netrc() Regression introduced in commit 29850e77 Reported-by: Axel Reinhold 2015-11-23 Tim Rühsen Fix SIGSEGV in -N / --content-disposition combination * src/http.c (http_loop): Fix SIGSEGV Reported-by: "Schleusener, Jens" 2015-11-20 Ander Juaristi Fix potential NULL pointer dereference * src/gnutls.c (ssl_connect_wget): check for NULL before calls 2015-11-20 Giuseppe Scrivano * configure.ac: change gettext version to 0.18.1 2015-11-19 Ikey Doherty configure.ac: Use correct gettext version 2015-11-19 Tim Rühsen Fix HSTS memory issue + test code issue * src/hsts.c (hsts_find_entry): Fix freeing memory (hsts_remove_entry): Remove freeing host member (hsts_match): Free host member here (hsts_store_entry): Free host member here (test_url_rewrite): Fix 'created' value (test_hsts_read_database): Fix 'created' value Reported-by: Dagobert Michelsen 2015-11-17 Tim Rühsen Include errno.h instead of sys/errno.h (Solaris issue) * src/metalink.c: Include errno.h instead of sys/errno.h Reported-by: Dagobert Michelsen 2015-11-17 Darshit Shah Fix compile error when IPv6 is disabled * src/ftp-basic.c: The code for the new FTPS functionality was unintentionally inside a #ifdef IPV6 block. Move the code around so that it is defined even when IPV6 isn't used Use gnulib module flock to provide function * bootstrap.conf: Use module flock from gnulib to provide it on other platforms such as Windows. 2015-11-16 Darshit Shah Eliminate NDEBUG redefined warnings * src/wget.h: Define NDEBUG only if it hasn't been defined before 2015-11-15 Giuseppe Scrivano NEWS: prepare new release cycle Prepare release 1.17 * gnulib: sync with upstream. * NEWS: Update. * src/main.c: Change the copyright year. 2015-11-03 Tim Rühsen Document combination of -nc and -O Fixes #46359 2015-11-03 Tim Rühsen Do not download/save file on error when --spider enabled * src/http.c (gethttp,http_loop): Do not download/save file on error when --spider is enabled and not working recursive. Reported-by: Сковорода Никита Андреевич chalkerx@gmail.com Fixes #45821 2015-10-27 Tim Rühsen Fix URL conversion for colons in filenames * src/convert.c (construct_relative): Prepend './' to filename * tests/Test-k.px: Amend test to succeed 2015-10-15 Tim Rühsen Adjust indentation of --no-use-server-timestamps in help output * src/main.c: Adjust indentation of --no-use-server-timestamps 2015-10-13 Ander Juaristi Added --convert-file-only option * src/convert.c (convert_links_in_hashtable, convert_links): test for CO_CONVERT_BASENAME_ONLY. (convert_basename): new function. * src/convert.h: new constant CO_CONVERT_BASENAME_ONLY. * src/init.c, src/main.c, src/options.h: new option "--convert-file-only". * doc/wget.texi: updated documentation. Reviewed-by: Gabriel Somlo 2015-10-12 Darshit Shah Fix Test-ftp-pasv-not-supported.px * tests/Test-ftp-pasv-not-supported.px: We do *NOT* expect any downloaded files. Also, do not negate the Test response. The test originally expected a downloaded file, but this is not true. As a result, the test would fail and return exit code 1. This was presumably the reason why the test result was negated before returning to the shell. Fix this issue, so that the test runs correctly without any hacks. 2015-10-12 Darshit Shah Fix make distcheck failures in Perl SSL Tests * tests/Makefile.am: Add valgrind-suppressions-ssl to EXTRA_DIST * tests/Test-proxied-https-auth-keepalive.px: Find valgrind in correct path during make distcheck * tests/Test-proxied-https-auth.px: Same 2015-10-12 christian fafard Skip HTTPS perl tests if IO::Socket::SSL not installed * tests/Test-proxied-https-auth-keepalive.px: Skip test if perl module IO::Socket::SSL is not installed (trivial change). * tests/Test-proxied-https-auth.px: Skip test if perl module IO::Socket::SSL is not installed (trivial change). 2015-10-09 Ander Juaristi Fix potential race condition * src/hsts.c (hsts_read_database): get an open file handle instead of a file name. (hsts_store_dump): get an open file handle instead of a file name. (hsts_store_open): open the file and pass the open file handle. (hsts_store_save): lock the file before the read-merge-dump process. Reported-by: Daniel Kahn Gillmor 2015-10-09 Ander Juaristi Fix HSTS merge bug * src/hsts.c (hsts_store_merge): call hsts_new_entry() if the entry does not exist in the database. When merging the existing HSTS database on disk with the one on memory, the entries that were on disk but not on memory were ignored. Thus, only the existing entries were merged. This behavior was only triggered when more than one Wget processes were using the same HSTS database simultaneously. This commit fixes the bug by adding the new entries to the on-memory database if they were not found there. 2015-09-28 Giuseppe Scrivano testenv/Test--rejected-log.py: Remove trailing white spaces 2015-09-28 Tim Rühsen Handle TLS rehandshakes in GnuTLS code * src/gnutls.c: New static function _do_handshake() * src/gnutls.c (wgnutls_read_timeout): Handle rehandshake * src/gnutls.c (wgnutls_write): Handle rehandshake * src/gnutls.c (ssl_connect_wget): Move handshake code into _do_handshake() Fixes #46061 2015-09-22 Darshit Shah Add tests for missing qop in digest auth * testenv/test-auth-both.py: Add qop parameter for digest auth * testenv/test-auth-digest.py: Same * testenv/conf/authentication.py: Support additional parameters for authentication * testenv/servers/http/http_server.py: Same Do not test for impossible qop value * http.c (digest_authentication_encode): Wget already errors out if qop != "auth". Then it makes no sense to test for qop == "auth-int" later on. Currently, Wget does not support the "auth-int" qop value and till nobidy requests, it may remain so. 2015-09-22 Darshit Shah Fix #46024. Support RFC 2069 Digest Authentication * http.c (digest_authentication_encode): Some servers are still using the obsolete RFC 2069 Digest Authentication. Allow Digest authentication without the qop parameter for this. Reported-by: Andreas Longwitz 2015-09-21 Darshit Shah Revert "Disable progress bar when wget is backgrounded (trivial patch)" This reverts commit e6247325633d7d0007906f88d573a3bfa37307a7. 2015-09-20 Giuseppe Scrivano NEWS: cite FTPS support 2015-09-14 Ander Juaristi Added support for FTPS * doc/wget.texi: updated documentation to reflect the new FTPS functionality. * src/ftp-basic.c (ftp_greeting): new function to read the server's greeting. (ftp_login): greeting code was previously here. Moved to ftp_greeting to support FTPS implicit mode. (ftp_auth): wrapper around the AUTH TLS command. (ftp_ccc): wrapper around the CCC command. (ftp_pbsz): wrapper around the PBSZ command. (ftp_prot): wraooer around the PROT command. * src/ftp.c (get_ftp_greeting): new static function. (init_control_ssl_connection): new static function to start SSL/TLS on the control channel. (getftp): added hooks to support FTPS commands (RFCs 2228 and 4217). (ftp_loop_internal): test for new FTPS error codes. * src/ftp.h: new enum 'prot_level' with available FTPS protection levels + prototypes of previous functions. New flag for enum 'wget_ftp_fstatus' to track whether the data channel has some security mechanism enabled or not. * src/gnutls.c (struct wgnutls_transport_context): new field 'session_data'. (wgnutls_close): free GnuTLS session data before exiting. (ssl_connect_wget): save/resume SSL/TLS session. * src/http.c (establish_connection): refactor ssl_connect_wget call. (metalink_from_http): take into account SCHEME_FTPS as well. * src/init.c, src/main.c, src/options.h: new command line/wgetrc options. (main): in recursive downloads, check for SCHEME_FTPS as well. * src/openssl.c (struct openssl_transport_context): new field 'sess'. (ssl_connect_wget): save/resume SSL/TLS session. * src/retr.c (retrieve_url): check new scheme SCHEME_FTPS. * src/ssl.h (ssl_connect_wget): refactor. New parameter of type 'int *'. * src/url.c. src/url.h: new scheme SCHEME_FTPS. * src/wget.h: new FTPS error codes. * src/metalink.h: support FTPS scheme. 2015-09-10 Christian Neukirchen Disable progress bar when wget is backgrounded (trivial patch) * src/progress.c (create_image): progress only when in foreground Sometimes I start wget, but the remote site is too slow, so I rather want to run it in background, however when I simply use job control for that, wget will keep spewing the progress bar all over my terminal. I have found the SIGHUP/SIGUSR1 feature to redirect output to a log file, but I think the following small patch is even more useful, since the progress bar will simply resume when wget is foregrounded again (also, the final message is still printed to the terminal in any case): 2015-09-04 Hubert Tarasiuk Add information about libmetalink and GnuPG * README.checkout: Optional dependencies and URL references. 2015-09-02 Hubert Tarasiuk Do not free Metalink structure if not initialized * src/main.c (main): Move metalink_delete to the conditional block. 2015-09-01 Ander Juaristi Updated HSTS documentation * doc/wget.texi: updated HSTS documentation. Reported-by: Daniel Kahn Gillmor 2015-09-01 Ander Juaristi Extra debug traces for HSTS. * src/main.c (load_hsts, save_hsts): added DEBUGP() calls to signal reads and saves of the HSTS database file. 2015-08-31 Darshit Shah Fix coding style violation in last commit * http.c (test_parse_range_header): Declare loop variable explicitly. Not in gnu99 standard. 2015-08-30 Darshit Shah Add unit test for parse_content_range() method * http.c (test_parse_range_header): New function to test the function for parsing the HTTP/1.1 Content-Range header. * test.[ch]: Same * http.c (parse_content_range): Fix parsing code. Fail on scenarios mentioned in rfc 7233. 2015-08-30 Tim Rühsen Fix null pointer dereference * src/metalink.c (gpg_skip_verification): Check output_stream before fclose Fix leaks found by Coverity * src/http.c (parse_strict_transport_security): Free c_max_age (open_output_stream): Fix indentation * src/iri.c (locale_to_utf8): Free new 2015-08-29 Tim Rühsen Fix two leaks foudn by Coverity * src/http.c (gethttp): Do not leak 'message'. * src/main.c (format_and_print_line): Do not leak 'line_dup'. Fix resource leak discovered by Coverity * src/retr.c (retrieve_url): Don't leak local_file. 2015-08-29 Darshit Shah Fix memory leaks in unit-test * hsts.c (get_hsts_store_filename): Free the homedir value (close_hsts_test_store): Actually free the store struct too (test_hsts_new_entry): Pass store to close_hsts_test_store() (test_hsts_url_rewrite_superdomain): Same (test_hsts_url_rewrite_congruent): Same (test_hsts_read_database): Same and homedir and store filename * http.c (test_parse_content_disposition): Free the returned filename * url.c (test_append_uri_pathel): Free allocated string Fix mixed-indentation in http.c * http.c: Fix mix indentation. Visual change only. 2015-08-27 Tim Rühsen Suppress debug output when strings may contain password * iri.c (do_conversion): Do not print out converted strings if they contain an '@'. That could be an URL with embedded password. Fixes #45825 2015-08-26 Ander Juaristi Fix resource leak. * src/http.c (parse_strict_transport_security): Freed memory to avoid resource leak. Comply with GNU coding style. 2015-08-22 Tim Rühsen Fix typo in NEWS * NEWS: Change typo --accept-reject into --accept-regex Reported-by: grarpamp 2015-08-21 Jookia <166291@gmail.com> Clarify that links are being converted. * src/convert.c: Add 'links in' after 'Converted %d' and 'Converting %s'. Removed useless TODOs. * testenv/Test--reject-log.py: Remove TODOs. 2015-08-15 Miquel Llobet Fixed #44516 -o- not logging to stdout src/log.c (log_init): check for hypen on filename, set stdout 2015-08-15 Daniele Calore Fix #40426: Allow -r -O- only if FILE is regular * main.c: added check of "-r -O FILE" option combination allow only if FILE is a regular file (bug #40426) 2015-08-15 Darshit Shah Fix var name conflicts with math.h and wingdi.h * src/recur.c (reject_reason): Rename all enum members to WG_RR_xx. * src/recur.c (retrieve_tree, download_child, write_reject_log_reason): Same 2015-08-14 Tim Rühsen Let bootstrap/autoreconf work without GPGME installed * configure.ac: Check for existance of AM_PATH_GPGME 2015-08-11 Tim Rühsen Fix IP address exposure in FTP code * src/ftp.c (getftp): Do not use PORT when PASV fails. * tests/FTPServer.px: Add pasv_not_supported server flag. * tests/Makefile.am: Add Test-ftp-pasv-not-supported.px * tests/Test-ftp-pasv-not-supported.px: New test Fix IP address exposure when automatically falling back from passive mode to active mode (using the PORT command). A behavior that may be used to expose a client's privacy even when using a proxy. 2015-08-07 Tim Rühsen Fix C89 compliancy in HSTS test code * src/hsts.c (test_hsts_new_entry): Move variable assignment before code Fix C89 compliancy in latest code * src/recur.c: Declare variables before code (write_reject_log_url): Use const keyword where appropriate Use the 'default' switch statement Use xfree() instead of free() Renamed variable f -> fp (write_reject_log_reason): Use const keyword where appropriate Use the 'default' switch statement Renamed variable f -> fp Renamed variable r -> reason 2015-08-07 Tim Rühsen Remove redundant definition of _GNU_SOURCE * src/warc.c: Remove definition of _GNU_SOURCE _GNU_SOURCE is already defined in config.h 2015-08-07 Giuseppe Scrivano NEWS: update 2015-08-07 Jookia <166291@gmail.com> Rewrite the --rejected-log test using the new framework. * tests/Test--rejected-log.px: Remove old test. * testenv/Test--rejected-log.py: Create new test. Replace variables before comparing expected files. * expected_files.py: Use formatted_content instead of file.content. 2015-08-06 Jookia <166291@gmail.com> Add option to write URL rejections to a tab-delimited CSV log. * main.c: Add "--rejected-log" option. * init.c: Add "rejectedlog" command. * options.h: Add "rejected_log" parameter string. * wget.texi: Add brief documentation on new --rejected-log option. * recur.c: Optionally log details of URLs not traversed. Add reject_reason enum. (download_child_p -> download_child): Return a reject_reason. (descend_redirect_p -> descend_redirect): Return a reject_reason. (retrieve_tree): Support logging reasons for rejection. Add write_reject_log_header that writes a CSV format header to a file. Add write_reject_log_url that writes a url struct to a file in CSV format. Add write_reject_log_reason that writes the URL and parent URL as well as the rejection reason to a CSV file. * Test--rejected-log.px: Add a basic test for the --rejected-log command. * tests/Makefile.am: Run Test--rejected-log.px. This allows you to figure out why URLs are being rejected and some context around it. CSV is used as the output format since it can be used easily parsed, it's delimited by tabs instead of commas to allow using all (quoted) URL characters and includes column names which may be used for compatibility. 2015-08-04 Tim Rühsen Fix memory leak in HSTS code * src/main.c (get_hsts_database): Free 'home' variable void uninitialized variable in metalink code * src/metalink.c: Init retr_err with METALINK_MISSING_RESOURCE * src/wget.h: Add enum METALINK_MISSING_RESOURCE 2015-07-24 Darshit Shah Fix function name collision with OpenSSL library * src/utils.[ch], src/http.c, src/metalink.c: Rename function hex_to_string() to wg_hex_to_string sine it collides with a similarly named function in OpenSSL Library. Fix configure options for metalink * configure.ac: Ensure metalink support can be properly disabled 2015-07-22 Alex Henrie Make the filename marquee a proper marquee * src/progress.c: Start the marquee in the middle of the available space and do not restart it until all of the text has scrolled out of view. 2015-07-20 Giuseppe Scrivano NEWS: cite HSTS Fix metalink tests testenv/Test-metalink-http.py: initialize HTTP test server testenv/Test-metalink-xml.py: initialize HTTP test server 2015-07-20 Ander Juaristi Enhancements in testsuite engine + new HSTS test. * testenv/Makefile.am: added new test 'Test-hsts.py'. * testenv/Test-hsts.py: new test for HSTS. * testenv/conf/domains.py: new hook to override domain list. * testenv/test/base_test.py: (__init__): new optional parameter for tests 'req_protocols'. (get_domain_addr): set the instance variables 'addr' and 'port'. Return address as an array (domain, port) instead of string. (gen_cmd_line): take into account domain and port. * testenv/test/http_test.py (__init__): new optional parameter 'req_protocols'. (setup): new function. Call to server_setup() decoupled from begin() and moved here. (begin): call to superclass to maintain backward compatibility. Removed call to server_setup(). This patch adds a new parameter to the test suite called 'req_protocols', and a new function called 'setup'. The ability for tests to be able to set some extra parameters such as the actual requested protocols (with 'req_protocols') became obvious when support for HSTS was added to Wget, where the requested URI and the actual executed URI do not have to be the same. This new parameter is optional and if not specified, the test suite behaves as before. Also, the new function 'setup' is provided as a means to start the test HTTP server, but not launch the test yet (this is done when calling 'begin', as usual), in case we want to query the address and port in which the test server listens. If 'setup' is not called, it is automatically invoked when calling 'begin'. With these measures, we preserve backward-compatibility with existing tests. 2015-07-20 Ander Juaristi Added support for HSTS. * Makefile.am: Added new source files hsts.c and hsts.h. * http.c (parse_strict_transport_security): new function for STS header parsing. (gethttp): update the HSTS store. * http.h: new include "hsts.h". * init.c: new options --hsts and --hsts-file. * main.c (get_hsts_database, load_hsts, save_hsts): new functions. New options --no-hsts and --hsts-file added to help. (main): load and save HSTS store. * options.h: new variables for supporting --hsts and --hsts-file. * retr.c (retrieve_url): rewrite the URI according to the HSTS policy before entering http_loop. * test.c, test.h: new unit tests for HSTS. * utils.c, utils.h (countchars): new function. * wget.h: new preprocessor check. * hsts.c, hsts.h: new files with the HSTS engine implementation. Added support for HTTP Strict Transport Security (HSTS), as defined by RFC 6797. 2015-07-20 Giuseppe Scrivano NEWS: cite metalink support fix compiler warnings * src/utils.h: Include * src/recur.c: Include "exits.h" 2015-07-20 Hubert Tarasiuk Test preferred location in Metalink-over-HTTP test case. * testenv/Test-metalink-http.py: Ensure preferred location is handled properly. Geolocation support for Metalink resources. * doc/wget.text: Add information about --preferred-location. * src/init.c: Add --preferred-location option. * src/main.c (option_data): Handle --preferred-location argument. (main): Sort resources based on location if requested. * src/metalink.c (metalink_res_cmp): Compare based on location if priority and preference are equal. * src/options.h (options): Add preferred_location option. Support at most one file signature. Adapt comments to libmetalink 0.13. * src/metalink.c (retrieve_from_metalink): Add comment about new libmetalink version. Do not iterate over signatures - support just one. Move some Metalink-related code from http.c to metalink.c. * src/http.c: Move find_key_value, has_key, find_key_values. * src/metalink.c: To here. * src/metalink.h: Make them non-static and add prototypes here. Unit test for find_key_values. * src/http.c: Add test_find_key_values. * src/test.c (main): Run new test. * src/test.h: Add test_find_key_values. Unit test for has_key. * src/http.c: Add test_has_key. * src/test.c (main): Run new test. * src/test.h: Add test_has_key. Unit test for find_key_value. * src/http.c: Add test_find_key_value. * src/test.c (main): Run new test. * src/test.h: Add test_find_key_value. Test case for Metalink over HTTP. * testenv/Test-metalink-http.py: New test. * testenv/Makefile.am: Add to test list. Support multiple headers with same name in Python test suite. * testenv/README: Describe how to use repeated header name. * testenv/server/http/http_server.py (finish_headers): Send all values from list if the header value is a Python list. Test case for Metalink in XML. * testenv/Test-metalink-xml.py: New test. * testenv/Makefile.am: Add file for automake. Start HTTP test only when calling begin(). * testenv/test/http_test.py: Move self.do_test() from __init__ to begin(). Metalink support. * bootstrap.conf: Add crypto/sha256 * configure.ac: Look for libmetalink and GPGME * doc/wget.texi: Add --input-metalink and --metalink-over-http options description. * po/POTFILES.in: Add metalink.c * src/Makefile.am: Add new translation unit (metalink.c) * src/http.c (http_stat): Add metalink field. (free_stat): Free metalink field. (find_key_value): Find value of given key in header string. (has_key): Check if token exists in header string. (find_key_values): Find all key=value pairs in header string. (metalink_from_http): Obtain Metalink metadata from HTTP response. (gethttp): Call metalink_from_http if requested. (http_loop): Request Metalink metadata from HTTP response if should be. Fall back to regular download if no Metalink metadata found. * src/init.c: Add --input-metalink and --metalink-over-http options * src/main.c (option_data): Handle --input-metalink and --metalink-over-http cmd arguments. (print_help): Print --input-metalink option description. (main): Retrieve files from Metalink file * src/metalink.c (retrieve_from_metalink): Download files described by metalink. (metalink_res_cmp): Comparator for resources priority-sorting. * src/metalink.h: Create header for metalink.c (RES_TYPE_SUPPORTED): Define supported resources media. (DEFAULT_PRI): Default mirror priority for Metalink over HTTP. (VALID_PRI_RANGE): Valid priority range. * src/options.h (options): Add input_metalink option and metalink_over_http options. * src/utils.c (hex_to_string): Convert binary data to ASCII-hex. * src/utils.h (hex_to_string): Add prototype. * src/wget.h: Add metalink-related error enums Add METALINK_METADATA flag for document type. 2015-07-15 Romain Bentz Add NULL value check to fix #45289 * src/recur.c (retrieve_tree): Check return value of url_parse() 2015-07-14 Tim Rühsen Let HTTPS tests XFAIL when no TLS support configured * configure.ac: Export WITH_SSL for use in Makefile.am * testenv/Makefile.am: Add HTTPS tests to XFAIL_TESTS when !WITH_SSL Reported-by: Ander Juaristi 2015-07-01 Tim Rühsen Change function params to const in src/iri.[ch] * iri.h, iri.c: Added const attribute for params of parse_charsset(), check_encoding_name(), idn_encode(), idn_decode(), remote_to_utf8(), set_uri_encoding(), set_content_encoding(). Work around a libidn <= 1.30 vulnerability * src/iri.c: Add _utf8_is_valid() to check UTF-8 sequences before passing them to idna_to_ascii_8z(). 2015-06-27 Ángel González Fix wgetrc filename creation for Windows * init.c/wgetrc_file_name: Remove obsolete code in WINDOWS code path Reported-by: Gisle Vanem 2015-06-16 Darshit Shah Add valgrind suppression files for HTTPS tests * testenv/test/base_test.py: Use Valgrind SSL suppressions file for tests * testenv/valgrind-suppression-ssl, tests/valgrind-suppression-ssl: Add new suppression files to suppress OpenSSL errors in valgrind * tests/test-proxied-https-auth.px: Use the valgrind SSL suppressions file for the test * tests/test-proxied-https-auth-keepalive.px: Same 2015-06-14 Darshit Shah contrib/check-hard: Indentation and spacing cleanup * contrib/check-hard: Reduce the amount of text output to the screen. Also implement some indentation and whitespace cleanups. 2015-06-14 Tim Rühsen Fix usage of CFLAGS in contrib/check-hard * contrib/check-hard: Set CFLAGS per command line instead of using export. 'make distcheck' changes CFLAGS. So using ./configure -C together with exported CFLAGS fails. Setting CFLAGS per command line works smoothly. 2015-06-13 Tim Rühsen Move test_* function protoypes from test.c to test.h * src/test.c: Remove test_* function prototypes, make tests_run static * src/test.h: Add test_* function protoypes 2015-05-23 Giuseppe Scrivano NEWS: cite --if-modified-since gnulib: update gnulib 2015-05-22 Hubert Tarasiuk Include --if-modified-since option in user manual. * doc/wget.texi: Add --if-modified-since section. Support If-Modified-Since header in timestamping mode. * src/wget.h: Add IF_MODIFIED_SINCE enum for dt. Add TIMECONV_ERR enum to uerr_t. * src/http.c (time_to_rfc1123): Convert time_t do http time. * src/http.c (initialize_request): Include If-Modified-Since header if appropriate. * src/http.c (set_file_timestamp): Separate this code from check_file_output. * src/http.c (check_file_output): Use set_file_timestamp. * src/http.c (gethttp): Handle properly 304 return code and 200 if server ignores If-Modified-Since headers. * src/http.c (http_loop): Load filename to hstat if condget was requested, use IF_MODIFIED_SINCE if requested and current timestamp can be obtained. Add --if-modified-since option * src/init.c: Add to commands array. * src/main.c: Add to cmdline_option. Add to help message. * src/options.h: Add to options struct. Add test for condget requests. * testenv/Test-condget.py: the test * testenv/Makefile.am: add to tests list Support conditional GET in testenv server. * src/exc/server_error.py: Add exception for GET to HEAD fallback. * src/server/http/http_server.py: Do not send body if 304 return code requested for a file. Implement timestamp support for local files in testenv * testenv/README: Change timestamp format definition * testenv/conf/local_files.py: Set proper timestamps 2015-05-20 Pär Karlsson Fix undeclared loop variable in Perl test suite Reported-by: Hubert Tarasiuk 2015-05-12 Ander Juaristi Make sure Wget does not unescape reserved chars. * testenv/Test-reserved-chars.py: New file. * testenv/Makefile.am: Added new test Test-reserved-chars.py. When following redirections, Wget should not unescape the reserved characters that might appear in target URLs. 2015-05-12 Ander Juaristi Fixed incorrect handling of reserved chars. * src/iri.c (do_conversion): Call url_unescape_except_reserved, instead of url_unescape. * src/url.c (url_unescape_1): New static function. (url_unescape): Calls url_unescape_1 with mask zero. Preserves same behavior as before. Only code changes. (url_unescape_except_reserved): New function. * src/url.h: Added prototype for url_unescape_except_reserved(). When the locale is US-ASCII, URIs that contain special characters in them are converted to IRIs according to RFC 3987, section 3.2 "Converting URIs to IRIs". 2015-05-07 Darshit Shah Fix documentation for update_speed_ring() * progress.c (update_speed_ring): The comment for the function incorrectly stated that the function uses thirty samples from the past instead of twenty. Reported-By: Yi Li 2015-05-04 Darshit Shah Remove shadowed variable in http.c * http.c (gethttp): Rename err to conn_err to prevent shadowed variable 2015-05-04 Steven Schubiger paramcheck: use explicit quoting for here-docs * util/paramcheck.pl: Adjust here-docs 2015-05-03 Tim Ruehsen Fix format specifier warning * src/utils.c (aprintf): Use %d for int argument 2015-05-01 Nikolay Merinov Fix timestamping and continue behaviour with ftp protocol. * src/ftp.c (ftp_loop_internal): Add option `force_full_retrieve' that force to retrieve full file. (ftp_retrieve_list): Pass `true' as `force_full_retrieve' option to `ftp_loop_internal' if we want to download file with newer timestamp than local copy. 2015-04-27 Rohit Mathulla openssl: Read cert from private key file when needed * src/openssl.c (ssl_init): Assign opt.cert_{file, type} from opt.private_key(_type) Fix double free bug in SSL code * src/openssl.c, src/gnutls.c (ssl_init): Copy options using xstrdup 2015-04-20 Hubert Tarasiuk Single exit point and common cleanup code in gethttp * src/http.c (gethttp): Common cleanup for type, message, req, resp, head. Single exit point. 2015-04-17 Tim Rühsen Silence warning in perl test suite * tests/WgetTests.pm: Use string comparisons for $valgrind variable 2015-04-17 Tim Rühsen Check memory allocations in WARC code * src/warc.c: Remove some memory allocations, use xmalloc instead of malloc Reported-by: Bill Parker 2015-04-17 Tim Rühsen Add more const usage to function params * warc.c, warc.h: Add const specifier to several function args 2015-04-14 Darshit Shah PEP8'ify the Python Test Suite * testenv/conf/{__init__,authentication,files_crawled, hook_sample,reject_header,server_files}.py: Aesthetic changes to meet Python PEP8 guidelines * testenv/exc/{server_error,test_failed}.py: Same * testenv/misc/{colour_terminal,wget_file}.py: Same * testenv/server/http/http_server.py: Same * testenv/test/base_test.py: Same 2015-04-13 Ander Juaristi Improved test suite documentation * README.checkout: Added reference to Python. New section "Testing and development". Updated information about test suites. Added support for GDB in Python tests * base_test.py (gen_cmd_line): Check GDB_TESTS environment variable. 2015-04-13 Tim Ruehsen Added GDB support to Perl tests * WgetTests.pm (run): Check GDB_TESTS environment variable. 2015-04-11 Hubert Tarasiuk Add Valgrind suppression for libidn.so at idna_to_ascii_4z * tests/WgetTests.pm (run): Include suppression file when running Valgrind. * tests/valgrind-suppressions: Add suppression for idn_to_ascii_4z. * tests/Makefile.am: Add valgrind-suppressions to EXTRA_DIST. 2015-04-11 Ángel González Remove memory leak in idn_encode. * src/iri.c (idn_encode): Free buffer from remote_to_utf8 when needed; give meaningful names to variables; remove excessive comment. 2015-04-10 Hubert Tarasiuk Fix error in free_vec. * src/utils.c (free_vec): Increment pointer instead of its value. Reported-by: Gisle Vanem 2015-04-10 Ángel González Fix const usage in iri.c * src/iri.c (remote_to_utf8): Do not qualify with const the output pointer. (do_conversion): Use the provided input parameter as const. (idn_encode): casts to remote_to_utf8 parameters are no longer needed. * src/iri.h: Adjusted remote_to_utf8 prototype. * src/url.c: It is no longer necessary to cast new_url to const char. 2015-04-06 Miquel Llobet Fixed #44628 honoring RFC 6266 content-disposition src/http.c (parse_content_disposition): stores filename* and filename separately and choses filename* if available. (test_parse_content_disposition): added new tests. 2015-04-02 Giuseppe Scrivano NEWS: cite last change 2015-04-02 Steven M. Schweda Add option to restrict filenames used VMS. * src/options.h (enum restrict_files_os): Define "restrict_vms". * src/init.c (defaults) [__VMS]: Set "opt.restrict_files_os" to "restrict_vms". (cmd_spec_restrict_file_names): honor "vms". * src/url.c (filechr_not_unix): Define "filechr_not_vms". (filechr_table): Update for VMS. (append_uri_pathel): Honor opt.restrict_files_os. (FN_QUERY_SEP): Update for VMS. (FN_QUERY_SEP_STR): Update for VMS. 2015-04-01 Hubert Tarasiuk Change semantics of resp_free and request_free in http.c * src/http.c (resp_free): Change the semantics of this function. (request_free): Change the semantics of this function. (initialize_request): Adjust request_free call. (establish_connection): Adjust request_free, resp_free calls. (gethttp): Adjust request_free, resp_free calls. Do not free request in establish_connection; do it in gethttp * src/http.c (establish_connection): Do not free request here (it is * never allocated here). * src/http.c (gethttp): Free request before returning if error in * establish_connection encountered. Transform read_header label and goto into a loop * src/http.c (gethttp): Replace label and goto statement with a do loop. 2015-03-31 Hubert Tarasiuk Factor out set_content_type function from gethttp * src/http.c (gethttp): Move some code in... (set_content_type): ... a new function. 2015-03-18 Giuseppe Scrivano Factor out some gethttp code * src/http.c (gethttp): Move some code in... (open_output_stream): ... a new function. Factor out some auth gethttp code * src/http.c (gethttp): Move some code in... (check_auth): ... a new function. Factor out some gethttp code * src/http.c (gethttp): Move some code in... (check_file_output): ... a new function. Factor out some connection initialization code for gethttp * src/http.c (gethttp): Move some initialization code in... (establish_connection): ... a new function. Factor out some proxy initialization code for gethttp * src/http.c (gethttp): Move some initialization code in... (initialize_proxy_configuration): ... a new function. Factor out some initialization code for gethttp * src/http.c (gethttp): Move some initialization code in... (initialize_request): ... a new function. 2015-03-18 Tim Ruehsen src/ftp.c: make sure warc_tmp becomes closed before return Reported-by: Coverity bug #1188044 src/http.c: fix error return of digest_authentication_encode() Reported-by: Coverity bug #1188036 2015-03-17 Rohan Prinja Convert wget.texi to UTF-8 * doc/wget.texi: convert to UTF-8 by adding @documentencoding * doc/Makefile.am: Pass argument --utf8 to pod2man. 2015-03-16 Ander Juaristi Alamos * doc/wget.texi: The default is GnuTLS, not OpenSSL. 2015-03-16 Darshit Shah testenv/http_server: Allow overriding Headers We would like to override existing headers in the response from the server to allow the test cases to define their own headers. This patch allows this behaviour through the use of the add_header() method while also allowing sending multiple headers of the same name. 2015-03-14 Darshit Shah retr.c: Fix memory leak in retrieve_from_file() Reported by: Coverity Bug 1188045 html-url.c: Fix potential memory leaks Reported by: Coverity Bug 1188050 main.c: Fix two potential memory leaks Reported by: Coverity bug 1188048 retr.c: Fix two memory leaks when proxy URL is bad Reported by: Coverity bug 1188047 2015-03-13 Darshit Shah Docs: --post-file is binary data Wget considers the file mentioned in the --post-file argument as a binary file and does not strip any control characters. The lack of this information in the documentation can cause a lot of headaches debugging for a simple issue 2015-03-09 Giuseppe Scrivano maint: update copyright year ranges to include 2015 NEWS: Prepare new development cycle 2015-03-09 Yousong Zhou Fix --content-on-error option handling. * src/http.c: Log --content-on-error downloads. * src/retr.c (retrieve_url): Register the download of an error page when --content-on-error is specified. testenv: add test case Test--convert-links--content-on-error.py. * testenv/Makefile.am: Register the new test. * testenv/Test--convert-links--content-on-error.py: Add test case for the combination of --convert-links and --content-on-error.py. testenv: fix http_server.py with Response and Authentication. * testenv/exc/server_error.py: Try writing file content for GET request even if there is a Response rule present. * testenv/server/http/http_server.py: Likewise. testenv: improve color output a bit. * testenv/misc/colour_terminal.py: Only print color codes when stdout isatty(). testenv: typo and style fix. * testenv/server/http/http_server.py(BaseTest): Add docstring; use raw string for regex. * testenv/server/http/http_server.py(_Handler): Typo fix. * testenv/conf/server_files.py(ServerFiles): Code style change for readability plus another typo fix. 2015-03-09 Giuseppe Scrivano NEWS: tag 1.16.3 2015-03-08 Anderson Goulart src/main.c (--no-verbose): don't show progress bar Fixes #44431 2015-03-06 Darshit Shah main.c: Use assertion to test buffer size 2015-03-02 Darshit Shah main.c: Need to explicitly disallow show_progress in -q 2015-02-28 Giuseppe Scrivano NEWS: add other items and tag 1.16.2 gnulib: sync gnulib 2015-02-27 Giuseppe Scrivano NEWS: Update file 2015-02-23 Eli Zaretskii warc.c: native uuid generation on Windows * warc.c (windows_uuid_str) [WINDOWS]: New function specific to MS-Windows. (warc_uuid_str) [WINDOWS]: If windows_uuid_str succeeds, use its result; otherwise use the fallback method. 2015-02-18 Gisle Vanem idn: use idn_free() to free allocated libidn memory xfree() might crash on libidn memory on Windows. From 'man idn_free': "Under Windows, different parts of the same application may use different heap memory, and then it is important to deallocate memory allocated within the same module that allocated it. This function makes that possible." 2015-02-10 Tim Rühsen gettext: Use gnulib's gettext.h for compatibility Fixes issues with gettext on Solaris Reported-by: Kiyoshi KANAZAWA src/openssl.c: Use SSL_state() instead of ssl_st.state Changes in OpenSSL 1.0.2 API hides ssl_st structure members. Reported-by: Gisle Vanem 2015-01-31 Giuseppe Scrivano * README.checkout: Remove duplicated word * cfg.mk (VC_LIST_ALWAYS_EXCLUDE_REGEX): Add ChangeLog-2014-12-10. 2015-01-27 Tim Rühsen README.checkout: Mention libio-socket-ssl-perl to be installed 2015-01-22 Dagobert Michelsen tests/Makefile.am: Solaris needs libintl and libiconv for testsuite 2015-01-20 Darshit Shah progress bar: Allow display on stderr alongwith -o This commit causes the --show-progress option to print the progress bar to stderr even when a logfile was explicitly provided on the command line. Such a combination allows a user to log the output of Wget while simultaneously keeping track of the download status. 2015-01-18 Darshit Shah gnulib: Pull submodule forward 2015-01-16 Mathieu Parent src/connect.c: More verbose error message (tiny change) This fixes Debian bug #144076. 2015-01-04 Tim Ruehsen src/ftp-basic.c: Accept 5-digit port numbers in EPSV responses Reported-by: Adam Sampson 2015-01-02 Tim Rühsen README.checkout: Add libpcre, pkg-config, libgnutls Add description for libpcre Add description for pkg-config Add description of GnuTLS Add 2014,2015 copyright year Use ASCII copyright characters 2014-12-27 Tim Ruehsen src/http.c: Revert commit d81a8d5f56fca2746c4ad83d764bd24adcfbe927 The removal of the 'redundant' condition was a failure. Fixes: #43876 Reported-by: Sean Jensen-Grey 2014-12-25 Tim Ruehsen configure.ac: Skip pkg-config for opensl and gnutls when prefix is given Make --with-libssl-prefix and --with-libgnutls-prefix do the right thing, no matter if pkg-config is installed or not. Reported-by: Charles Diza 2014-12-25 Tim Ruehsen src/main.c, src/warc.c: Use gnulib's base_name() instead of basename() Reported-by: Eli Zaretskii 2014-12-24 Giuseppe Scrivano Generate distributed ChangeLog from git log * bootstrap.conf (gnulib_modules): Add "gitlog-to-changelog". * Makefile.am: Generate the ChangeLog file from the git log and the old ChangeLog data. merge ChangeLog files in ChangeLog-2014-12-10. * ChangeLog: truncate file. * ChangeLog.README: Likewise. * doc/ChangeLog: Likewise. * msdos/ChangeLog: Likewise. * src/ChangeLog: Likewise. * testenv/ChangeLog: Likewise. * tests/ChangeLog: Likewise. * ChangeLog-2014-12-10: New file. * Makefile.am: Distribute ChangeLog-2014-12-10. 2014-12-21 Eli Zaretskii src/utils.c: Don't include termios.h on Windows 2014-12-21 Tim Ruehsen tests/FTPServer.pm: Fix ftp tests for Windows The Single Unix strftime format specifier %e is not supported by MS Windows. Instead we now use %d. Reported-By: Eli Zaretskii 2014-12-21 Tim Rühsen gnulib: Use basename() from gnulib module 'dirname' Avoid basename incompatibilities between POSIX and GNU implementations. Also, libgen.h isn't needed any more which increases compatibility. 2014-12-21 Giuseppe Scrivano contrib/check-hard: delete trailing whitespaces and repeated word 2014-12-21 Benno Schulenberg * src/main.c (print_help): Indent description continuation lines by two spaces. * src/main.c (print_help): Don't use a possessive instead of a plural. * src/main.c (print_help): Don't use the backtick as a left quote mark in the help text. * src/main.c (print_help): Remove the full stops from option descriptions. And use semicolons to separate multiple phrases in a description. * src/main.c (print_help): Make all option descriptions start with a lowercase letter. Also spell URL consistently in all uppercase, and fix the grammar. 2014-12-20 Tim Ruehsen doc/wget.texi: Add 'https_only' in 'wgetrc commands' section Reported-By: Eli Zaretskii 2014-12-20 Eli Zaretskii tests/WgetTests.pm: Print timestamps when tests fail due to wrong timestamps 2014-12-19 Tim Rühsen sample.wgetrc: Fix wrong command 'locale' into 'localencoding' Reported-by: Eli Zaretskii wget.texi: Clarify wgetrc command syntax Reported-by: Eli Zaretskii 2014-12-18 Tim Rühsen configure.ac: Fix libuuid and uuid_create detection --with-libuuid now explictly asks for libuuid. --without-libuuid ignores libuuid and tries to use libc builtin functions. Else try builtin functions first, libuuid second and fallback to Wget's own function. 2014-12-17 Yuriy M. Kaminskiy src/log.c: Fix unexpected termination in logprintf (tiny change) When errno was set to EPIPE before call to logprintf (e.g. during close of SSL connection that was reset by peer), it will unexpectedly terminate wget. It should exit only when EPIPE was triggered by logging code. Regression by 0b5b100fc9aaa54257ffa16a083a862c031c335b 2014-12-17 Tim Rühsen configure.ac: Check for RAND_egd() in pkg-config success code src/main.c: Conditionally print help for --random-file and --egd-file --random-file is only relevant when compiled with either OpenSSL or LibreSSL. --egd-file is only relevant when compiled with OpenSSL src/init.c: Fix indentation for crlfile option wget.texi: Document --random-file and --egd-file as OpenSSL only 2014-12-17 Jérémie Courrèges-Anglas openssl: Detect the availability of RAND_egd (tiny change) Alternatives like LibreSSL don't provide RAND_egd() anymore. Fixes compilation on OpenBSD. Fix use of uuid libc functions (tiny change) * Always attempt to detect uuid.h and uuid_create(). * Split libuuid and uuid.h implementations of warc_uuid_str(), since those APIs vary significantly. * Correctly use the uuid.h functions 2014-12-16 Darshit Shah Revert "ChangeLog: Move all ChangeLog files to .pre-gitlog" This reverts commit fcd3b3c473be25c4967fd8139d976e76faaef1b6. Turns out that removing the ChangeLog files causes the Wget build to fail. While this issue is investigated and sorted out, the commit is reversed to allow people to be able to build Wget from master 2014-12-16 Tim Rühsen contrib: Added script contrib/check-hard to check Wget status This script automates some tests, e.g. testing with and without valgrind, using different configure options. It is aimed at developers to execute before pushing commits. 2014-12-16 Cong Ma ftp: fix invalid pointer dereference in getftp() (tiny change) The pointer respline in use after being passed to ftp_response() may be uninitialized if ftp_response() fails. Ensure that respline be used after checking the return value of ftp_response(). 2014-12-15 Darshit Shah ChangeLog: Move all ChangeLog files to .pre-gitlog From v1.16.1 onwards, Wget no longer maintains an active ChangeLog file. Instead the ChangeLog will be automatically generated on each release through gnulib's gitlog-to-changelog script. However, the old versions of the ChangeLog files are retained for reference. These files are renamed with a .pre-gitlog appended to their filenames. Also removed ChangeLog.README file which is not required anymore 2014-12-12 Tim Rühsen src/iri.c: Call xstrndup instead of strndup strndup() does not exist in some older C libraries, also xstrndup() exits on memory allocation failures. bootstrap.conf: Add module 'getline' This function does not exist in some older C libraries. 2014-12-12 Tim Rühsen src/iri.c: Use c_strcasestr instead of strcasestr This also fixes a problem with strcasestr not being in the boostrap.conf module list. Reported-by: Kiyoshi KANAZAWA 2014-12-11 Friedrich Haubensak Add support for older versions of flex (tiny change) E.g. flex 2.5.4 (Solaris 10) does not like a space after -o. 2014-12-10 Tim Rühsen Check for pcre.h in configure.ac 2014-12-11 Tim Ruehsen * src/Makefile.am: Support older versions of flex 2014-12-10 Tim Ruehsen * configure.ac: Check for pcre.h 2014-12-08 Giuseppe Scrivano * cfg.mk (local-checks-to-skip): Add sc_cast_of_argument_to_free. 2014-12-05 Tim Ruehsen * src/Makefile.am: Removed reference to cmpt.c 2014-12-05 Tim Ruehsen * configure.ac: Fix message for when GnuTLS has not been found. 2014-12-05 Tim Ruehsen * bootstrap.conf: Add modules fnmatch, memrchr, strcase, strptime, strtoll, timegm Remove module strcasecmp Use module spawn-pipe instead of deprecated module pipe 2014-12-04 Darshit Shah * src/wget.h: Define MIX and MAX macros globally for all files * src/gnutls.c, retr.c: Delete declaration of MIN macro * src/http.c, progress.c: Delete declaration of MIN and MAX macros 2014-12-04 Darshit Shah * src/ftp.c (getftp): Remove a call to assert(1) and replace with error message. Also fix indentation 2014-12-03 Jérémie Courrèges-Anglas (tiny change) * src/openssl.c (ssl_init): Allow pfs to be used with protocols other than TLSv1. 2014-12-03 Gisle Vanem * src/openssl.c (ssl_init): Fix C89 warning 2014-12-03 Gisle Vanem * src/mswindows.c (ws_cleanup): Fix compiler error 2014-12-01 Tim Ruehsen * src/connect.c, iri.c, url.c, warc.c: Fix issues reported by static code analysis tool 'parfait'. Closes: #41235 Reported-by: Jiri Kukacka 2014-11-29 Tim Ruehsen * src/utils.h: xfree() sets argument to NULL after freeing, removed xfree_null() * src/connect.c, cookies.c, ftp-basic.c, ftp-ls.c, ftp.c hash.c, host.c, html-url.c, http.c, init.c, iri.c, log.c, main.c, mswindows.c, netrc.c, openssl.c, recur.c, res.c, retr.c, url.c, warc.c: Replaced xfree_null() by xfree() 2014-11-28 Tim Ruehsen * src/main.c: Fix length of program_argstring, fix regression introduced with commit 9217b864d865cf1766e1589d138c8fc17d3d3d5c. 2014-11-28 Tim Ruehsen * src/gettext.h: Removed this file * src/wget.h: Do not include gettext.h, include libintl.h directly, define _() as gettext() shortcut. The header file gettext.h is simply not needed. 2014-11-28 Tim Ruehsen * src/Makefile.am: Removed gettext.h 2014-11-27 Darshit Shah * src/cookies.c, gettext.h, init.c, retr.c, url.c, warc.c: Replace usage of free() with xfree() macro. 2014-11-26 Tim Ruehsen * tests/HTTPServer.pm: Substitue server port in HTTP headers This allows us to use references like "Location" => "http://localhost:{{port}}/whatever in test cases. 2014-11-26 Tim Ruehsen * testenv/Makefile.am: Removed Test-auth-both.py from XFAIL_TESTS 2014-11-26 Tim Ruehsen * src/recur.c: Fix blacklisting of URLs to download Fixes a reported crash and prevents multiple downloads of the same file in case the URL is escaped in different ways. The crash has been Reported-by: Frédéric 2014-11-26 Tim Ruehsen * src/recur.c: Add space after function names 2014-11-26 Tim Ruehsen * src/http.c: Select strongest challenge from WWW-Authenticate, support multiple challenges per header line. 2014-11-26 Tim Ruehsen * src/gnutls.c (ssl_connect_wget): Implement missing code for --secure-protocol=TLSv1_1|TLSv1_2, print message before abort() for missing protocol implementations. * src/openssl.c (ssl_init): Print message before abort() for missing protocol implementations. 2014-11-26 Giuseppe Scrivano * testenv/Makefile.am (TESTS): Add Test-redirect-crash.py. * testenv/Test-redirect-crash.py: New File. 2014-11-25 Pär Karlsson * src/Makefile.am: Added missing version.h to wget_SOURCES 2014-11-25 Darshit Shah * bootstrap.conf: Add flex as a prerequisite to building Wget from the repository sources 2014-11-24 Tim Ruehsen * src/wget.h, test.c, main.c: Make program_name and program_argstring const 2014-11-24 Tim Ruehsen * src/warc.c: Fix C89 compile errors using ./configure --without-libuuid 2014-11-24 Tim Ruehsen * src/warc.c, warc.h, http.c: Add size of buffer to warc_timestamp() 2014-11-24 Tim Ruehsen * src/retrc.c: Fix possibly uninitialized variable 2014-11-22 Darshit Shah * src/wget.h: Make extern declaration for program_name 2014-11-22 Darshit Shah * src/wget.h: Make extern declaration for program_argstring. * src/warc.c: Remove extern declaration * src/test.c: Make program_argstring a char *, instead of const char *. 2014-11-22 Darshit Shah * src/version.h: New file. Add extern declarations for globally shared strings * src/Makefile.am: Have version.c import version.h * src/main.c: Import version.h and remove old extern declarations * src/http.c: Same * src/warc.c: Same 2014-11-22 Darshit Shah * src/version.h: Add extern for compilation_string * src/main: Remove declaration for compilation_String 2014-11-22 Darshit Shah * src/utils.c (abort_run_with_timeout): The sig parameter is not used. Mark it as such. (abort_run_with_timeout): One implementation of this function did not specify _Noreturn. Add it. 2014-11-22 Darshit Shah * src/retr.h: Declare extern variable numurls * src/http.c: Remove extern declaration for numurls * src/ftp.c: Same 2014-11-22 Darshit Shah * src/hosts.c (address_list_set_faulty): Add a helpful error message for when a logical assumption fails 2014-11-22 Darshit Shah * src/cookies.c (discard_matching_cookie): Replace defensive assert with a logical check and error message 2014-11-22 Darshit Shah * build-aux/build_info.pl: Include version.h in the final C file generated by the script. 2014-11-21 Tim Ruehsen * testenv/server/http/http_server.py: Fixed typo Blackisted to Blacklisted 2014-11-21 Daniel Stenberg * src/iri.c (remote_to_utf8): Fix compiler warning 2014-11-20 Tim Ruehsen * src/progress.c: Fix code for undefined USE_NLS_PROGRESS_BAR Reported-by: Gisle Vanem 2014-11-20 Tim Ruehsen * src/cookies.c, ftp-basic.c, ftp-ls.c, ftp.c, html-url.c, http.c, init.c, iri.c, main.c, progress.c, res.c, url.c, utils.c, wget.h: Replaced strcasecmp and strncasecmp by c_strcasecmp and c_strncasecmp where only ASCII string comparison is wanted (instead of locale dependent). Fixes possible issues with Wget running in a turkish locale. 2014-11-20 Tim Ruehsen * src/Removed form feeds (\014) from *.c and *.h 2014-11-20 Tim Ruehsen * configure.ac: Fix libpsl<0.6.0 detection 2014-11-20 Tim Ruehsen * NEWS: remove form feeds 2014-11-20 Tim Ruehsen * src/cookies.c, ftp-basic.c, ftp.c, gnutls.c, host.c, html-url.c, http.c, main.c, progress.c, retr.c, warc.c: Trivial fixes for C89 compliancy. 2014-11-20 Gisle Vanem * src/http.c (parse_content_disposition): Fix C89 issue found by MSVC 16 2014-11-20 Gisle Vanem * src/build_info.c.in: Assume large file for WINDOWS 2014-11-20 Darshit Shah * configure.ac: Use pkg-config for libpsl detection config status for libpsl is reflected by $with_libpsl 2014-11-20 Darshit Shah * configure.ac: Use autoconf macros where available 2014-11-20 Darshit Shah * configure.ac: Change option to --enable-assert. Assertions are disabled by default. * NEWS: Edit to reflect default option. * README.checkout: Document --enable-assert for developers 2014-11-20 Darshit Shah * configure.ac: Add the -Wextra flag when no CFLAGS are set 2014-11-20 Darshit Shah * .gitignore: po/stamp-po should be ignored 2014-11-19 Tim Ruehsen * configure.ac: Check for random() 2014-11-19 Tim Ruehsen * src/utils.c (random_float, random_number): Use random() and srandom() if available. Reported-by: Coverity scanner 2014-11-19 Tim Ruehsen * src/openssl.c (ssl_check_certificate): Fix memory leak 2014-11-19 Tim Ruehsen * src/gnutls.c (ssl_init): Report load failure of cert files from cert directory only with --debug. Fixes #41331. 2014-11-19 Tim Ruehsen * src/ftp-basic.c (ftp_epsv): Fix loop check Reported-by: Coverity scanner 2014-11-19 Darshit Shah * testenv/Test-504.py: Add new test case to show how Wget handles 504 Gateway Timeouts * testenv/Makefile.am: Add Test-504.py to TESTS 2014-11-19 Darshit Shah * src/utils.c (compile_posix_regex): Fix memory leak due to unfreed regex pointer 2014-11-19 Darshit Shah * src/progress.c (create_image): Do not scroll filename if amount of scrolling is less than MIN_SCROLL_TEXT * src/retr.c (fd_read_body): Do not print the directory prefix in the progress bar 2014-11-19 Darshit Shah * src/main.c (format_and_print_line): Fix potential memory leak 2014-11-19 Darshit Shah * src/exits.c (get_status_for_err): GATEWAYTIMEOUT is a Server Error and Wget's exit code should reflect that * src/wget.h: Add GATEWAYTIMEOUT as an error * src/http.c (http_loop): Gateway timeout errors should be non-fatal 2014-11-18 Tim Ruehsen * src/Fix warnings from clang-analyzer 3.6 gnutls.c:457:3: warning: Value stored to 'err' is never read err = 0; http-ntlm.c:477:5: warning: Value stored to 'size' is never read size = (size_t) snprintf (ntlmbuf, sizeof(ntlmbuf), http.c:1479:3: warning: Attempt to free released memory xfree_null (hs->error); 2014-11-17 Tim Ruehsen * testenv/server/http/http_server.py: allow case-insensitive auth-type, send BasIc and DIgest to provoke Wget failures with turkish locales 2014-11-17 Tim Ruehsen * bootstrap.conf (gnulib_modules): Add module c-strcase 2014-11-17 Tim Ruehsen * src/main.c: code cleanup for redirect_output_signal() 2014-11-17 Tim Ruehsen * src/http.c: use c_strncasecmp() in BEGINS_WITH macro and in STARTS macro 2014-11-17 Tim Ruehsen * src/ftp.h: removed unused variable ftp_last_respline * src/init.c, test.c, warc.c: removed 'make check'compiler warnings 2014-11-16 Tom Li * src/progress.c (create_image): Ensure that the last character of the filename is scrolled to 2014-11-16 Darshit Shah * po/stamp-po: Remove autogenerated file from checked out sources 2014-11-16 Darshit Shah * gnulib: Update submodule * bootstrap: Update script from latest gnulib 2014-11-16 Darshit Shah * .gitignore: Sort gitignore file based on directories 2014-11-15 Darshit Shah * testenv/certs/README: Remove trailing whitespaces 2014-11-15 Darshit Shah * src/progress.c (create_image): Fix assertion that checks progress bar length Should fix bug #43593 2014-11-15 Darshit Shah * configure.ac: Add AC_HEADER_ASSERT. Enables --disable-assert option 2014-11-12 Witchakorn Kamolpornwijit (tiny change) * src/html-parse.c (map_html_tags): Fix range check 2014-11-12 Tim Ruehsen * src/openssl.c (ssl_init): Fix error handling for CRL loading 2014-11-11 Tim Ruehsen * testenv/recreated the certs and the CRL file, also amended certs/README 2014-11-11 Tim Ruehsen * src/openssl.c (ssl_init): Load CRL file given by --crl-file 2014-11-11 Tim Ruehsen * src/main.c: Fix missing _Noreturn declaration specifiers Reported-by: Gisle Vanem 2014-11-11 Tim Ruehsen * src/http.c (gethttp): Always add Accept-Encoding: identity Fixes bug #40819 Reported-by: Noël Köthe 2014-11-10 Tim Ruehsen * testenv/removed certs/wget-cert.pem * testenv/added certs/ca-cert.pem, certs/ca-key.pem, certs/server-cert.pem, certs/server-key.pem and certs/server-crl.pem * testenv/added certs/README for description how to generate the certs and keys * testenv/server/http/http_server.py: amended to work with new certs/ files * testenv/added Test--https-crl.py to test --crl-file * testenv/Makefile.am: added Test--https-crl.py * testenv/Test--https.py: use --ca-certificate of --no-check-certificates 2014-11-10 Tim Ruehsen * src/openssl.c: Fix compile-time check for TLSv1.1 and TLSv1.2 Reported-by: Velemas Vosak 2014-11-09 Darshit Shah * src/http.c (gethttps): Honour keep-alive when server responds with a 416 Range not Satisfiable 2014-11-09 Darshit Shah * src/http.c (gethttp): Remove redundant condition evaluation 2014-11-06 Tim Ruehsen * src/init.c, main.c, options.h: added new option --crl-file for specifying a CRL (Certificate Revocation List) file. * src/gnutls.c: load CRL file given by --crl-file Reported-by: Noël Köthe 2014-11-06 Tim Ruehsen * doc/wget.texi: added description for --crl-file 2014-11-05 Tim Ruehsen * tests/Test-stdouterr.px: fix skip exit code to 77 2014-11-05 Tim Ruehsen * tests/Makefile.am: added $(LIB_CLOCK_GETTIME) to unit test linkage Reported-by: kdevel@vogtner.de 2014-11-05 Tim Ruehsen * configure.ac: check for python3 * Makefile.am: only use python test suite if python3 found 2014-11-05 Giuseppe Scrivano * contrib/tsocked-wget (TSOCKS_CONF_FILE): Remove empty new-line. 2014-11-04 Tim Ruehsen * src/iri.c (do_conversion): fix quote() misuse * src/iri.c (remote_to_utf8): fix segfault on do_conversion() error Reported-by: Mikael Magnusson 2014-11-04 Darshit Shah * README.checkout: Bootstrap script is ./bootstrap not ./bootstrap.sh 2014-11-03 Tim Ruehsen * tests/added new test Test-proxied-https-auth-keepalive.px 2014-11-03 Tim Ruehsen * tests/Test-proxied-https-auth.px: add valgrind support, sync parent / child (client / server) 2014-11-03 Tim Ruehsen * src/log.c: turn on automatic format check for log_vprintf_internal() 2014-11-03 Pär Karlsson * tests/WgetFeature.pm: Fixed finding of WgetFeature.cfg when using separate build dirs. * tests/WgetFeature.pm: Fixed warning from BLOCK eval, made it indo EXPR eval 2014-11-03 Jakub Čajka * src/iri.c (remote_to_utf8): Fixed assumption that char is signed. Fixes fellowing test case failures idn-cmd-utf8, idn-robots-utf8, if char is unsigned. 2014-11-01 Mike Frysinger * configure.ac: Use pkg-config to check for zlib presence and fallback to the previous code if that doesn't work. * configure.ac: Use pkg-config to check for libuuid presence and fallback to the previous code if that doesn't work. * configure.ac: Use pkg-config to check for pcre presence and fallback to the previous code if that doesn't work. * configure.ac: Use pkg-config to check for openssl presence and fallback to the previous code if that doesn't work. * configure.ac: Use pkg-config to check for gnutls presence and fallback to the previous code if that doesn't work. 2014-11-01 Giuseppe Scrivano * src/http.c (gethttp): Do not check for "Connection: close" header only when the Content-Length is specified. 2014-11-01 Darshit Shah * src/progress.c (create_image): Extra padding for size_grouped_diff has already been added. Do not add that again. (create_image): Assert that the progress bar being drawn is lesser than the size of the screen. 2014-11-01 Darshit Shah * configure.ac: Fix check for libpsl 2014-10-31 Tim Ruehsen * src/openssl.c: make _get_rfc2253_formatted() static 2014-10-31 Pär Karlsson * tests/WgetTests.pm: Proper conditional operators, tidied up code, idiomatic improvements as per modern Perl best practices. * tests/WgetFeature.pm: Tidied up code, idiomatic improvements for readability * tests/FTPServer.pm: Tidied up code (perltidy -gnu) * tests/FTPTest.pm: Likewise * tests/HTTPServer.pm: Likewise * tests/HTTPTest.pm: Likewise 2014-10-30 Mike Frysinger * tests/WgetFeature.pm: fix skip exit code to 77 2014-10-30 Mike Frysinger * src/openssl.c: fix implicit decl warning 2014-10-29 Tim Ruehsen * src/openssl.c: print cert subject and issuer RFC2253 compliant * src/openssl.c (ssl_check_certificate): removed memory leak 2014-10-29 Tim Ruehsen * src/http.c (gethttp): use CLOSE_INVALIDATE instead of fd_close 2014-10-29 Tim Ruehsen * src/http.c (gethttp): fix memory leak 2014-10-29 Tim Ruehsen * src/http.c (CLOSE_FINISH): always set fd invalid after close 2014-10-29 Peter Meiser (tiny change) * src/openssl.c (ssl_init) [! OPENSSL_NO_SSL3]: Add guard for OpenSSL without SSLv3. 2014-10-28 Tim Ruehsen * tests/Test-proxied-https-auth.px: synchronize client and server 2014-10-28 Tim Ruehsen * configure.ac: check for strlcpy() 2014-10-28 Tim Ruehsen * configure.ac: add --enable-valgrind-tests 2014-10-28 Tim Ruehsen * src/utils.c: added strlcpy(), concat_strings() rewritten * src/utils.h: added strlcpy() 2014-10-28 Tim Ruehsen * src/ftp.c (ftp_loop_internal): Fix memory leak * src/http.c (gethttp): Fix two memory leaks * src/init.c (cleanup): Fix compile errors and warnings * src/init.c (cleanup): Free more opt variables * src/main.c (main): Fix two memory leaks * src/retr.c (retrieve_url): Fix two memory leaks * src/retr.c (retrieve_url): Add iri to call to url_parse() * src/url.c (url_parse): Fix memory leak 2014-10-28 Giuseppe Scrivano * tests/Makefile.am: Replace $FOO with @VAR@. 2014-10-28 Giuseppe Scrivano * src/Makefile.am: Replace $FOO with @VAR@. 2014-10-28 Giuseppe Scrivano * doc/Makefile.am: Replace $FOO with @VAR@. 2014-10-28 Giuseppe Scrivano * cfg.mk (local-checks-to-skip): Add sc_makefile_at_at_check. * Makefile.am: Replace $FOO with @VAR@. 2014-10-25 Tim Ruehsen * tests/WgetTests.pm (run): generate valgrind command line if requested * tests/Makefile.am: set/export VALGRIND_TESTS 2014-10-25 Tim Ruehsen * testenv/test/base_test.py (gen_cmd_line): generate valgrind command line if requested * testenv/README: amend description of VALGRIND_TESTS * testenv/Makefile.am: set/export VALGRIND_TESTS 2014-10-22 Ángel González * src/css-url.c (get_uri_string): Honor the specified length argument. 2014-10-21 Matthew Atkinson (tiny change) * src/http.c (gethttp): Always send Content-Length header when method is POST, PUT, or PATCH even with no post body, as some servers will reject the request otherwise. 2014-10-16 Tim Ruehsen * src/url.c (url_parse): little code cleanup * src/html-url.c (get_urls_html): HTTP header Content-Type charset preceeds http-equiv * src/iri.c (do_conversion): moved iconv code completely into the function * src/iri.c (do_conversion): call url_unescape to fix charset conversion * src/iri.c (remote_to_utf8): use strcasecmp to compare encoding * src/gnutls.c (ssl_connect_wget): do not use SSLv3 except explicitely requested * src/openssl.c (ssl_init): do not use SSLv3 except explicitely requested 2014-10-16 Tim Ruehsen * doc/wget.texi (Download Options): update --secure-protocol description 2014-10-08 Nikolay Morozov Sergey Lvov * src/init.c (cmd_spec_secure_protocol): Add support for TLS v1.1 and TLS v1.2 protocols * src/openssl.c (ssl_init): Add support for OpenSSL engines 2014-10-08 Darshit Shah * testenv/Makefile.am: Fix EXTRA_DIST variable for make distcheck * testenv/server/http/http_server.py (HTTPServer.__init__): Fix how CERTFILE is found when running make dist / make distcheck. 2014-10-06 Tim Ruehsen * tests/Test-iri.px: fixed encodings * tests/Test-iri-forced-remote.px: fixed encodings * tests/Test-iri-percent.px: fixed encodings * tests/Test-idn-meta.px: fixed encodings 2014-10-02 Tim Ruehsen * tests/tests/WgetTests.pm: use filename as default test name * tests/tests/Test-*.px: removed redundant 'name => ...' 2014-10-01 Tim Ruehsen * tests/tests/Makefile.am: remove run-px, fixing dist-check 2014-10-01 Tim Ruehsen * tests/Test--post-file.px: name => "Test--post-file", fixing race condition. * tests/Test-N-no-info.px: name => "Test-N-no-info", fixing race condition. * tests/Test-N-smaller.px: name => "Test-N-smaller", fixing race condition. * tests/Test-c-shorter.px: name => "Test-c-shorter", fixing race condition. * tests/Test-proxy-auth-basic.px: name => "Test-proxy-auth-basic", fixing race condition. * tests/Test-proxied-https-auth.px: removed debug print line. 2014-10-01 Darshit Shah * testenv/Makefile.am: Run the tests in Python's Optimizedmode * testenv/conf/__init__.py (gen_hook): Use try..except instead of if..else * testenv/misc/color_terminal.py: System and check will not change while a test is run. Do not test for them on every invocation of printer() * testenv/server/http/http_server.py: The ssl and re modules are required by specific functions. Load them lazily (HTTPSServer.__init__): Lazy load ssl module here (_handler.parse_range_header): Lazy load re module here (_Handler.get_rule_list): get() can return a default value. Use it (_Handler.guess_type): Same (_Handler.is_authorized): Unused function artefact. Remove (_Handler.reject_headers): Unused function artefact. Remove 2014-09-30 Tim Ruehsen * testenv/test/base_test.py: Add --track-origins=yes to valgrind testing 2014-09-25 Tim Ruehsen * tests/Makefile.am: Modified to use parallel test harness * tests/Test-proxied-https-auth.px: get $top_srcdir from ENV * tests/run-px: removed * tests/WgetTest.pm.in: removed * tests/WgetTest.pm: get $top_srcdir from ENV 2014-09-25 Tim Ruehsen * configure.ac: removed WgetTest.pm.in * .gitignore: removed WgetTest.pm 2014-09-12 Darshit Shah * bootstrap.conf: Add GNULib module mbiter 2014-09-08 Darshit Shah * src/init.c (defaults): Set retr-symlinks to true by default. This changes a default setting of wget. Fixes security bug CVE-2014-4877 2014-09-08 Darshit Shah * src/ftp.c (ftp_retrieve_glob): Also check for invalid entries along with harmful filenames (is_valid_entry): New function. Check if the provided node is a valid entry in a listing file. 2014-09-08 Darshit Shah * doc/wget.texi (symbolic links): Update documentation of retr-symlinks to reflect the new default. Add warning about potential security issues with --retr-symlinks=yes. 2014-08-08 Darshit Shah * testenv/conf/__init__.py: Add extra newline according to PEP8 * testenv/conf/{authentication,expect_header,expected_files,expected_ret_code, files_crawled,hook_sample,local_files,reject_header,response,send_header, server_files,urls,wget_commands}.py: Add docstrings explaining the conf file and how it should be used * testenv/server/http/http_server (InvalidRangeHeader): Clear TODO and eliminate this exception. Use ServerError for all such purposes. (_Handler): Remove reference to InvalidRangeHeader (_handler.parse_range_header): User ServerError instead of InvalidRangeHeader (_Handler.do_GET): Add docstring (_Handler.do_POST): Add docstring. Also create an empty dict for rules if no rules are supplied. Send the Location header as suggested in RFC 7231 (_Handler.do_PUT): Don't pop the server file already. Push it to later in .. (_Handler.send_put): .. Here. If the file exists respond with a 204 No Content message and pop the file for replacement. Do not send the Content-Length, Content-Type headers since PUT requests should not respond with data. (_Handler.parse_auth_header): Fit line within 80 chars (_Handler.check_response): Better visual indent (_Handler.authorize_digest): Better visual indent. (_Handler.expect_headers): Remove unused function (_Handler.guess_type): Fix indentation (HTTPd): Add newline according to PEP8 guidelines (HTTPSd): Fix indentation (StoppableHTTPServer): Add docstring (HTTPSServer): Fix indentation (WgetHTTPRequestHandler): Merge class into _handler. (_Handler): Add docstring (_Handler.parse_range_header): Fix indentation (ServerError): Split exception into separate file ... * testenv/exc/server_error.py: ... Here * testenv/misc/colour_terminal.py: Add docstring, fix indentation * testenv/test/base_test.py: Fix visual indent * testenv/test/http_test.py: Fit within 80 char lines 2014-08-04 Darshit Shah * testenv/conf/server_conf.py: Delete file. Server configuration is now done via the server_conf() method. * testenv/server/http/http_server.py (StppableHTTPServer.server_sett): Delete method required by the above hook (HTTPd.server_sett): Same 2014-08-03 Giuseppe Scrivano * doc/wget.texi (Download Options): Fix texinfo warning. 2014-07-28 Ángel González * contrib: Created contrib folder. * contrib/tsocked-wget: Added wrapper for usage with socks proxy. 2014-07-26 Darshit Shah * testenv/Test-*.py: Remove the '-d' switch from WGET_OPTIONS. * testenv/test/base_test (BaseTest.gen_cmd_line): Add --debug and --no-config to the list of switches passed to wget unconditionally. 2014-07-25 Darshit Shah * .gitignore: Add a gitignore file for the project. 2014-07-23 Darshit Shah * testenv/test/base_test.py (BaseTest.gen_cmd_line): Add support for running all tests through valgrind if the relevant environment variable is set * testenv/conf/expected_ret_code (ExpectedRetCode.__call__): Valgrind returns error code 45 when it detects a memory leak. * testenv/Readme: Update with details about valgrind tests 2014-07-23 Darshit Shah * src/http.c (gethttp): Fix a memory leak when retrying authorization (gethttp): Fix memory leak when trying to parse content disposition headers (http_loop): Assigning a new value to *local)file without freeing the old one causes a memory leak (http_loop): Free the HTTP message and error strings before continuing loop 2014-07-22 Darshit Shah * testenv/(README): Remove old TODO and document SERVER_WAIT variable 2014-07-22 Darshit Shah * configure.ac: Fix broken code for detecting libpsl 2014-07-21 Darshit Shah * src/cookies.c (check_domain_match): Fix a potential memory leak when checking cookie domain names 2014-07-21 Darshit Shah * configure.ac: Fix check for Libpsl 2014-07-21 Daniel Stenberg * src/main.c (print_help): HTTP Method is a part of the Request not Header 2014-07-07 Tomas Hozza * src/iri.c (locale_to_utf8): Fix checking of iconv_open return code. 2014-07-05 Darshit Shah * src/http.c (gethttp): Fix indentation of conditional block (gethttp): Remove unneeded variable 2014-07-05 Darshit Shah * src/cookies.c (check_domain_match): Libpsl requires that all domain names passed to it be in utf8 lower case. 2014-07-03 Darshit Shah * src/wget.h (uerr_t): Remove unused error codes * src/http.c: (http_loop): Remove reference to unused error code 2014-06-30 Giuseppe Scrivano * src/convert.c (local_quote_string): Initialize newname. 2014-06-29 Giuseppe Scrivano * src/warc.c (warc_write_date_header): Avoid out-of-scope variable usage. 2014-06-28 Giuseppe Scrivano * src/Makefile.am (wget_SOURCES): Remove space-tab indentation. 2014-06-28 Giuseppe Scrivano * cfg.mk (local-checks-to-skip): Remove some checks. 2014-06-27 Giuseppe Scrivano * cfg.mk (VC_LIST_ALWAYS_EXCLUDE_REGEX): Ignore msdos/* and vms/* from syntax-check. 2014-06-22 Giuseppe Scrivano * src/warc.c [HAVE_UUID_CREATE]: Include (warc_uuid_str) [HAVE_UUID_CREATE]: Use uuid_create and uuid_to_string to generate the UUID. Reported by: Alex Zimnitsky 2014-06-22 Giuseppe Scrivano * configure.ac: Add check for uuid_create. 2014-06-22 Darshit Shah (tiny change) * src/progress.c (create_image): Align percentage download output better. 2014-06-22 Darshit Shah * testenv/(conf.files_crawled): diff is a set object and needs explicit str conversion. 2014-06-21 Nikita Vetrov (tiny change) * src/openssl.c: Add one-time call OPENSSL_config in ssl_init. 2014-06-21 Giuseppe Scrivano * src/mswindows.c (fake_fork_child): Fix build error. Reported by: Gisle Vanem . 2014-06-19 Giuseppe Scrivano * src/mswindows.c (fake_fork_child): Revert dinamic allocation of info->lfilename. Reported by: Gisle Vanem . 2014-06-16 Giuseppe Scrivano * src/http.c (gethttp): Set "Connection: Keep-Alive" with keep-alive connections when using a proxy too. Reported by: Thorsten Schroeteler . 2014-06-16 Darshit Shah * src/test.c: Include locale.h header for gettext. 2014-06-16 Darshit Shah * src/main.c (print_help, print_version): Mark as noreturn. * src/utils.c (memfatal, abort_run_with_timout): Same 2014-06-11 Giuseppe Scrivano * tests/Makefile.am: Remove @VAR@ with $FOO. * tests/FTPTest.pm: Remove terminating empty lines. * tests/HTTPServer.pm: Likewise. * tests/HTTPTest.pm: Likewise. * tests/Test--httpsonly-r.px: Likewise. * tests/Test--no-content-disposition-trivial.px: Likewise. * tests/Test--no-content-disposition.px: Likewise. * tests/Test--spider-fail.px: Likewise. * tests/Test--spider-r--no-content-disposition-trivial.px: Likewise. * tests/Test--spider-r--no-content-disposition.px: Likewise. * tests/Test--spider-r-HTTP-Content-Disposition.px: Likewise. * tests/Test--spider-r.px: Likewise. * tests/Test--spider.px: Likewise. * tests/Test--start-pos--continue.px: Likewise. * tests/Test--start-pos.px: Likewise. * tests/Test-E-k-K.px: Likewise. * tests/Test-E-k.px: Likewise. * tests/Test-HTTP-Content-Disposition-1.px: Likewise. * tests/Test-HTTP-Content-Disposition-2.px: Likewise. * tests/Test-HTTP-Content-Disposition.px: Likewise. * tests/Test-N--no-content-disposition-trivial.px: Likewise. * tests/Test-N--no-content-disposition.px: Likewise. * tests/Test-N-HTTP-Content-Disposition.px: Likewise. * tests/Test-N-current.px: Likewise. * tests/Test-N-no-info.px: Likewise. * tests/Test-N-old.px: Likewise. * tests/Test-N-smaller.px: Likewise. * tests/Test-N.px: Likewise. * tests/Test-O--no-content-disposition-trivial.px: Likewise. * tests/Test-O--no-content-disposition.px: Likewise. * tests/Test-O-HTTP-Content-Disposition.px: Likewise. * tests/Test-O-nc.px: Likewise. * tests/Test-O-nonexisting.px: Likewise. * tests/Test-O.px: Likewise. * tests/Test-Restrict-Lowercase.px: Likewise. * tests/Test-Restrict-Uppercase.px: Likewise. * tests/Test-auth-basic.px: Likewise. * tests/Test-auth-no-challenge-url.px: Likewise. * tests/Test-auth-no-challenge.px: Likewise. * tests/Test-auth-with-content-disposition.px: Likewise. * tests/Test-c-full.px: Likewise. * tests/Test-c-partial.px: Likewise. * tests/Test-c-shorter.px: Likewise. * tests/Test-c.px: Likewise. * tests/Test-cookies-401.px: Likewise. * tests/Test-cookies.px: Likewise. * tests/Test-ftp--start-pos.px: Likewise. * tests/Test-ftp-bad-list.px: Likewise. * tests/Test-ftp-iri-disabled.px: Likewise. * tests/Test-ftp-iri-fallback.px: Likewise. * tests/Test-ftp-iri-recursive.px: Likewise. * tests/Test-ftp-iri.px: Likewise. * tests/Test-ftp-list-Multinet.px: Likewise. * tests/Test-ftp-list-UNIX-hidden.px: Likewise. * tests/Test-ftp-list-Unknown-a.px: Likewise. * tests/Test-ftp-list-Unknown-hidden.px: Likewise. * tests/Test-ftp-list-Unknown-list-a-fails.px: Likewise. * tests/Test-ftp-list-Unknown.px: Likewise. * tests/Test-ftp-pasv-fail.px: Likewise. * tests/Test-ftp-recursive.px: Likewise. * tests/Test-ftp.px: Likewise. * tests/Test-i-ftp.px: Likewise. * tests/Test-i-http.px: Likewise. * tests/Test-idn-cmd-utf8.px: Likewise. * tests/Test-idn-cmd.px: Likewise. * tests/Test-idn-headers.px: Likewise. * tests/Test-idn-meta.px: Likewise. * tests/Test-idn-robots-utf8.px: Likewise. * tests/Test-idn-robots.px: Likewise. * tests/Test-iri-disabled.px: Likewise. * tests/Test-iri-forced-remote.px: Likewise. * tests/Test-iri-list.px: Likewise. * tests/Test-iri-percent.px: Likewise. * tests/Test-iri.px: Likewise. * tests/Test-k.px: Likewise. * tests/Test-meta-robots.px: Likewise. * tests/Test-nonexisting-quiet.px: Likewise. * tests/Test-noop.px: Likewise. * tests/Test-np.px: Likewise. * tests/Test-proxy-auth-basic.px: Likewise. * tests/Test-restrict-ascii.px: Likewise. * tests/Test-stdouterr.px: Likewise. * tests/WgetTest.pm.in: Likewise. 2014-06-11 Giuseppe Scrivano * src/ftp.c: Replace main() with main in comments. * src/http.c: Likewise. * src/init.c: Likewise. * src/cmpt.c (day_of_the_week): Remove repeated word in a comment. * src/utils.c: Remove repeated word in a comment. * src/gnutls.c: Do not include . * src/connect.c: Do not depend on always defined macros. * src/convert.c: Likewise. * src/css-url.c: Likewise. * src/hash.c: Likewise. * src/mswindows.h: Likewise. * src/netrc.c: Likewise. * src/progress.c: Likewise. * src/sysdep.h: Likewise. * src/utils.c: Likewise. * src/wget.h: Likewise. * src/Makefile.am: Remove @VAR@ with $FOO. * src/DESCRIP_MODS.MMS: Remove terminating empty lines. * src/DESCRIP_SRC.MMS: Likewise. * src/convert.c: Likewise. * src/exits.c: Likewise. * src/http.c: Likewise. * src/init.c: Likewise. * src/iri.c: Likewise. * src/res.c: Likewise. * src/spider.c: Likewise. * src/test.c: Likewise. * src/test.h * src/url.c: Likewise. * src/utils.c: Likewise. * src/vms.c: Likewise. * src/decc_ver.c (main): Ensure bindtextdomain is called. * src/hash.c (main): Likewise. * src/html-parse.c: Likewise. * src/netrc.c: Likewise. * src/test.c: Likewise. * src/trunc.c: Likewise. 2014-06-11 Giuseppe Scrivano * msdos/Makefile.DJ: Remove terminating empty lines. * msdos/Makefile.WC: Likewise. * msdos/msdos.c: Likewise. 2014-06-11 Giuseppe Scrivano * doc/Makefile.am: Remove @VAR@ with $FOO. * doc/fdl.texi: Remove terminating empty lines. 2014-06-11 Giuseppe Scrivano * NEWS: Remove repeated word. * po/POTFILES.in: Add lib/regcomp.c. * configure.ac: Do not depend on always defined macros. * vms/vms.h: Likewise. * Makefile.am: Remove @VAR@ with $FOO. * util/Makefile.am: Likewise. * ABOUT-NLS: Remove terminating empty lines. * MAILING-LIST: Likewise. * util/rmold.pl: Likewise. * util/trunc.c (main): Ensure bindtextdomain is called. 2014-06-10 Giuseppe Scrivano * src/wget.h: Remove trailing whitespaces. * src/retr.c (getproxy): Return a dinamically allocated string and... (retrieve_from_file, retrieve_url, url_uses_proxy): ...fix the caller to handle it. * src/init.c (home_dir): Replace strncpy with strdup. * src/mswindows.c (struct fake_fork_info): Make lfilename a pointer. (fake_fork_child): Replace strncpy with strdup. * src/http.c (ensure_extension): Replace strncpy with memcpy, not much better but make "make syntax-check" happy and we know the size. * src/ftp.c (getftp): Add parameter last_expected_bytes. (ftp_loop_internal): Pass parameter last_expected_bytes to getftp. * src/ftp-basic.c: Remove declaration of ftp_last_respline. (ftp_response): Do not set ftp_last_respline. * src/css-url.c (get_uri_string): Replace strncpy with strdup. * src/vms.c (set_vms_name): Replace strncpy with strdup. * src/exits.c: Move WGET_EXIT_* definitions to... * src/exits.h: ...here. Add WGET_EXIT_GENERIC_ERROR, WGET_EXIT_PARSE_ERROR. Remove WGET_EXIT_MINIMUM. * src/init.c: Fix calls to exit(). * src/log.c: Likewise. * src/main.c: Likewise. * src/mswindows.c: Likewise. * src/netrc.c: Likewise. * src/utils.c: Likewise. * src/warc.c: Likewise. * src/Test-stdouterr.px: Likewise. 2014-06-10 Giuseppe Scrivano * doc/wget.texi: Remove trailing whitespaces. * doc/fdl.texi: Copy from gnulib. 2014-06-10 Giuseppe Scrivano * build-aux/build_info.pl: Remove trailing whitespaces. * configure.ac: Likewise. 2014-06-08 Giuseppe Scrivano * src/main.c: Make `program_name' not static. * src/cookies.c [HAVE_PSL]: Include only when HAVE_PSL is defined. 2014-05-30 Darshit Shah * src/cookies.c (check_domain_match): Use libpsl to check if the cookie domain is valid. Also remove unneeded test for numeric addresses. 2014-05-30 Darshit Shah * src/connect.{c,h}, convert.{c,h}, cookies.{c,h}, ftp-ls.c, ftp.h, gettext.h, hash.h, host.h, html-parse.h, html-url.h, http.c, init.c, main.c, mswindows.c, netrc.h, openssl.c, options.h, ptimer.h, recur.c, retr.c, sysdep.h, url.h, vms.c, warc.c, wget.h: Whitespace and formatting changes only. 2014-05-30 Darshit Shah * configure.ac: Allow compilation without libpsl. * README.checkout: Add libpsl as a dependency. 2014-05-24 Giuseppe Scrivano * gnulib: update module. 2014-05-24 Darshit Shah * doc/wget.texi: Document noscroll parameter to progress=bar 2014-05-20 Darshit Shah * src/utils.c (human_readable): Add new parameters, acc and decimals for the function. acc decides the number under which decimal values are shown for a certain value, while decimals decides the number of decimal digits displayed * src/utils.h (human_readable): Update declartion of the function * src/ftp.c (print_length): Update call to human_readable * src/http.c (gethttp): Same * src/init.c (defaults): Enable scrolling filenames by default * src/main.c (main): Update call to human_readable * src/options.h (options): Add new option noscroll * src/progress.c (create_image): Update the look of the progress bar. Human readable download size by default and add support for noscroll. (dot_set_params, bar_set_params): Change paramter to char * since strtok needs to modify the string. (bar_set_params): Add support for noscroll parameter to bar. 2014-05-03 Tim Ruehsen * src/retr.c (retrieve_url): fixed memory leak 2014-05-03 Tim Ruehsen * src/ftp-ls.c (ftp_parse_vms_ls): Explicitly typecast strlen's output * src/ftp.c (getftp): Prevent declaration of shadow variable * src/html-url.c (tag_handle_base, tag_handle_form, tag_handle_link, tag_handle_meta): Compiler directive to ignore unused variable * src/html-url.h (cleanup_html_url): Add function declaration * src/http-ntlm.c (ntlm_input, mkhash): Use ssize_t to declare variables that store sizes. (mkhash): Explicitly typecast output of c_toupper (short_pair): Add typecasts to prevent compiler warnings (ntlm-output): Fix datatypes of various variables * src/http.c (gethttp): Prevent declaration of shadow variable err (gethttp): remove unreachable code (test_parse_content_disposition): Fix variable declarations and use countof() macro * src/init.c (run_command): Prevent declaration of ghost variable (cmd_string, cmd_string_uppercase, cmd_file, cnd_vector, cmd_directory_vector, cmd_spec_dirstruct, cmd_spec_header, cmd_spec_warc_header, cmd_spec_htmlify, cmd_spec_mirror, cmd_spec_prefer_family, cmd_spec_progress, cmd_spec_recursive, cmd_spec_regex_type, cmd_spec_restrict_file_names, cmd_spec_report_speed, cmd_spec_timeout, cmd_spec_useragent, cmd_spec_verbose): Add compiler directive to ignore unused variable (cleanup_html_url, spider_cleanup): Remove declarations (test_commands_sorted): Cleanup code (test_cmd_spec_restrict_file_names): Use correct data types * src/iri.c (remote_to_utf8): Use more verbose variable name * src/main.c (init_switches): Prevent declaration of ghost variable * src/netrc.c (netrc_cleanup): Create cleanup function on common naming scheme * src/netrc.h (netrc_cleanup): Declare function * src/openssl.c (openssl_write, openssl_errstr): Compiler directive to ignore unused parameter (openssl_errstr): Explcicitly typecasr output of ASN1_STRING_length * src/options.h (struct options): Declare includes and excludes as const char ** * src/progress.c (progress_interactive_p, progress_handle_sigwinch): Compiler directive to ignore unused parameter * src/res.c (test_is_robots_txt_url): Fix datatypes * src/retr.c (line_terminator): Add compiler directive for ignoring unused paramter. Remove unused variable. * src/spider.h (spider_cleanup): Declare function * src/test,c (main): declare unused paramter * src/test.h (test_*): Declare functions * src/url.c (url_string): Explicit typecast of password strings (run_test): Declare *test, struct tests and *expected_result as const (test_path_simplify, test_append_uri_pathl, test_are_urls_equal): Fix datatypes * src/utls.c (fork_to_background): Be more verbose when errors occur (dir_matches_p, test_dir_matches_p): Declare char * as const as required (base64_encode, base64_decode, get_max_length): Set correct return type (match_pcre_regex, match_posix_regex): Use correct datatypes and typecasts (test_subdir_p): static const struct * src/utils.h (base64_encode, base64_decode, get_max_length): Fix return type * src/warc.c (ward_write_cdx_record): unused parameter (struct hash_table *warc_cdx_dedup_table): Declare as static 2014-05-01 Darshit Shah (tiny change) * src/progress.c (dot_finish): Do not print extra newlines when not in verbose mode. (Purely aesthetic change) (get_eta): Add extra space when eta is printed. (create_image): Remove erroneous space from being added to progress bar when filename > MAX_FILENAME_LEN (create_image): Remove extra space before printed download speeds Make the filename a fixed width column. 2014-05-01 Darshit Shah * doc/wget.texi: Add documentation for --show-progress 2014-05-01 Benjamin Goose * src/http.c: Fix small memory leak 2014-04-22 Tim Ruehsen * src/http.c (gethttp): Fix 204 response handling * tests/Test-204.px: added file * tests/run-px: added Test-204.px * tests/Makefile.am: added Test-204.px 2014-04-22 Giuseppe Scrivano * tests/Makefile.am (EXTRA_DIST): Add missing Test--start-pos.px, Test-ftp--start-pos.px and Test--start-pos--continue.px. 2014-04-19 Darshit Shah * src/log.h (log_options): Add new logging options, LOG_PROGRESS. All progress bar related output should use LOG_PROGRESS. * src/log.c (CHECK_VERBOSE): Implement LOG_PROGRESS output * src/progress.c (dot_create, print_row_stats, dot_update, dot_finish, bar_finish, display_image): Output progress information through LOG_PROGRESS (progress_implementation, dot_create, bar_create, progress_create): The progress bar create functions accept an extra paramter for the filename of the local file (bar_progress): Add new variable to store filename of currently downloading file (bar_finish): Aesthetic change. Print two newlines if in verbose mode, else only one. (MACRO): Define new macro, MIN (create_image): Implement displaying filename in progress bar output Filename tick implementation copied from Giuseppe's patch on parallel-wget * src/progress.h (progress_create): Accept another parameter for filename * src/http.c (gethttp): Remove unnecessary conditional (read_response_body): Send local filename to fd_read_body so that it can be printed on the progress bar * src/main.c (option_data): Add new switch, --show-progress (main): If in verbose output, show progress bar by default (main): Set progress implemetation when displaying progress bar (no_prefix): Increase buffer size to 2048 to prevent overflows * src/init.c (commands): Add new command, showprogress (defaults): By default initialize show_progress to false * src/options.h (options): Add new option, show_progress * src/retr.c (fd_read_body): Accept new parameter for filename of currently downlaoding file (fd_read_body): Create and update progress bar when opt.show_progress is set (fd_read_body): Display progress information in Windows console titlebars * src/retr.h (fd_read_body): Update declaration * src/ftp.c (getftp): Send filename of the local file so that it can be printed with the progress bar. 2014-03-26 Darshit Shah * src/ftp.c (getftp): Rearrange parameters to fix compiler warning * src/utils.c (get_dir_matches_p): Do not pass a const char** to a function that expects char** 2014-03-25 Daniel Stenberg * src/url.c (shorten_length): Remove unused function. 2014-03-19 Yousong Zhou * src/init.c, main.c, options.h: Add option --start-pos for specifying start position of a download. * src/http.c: Utilize opt.start_pos for HTTP download. * src/ftp.c: Utilize opt.start_pos for FTP retrieval. 2014-03-13 Zihang Chen * testenv/test: (new package) package for test case classes * testenv/WgetTest.py: Split into test/base_test.py and test/http_test.py. * testenv/Test-*.py: Optimize the imports according to changes of WgetTest.py 2014-03-13 Zihang Chen * testenv/server: (new package) package for the server classes * testenv/server.http: (new package) package for HTTP server * testenv/server.ftp: (new package) package for FTP server * testenv/HTTPServer.py: Move to server/http/http_server.py. Also change the CERTFILE to '../certs/wget-cert.pem'. * testenv/FTPServer.py: Move to server/ftp/ftp_server.py. * testenv/WgetTest.py: Optimize import respect to the server classes. 2014-03-13 Zihang Chen * testenv/misc: (new package) package for miscellaneous modules * testenv/ColourTerm.py: Move to package misc and rename to colour_terminal.py, add print_color functions to reduce the use of string literals like "BLUE", "RED" etc. * testenv/WgetTest.py: (CommonMethods.Server_setup): Change invocation to printer to print_blue. (CommonMethods.FilesCrawled): Change invocation to printer to print_red. (HTTPTest.__init__): Change invocations to printer to print_red and print_green respectively. 2014-03-13 Zihang Chen * testenv/exc: (new package) package for miscellaneous exceptions * testenv/WgetTest.py: Move TestFailed to exc/test_failed.py. 2014-03-13 Zihang Chen * testenv/conf: (new package) package for rule classes and hook methods * testenv/WgetTest.py: (CommonMethods.Authentication): Move to conf/authentication.py. (CommonMethods.ExpectHeader): Move to conf/expect_header.py. (CommonMethods.RejectHeader): Move to conf/reject_header.py. (CommonMethods.Response): Move to conf/response.py. (CommonMethods.SendHeader): Move to conf/send_header.py. (CommonMethods.ServerFiles): Move to conf/server_files.py. (CommonMethods.LocalFiles): Move to conf/local_files.py. (CommonMethods.ServerConf): Move to conf/server_conf.py. (CommonMethods.WgetCommands): Move to conf/wget_commands.py. (CommonMethods.Urls): Move to conf/urls.py. (CommonMethods.ExpectedRetcode): Move to conf/expected_retcode.py. (CommonMethods.ExpectedFiles): Move to conf/expected_files.py. (CommonMethods.FilesCrawled): Move to conf/files_crawled.py. (CommonMethods.__check_downloaded_files): Rename to _check_downloaded_files, so that the method is callable from outside the class. (CommomMethods.get_server_rules): Modify so that it utilizes the conf package. (HTTPTest): Add a method hook_call(configs, name) to reduce duplications in pre_hook_call, call_test and post_hook_call utilizing the conf package. * testenv/conf/hook_sample.py: (new file) sample for hooks * testenv/conf/rule_sample.py: (new file) sample for rules * testenv/REAMDE: Update sections about customizing rules and hooks. 2014-03-13 Zihang Chen * testenv/base_test.py: (CommonMethods): Rename to BaseTest. (BaseTest): Implement __init__ method where the class-wide variables are initialized. Also variable names like `xxx_list` is renamed to its plural form, e.g. `server_list` => `servers`. (BaseTest.init_test_env): Remove name argument due to its unnecessarity. (BaseTest.get_test_dir): Because the path of the test directory is needed in multiple methods, this method is implemented. (BaseTest.get_domain_addr): Rewrite the return statement utilizing str formatting (which is more Pythonic). (BaseTest.get_cmd_line): Rename to gen_cmd_line. Change the variables with capitcal characters to lower ones. Also, the nested for loop is rewritten to a plain loop using the zip function. (BaseTest.__gen_local_filesys): Rename to gen_local_fs_snapshot. Move to ExpectedFiles in conf/expected_files.py and is marked as a static method. Refactor to a less verbose implementation. (BaseTest._check_downloaded_files): Rename to __call__ to agree with the invocation in test case classes. Move to ExpectedFiles in conf/expected_files.py. (BaseTest.get_server_rules): Refactor to a more Pythonic form utilizing dict.items() and is marked static. (BaseTest.stop_server): (new method) an abstract method which should stop the currently using servers. (BaseTest.instantiate_server_by): (new method) an abstract method which should instantiate a server instance according to the given argument. (BaseTest.__enter__): (new method) method which initialize the context manager (BaseTest.__exit__): (new method) method that finilize the context manager and deal with the exceptions during the execution of the with statement, subclasses can override this method for extensibility * testenv/http_test.py: (HTTPTest.__init__): Add call to super.__init__. Default values of pre_hook, test_params, post_hook are set to None to avoid a subtle bug of Python. Argument servers is renamed to protocols. (HTTPTest.Server_setup): Move to BaseTest and rename to server_setup. Calls to pre_hook_call, call_test, post_hook_call are removed. (HTTPTest.hook_call, pre_hook_call, call_test, post_hook_call): Move to BaseTest for that both HTTP test cases and FTP test cases may use these methods. (HTTPTest.init_HTTP_Server, init_HTTPS_Server): Merge and rename to instantiate_server_by to implement the abstract method in BaseTest. (HTTPTest.stop_HTTP_Server): Rename to stop_server to implement the abstract method in BaseTest. Also, pull out the part where remaining requests are gathered into a new method request_remaining. (BaseTest.act_retcode): Rename to ret_code because ExpectedRetCode is moved out from BaseTest, so the name act_retcode is actually a bit verbose. * testenv/conf/expected_ret_code.py: (ExpectedRetCode.__call__): Rewrite the str into a more readable form. * testenv/conf/files_crawled.py: (FilesCrawled.__call__): Refactor this method into a more Pythonic form utilizing the zip function. * testenv/conf/local_files.py: (LocalFiles__call__): Rewrite this method with the recommended with statement. * testenv/conf/server_conf.py: (ServerConf.__call__): Rewrite this method due to BaseTest.server_list is renamed to BaseTest.servers. * testenv/conf/server_files.py: (ServerFiles.__call__): Refactor the nested for loop into a plain one utilizing the zip function. * testenv/conf/urls.py: (URLs): Rename url_list to urls. * testenv/conf/wget_commands.py: (WgetCommands): Rename command_list to commands, rename test_obj.options to test_obj.wget_options. * testenv/Test--https.py, Test-Proto.py, Test-Parallel-Proto.py: Argument servers is changed to protocols due to change in the signature of HTTPTest.__init__. 2014-03-13 Zihang Chen * testenv/WgetTest.py: Move WgetFile to package misc. * testenv/README: Modify documentation respect to WgetFile. * testenv/Test-*.py: Optimize imports about WgetFile. 2014-03-13 Zihang Chen * testenv/Test-Proto.py: Fix a typo (line 71: server to servers). 2014-03-04 Giuseppe Scrivano * src/http.c (modify_param_value, extract_param): Aesthetic change. 2014-02-24 Yousong Zhou (tiny change) * tests/tests/Test--httpsonly-r.px: Add feature constraint on https. 2014-02-24 Giuseppe Scrivano * gnulib: update module. Reported by: Darshit Shah . 2014-02-14 Vladimír Pýcha (tiny change) * src/http.c (parse_content_disposition, extract_param) (append_value_to_filename, digest_authentication_encode): URL-decode the filename parameter of Content-Disposition HTTP header if it is encoded. This is related to --content-disposition. New parameter of extract_param(), "is_url_encoded". Add argument NULL to the call of extract_param() in digest_authentication_encode(). * src/http.h: Add the new parameter to the declaration of extract_param(). * src/cookies.c (parse_set_cookie, test_cookies): Add argument NULL to the calls of extract_param(). * src/url.c (url_unescape): Remove "static" modifier. * src/url.h: Add declaration of url_unescape(). 2014-02-13 Yousong Zhou * tests/Wget.pm.in: Exclude existing files from the check of unexpected downloads. 2014-02-13 Yousong Zhou * tests/Test--start-pos.px: Test --start-pos for HTTP downloads. * tests/Test-ftp--start-pos.px: Test --start-pos for FTP downloads. * tests/Test--start-pos--continue.px: Test the case when --start-pos and --continue were both specified. 2014-02-13 Yousong Zhou * tests/FTPServer.pm: Fix the handling of TYPE command and avoid endless loop when doing binary mode RETR. 2014-02-10 Yousong Zhou * doc/wget.texi: Add documentation for --start-pos. 2014-02-06 Giuseppe Scrivano * src/main.c (print_version): Move copyright year out of the localized string and update it. 2014-02-06 Giuseppe Scrivano * configure.ac: Update copyright years. 2014-01-29 Darshit Shah * src/main.c: Remove pre-processor variable WHEN_DEBUG (option_data[]): Do not fail on --debug even if debug support is not compiled in. (main): Explicitly set opt.debug to false in case debugging support was not compiled. * src/init.c (commands[]): Support --debug wven when support is not compiled in. * src/options.h: Same 2014-01-23 Lars Wendler (tiny change) * tests/Test--post-file.px: Do not fail when wget has no debug support. 2014-01-17 Darshit Shah * src/init.c (commands[]): Add --no-config. * src/options.h: Same. * src/main.c (option_data[]): Same. (print_help): Same. (main): If --no-config is set, then do not read the wgetrc files. 2014-01-05 Håkon Vågsether (tiny change) * src/http.c (http_loop): Fix checking the URL length when filename is specified. 2014-01-02 Darshit Shah * testenv/WgetTest.py (CommonMentods.exec_wget): Wait for n seconds before calling the Wget executable. 2014-01-02 Darshit Shah * testenv/Makefile.am: Add new Test--https.py to list of tests and EXTRA_DIST. Also replace all tabs with spaces in file for conformity. * testenv/Test--https.py: New test to check if Wget works correctly with HTTPS servers * testenv/HTTPServer.py: Import new modules for use in HTTPS Servers (HTTPSServer): New class that generates a SSL-wrapped socket for use in a HTTPS Server. (HTTPSd): HTTPS daemon class. Analogous to the HTTPd class * testenv/WgetTest.py: Define global variables HTTP and HTTPS to reflect Server types (CommonMethods.exec_wget): Add the protocol information to the URL before passing it to wget (HTTPTest.__init__): Edit syntax. The servers variable now accepts a list of servers defined by their type. E.g. HTTP, HTTPS. (HTTPTest.Server_setup): Reflect change in type of variable servers. However, we maintin the value of self.servers to allow most of the code to remain unchanged. (HTTPTest.init_HTTPS_Server): Initialize a HTTPS Server * testenv/Test-Parallel-Proto.py: Edit to reflect slight change in Test Fiel Syntax. * testenv/Test-Proto.py: Same 2013-12-29 Giuseppe Scrivano * src/init.c (home_dir): Remove useless 'if'. * src/warc.c (warc_start_new_file): Likewise. (warc_process_cdx_line): Likewise. (warc_write_response_record): Likewise. 2013-12-29 Giuseppe Scrivano * doc/wget.texi: Update to GFDL 1.3. 2013-12-27 Darshit Shah * testenv/WgetTest.py: Add modeline (CommonMethods.ServerConf): New pre-test hook that sets BaseHTTPRequestHandler class variables in all available servers * testenv/HTTPServer.py (HTTPd.ServerConf): Call the respective method in the Server to set the class variables (StoppableHTTPServer.server_sett): Set the handler class variables 2013-12-26 Tim Ruehsen * src/gnutls.c (ssl_connect_wget): Fix connect timeout failure 2013-12-26 Darshit Shah * testenv/WgetTest.py (HTTPTest.call_test): Correct the call to stop_HTTP_Server. 2013-12-25 Darshit Shah * testenv/WgetTest.py (CommonMehtods.exec_wget): Catch and handle exception if the Wget executable is not found at src/wget (HTTPTest.call_test): In case of error during execution, remove all existing servers before quitting 2013-12-22 Mike Frysinger * doc/sample.wgetrc: add links to the manual. 2013-12-22 Giuseppe Scrivano * gnulib: add git submodule. 2013-12-15 Darshit Shah * testenv/WgetTest.py (HTTPTest.HTTP_setup): Rename to Server_setup so it can be easily reused for other non-HTTP servers. (HTTPTest.__init__): Call Server_setup instead of HTTP_setup (HTTPTest.Server_setup): Split into three more functions, that handle pre-hooks, test execution and post-hooks respectively. (HTTPTest.pre_hook_call): Set up and execute the pre-test hooks. Code split from HTTPTest.Server_setup (HTTPTest.call_test): Execute wget and log exit code. Code split from HTTPTest.Server_setup (HTTPTest.post_hook_call): Set up and execute post-test hooks. Code split from HTTPTest.Server_setup 2013-11-04 Darshit Shah * tests/Makefile.am: Add new tests introduced in last commit to EXTRA_DIST. Reported by: Andrea Urbani 2013-11-02 Giuseppe Scrivano * src/progress.c (struct progress_implementation): New method draw. (bar_update): Split into... (bar_draw): ...this new function. (dot_update): Split into... (dot_draw): New function. (progress_update): Also invoke draw on `current_impl'. 2013-11-02 Giuseppe Scrivano * src/http.c (gethttp): Increase max header value length to 512. 2013-10-30 Giuseppe Scrivano * src/http.c (skip_short_body): Remove assert which is always true. Reported by: David Binderman 2013-10-26 Bykov Aleksey * src/utils.c (match_tail): Fix cookies reject * src/ftp-ls.c (ftp_parse_unix_ls): Fix parsing month name in uppercase 2013-10-22 Ángel González * bootstrap.conf (gnulib_modules): Add module xstrndup. 2013-10-17 Andrea Urbani * tests/FTPServer.pm (GetBehavior): new routine. * tests/FTPServer.pm (get_list): new parameter to skip hidden files * tests/Test-ftp-list-Multinet.px: Test LIST on a "UNIX MultiNet Unix Emulation" system that returns an empty content when "LIST -a" is requested (probably because no "-a" files exist) * tests/Test-ftp-list-Unknown.px: Test LIST on a "Unknown ftp service" system that returns an empty content when "LIST -a" is requested (probably because no "-a" files exist) * tests/Test-ftp-list-Unknown-a.px: Test LIST on a "Unknown ftp service" system that recognises "LIST -a" as "give me the -a file" and there is a "-a" file + other two files. "LIST -a" will return only "-a", "LIST" all the three files. * tests/Test-ftp-list-Unknown-hidden.px: Test LIST on a "Unknown ftp service" system that recognises "LIST -a" as an "UNIX Type: L8" system (show me also the hidden files) and there is an hidden file. * tests/Test-ftp-list-Unknown-list-a-fails.px: Test LIST on a "Unknown ftp service" system that raises an error on "LIST -a" command. * tests/Test-ftp-list-UNIX-hidden.px: Test LIST on a "UNIX Type: L8" system that recognises "LIST -a" as "show me also the hidden files" and there is an hidden file. 2013-10-17 Andrea Urbani * src/ftp.c (getftp): force "LIST" or "LIST -a" according to the remote system type. If the remote system is not known, it tries, only the first time, "LIST -a", after "LIST" and decides which one to use. For more information look for "__LIST_A_EXPLANATION__" * src/ftp.h (enum ustype): New ustype enum. * src/ftp.h (ftp_syst): New enum ustype *unix_type parameter. * src/ftp.h (ftp_list): Removed enum stype rs parameter, added. bool avoid_list_a, bool avoid_list, bool *list_a_used parameters. * src/ftp.h (wget_ftp_fstatus): New AVOID_LIST_A, AVOID_LIST, LIST_AFTER_LIST_A_CHECK_DONE values. * src/ftp-basic.c (ftp_list): it handles the new avoid_list_a, avoid_list and list_a_used parameters. * src/ftp.h (ftp_syst): it stores information about the "215 UNIX" systems into the new unix_type parameter. 2013-10-14 Giuseppe Scrivano * testenv/Makefile.am (XFAIL_TESTS): Remove Test--spider-r.py. 2013-10-10 Giuseppe Scrivano * tests/Test-idn-robots-utf8.px: Remove -H. * tests/Test-idn-cmd.px: Likewise. * tests/Test-idn-cmd-utf8.px: Likewise. Suggested by: Tim Ruehsen 2013-10-10 Giuseppe Scrivano * src/url.c (url_parse): Try to convert UTF-8 URLs to IDN. * src/html-url.c (append_url): Parse URLs specifying an IRI structure. 2013-10-07 Tim Ruehsen * tests/Test-idn-robots.px: added punycoded and escaped URLs to follow removed -H 2013-10-06 Tim Ruehsen * doc/wget.texi: add/explain quoting of wildcard patterns 2013-10-06 Giuseppe Scrivano * testenv/Makefile.am (EXTRA_DIST): Distribute test files. 2013-09-16 Darshit Shah * testenv/README: Update documentation 2013-09-14 Darshit Shah * testenv/HTTPServer.py (StoppableHTTPServer): Define object variable request_headers which stores a list of requests received by the server (StoppableHTTPServer.get_req_headers): Return the list of Request Headers stored by the server (_Handler.do_HEAD): Send the Request MEthod string for identification (_Handler.do_GET): Same (_Handler.__log_request): Log the request in Request_Headers list (_Handler.send_head): Make a call to __log_request * testenv/Test--spider-r.py: Add new list, Request_List, which contains all the requests that Wget is expected to send. This will allow for fine-grained tests on recursive downloading. * testenv/WgetTest.py (CommonMethods.FilesCrawled): New Post-Test Hook, that ensures that all the expected Files on the server were accessed as expected. (HTTPTest.stop_HTTP_server): On stopping server, asks it to respond with list of all requests it received. 2013-09-13 Tim Ruehsen * src/recur.c (download_child_p): fix compile error when configured using --without-ssl. 2013-09-13 Tim Ruehsen * configure.ac: added a summary of build options fixed some indentations removed the unconditionally adding of libz with --with-ssl removed -lgcrypt and -lgpg-error for gnutls 2013-09-13 Darshit Shah * testenv/WgetTest.py (CommonMethods._replace_substring): New method that will replace a substring delimited by {{ }} characters by the value of self. variable (CommonMethods.WgetCommands): Use the _replace_substring () call to replace the substrings in the the command line. (CommonMethods.ServerFiles): Run the _replace_substring () method on the File contents too. 2013-09-13 Darshit Shah * testenv/Test--spider-r.py: Test retrieval in recursive spider mode. * testenv/Makefile.am: add new file 2013-09-13 Darshit Shah * testenv/HTTPServer.py (_Handler.do_HEAD): If requested path is /, respond with /index.html (_Handler.do_HEAD): Smartly guess value of Content-Type Header from file extension (_Handler.guess_type): Use a preset list of extensions and Content-Type strings. If the extension matches one in the list, use that string, else default to "text/plain" 2013-09-11 Darshit Shah * testenv/WgetTest.py (CommonMethods.exec_wget): Expect domain_list instead of domain. (CommonMethods.get_cmd_line): Same. Generate command line by prepending to each file it's respective domain string (CommonMethods.ServerFiles): Generate file_list and server_rules for each Server and set the config details (HTTPTest): New named parameter, servers which signifies number of servers to spawn (HTTPTest.HTTP_setup): This method now takes servers as a new parameter. Instead of storing server and domain, we now store server_list and domain_list. Each server must be initialized through a loop. (HTTPTest.stop_HTTP_server): Stop all servers in a loop. * testenv/Test-Parallel-Proto.py: Prototype test file for multiple servers. 2013-09-10 Darshit Shah * testenv/WgetTest.py (HTTPTest.stop_HTTP_server): With the threaded servers, we can simply use the socketserver.shutdown() method to close the server instead of sending a QUIT command * testenv/HTTPServer.py (StoppabelHTTPServer.serve_forever): Delete method. No need to override this method anymore. (WgetHTTPRequestHandler.do_QUIT): No longer required (HTTPd): Rename self.server to self.server_inst to reduce ambiguity when referenced from WgetTest 2013-09-09 Tim Ruehsen * src/gnutls.c (ssl_connect_wget): changed checking of option "PFS" to be better prepared for some kinds of backports. Reported by: Daniel Kahn Gillmor 2013-09-08 Darshit Shah * testenv/README (File Structure): Add explanation about various variables used consistently across all tests. 2013-09-07 Tim Ruehsen * src/gnutls.c (ssl_connect_wget): use gnutls_check_version() to check if option "PFS" is available Reported by: Daniel Kahn Gillmor 2013-09-07 Darshit Shah * testenv/Test-Post.py: Test basic functionality for sending HTTP POST requests using the --method command * testenv/Makefile.am: Add new test 2013-09-07 Darshit Shah * testenv/HTTPServer.py: Remove bunch of old code artefacts * testenv/WgetTest.py: Same 2013-09-07 Darshit Shah * testenv/HTTPServer.py (StoppableHTTPServer.server_conf): Change global variable fileSys to an object variable. This is good programming practice and required for parallel-wget support. (StoppableHTTPServer.server_forever): Edit overridden method to remove the global queue variable. No longer required under the new working (WgetHTTPRequestHandler.do_QUIT): Don't push fileSys through the queue (_Handler): Rename class __Handler to _Handler to match Python's encapsulation rules (_Handler.do_POST): fileSys is now an object variable of the server (_Handler.do_PUT): Same (_Handler.send_put): Same (_Handler.send_head): Same (HTTPd): New class that wraps around the server for Threading (create_server): Make new object of HTTPd. (spawn_server): Start the thread created through create_server (ret_fileSys): Removed method. No longer required. * testenv/WgetTest.py (HTTPTest.__init__): Don't explicitly set self.act_retcode. Instead toggle tests_passed boolean to set the correct return code. (HTTPTest.HTTP_setup): We no longer call HTTPServer.spawn_server to start a new instance of the server. (HTTPTest.init_HTTP_server): We no longer call the old create_server(), spawn_server() methods. Instead use the new HTTPd class interface to create new instances of the server (HTTPTest.stop_HTTP_server): Don't ask server to return fileSys. 2013-09-06 Darshit Shah * testenv/WgetTest.py (CommonMethods.__check_downloaded_files): Print a unified diff in case there is a mismatch in the file contents 2013-09-06 Darshit Shah * testenv/README: New section on pending work. Will keep updating this to keep track of work that remains to be done on this implementation 2013-09-06 Darshit Shah * testenv/HTTPServer.py (WgetHTTPRequestHandler.test_cookies): Comment out the old test_cookies code. This is no longer used and was causing problems with expected cookies. The code will soon be removed anyways * testenv/Test-cookie.py: Add new test for basic cookie functionality * testenv/Test-cookie-401.py: Ensure cookies are saved during a 401 response * testenv/Test-cookie-expires.py: Ensure that the Expires field is correctly handled * testenv/Test-cookies-domain-mismatch.py: Ensure that mismatched domains are handled by Wget * testenv/Makefile.am: Add the new tests 2013-09-05 Darshit Shah * testenv/Test-auth-with-content-disposition.py: Add test that ensures Content Disposition works alongwith authentication * testenv/Makefile.am: Add new test 2013-09-04 Tim Ruehsen * doc/sample.wgetrc: added "secureprotocol" example * doc/wget.texi (HTTPS (SSL/TLS) Options): Document PFS. 2013-09-04 Darshit Shah * testenv/Test-c-full.py: Test Continue options * testenv/Makefile.am: Add Test-c-full.py and Test-O 2013-09-03 Tim Ruehsen * src/main.c: Add new value 'PFS' to --secure-protocol to enforce the so-called Perfect Forward Security. * src/init.c (cmd_spec_secure_protocol): added secure_protocol_pfs * src/openssl.c, gnutls.c, options.h: likewise 2013-09-02 Darshit Shah * testenv/Makefile.am: Add new Test * testenv/Test-Head.py: New Test to ensure HEAD requests are handled correctly 2013-08-31 Darshit Shah * testenv/WgetTest.py: Remove import module defaultdict. (CommonMethods.get_server_rules): server_rules should be a dict, not a defaultdict (list). * testenv/HTTPServer.py (WgetHTTPRequestHandler.get_rule_list): If rule does not exist, return None. Not an emppty list. (WgetHTTPRequestHandler.test_cookies): Rule variable is not a list. (__Handler.send_cust_headers): Same. (__Handler.custom_response): Same. (__Handler.is_authorized): Same. (__Handler.expect_headers): Same. (__Handler.reject_headers): Same. 2013-08-31 Darshit Shah * testenv/README: Explain that TEST_NAME needs to be unique * testenv/Test-auth-no-challenge.py: Edit non-unique TEST_NAME 2013-08-31 Darshit Shah * testenv/README: (newfile) Simple help / instructions about using the Test Environment. * testenv/Makefile.am: (newfile) Makefile for the Test Environment. Uses the Automake Parallel Test Harness * testenv/WgetTest.py: (newfile) Base module that executes the Test. * testenv/HTTPServer.py: (newfile) Contains the custom HTTP Server for the Test Environment. Creates an instance of http.server in Python3. * testenv/FTPServer.py: (newfile) Overrides methods from pyftpdlib for use in the Test Environment. ** Work under progress **. * testenv/ColourTerm.py: (newfile) A custom module to output coloured text to the terminal. Known to work on POSIX shells. * testenv/Test-Proto.py: (newfile) A prototype Test File. This should be copied when writing a new Test Case. * testenv/Test-Content-disposition-2.py: Test Content Disposition clobbering * testenv/Test-Content-disposition.py: Test Content Disposition Headers * testenv/Test-O.py: Test Output filename command * testenv/Test-auth-basic-fail.py: Test returncode on auth failure * testenv/Test-auth-basic.py: Test Basic Auth negotiation * testenv/Test-auth-both.py: Test handling of Multiple auth providers. This test currently fails. * testenv/Test-auth-digest.py: Test Digest Auth Negotiation * testenv/Test-auth-no-challenge-url.py: Ensure --auth-no-challenge is handled when auth details are in-URL. * testenv/Test-auth-no-challenge.py: Ensure --auth-no-challenge is honoured * testenv/Test-auth-retcode.py: Ensure correct return code after 403 Forbidden response. 2013-08-31 Darshit Shah * testenv/HTTPTest.py (ServerError): Define new Exception for handling internal control flow. (StoppableHTTPServer.SendHeader): Simply pass. Do nothing. Adding functionality here seems to crash for no apparent reason. (stoppableHTTPServer.send_cust_headers): Minor optimization. No need for extra variable. (__Handler.Response): Handle explicit Response Code Rules (__Handler.Authentication): Handle Authentication rules (__Handler.handle_auth): Actual worker method for authentication (__Handler.ExpectHeader): Ensure Expected Headers are received (__Handler.RejectHeader): Ensure Blacklisted Headers are NOT received (__Handler.send_HEAD): Dynamically call server rule functions based on the self.rules list. This feature will later be added to POST/PUT, etc 2013-08-31 Darshit Shah * configure.ac: Add testenv/Makefile to AC_CONFIG_FILES. * Makefile.am: Add testenv to SUBDIRS 2013-08-23 Tim Ruehsen * doc/sample.wgetrc: added "httpsonly" example 2013-08-22 Tim Ruehsen * tests/Makefile.am (EXTRA_DIST): Add Test--httpsonly-r.px. * tests/run-px (tests): Likewise. * tests/Test--httpsonly-r.px: New file. 2013-08-22 Tim Ruehsen * src/main.c: Add new option --https-only. * src/options.h: Likewise. * src/recur.c (download_child_p): add check for HTTPS. 2013-08-22 Tim Ruehsen * doc/wget.texi: added description for --https-only 2013-08-13 Hrvoje Niksic * doc/wget.texi (Download Options): Fix misspelling. 2013-08-09 Tim Ruehsen * src/gnutls.c (ssl_init): Prevent CA files from being loaded twice if possible. * src/gnutls.c (ssl_check_certificate): Added some error messages * src/gnutls.c: Fixed some compiler warnings 2013-08-08 Will Dietz (tiny change): * src/main.c (format_and_print_line): Wrap correctly long tokens. 2013-07-23 Tim Ruehsen * configure.ac: Remove AM_CONDITIONAL HAVE_NETTLE. Reported by: Darshit Shah . 2013-07-16 Darshit Shah * src/wget.h (err_t): Added new errors, ATTRMISSING and UNKNOWNATTR to handle missing attributes and Unknown attribute values respectively in HTTP Headers. * src/exits.c (get_status_for_err): ATTRMISSING is a Protocol Error while UNKNOWNATTR is a general error, presumably because of a feature that is not yet implemented. * src/http.c (gethttp): Call create_authorization_line () separately. In case the auth_err flag has been set with an error, handle it and exit. * src/http.c (create_authorization_line): Pass a pointer, auth_err to set the flag for different kinds of errors encountered. * src/http.c (http_loop): Handle the errors raised by the authentication handlers. * src/http.c (digest_authentication_encode): Pass pointer auth_err to set the error flags. Set qop to NULL in case the value of the qop / algorithm attribute is unknown to Wget. Set an appropriate error too. 2013-07-13 Tim Ruehsen * src/http.c (digest_authentication_encode): Fix a crash when the algorithm is not specified in the server response. Free dynamic memory used by the function when the function exits. * src/http-ntlm.c [HAVE_NETTLE]: Include and . (setup_des_key) [HAVE_NETTLE]: New function to deal with libnettle. (calc_resp) [HAVE_NETTLE]: Add support for libnettle. (mkhash) [HAVE_NETTLE]: Likewise. Reported by: Tim Ruehsen . 2013-07-13 Tim Ruehsen * configure.ac: check for libnettle when GNU TLS is used. 2013-07-13 Steven M. Schweda * src/warc.c (warc_tempfile): Fix a portability issue on VMS. 2013-07-12 Giuseppe Scrivano * src/http.c (digest_authentication_encode): Set default value of `algorithm' to "MD5". Check if `qop' is not-NULL before access it. 2013-07-11 Tomas Hozza * src/ftp.c (ftp_loop): Use ftp_retrieve_glob() also in case --preserve-permissions was specified. 2013-07-11 Tomas Hozza * doc/wget.texi: Document --regex-type and --preserve-permissions. 2013-07-11 Tim Ruehsen * src/gnutls.c (ssl_connect_wget): respect connect timeout. 2013-07-11 Karsten Hopp * src/openssl.c (struct openssl_read_args, struct scwt_context): New struct. (openssl_read, ssl_connect_with_timeout_callback): New function. (ssl_connect_wget): respect connect timeout. 2013-07-10 Giuseppe Scrivano * src/http.c (read_response_body) [ALLOW_CLOBBER]: Move definition to.. * src/options.h (struct options): Make `backups' an int. * src/url.h [ALLOW_CLOBBER]: .. Here. Do not clobber when backups are used. * src/url.c (url_file_name): Use the ALLOW_CLOBBER macro instead of repeating the code. 2013-07-09 Giuseppe Scrivano * doc/wget.texi (Download Options): Add documentation for --backups. (Wgetrc Commands): Add documentation for backups. Reported by: Tomas Hozza . 2013-07-08 Steven M. Schweda * src/retr.c (rotate_backups): Support for VMS files. 2013-06-26 Darshit Shah * src/http.c (gethttp): Reverse change by commit 90896 that prevented downloading response body data to non-GET Request methods. 2013-06-22 Ángel González * src/init.c (cmd_string_uppercase): Rewrite function. 2013-06-19 Ciprian Vieru (tiny change) * src/html-url.c: Define TAG_TD, TAG_TH, TAG_VIDEO, TAG_AUDIO, TAG_SOURCE. (tag_url_attributes, known_tags): Likewise. 2013-06-19 Tim Ruehsen * src/connect.c (socket_ip_address): zero out ip address structure to avoid access to uninitialized values by inet_ntop(). * src/ftp.c (ftp_loop_internal): fix segfault caused by warc_tmp NULL pointer. 2013-06-17 Dave Reisner (tiny change) * doc/texi2pod.pl: Fix formatting error that causes build to fail with Perl 5.18 2013-06-17 Darshit Shah * doc/wget.texi (POST): Explain the new redirection rules. * doc/wget.texi (Other HTTP Methods): Same. * doc/wget.texi (body-data): Fix typo in description. 2013-06-13 Darshit Shah * src/http.c (gethttp): Follow RFC 2616 and httpbis specifications when handling redirections. Do not suspend the method on 301/302 redirects. (gethttp): If method if not GET, we do not intend to download anything. * src/main.c (main): Set spider mode when opt.method is HEAD. This will prevent Wget from downloading any file. * src/retr.c (SUSPEND_METHOD): Rename macro SUSPEND_POST_DATA to SUSPEND_METHOD to more accurately reflect its use. Similarly rename related variables. 2013-05-21 Ray Satiro * src/url.c (url_file_name): Use MAX_PATH in Windows. 2013-05-17 Bykov Aleksey * bootstrap: Add `mkostemp' 2013-05-14 Bykov Aleksey * src/warc.c (warc_tempfile): For fix "Could not open temporary WARC manifest file." issue in Win system force to use `mkostemp(filename, O_TEMPORARY)' instead of `mkstemp(filename)'. Thank to Angel Gonzalez for help. 2013-05-14 Tim Ruehsen * src/cookies.c (cookie_jar_load): Replaced read_whole_file() by getline(). * src/init.c (run_wgetrc): Likewise. * src/netrc.c (parse_netrc): Likewise. * src/utils.c: Likewise. * src/ftp.c (getftp): Likewise. * src/ftp-ls.c (ftp_parse_unix_ls, ftp_parse_winnt_ls, ftp_parse_vms_ls): Likewise. * src/ftp-ls.c (clean_line): Accept the string length as parameter. * src/ftp-ls.c: Replaced indent tabs by spaces. * src/ftp.c: Likewise. * src/utils.c: Removed read_whole_file() definition. * src/netrc.c: Removed read_whole_file() definition for STANDALONE. * src/utils.h: Removed read_whole_file() declaration. 2013-05-10 Darshit Shah (tiny change) * doc/wget.texi (No of tries): Fix typo to make it clear that --tries option sets number of tries not retries. Reported by: Hauke Hoffman 2013-05-09 Tim Ruehsen * src/utils.c (acceptable): use standard string functions instead of self-written code. (match_tail): Likewise. (suffix): Likewise. (has_wildcards_p): Likewise. (test_subdir_p): Fix some warnings. (test_dir_matches_p): Likewise. 2013-05-05 mancha (tiny change) * src/gnutls.c (ssl_connect_wget): Don't abort on non-fatal alerts received during handshake. For example, when connecting to servers using TSL-SNI that send warning-level unrecognized_name alerts. 2013-05-04 Darshit Shah * src/init.c (cmd_string_uppercase): Fix issue that cased invalid headers when converting to uppercase. 2013-05-01 Giuseppe Scrivano * src/init.c: Declare `cmd_string_uppercase'. (commands): Now `method' uses cmd_string_uppercase. (cmd_string_uppercase): New method * src/http.c (gethttp): Do not transform opt.method to uppercase. Reported by: Stefano Lattarini 2013-05-01 Gijs van Tulder * src/retr.c (retrieve_url): New variable `saved_method'. (SUSPEND_POST_DATA): Save the method to `saved_method'. (RESTORE_POST_DATA): Restore the method from `saved_method'. 2013-04-28 Giuseppe Scrivano * doc/wget.texi (Types of Files...): Document --accept-regex, --reject-regex. Reported by: Tomas Hozza . 2013-04-26 Tomas Hozza (tiny change) * src/log.c (redirect_output): Use DEFAULT_LOGFILE in diagnostic message when `logfile' is NULL. * src/utils.c (unique_create): Ensure `logfile' has always a value. 2013-04-24 Darshit Shah * src/http.c (gethttp): Remove check for opt.post_data and opt.post_file_name. * src/main.c (main): Change location in code where --post-data and --post-file options are converted to --body-data --body-file. 2013-04-21 Gijs van Tulder * src/http.c: Copy opt.body_data to the WARC file, instead of opt.post_data (the old option). 2013-04-14 Giuseppe Scrivano * doc/wget.texi (Download Options): Document "mega" dot style. 2013-04-12 Gijs van Tulder * src/warc.c: Generate unique UUIDs for the manifest and the record holding the command-line arguments. Write the manifest to a "metadata" record to follow the WARC implementation guidelines. * src/warc.h: Declare new function warc_write_metadata_record. 2013-04-05 Darshit Shah * doc/doc/wget.texi: Fix ambiguous wording in --post-data section. Make it clear that wget does not check for the format of the post-data. * doc/doc/wget.texi: Add documentation for --method, --body-data and --body-file. 2013-03-31 Gijs van Tulder * src/warc.c: Correctly write the field length in the skip length field of .warc.gz files. (Following the GZIP spec in RFC 1952.) 2013-03-20 Tomas Hozza * src/http.c (gethttp): Set "sock" to -1 if it's not and we have no persistent connection 2013-03-15 Darshit Shah * src/http.c (post_file): Rename function to body_file_send to more accurately reflect its use. * src/http.c (gethttp): Add support for --method, --body-data and --body-file * src/init.c (commands): Same. * src/options.h (options): Same. * src/main.c (option_data): Same. * src/main.c (print_help): Add --method command. * src/main.c (main): Make old --post-{data,file} commands aliases to --method. Add sanity checks for --method, --body-data and --body-file. * src/retr.c (SUSPEND_POST_DATA): Edit Macro Definition to use body_data. * src/retr.c (RESTORE_POST_DATA): Same. 2013-03-12 Darshit Shah * tests/Makefile.am (EXTRA_DIST): Add Test--post-file.px. * tests/run-px (tests): Likewise. * tests/Test--post-file.px: New file. 2013-03-12 Darshit Shah * src/http.c (gethttp): Make wget return FILEBADFILE error and abort if post-file does not exist. * src/http.c (http_loop): Handle FILEBADFILE as a valid err. * src/exits.c (get_status_for_err): Mark FILEBADFILE as an IO error. 2013-02-15 Darshit Shah * src/cookies.c (cookie_handle_set_cookie): Set cookie->discard_requested to true on domain mismatch. 2013-01-30 Pavel Mateja (tiny change) * src/http.c (gethttp): Specify "Host" for CONNECT method. 2012-12-20 Tim Ruehsen * src/gnutls.c (ssl_connect_wget): added +VERS-SSL3.0 to fix --secure-protocol=SSLv2/SSLv3. 2012-12-09 Giuseppe Scrivano * src/main.c (main): Keep looking for "config" until there are arguments to parse. Reported by: Adrien Dumont * src/test.c: Include "wget.h". 2012-12-08 Michael Stapelberg (tiny change) * src/retr.c (retrieve_url): Set iri->orig_url to NULL after it is freed. 2012-11-26 Giuseppe Scrivano * src/wget.h (MAX_INT_TO_STRING_LEN): Define macro. * src/warc.c (warc_write_block_from_file): Use `MAX_INT_TO_STRING_LEN' to find the buffer size. (warc_write_cdx_record): Likewise. 2012-11-24 Giuseppe Scrivano * src/warc.c (warc_write_block_from_file): Use `number_to_string' to convert the content-length to a string. 2012-11-24 Gijs van Tulder * src/warc.c (warc_write_cdx_record): Use `number_to_string' to convert the offset to a string. 2012-11-15 Giuseppe Scrivano * src/retr.c (write_data): Fix comment. 2012-11-14 Ángel González * src/warc.c (warc_sha1_stream_with_payload): Fix compilation under gcc -std=c89. 2012-11-13 Giuseppe Scrivano * src/retr.c (fd_read_body): Correctly check the return code from write_data. Reported by: Torsten Scheck 2012-11-09 Tim Ruehsen * tests/HTTPServer.pm: added check for must-not-match request-header * tests/Test-cookies.px: check cookie deletion and cookie domain matching 2012-10-08 Stefano Lattarini (tiny change) docs: fix errors and warnings with Texinfo 5 Or rather, with the development version 4.13.90, which will eventually become Texinfo 5.0. * doc/wget.texi: Use '@item' instead of '@itemx' in several places, as Texinfo 5 refuses to process an '@itemx' that is not preceded by an '@item'. 2012-10-07 Tim Ruehsen Giuseppe Scrivano * src/utils.c (get_max_length): If `pathconf' is not available fallback to PATH_MAX. 2012-10-07 Ray Satiro * src/url.c: Change the functions of a growable string object to null terminate the string. * src/url.c (append_null): New function to null terminate a growable string object. * src/url.c (shorten_length): New function to shorten the length of a growable string object. 2012-10-07 Giuseppe Scrivano * configure.ac: Check for patchconf. 2012-10-06 Giuseppe Scrivano * src/http.c (http_loop): Send a HEAD request when -c and --content-disposition are used together. 2012-10-03 Merinov Nikolay * src/ftp.c (ftp_loop_internal): Ignore --no-clobber option when receiving directory listing. 2012-09-29 Merinov Nikolay * src/ftp-ls.c (ftp_parse_winnt_ls): Support filename extracting with new listing format. 2012-09-29 Tim Ruehsen * src/url.h (CHOMP_BUFFER): Add definition. * src/url.c (url_file_name): New local variables `fname_len_check' and `max_length'. Check that the length of the file name is acceptable. * src/utils.h (get_max_length): Declare function. * src/utils.c (get_max_length): New function. 2012-09-28 Steven Schubiger * src/src/recur.c (retrieve_tree): Combine duplicated code. 2012-09-23 Merinov Nikolay * m4/wget.m4 (WGET_FNMATCH): Add AC_LANG_SOURCE into AC_COMPILE_IFELSE in order to silence autoconf 2.68 warning. 2012-09-20 Giuseppe Scrivano * bootstrap: Update from gnulib. 2012-09-03 Tim Ruehsen * src/http.c (digest_authentication_encode): Add support for RFC 2617 MD5-sess authentication algorithm. Feature request and testing by: Avinash 2012-09-02 Nguyễn Thái Ngọc Duy (tiny change) * src/src/main.c (main): mark more strings for translation. * src/src/mswindows.c (fork_to_background): Likewise. * src/src/recur.c (download_child_p): Likewise. 2012-09-02 Nguyễn Thái Ngọc Duy (tiny change) * po/POTFILES.in: Add more files. 2012-08-29 Rohit Mathulla (tiny change) * src/html-url.c (get_urls_file): Convert shorthand URLs. 2012-08-28 Tim Ruehsen * doc/wget.texi: remove -nv from --report-speed 2012-08-28 Tim Ruehsen * src/gnutls.c (ssl_check_certificate): deinit gnutls_x509_crt_t. * src/gnutls.c (ssl_init): don't error if CA directory is empty. 2012-08-25 Hrvoje Niksic * src/warc.c (warc_find_duplicate_cdx_record): Use hash_table_get instead of hash_table_get_pair. 2012-08-21 (tiny change) * src/connect.c (connect_to_ip) [ENABLE_IPV6]: Attempt to use IPv6. * src/http.c (gethttp): Likewise. 2012-08-04 mancha (tiny change) * doc/wget.texi: Export ENVIRONMENT to the man page. 2012-07-08 Steven Schubiger * src/exits.h: Fix comment. * src/exits.c: Likewise. 2012-07-08 Giuseppe Scrivano * bootstrap: Update from gnulib. * bootstrap.conf (gnulib_extra_files): Remove $build_aux/missing. * lib/Makefile.am: Delete file. 2012-07-07 Tim Ruehsen (digest_authentication_encode): Add support for RFC 2617 Digest Access Authentication. 2012-07-07 Giuseppe Scrivano * src/http.c (http_loop): Fix log message. * src/main.c (main): Likewise. Reported by: Petr Pisar 2012-07-07 Giuseppe Scrivano * src/html-url.c (cleanup_html_url): Remove "static" modifier. * src/init.c (cleanup_html_url): Likewise. Reported by: Mike Frysinger . 2012-07-03 Steven Schubiger * src/init.c: Include warc.h for warc_close in cleanup function. 2012-06-17 Giuseppe Scrivano * src/wget.h: Define `CLOSEFAILED'. * src/init.c: Include "exits.h". (cleanup): Check `fclose' failure. * src/exits.c (get_status_for_err): Handle `CLOSEFAILED'. 2012-06-16 Giuseppe Scrivano * tests/Makefile.am (EXTRA_DIST): Add Test-stdouterr.px. * tests/run-px (tests): Likewise. * tests/Test-stdouterr.px: New file. 2012-06-16 Giuseppe Scrivano * src/main.c (main): Move some cleanup related function to... * src/init.c (cleanup): ...here. * src/main.c: Do not include "stdout.h". (main): Do not register `close_stdout' at exit. Reported by: Micah Cowan . 2012-06-16 Giuseppe Scrivano * bootstrap.conf (gnulib_modules): Remove `closeout'. Reported by: Micah Cowan . 2012-06-09 Giuseppe Scrivano * src/main.c (print_help): Move --report-speed under the section "Logging and input file". 2012-06-09 Giuseppe Scrivano * doc/wget.texi (Logging and Input File Options): Document "--report-speed". (HTTPS (SSL/TLS) Options): Document WARC. * doc/texi2pod.pl: Revert change from 2011-08-06. 2012-06-06 Giuseppe Scrivano * src/main.c (print_help): Rename --bits to --report-bps. (cmdline_options): Likewise. * src/init.c (commands): Rename --report-bps to --report-speed. (cmd_spec_report_speed): New function. * src/options.h (struct options): Rename `bits_fmt' to `report_bps'. * src/main.c (print_help): Rename --bits to --report-bps. (cmdline_options): Likewise. * src/init.c (commands): Likewise * src/progress.c (create_image): Adjust caller. * src/retr.c (retr_rate): Likewise. * src/utils.c (convert_to_bits): Likewise. 2012-06-04 Tim Ruehsen * src/main.c (main): Check for filename != NULL. * src/warc.c (warc_process_cdx_line): Fix memory leak. * src/utils.c (match_posix_regex, compile_posix_regex): Remove dead assignment. * src/openssl.c (ssl_init): Fix old-style function definition. 2012-06-02 Giuseppe Scrivano * src/connect.c: Include and . 2012-05-31 Ángel González * convert.c: fix segfault on wrong urls (bug 36570) 2012-05-30 Gijs van Tulder * src/warc.c: Fix segfault if CDX record is not found. 2012-05-26 Mike Frysinger * src/warc.c: Change type of `warc_current_gzfile' to gzFile. 2012-05-26 Giuseppe Scrivano * src/warc.c (warc_load_cdx_dedup_file): Change type of `line_length' to ssize_t. Suggested by: Ángel González 2012-05-19 illusionoflife (tiny change) * src/convert.c (register_html,register_css): Fixed functions signature to not accept unused argument * src/retr.c (retrieve_url): Changed register_{css,html} usage according new signature. 2012-05-18 Tim Ruehsen * src/gnutls.c (wgnutls_poll): Honor the specified `timeout' value. (wgnutls_peek): Likewise. 2012-05-16 Giuseppe Scrivano * src/warc.h: Cut length lines to 80 columns. * src/warc.c: Likewise. 2012-05-14 Tim Ruehsen * src/gnutls.c (wgnutls_read_timeout): removed warnings, moved fcntl stuff outside loop. * src/hash.h (hash_table_put): Make argument "value" const. * src/hash.c (hash_table_put): Make argument value const. Cast `value' to void. * src/http.c (request_set_header): Make argument `name' const. Cast `value' and `name' to void*. (request_remove_header): Make argument `name' const. * src/url.c (url_file_name): Make `index_filename' static. * src/warc.h (warc_write_cdx_record): Make `url', `timestamp', `mime_type', `payload_digest', `redirect_location', `warc_filename', response_uuid' arguments const. Make `checksum' const. * src/warc.c (warc_write_date_header): Make the `timestamp' argument const. Make `extension' const. (warc_write_cdx_record): Make `url', `timestamp', `mime_type', `payload_digest', `redirect_location', `warc_filename', response_uuid' arguments const. Make `checksum' const. 2012-05-13 Tim Ruehsen * src/gnutls.c (credentials): Change type to gnutls_certificate_credentials_t. (ssl_init): Do not use deprecated types. (ssl_connect_wget): Likewise. 2012-05-13 Giuseppe Scrivano * doc/wget.texi (Types of Files): Document --accept-regex and --reject-regex. 2012-05-13 Giuseppe Scrivano * bootstrap.conf (gnulib_modules): Add `git-version-gen'. * build-aux/bzr-version-gen: Remove file. * configure.ac: Invoke `build-aux/git-version-gen' to get the dist version. * Makefile.am (EXTRA_DIST): Distribute build-aux/git-version-gen instead of build-aux/bzr-version-gen. 2012-04-22 Tim Ruehsen * src/main.c (main): Dynamically allocate `opt.progress_type'. 2012-04-21 Tim Ruehsen * src/ftp-basic.c (ftp_pasv): Fix memory leak. * src/http.c (gethttp): Fix memory leak. * src/ftp.c (getftp): Silent compiler warning. 2012-04-13 Tim Ruehsen (tiny change) * src/warc.c (warc_load_cdx_dedup_file): Fix a memory leak by freeing `lineptr'. 2012-04-11 Gijs van Tulder * src/init.c: Add --accept-regex, --reject-regex and --regex-type. * src/main.c: Likewise. * src/options.c: Likewise. * src/recur.c: Likewise. * src/utils.c: Add regex-related functions. * src/utils.h: Add regex-related functions. 2012-04-11 Gijs van Tulder * bootstrap.conf (gnulib_modules): Include module `regex'. * configure.ac: Check for PCRE library. 2012-04-07 Daniel Kahn Gillmor (tiny change) * src/gnutls.c (key_type_to_gnutls_type): New function. (ssl_init): Use correctly the specified gnutls certificate. 2012-04-01 Giuseppe Scrivano * src/gnutls.c (wgnutls_read_timeout): Ensure timer is freed. * src/gnutls.c (wgnutls_read_timeout): Do not use timer if it is not allocated. Reported by: Xu Zhongxing 2012-04-01 Gijs van Tulder * src/html-url.c: Prevent crash on incomplete STYLE tag. 2012-03-30 Tim Ruehsen (tiny change) * src/warc.c: make warc_uuid_str() implementation depend on HAVE_LIBUUID. 2012-03-30 Tim Ruehsen * src/url.c: Use empty query in local filenames. 2012-03-30 Tim Ruehsen * src/convert.c (convert_links_in_hashtable): Mmake it static. * src/cookies.c (parse_set_cookie): Remove empty else branches. * src/css-url.c: Include "css-url.h". (get_uri_string): Make it static. * src/css-url.h (get_urls_css): Add protoype. * src/gnutls.c (ssl_init): Add prototype. * src/html-parse.c (tagstack_push): Make it static. * src/html-parse.c (tagstack_pop): Make it static. * src/html-parse.c (tagstack_find): Make it static. * src/html-url.c (cleanup_html_url): Make it static. * src/progress.c (count_cols): Make it static. * src/progress.c (get_eta): Make it static. * src/retr.h (convert_to_bits): Remove prototype. * src/util.h (convert_to_bits): Add prototype. * src/spider.c (spider_cleanup): Make it static. * src/warc.c (warc_write_start_record): Add prototype. * src/warc.c (warc_write_end_record): Add prototype. * src/warc.c (warc_start_cdx_file): Add prototype. * src/warc.c (warc_init): Add prototype. * src/warc.c (warc_load_cdx_dedup_file): Add prototype. * src/warc.c (warc_write_metadata): Add prototype. * src/warc.c (warc_close): Add prototype. * src/warc.c (warc_tempfile): Add prototype. * src/warc.c (warc_write_warcinfo_record): Make it static. * src/warc.c (warc_load_cdx_dedup_file): Make it static. * src/warc.c (warc_write_metadata): Make it static. * src/warc.h (warc_init): Fix prototype. * src/warc.h (warc_close): Fix prototype. * src/warc.h (warc_tempfile): Fix prototype. 2012-03-29 Tim Ruehsen (tiny change) * src/utils.c (library): Include . 2012-03-25 Ray Satiro * src/build_info.c.in: Check that HAVE_LIBSSL32 is defined when OpenSSL is used. 2012-03-25 Ray Satiro * configure.ac: Fix build under mingw when OpenSSL is used. 2012-03-25 Giuseppe Scrivano * src/utils.c: Include . * src/ptimer.c: Include . * src/connect.c: Include , , . Reported by: Ray Satiro . 2012-03-20 Ángel González * bootstrap.conf (gnulib_modules): Add modules `ftello', `mkstemp' and `strtok_r'. 2012-03-07 Steven Schubiger * src/init.c (wgetrc_user_file_name): Correct typo. 2012-03-06 Sasikantha Babu * src/utils.c (convert_to_bits): Added new function convert_to_bits to convert bytes to bits. * src/retr.c (calc_rate): Modified the function to handle --bits option and download rate calculated as bits per sec (SI-prefix) for --bits otherwise bytes (IEC-prefix). (retr_rate): Rates will display in bits per sec for --bits. * src/options.h (struct opt): Added --bit option bool variable bits_fmt. * src/main.c (print_help) : Added help for --bit. * src/init.c: Defined command for --bit option. * src/retr.h: Added function prototype. 2012-02-26 Giuseppe Scrivano * src/main.c: Include "closeout.h" (main): Register close_stdout at exit. 2012-02-26 Giuseppe Scrivano * bootstrap.conf (gnulib_modules): Add module `closeout'. 2012-02-23 Giuseppe Scrivano * src/main.c (main): Write diagnostic messages to `stderr' not to `stdout'. * src/main.c (main): Fail gracefully if `malloc' fails. * src/gnutls.c (wgnutls_read): Remove unused variables `timer' and `flags'. 2012-02-17 Steven Schubiger * src/warc.c: Add license header. 2012-02-01 Gijs van Tulder * src/warc.c: Fix large file support with ftello, fseeko. * src/warc.h: Fix large file support. * src/http.c: Fix large file support. 2012-01-27 Gijs van Tulder * src/retr.c (fd_read_body): If the response is chunked, the chunk headers are now written to the WARC file, making the WARC file an exact copy of the HTTP response. 2012-01-27 Gijs van Tulder * src/retr.c (fd_read_body): Fix a memory leak with chunked responses. * src/http.c (skip_short_body): Fix the same memory leak. 2012-01-09 Sasikantha Babu (tiny change) * src/connect.c (connect_to_ip): properly formatted ipv6 address display. (socket_family): New function - returns socket family type. * src/http.c (gethttp): properly formatted ipv6 address display. 2012-01-09 Gijs van Tulder * src/init.c: Disable WARC compression if zlib is disabled. * src/main.c: Do not show the 'no-warc-compression' option if zlib is disabled. * src/warc.c: Do not compress WARC files if zlib is disabled. 2012-01-09 Gijs van Tulder * configure.ac: Always try to use libz, even without SSL. 2011-12-12 Giuseppe Scrivano * Makefile.am (EXTRA_DIST): Add build-aux/bzr-version-gen. Reported by: Elan Ruusamäe . 2011-12-11 Giuseppe Scrivano * util/trunc.c (main): Call `close' on the fd and check for errors. Reported by: . 2011-11-09 Gijs van Tulder * src/warc.c: Call gzdopen() with wb9 instead of wb+9, which fails on zlib version >= 1.2.4. 2011-11-04 Steven Schweda * src/warc.c [! WINDOWS]: Include . (warc_write_warcinfo_record): Assign a new allocated buffer and free it on errors. 2011-11-01 Steven Schweda * src/gnutls.c (ssl_init): Ensure GNU TLS is loaded only once. 2011-10-23 Giuseppe Scrivano * bootstrap.conf (gnulib_modules): Include module `vsnprintf'. 2011-10-16 Steven Schubiger * util/paramcheck.pl: Match 1 or more times where applicable. (extract_entries): Return a copy instead of reference. 2011-10-07 Steven Schweda * src/connect.c: Add HAVE_SYS_SELECT_H and HAVE_SYS_SOCKET_H conditions on includes of and , respectively. * src/ftp.c (getftp): Move BIN_TYPE_TRANSFER macro into VMS-specific section. On VMS, use Stream_LF attributes for listing files. Pass BIN_TYPE_FILE to fopen_excl() instead of constant-everywhere "true". * src/ftp.c (ftp_retrieve_list): Restore lost test of opt.preserve_perm (--preserve-permissions) on the chmod() operation. * src/init.c, main.c: Remove "deprecated" from opt.preserve_perm (--preserve-permissions). * src/init.c (initialize): Use distinct messages for errors in C macro SYSTEM_WGETRC and environment-variable SYSTEM_WGETRC. Avoid use of C macro SYSTEM_WGETRC when it's not defined. * src/log.c (log_close): Avoid closing logfp when it's stderr. * src/main.c (print_help): Restore --preserve-permissions. * src/main.c (main): Avoid using a negative value of longindex as a subscript (for long_options[]) when searching for "--config". * src/main.c (main): Exit the program using exit() instead of "return". (VMS handles these differently, and exit() is better.) * src/openssl.c (ssl_init): Add type cast (SSL_METHOD *) to newly "const" "meth" argument to accommodate OpenSSL version 0.9.8, where that argument is not "const" in the OpenSSL function (SSL_CTX_new). * src/test.c: Declare "program_argstring". * src/utils.c (fopen_excl): Comment typography. * src/warc.h: New file. * src/warc.c: New file. 2011-10-02 Henrik Holst (tiny change) * doc/wget.texi (HTTP Options): Document option --content-on-error. 2011-10-02 Henrik Holst (tiny change) * src/http.c (gethttp): If 'contentonerror' is used then do not skip the http body on 4xx and 5xx errors. * src/init.c (commands): Add 'contentonerror'. * src/main.c (print_help, option_data): Add new option 'contentonerror' to make wget not skip the http content on 4xx and 5xx errors. * src/options.h: New variable 'content_on_error'. 2011-09-27 Karl Berry (tiny change) * doc/wget.texi: Make dir entry consistent with others. 2011-09-19 Giuseppe Scrivano * src/main.c (print_version): Update copyright year. (print_version): Fix typo. 2011-09-13 Giuseppe Scrivano * src/ftp.c (ftp_retrieve_glob): Propagate correctly the `res' error code. 2011-09-07 Giuseppe Scrivano * src/http.c (gethttp): Don't inhibit arest request if opt.timestamping is set. Reported by 2011-09-06 Jakob Matthes (tiny change) * src/main.c (print_version): Do not exit prematurely when --help is passed. 2011-09-04 Christian Jullien (tiny change) * src/gnutls.c: Include . 2011-09-04 Alan Hourihane (tiny change) * configure.ac: Check for libz when gnutls is used. 2011-09-02 Mojca Miklavec (tiny change) * src/main.c (print_version): Do not exit prematurely when --version is passed. 2011-08-30 Giuseppe Scrivano * src/gnutls.c (wgnutls_read_timeout): Use the non blocking socket only for `gnutls_record_recv'. Set errno to ETIMEDOUT on a read timeout. 2011-08-29 Giuseppe Scrivano * src/gnutls.c (wgnutls_read_timeout): New function. (wgnutls_read): Use wgnutls_read_timeout. (wgnutls_peek): Likewise. 2011-08-27 Giuseppe Scrivano * src/main.c (print_help): Exit with an error status if print to stdout fails. (print_usage): Change method signature and return a status code. (print_version): Likewise. 2011-08-26 Giuseppe Scrivano * src/gnutls.c: Include "ptimer.h". (wgnutls_read): Honor read timeout. * src/openssl.c (ssl_init): Make `meth' const. 2011-08-26 Giuseppe Scrivano * configure.ac: Under mingw don't check for static OpenSSL libraries if the shared version was already found. Suggested by: Ray Satiro . 2011-08-25 Giuseppe Scrivano * src/utils.c [HAVE_UTIME && HAVE_UTIME_H]: Include . [HAVE_UTIME && HAVE_SYS_UTIME_H]: Include . (touch) [HAVE_UTIME: Prefers utime over futimens when it is available. It was reported that Cygwin has a not working futimens. 2011-08-25 Giuseppe Scrivano * configure.ac: Check for `utime'. 2011-08-19 Giuseppe Scrivano * src/init.c (home_dir) [MSDOS]: Move local variable `len' here. 2011-08-18 Giuseppe Scrivano * doc/texi2pod.pl: Don't assume the perl executable is under /usr/bin/. 2011-08-18 Giuseppe Scrivano * src/http.c (gethttp): Fix a memory leak on some errors. Free the head buffer. * src/Makefile.am: Use an additional file "css_.c" which in turn includes "wget.h" and immediately "css.c". * src/http.c (gethttp): Reset chunked_transfer_encoding on redirections. 2011-08-13 Giuseppe Scrivano * src/retr.c (fd_read_body): Ensure max is not already defined. * src/mswindows.h (snprintf): Remove definition. (vsnprintf): Likewise. 2011-08-12 Giuseppe Scrivano * src/css.l: Remove include "wget.h". Reported by: Perry Smith . 2011-08-11 Giuseppe Scrivano * src/utils.c (abort_run_with_timeout): Use sigprocmask instead of sigsetmask. * src/gnutls.c (ssl_connect_wget): Remove call to deprecated function `gnutls_certificate_type_set_priority'. * src/Makefile.am (version.c): Don't invoke hg to set version string. 2011-08-11 Giuseppe Scrivano * bootstrap.conf (gnulib_modules): Add `sigprocmask'. * configure.ac: Do not hardcode GNU TLS and OpenSSL libraries. * bootstrap.conf (gnulib_modules): Include module iconv. * configure.ac: Allow --with-libgnutls-prefix and --with-libssl-prefix Suggested by: Karl Berry * build-aux/bzr-version-gen (TAG): Consider only the last tag. 2011-08-10 Giuseppe Scrivano Fix a linker error on systems with an older gnutls version. * src/gnutls.c (ssl_connect_wget) [HAVE_GNUTLS_PRIORITY_SET_DIRECT]: Use gnutls_priority_set_direct. [! HAVE_GNUTLS_PRIORITY_SET_DIRECT]: Use gnutls_protocol_set_priority. Reported by: Karl Berry 2011-08-10 Giuseppe Scrivano * configure.ac: Print usage string for --with-ssl. Reported by: Karl Berry * configure.ac: Check for `gnutls_priority_set_direct' when gnutls is used. Reported by: Karl Berry 2011-08-09 Giuseppe Scrivano * build-aux/bzr-version-gen: Fix some portability issues. 2011-08-06 Jochen Roderburg * src/host.c (lookup_host): Ensure it doesn't print more IPs than available ones. 2011-08-06 Giuseppe Scrivano * src/init.c (defaults): Set default value for show_all_dns_entries. * src/host.c (lookup_host): If `showalldnsentries' is used then print all the IP corresponding to a DNS entry. * src/init.c (commands): Add `showalldnsentries'. Suggested by: Witold Baryluk * src/http.c (gethttp): Add the Cache-Control HTTP header when --no-cache is specified. Reported by: Коренберг Марк . 2011-08-06 Giuseppe Scrivano * doc/wget.texi (Wgetrc Commands): Document show_all_dns_entries. * doc/Makefile.am (wget.pod): Pass the VERSION value to texi2pod. * doc/texi2pod.pl: Update from GCC. 2011-08-05 Giuseppe Scrivano * src/utils.c (acceptable): Accept always the file if it is the specified output destination. Reported by: Shai Berger 2011-08-05 Giuseppe Scrivano * src/gnutls.c (ssl_connect_wget): Use `gnutls_priority_set_direct' instead of the deprecated `gnutls_protocol_set_priority'. 2011-08-02 Giuseppe Scrivano * src/cookies.c (parse_set_cookie): If the value is quoted, do not modify it. Reported by: Nirgal Vourgère 2011-07-29 Giuseppe Scrivano * src/log.c (logprintf): Exit immediately on a SIGPIPE error. Reported by: Noèl Köthe . 2011-07-28 Noèl Köthe (tiny change) * doc/wget.texi (HTTP Options): Fix typo. 2011-07-26 Giuseppe Scrivano * src/retr.c (retrieve_url): Do not register redirects when in spider mode. 2011-07-26 Giuseppe Scrivano * doc/wget.info (cookies): Remove reference to --cookies. Reported by: Noèl Köthe. 2011-07-26 Carlos Martín Nieto (tiny change) * src/init.c (home_dir): Allocate path buffer dinamically. 2011-07-20 Merinov Nikolay * src/iri.c (remote_to_utf8): Add test for non-ASCII symbols with UTF-8 URI encoding. * src/res.c (res_retrieve_file): Fix url_parse call. 2011-07-05 Giuseppe Scrivano * doc/wget.texi (Recursive Retrieval Options): Make clearer that recursion, by default, uses 5 levels. Reported by: Marc Deop . 2011-06-08 Giuseppe Scrivano * src/retr.c (retrieve_from_file): Parse the url careless if IRI is enabled. Reported by: Volker Kuhlmann . 2011-06-03 Merinov Nikolay * tests/Test-idn-cmd-utf8.px: Added test for idn with utf-8 local encoding. * tests/Test-idn-robots-utf8.px: Added test for idn with utf-8 local encoding and robots.txt file. * tests/Makefile.am, run-px: Add new tests. 2011-05-26 Steven Schweda * src/connect.c [HAVE_SYS_SOCKET_H]: Include . [HAVE_SYS_SELECT_H]: Include . 2011-05-25 Giuseppe Scrivano * bootstrap.conf (gnulib_modules): Add `strerror_r-posix'. 2011-05-24 Giuseppe Scrivano * src/retr.c (fd_read_body): Define max. 2011-05-23 Giuseppe Scrivano * src/retr.c (fd_read_body): Be sure the buffer size is at least 8Kb. BUFSIZ may assume very small values with a negative impact on the performances. * src/mswindows.h [NEED_GAI_STRERROR]: remove definition for gai_strerror. 2011-05-19 Giuseppe Scrivano * COPYING: Fix the copyright years. Reported by: Brett Smith . 2011-05-15 Ray Satiro * src/connect.c (select_fd) [WINDOWS]: Ensure the socket is blocking. * src/mswindows.h: Add declaration for set_windows_fd_as_blocking_socket. * src/mswindows.c (set_windows_fd_as_blocking_socket): New function. * src/openssl.c (openssl_read): Prevent loops on read errors. 2011-04-24 Giuseppe Scrivano * src/html-url.c: Include "exists.h". * src/gnutls.c (wgnutls_peek): Remove local variable `ret'. 2011-04-21 Daniel Manrique (tiny change) * src/main.c (main): Set exit status when invalid host name given in command line. * src/html-url.c (get_urls_file): Set exit status when invalid host name given in input file. 2011-04-19 Ray Satiro * configure.ac: Adjust indentation. 2011-04-19 Giuseppe Scrivano * tests/Makefile.am (LIBS): Add $(LIB_CLOCK_GETTIME). 2011-04-19 Giuseppe Scrivano * src/gnutls.c: Do not include . * src/gnutls.c (wgnutls_peek): Ensure there is data available before attempt a read on the blocking socket. * src/Makefile.am (LIBS): Add $(LIB_CLOCK_GETTIME) * src/utils.c: Include . Do not include . (touch): Use `futimens' instead of `utimes'. * src/mswindows.h: Do not include . Remove macro definition for mkdir. 2011-04-19 Giuseppe Scrivano * bootstrap.conf (gnulib_modules): Add `mkdir'. 2011-04-18 Giuseppe Scrivano * src/utils.c: Include . Do not include . (touch): Use `utimes' instead of `utime'. * src/openssl.c (openssl_read): Fix build error. 2011-04-17 Giuseppe Scrivano * src/wget.h [HAVE_LIBSSL32]: Define HAVE_SSL. * src/gnutls.c (wgnutls_peek): Use correctly FIONBIO codes under Mingw32. Reported by: Ray Satiro 2011-04-17 Giuseppe Scrivano * configure.ac: Do not check the host_os twice and if windres is available. Reported by: Ray Satiro 2011-04-16 Ray Satiro 2011-04-16 Giuseppe Scrivano * configure.ac: Detect dynamically linked OpenSSL libraries under mingw32. 2011-04-14 Giuseppe Scrivano * bootstrap: Update from gnulib. * bootstrap.conf (gnulib_modules): Add `pipe' and `sigpipe'. * .cvsignore: Remove file. * .hgignore: Likewise. * .symlinks: Likewise. * bootstrap.conf (gnulib_modules): Add `mbtowc and `unlocked-io'. 2011-04-13 Giuseppe Scrivano * src/openssl.c (openssl_poll): Check if the specified timeout is zero after SSL_pending. (openssl_peek): Make the call non-blocking. 2011-04-11 Cristian Rodríguez (tiny change) * src/openssl.c (ssl_init) [! OPENSSL_NO_SSL2]: Use SSLv2 only when available. 2011-04-07 Ray Satiro (tiny change) (wgnutls_read): Check for the GNUTLS_E_AGAIN return code in the recv tight loop. (wgnutls_write): Likewise. (wgnutls_peek): Likewise. 2011-04-07 Giuseppe Scrivano * src/gnutls.c (wgnutls_peek): New local variable `read'. Use `read' instead of `ret' to store the number of read bytes. Reported by: Ray Satiro 2011-04-07 Giuseppe Scrivano * src/gnutls.c (ssl_init): Allow X509 v1 certificates. Suggested by: Ray Satiro (wgnutls_peek): Remove tight loop around gnutls_record_recv. 2011-04-04 Giuseppe Scrivano * tests/Makefile.am (LIBS): Remove @LIBSSL@ @W32LIBS@ 2011-04-04 Giuseppe Scrivano * src/openssl.c [WINDOWS]: Include . (openssl_read): Retry the read on SSL_ERROR_WANT_READ. * src/host.c [WINDOWS]: Include and . Suggested by: Ray Satiro . * src/Makefile.am (LIBS): Remove @LIBSSL@ @W32LIBS@ 2011-04-04 Giuseppe Scrivano * configure.ac: Use AC_CHECK_LIB to look for the openssl library. 2011-04-03 Giuseppe Scrivano * src/gnutls.c: Include . (wgnutls_peek): Make the socket non blocking before attempt a read. * src/gnutls.c: Include . (wgnutls_peek) [F_GETFL]: Use fcntl. (wgnutls_peek) [! F_GETFL]: Use ioctl. 2011-04-03 Giuseppe Scrivano * bootstrap.conf (gnulib_modules): Add `fcntl'. (gnulib_modules): Add `ioctl'. 2011-03-31 Giuseppe Scrivano * src/recur.c (download_child_p): When --no-parent is used, check that the ports are the same only in case the same protocol is used. Reported by: Karl Berry * src/res.c (res_parse): Add new line to debug messages. 2011-03-26 Giuseppe Scrivano * configure.ac: Fix the gnutls detection. 2011-03-21 Giuseppe Scrivano * src/iri.h (parse_charset) [!ENABLE_IRI]: Use the parameter to avoid a compiler warning. * src/iri.h: Declare `dummy_iri' as extern not static. * src/main.c [!ENABLE_IRI]: Define `dummy_iri'. (main) [!ENABLE_IRI]: Initialize `dummy_iri'. 2011-03-21 Giuseppe Scrivano * doc/wget.texi: Do not cite the current maintainer. Reported by: Micah Cowan . 2011-03-21 Giuseppe Scrivano * bootstrap: Update from gnulib. 2011-03-19 Giuseppe Scrivano * bootstrap.conf (buildreq): Update build prerequisites list. 2011-03-11 Giuseppe Scrivano * src/ftp.c (getftp): Fix some memory leaks. * src/ftp-ls.c (ftp_parse_winnt_ls): Likewise. Reported by: Zhenbo Xu . 2011-03-02 Tomasz Buchert (tiny change) * src/http.c (ensure_extension): Do not adjust the extension if the file ends in .htm. 2011-02-22 Steven Schubiger * src/http.c (gethttp, http_loop): Move duplicated code which is run when an existing file is not to be clobbered to a function. (get_file_flags): New static function. 2011-02-22 Gilles Carry * src/main.c (prompt_for_password): Use stderr instead of stdout to prompt password. This allows to use --output-document=- and --ask-password simultaneously. Without this, redirecting stdout makes password prompt invisible and mucks up payload such as in this example: wget --output-document=- --ask-password -user=foo \ http://foo.com/tarball.tgz | tar zxf - 2010-12-28 Adrien Nader (tiny change) * src/main.c [WINDOWS]: Include and . (main) [WINDOWS]: Set the stdout file mode to binary. Submitted also by: Martin Panter 2010-12-22 Giuseppe Scrivano * doc/wget.texi (HTTP Options): Remove sentence which doesn't reflect the wget behaviour when -k -K are used with -E. Reported by: pike-wget@kw.nl. 2010-12-10 Evgeniy Philippov (tiny change) * src/main.c (main): Initialize `total_downloaded_bytes'. 2010-12-09 Giuseppe Scrivano * src/main.c (main): If --no-clobber and --convert-links are used at the same time, use only the latter. Reported by: Romain Vimont 2010-12-07 Jessica McKellar (tiny change) * vms/WGET.HLP: Make help message clearer. 2010-12-07 Jessica McKellar (tiny change) * src/main.c: Make help message clearer. 2010-12-02 Giuseppe Scrivano * src/build_info.c.in: Do not list md5. 2010-12-01 Giuseppe Scrivano * src/cmpt.c: Include inconditionally. * src/connect.c: Likewise. * src/convert.c: Likewise. * src/ftp-basic.c: Likewise. * src/ftp-ls.c: Likewise. * src/ftp.c: Likewise. * src/gnutls.c: Likewise. * src/http.c: Likewise. * src/init.c: Likewise. * src/log.c: Likewise. * src/main.c: Likewise. * src/openssl.c: Likewise. * src/progress.c: Likewise. * src/ptimer.c: Likewise. * src/recur.c: Likewise. * src/retr.c: Likewise. * src/url.c: Likewise. * src/utils.c: Likewise. * src/retr.c (fd_read_body): Dinamically allocate `dlbuf'. 2010-11-27 Reza Snowdon * src/init.c (initialize): If 'ok' is not zero; exit, as this indicates there was a problem parsing 'SYSTEM_WGETRC'. Fixes bug #20370. 2010-11-22 Giuseppe Scrivano Revert last commit. * src/ftp.c (getftp): When count > 0, don't append to the existing file. * src/http.c (gethttp): Likewise. 2010-11-21 Giuseppe Scrivano * src/ftp.c (getftp): If `count' is bigger than zero, try to resume the download, not restart it. * src/http.c (gethttp): Likewise. Suggested by: Evgeny Kapun . 2010-11-20 Filipe Brandenburger (tiny change) * src/http.c (gethttp): Repeat a POST request on a 307 response. * src/retr.c (retrieve_url): Use NEWLOCATION_KEEP_POST. * src/wget.h: Define NEWLOCATION_KEEP_POST. 2010-11-19 Giuseppe Scrivano * src/ftp.c (getftp): Accept new argument `count'. When `count' is bigger than zero, assume the file can be overwritten. (ftp_loop_internal): Pass new argument `count' to `getftp'. * src/http.c (gethttp): Accept new argument `count'. When `count' is bigger than zero, assume the file can be overwritten. (http_loop): Pass new argument `count' to `gethttp'. Reported by: Evgeny Kapun . 2010-10-26 Giuseppe Scrivano * doc/wget.texi (Download Options): Remove unclear statement about the --waitretry option. Reported by: Manfred Koizar . 2010-10-24 Jessica McKellar (tiny change) * src/main.c (main): Print the total download time as part of the summary for downloads using the recursive or page requisites options. Fixes bug #21359. 2010-10-24 Jessica McKellar (tiny change) * NEWS: Mention the change to the the summary for recursive downloads. 2010-10-24 Giuseppe Scrivano * src/gnutls.c (wgnutls_peek): Do not return an error when `gnutls_record_recv' fails but some data is already available. 2010-10-23 Giuseppe Scrivano * tests/Makefile.am (LIBS): Remove @LIBGNUTLS@ and use @W32LIBS@ as last component. 2010-10-23 Giuseppe Scrivano * src/Makefile.am (LIBS): Remove @LIBGNUTLS@ and use @W32LIBS@ as last component. * src/gnutls.c (wgnutls_peek): Block until data is available. 2010-10-23 Giuseppe Scrivano * configure.ac: Add check for libgpg-error and libgcrypt. 2010-10-21 Giuseppe Scrivano * src/retr.c (retrieve_url): Do not register HTML files twice. Reported by: Manfred Koizar . 2010-10-18 Manfred Koizar (tiny change) * src/html-url.c (tag_handle_link): Do not assume external links type to be always "text/html". 2010-10-16 Giuseppe Scrivano * src/connect.c (socket_ip_address): Initialize `sockaddr' to zero. * src/ftp-basic.c (ftp_epsv): Remove dead assignment to `s'. * src/ftp-ls.c (ftp_parse_vms_ls): Remove dead assignment to `len'. * src/http.c (skip_short_body): Remove dead assignment to `ret'. * src/init.c (wgetrc_user_file_name): Remove dead assignment to `home'. * src/main.c (main): Silent warning reported by clang. (http_loop): Silent warning reported by clang. * src/retr.c (retrieve_url): Likewise. * src/url.c (init_seps): Remove dead assignment to `p'. 2010-10-11 Steven Schubiger * src/ftp.c (getftp): Initialize `targ'. 2010-10-11 Giuseppe Scrivano * src/ftp.c (getftp): Fix indentation. 2010-10-08 Ivanov Anton (tiny change) * src/html-url.c (check_style_attr): Skip quotes when they are present. 2010-10-06 Giuseppe Scrivano * src/main.c (main): Remove unused variable `status'. 2010-09-25 Merinov Nikolay * src/init.c: Add "unlink" command into command list. * src/main.c: Add "unlink" option into option_data list. * src/options.h: Adding unlink field into struct options. * src/wget.h: Addind UNLINKERR error into uerr_t enum. * src/exits.c (get_status_for_err): define status for UNLINKERR. * src/ftp.c (getftp): unlink file if exists, instead clobbering. (ftp_loop_internal): processing UNLINKERR. * src/http.c (gethttp): unlink file if exists, instead clobbering. (http_loop): processing UNLINKERR. 2010-09-25 Merinov Nikolay * doc/wget.texi (Download Options): Document --unlink option. 2010-09-24 Dennis, CHENG Renquan Fix problem when content-disposition is used with recursive downloading. * src/url.h (url_file_name): Add a new argument `replaced_filename'. * src/url.c (url_file_name): Likewise. * src/http.c (parse_content_disposition): Do not add a prefix to the return value. (test_parse_content_disposition): Adjust tests. (gethttp): Pass additional parameter to `url_file_name'. (http_loop): Likewise. * src/ftp.c (ftp_loop_internal, ftp_get_listing, ftp_retrieve_list) (ftp_loop): Likewise. 2010-09-14 Giuseppe Scrivano * src/convert.c (local_quote_string): Accept new parameter `no_html_quote'. Inhibit HTML quoting when `no_html_quote'. (convert_links): Disable HTML quoting in CSS files. Reported by: Manuel Reinhardt . 2010-09-13 Giuseppe Scrivano * doc/wget.texi (Recursive Accept/Reject Options): Remove superfluous dot. Reported by: Snader_LB. 2010-09-12 Mike Frysinger Fix some tests failures. * tests/Test-iri-forced-remote.px: Use --trust-server-names to the cmdline variable. * tests/Test-iri-list.px: Likewise. * tests/Test-iri.px: Likewise. 2010-09-12 Giuseppe Scrivano * src/main.c (print_help): Align --trust-server-names help text with the rest of the text. Suggested by: Mike Frysinger . 2010-09-06 Giuseppe Scrivano * lib/Makefile.am: Fix typo. 2010-08-20 Giuseppe Scrivano * src/convert.c: Include "iri.h". (convert_links_in_hashtable): Parse the url, if necessary, before check if it is present in the dowloaded url map. Reported by: . 2010-08-09 Giuseppe Scrivano Fix a problem with HTTP/1.0 proxies. * src/http.c (gethttp): Specify the connection header when keep alive connections are not used. Use persistent connections with proxies supporting them. * src/http.c (gethttp): Specify Proxy-Connection to the HTTP proxy. 2010-08-08 Reza Snowdon * src/main.c (main): inserted 'defaults'. Added additional 'getopt_long' while loop to search and apply a user specified config file before any other options. New variables 'retconf', 'use_userconfig', 'confval', 'userrc_ret', 'config_opt'. * src/init.c: Include stdbool.h. (commands): Added config details. (defaults): Removed static. (wgetrc): Removed static. (initialize): Removed 'defaults ()', changed 'int ok' to 'bool ok'. * src/options.h: New variable 'choose_config'. * src/init.h (defaults): exported function. (run_wgetrc): exported function. 2010-08-08 Reza Snowdon * doc/wget.texi: Added information about the config option to the 'Overview' section and a description of the option in 'Logging and Input File Options'. 2010-08-08 Giuseppe Scrivano * Makefile.am (EXTRA_DIST): Remove configure.bat. 2010-08-06 Giuseppe Scrivano * src/css-url.c (get_uri_string): Skip empty urls. 2010-07-30 Giuseppe Scrivano * src/html-url.h (struct map_context): Remove member `tail'. * src/html-url.c (append_url): Append the new url ordered by `position'. (get_urls_html): Do not initialize `ctx.tail'. * src/css-url.c (get_urls_css_file): Do not initialize `ctx.tail'. 2010-07-29 Giuseppe Scrivano * src/gnutls.c (wgnutls_peek): Don't read more data if the buffered peek data is sufficient. 2010-07-28 Giuseppe Scrivano * src/http.h (http_loop): Add new argument `original_url' * src/http.c (http_loop): Add new argument `original_url'. Use `original_url' to get a filename if `trustservernames' is false. * src/init.c (commands): Add "trustservernames". * src/options.h (library): Add variable `trustservernames'. * src/main.c (option_data): Add trust-server-names. (print_help): Describe --trust-server-names. * src/retr.c (retrieve_url): Pass new argument to `http_loop'. 2010-07-28 Alon Bar-Lev (tiny change) * doc/texi2pod.pl: Use the warnings module only when it is available. 2010-07-25 John Trengrove (tiny change) * src/ftp.h: Added enum `parsetype'. Modified struct to hold parsetype. * src/ftp-ls.c (ftp_parse_unix_ls): Default to TT_DAY. Change to TT_HOUR_MIN if hours/minutes parsed. (ftp_parse_winnt_ls): Default to TT_HOUR_MIN. (ftp_parse_vms_ls): Default to TT_HOUR_MIN. (ftp_index): Print only if fileinfo struct value ttype set to TT_HOUR_MIN. 2010-07-24 Giuseppe Scrivano * configure.bat: Remove file. 2010-07-20 Leonid Petrov * src/ftp.c (getftp): Don't attempt to retrieve the file if it is already completely. (ftp_loop_internal): Force the length to be zero for symlinks. 2010-07-20 Alan Jenkins (tiny change) * src/http.c (gethttp): Check content-length was set before trying to compare it with the already downloded filesize. 2010-07-18 Giuseppe Scrivano * src/http.c (gethttp): If -N is used, don't exit immediately if the content length is less or equal than the existing file size. Specify "Range" only if -N is not used. Reported by: Caleb Cushing . 2010-07-14 Giuseppe Scrivano * src/ftp.c (max): Remove definition. (getftp): New variable `got_expected_bytes'. Let `expected_bytes' be the value SIZE returns, otherwise use the bytes count returned by RETR. Suggested by Daniel Stenberg . 2010-07-14 Giuseppe Scrivano * src/ftp.c (max): Add definition. (getftp): Consider for `expected_bytes' the greatest value between the bytes count returned by SIZE and the bytes count returned by RETR. Reported by: Jozua . 2010-07-11 Giuseppe Scrivano * src/http.c (http_loop): New variable `force_full_retrieve'. If the remote file is newer that the local one then inhibit -c. Reported by: Caleb Cushing . 2010-07-11 Giuseppe Scrivano * configure.ac (AC_CHECK_SIZEOF): Quote argument. Reported by: Jochen Roderburg . 2010-07-09 Giuseppe Scrivano * src/css-url.c (get_urls_css_file): Use `wget_read_file' instead of `read_file'. Use `wget_read_file_free' instead of `read_file_free'. * src/convert.c (convert_links): Likewise. * src/html-url.c (get_urls_html, get_urls_file): Likewise. * src/res.c (res_parse_from_file): Likewise. * src/utils.h: Rename `read_file' to `wget_read_file'. Rename `read_file_free' to `wget_read_file_free'. * src/utils.c (wget_read_file): Renamed from `read_file'. (wget_read_file_free): Renamed from `read_file_free'. 2010-07-09 Giuseppe Scrivano * bootstrap.conf (gnulib_modules): Add `vasprintf'. Remove `asprintf'. 2010-07-09 Giuseppe Scrivano * bootstrap.conf (buildreq): Relax gettext version to 0.17. 2010-07-05 Giuseppe Scrivano * src/gnutls.c (ssl_init): New local variables `ca_directory' and `dir'. Load default root certificates under the `ca_directory' directory. 2010-07-05 Giuseppe Scrivano * bootstrap.conf (gnulib_modules): Add `asprintf'. 2010-07-03 Giuseppe Scrivano * src/gnutls.c (ssl_connect_wget): New local variable `allowed_protocols'. Honor the --secure-protocol flag. 2010-06-22 Giuseppe Scrivano * configure.ac: By default use GNU TLS not OpenSSL. 2010-06-20 Giuseppe Scrivano * src/main.c (print_help): Fix help string for --random-wait. Its range is [0.5*wait, 1.5*wait) not [0, 2*wait). Reported by: Tom Mizutani . 2010-06-17 Giuseppe Scrivano * windows: Remove directory. * Makefile.am (SUBDIRS): Remove windows. * configure.ac: Don't generate windows/Makefile. 2010-06-15 Giuseppe Scrivano * src/gnutls.c [WIN32]: Include "w32sock.h". (FD_TO_SOCKET): Add definition. (ssl_connect_wget): Get the real socket handle by FD_TO_SOCKET. Define FD_TO_SOCKET if it is not yet defined. 2010-06-15 Giuseppe Scrivano * m4/wget.m4 (WGET_STRUCT_SOCKADDR_STORAGE): Guard header inclusions. (TYPE_STRUCT_SOCKADDR_IN6): Likewise. (MEMBER_SIN6_SCOPE_ID): Likewise. (PROTO_INET6): Likewise. * configure.ac: Don't check for `getaddrinfo'. * bootstrap.conf (gnulib_modules): Add `getaddrinfo' module. 2010-06-14 Giuseppe Scrivano * src/gnutls.c: Include . (struct wgnutls_transport_context): Remove `peekstart'. (ssl_connect_wget): Renamed from `ssl_connect'. (wgnutls_poll): New variable `ctx'. (wgnutls_read): Don't use `ctx->peekstart'. (wgnutls_peek): Likewise. Don't attempt to read if there is not ready data. 2010-06-14 Giuseppe Scrivano * src/http.c (http_loop): Always send a HEAD request when -N is used together with --content-disposition. Reported by: Jochen Roderburg . 2010-06-10 Muthu Subramanian K (tiny change) * src/http.c (http_loop): Increase `total_downloaded_bytes' by `hstat.rd_size', not `hstat.len'. 2010-06-10 Giuseppe Scrivano * src/main.c (print_version): Drop the current maintainer line. 2010-06-10 Giuseppe Scrivano * src/Makefile.am (build_info.c): Generate the `build_info.c' file under the build directory. Patch by: Daniel Stenberg . 2010-06-10 Giuseppe Scrivano * configure.ac (AM_INIT_AUTOMAKE): Remove dist-bzip2 dist-lzma from automake options. Reported by: Daniel Stenberg . 2010-06-10 Giuseppe Scrivano * bootstrap.conf (buildreq): Add definition. 2010-06-04 Giuseppe Scrivano * tests/Test--no-content-disposition-trivial.px: Use /usr/bin/env to find the perl interpreter. * tests/Test--no-content-disposition.px: Likewise. * tests/Test--spider-fail.px: Likewise. * tests/Test--spider-r--no-content-disposition-trivial.px: Likewise. * tests/Test--spider-r--no-content-disposition.px: Likewise. * tests/Test--spider-r-HTTP-Content-Disposition.px: Likewise. * tests/Test--spider-r.px: Likewise. * tests/Test--spider.px: Likewise. * tests/Test-E-k-K.px: Likewise. * tests/Test-E-k.px: Likewise. * tests/Test-HTTP-Content-Disposition-1.px: Likewise. * tests/Test-HTTP-Content-Disposition-2.px: Likewise. * tests/Test-HTTP-Content-Disposition.px: Likewise. * tests/Test-N--no-content-disposition-trivial.px: Likewise. * tests/Test-N--no-content-disposition.px: Likewise. * tests/Test-N-HTTP-Content-Disposition.px: Likewise. * tests/Test-N-current.px: Likewise. * tests/Test-N-no-info.px: Likewise. * tests/Test-N-old.px: Likewise. * tests/Test-N-smaller.px: Likewise. * tests/Test-N.px: Likewise. * tests/Test-O--no-content-disposition-trivial.px: Likewise. * tests/Test-O--no-content-disposition.px: Likewise. * tests/Test-O-HTTP-Content-Disposition.px: Likewise. * tests/Test-O-nc.px: Likewise. * tests/Test-O-nonexisting.px: Likewise. * tests/Test-O.px: Likewise. * tests/Test-Restrict-Lowercase.px: Likewise. * tests/Test-Restrict-Uppercase.px: Likewise. * tests/Test-auth-basic.px: Likewise. * tests/Test-auth-no-challenge-url.px: Likewise. * tests/Test-auth-no-challenge.px: Likewise. * tests/Test-auth-retcode.px: Likewise. * tests/Test-auth-with-content-disposition.px: Likewise. * tests/Test-c-full.px: Likewise. * tests/Test-c-partial.px: Likewise. * tests/Test-c-shorter.px: Likewise. * tests/Test-c.px: Likewise. * tests/Test-cookies-401.px: Likewise. * tests/Test-cookies.px: Likewise. * tests/Test-ftp-bad-list.px: Likewise. * tests/Test-ftp-iri-disabled.px: Likewise. * tests/Test-ftp-iri-fallback.px: Likewise. * tests/Test-ftp-iri-recursive.px: Likewise. * tests/Test-ftp-iri.px: Likewise. * tests/Test-ftp-pasv-fail.px: Likewise. * tests/Test-ftp-recursive.px: Likewise. * tests/Test-ftp.px: Likewise. * tests/Test-i-ftp.px: Likewise. * tests/Test-i-http.px: Likewise. * tests/Test-idn-cmd.px: Likewise. * tests/Test-idn-headers.px: Likewise. * tests/Test-idn-meta.px: Likewise. * tests/Test-idn-robots.px: Likewise. * tests/Test-iri-disabled.px: Likewise. * tests/Test-iri-forced-remote.px: Likewise. * tests/Test-iri-list.px: Likewise. * tests/Test-iri-percent.px: Likewise. * tests/Test-iri.px: Likewise. * tests/Test-k.px: Likewise. * tests/Test-meta-robots.px: Likewise. * tests/Test-nonexisting-quiet.px: Likewise. * tests/Test-noop.px: Likewise. * tests/Test-np.px: Likewise. * tests/Test-proxied-https-auth.px: Likewise. * tests/Test-proxy-auth-basic.px: Likewise. * tests/Test-restrict-ascii.px: Likewise. Reported by sci-fi@hush.ai. 2010-06-04 Giuseppe Scrivano * build-aux/build_info.pl: Use /usr/bin/env to find the perl interpreter. * util/paramcheck.pl: Likewise. * util/rmold.pl: Likewise. Reported by sci-fi@hush.ai. 2010-06-03 Giuseppe Scrivano * src/http.c (http_loop): Do not send a HEAD request before the GET request when --content-disposition is used. Reported by: alex_wh@mail.ru. 2010-06-03 Giuseppe Scrivano * configure.ac (W32LIBS): Remove -lwsock32. 2010-06-03 Giuseppe Scrivano * bootstrap.conf (gnulib_modules): Add iconv-h. 2010-05-31 Giuseppe Scrivano * src/sysdep.h (_SVID_SOURCE): Remove definition. (_BSD_SOURCE): Remove definition. 2010-05-31 Giuseppe Scrivano * src/recur.c (retrieve_tree): Remove variable `up_error_code'. * src/iri.c (open_locale_to_utf8): Remove function. 2010-05-31 Giuseppe Scrivano * src/html-parse.c (map_html_tags): Fix condition for closed tag. Handle the case '>' is not specified after '/'. 2010-05-31 Giuseppe Scrivano * src/css.l: Use option "nounput". (YY_NO_INPUT): Add macro definition. 2010-05-30 Giuseppe Scrivano * src/html-parse.c (NAME_CHAR_P): Consider '<' an invalid character. (advance_declaration): Close the tag if '<' is found. (map_html_tags): Likewise. 2010-05-29 Giuseppe Scrivano * tests/Makefile.am (EXTRA_DIST): Add Test-auth-retcode.px. * tests/run-px (tests): Likewise. * tests/Test-auth-retcode.px: New file. 2010-05-27 Giuseppe Scrivano * src/main.c (print_usage): Accept a new parameter `error'. (main): Print diagnostic messages to stderr, not stdout. 2010-05-27 Giuseppe Scrivano * src/main.c (main): Exit with failure when -k is specified and -O is not a regular file. 2010-05-27 Giuseppe Scrivano * src/http.c (gethttp): Consider new cookies retrieved when the authorization fails. 2010-05-27 Giuseppe Scrivano * src/Makefile.am (wget_SOURCES): Remove `snprintf.c'. * src/snprintf.c: Remove file. 2010-05-27 Giuseppe Scrivano * doc/wget.texi (Download Options): Document that -k can be used with -O only with regular files. 2010-05-27 Giuseppe Scrivano * bootstrap.conf (gnulib_modules): Add `snprintf' module. * configure.ac: Remove check for the `snprintf' function. 2010-05-25 Giuseppe Scrivano * src/host.c (sufmatch): Do not consider zero length entries. * src/http.c (H_10X): New macro. (gethttp): Silently ignore 1xx responses. 2010-05-24 Giuseppe Scrivano * src/iri.h (dummy_iri): Define static. * src/css-tokens.h (css_tokens): Remove variable. 2010-05-23 Giuseppe Scrivano * src/http.c (http_atotm): Count the NUL character when copying the locale string. Reported by: Florian Weimer 2010-05-16 Giuseppe Scrivano * tests/Makefile.am (../md5/libmd5.a): Remove rule. (LDADD): Remove MD5_LDADD. 2010-05-16 Giuseppe Scrivano * src/Makefile.am (LDADD): Remove MD5_LDADD. (MD5_LDADD): Remove definition. (AM_CPPFLAGS): Remove MD5_CPPFLAGS. (version.c): Remove dependency from MD5_LDADD. (../md5/libmd5.a): Remove rule. * src/ftp-opie.c: Include "md5.h". Do not include "gen-md5.h". (skey_response): New variable ctx. Use md5 functions from the gnulib module instead of functions defined previously in the "gen-md5.h" file. * src/gen-md5.h: Remove file. * src/gen-md5.c: Likewise. * src/http.c: Include "md5.h". Do not include "gen-md5.h". (digest_authentication_encode): New variable ctx. Use md5 functions from the gnulib module instead of functions defined previously in the "gen-md5.h" file. 2010-05-16 Giuseppe Scrivano * md5: Remove directory. * bootstrap.conf (gnulib_modules): Add crypto/md5. * configure.ac: Remove any check for md5 libraries. * Makefile.am (ACLOCAL_AMFLAGS): Remove -I md5/m4. (SUBDIRS): Remove md5. 2010-05-15 Giuseppe Scrivano * src/Makefile.am: Quote any path using $(top_srcdir) or $(top_builddir). (version.c): Remove dependency from configure.ac and Makefile.am. * src/css.l: Include "wget.h". 2010-05-15 Giuseppe Scrivano * bootstrap.conf (gnulib_modules): Add "getopt-gnu". Remove "getopt". 2010-05-14 Giuseppe Scrivano * bootstrap (gnulib_path): Default to "gnulib" if it doesn't have a value. Redirect "git clone" stderr to stdout. 2010-05-09 Giuseppe Scrivano * build-aux/bzr-version-gen: New file. * Makefile.am (EXTRA_DIST): Add .version. (BUILT_SOURCES): New defition. (.version): New rule. (dist-hook): Likewise. * configure.ac (AC_INIT): Use build-aux/bzr-version-gen to generate the version string. 2010-05-08 Giuseppe Scrivano * tests/Makefile.am: Update copyright years. 2010-05-08 Giuseppe Scrivano * src/Makefile.am: Update copyright years. * src/cmpt.c: Likewise. * src/connect.h: Likewise. * src/convert.c: Likewise. * src/convert.h: Likewise. * src/cookies.c: Likewise. * src/cookies.h: Likewise. * src/css-tokens.h: Likewise. * src/css-url.c: Likewise. * src/css-url.h: Likewise. * src/css.l: Likewise. * src/exits.c: Likewise. * src/exits.h: Likewise. * src/ftp-basic.c: Likewise. * src/ftp-ls.c: Likewise. * src/ftp-opie.c: Likewise. * src/ftp.c: Likewise. * src/ftp.h: Likewise. * src/gen-md5.c: Likewise. * src/gen-md5.h: Likewise. * src/gettext.h: Likewise. * src/gnutls.c: Likewise. * src/hash.c: Likewise. * src/hash.h: Likewise. * src/host.c: Likewise. * src/host.h: Likewise. * src/html-parse.c: Likewise. * src/html-parse.h: Likewise. * src/html-url.c: Likewise. * src/html-url.h: Likewise. * src/http-ntlm.c: Likewise. * src/http-ntlm.h: Likewise. * src/http.c: Likewise. * src/http.h: Likewise. * src/init.c: Likewise. * src/init.h: Likewise. * src/iri.c: Likewise. * src/iri.h: Likewise. * src/log.c: Likewise. * src/log.h: Likewise. * src/main.c: Likewise. * src/mswindows.c: Likewise. * src/mswindows.h: Likewise. * src/netrc.c: Likewise. * src/netrc.h: Likewise. * src/openssl.c: Likewise. * src/options.h: Likewise. * src/progress.c: Likewise. * src/progress.h: Likewise. * src/ptimer.c: Likewise. * src/ptimer.h: Likewise. * src/recur.c: Likewise. * src/recur.h: Likewise. * src/res.c: Likewise. * src/res.h: Likewise. * src/retr.c: Likewise. * src/retr.h: Likewise. * src/spider.c: Likewise. * src/spider.h: Likewise. * src/ssl.h: Likewise. * src/sysdep.h: Likewise. * src/test.c: Likewise. * src/test.h: Likewise. * src/url.c: Likewise. * src/url.h: Likewise. * src/utils.c: Likewise. * src/utils.h: Likewise. * src/wget.h: Likewise. 2010-05-08 Giuseppe Scrivano * doc/Makefile.am: Update copyright years. * doc/fdl.texi: Likewise. * doc/texi2pod.pl: Likewise. * doc/wget.texi: Likewise. 2010-05-08 Giuseppe Scrivano * Makefile.am: Update copyright years. * build-aux/build_info.pl: Likewise. * configure.ac: Likewise. * configure.bat: Likewise. * doc/Makefile.am: Likewise. * doc/fdl.texi: Likewise. * doc/texi2pod.pl: Likewise. * doc/wget.texi: Likewise. * m4/exitfail.m4: Likewise. * m4/getpagesize.m4: Likewise. * m4/wchar.m4: Likewise. * m4/wctype.m4: Likewise. * m4/wget.m4: Likewise. * md5/Makefile.am: Likewise. * md5/dummy.c: Likewise. * md5/m4/00gnulib.m4: Likewise. * md5/m4/gnulib-cache.m4: Likewise. * md5/m4/gnulib-common.m4: Likewise. * md5/m4/gnulib-comp.m4: Likewise. * md5/m4/gnulib-tool.m4: Likewise. * md5/m4/include_next.m4: Likewise. * md5/m4/longlong.m4: Likewise. * md5/m4/md5.m4: Likewise. * md5/m4/multiarch.m4: Likewise. * md5/m4/stddef_h.m4: Likewise. * md5/m4/stdint.m4: Likewise. * md5/m4/wchar.m4: Likewise. * md5/m4/wchar_t.m4: Likewise. * md5/m4/wint_t.m4: Likewise. * md5/md5.h: Likewise. * md5/stddef.in.h: Likewise. * md5/stdint.in.h: Likewise. * md5/wchar.in.h: Likewise. * msdos/config.h: Likewise. * msdos/msdos.c: Likewise. * po/POTFILES.in: Likewise. * util/Makefile.am: Likewise. * util/paramcheck.pl: Likewise. * util/rmold.pl: Likewise. 2010-05-07 Giuseppe Scrivano * tests/Makefile.am (LIBS): Add definition. (LDADD): Add LIBS. 2010-05-07 Giuseppe Scrivano * src/gnutls.c (wgnutls_close): Use always `close', not `closesocket'. * src/openssl.c (openssl_close): Use always `close', not `closesocket'. (ssl_connect_wget): Get the real socket handle by FD_TO_SOCKET. Define FD_TO_SOCKET if it is not yet defined. * src/Makefile.am (libunittest_a_CPPFLAGS): Add -I$(top_builddir)/lib. * src/mswindows.h: Always include and . Do not include . [! INHIBIT_WRAP]: Remove macro's socket, bind, connect, accept, recv, send, select, getsockname, getpeername, setsockopt, closesocket. Remove wrapped_socket, wrapped_bind, wrapped_connect, wrapped_listen, wrapped_accept, wrapped_recv, wrapped_send, wrapped_select, wrapped_getsockname, wrapped_getpeername, wrapped_setsockopt, wrapped_closesocket prototypes. * src/mswindows.c: Remove wrapped_socket, wrapped_bind, wrapped_connect, wrapped_listen, wrapped_accept, wrapped_recv, wrapped_send, wrapped_select, wrapped_getsockname, wrapped_getpeername, wrapped_setsockopt, wrapped_closesocket functions. Remove WRAP macro. * src/host.h [WINDOWS]: Include not . * src/connect.c: Include and on any platform, gnulib ensures they exist. Remove `read', `write', `close' macros. * src/Makefile.am: Copy build_info.c.in only if we are working in a VPATH. (AM_CPPFLAGS): Add directory with generated gnulib files. 2010-05-07 Giuseppe Scrivano * configure.ac: Don't call macro WGET_SOCKLEN_T. (W32LIBS): Add -lws2_32. * Makefile.am (EXTRA_DIST): Remove autogen.sh. * bootstrap.conf (gnulib_modules): Use new modules from gnulib: accept, bind, close, connect, getpeername, getsockname, listen, setsockopt. * m4/wget.m4 (WGET_SOCKLEN_T): Remove macro. * po/wget.pot: Remove. 2010-05-07 Giuseppe Scrivano * configure.ac (ALL_LINGUAS): Remove. 2010-05-06 Giuseppe Scrivano * src/Makefile.am (build_info.c): Generate build_info.c in the builddir, not srcdir. 2010-05-06 Giuseppe Scrivano * bootstrap: New file. * bootstrap.conf: New file. * lib/Makefile.am: include gnulib.mk. * autogen.sh: Removed. * GNUmakefile: Likewise. * INSTALL: Likewise. * build-aux/announce-gen: Likewise. * build-aux/compile: Likewise. * build-aux/config.guess: Likewise. * build-aux/config.rpath: Likewise. * build-aux/config.sub: Likewise. * build-aux/depcomp: Likewise. * build-aux/gnupload: Likewise. * build-aux/install-sh: Likewise. * build-aux/mdate-sh: Likewise. * build-aux/missing: Likewise. * build-aux/mkinstalldirs: Likewise. * build-aux/texinfo.tex: Likewise. * build-aux/update-copyright: Likewise. * build-aux/useless-if-before-free: Likewise. * build-aux/vc-list-files: Likewise. * build-aux/ylwrap: Likewise. * lib/DESCRIP_DEPS.MMS: Likewise. * lib/DESCRIP_MODS.MMS: Likewise. * lib/DESCRIP_SRC.MMS: Likewise. * lib/alloca.c: Likewise. * lib/alloca.in.h: Likewise. * lib/c-ctype.c: Likewise. * lib/c-ctype.h: Likewise. * lib/config.charset: Likewise. * lib/errno.in.h: Likewise. * lib/error.c: Likewise. * lib/error.h: Likewise. * lib/exitfail.c: Likewise. * lib/exitfail.h: Likewise. * lib/fseeko.c: Likewise. * lib/getdelim.c: Likewise. * lib/getline.c: Likewise. * lib/getopt.c: Likewise. * lib/getopt.in.h: Likewise. * lib/getopt1.c: Likewise. * lib/getopt_int.h: Likewise. * lib/getpagesize.c: Likewise. * lib/getpass.c: Likewise. * lib/getpass.h: Likewise. * lib/gettext.h: Likewise. * lib/intprops.h: Likewise. * lib/localcharset.c: Likewise. * lib/localcharset.h: Likewise. * lib/lseek.c: Likewise. * lib/mbrtowc.c: Likewise. * lib/mbsinit.c: Likewise. * lib/memchr.c: Likewise. * lib/memchr.valgrind: Likewise. * lib/quote.c: Likewise. * lib/quote.h: Likewise. * lib/quotearg.c: Likewise. * lib/quotearg.h: Likewise. * lib/realloc.c: Likewise. * lib/ref-add.sin: Likewise. * lib/ref-del.sin: Likewise. * lib/stdbool.in.h: Likewise. * lib/stddef.in.h: Likewise. * lib/stdint.in.h: Likewise. * lib/stdio-impl.h: Likewise. * lib/stdio-write.c: Likewise. * lib/stdio.in.h: Likewise. * lib/stdlib.in.h: Likewise. * lib/str-two-way.h: Likewise. * lib/strcasecmp.c: Likewise. * lib/strcasestr.c: Likewise. * lib/streq.h: Likewise. * lib/strerror.c: Likewise. * lib/string.in.h: Likewise. * lib/strings.in.h: Likewise. * lib/strncasecmp.c: Likewise. * lib/unistd.in.h: Likewise. * lib/verify.h: Likewise. * lib/wchar.in.h: Likewise. * lib/wctype.in.h: Likewise. * lib/xalloc-die.c: Likewise. * lib/xalloc.h: Likewise. * lib/xmalloc.c: Likewise. * m4/00gnulib.m4: Likewise. * m4/alloca.m4: Likewise. * m4/codeset.m4: Likewise. * m4/errno_h.m4: Likewise. * m4/error.m4: Likewise. * m4/extensions.m4: Likewise. * m4/fseeko.m4: Likewise. * m4/getdelim.m4: Likewise. * m4/getline.m4: Likewise. * m4/getopt.m4: Likewise. * m4/getpass.m4: Likewise. * m4/gettext.m4: Likewise. * m4/glibc21.m4: Likewise. * m4/gnulib-cache.m4: Likewise. * m4/gnulib-common.m4: Likewise. * m4/gnulib-comp.m4: Likewise. * m4/gnulib-tool.m4: Likewise. * m4/iconv.m4: Likewise. * m4/include_next.m4: Likewise. * m4/inline.m4: Likewise. * m4/lib-ld.m4: Likewise. * m4/lib-link.m4: Likewise. * m4/lib-prefix.m4: Likewise. * m4/localcharset.m4: Likewise. * m4/locale-fr.m4: Likewise. * m4/locale-ja.m4: Likewise. * m4/locale-zh.m4: Likewise. * m4/longlong.m4: Likewise. * m4/lseek.m4: Likewise. * m4/malloc.m4: Likewise. * m4/mbrtowc.m4: Likewise. * m4/mbsinit.m4: Likewise. * m4/mbstate_t.m4: Likewise. * m4/memchr.m4: Likewise. * m4/mmap-anon.m4: Likewise. * m4/multiarch.m4: Likewise. * m4/nls.m4: Likewise. * m4/po.m4: Likewise. * m4/progtest.m4: Likewise. * m4/quote.m4: Likewise. * m4/quotearg.m4: Likewise. * m4/realloc.m4: Likewise. * m4/stdbool.m4: Likewise. * m4/stddef_h.m4: Likewise. * m4/stdint.m4: Likewise. * m4/stdio_h.m4: Likewise. * m4/stdlib_h.m4: Likewise. * m4/strcase.m4: Likewise. * m4/strcasestr.m4: Likewise. * m4/strerror.m4: Likewise. * m4/string_h.m4: Likewise. * m4/strings_h.m4: Likewise. * m4/unistd_h.m4: Likewise. * m4/wchar_t.m4: Likewise. * m4/wint_t.m4: Likewise. * m4/xalloc.m4: Likewise. * maint.mk: Likewise. * po/Makefile.in.in: Likewise. * po/Makevars: Likewise. * po/Rules-quot: Likewise. * po/be.po: Likewise. * po/bg.po: Likewise. * po/boldquot.sed: Likewise. * po/ca.po: Likewise. * po/cs.po: Likewise. * po/da.po: Likewise. * po/de.po: Likewise. * po/el.po: Likewise. * po/en_GB.po: Likewise. * po/eo.po: Likewise. * po/es.po: Likewise. * po/et.po: Likewise. * po/eu.po: Likewise. * po/fi.po: Likewise. * po/fr.po: Likewise. * po/ga.po: Likewise. * po/gl.po: Likewise. * po/he.po: Likewise. * po/hr.po: Likewise. * po/hu.po: Likewise. * po/id.po: Likewise. * po/it.po: Likewise. * po/ja.po: Likewise. * po/lt.po: Likewise. * po/nb.po: Likewise. * po/nl.po: Likewise. * po/pl.po: Likewise. * po/pt.po: Likewise. * po/pt_BR.po: Likewise. * po/quot.sed: Likewise. * po/ro.po: Likewise. * po/ru.po: Likewise. * po/sk.po: Likewise. * po/sl.po: Likewise. * po/sr.po: Likewise. * po/sv.po: Likewise. * po/tr.po: Likewise. * po/uk.po: Likewise. * po/vi.po: Likewise. * po/zh_CN.po: Likewise. * po/zh_TW.po: Likewise. 2010-05-05 Giuseppe Scrivano * src/http.c (gethttp): Give a nicer message on a malformed status line. 2010-05-04 Giuseppe Scrivano * src/http.c (gethttp): Check `resp_status' return code and handle errors. 2010-05-04 Giuseppe Scrivano * AUTHORS: Added myself. 2010-05-03 Giuseppe Scrivano * configure.ac: Fix a sanity check by the AC_CONFIG_SRCDIR macro. 2010-05-01 Giuseppe Scrivano * src/http.c (request_send): Specify 1.1 as HTTP version for requests. (skip_short_body): Accept new parameter `chunked'. New variable `remaining_chunk_size'. Handle the chunked transfer encoding. (gethttp): New variable `chunked_transfer_encoding`. Set `keepalive' by default to true. Check if the server is using the chunked transfer encoding. * src/retr.h: Define `rb_chunked_transfer_encoding'. * src/retr.c (fd_read_body): New variable `chunked'. New variable `remaining_chunk_size'. Handle the chunked transfer encoding. 2010-05-01 Giuseppe Scrivano * NEWS: Mention support for HTTP/1.1. 2010-03-04 Steven Schubiger * src/ftp.c (ftp_loop_internal): Omit input file from being deleted after retrieval. * src/retr.c (input_file_url): New function. * src/retr.h: Add declaration for input_file_url. 2010-03-02 Steven Schubiger * src/retr.c (retrieve_from_file): Improve checking for a URL by invoking url_valid_scheme instead of url_has_scheme. * src/url.c (url_valid_scheme): New function. * src/url.h: Add declaration for url_valid_scheme. 2010-03-02 Steven Schubiger * src/retr.c (retrieve_from_file): Free memory of input file. 2010-03-01 Steven Schubiger * tests/Test-i-ftp.px: Test --input-file in conjunction with FTP. * tests/run-px, Makefile.am (EXTRA_DIST): Added Test-i-ftp.px. 2010-03-01 Steven Schubiger * src/retr.c (retrieve_url): Retrieve the local filename from ftp_loop. (retrieve_from_file): Return if there's no input file. * src/ftp.c (ftp_loop_internal): Duplicate the local filename into retrieve_url's scope when a valid reference is being passed. (ftp_loop): Call ftp_loop_internal here with passing a reference to the local filename, elsewhere with NULL. * src/ftp.h: Adjust declaration of ftp_loop. 2010-02-26 Steven Schubiger * tests/Test-i-http.px: Test --input-file in conjunction with HTTP. * tests/run-px, Makefile.am (EXTRA_DIST): Added Test-i-http.px. 2010-02-25 Steven Schubiger * tests/FTPServer.pm (FTPServer::new): Substitute port placeholders in content of files to be retrieved via FTP. 2010-01-27 Paul Townsend (tiny change) * src/retr.c (fd_read_body): Be sure to measure timer when time has elapsed, too, not just when we've made progress. 2010-01-13 Micah Cowan * src/openssl.c: Fixed some mixed declarations-and-code. 2010-01-09 Micah Cowan * src/init.c, main.c, http.c, ftp.c, options.h: Renamed --no-match-server-timestamps to --no-use-server-timestamps, and add it to print_help. 2010-01-09 Micah Cowan * doc/wget.texi (Download Options): Documented --no-use-server-timestamps. 2010-01-09 Jeremy Olexa (tiny change) * src/init.c: Deal with systems which provide PATH_MAX in sys/param.h rather than limits.h. 2010-01-09 David Holman (tiny change) * src/init.c, main.c, http.c, ftp.c, options.h: Added support for --no-match-server-timestamps. 2009-12-03 Peter Rosin * src/snprintf.c (dopr): Output %p as unsigned. 2009-10-24 Petr Pisar * src/openssl.c: Implement support for (multiple) subjectAltNames in X509 certificates, not just the commonName. 2009-10-14 Steven Schubiger * tests/Test-E-k-K.px, Test-cookies-401.px, Test-ftp-bad-list.px, Test-iri-list.px, Test-iri.px: Removed -d from invocation. Patch by Mike Frysinger. 2009-10-09 Steven Schweda * src/utils.c: Added characters " : \ | to the list of normal caret-escaped (ODS5) characters in the char_prop[] table. 2009-10-09 Steven Schweda * New VMS MMS/MMK builders, to accommodate the new source tree structure: lib/DESCRIP_DEPS.MMS Dependencies (lib) lib/DESCRIP_MODS.MMS Modules (lib) lib/DESCRIP_SRC.MMS Main (lib) md5/DESCRIP_DEPS.MMS Dependencies (md5) md5/DESCRIP_MODS.MMS Modules (md5) md5/DESCRIP_SRC.MMS Main (md5) src/DESCRIP_DEPS.MMS Dependencies (src) src/DESCRIP_MODS.MMS Modules (src) src/DESCRIP_SRC.MMS Main (src) vms/DESCRIP.MMS Main (global) vms/DESCRIP_MKDEPS.MMS Dependency generator vms/DESCRIP_SRC.MMS Main (main) vms/DESCRIP_SRC_CMN.MMS Main (common) vms/DESCRIP_SRC_FLAGS.MMS Compiler and linker flags vms/COLLECT_DEPS.COM Dependency processor vms/CONFIG_EXTRACT.COM Extract AC_INIT from configure.ac vms/WGET_MULTINET.OPT Link options for (old) MultiNet vms/WGET_SSL_HP.OPT Link options for HP SSL vms/WGET_SSL_O.OPT Link options for OpenSSL * Other VMS-specific files: vms/alloca.h Dummy alloca.h. vms/config.h_vms Manually crafted config.h vms/stdint.h Dummy stdint.h vms/vms.h Declarations, prototypes for vms.c vms/vms_ip.h Helper for netdb.h vms/VMS_NOTES.TXT Instructions, notes vms/WGET.HLP Basic VMS HELP 2009-10-09 Micah Cowan * src/main.c: Fix declaration of compiled_features. * src/build_info.c.in: Adapt to new input format. Added a check for large-file support. Replaced the "openssl" and "gnutls" advertisements with a single "ssl/foo" advertisement. 2009-10-09 Micah Cowan * build_aux/build_info.pl: Reworked the input format. Eliminated support, and need, for arbitrary #if blocks. Introduced "choices", and explicitly open the .c file rather than print to STDOUT, so we avoid creating the file if we find problems with the input. Options are advertised in alphabetical order. 2009-09-30 Micah Cowan * src/sysdep.h: Unconditionally include stdbool.h (gnulib has it for systems that don't supply it.) * src/vms.c: Added functions vms_basename(), used to extract the bare executable name from argv[0], and vms_getpass(), a VMS-specific replacement for GNU getpass(). vms.c: Added global string constants: compilation_string (NULL), link_string (NULL), and version_string (real). 2009-09-27 Micah Cowan * tests/Test-idn-cmd.px, Test-idn-headers.px, Test-idn-meta.px, Test-idn-robots.px, Test-proxy-auth-basic.px: Removed --debug from invocation (in case it wasn't built with --debug support). 2009-09-24 Steven Schweda * src/main.c (print_version): Remove VMS-conditional build info string; the new VMS builders use OS_TYPE. * src/ftp.c (getftp): Fix accidental use of target -> targ. 2009-09-24 Micah Cowan * vms/vms.c: Moved to src/src.c. 2009-09-24 Micah Cowan * tests/Test-ftp-iri-disabled.px: Fix name "Test-ftp-iri" -> "test-ftp-iri-disabled" * tests/Test-ftp-iri-fallback.px: Fix name "Test-ftp-iri" -> "test-ftp-iri-fallback" 2009-09-24 Micah Cowan * src/vms.c: Moved from vms/vms.c. * src/sysdep.h: Use proper prototypes for snprintf, vsnprintf (thanks to Steven Schweda). 2009-09-24 Micah Cowan * src/host.c (host_errstr): host_errstr should return const char *. 2009-09-22 Micah Cowan * src/Makefile.am (version.c): Explicitly tell Mercurial where the top-source repository should be, so it doesn't accidentally use the repository for a containing directory (for instance, if the user's home directory is managed by a Mercurial repo). Also ensures we find the repository, even if the build directory is outside the source directory. * src/wget.h: define WINDOWS if _WIN32 or __WIN32__ is defined, but not __CYGWIN__. * src/mswindows.c (xsleep): Check for availability of the sleep function, in addition to the usleep function. (get_winsock_error, windows_strerror): Removed (gnulib already supplies this functionality). * src/mswindows.h [NEED_GAI_STRERROR]: define gai_strerror to (gnulib's) strerror, rather than windows_strerror. Removed error macro definitions provided by gnulib. (windows_strerror): Removed (gnulib already supplies this functionality). * src/host.c: Don't declare h_errno when building on Windows. * src/Makefile.am (LIBS): Added @W32LIBS@. (EXTRA_wget_SOURCES): Removed (it's in LIBOBJ now). (version.c): Removed dependency on $(LDADD), since it may include dependencies that are not part of the Wget sources (-lwsock32, for instance). * src/openssl.c (ssl_check_certificate): Avoid reusing the same buffer space for successive quoted arguments. Thanks to Steven Schweda for pointing out the problem. 2009-09-22 Micah Cowan * configure.ac: Added "sleep" and "symlink" to AC_CHECK_FUNCS, removing the hard-coded definition of HAVE_SYMLINK. When running on MinGW, compile mswindows.c, and link against libwsock32. 2009-09-22 Arvind Jamuna Dixit * src/http.c (extract_param, parse_content_disposition): Support RFC 2231 while parsing Content-Disposition header 2009-09-21 Micah Cowan * vms/VMS-WGET.COM: "the the" -> "the". * po/POTFILES.in: Removed some files that are not using gettext. * po/*.po: Updated from translationproject.org. 2009-09-21 Micah Cowan * src/progress.c (update_speed_ring): "the the" -> "the". * src/css-url.c, iri.c, mswindows.c: Removed assert.h inclusion (not using it). 2009-09-20 Micah Cowan * INSTALL: Various minor adjustments to bring it up to date. 2009-09-11 Steven Schweda * src/utils.c, utils.h (fopen_excl): Make second argument an int, rather than a bool (so it can handle the appropriate VMS version of the flag). * src/main.c (print_version): Don't print LOCALEDIR unless NLS is enabled. * src/http.c (gethttp): Pass FOPEN_BIN_FLAG, instead of true. * src/ftp.c (ftp_retrieve_glob): Cast strcmp to assign to cmp, for a silly HP build environment. 2009-09-09 Micah Cowan * configure.ac: Add bz2 and lzma dists. 2009-09-08 Steven Schubiger * src/main.c, init.c: Mark the --preserve-permissions and --html-extension option as deprecated. 2009-09-08 Micah Cowan * po/*.po: Updated from translationproject.org. 2009-09-07 Micah Cowan * tests/run-px: Exit with a failure if there were any tests with "unknown" exit statuses. * tests/Test-auth-with-content-disposition.px: New. Test Content-Disposition support when HTTP authentication is required. * tests/run-px, Makefile.am (EXTRA_DIST): Added Test-auth-with-content-disposition.px. * tests/FTPServer.pm (FTPServer::run): Pass "server behavior" information to newly-constructed FTPPaths object. (FTPPaths::initialize): Accept "server behavior" hash. (FTPPaths::_format_for_list): If server behavior has "bad_list" set, then always report 0 for the size. * tests/Test-ftp-bad-list.px: Added. Attempts to reproduce bug 22403... but doesn't. * tests/run-px, Makefile.am (EXTRA_DIST): Added Test-ftp-bad-list.px. 2009-09-07 Micah Cowan * Makefile.am (distuninstallcheck_listfiles): Don't complain if /usr/share/info/dir and /etc/wgetrc are left behind after an uninstall. * po/Rules-quot (mostlyclean-quot): Add en_US.po for remvoal by mostlyclean. (en_US.po-update): Behave properly for VPATH builds. 2009-09-06 Micah Cowan * tests/WgetTest.pm.in (_setup): Don't expect error codes from _setup_server; none are returned. (quotechar, _show_diff): Added facilities for expounding on where output didn't match expectations. (_verify_download): Use _show_diff. * tests/FTPTest.pm (_setup_server): Pass value of server_behavior to FTPServer initialization. * tests/Test-ftp-pasv-fail.px: Added. * tests/run-px, Makefile.am (EXTRA_DIST): Added Test-ftp-pasv-fail.px. * tests/WgetTest.pm.in: Added "server_behavior" to the set of accepted initialization values. * tests/FTPServer.pm (__open_data_connection): Add "server_behavior" to the set of accepted initialization values. (run): Honor the 'fail_on_pasv' server behavior setting, to trigger the Wget getftp glitch. 2009-09-06 Micah Cowan * src/ftp.c (getftp, ftp_loop_internal): Separate "len" input/output parameter (with different meanings for input and output), into two separate parameters, one input (passed_expected_bytes) and one output (qtyread). Fixes bug #26870. 2009-09-06 Gisle Vanem * msdos/Makefile.WC: Added compilation of new file msdos.c. Simplified; no need to have explicit rules for files in ./lib and ./md5. Just use the suffix search path mechanism in wmake. 2009-09-06 Gisle Vanem * msdos/Makefile.DJ: Added compilation of new file msdos.c. 2009-09-05 Steven Schubiger * tests/run-px: Introduce two new diagnostics: Skip and Unknown. * tests/WgetFeature.pm (import): Parse the version output of Wget and assert the availability of a feature. * tests/WgetFeature.cfg: Messages to be printed in absence of a required feature. * tests/Test-ftp-iri-disabled.px, Test-ftp-iri-fallback.px, Test-ftp-iri-recursive.px, Test-ftp-iri.px, Test-idn-cmd.px, Test-idn-headers.px, Test-idn-meta.px, Test-idn-robots.px, Test-iri-forced-remote.px, Test-iri-list.px, Test-iri-percent.px, Test-iri.px: Use WgetFeature.pm to check for the presence of the IDN/IRI feature. * tests/Test-proxied-https-auth.px: Replace grepping for a feature with loading WgetFeature.pm at compile-time. * tests/Makefile.am: Add WgetFeature.pm and WgetFeature.cfg to EXTRA_DIST. 2009-09-05 Steven Schubiger * src/retr.h: Declare set_local_file() to avoid build warnings. 2009-09-05 Micah Cowan * tests/Test-ftp-recursive.px: Added. * tests/run-px, Makefile.am (EXTRA_DIST): Added Test-ftp-recursive.px. * tests/FTPTest.pm (_setup_server): Don't construct the "input" directory's contents, just pass the URLs structure to FTPServer->new. * tests/FTPServer.pm: Rewrote portions, so that the server now uses the information from the %urls hash directly, rather than reading from real files. Added an FTPPaths package to the file. 2009-09-05 Micah Cowan * configure.ac: If we can't find idna.h, check to see if it's because we need to add /usr/include/idn to the inclusion path (for OpenSolaris). 2009-09-05 Gisle Vanem * src/connect.c, init.c, main.c, openssl.c, options.h: Replace "ifdef MSDOS" with "ifdef USE_WATT32" since DOS-targets in fact use the Watt-32 tcp/ip stack. 2009-09-05 Gisle Vanem * msdos/config.h: Added 'HAVE_ALLOCA_H' for Watcom 1.5+. Added 'USE_WATT32' since all DOS-targets use the Watt-32 tcp/ip stack. Added meaningless dummy LOCALEDIR. * msdos/Makefile.WC: A much needed update. Added rules for many files in ./lib, css.c and version.c. * msdos/Makefile.DJ: A much needed update. Added rules for css.c and version.c. 2009-09-04 Steven Schubiger * configure.ac: Place gl_EARLY and md5_EARLY before the gettext macros in order to silence autoconf warnings. 2009-09-04 Micah Cowan * tests/WgetTest.pm.in (run): Error-checking improvements. 2009-09-04 Micah Cowan * src/Makefile.am: Move build_info.c to wget_SOURCES from nodist_wget_SOURCES, reduce dependencies, and invoke build_info.pl in its new home, $(top_srcdir)/build-aux. 2009-09-04 Micah Cowan * doc/wget.texi (Time-Stamping): "older" -> "not newer". * doc/Makefile.am (install.man, install.wgetrc): Use $(mkinstalldirs), not $(top_srcdir)/mkinstalldirs. 2009-09-04 Micah Cowan * Makefile.am (EXTRA_DIST): build_info.pl -> build-aux/build_info.pl * build-aux/build_info.pl: Moved from top directory. * md5/*: Updated md5 from gnulib. * configure.ac: Configured build-aux/ as auxiliarry directory. * build-aux/compile, build-aux/config.guess, build-aux/config.rpath, build-aux/config.sub, build-aux/depcomp, build-aux/install-sh, build-aux/link-warning.h, build-aux/mdate-sh, build-aux/missing, build-aux/mkinstalldirs, build-aux/texinfo.tex, build-aux/useless-if-before-free, build-aux/vc-list-files, build-aux/ylwrap: Moved from top directory. * build-aux/announce-gen: Imported from gnulib. * build-aux/update-copyright: Imported from gnulib. * build-aux/gnupload: Imported from gnulib. * lib/Makefile.am, m4/gnulib-cache.m4, m4/gnulib-comp.m4: Adjusted for announce-gen, update-copyright, and gnupload. 2009-09-03 gettextize * m4/gettext.m4: Upgrade to gettext-0.17. * m4/iconv.m4: Upgrade to gettext-0.17. * m4/lib-link.m4: Upgrade to gettext-0.17. * m4/po.m4: Upgrade to gettext-0.17. * po/Makefile.in.in: Upgrade to gettext-0.17. * configure.ac (AM_GNU_GETTEXT_VERSION): Bump to 0.17. 2009-09-03 Micah Cowan * src/ftp-ls.c (ftp_parse_vms_ls): Replace use of localtime_r with localtime, as not all platforms have localtime_r. * src/Makefile.am (wget_SOURCES): Rearranged some of the file order, so .c files and .h files are apart. Added gettext.h, so that it shows up in the dist. 2009-09-03 Micah Cowan * NEWS: Give credit to jff for SSL security fix, call attention to IRI support's dependence on libidn and libiconv, and note that --html-extension is still accepted, though deprecated. * lib/*, m4/*: Updated gnulib. * lib/getpagesize.c, lib/memchr.c, lib/memchr.valgrind, lib/stddef.in.h, lib/str-two-way.h, lib/strcasecmp.c, lib/strcasestr.c, lib/strings.in.h, lib/strncasecmp.c, m4/getpagesize.m4, m4/memchr.m4, m4/mmap-anon.m4, m4/stddef_h.m3, m4/strcase.m4, m4/strcasestr.m4, m4/strings_h.m4, m4/wchar_t.m4: Added, via gnulib --import strcasestr. * configure.ac: Move AM_GNU_GETTEXT below AC_AIX, to shut up autoconf warnings. 2009-09-02 Micah Cowan * tests/Makefile.am (unit-tests): explicit dependency is unnecessary (and harmful, as it overrides the automatic one). 2009-09-02 Micah Cowan * src/gettext.h: Refreshed from gettext 0.17. * src/test.c: Added definition of program_name variable, required by lib/error.c (libgnu.a). Doesn't cause problems until you try to build on a non-GNU system... * src/build_info.c.in: Removed useless "+gettext" feature ad. 2009-09-02 Micah Cowan * po/Rules-quot (en_US.po-update): Remove use of GNU make's non-portable $^ variable. 2009-09-01 Steven Schubiger * tests/Makefile.am: Add Test-cookies.px, Test-cookies-401.px and Test-restrict-ascii.px to EXTRA_DIST. 2009-09-01 Micah Cowan * tests/Makefile.am (../src/libunittest.a): Make it a phony target, so we always make sure to get up-to-date unit-test runs. 2009-08-31 Steven Schubiger * tests/Makefile.am: Add Test-k.px to EXTRA_DIST. 2009-08-29 Steven Schubiger * tests/run-px: Add Test-k.px to the list. * tests/Test-k.px: Test escaping of semicolons in local file strings. 2009-08-29 Steven Schubiger * src/convert.c (local_quote_string): Percent-encode semicolons in local file strings. 2009-08-27 Micah Cowan * tests/run-px: Added Test-cookies.px, Test-cookies-401.px * tests/Test-cookies.px: Basic testing to make sure Wget doesn't send cookies; no path/domain checking. * tests/Test-cookies.px: Test to make sure Wget heeds cookies when they are sent with a 401 response (#26775). * tests/HTTPServer.pm (send_response): Don't try to substitute port in response body, if there isn't one. (verify_request_headers): Avoid uninitialized warning when an expected header isn't provided by Wget. 2009-08-27 Micah Cowan * tests/WgetTest.pm.in (run): Shift the errcode right by 8 binary places. * tests/Test--spider-fail.px, Test--spider-r--no-content-disposition.px, Test--spider-r--no-content-disposition-trivial.px, Test--spider-r-HTTP-Content-Disposition.px, Test--spider-r.px, Test-O-nonexisting.px, Test-cookies-401.px, Test-nonexisting-quiet.px: Adjusted "expected error code"; Wget's exit codes have changed. 2009-08-27 Micah Cowan * src/wget.h (uerr_t): added new VERIFCERTERR code for SSL certificate problems. Marked exit codes that are defined but never used (at least, the ones I could find). * src/retr.c, retr.h (retrieve_url): Added a new boolean argument to determine whether an exit status should be recorded. (retrieve_from_file): Adjust to new retrieve_url signature. * src/res.c (res_retrieve_file): Don't have retrieve_url record an exit status for robots.txt. * src/recur.c (retrieve_tree): Adjust to new retrieve_url signature. * src/main.c (main): Use the exit status stored by retrieve_url. * src/http.c (gethttp): Distinguish certificate verification problems from SSL connection issues. (http_loop): Handle newly-created VERIFCERTERR error code. * src/exits.c, exits.h: Newly added. * src/Makefile.am (wget_SOURCES): Add exits.c and exits.h. 2009-08-27 Micah Cowan * src/http.c (gethttp): Make sure Wget heeds cookies when they are sent with a 401 response; or any other sort of response for that matter (#26775). 2009-08-27 Micah Cowan * doc/texi2pod.pl: Handle @asis in table-element formatting. * doc/wget.texi (Exit Status): Document new exit codes. 2009-08-27 Micah Cowan * NEWS: Mention the changes to exit codes. 2009-08-27 Micah Cowan * NEWS: Add mention of the NUL characters SSL security fix. 2009-08-19 Micah Cowan * src/openssl.c (ssl_check_certificate): Only warn about an attack if the hostname would otherwise have matched. Also some formatting cleanup. 2009-08-19 Joao Ferreira * src/openssl.c (ssl_check_certificate): Detect embedded NUL characters in the SSL certificate common name. 2009-08-17 Tony Lewis * src/http.c (gethttp): Ensure that we parse Content-Length before we attempt to refer to its value. Without this fix, NTLM support was completely buggered. #27192 2009-08-09 Michael Baeuerle * src/ftp.c: #include for strcasecmp. 2009-08-02 Micah Cowan * doc/wget.texi (Option Syntax): "This is a complete equivalent of" -> "This is completely equivalent to". Thanks to Reuben Thomas for catching this. 2009-07-28 Micah Cowan * src/main.c (option_data): Rename --html-extension to --adjust-extension. (print_help): Ditto. * src/options.h (struct option): Rename html_extension to adjust_extension. * src/http.c (gethttp): Ditto. * src/convert.c (local_quote_string): Ditto. * src/init.c (commands): Add "adjustextension", and reflect rename change for opt.adjust_extension, for both "adjustextension" and "htmlextension". 2009-07-28 Micah Cowan * doc/wget.texi (Download Options): Document "lowercase", "uppercase", and the new "ascii" specifier for --restrict-file-names. (HTTP Options): Rename --html-extension to --adjust-extension. (Wgetrc Commands): Rename html_extension to adjust_extension. 2009-07-28 Micah Cowan * NEWS: Mention some more previously undocumented items, the new "ascii" specifer for --restrict-file-names, and the renaming of --html-extension to --adjust-extension. 2009-07-27 Petr Pisar * src/main.c (print_help): Fixed a couple typos. 2009-07-27 Petr Pisar * po/Makevars (MSGID_BUGS_ADDRESS): Fixed. 2009-07-27 Micah Cowan * tests/Test-restrict-ascii.px: New. * tests/run-px: Added Test-restrict-ascii.px. 2009-07-27 Micah Cowan * src/options.h (struct options): Added restrict_files_nonascii boolean field. * src/url.c (FILE_CHAR_TEST): Add check for chars outside the ASCII range. * src/init.c (defaults): Add restrict_files_nonascii to initialization. (cmd_spec_restrict_file_names): Allow parsing of "ascii" keyword. 2009-07-27 Marcel Telka * src/iri.c (do_conversion): Typo: invalide -> invalid 2009-07-26 Micah Cowan * tests/Test-ftp-iri.px, Test-ftp-iri-fallback.px, Test-ftp-iri-recursive.px, Test-ftp-iri-disabled.px, Test-idn-cmd.px, Test-idn-robots.px: Adjust wget invocations, replacing --locale with --local-encoding. 2009-07-26 Micah Cowan * src/main.c (option_data): Rename --locale option to --local-encoding (print_help): Document --no-iri, --local-encoding, and --remote-encoding, within usage message. Remove defunct --preserve-permissions. 2009-07-26 Micah Cowan * doc/wget.texi (Download Options): Change --iri item to --no-iri; rename --locale to --local-encoding. (Wgetrc Commands): Document iri, local_encoding, remote_encoding, ask_password, auth_no_challenge, and keep_session_cookies. 2009-07-23 Micah Cowan * src/progress.c (get_eta): Change "Translation note" in comment to "TRANSLATORS", so it actually appears in wget.pot. 2009-07-10 Micah Cowan * util/paramcheck.pl (find_documentation): Added. (emit_undocumented_opts): Check for documentation in both TexInfo and --help string. 2009-07-07 Steven Schubiger * tests/Makefile.am: Add IDN/IRI test files and Test-meta-robots.px to EXTRA_DIST. 2009-07-06 Micah Cowan * src/main.c (print_help): Improve documentation of --base. 2009-07-06 Micah Cowan * doc/wget.texi (Logging and Input File Options): Alter description of --input-file, implying that --force-html isn't necessary when the input is in HTML file. Improve accuracy of --base description. (Wgetrc Commands): Improve accuracy of "base" description. (HTTP Options): Clarify operation of --post-file. 2009-07-05 Petr Pisar * src/main.c (print_version): Mark initial line for translation, along with (env), (user), and (system). Change copyright year to 2009. * src/http.c (http_loop): Space after colon. * src/gnutls.c (ssl_check_certificate): Use quote function, rather than explicit quotes. 2009-07-05 Micah Cowan * tests/Test-meta-robots.px: Added. * tests/run-px: Add Test-meta-robots.px to the list. 2009-07-05 Micah Cowan * src/html-url.c (tag_handle_meta): Handle meta name="robots" properly: deal with whitespace, commas after... * src/netrc.c (parse_netrc): Rename local-scope variable "quote" to "qmark", to avoid conflict with the function name. 2009-07-05 Micah Cowan * src/Makefile.am (version.c): Add dependency on configure.ac. * src/iri.c: Mark some strings for translation. 2009-07-05 Micah Cowan * po/Rules-quot: Added targets to build en@{quot,boldquot}.po * po/POTFILES.in: Added src/gnutls.c, src/iri.c. * po/*.po: Updated translations from TP. New translation: Lithuanian. * lib/*, md5/*: Updated gnulib. 2009-07-04 Steven Schweda * vms/COLLECT_DEPS.COM, vms/config.h_vms, vms/decc_ver.c, vms/DESCRIP_CONFIG.MMS, vms/DESCRIP_DEPS.MMS, vms/DESCRIP_MKDEPS.MMS, vms/DESCRIP.MMS, vms/DESCRIP_SRC.MMS, vms/vms.c, vms/vms.h, vms/vms_ip.h, vms/vms_name_fix.sh, vms/VMS_NOTES.TXT, vms/VMS-WGET.COM, vms/WGET.HLP, vms/WGET_MULTINET.OPT, vms/WGET.OPT, vms/WGET_SSL_HP.OPT, vms/WGET_SSL.OPT: Added. 2009-07-04 Steven Schweda * src/wget.h (ORIG_SFX): Macro added, to supply an alternative "_orig" suffix on VMS. * src/connect.c, host.c, host.h, main.c: Include "vms_ip.h" rather than on VMS systems. * src/convert.c (write_backup_file): Use "_orig" rather than ".orig" on VMS. * src/ftp-basic.c (ftp_list): Don't use LIST -a on VMS servers. * src/ftp.c [__VMS]: Include "vms.h" * src/ftp.c (getftp): Disable some unhelpful "for VMS" code. Pass the OS type to ftp_list. (getftp) [__VMS]: Alter the filename as required, and invoke fopen with extra optional arguments. (ftp_retrieve_list): Set permissions before setting the times. * src/ftp-ls.c [__VMS]: Various improvements. * src/http.c (SET_USER_AGENT): Added. Include operating-system info in the User-Agent header. (gethttp): Use ORIG_SFX macro. (gethttp) [__VMS]: invoke fopen with extra arguments to communicate file type. (http_loop): Add cast to time_t. * src/init.c (wgetrc_user_file_name) [__VMS]: Adjustments for where to find the .wgetrc. * src/log.c: Adjustments to improve handling of log files on VMS. * src/main.c [__VMS]: Added --ftp-stmlf option (and associated documentation). Adjust description of backup-suffixes, as they now differ for VMS. (print_help, print_version): Include OS type in version information. (main) [__VMS]: Invoke fopen with extra options, to specify file type information. * src/netrc.c (search_netrc) [__VMS]: Fix .netrc-finding. * src/openssl.c, openssl.h (ssl_connect_wget): Renamed from ssl_connect. * src/options.h (struct options): Add ftp_stmlf field. * src/retr.c (write_data) [__VMS]: Avoid fflush. * src/url.c [__VMS]: Include vms.h. * src/url.c (url_file_name) [__VMS]: Handle naming as required for VMS. * src/utils.c [__VMS]: Include vms.h. Various added facilities for VMS needs. 2009-07-04 Steven Schubiger * src/url.c (url_parse): If an URL scheme is invalid, distinguish between an unsupported or missing scheme. * src/url.c: Add a "missing scheme" entry to parse_errors. 2009-07-04 Micah Cowan * src/main.c (print_version): Allow localization of the version-info labels, eschew attempts at alignment (which is complicated when handling translated strings), and avoid using printf() with variable-stored format strings that lack conversion specifications. (format_and_print_line): For similar reasons, don't calculate line-continuation tabulation based on the number of bytes in a string. 2009-07-03 Micah Cowan * tests/Test-ftp-iri-disabled.px, Test-iri-disabled.px: --iri=no --> --no-iri 2009-07-03 Micah Cowan * src/iri.h (iri_dup): Provide macro definition for when IRIs are disabled. * src/Makefile.am (LIBS): Added @LIBICONV@. (wget_SOURCES): Added iri.h. (EXTRA_wget_SOURCES): Added iri.c, so it gets packaged even if IRI support was disabled. 2009-07-03 Micah Cowan * doc/wget.texi (Download Options): --iri=no --> --no-iri (Contributors): Add Saint Xavier. 2009-07-03 Micah Cowan * configure.ac: Ensure LIBICONV is empty if IRIs are disabled. * AUTHORS: Added Ted Mielczarek and Saint Xavier. * NEWS: Added items for IRI support, new --version information. 2009-07-02 Micah Cowan * src/recur.c (url_enqueue): Quote enqueue/dequeue debug messages. * src/html-url.c (append_url): Change "merge()" quoting style from locale_quoting_style to escape_quoting_style. 2009-07-01 Steven Schubiger * src/Makefile.am: Add a rule to generate build_info.c and list the build_info.c.in file in EXTRA_DIST. Adjust elsewhere where needed. * src/build_info.c: Remove this static source file. * src/build_info.c.in: Data for generation of build_info.c. 2009-07-01 Steven Schubiger * Makefile.am: Add build_info.pl to EXTRA_DIST. * build_info.pl: Generate build_info.c from data. 2009-07-01 Micah Cowan * tests/HTTPServer.pm (send_response): Invocation of verify_request_headers, to support testing of Wget-sent header values. (verify_request_headers): Added. * tests/Test-iri.px: Added verification checks for Referer values. 2009-07-01 Micah Cowan * src/retr.c (retrieve_url): Use the existing "redirect" label, instead of superfluous "second_try". Removed no-longer-accurate debug statement. Use the "newloc" parameter to store the fallback URL, when IRI version was rejected. * src/recur.c (retrieve_tree): Always use the parsed URL for tracking the Referer, since that's the one we actually requested (if there's a difference in terms of percent-encodings and such). 2009-06-29 Micah Cowan * tests/WgetTest.pm.in (_cleanup): Allow cleanup of test directories to be skipped at user discretion. * tests/run-px, Test-iri-percent.px, Test-ftp-iri-recursive.px: Added test for percent-coded value preservation, FTP recursion when IRI support's on. 2009-06-29 Micah Cowan * src/html-url.c (append_url): Quote some more arguments that might contain characters that are inappropriate to display for the current locale. * src/retr.c (retrieve_from_file): Be sure to pass iri information when parsing an input-fle url, and be sure to hand a clean iri struct, off to retrieve_tree and retrieve_url. * src/iri.c, iri.h (iri_dup): Added. * src/retr.c (retrieve_url): Re-parse for IRI fallback. * src/main.c (main): Set up iri before the url_parse invocation (so we can use it). 2009-06-20 Micah Cowan * doc/wget.texi (Contributors): Added Jay Krell. 2009-06-20 Jay Krell * src/sysdep.h (_ALL_SOURCE): (small change) Define the _ALL_SOURCE macro on INTERIX systems. (I switched the location from ftp.c to sysdep.h --mjc) 2009-06-15 Micah Cowan * src/ftp.c (getftp): If we can't accept the connection, return CONERROR, not whatever the contents of err happens to be. Fixes bug #25015. * src/retr.c (fd_read_body): Make both args to progress_create consistent, resulting in an accurate progress display. Fixes bug #24948. 2009-06-14 Phil Pennock (tiny change) * src/host.h: Declare `is_valid_ip_address'. * src/host.c (is_valid_ip_address): New function. * src/http.c (gethttp): Specify the hostname to ssl_connect_wget. * src/gnutls.c (ssl_connect_wget): Specify the server name. * src/openssl.c (ssl_connect_wget): Likewise. * src/ssl.h: Change method signature for ssl_connect_wget. 2009-06-14 Micah Cowan * tests/Makefile.am (EXTRA_DIST): Include all the tests, run-px, and certs/, to make distcheck happy. 2009-06-14 Micah Cowan * src/Makefile.am (wget_SOURCES): css-tokens.h needs to ship with dist, too. 2009-06-14 Micah Cowan * po/Makefile.in.in (distclean): remove en_US.po, too. * Makefile.am: Include md5 as a subdir unconditionally. It may result in useless compilation, and additional risk of breaking a build of something that isn't actually needed, but otherwise it's too much of a hassle to manage a failure-free distcheck. 2009-06-14 Micah Cowan * doc/Makefile.am (wget.pod): $(srcdir)/version.texi -> version.texi 2009-06-13 Micah Cowan * src/init.c: Rename setval_internal_wrapper to setval_internal_tilde, ensure we don't "replace" the tilde unless it's actually present. Clean up some minor GNU style issues. 2009-06-13 Julien Pichon * src/init.c: Handle tilde-expansion in wgetrc commands, without resorting to setting/unsetting globals to change behavior in one call location. 2009-06-12 Micah Cowan * src/host.c: Include before . Not required by POSIX any more, but some older systems (such as FreeBSD 4.1) still need it, and it doesn't seem like it could hurt... * src/build_info.c (library): Handle "https" as a feature in its own right, apart from "gnutls" and "openssl". * src/host.c: Declare h_errno if no declaration is provided. Idea thanks to Maciej W. Rozycki. 2009-06-12 Micah Cowan * doc/wget.texi (Download Options): More accuracy on what happens when -nd is used with -r or -p. 2009-06-12 Micah Cowan * configure.ac: Check for h_errno declaration. Idea thanks to Maciej W. Rozycki. 2009-06-11 Xin Zou * src/http.c (gethttp): Fix some memory leaks. 2009-06-11 Micah Cowan * src/http.c (http_atotm): Handle potential for setlocale's return value to be static storage. Thanks to Benjamin Wolsey . * src/sysdep.h: Need NAMESPACE_TWEAKS on non-Linux glibc-based systems, too. Thanks to Robert Millan. 2009-06-11 Micah Cowan * doc/wget.texi (Contributors): Added Xin Zou, Benjamin Wolsley, and Robert Millan. 2009-06-11 Joao Ferreira * doc/wget.texi (Option Syntax): Fixed contradictory and confusing explanation of --folow-ftp and negation. 2009-06-11 Benjamin Wolsey * tests/Test-proxied-https-auth.px: Take an optional argument for the top source directory, so we can find the cert and key. * tests/run-px: Provide the top source directory as an argument, so scripts can find their way around. 2009-06-10 Micah Cowan * doc/sample.wgetrc: Add "https_proxy" to the proxy examples. Thanks to Martin Paul for the suggestion. 2009-05-28 Steven Schubiger * src/ftp.c (ftp_get_listing): Update the "listing file" string after calling ftp_loop_internal(). 2009-05-27 Steven Schubiger * src/ftp.c (ftp_get_listing): Duplicate the "listing file" string to avoid memory corruption when FOPEN_EXCL_ERR is encountered. 2009-05-17 Steven Schubiger * src/progress.c (eta_to_human_short): Fix the remaining hours to be displayed. Spotted by Tadeu Martins (#26411). 2009-04-24 Micah Cowan * src/hash.c: Change stdint.h inclusion to use HAVE_STDINT_H, not C99 check. * src/connect.c: stdint.h inclusion added. Thanks to Markus Duft for a similar patch. 2009-04-20 Micah Cowan * src/Makefile.am (version.c): Fix unportable use of "echo -n". 2009-04-13 Steven Schubiger * src/ftp.c (ftp_retrieve_list): Move the duplicated code that determines the local file to a function. * src/http.c (http_loop): Likewise. * src/retr.c (set_local_file): New function. 2009-04-11 Steven Schubiger * tests/run-px: Skip testing with real rc files by setting SYSTEM_WGETRC and WGETRC to /dev/null. 2009-04-11 Steven Schubiger * src/init.c (initialize): Run a custom SYSTEM_WGETRC when provided as an environment variable. 2009-03-03 Steven Schubiger * src/ftp.c, src/http.c, src/main.c, src/recur.h, tests/Makefile.am: Update the copyright years. 2009-02-27 Steven Schubiger * src/ftp.c (ftp_loop_internal): Don't claim for FTP retrievals when writing to standard output either that the document has been saved. Addresses bug #20520 again. 2009-02-27 Gisle Vanem * src/main.c (main): "freopen (NULL,.." causes an assertion in MSVC debug-mode. I.e. NULL isn't legal. But the "CONOUT$" device works fine. 2009-02-25 Benjamin Wolsey * tests/Makefile.am (run-px-tests): Ensure run-px is run from srcdir. * tests/run-px: Include modules from srcdir. 2009-02-21 Steven Schubiger * src/http.c (http_loop): When a document is written to standard output, don't claim it has been saved to a file. Addresses bug #20520. 2009-02-18 Steven Schubiger * src/recur.h: Remove the dangling declaration for recursive_cleanup(). 2009-02-01 Gerardo E. Gidoni * src/main.c, recur.c, recur.h, res.c, retr.c, retr.h: restructured code to avoid multiple 'url_parse' calls. 2009-01-23 Steven Schubiger * util/freeopts, util/rmold.pl, util/trunc.c: Remove unnecessary whitespace. 2008-12-04 Micah Cowan (not copyrightable) * tests/run-px, Test-idn-robots.px: Added test for robots-file downloads. * tests/Test-idn-cmd.px, Test-idn-meta.px, Test-idn-headers.px: Fix test names. 2008-11-27 Saint Xavier * src/http.c (gethttp): Move authentication code before filename allocation avoiding fallbacking on default filename because "Content-Disposition" header wasn't present before authentcation has been completed. Fixes bug #24862. 2008-11-26 Micah Cowan (not copyrightable) * tests/Test-ftp-iri-disabled.px, Test-ftp-iri-fallback.px, Test-ftp-iri.px, Test-idn-cmd.px, Test-idn-headers.px, Test-idn-meta.px, Test-iri-disabled.px, Test-iri-forced-remote.px, Test-iri-list.px, Test-iri.px: More module-scope warnings. 2008-11-25 Steven Schubiger * tests/WgetTest.pm.in: Remove the magic interpreter line; replace -w with lexical warnings. 2008-11-16 Steven Schubiger * src/main.c: Declare and initialize the numurls counter. * src/ftp.c, http.c: Make the counter visible here and use it. * src/options.h: Remove old declaration from options struct. 2008-11-15 Steven Schubiger * src/init.c (defaults): Set default waitretry value. 2008-11-15 Steven Schubiger * doc/sample.wgetrc: Comment the waitretry "default" value, because there is a global one now. * doc/wget.texi (Download Options): Mention the global default value. 2008-11-14 Steven Schubiger * src/main.c (format_and_print_line): Use a custom format string for printing leading spaces. 2008-11-13 Steven Schubiger * tests/FTPServer.pm, FTPTest.pm, HTTPServer.pm, HTTPTest.pm, WgetTest.pm.in: Clean up leftover whitespace. 2008-11-13 Micah Cowan * src/http.c (gethttp): Don't do anything when content-length >= our requested range. 2008-11-12 Steven Schubiger * tests/Test-auth-basic.px, Test-auth-no-challenge.px, Test-auth-no-challenge-url.px, Test-c-full.px, Test-c-partial.px, Test-c.px, Test-c-shorter.px, Test-E-k-K.px, Test-E-k.px, Test-ftp.px, Test-HTTP-Content-Disposition-1.px, Test-HTTP-Content-Disposition-2.px, Test-HTTP-Content-Disposition.px, Test-N-current.px, Test-N-HTTP-Content-Disposition.px, Test-N--no-content-disposition.px, Test-N--no-content-disposition-trivial.px, Test-N-no-info.px, Test--no-content-disposition.px, Test--no-content-disposition-trivial.px, Test-N-old.px, Test-nonexisting-quiet.px, Test-noop.px, Test-np.px, Test-N.px, Test-N-smaller.px, Test-O-HTTP-Content-Disposition.px, Test-O-nc.px, Test-O--no-content-disposition.px, Test-O--no-content-disposition-trivial.px, Test-O-nonexisting.px, Test-O.px, Test-proxy-auth-basic.px, Test-Restrict-Lowercase.px, Test-Restrict-Uppercase.px, Test--spider-fail.pxm, Test--spider.px, Test--spider-r-HTTP-Content-Disposition.px, Test--spider-r--no-content-disposition.px, Test--spider-r--no-content-disposition-trivial.px, Test--spider-r.px: Enforce lexically scoped warnings. * tests/Test-proxied-https-auth.px, run-px: Place use strict before use warnings. 2008-11-12 Steven Schubiger * tests/FTPServer.pm, FTPTest.pm, HTTPServer.pm, HTTPTest.pm: Remove the magic interpreter line, because it cannot be used fully. Substitute -w with use warnings. 2008-11-12 Steven Schubiger * src/main.c (print_version): Hand the relevant xstrdup/xfree calls back to format_and_print_line(). 2008-11-12 Micah Cowan * src/ftp-ls.c (ftp_index): HTML-escape dir name in title, h1, a:href. 2008-11-12 Alexander Belopolsky * src/url.c, url.h (url_escape_unsafe_and_reserved): Added. * src/ftp-ls.c (ftp_index): URL-escape, rather than HTML-escape, the filename appearing in the link. 2008-11-11 Steven Schubiger * src/main.c (format_and_print_line): Move both the memory allocating and freeing bits upwards to print_version(). 2008-11-11 Micah Cowan * tests/HTTPServer.pm (handle_auth): Allow testing of --auth-no-challenge. * tests/Test-auth-no-challenge.px, Test-auth-no-challenge-url.px: Added. * tests/run-px: Add Test-auth-no-challenge.px, Test-auth-no-challenge-url.px. 2008-11-10 Saint Xavier * src/http.c: Make --auth-no-challenge works with user:pass@ in URLs. 2008-11-10 Micah Cowan * doc/Makefile.am (EXTRA_DIST): Removed no-longer-present README.maint (shouldn't have been there in the first place). * doc/wget.texi (Mailing Lists): Added information aboug Gmane portal, added subsection headings. Update node pointers. 2008-11-10 Micah Cowan * MAILING-LIST: Mention Gmane, introduce subsections. 2008-11-07 Steven Schubiger * tests/run-px: Use some colors for the summary part of the test output to strengthen the distinction between a successful or failing run. 2008-11-06 Steven Schubiger * tests/run-px: When executing test scripts, invoke them with the current perl executable name as determined by env. 2008-11-06 Micah Cowan * tests/run-px: Use strict (thanks Steven Schubiger!). 2008-11-05 Micah Cowan * src/ftp.c (print_length): Should print humanized "size remaining" only when it's at least 1k. 2008-11-05 Micah Cowan * doc/wget.texi: Move --no-http-keep-alive from FTP Options to HTTP Options. (Mailing List): Mention moderation for unsubscribed posts, and archive location. 2008-11-05 Micah Cowan * MAILING-LIST: Mention moderation for unsubscribed posts, and archive location. 2008-11-04 Micah Cowan * doc/wget.texi, fdl.texi: Updated to FDL version 1.3. 2008-10-31 Micah Cowan * src/main.c (print_version): Add information about the mailing list. 2008-10-31 Micah Cowan * doc/wget.texi (Mailing List): Update info to reflect change to bug-wget@gnu.org. 2008-10-31 Micah Cowan * MAILING-LIST: Update information. * NEWS: Add mention of mailing list move. 2008-10-31 Alexander Drozdov * src/retr.c (fd_read_hunk): Make assert deal with maxsize == 0. * src/ftp-ls.c (clean_line): Prevent underflow on empty lines. 2008-10-26 Gisle Vanem * src/main.c (format_and_print_line): Put variables on top of blocks (not all compilers are C99). Add an extra '\n' if SYSTEM_WGETRC isn't defined and printed. 2008-09-30 Steven Schubiger * doc/wget.texi (Wgetrc Commands): Add default_page, save_headers, spider and user_agent to the list of recognized commands. 2008-09-10 Michael Kessler * doc/wget.texi (Robot Exclusion): Fixed typo "downloads" -> "download" 2008-09-09 Micah Cowan * tests/Test-idn-cmd.px: Added. * tests/run-px: Added Test-idn-cmd.px. 2008-09-09 Micah Cowan * src/init.c (home_dir): Save the calculated value for home, to avoid duplicated work on repeated calls. (wgetrc_file_name) [WINDOWS]: Define and initialize home var. * src/build_info.c, main.c: Remove unnecessary extern vars system_wgetrc and locale_dir. * src/main.c: Define program_name for lib/error.c. 2008-09-09 Gisle Vanem * src/url.c (url_error): Use aprintf, not asprintf. 2008-09-02 Gisle Vanem * src/mswindows.h: Must ensure is included before we redefine ?vsnprintf(). 2008-08-28 Micah Cowan * tests/HTTPServer.pm (run): Allow distinguishing between hostnames, when used as a proxy. * tests/Test-idn-headers.px, Test-idn-meta.px: Added. * tests/run-px: Added Test-idn-headers.px, Test-idn-meta.px. * tests/Test-proxy-auth-basic.px: Use the full URL, rather than just the path (made necessary by the accompanying change to HTTPServer.pm). 2008-08-14 Xavier Saint * tests/Test-iri-list.px : Fetch files from a remote list. 2008-08-08 Steven Schubiger * src/main.c, utils.h: Removed some dead conditional DEBUG_MALLOC code. 2008-08-03 Xavier Saint * tests/Test-iri.px : HTTP recursive fetch for testing IRI support and fallback. * tests/Test-iri-disabled.px : Same file structure as Test-iri.px but with IRI support disabled * tests/Test-iri-forced-remote.px : There's a difference between ISO-8859-1 and ISO-8859-15 for character 0xA4 (respectively currency sign and euro sign). So with a forced ISO-8859-1 remote encoding, wget should see 0xA4 as a currency sign and transcode it correctly in UTF-8 instead of using the ISO-8859-15 given by the server. * tests/Test-ftp-iri.px : Give a file to fetch via FTP in a specific locale and expect wget to fetch the file UTF-8 encoded. * tests/Test-ftp-iri-fallback.px : Same as above but wget should fallback on locale encoding to fetch the file. * tests/Test-ftp-iri.px : Same as Test-ftp-iri.px but with IRI support disabled. The UTF-8 encoded file should not be retrieved. 2008-08-03 Xavier Saint * doc/wget.texi : Add option descriptions for the three new options --iri, --locale and --remote-encoding related to IRI support. * doc/sample.wgetrc : Add commented lines for the three new command iri, locale and encoding related to IRI support. 2008-08-03 Steven Schubiger * src/url.c, url.h (url_error): Better messages for unsupported schemes, especially https. * src/html-url.c, recur.c, retr.c: Adjust to new url_error invocation, and free result. 2008-08-03 Micah Cowan * src/main.c (print_help): Added --default-page. 2008-08-03 Micah Cowan * src/build_info.c, css-url.c: #include wget.h, not config.h. 2008-08-03 Micah Cowan * doc/wget.texi: Don't set UPDATED; already set by version.texi. (HTTP Options): Add --default-page option. 2008-08-01 Joao Ferreira * src/init.c, main.c, options.h, url.c: Added option --default-page to support alternative default names for index.html 2008-08-01 Joao Ferreira * NEWS: Added option --default-page to support alternative default names for index.html 2008-07-17 Steven Schubiger * src/retr.c (retrieve_from_file): When given an URL as input file, use it as baseref if none was specified and treat the input file as HTML if its content type is text/html. * src/init.c (cleanup): Free the memory associated with the base option (when DEBUG_MALLOC is defined). 2008-07-17 Steven Schubiger * doc/wget.texi (Logging and Input File Options): Document for --input-file and according remote input file URLs, the implicit enforcement of treating a document as HTML and the possible baseref assumption. 2008-07-02 Xavier Saint * src/iri.c, iri.h : New function idn_decode() to decode ASCII encoded hostname to the locale. * src/host.c : Show hostname to be resolved both in locale and ASCII encoded. 2008-06-30 Micah Cowan * NEWS: Entries for 1.11.4. * AUTHORS: Added Steven Schubiger. 2008-06-29 Micah Cowan * doc/wget.texi : Added Joao Ferreira, Mike Frysinger, Alain, Guibert, Madhusudan Hosaagrahara, Jim Paris, Kenny Parnell, Benno Schulenberg, and Pranab Shenoy. Added Steven Schubiger to the "Special Thanks" section. 2008-06-28 Steven Schubiger * src/retr.c (retrieve_from_file): Allow for reading the links from an external file (HTTP/FTP). 2008-06-26 Xavier Saint * src/iri.c, iri.h : New functions locale_to_utf8() and idn_encode() adding basic capabilities of IRI/IDN. * src/url.c : Convert URLs from locale to UTF-8 allowing a basic support of IRI/IDN 2008-06-26 Xavier Saint * configure.ac : IRIs support required libiconv, check it. 2008-06-25 Steven Schubiger * src/ftp.c (getftp): When spidering a FTP URL, emit a diagnostic message if the remote file exists. 2008-06-24 Steven Schubiger * src/http.c (http_loop): Replace escnonprint() occurence with a quotearg_style() call. 2008-06-24 Micah Cowan * src/ftp-ls.c (ftp_index): Don't assume time_t* is compatible with long*. Fixes crash on Windows, and probably other systems. 2008-06-22 Steven Schubiger * src/http.c: Explicitly initialize and deallocate the message string used by the -nv --spider functionality. 2008-06-22 Steven Schubiger * src/http.c: Make -nv --spider include the file's name when it exists. 2008-06-22 Micah Cowan * tests/Test-proxied-https-auth.px: Shift exit code so it falls in the lower bits, and actually fails when it should. Use dynamic port, instead of static port. Only run the test if our Wget was built with HTTPS support. * tests/certs/server-cert.pem, certs/server-key.pem: Apparently failed to add these from 1.11.x repo. Fixed. 2008-06-22 Micah Cowan * src/Makefile.am (version.c): Fixed version string invocation so it once again can't result in unterminated strings, made all the string vars pointers-to-const, and moved line lengths below 80 (in Makefile.am, not in version.c). 2008-06-19 Xavier Saint * src/iri.c, iri.h : New function check_encoding_name() as a preliminary encoding name check. * src/main.c, iri.c : Make use of check_encoding_name(). 2008-06-19 Xavier Saint * src/iri.c, iri.h : Add a new function find_locale() to find out the local system encoding. * src/main.c : Make use of find_locale(). 2008-06-19 Xavier Saint * src/iri.c : Include missing stringprep.h file and add a cast. * src/init.c : set a default initial value for opt.enable_iri, opt.locale and opt.encoding_remote. 2008-06-19 Xavier Saint * src/html-url.c : Add "content-type" meta tag parsing for retrieving page encoding. * src/iri.h : Make no-op version of parse_charset() return NULL. 2008-06-16 Micah Cowan * src/http.c (http_loop): When hstat.len is higher than the successfully completed content's length, but it's because we _set_ it that way, don't abort. 2008-06-14 Xavier Saint * src/iri.c, iri.h : New files. * src/Makefile.am : Add files iri.h and conditional iri.c. * src/build_info.c : Add compiled feature "iri". * src/http.c : include iri.h and parse charset from Content-Type header. * src/init.c, main.c, options.h : if an options isn't supported at compiled time, don't get rid off it and show a dummy message instead if they are used. 2008-06-14 Xavier Saint * configure.ac: Add support for IRIs 2008-06-13 Micah Cowan * src/build_info.c: ENABLE_NTLM, not HAVE_NTLM; distinguish OpenSSL from "ssl". 2008-06-13 Micah Cowan * doc/wget.texi (Mailing List): The wget-notify mailing list no longer receives commit notifications from the source repository. (Internet Relay Chat): Activity isn't quite so low any more, remove notice to that effect. 2008-06-13 Madhusudan Hosaagrahara * src/Makefile.am, main.c, init.c, init.h, build_info.c: Adds build information to the --version command line option. Fixes bug #20636. 2008-06-12 Micah Cowan * tests/FTPServer.pm, FTPTest.pm, HTTPServer.pm, HTTPTest.pm, Test--no-content-disposition-trivial.px, Test--no-content-disposition.px, Test--spider-fail.px, Test--spider-r--no-content-disposition-trivial.px, Test--spider-r--no-content-disposition.px, Test--spider-r-HTTP-Content-Disposition.px, Test--spider-r.px, Test--spider.px, Test-E-k-K.px, Test-E-k.px, Test-HTTP-Content-Disposition-1.px, Test-HTTP-Content-Disposition-2.px, Test-HTTP-Content-Disposition.px, Test-N--no-content-disposition-trivial.px, Test-N--no-content-disposition.px, Test-N-HTTP-Content-Disposition.px, Test-N-current.px, Test-N-no-info.px, Test-N-old.px, Test-N-smaller.px, Test-N.px, Test-O--no-content-disposition-trivial.px, Test-O--no-content-disposition.px, Test-O-HTTP-Content-Disposition.px, Test-O-nonexisting.px, Test-O.px, Test-Restrict-Lowercase.px, Test-Restrict-Uppercase.px, Test-auth-basic.px, Test-c-full.px, Test-c-partial.px, Test-c.px, Test-ftp.px, Test-nonexisting-quiet.px, Test-noop.px, Test-np.px, Test-proxied-https-auth.px, Test-proxy-auth-basic.px, WgetTest.pm.in: Use whatever ports are available, rather than hard-coded ones. * tests/run-px: More summary info, explicit exit code. * tests/Makefile.am: Reinstate "run-px-tests" as a dependency for the "check" target. * tests/WgetTest.pm.in: Draw more attention to the fact that WgetTest.pm is a generated file. * tests/Test-proxied-https-auth.px: Better cleanup, so next test can open the port. 2008-06-01 Micah Cowan * src/main.c [WINDOWS]: Reopen stdout in binary mode, when -O - is given. 2008-05-31 Micah Cowan * src/html-url.c, http.c: Avoid casts in a couple spots. 2008-05-31 Micah Cowan * tests/Test-N-current.px: Ensure we catch failures. * tests/Test-N-old.px: Make it test only the timestamp, and not the content length in addition. * tests/Test-N-smaller.px, Test-N-no-info.px: added. * tests/Test-c-partial.px: Improve checking that the file was partially retrieved, rather than overwritten. * tests/run-px: Added Test-N-smaller.px, Test-N-no-info.px. * tests/HTTPServer.pm: Return 416 for fully-retrieved content, rather than 206 with a zero content-length. 2008-05-30 Henri Häkkinen * src/cookies.c, ftp-basic.c, hash.c, html-url.c, http-ntlm.c, http.c, init.c, log.c, main.c, progress.c, ptimer.c, spider.c, url.c, utils.c: Minor changes to silence warnings when using -Wall. 2008-05-29 Micah Cowan * po/*.po: Updated from TP (the 1.11.3 set). * po/POTFILES.in: Added some more files from lib/, remove src/xmalloc.c. * po/quot.sed, po/boldquot.sed: Automatic handling of quotearg's ` and '. 2008-05-26 Steven Schubiger * src/ftp.c (getftp): Replace last remaining invocation of escnonprint with gnulib quote. 2008-05-23 Micah Cowan * tests/Test--spider.px: Make test expect 0 return code. 2008-05-22 Micah Cowan * tests/Makefile.am (run-px-tests): Replaced ugly list of tests with run-px Perl script to manage running them. * tests/run-px: Added. * tests/FTPServer.pm (run): Avoid re-forking. Fixes bug #20458. 2008-05-19 Micah Cowan * src/main.c (main): Password prompt should be done only once (not once per argument), and should be done prior to the background fork. 2008-05-17 Steven Schubiger * doc/wget.texi (Download Options): Change documentation to reflect the new default value for --prefer-family. (Wgetrc Commands): Same, for prefer_family wgetrc command. 2008-05-17 Steven Schubiger * src/init.c (defaults): Set the preferred IP family to `none' by default. 2008-05-17 Micah Cowan * src/main.c (main): Handle Ctrl-D on command-line. 2008-05-17 Kenny Parnell (cmd_spec_prefer_family): Initialize prefer_family to prefer_none. 2008-05-15 Steven Schubiger * src/ftp.c (getftp): Verify that the file actually exists in FTP, by checking it against the listing. 2008-05-15 Micah Cowan * src/main.c (prompt_for_password): Use the quote module. 2008-05-15 Micah Cowan * NEWS: Entry for --ask-password. 2008-05-14 Micah Cowan * src/ftp.c (ftp_retrieve_list): Symlinks and other filenames should be fully quoted. 2008-05-14 Joao Ferreira * src/main.c, src/http.c, src/ftp.c: -nc is now working in conjunction with '-O file'. 2008-05-12 Micah Cowan * src/main.c (main): Downgrade "-N with -O" to a warning, and switch it off to avoid confusing messages. 2008-05-12 Micah Cowan * doc/wget.texi (Download Options): -N with -O downgraded to a warning. 2008-05-12 Micah Cowan * NEWS: Translations and -N/-O. 2008-04-30 Steven Schubiger * src/main.c (main): New code that handles prompting for passwords when specified explicitly via command-line option (using gnulib's getpass-gnu module). (main): Include the getpass header. * src/init.c: Add "ask-password" to the list of recognized commands. * src/options.h: Add an according boolean member to the options struct. * src/sysdep.h: Comment the defines __EXTENSIONS__ and _GNU_SOURCE out, because they're now defined independently by config.h. 2008-04-30 Steven Schubiger * lib/getdelim.c, lib/getline.c, lib/getpass.c, lib/getpass.h, lib/realloc.c, lib/stdio.h, lib/stdio.in.h, lib/stdlib.h, lib/stdlib.in.h: Imported from gnulib. * m4/eoverflow.m4, m4/extensions.m4, m4/getdelim.m4, m4/getline.m4, m4/getpass.m4, m4/malloc.m4, m4/realloc.m4, m4/stdio_h.m4, m4/stdlib_h.m4: Imported from gnulib. * md5/stdint.h: Imported from gnulib. * GNUmakefile: Updated from gnulib. * lib/Makefile.am, lib/getopt.c, lib/unistd.in.h: Updated from gnulib. * m4/gnulib-cache.m4, m4/gnulib-common.m4, m4/gnulib-comp.m4, m4/include_next.m4, m4/unistd_h.m4: Updated from gnulib. * md5/Makefile.am, md5/m4/gnulib-cache.m4, md5/m4/gnulib-common.m4, md5/m4/gnulib-comp.m4, md5/m4/include_next.m4, md5/m4/md5.m4, md5/m4/stdint.m4, md5/md5.c, md5/md5.h, md5/stdint.in.h, md5/wchar.in.h: Updated from gnulib. 2008-04-30 Steven Schubiger * doc/wget.texi : Document the --ask-password option. 2008-04-30 Micah Cowan * src/progress.c (create_image): Fix glitch where too many spaces are printed on lines that don't display the ETA, in multibyte locales. 2008-04-30 Micah Cowan * NEWS: Added documentation for changes made in 1.11.2. 2008-04-27 Rabin Vincent * src/http.c (http_loop): Fix return for the case where we don't download a file because of -nc. 2008-04-27 Micah Cowan * src/url.c (path_simplify): Go back to allowing leading ".." in paths, but only for FTP URLs. (test_path_simplify): Add scheme-specificness to tests, adapt for mu_run_test. * src/test.c (all_tests): Add test_path_simplify. * src/main.c (main): Downgrade -r, -p with -O to a warning rather than an error; elaborate just a bit more for other -O combination cases. 2008-04-27 Micah Cowan * doc/wget.texi (Download Options) <-O>: Elaborate on why certain options make poor combinations with -O. 2008-04-26 Micah Cowan * tests/Makefile.am, Test-proxied-https-auth.px: Added a test for accessing password-protected HTTPS URLs through a proxy (via CONNECT). 2008-04-26 Micah Cowan * src/http.c (gethttp): Move proxy CONNECT handling to below the retry_with_auth label, to deal with properly reconnecting to proxies when we need to authenticate. 2008-04-25 Micah Cowan * src/Makefile.am: -I foo -> -Ifoo. 2008-04-24 Micah Cowan * src/main.c: Revised usage description of --convert-links to apply to CSS as well as to HTML. 2008-04-24 Micah Cowan * doc/wget.texi: Adjusted documentation to account for CSS support; added Ted Mielczarek to contributors. 2008-04-24 Micah Cowan * NEWS: Removed info about move to Automake, Gnulib. Added item about the addition of CSS support. 2008-04-23 Micah Cowan * src/utils.c (test_dir_matches_p): Added a test for the case described in issue #20518. 2008-04-22 Ted Mielczarek * src/css.lex, css-url.c, css-url.h: Added to implement support for parsing CSS in Wget. * src/convert.c: Convert links in CSS files, too. * src/convert.h (convert_options): Added for options link_css_p, link_expect_css. * src/convert.h: Added prototype for new register_css function. * src/html-parse.c: Added support for parsing element content, in addition to tag starts and ends. * src/html-parse.h (taginfo): Added delimiter fields for element content. * src/html-url.h: Added. * src/html-url.c (append_url): No longer internal-linkage only. Now takes position and size as explicit parameters. * src/html-url.c: Use new html-url.h header, add support for handling of "style" HTML attributes. Mark URIs obtained from link tags with rel="stylesheet" with link_expect_css. Adapt uses of append_url to supply the newly-added parameters for position and size. * src/http.c: Add detection for when the content-type is text/css; and ensure that such files have the ".css" filename extension, when --convert-links is active. * src/recur.h: Remove declarations for functions found in html-url.c (moved to html-url.h). * src/recur.c: Add support for culling links from CSS files, too, and tracking for when we're expecting the file to be CSS (even when its content type isn't text/css). * src/retr.c (retrieve_url): Add registration of CSS files. * src/wget.h: Added TEXTCSS to dt flags enum. * src/Makefile.am: Added css.lex, css-url.c, css-url.h, html-url.h to wget_SOURCES. 2008-04-22 Ted Mielczarek * configure.ac: Added check for lex. 2008-04-22 Steven Schubiger * src/http.c (print_response_line): Changed to make responses always be logged, even in --quiet mode, if --server-response was specified. This is to bring http.c's handling of the situation in line with ftp.c's. 2008-04-22 Rabin Vincent * src/ftp.c (ftp_get_listing): Only remove .listing if it has been created. 2008-04-22 Pranab Shenoy * src/init.c: Added test_commands_sorted unit test to check is commands are sorted. Fixes bug #21245. * src/test.c: Added test_commands_sorted to the test suite. 2008-04-22 Mike Frysinger * src/Makefile.am: Move @LIBS@ after other libraries, for better static-linking support. Fixes bug #22143. 2008-04-22 Mike Frysinger * doc/sample.wgetrc: Added prefer_family example. Resolves bug #22142. 2008-04-22 Micah Cowan * ylwrap: Added via automake -ac. 2008-04-22 Micah Cowan * src/Makefile.am, css.lex, css.l: Renamed css.lex to css.l. * src/recur.c (retrieve_tree): Fix typo to allow text/css files to be parsed. 2008-04-22 Jim Paris * src/openssl.c (ssl_init): Enable combined certificate/key in single file (apparent regression from ~1.9). Resolves issue #22767. 2008-04-22 Alain Guibert * src/test.h (mu_run_test): Move declaration before statements, for C90 conformance. Fixes bug #22789. 2008-04-16 Steven Schubiger * src/sysdep.h: Comment the defines __EXTENSIONS__ and _GNU_SOURCE out, because they're now defined independently by config.h. 2008-04-16 Steven Schubiger * src/ftp.c: Use Gnulib's quote function for printing filenames and such. * src/connect.c: Likewise. * src/convert.c: Likewise. * src/cookies.c: Likewise. * src/ftp-opie.c: Likewise. * src/gnutls.c: Likewise. * src/init.c: Likewise. * src/log.c: Likewise. * src/mswindows.c: Likewise. * src/openssl.c: Likewise. * src/progress.c: Likewise. * src/recur.c: Likewise. * src/res.c: Likewise. * src/utils.c: Likewise. 2008-04-14 Steven Schubiger * src/http.c: Use Gnulib's quote function for printing filenames and such. * src/wget.h: #include "quote.h". 2008-04-14 Micah Cowan * GNUmakefile, lib/Makefile.am, lib/error.c, lib/error.h, lib/exitfail.c, lib/exitfail.h, lib/getopt.c, lib/intprops.h, lib/quote.c, lib/quote.h, lib/quotearg.c, lib/quotearg.h, lib/stdlib.in.h, lib/strerror.c, lib/string.in.h, lib/unistd.in.h, lib/wchar.in.h, lib/wctype.in.h, lib/xalloc-die.c, lib/xalloc.h, lib/xmalloc.c, m4/error.m4, m4/exitfail.m4, m4/extensions.m4, m4/gnulib-cache.m4, m4/gnulib-comp.m4, m4/include_next.m4, m4/inline.m4, m4/mbrtowc.m4, m4/mbstate_t.m4, m4/quote.m4, m4/quotearg.m4, m4/stdlib_h.m4, m4/strerror.m4, m4/string_h.m4, m4/unistd_h.m4, m4/wchar.m4, m4/wctype.m4, m4/wint_t.m4, m4/xalloc.m4, md5/Makefile.am, md5/m4/gnulib-cache.m4, md5/m4/gnulib-comp.m4, md5/m4/include_next.m4, md5/m4/md5.m4, md5/m4/stdint.m4, md5/md5.c, md5/md5.h, md5/stdint.in.h, md5/wchar.in.h: Update from Gnulib, and add the "quote" module. 2008-04-12 Rabin Vincent * src/mswindows.c (fake_fork_child): Don't create a logfile for --background when --quiet is used, but not --server-response. Fixes bug #20917. * src/utils.c (fork_to_background): Likewise. 2008-04-12 Micah Cowan * src/utils.c (aprintf): Minor formatting changes to Alex's code (80- column limit, concatenated string literals, avoiding nesting levels), and removed invocation of free (since we're aborting anyway). 2008-04-11 Micah Cowan * src/ftp.c (getftp, ftp_loop_internal): Don't append to an existing .listing when --continue is used. Fixes bug #22825. Thanks to Rabin Vincent for pointing the way with a suggested fix! 2008-04-11 Micah Cowan * doc/wget.texi : Added Julien Buty, Alexander Dergachev, and Rabin Vincent. 2008-04-11 Alexander Dergachev * src/utils.c (aprintf): Now we are setting limits (1 Mb) for text buffer when we use non-C99 vsnprintf. 2008-04-10 Micah Cowan * tests/Makefile.am, Test-proxy-auth-basic.px: Added a test for accessing password-protected URLs through a proxy. 2008-04-10 Alexander Dergachev * src/xmalloc.c, xmalloc.h (memfatal): Now exported; accepts an "unknown" value for the attempted allocation size. * src/utils.c (aprintf): Now calls memfatal, instead of aborting. 2008-03-24 Micah Cowan * doc/wget.texi : Mentioned various caveats in the behavior of accept/reject lists, deprecate current always-download-HTML feature. Added @noindent to a couple of appropriate spots. 2008-03-24 Micah Cowan * NEWS: Added documentation change re: --no-parents, and various caveats on accept/reject lists behavior. Rearranged some items in order of priority. 2008-03-20 Micah Cowan * ABOUT-NLS: Reinstated, but with a message mentioning that gettext is not included. * Makefile.am: Removed "test" target; "check" should be used instead (and "test" was mildly broken, anyway). 2008-03-19 Micah Cowan * src/utils.c (test_dir_matches_p): More tests related for dir_matches_p. 2008-03-17 Micah Cowan * src/connect.c: Include sys/time.h to support use of the select function on older systems. 2008-03-17 Micah Cowan * doc/wget.texi : Mention importance of trailing slashes to --no-parents. 2008-02-14 Micah Cowan * ABOUT-NLS: Removed. 2008-02-11 Benno Schulenberg * src/http.c: More accurate and descriptive messages for when a file won't be retrieved during spider-mode. 2008-02-10 Micah Cowan * src/http.c: Added existence_checked member to the http_stat struct. (gethttp): Mark hs->existence_checked when we've checked whether a file-to-download exists; so we don't check it again if the connection gets lost (and potentially pick a new "unique" name). This fixes bug 22251. * src/progress.c (create_image): Add space for an extra column in the "eta" portion of the progress bar image; to deal with too-long Czech translation. * src/main.c, http.c, init.c: Added --auth-no-challenge option, to bring back 1.10.2 unsafe auth behavior when needed. This fixes bug #22242. 2008-02-10 Micah Cowan * doc/wget.texi : Added documentation of --auth-no-challenge. 2008-02-10 Micah Cowan * NEWS: Added note re interrupted files resulting in renames, and new --auth-no-challenge option. 2008-02-07 Micah Cowan * src/progress.c (create_image): Remove assertion on exceeding screen width, which given the less-than-robust code there, can be broken by a number of factors (such as large file downloads). 2008-02-06 Micah Cowan * src/progress.c (countcols): Use strlen() when mbtowc or wcwidth not available (or not using NLS). * src/utils.c: Ensure we use single-byte separators when not doing NLS progress-bars. * src/wget.h: Determine whether to use NLS for progress-bars, based on whether wcwidth and mbtowc are available. 2008-02-06 Micah Cowan * doc/wget.ṫexi : Remove references to no-longer-supported socks library. 2008-02-06 Micah Cowan * configure.ac (AC_CHECK_FUNCS): Added check for mbtowc. * NEWS: Added notes regarding fixes for the localized progress bar and --no-clobber wasted GET request. * po/be.po: Added from the TP. 2008-02-03 Micah Cowan * src/progress.c (create_image): Use number of characters/columns consumed, rather than number of bytes, to determine how much of a line we've used. Fixes assertion errors and field alignment bugs (#22161, #20481) (get_eta, count_cols): Added to support the changes for create_image. * src/http.c (http_loop): Put no-clobber logic back into http_loop, before starting to fetch, for when we're not doing content-disposition. 2008-02-03 Micah Cowan * configure.in: Add checks for wchar.h, wcwidth function (to support column-counting in progress.c). * NEWS: Added line for 1.11.1. * util/README, util/Makefile.am, util/trunc.c: Added a small utility program to create files of arbitrary size (useful for testing certain situations with --continue). 2008-01-31 Micah Cowan * util/README, util/dist-wget, util/download-netscape.html, util/download.html, util/update_po_files.sh, util/wget.spec: Removed (obsolete and/or incomplete). * Makefile.am: Removed no-longer-existant util stuff from extra_DIST (but added the README). 2008-01-31 Micah Cowan * src/http.c (gethttp): Don't derive hs->contlen from possibly invalid/missing Content-Length; instead, get the appropriate value from the Content-Range header values. (parse_content_range): Handle '*' instance-length field. 2008-01-31 Micah Cowan * doc/wget.texi: Ensure that license info appears in the info version of the manual. 2008-01-28 Micah Cowan * po/en@quot.po, po/en@boldquot.po, po/en_US.po: Updated translations for copyright year in --version. * po/Rules-quot: Make en@*-update should create wget.pot. * configure.ac: Ensure that en_US appears in ALL_LINGUAS exactly once. 2008-01-25 Micah Cowan * tests/Makefile.am: Updated copyright year. 2008-01-25 Micah Cowan * src/main.c: Added notes to translators regarding (C), diacritics in names. * src/Makefile.am, cmpt.c, connect.c, connect.h, convert.c, cookies.c, cookies.h, ftp-basic.c, ftp-ls.c, ftp-opie.c, ftp.c, ftp.h, gen-md5.c, gen-md5.h, gnutls.c, hash.c, hash.h, host.c, host.h, html-parse.c, html-parse.h, html-url.c, http-ntlm.c, http-ntlm.h, http.c, http.h, init.c, init.h, log.c, log.h, main.c, mswindows.c, mswindows.h, netrc.c, netrc.h, openssl.c, options.h, progress.c, progress.h, ptimer.c, ptimer.h, recur.c, recur.h, res.c, res.h, retr.c, retr.h, spider.c, spider.h, ssl.h, sysdep.h, test.c, test.h, url.c, url.h, utils.c, utils.h, wget.h, xmalloc.c, xmalloc.h: Updated copyright year. 2008-01-25 Micah Cowan * msdos/config.h: Updated copyright year. 2008-01-25 Micah Cowan * Makefile.am, NEWS, README, configure.ac, configure.bat, m4/wget.m4, po/POTFILES.in, util/Makefile.am, util/dist-wget, util/rmold.pl, files: Updated copyright year. 2008-01-25 Micah Cowan * doc/Makefile.am, wget.texi: Updated copyright year. 2008-01-24 Micah Cowan * configure.ac: Added en_US LINGUA (generated). * po/Rules-quot: Added rule to copy en_US.po from en@quot.po. * po/boldquot.sed, po/quot.sed: Translate _all_ apostrophes we find, not just the ones used for quotes; and add rules to use the copyight symbol, and write Hrvoje's last name properly. ^_^ * po/en@quot.po, po/en@boldquot.po: Updated by new rules. * po/en_US.po: Added. 2008-01-23 Micah Cowan * tests/Makefile.am: Add libmd5 to unit-tests. 2007-12-10 Micah Cowan * src/main.c: The option is --content-disposition, not --no-content-disposition (at the moment). 2007-12-10 Micah Cowan * doc/wget.texi: Document the --content-disposition option (and not just the .wgetrc setting). 2007-12-10 Micah Cowan * NEWS: Removed developer-only notices (Autoconf, TODO, PATCHES, GNUTLS). 2007-12-08 Hrvoje Niksic * src/ftp.c (ftp_retrieve_glob): Print both arguments of fnmatch in fnmatch error message. (ftp_retrieve_glob): Don't match with fnmatch if we're only supposed to get one file. 2007-12-07 Micah Cowan * src/Makefile.am: Plug in vars to include stuff from $(top_srcdir)/md5 when appropriate. 2007-12-07 Micah Cowan * lib/Makefile.am, lib/c-ctype.c, lib/c-ctype.h, lib/gettext.h, lib/stdbool.in.h, lib/unistd.in.h, m4/gnulib-cache.m4, m4/gnulib-common.m4, m4/gnulib-comp.m4, m4/unistd_h.m4: Updated from gnulib. * Makefile.am, configure.ac: Plugged in the md5/ stuff. * lib/md5.c, lib/md5.h, lib/stdint.in.h, lib/wchar.in.h, m4/longlong.m4, m4/md5.m4, m4/stdint.m4, m4/wchar.m4: Moved to md5/. * md5/Makefile.am, md5/dummy.c, md5/m4/gnulib-cache.m4, md5/m4/gnulib-common.m4, md5/m4/gnulib-comp.m4, md5/m4/gnulib-tool.m4, md5/m4/include_next.m4, md5/m4/longlong.m4, md5/m4/md5.m4, md5/m4/stdint.m4, md5/m4/wchar.m4, md5/md5.c, md5/md5.h, md5/stdint.in.h, md5/wchar.in.h: Moved/copied from lib/, m4/; updated from gnulib. * m4/ulonglong.m4: Removed (via update from gnulib). 2007-12-06 Micah Cowan * doc/wget.texi: "the the" -> "the" 2007-12-05 Micah Cowan * src/utils.c (subdir_p): Handle the case where d1 is "". * src/convert.c (convert_all_links): Don't return without deallocating timer. 2007-12-05 Micah Cowan * doc/wget.texi : Explicitly mention that --content-disposition has known issues. 2007-12-05 Micah Cowan * NEWS: Reword warnings regarding --content-disposition. 2007-11-28 Micah Cowan * tests/Makefile.am: Updated license exception for OpenSSL, per the SFLC. 2007-11-28 Micah Cowan * msdos/config.h: Updated license exception for OpenSSL, per the SFLC. 2007-11-28 Micah Cowan * src/Makefile.am, cmpt.c, connect.c, connect.h, convert.c, convert.h, cookies.c, cookies.h, ftp-basic.c, ftp-ls.c, ftp-opie.c, ftp.c, ftp.h, gen-md5.c, gen-md5.h, gnutls.c, hash.c, hash.h, host.c, host.h, html-parse.c, html-parse.h, html-url.c, http-ntlm.c, http-ntlm.h, http.c, http.h, init.c, init.h, log.c, log.h, main.c, mswindows.c, mswindows.h, netrc.c, netrc.h, openssl.c, options.h, progress.c, progress.h, ptimer.c, ptimer.h, recur.c, recur.h, res.c, res.h, retr.c, retr.h, safe-ctype.c, safe-ctype.h, spider.c, spider.h, ssl.h, sysdep.h, test.c, test.h, url.c, url.h, utils.c, utils.h, wget.h, xmalloc.c, xmalloc.h: Updated license exception for OpenSSL, per the SFLC. 2007-11-28 Micah Cowan * Makefile.am, README, autogen.sh, configure.bat, configure.in, m4/wget.m4, util/Makefile.am, util/dist-wget: Updated license exception for OpenSSL, per the SFLC. 2007-10-30 Micah Cowan * src/main.c (main): Declare argv parameter as char **argv, rather than char *const *argv. This fixes usage of getopt_long, regardless of whether getopt_long has a constified argv or not. 2007-10-23 Micah Cowan * lib/stdbool.in.h, lib/stdint.in.h: gnulib-tool --update. Includes fix for broken stdbool.h on Tru64. 2007-10-22 Micah Cowan * po/*.po: Refresh from TP and update-po. * lib/Makefile.am, m4/gnulib-cache.m4, m4/longlong.m4, m4/ulonglong.m4, maint.mk: gnulib-tool --update. Includes fix for maint.mk with old versions of gzip. 2007-10-22 Gisle Vanem * src/mswindows.c: Move INHIBIT_WRAP macro definition up with wget.h inclusion. 2007-10-18 Steven Schweda * src/sysdep.h: #include as well as , to work around a glitch on Tru64 systems. 2007-10-18 Micah Cowan * tests/Makefile.am: Add dependency for unit_tests on libgnu.a. 2007-10-18 Micah Cowan * src/Makefile.am: version.c should not be distributed. Removed config-post.h. Add version.c dependency and gnulib include path for libunittest.a. * src/sysdep.h: Got contents of config-post.h * src/config-post.h: Removed. * src/wget.h: #include "config.h". * src/alloca.c, cmpt.c, connect.c, convert.c, cookies.c, ftp-basic.c, ftp-ls.c, ftp-opie.c, ftp.c, gen-md5.c, gnutls.c, hash.c, host.c, html-parse.c, html-url.c, http-ntlm.c, http.c, init.c, log.c, main.c, mswindows.c, netrc.c, openssl.c, progress.c, ptimer.c, recur.c, res.c, retr.c, spider.c, url.c, utils.c, xmalloc.c: Use wget.h at very top, and instead of config.h. 2007-10-18 Micah Cowan * po/POTFILES.in: Removed no-longer-existing or generated files. * autogen.sh: Reinstated, in case we have to do something at some point other than autoreconf. * Makefile.am: Put autogen.sh back in EXTRA_DIST. Just in case someone needs to rebuild configure. * configure.ac: Removed config-post.h inclusion from bottom of generated config.h. 2007-10-16 Micah Cowan * README: Draw attention to wiki:PatchGuidelines. 2007-10-15 Micah Cowan * src/Makefile.am: Remove intermediary hg-id file generation, make version.c-generation more portable. 2007-10-15 Gisle Vanem * msdos/config.h: Added some HAVE_* for djgpp 2.04 and Watcom 1.7+. * msdos/Makefile.DJ: rewritten for including some files from ../lib. * msdos/Makefile.WC: Ditto. Handling source-files out-of current directory makes compliation a bit more painfull. AFAICS, one must use explicit rules. 2007-10-14 Micah Cowan * src/cmpt.c, cookies.c, ftp-basic.c, ftp-ls.c, ftp.c, hash.c, host.c, html-parse.c, html-url.c, http-ntlm.c, http.c, init.c, log.c, main.c, netrc.c, openssl.c, res.c, url.c, utils.c, wget.h: Replace uses of ISSPACE, etc with c_isspace, etc. * src/gnu-md5.c, gnu-md5.h: Removed, in deference to gnulib. * src/Makefile.am: Removed gnu-md5.h from wget_SOURCES. * src/gen-md5.c: Changed #inclusion of gnu-md5.h to md5.h (gnulib's). * src/recur.c (download_child_p): Print error if unlink of robots.txt fails. * src/main.c (main): --spider or --delete-after now implies --no-directories (thanks, Josh Williams). 2007-10-14 Micah Cowan * configure.ac: Let gnulib handle builtin MD5 functionality. * NEWS: Mention gnulib. 2007-10-14 Joshua David Williams * src/recur.c (download_child_p): Remove robots.txt if --delete-after or --spider is on. 2007-10-13 Micah Cowan * src/Makefile.am: Make version.c depend on Wget dependencies (source files, plus such things as LIBOBJS so we get things like http-ntlm.c). Removed getopt.[ch], as we're now using gnulib for these. 2007-10-13 Micah Cowan * doc/wget.texi : Replaced mention of no-longer included PATCHES file with link to relevant Wgiki page. * doc/wget.texi : Added new section. 2007-10-13 Micah Cowan * GNUMakefile, maint.mk: Added as part of the gnulib-ization. * Makefile.am: gnulib-ized. * configure.ac: gnulib-ized. Removed built-in getopt checks. 2007-10-12 Micah Cowan * PATCHES: Removed. * NEWS: Updated info about source repositories, removal of PATCHES file. 2007-10-10 Micah Cowan * src/http-ntlm.c: Include openssl/opensslv.h explicitly, instead of hoping it'll be included by accident in openssl/des.h. 2007-10-10 Micah Cowan * doc/wget.texi : Fixed "doewnloads" typo. 2007-10-09 gettextize * m4/gettext.m4: New file, from gettext-0.16.1. * m4/iconv.m4: New file, from gettext-0.16.1. * m4/lib-ld.m4: Upgrade to gettext-0.16.1. * m4/lib-link.m4: Upgrade to gettext-0.16.1. * m4/lib-prefix.m4: Upgrade to gettext-0.16.1. * m4/nls.m4: New file, from gettext-0.16.1. * m4/po.m4: New file, from gettext-0.16.1. * m4/progtest.m4: New file, from gettext-0.16.1. * po/Makefile.in.in: Upgrade to gettext-0.16.1. * po/Rules-quot: New file, from gettext-0.16.1. * po/boldquot.sed: New file, from gettext-0.16.1. * po/en@boldquot.header: New file, from gettext-0.16.1. * po/en@quot.header: New file, from gettext-0.16.1. * po/insert-header.sin: New file, from gettext-0.16.1. * po/quot.sed: New file, from gettext-0.16.1. * po/remove-potcdate.sin: New file, from gettext-0.16.1. 2007-10-09 Micah Cowan * src/gettext.h: Imported from /usr/share/gettext, fuller handling of --disable-nls. * src/wget.h: Remove logic for handling lack of NLS (now in gettext.h). * src/main.c: Use gettext's ENABLE_NLS rather than HAVE_NLS. * src/Makefile.am: added @LIBINTL@ to LIBS (though it probably belongs in LDADD, along with everything else currently assigned to LIBS). 2007-10-09 Micah Cowan * configure.in: Renamed to configure.ac * configure.ac: Renamed from configure.in. Added invocations of AM_GNU_GETTEXT, etc. Added en@quot and en@boldquot pseudo-LINGUA support. * ABOUT-NLS: Added back in (required by autoreconf :\). * Makefile.am: Added ABOUT-NLS and msdos/Makefile.WC to EXTRA_DIST. * m4/wget.m4: Removed no-longer-used NLS stuff. * Makefile.in.in: Restore previous policy of not updating .po's unless explicitly asked (via update-po). 2007-10-09 Gisle Vanem * src/mswindows.c: 'argc' and 'argv' in 'windows_main()' are no longer needed. Hence simply the prototype. Free 'exec_name' at exit. 2007-10-08 Micah Cowan * src/http.c (http_loop): Add send_head_first conditional back around code that needs it, but not around the last-modified header-parsing stuff this time. Removed no-longer-useful (was it ever?) restart_loop boolean, continuing unconditionally at end of send_head_first conditional block (if we haven't jumped out). 2007-10-08 Micah Cowan * doc/wget.texi: Credit to Ralf Wildenhues for automakifying patches. 2007-10-08 Micah Cowan * AUTHORS: Credit to Ralf Wildenhues for automakifying patches. 2007-10-05 Ralf Wildenhues * tests/WgetTest.pm.in: wget is built in the build tree. Use an absolute path to the binary. * tests/Makefile.in: Removed, replaced by Makefile.am. * tests/Makefile.am: Converted from Makefile.in. 2007-10-05 Ralf Wildenhues * po/Makefile.in.in: Since `distdir' is used now, adjust DISTFILES to the missing ChangeLog file. Add trivial targets ps, pdf, html. * Makefile.in: Removed, replaced by Makefile.am. * Makefile.am: Converted from Makefile.in. * util/Makefile.in: Removed, replaced by Makefile.am. * util/Makefile.am: Converted from Makefile.in. * configure.in: Adjust for automake support. 2007-10-05 Ralf Wildenhues * doc/Makefile.in: Removed, replaced by Makefile.am. * doc/Makefile.am: Converted from Makefile.in. 2007-10-05 Micah Cowan * config.guess, config.sub, install-sh: Update from versions found in /usr/share/automake/. * autogen.sh: Removed, in favor of just running autoreconf. 2007-10-04 Micah Cowan * src/http.c (http_loop): We've got_name if content_disposition support isn't on; make sure we continue properly in that case, even though we're not sending HEAD. * src/Makefile.in: Removed, replaced by Makefile.am. * src/Makefile.am: Converted from Makefile.in. 2007-10-03 Micah Cowan * doc/wget.texi : Cleaned up alphabetization, more consistent use of underscores. Added a description of the content_disposition wgetrc command. 2007-10-03 Micah Cowan * NEWS: Note missing functionality from GnuTLS support. Call out attention to content_disposition's experimental status. 2007-10-02 Stephen Gildea * src/connect.c (connect_to_host): Warn about host lookup failures. 2007-10-02 Micah Cowan * src/ftp.c (getftp, ftp_loop_internal), http.c (http_loop), main (main): Use datetime_str instead of time_str, for those who have potentially long-running sessions. Based on suggestions by Saso Tomat and Steven M. Schweda . * src/http.c (gethttp): Warn about host lookup failures. Adjusted from Stephen Gildea's patch. 2007-10-02 Gisle Vanem * src/ftp.c: Use "_listing" for MSDOS (".listing" is illegal). * src/url.c: Update comment for 'filechr_not_windows'. * src/utils.c: Include for 'getpid()' on Watcom. 2007-10-02 Gisle Vanem * msdos/config.h: Removed unused defines, added needed 'HAVE_*' defines. * msdos/Makefile.DJ: rewritten to be used from './src' directory. Added '-DOPENSSL_NO_KRB5' for OpenSSL build. Target is now wget.exe. * msdos/Makefile.WC: Added for building with OpenWatcom targeting 32-bit DOS (DOS32A extender). 2007-10-01 Micah Cowan * doc/wget.texi: Updated information in Mailing Lists, Reporting Bugs. Added Web Site section, and add information about Mac OS X, MS-DOS, and VMS in Portability. 2007-09-27 Micah Cowan * doc/wget.texi: Removed "for more details" from parenthesese enclosing @pxref{}s, so that texi2pod.pl knows to remove the whole reference. Made some gramattical improvements, and strengthened the recommendation to use the info manual instead. * doc/texi2pod.pl: Brought in some updates from the GCC version. Not an entire update, since a couple "fixes" there breaks stuff here. 2007-09-25 Micah Cowan * tests/Makefile.in: Use EXEEXT instead of exeext. 2007-09-25 Micah Cowan * src/Makefile.in: Use EXEEXT instead of exeext. 2007-09-25 Micah Cowan * configure.in: Remove unnecessary heuristic to generate exeext variable, since AC_PROG_CC and others automatically set EXEEXT. Pointed out by Steve Kenton . 2007-09-24 Jochen Roderburg * src/http.c (http_zero): Remove no-longer-used local_size variable. Fixes bug #21057. 2007-09-24 Gisle Vanem * src/connect.c, init.c, main.c, openssl.c, options.h, sysdep.h, url.c, utils.c: Added support for building on MS-DOS. 2007-09-24 Gisle Vanem * msdos/Makefile.DJ, config.h: Added to support building on MS-DOS via DJGPP. 2007-09-12 Micah Cowan * src/http.c (http_loop): Remove send_head_first from condition for parsing timestamp. 2007-09-12 Micah Cowan * doc/wget.texi: Expanded the description of -O. Clarified the detection of elements as "patterns" versus "suffixes" in -A, -R. Describe -p in relation to -nc. 2007-09-12 Micah Cowan * AUTHORS: Added... me... * TODO: file removed, bugtracker is authoritative source for planned changes. 2007-08-29 Micah Cowan * src/openssl.c (ssl_init): Re un-const-ified the meth local variable, to match current versions of openssl. * src/spider.c: Removed visited_url function, as it may be very inefficient. (print_broken_links): Removed traversal of referrers, until such time as a more efficient implementation can be written. * src/spider.h: Replaced declaration of visited_url with an empty-bodied, function-like macro. 2007-08-27 Micah Cowan * src/wget.h: Added macro replacement for ngettext, for environs that lack NLS. 2007-08-27 Gisle Vanem * src/mswindows.c (run_with_timeout): Ensure that the correct conversion specification is used for the return result of the GetLastError function. * src/getopt.c: Fix missing (but, accidentally, legal) comment delimiter after licensing text. * src/recur.c (retrieve_tree): Inserted missing cast for strip_auth. Includes adjustment by Ralf Wildenhues. * src/openssl.c (ssl_init): const-ified the meth local variable. * src/main.c: Include all the static function definitions in the "#ifndef TESTING" clause, leaving just the definitions for exec_name (not set), and opt. * src/utils.c (run_with_timeout): Now returns bool, to align with declaration in utils.h. 2007-08-26 Micah Cowan * src/spider.c (print_broken_links): Fixed incorrect plurals msgid usage, switched to use ngettext function. 2007-08-26 Micah Cowan * po/POTFILES.in: Added spider.c. 2007-08-24 Micah Cowan * src/http.c (http_loop): Introduced time_came_from_head boolean flag, to help avoid parsing the same Last-Modified header twice. Replaced spidering returns of RETRUNNEEDED for some situations, to RETROK, as otherwise it will be interpreted as an error. RETRUNNEEDED appears never to be referenced outside of http.c (and wget.h), and, when returned by gethttp, is translated by http_loop to RETROK. * src/url.c (are_urls_equal): Don't call getchar_from_escaped_string if u2 is shorter than u1. (getchar_from_escaped_string): Don't decode reserved characters. Handle illegally appearing '%'s as literal '%'s. Ensure hex digits before attempting to decode. (test_are_urls_equal): Added tests to handle u2 shorter than u1, and %2f not treated the same as /. * src/spider.c (in_url_list_p): Don't call are_urls_equal if one of them is NULL. 2007-08-24 Micah Cowan * po/no.po: removed; replaced by nb.po (per the translation project coordinator, Benno Schulenberg). 2007-08-23 Joshua David Williams * src/spider.c (in_url_list_p): Removed the bool verbose argument 2007-08-22 Micah Cowan * Makefile.in: Exclude .svn directories and below from distribution. 2007-08-22 Mauro Tortonesi * src/http.c (http_loop): Fall back to GET if HEAD fails with a 500 or 501 error code. 2007-08-21 Mauro Tortonesi * tests/WgetTest.pm.in: Added support for timestamping of pre-existing files. * tests/Test-N-current.px: Fixed broken test logic. * tests/Makefile.in: Updated list of automatically run tests. * tests/Test-HTTP-Content-Disposition.px: Added -e contentdisposition=on option, since now HTTP Content-Disposition header support is turned off by default. * tests/Test-HTTP-Content-Disposition-1.px: Ditto. 2007-08-21 Mauro Tortonesi * src/http.c (http_loop): Send preliminary HEAD request if -N is given and the destination file exists already. 2007-08-10 Ralf Wildenhues * src/url.c (append_uri_pathel): Do not assume dest string to be zero-terminated. (test_append_uri_pathel): Terminate string to fix test failure. 2007-08-10 Mauro Tortonesi * tests/Test--spider--no-content-disposition-trivial.px: Added new tests for validation of HTTP Content-Disposition header support logic. In particular, these tests check wget's behavior for every combination of --spider [-r] and -e contentdisposition=on/off options. * tests/Test--spider-r-HTTP-Content-Disposition.px: Ditto. * tests/Test--spider-HTTP-Content-Disposition.px: Ditto. * tests/Test--spider--no-content-disposition.px: Ditto. * tests/Test--spider-r--no-content-disposition-trivial.px: Ditto. * tests/Test--spider-r--no-content-disposition.px: Ditto. 2007-08-10 Mauro Tortonesi * src/http.c (http_loop): Fixed HTTP HEAD requests logic when --spider is given. 2007-08-09 Ralf Wildenhues * src/url.c (url_string): Use comparison, not assignment, in check for auth_mode == URL_AUTH_HIDE_PASSWD. 2007-08-09 Ralf Wildenhues * m4/wget.m4 (WGET_PROCESS_PO, AM_PATH_PROG_WITH_TEST): Add missing M4 quotation. Delete serial number. 2007-08-09 Micah Cowan * src/http.c (http_loop): If we got a HEAD and then a GET, and the GET had a timestamp, use that one, not any we may have gotten from the HEAD. 2007-08-09 Micah Cowan * NEWS: Timestamping from most recent response. 2007-08-08 Micah Cowan * src/init.c (defaults): Content disposition will not be default, since it currently results in extra round-trips. 2007-08-08 Micah Cowan * NEWS: Call attention to the fact that Content-Disposition is not enabled by default. 2007-08-07 Micah Cowan * configure.in: Fix --with-libssl-prefix failure by replacing usage of sh "if" statement with "AS_IF" macros, to force AC_REQUIRE'd macros to be expanded before the conditional statement body. * NEWS: Note that configure.in now requires autoconf >= 2.61, to support AS_IF and its expansion of AC_REQUIREs. 2007-07-31 Micah Cowan * src/http.c (gethttp): Set contlen = -1 when we encounter a negative-valued Content-Length header, so we don't consider it an internal error later on and call abort(). 2007-07-29 Micah Cowan * src/url.h, url.c (url_string): Replaced bool arg of the url_string function with enum url_auth_mode, with added option to completely remove user/pass auth information. * src/http.c, ftp.c, url.c, recur.c: Adapted call to url_string function to fit new usage. * src/recur.c (retrieve_tree): Remove auth info from Referer header. 2007-07-29 Micah Cowan * NEWS: No more auth before challenge. No more auth info in Referer. New --max-redirect option. 2007-07-28 Micah Cowan * src/options.h, init.c, retr.c, main.c: renamed opt maxredirect field to max_redirect, for improved consistency. * src/init.c: changed max_redirect parser from cmd_number_inf to cmd_number, as infinite redirects may not be appropriate. Alternatively, if cmd_number_inf should be used, then opt.max_redirect's value should be checked a bit differently in retr.c, to allow for the "infinite" meaning of zero. 2007-07-28 Micah Cowan * doc/wget.texi : Added --max-redirect option. 2007-07-25 Micah Cowan * tests/HTTPServer.pm (run, send_response): Farmed out some logic from the run method into a separate one named send_response, which was then modified to handle simple authentication testing. (handle_auth): Added to handle simple authentication testing. (verify_auth_basic): Checks to make sure Basic credentials are valid. (verify_auth_digest): Stub added; always fails test. * tests/Makefile.in: Added Test-auth-basic.px to list of automatically run tests. * tests/Test-auth-basic: Simple basic authentication test; mainly just lets the server do its testing. Its current purpose is just to ensure that correct basic creds are sent, but never until a challenge has been sent. 2007-07-25 Micah Cowan * src/http.c (create_authorization_line) (basic_authentication_encode, known_authentication_scheme_p) (load_cookies): Moved declarations up. (basic_authed_hosts): Added. Tracks what hosts have issued Basic challenge and been given the global username, password. (maybe_send_basic_creds): Added. Sends Basic creds for hosts that have issued Basic challenges. (register_basic_auth_host): Added. Instantiates basic_authed_hosts if necessary, then registers the host that has issued a challenge. (gethttp) : Only send authentication credentials after we've received a challenge from that host. This is a stop-gap fix until a proper fix can be implemented; still isn't quite right, as we should only be sending credentials automatically for authenticated paths and below, and not for the entire host. 2007-07-16 Joshua David Williams * src/test.h: tests made more verbose; now displays the name of each test run. 2007-07-16 Joshua David Williams * src/options.h: added maxredirect to options struct * src/init.c: added maxredirect to list of variables * src/retr.c (retrieve_url): replaced MAX_REDIRECTIONS with opt.maxredirect * src/main.c: added option --max-redirect 2007-07-10 Mauro Tortonesi * tests/Test--no-content-disposition.px: Added new tests for validation of HTTP Content-Disposition header support logic. In particular, these tests check wget's behavior for every combination of -N/-O and -e contentdisposition=on/off options. * tests/Test--no-content-disposition-trivial.px: Ditto. * tests/Test-N-HTTP-Content-Disposition.px: Ditto. * tests/Test-N--no-content-disposition.px: Ditto. * tests/Test-N--no-content-disposition-trivial.px: Ditto. * tests/Test-O-HTTP-Content-Disposition.px: Ditto. * tests/Test-O--no-content-disposition.px: Ditto. * tests/Test-O--no-content-disposition-trivial.px: Ditto. 2007-07-10 Mauro Tortonesi * src/http.c (http_loop): Fixed the HTTP requests logic. Now it skips the preliminary HEAD request if either -O or --no-content-disposition are given, and neither --spider and -N are given. 2007-07-09 Micah Cowan * README, util/wget.spec: Removed references to wget.sunsite.dk. 2007-07-05 Micah Cowan * tests/Makefile.in: Updated GPL reference to version 3 or later, removed FSF address. 2007-07-05 Micah Cowan * src/cmpt.c, connect.c, connect.h, convert.c, convert.h: * src/cookies.c, cookies.h, ftp-basic.c, ftp.c, ftp.h, ftp-ls.c: * src/ftp-opie.c, gen-md5.c, gen-md5.h, getopt.c, getopt.h, gnu-md5.c: * src/gnu-md5.h, gnutls.c, hash.c, hash.h, host.c, host.h: * src/html-parse.c, html-parse.h, html-url.c, http.c, http.h: * src/http-ntlm.c, http-ntlm.h, init.c, init.h, log.c, log.h, main.c: * src/Makefile.in, mswindows.c, mswindows.h, netrc.c, netrc.h: * src/openssl.c, options.h, progress.c, progress.h, ptimer.c: * src/ptimer.h, recur.c, recur.h, res.c, res.h, retr.c, retr.h: * src/safe-ctype.c, safe-ctype.h, spider.c, spider.h, ssl.h, sysdep.h: * src/test.c, test.h, url.c, url.h, utils.c, utils.h, wget.h: * src/xmalloc.c, xmalloc.h: Updated GPL reference to version 3 or later, removed FSF address. 2007-07-05 Micah Cowan * doc/fdl.texi: Changed to match the version in gnulib. * doc/Makefile.in: * doc/texi2pod.pl: * doc/texinfo.tex: Updated GPL reference to version 3 or later, removed FSF address. * doc/wget.texi: Slightly reworded the FDL license invocation. Replaced the maintainer reference. Removed the GPL text from the manual. * doc/gpl.texi: Removed due to discontinuation of reference in Wget manual. 2007-07-05 Micah Cowan * AUTHORS: Draw attention to previous maintainers. * autogen.sh, config.guess, config.sub, configure.bat: * configure.in, m4/wget.m4, Makefile.in, util/dist-wget: * util/Makefile.in, util/rmold.pl: Updated GPL reference to version 3 or later, removed FSF address. * README: Updated reference to maintainer, and updated GPL reference to version 3 or later. * COPYING: Replaced with verson 3. 2007-07-04 Mauro Tortonesi * src/http.c (http_loop): Skip HEAD request and start immediately with GET if -O is given. 2007-06-14 Mauro Tortonesi * tests/FTPServer.pm: Added FTP testing support. * tests/FTPTest.pm: Ditto. * tests/Test-ftp.px: Ditto. 2007-02-02 Hrvoje Niksic * src/netrc.c: Don't make netrc_list static, as it prevents compilation with DEBUG_MALLOC. * src/utils.c (aprintf): Don't use vasprintf when DEBUG_MALLOC is requested because, in that case, we want the calls to malloc to be coming from us. 2007-02-02 Hrvoje Niksic * src/http.c (print_server_response): Escape non-printable characters in server respone. 2007-01-23 Hrvoje Niksic * src/progress.c (create_image): Check for ETA overflow. (print_row_stats): Ditto. 2007-01-23 Hrvoje Niksic * src/cookies.c (parse_set_cookie): Would erroneously discard cookies with unparsable expiry time. 2007-01-09 Steven M. Schweda * src/ftp-basic.c (ftp_syst): Fixed segfault if response text is missing. 2007-01-09 Mauro Tortonesi * src/init.c (cmd_spec_prefer_family): Small fix to get rid of a gcc warning about strict-aliasing violation. 2006-12-29 Gisle Vanem * src/mswindows.c: Avoid a warning if 'ws_hangup()' is unused. 2006-12-27 Mauro Tortonesi * src/http.c (parse_content_disposition): Consider directory prefix, if specified. 2006-12-22 Mauro Tortonesi * tests/HTTPTest.pm: Don't ignore initial '/' character in requested URLs. 2006-11-21 Hrvoje Niksic * src/retr.c (retrieve_from_file): Ditto. (url_uses_proxy): New function. * src/main.c (main): Don't check for opt.use_proxy when deciding whether to call retrieve_url or retrieve_tree; check whether the proxy would be used for *this* URL. 2006-11-10 Mauro Tortonesi * tests/Test-np.px: Added test for -np. * tests/HTTPTest.pm: Ignore initial '/' character in requested URLs. 2006-10-17 Mike Grant * src/ftp.c (ftp_loop_internal): Would incorrectly skip changing working directory when retrying after a failed FTP attempt. Originally reported by Nate Eldredge. 2006-10-12 Mauro Tortonesi * tests/Test1.px: Renamed to Test-noop.px. * tests/Test-noop.px: Ditto. * tests/Test2.px: Renamed to Test-N.px. * tests/Test-N.px: Ditto. * tests/Test3.px: Renamed to Test-nonexisting-quiet.px. * tests/Test-nonexisting-quiet.px: Ditto. * tests/Test4.px: Renamed to Test-O-nonexisting.px. * tests/Test-O-nonexisting.px: Ditto. * tests/Test5.px: Renamed to Test-HTTP-Content-Disposition.px. * tests/Test-HTTP-Content-Disposition.px: Ditto. * tests/Test6.px: Renamed to Test-HTTP-Content-Disposition-1.px. * tests/Test-HTTP-Content-Disposition-1.px: Ditto. * tests/Test7.px: Renamed to Test-HTTP-Content-Disposition-2.px. * tests/Test-HTTP-Content-Disposition-2.px: Ditto. * tests/Test8.px: Replaced by Test--spider-r.px. * tests/Test9.px: Renamed to Test-Restrict-Lowercase.px. * tests/Test-Restrict-Lowercase.px: Ditto. * tests/Test10.px: Renamed to Test-Restrict-Uppercase.px. * tests/Test-Restrict-Uppercase.px: Ditto. * tests/Test--spider.px: Added test for spider mode. * tests/Test--spider-fail.px: Added failing test for spider mode. * tests/Test--spider-r.px: Added test for recursive spider mode. * tests/Test-c.px: Added test for --continue mode. * tests/Test-c-full.px: Added test for --continue mode. * tests/Test-c-partial.px: Added test for --continue mode. * tests/Test-O.px: Added test for -O. * tests/Test-N-current.px: Added test for -N. * tests/Test-N-old.px: Added test for -N. * tests/Test-E-k.px: Added test for -E -k. * tests/Test-E-k-K.px: Added test for -E -k -K. 2006-10-12 Mauro Tortonesi * src/convert.c (downloaded_file): Fixed bug which used to break -E -k -K mode. 2006-08-28 Noèl Köthe * Makefile.in: Fixed a DESTDIR-related bug. 2006-08-28 Mauro Tortonesi * src/http.c: #include'd spider.h to get rid of compiler warnings. * src/main.c: Ditto. * src/recur.c: Ditto. 2006-08-24 Mauro Tortonesi * src/Makefile.in: Added spider.c to the list of files to compile and spider.h to the list of header files. Updated copyright information. * src/http.c: Major changes to recursive spider mode. Now for every resource we are supposed to check, we send a HEAD request to find out if it exists. If the resource is a HTML file, we retrieve it and parse it to discover links to other resources. * src/recur.c: Ditto. * src/res.c (res_retrieve_file): Disable opt.timestamping and opt.spider when retrieving robots.txt. Updated copyright information. * src/convert.c: Moved code tracking broken links to spider.c. * src/convert.h: Ditto. * src/spider.c: Created new file to keep track of visited URLs in spider mode. * src/spider.h: Ditto. 2006-08-21 Mauro Tortonesi * src/http.c: Fixed timestamping-related bug. 2006-08-17 Mauro Tortonesi * tests/HTTPServer.pm: Added support for Range header. 2006-08-16 Mauro Tortonesi * src/http.c: Fixed bug which broke --continue feature. Now if -c is given, http_loop sends a HEAD request to find out the destination filename before resuming download. 2006-08-08 Hrvoje Niksic * src/utils.c (datetime_str): Avoid code repetition with time_str. 2006-07-21 Hrvoje Niksic * src/init.c (commands): Correctly place "contentdisposition". 2006-07-17 Daniel Richard G. * Makefile.in: Added DESTDIR='$(DESTDIR)' to MAKEDEFS. 2006-07-14 Mauro Tortonesi * tests/Test4.px: Fixed wrong expected behaviour. 2006-07-14 Mauro Tortonesi * src/sysdep.h: If intptr_t isn't defined, simply typedef it to long. * src/http.c: Added explicit cast to int in logprintf call to remove compiler warnings on 64-bit platforms. * src/connect.c: Added a few casts to intptr_t to remove compiler warnings on 64-bit platforms. * src/main.c: Disable -r, -p and -N when -O is used. Disable -k when -O is used and multiple URLs are given. Update maintainer information. * src/all: Update copyright information. 2006-07-14 Mauro Tortonesi * configure.in: Check for intptr_t. 2006-07-10 Mauro Tortonesi * doc/wget.texi: Fixed rendering of --no-proxy description in the man page. Added information about current maintainer. 2006-07-10 KJKHyperion * src/url.c (filechr_table): Mark DEL (0x7f) as a control character and | as a character Windows can't handle. 2006-06-28 Mauro Tortonesi * src/res.c: Implemented is_robots_txt_url function for detection of robots.txt URLs and related test routine. * src/res.h: Ditto. * src/url.c: Implemented are_urls_equal function for URL comparison and related testing routine. * src/url.h: Ditto. * src/convert.c: Fixes for recursive spider mode: don't consider non-existing robots.txt as a broken link, and use are_urls_equal instead of strcasecmp for referrer URLs comparison. * src/test.c: Call tests routines for are_urls_equal and is_robots_txt_url. 2006-06-28 Mauro Tortonesi * doc/wget.texi: Removed invariant status to the GPL and GFDL sections. Changed UPDATED to Jun 2006. Updated copyright notice to include 2006. 2006-06-27 Hrvoje Niksic * configure.in: We're no longer using strtoimax. 2006-06-26 Hrvoje Niksic * src/wget.h (wgint): Typedef to any 64-bit (or larger) type we can find, not necessarily off_t or long. 2006-06-26 Hrvoje Niksic * src/cmpt.c (strtoll): Check for overflow and underflow without relying on (technically) undefined behavior. Don't assume that strtoll_type is 64 bits wide. 2006-06-26 Hrvoje Niksic * doc/wget.texi (Recursive Accept/Reject Options): Document --ignore-case. 2006-06-21 Hrvoje Niksic * src/utils.c (base64_encode): Cast void pointer to char * before doing arithmetic. 2006-06-20 Hrvoje Niksic * src/utils.c (base64_encode): Made TBL const. (base64_decode): Made the base64_char_to_value table const. 2006-06-20 Hrvoje Niksic * doc/wget.texi (Download Options): Add missing word. Reported by Adrian Knoth. 2006-06-19 Hrvoje Niksic * src/utils.c (base64_encode): Would read past end of STR. Reported by rick@eckle.org. 2006-06-19 Hrvoje Niksic * src/utils.c (base64_encode): Made the DATA pointer void * so the callers can pass it any kind of pointer (including both signed and unsigned char pointers). (base64_decode): Ditto for DEST. 2006-06-13 Mauro Tortonesi * tests/Test9.px: Added test for --restrict-file-names=lowercase option. * tests/Test10.px: Added test for --restrict-file-names=uppercase option. 2006-06-13 Mauro Tortonesi * src/options.h (struct options): Introduced member restrict_files_case to keep track of preferences on character case restrictions for filenames. * src/init.c: Modified defaults and cmd_spec_restrict_file_names to support character case restrictions for filenames. Added test_cmd_spec_restrict_file_names unit test. * src/url.c: Modified append_uri_pathel to support character case restrictions for filenames. Added test_append_uri_pathel unit test. * src/test.c: Added test_cmd_spec_restrict_file_names and test_append_uri_pathel to the list of unit tests to run. 2006-06-12 Tony Lewis * src/main.c: Improved CHEN Peng's patch by proposing a simpler logic. 2006-06-12 Mauro Tortonesi * src/retr.c (retrieve_from_file): Use retrieve_tree and automatically turn on opt.follow_ftp in case of recursive FTP retrieval through HTTP proxy. * src/main.c: Automatically turn on opt.follow_ftp in case of recursive FTP retrieval through HTTP proxy. 2006-06-12 CHEN Peng * src/main.c: Use retrieve_tree in case of recursive FTP retrieval through HTTP proxy. 2006-05-26 Mauro Tortonesi * tests/HTTPServer.pm: Fixed bug when returning 404. Improved logging. * tests/Test.pm: Added support for command lines which use an absolute path for the Wget binary. 2006-05-26 Mauro Tortonesi * tests/HTTPServer.pm: Added synchronization between client and server processes to prevent the test to start before the server is ready. * tests/HTTPTest.pm: Ditto. * tests/Test.pm: Ditto. * tests/Test1.px: Removed unneeded ../src/ from command line. * tests/Test2.px: Ditto. * tests/Test3.px: Ditto. * tests/Test4.px: Ditto. * tests/Test5.px: Ditto. * tests/Test6.px: Ditto. * tests/Test7.px: Ditto. * tests/Test8.px: Added test for recursive spider mode. 2006-05-25 Mauro Tortonesi * src/convert.c: Added mechanisms to keep track broken links. * src/convert.h: Ditto. * src/wget.h: Reordered and enumerated uerr_t constants. * src/recur.c: Fixes to support recursive spider mode. * src/http.c: Ditto. * src/main.c: Print broken links in case of recursive spider mode. * src/retr.c: Changed interface of retrieve_url. * src/retr.h: Ditto. * src/ftp.c: Changed interface of ftp_loop. * src/ftp.h: Ditto. * src/res.c: Minor change to reflect changes in interface of retrieve_url. 2006-05-18 Lawrence Jones * src/ftp-ls.c (ftp_parse_unix_ls): Correct size parsing, add size and filename debugging output. 2006-04-28 Mauro Tortonesi * tests/Test5.px: Added test for HTTP Content-Disposition support. * tests/Test6.px: Ditto. * tests/Test7.px: Ditto. 2006-04-28 Mauro Tortonesi * src/http.c: If Content-Disposition header is present, allow unique filename generation unless -nc is given. Permit to disable parsing of Content-Disposition header. * src/options.h: Added option --no-content-disposition to disable parsing of HTTP Content-Disposition header. * src/init.c: Ditto. * src/main.c: Ditto. 2006-04-27 Mauro Tortonesi * tests/HTTPServer.pm: Serve index.html if no filename is given. * tests/Test.pm: Added support for pre-existing files. 2006-04-11 Hrvoje Niksic * src/hash.c (TOLOWER): Wrap macro arg in parentheses. 2006-04-08 Hrvoje Niksic * src/http.c (parse_content_disposition): Doc fix. 2006-03-15 Mauro Tortonesi * src/utils.c: Restricted operational semantics of frontcmp and proclist from generic strings to directory names and them to subdir_p and dir_matches_p respectively. Applied George Ogata's one line patch to restrict algorithm of subdir_p to full directory name matching. Added testcases for subdir_p and dir_matches_p. * src/utils.h: Changed all frontcmp occurrences to subdir_p. * src/recur.c: Ditto. * src/test.c: Changed type returned by test functions from char * to const char *. Added test_subdir_p and test_dir_matches_p to the list of tests to run. * src/http.c (test_parse_content_disposition): Changed return type from char * to const char *. 2006-03-14 Mauro Tortonesi * src/recur.c (struct queue_element): Changed type of html_allowed member to bool. 2006-03-09 Mauro Tortonesi * src/ftp.c (ftp_list): Try `LIST -a' command first and revert to `LIST' in case of failure. 2006-03-06 Hrvoje Niksic * src/hash.c (TOLOWER): Fix definition when STANDALONE. Reported by Beni Serfaty. 2006-03-02 Mauro Tortonesi * src/http.c (http_loop): Fixed recursive HTTP retrieval. 2006-02-28 Hrvoje Niksic * src/url.c (find_last_char): Define in terms of memrchr. * src/cmpt.c (memrchr): Define it on systems that don't have it. * src/http.c (extract_param): New function for parsing header values with parameters. (parse_content_disposition): Use it. Don't allow slashes and backslashes in the file name. 2006-02-28 Hrvoje Niksic * src/http.c (extract_param): Declare extern so it can be used from other files. (extract_param): Return error for empty name. 2006-02-28 Hrvoje Niksic * configure.in: Check for memrchr. 2006-02-27 Hrvoje Niksic * src/url.c (path_simplify): Don't preserve ".." at beginning of path. Suggested by Frank McCown. 2006-02-25 Hrvoje Niksic * src/http.c (gethttp): Only use FILE.N.html if FILE.html exists. 2006-02-09 Hrvoje Niksic * src/mswindows.c (run_with_timeout): Made thread_hnd non-static. 2006-02-05 Hrvoje Niksic * src/retr.c (sleep_between_retrievals): Sleep at a minimum of 1/2 of the specified wait period. 2006-02-05 Hrvoje Niksic * doc/wget.texi (Download Options): Document the modified meaning of --random-wait. 2006-02-05 Hrvoje Niksic (Download Options): Changed "a recent article" to "a 2001 article" in the description of --random-wait, since the article in question is not really recent. 2006-02-03 Mauro Tortonesi * src/http.c: Fixed support for Content-Disposition header. * src/test.c: Added test_parse_content_disposition to the list of unit tests to run. 2006-02-03 Hrvoje Niksic * src/utils.c (number_to_string): Don't use sprintf for printing WGINT_MIN; simply divide n by 10 and defer printing the last digit. (number_to_string): Removed the SPRINTF_WGINT macro. 2006-02-02 Hrvoje Niksic * src/hash.c: Don't define countof if it's already defined. * src/hash.c: Obtain the definition of uintptr_t when standalone. 2006-01-30 Mauro Tortonesi * src/http.c: Changed output format. Removed excessively verbose debugging output. 2006-01-24 Mauro Tortonesi * tests/HTTPServer.pm: Enhanced logging support. * tests/HTTPTest.pm: Updated to new test format. * tests/Test.pm: Improved test setup, verification and cleanup. Major refactoring. * tests/Test1.px: Updated to new test format. * tests/Test2.px: Updated to new test format. * tests/Test3.px: Added new test for quiet download of nonexistent URL. * tests/Test4.px: Added new test for quiet download of nonexistent URL with --output-document option. 2005-12-07 Mauro Tortonesi * src/http.c: Fixed pre-download verbose output which was broken by HTTP code refactoring. 2005-12-05 Mauro Tortonesi * tests/HTTPServer.pm: Refactored as a subclass of HTTP::Daemon. Removed the old run method and renamed the old run_daemon method to run. Added support for partial * tests/Testing.pm: Renamed to HTTPTest.pm. * tests/HTTPTest.pm: Refactored as a subclass of Test. Renamed Run_HTTP_Test to run, verify_download to _verify_download and added support for timestamp checking. * tests/Test.pm: Added Test class as the super class of every testcase. * tests/test1: Renamed to Test1.px. * tests/Test1.px: Refactored as an instance of the HTTPTest class. * tests/Test2.px: Added -N HTTP test. 2005-11-23 Mauro Tortonesi * src/http.c: Refactored HTTP code. If -O is not used, the new code delays the choice of the file name where the downloaded resource will be saved until the HTTP headers have been retrieved. Added support for Content-Disposition header. 2005-11-19 Hrvoje Niksic * src/hash.c (INVALID_PTR): Use uintptr_t instead of unsigned long. (hash_pointer): Don't assume a pointer fits in `unsigned long'. 2005-11-19 Hrvoje Niksic * configure.in: Check for uintptr_t. 2005-11-15 Hrvoje Niksic * doc/wget.texi: Document https_proxy. 2005-11-02 Mauro Tortonesi * tests/HTTPServer.pm: Added basic support for HTTP testing. * tests/Testing.pm: Added basic support for feature testing (only HTTP testing is supported at the moment). * tests/test1: Added basic HTTP test. 2005-11-02 Mauro Tortonesi * src/Makefile.in: Removed support for unit testing (now it is in tests/Makefile.in). 2005-11-02 Mauro Tortonesi * Makefile.in: Improved support for unit testing. * configure.in: Ditto. 2005-10-27 Mauro Tortonesi * src/Makefile.in: Added basic support for unit testing. * src/test.c: Ditto. * src/test.h: Ditto. 2005-10-27 Mauro Tortonesi * Makefile.in: Added basic support for unit testing. 2005-10-13 Daniel Stenberg * src/http-ntlm.c (ntlm_output): Fixed buffer overflow vulnerability. 2005-10-09 Russ Allbery * src/snprintf.c: Remove round to round_int and pow10 to pow10_int, to avoid warnings from GCC 4.0. 2005-10-05 Mauro Tortonesi * src/retr.c: Changed semantics of no_proxy_match. 2005-09-17 Hrvoje Niksic * src/retr.c (retr_rate): Rename parameter from MSECS to SECS since it no longer holds milliseconds. 2005-09-17 Hrvoje Niksic * src/main.c (main): Don't print the summary if nothing has been downloaded. 2005-09-02 Hrvoje Niksic * doc/sample.wgetrc: Rewrite the "passive FTP" paragraph to better reflect reality. 2005-09-01 Hrvoje Niksic * src/progress.c: Introduce symbolic constants for "magic" values of 0.2 and 0.9, REFRESH_INTERVAL and ETA_REFRESH_INTERVAL. 2005-08-27 Hrvoje Niksic * src/hash.c (hash_table_map): Rename to hash_table_for_each and update callers. Document the meaning of the callback's return value. (hash_table_iterate): New function. (hash_table_iter_next): Likewise. Update most places that used hash_table_for_each to use the iteration, which doesn't require a temporary function with explicit state management. 2005-08-27 Hrvoje Niksic * src/cmpt.c (strtoll): Correctly handle strtoll("0x", ptr, 0) and strtoll("0x", ptr, 0) -- in both cases *ptr must be set to the position of 'x', not after it. 2005-08-26 Stepan Kasal * configure.in: Abort configure if --with-ssl given but SSL unavailable. Use HAVE_LIBSSL and HAVE_LIBGNUTLS symbols provided by AC_LIB_HAVE_LINKFLAGS instead of inventing new ones. 2005-08-26 Jeremy Shapiro * src/openssl.c (ssl_init): Set SSL_MODE_AUTO_RETRY. 2005-08-26 Albert Chin * src/Makefile.in: Use @datadir@. Define localedir as $(datadir)/locale. 2005-08-23 Hrvoje Niksic * src/host.c (address_list_from_ipv4_addresses): Use IP_INADDR_DATA. 2005-08-12 Hrvoje Niksic * src/wget.h: Renamed strtoll_return to strtoll_type. 2005-08-11 Hrvoje Niksic * src/progress.c (eta_to_human_short): Switch to days when printing more than 48h rather than 100h. (It's not immediately apparent how many days there are in 83h.) 2005-08-11 Hrvoje Niksic * src/cmpt.c (strtoll): Define it if missing on the system and if Wget needs it. * src/mswindows.c (str_to_int64): Move to cmpt.c and rename to strtoll. 2005-08-11 Hrvoje Niksic * configure.in: Check for strtoll and strtoimax. 2005-08-10 Hrvoje Niksic * src/host.c (print_address): Always use inet_ntop when IPv6 is enabled. * src/host.h (ip_address): Simplify the data union. 2005-08-09 Vasil Dimov * src/ftp.c (getftp): Don't free RESPLINE if ftp_response returns a status other than FTPOK. 2005-08-09 Hrvoje Niksic * src/mswindows.c (inet_ntop): Also handle IPv4 addresses for completeness. 2005-08-09 Hrvoje Niksic * src/http.c (gethttp): Don't read more than the amount of data specified by the content-length header. 2005-08-09 Hrvoje Niksic * doc/wget.texi (Wgetrc Commands): Removed documentation for the now deleted command "kill_longer". 2005-08-04 Giuseppe Bonacci * src/ftp-ls.c (ftp_parse_unix_ls): Remember the position of the previous token instead of backtracking back to it. 2005-07-08 Hrvoje Niksic * src/utils.c (aprintf): Use vasprintf where available. 2005-07-08 Hrvoje Niksic * src/url.c (rewrite_shorthand_url): Simplify code using aprintf and strspn. 2005-07-08 Hrvoje Niksic * configure.in: Remove -Wno-implicit from default GCC warning flags. 2005-07-08 Hrvoje Niksic * configure.in: Don't check for symlink, which is expected to exist. Check for asprintf. 2005-07-08 Gisle Vanem * src/mswindows.h: Include process.h to get getpid() declaration. 2005-07-07 Hrvoje Niksic * src/openssl.c (ssl_check_certificate): Print custom error messages for frequent X509 certificate problems. 2005-07-07 Hrvoje Niksic * src/mswindows.h: Define an alias for stat and fstat, as requested by config-compiler.h. (gai_strerror): Define to windows_strerror if NEED_GAI_STRERROR is defined. 2005-07-07 Hrvoje Niksic * src/gnutls.c (ssl_check_certificate): Check for the validity of the presented X509 certificate. 2005-07-07 Hrvoje Niksic * configure.bat: Copy the common config.h and config-compiler.h. 2005-07-06 Hrvoje Niksic * src/utils.c: Unconditionally include . 2005-07-06 Hrvoje Niksic * src/utils.c (fnmatch_nocase): New function. (proclist): Use it instead of fnmatch when opt.ignore_case is requested. (in_acclist): Respect opt.ignore_case. (frontcmp): Respect opt.ignore_case. * src/options.h (struct options): New flag opt.ignore_case. 2005-07-06 Hrvoje Niksic * src/sysdep.h: Add a full declaration of fnmatch.h. 2005-07-06 Hrvoje Niksic * src/ptimer.c: Measure time in seconds rather than milliseconds. Adjusted all callers. 2005-07-06 Hrvoje Niksic * src/mswindows.h: Use strtoll where available. 2005-07-06 Hrvoje Niksic * src/http.c (gethttp): When freeing MESSAGE, take into account that it can be NULL. 2005-07-06 Hrvoje Niksic * configure.in: Don't check for setjmp.h. 2005-07-06 Hrvoje Niksic * Makefile.in: Also use @LIBGNUTLS@ to build LIBS. 2005-07-05 Hrvoje Niksic * src/wget.h (or): Define HAVE_SSL when either HAVE_OPENSSL or HAVE_GNUTLS are defined. * src/gnutls.c: New file. 2005-07-05 Hrvoje Niksic * src/http.c (gethttp): Don't print the request write error message twice. 2005-07-05 Hrvoje Niksic * src/cmpt.c (timegm): Remove unused variable. 2005-07-05 Hrvoje Niksic * src/cmpt.c (timegm): Handle years after 2099. 2005-07-05 Hrvoje Niksic * src/cmpt.c (timegm): Don't call mktime; simply count the seconds between 1970-01-01 and the specified date. 2005-07-05 Hrvoje Niksic * configure.in: Add check for GnuTLS if --with-ssl=gnutls is used. 2005-07-04 Hrvoje Niksic * src/xmalloc.c (debugging_free): Prefix hex pointer value with "0x" when printing. 2005-07-04 Hrvoje Niksic * src/utils.c (base64_decode): Don't silently tolerate non-base64 non-white-space characters in the base64 stream. 2005-07-04 Hrvoje Niksic * src/utils.c (NEXT_BASE64_CHAR): Rename to NEXT_CHAR and simplify to get the next non-whitespace character. 2005-07-04 Hrvoje Niksic * src/openssl.c (openssl_errstr): Separate error messages with "; ". 2005-07-04 Hrvoje Niksic * src/openssl.c (openssl_errstr): Instead of always using a large static buffer, only allocate the error string when there is an actual error. 2005-07-04 Hrvoje Niksic * src/connect.c (LAZY_RETRIEVE_INFO): Make last_tick unsigned to match transport_map_modified_tick. 2005-07-04 Hrvoje Niksic * src/config-post.h (alloca): Updated declaration to not enumerate all Windows compilers. 2005-07-03 Hrvoje Niksic * src/url.c (path_end): Skip separators appropriate for the scheme. (strpbrk_or_eos): Remove gcc-specific version, as the optimization it tried to perform no longer applies. 2005-07-03 Hrvoje Niksic * src/mswindows.h: Also wrap accept() and listen(). 2005-07-03 Hrvoje Niksic * src/ftp.c (getftp): Ditto. * src/http.c (gethttp): Use fd_errstr. * src/connect.c (fd_register_transport): Restructure parameters to include only a single structure that describes transport implementation. * src/openssl.c (openssl_errstr): New function: dump SSL error strings into a static buffer and return a pointer to the buffer. * src/connect.c (fd_errstr): New function; returns transport-specific error message, or strerror(errno) if transport doesn't supply one. 2005-07-03 Hrvoje Niksic * po/POTFILES.in: Include src/ptimer.c. 2005-07-02 Hrvoje Niksic * src/version.c: Don't use "cvs" in version name, since we're not using CVS anymore. 2005-07-02 Hrvoje Niksic * src/url.c (url_parse): Would crash when parsing fragments. Support fragments for FTP URLs too. 2005-07-02 Hrvoje Niksic * src/progress.c (create_image): Ditto. * src/retr.c (retr_rate): Display smaller rate numbers with greater precision. 2005-07-02 Hrvoje Niksic * src/http.c (response_head_terminator): Minor optimization. * src/retr.c (fd_read_hunk): Call terminator with pointer to the start of the data and the pointer to the current data. Changed all callers. 2005-07-02 Hrvoje Niksic * src/http.c (gethttp): Skip error message body in the keep-alive case. 2005-07-02 Hrvoje Niksic * src/host.c: Remove extraneous definition of netdb.h. 2005-07-02 Hrvoje Niksic * src/host.c: Don't include "connect.h" now that we no longer have socket_has_inet6. 2005-07-01 Hrvoje Niksic * src/url.c (url_parse): Make sure u->params is not initialized for http/https URLs. (url_parse): Don't crash on garbage following []-delimited IPv6 addresses. 2005-07-01 Hrvoje Niksic * src/main.c (print_help): Don't refer to the non-existent -nr in description of --mirror. 2005-07-01 Hrvoje Niksic * configure.in: Mention in message that the "GNU" md5 implementation is in fact built-in to Wget. 2005-06-30 Hrvoje Niksic * src/http.c (gethttp): Explicitly document the different cases when generating the Host header. 2005-06-30 Hrvoje Niksic * src/host.c (pretty_print_address): Renamed to just print_address. Clarify documentation. 2005-06-30 Hrvoje Niksic * src/host.c (pretty_print_address): Handle error result from inet_ntop. 2005-06-30 Gisle Vanem * src/mswindows.c (inet_ntop): New function. Print IPv6 addresses using WSAAddressToString. 2005-06-29 Hrvoje Niksic * src/wget.h: Remove obsolete definition of with_thousand_seps_sum. 2005-06-29 Hrvoje Niksic * src/utils.c: Unconditionally include locale.h. 2005-06-29 Hrvoje Niksic * src/utils.c (random_number): Use lrand48 if available. (random_float): Use drand48 if available. 2005-06-29 Hrvoje Niksic * src/ptimer.c: Include sys/time.h to get struct timeval. 2005-06-29 Hrvoje Niksic * src/main.c: Check for both SIGHUP and SIGUSR1 before using them. 2005-06-29 Hrvoje Niksic * src/main.c (secs_to_human_time): Use print_decimal when printing total download time in seconds. * src/progress.c (print_row_stats): Use it to print total download time at the end of the download. (create_image): Ditto. * src/utils.c (print_decimal): New function; print small decimal numbers with more precision than large ones. * src/progress.c (print_row_stats): New function. Print ETA after the download rate at the end of each row. 2005-06-29 Hrvoje Niksic * src/gnu-md5.h: Unconditionally include limits.h. 2005-06-29 Hrvoje Niksic * m4/wget.m4 (WGET_WITH_NLS): Don't check for locale.h. 2005-06-29 Hrvoje Niksic * configure.in: Test for $LIBSSL instead of the old $ssl_success when deciding which MD5 to use. 2005-06-29 Hrvoje Niksic * configure.in: Require Autoconf 2.59. 2005-06-29 Hrvoje Niksic * configure.in: Check for drand48. 2005-06-28 Hrvoje Niksic * src/init.c (parse_line): Check for alphanumerics. 2005-06-28 Hrvoje Niksic * src/ftp.c (getftp): Delete trailing newlines from LIST output so lines don't come out with trailing \015\012 with -S. 2005-06-28 Hrvoje Niksic * src/(dot_create): Remove unnecessary casts. 2005-06-28 Hrvoje Niksic * doc/wget.texi (Logging and Input File Options): Don't claim that --base requires --force-html to work. 2005-06-27 Hrvoje Niksic * src/wget.h (PTR_FORMAT): Cast the result of sizeof to int before passing it to printf's %*. * src/retr.h: Declare output_stream and output_stream_regular. * src/ftp.h: Declare ftp_last_respline. * src/convert.h: Declare dl_url_file_map. * src/http.h: New file. 2005-06-27 Hrvoje Niksic * src/utils.c (with_thousand_seps): Handle negative numbers. 2005-06-27 Hrvoje Niksic * src/utils.c (human_readable): Remove intermediary cast to long; MSVC has problems casting *unsigned* __int64 to double. 2005-06-27 Hrvoje Niksic * src/utils.c (get_grouping_data): Force the use of separators in C locale. 2005-06-27 Hrvoje Niksic * src/utils.c (get_grouping_data): Force separator to "." rather than " " when "," is taken. 2005-06-27 Hrvoje Niksic * src/sysdep.h: Use the system-provided fnmatch by default. 2005-06-27 Hrvoje Niksic * src/progress.c (set_progress_implementation): Type COLON as const char *. 2005-06-27 Hrvoje Niksic * src/progress.c (dot_update): Remove unused variable row_qty. 2005-06-27 Hrvoje Niksic * src/mswindows.h: Remove superfluous includes. 2005-06-27 Hrvoje Niksic * src/main.c (i18n_initialize): Set all locale categories. * src/http.c (http_atotm): Temporarily set locale to "C". 2005-06-27 Hrvoje Niksic * src/http.c (mktime_from_utc): Renamed to timegm and moved to cmpt.c. Don't compile it if GNU timegm is available. (http_atotm): Use timegm. 2005-06-27 Hrvoje Niksic * src/http.c (http_atotm): Correctly query the old locale value. 2005-06-27 Hrvoje Niksic * src/http.c (gethttp): Improve "POST data file missing" error message. 2005-06-27 Hrvoje Niksic * src/cookies.c: Make cookies_now static. 2005-06-27 Hrvoje Niksic * src/config-post.h (alloca): Don't #define alloca under MinGW32, which defines it in malloc.h, included from mswindows.h. 2005-06-27 Hrvoje Niksic * src/config-post.h (alloca): Amend alloca declaration to take care of all Win32 compilers, not just MSVC and MinGW. 2005-06-26 Hrvoje Niksic * src/utils.c (with_thousand_seps): Correctly implement thousand seps consisting of more than one character. 2005-06-26 Hrvoje Niksic * src/progress.c (create_image): Mark the "eta" string for translation. 2005-06-26 Hrvoje Niksic * src/main.c (secs_to_human_time): Ditto. * src/progress.c (create_image): Print more exact duration of very short downloads. * src/main.c (secs_to_human_time): Don't translate time suffixes "h", "m", and "s", which are not strictly SI, but are "accepted for use with SI". (secs_to_human_time): Print really small intervals as 0s, not 0.00s. 2005-06-26 Hrvoje Niksic * src/main.c (main): Print the downloaded and quota amounts with the "human_readable" function. * src/ftp.c (print_length): Ditto. * src/http.c (gethttp): Don't display thousand separators. * src/utils.c (with_thousand_seps): Rewritten to respect locale settings and to be type size agnostic. 2005-06-26 Hrvoje Niksic * src/html-url.c (get_urls_file): Don't explicitly set entry->next to NULL since entry is already zeroed out. 2005-06-26 Hrvoje Niksic * src/ftp.c (getftp): Always invoke SIZE, not only when continuing a download. * src/main.c (main): Ditto here. * src/progress.c (create_image): When the download is finished, print how long it took. 2005-06-26 Hrvoje Niksic * src/config-post.h: Replace the alloca declaration with the one from the latest Autoconf manual. This should remove a warning with GCC on AIX. 2005-06-26 Hrvoje Niksic * m4/wget.m4: Use proper GPL header. 2005-06-26 Gisle Vanem * src/mswindows.h: Define gai_strerror under MinGW. 2005-06-25 Hrvoje Niksic * src/utils.c (with_thousand_seps_sum): Now defined only if SUM_SIZE_INT is double. * src/wget.h (SUM_SIZE_INT): Instead of bothering with long, long long, __int64, and friends, simply either use wgint or double, end of story. Since we know how to print either, we no longer need LARGE_INT_FMT. * src/sysdeps.h (LARGE_INT): Renamed to SUM_SIZE_INT to better reflect its intent, and moved to wget.h. 2005-06-25 Hrvoje Niksic * src/utils.c (with_thousand_seps_sum): Decrease buffer size so it cannot overrun add_thousand_seps's buffer. 2005-06-25 Hrvoje Niksic * src/utils.c (human_readable): Divide with 1024 instead of shifting so the operation can work with non-integer N. 2005-06-25 Hrvoje Niksic * src/utils.c (SPRINTF_WGINT): The correct format is %I64d, not just %I64. 2005-06-25 Hrvoje Niksic * src/progress.c (eta_to_human): New logic for more human-readable ETA. 2005-06-25 Hrvoje Niksic * src/http.c (http_loop): Don't warn about wildcards in HTTP URLs if globbing isn't requested in the first place. * src/retr.c (retrieve_url): Temporarily turn off globbing when processing HTTP->FTP redirects. 2005-06-25 Hrvoje Niksic * src/cookies.c (parse_set_cookies): Cast pointer subtraction to int before using it with %d; AIX compiler warns on this. Reported by Jens Schleusener. 2005-06-25 Hrvoje Niksic * src/Makefile.in: No need to clean .libs. 2005-06-25 Hrvoje Niksic * doc/wget.texi (Download Options): Update -4/-6 documentation to reflect the fact that we no longer use AI_ADDRCONFIG. 2005-06-25 Hrvoje Niksic * Makefile.in: No need to clean .libs. 2005-06-25 Hrvoje Niksic * Makefile.in: Instead of creating configure.bat from configure.bat.in, simply make sure the correct EOL style in checked out of the repository. 2005-06-25 Hrvoje Niksic * Makefile.in (DISTFILES): Don't split the sed invocation across several lines, Solaris make passes the backslashes to sed literally. 2005-06-24 Hrvoje Niksic * src/init.c: opt.verbose must be declared as int. 2005-06-24 Hrvoje Niksic * src/http.c (gethttp): Don't prepend / here. * src/cookies.c (cookie_handle_set_cookie): Prepend / to PATH. (cookie_header): Ditto. 2005-06-24 Hrvoje Niksic * doc/gpl.texi (GNU General Public License): Split GPL text into a separate file and include it from wget.texi. Used the latest template from gnu.org with the updated address of the FSF. 2005-06-24 Hrvoje Niksic * configure.in: Use AC_LIB_HAVE_LINKFLAGS instead of AC_LIB_LINKFLAGS when checking for library functions. * configure.in: Don't waste time checking for headers and functions we know must be there. But manually AC_DEFINE the functions that might be missing from non-Unix systems. 2005-06-24 Hrvoje Niksic * configure.in: Move the large file check further up. Only check for endianness if GNU md5 is used (it being the only file that needs endianness information). 2005-06-24 Hrvoje Niksic * configure.in: Include m4/*.m4. * aclocal.m4: Renamed to m4/wget.m4. 2005-06-24 Hrvoje Niksic * configure.in: Enable the user to turn off SSL autodetection and disable SSL using --without-ssl. * Makefile.in ($(srcdir)/stamp-h.in): Remove the aclocal.m4 dependencies. 2005-06-24 Hrvoje Niksic * configure.in: Don't indent #include lines. 2005-06-23 Mauro Tortonesi * libtool.m4, ltmain.sh: Deleted. * configure.in: Replaced ugly libtool-based check for OpenSSL libs with a simpler config.rpath-based approach. * Makefile.in, src/Makefile.in: Removed libtool support. * m4/lib-link.m4, m4/lib-prefix.m4, m4/lib-ld.m4, config.rpath: config.rpath macros taken from gettext 0.14.5. 2005-06-23 Hrvoje Niksic * util/dist-wget: Port to subversion. 2005-06-23 Hrvoje Niksic * src/utils.c (read_file): Ditto. * src/main.c (main): Use struct_fstat. * src/mswindows.h (struct_fstat): Define a struct_fstat to deal with the fact that Borland 5.5 has 64-bit stat, but not 64-bit fstat! 2005-06-23 Hrvoje Niksic * src/sysdep.h: Remove code that deals with Watcom. 2005-06-23 Hrvoje Niksic * src/cmpt.c (strpbrk): Removed. (mktime): Removed. Include . 2005-06-23 Hrvoje Niksic * doc/wget.texi (Contributors): Updated list of principal contributors. 2005-06-23 Hrvoje Niksic * configure.in: Don't check for strpbrk and mktime. 2005-06-22 Hrvoje Niksic * src/sysdep.h: Include the stdbool.h/_Bool/bool blurb from Autoconf. 2005-06-22 Hrvoje Niksic * src/openssl.c, connect.c, host.c: Replace instances of #ifdef ENABLE_DEBUG if (opt.debug) {...} #endif with IF_DEBUG {...}. * src/main.c: Rename the IF_DEBUG defined here to WHEN_DEBUG. * src/wget.h (IF_DEBUG): New macro. (DEBUGP): Define in terms of IF_DEBUG. 2005-06-22 Hrvoje Niksic * src/init.c (cmd_lockable_boolean): Removed. 2005-06-22 Hrvoje Niksic * src/http.c (gethttp): Only handle --set-cookies (and assert that cookie jar exists) if opt.cookies is true. Failure to do so triggered the assert when --no-cookies was used and the server sent a Set-Cookie header. Ouch! 2005-06-22 Hrvoje Niksic * src/cookies.c (struct cookie): Use 1-bit bitfields for booleans which makes the structure takes less space at no cost in complexity. 2005-06-22 Hrvoje Niksic * src/connect.c (select_fd): Expect select() to exist. * src/utils.c (xsleep): Always use select() as sleep fallback on non-Windows platforms. * src/ptimer.c: Delete the implementation of PTIMER_TIME. * src/main.c: Assume existence of signal(), test for different signal names instead. * src/cmpt.c: Better document reasons why certain functions are included. 2005-06-22 Hrvoje Niksic * src/all: Use bool instead of int and false/true instead of 0/non-0 for boolean variables and values. 2005-06-22 Hrvoje Niksic * src/Makefile.in: Remove the manually maintained dependency list; make all object files depend on every header. 2005-06-22 Hrvoje Niksic * src/Makefile.in ($(OBJ)): Add the config.h dependency. 2005-06-22 Hrvoje Niksic * doc/wget.texi: Use the more standard authorship phrase "and others". 2005-06-22 Hrvoje Niksic * doc/wget.texi (Wgetrc Commands): Remove the "lockable boolean" feature. 2005-06-22 Hrvoje Niksic * doc/wget.texi (Overview): Remove explicit vertical spacing. 2005-06-22 Hrvoje Niksic * doc/wget.texi (Mailing List): Remove reference to the wget-cvs list, which no longer exists. 2005-06-22 Hrvoje Niksic * doc/texinfo.tex: Update with a non-prehistoric version. 2005-06-22 Hrvoje Niksic * doc/texi2pod.pl: Locate perl using the "env" program, so we don't need to modify texi2pod. * doc/Makefile.in (wget.pod): Work with texi2pod.pl directly instead of generating it from texi2pod.pl.in. 2005-06-22 Hrvoje Niksic * configure.in: Don't check for signal.h. Remove the AC_HEADER_TIME check. Remove the test for ANSI C prototypes. 2005-06-22 Hrvoje Niksic * configure.in: Check for C99 conformant stdbool.h. 2005-06-22 Hrvoje Niksic * configure.in: Assume existence of gettimeofday and select. gettimeofday exists on all platforms we care about (except for Windows where Windows-specific functions are used instead), and select exists virtually everywhere. * configure.in: Assume existence of strerror, signal, strstr, and memmove, which are all required by ANSI C. 2005-06-22 Hrvoje Niksic * config.sub, config.guess: Updated from canonical location. 2005-06-22 Hrvoje Niksic * autogen.sh: New file. * Makefile.svn: Deleted, replaced with the even simpler (and more standard) `autogen.sh' script. 2005-06-22 Hrvoje Niksic * README.svn: Renamed to README.checkout. Edited to mention the autogen.sh script. 2005-06-22 Hrvoje Niksic * MAILING-LIST: Remove reference to the obsolete `wget-cvs' mailing list. 2005-06-21 Hrvoje Niksic * Makefile.cvs: Renamed to Makefile.svn. * README.cvs: Renamed to README.svn. 2005-06-20 Hrvoje Niksic * src/main.c, ptimer.c, sysdep.h, utils.c: Use #elif to simplify reading of chained if-else-else-else-... statements. 2005-06-20 Hrvoje Niksic * src/hash.c: Rename "mapping" to "cell" to avoid confusion with the term "mapping" (or "map") sometimes being used for the entire hash table. Also rename "non-empty" to "occupied" for easier reading of if (!NON_EMPTY (...)) ... . 2005-06-20 Hrvoje Niksic * src/all: Return type of signal handlers is `void'. Include signal.h unconditionally. * src/all: Don't explicitly cast values returned by malloc. We no longer support ancient compilers that don't declare malloc, and we never supported C++ builds. 2005-06-20 Hrvoje Niksic * doc/ansi2knr.1: Removed. 2005-06-20 Hrvoje Niksic * configure.in: Don't check for the return type of signal handlers; C89 requires it to be void. 2005-06-19 Hrvoje Niksic * src/init.c (cmd_lockable_boolean): Don't recognize literal "2" and "-1" values; unlike 0 and 1, those should never be exposed to the user. Update the error message to be more self-consistent, as proposed by Benno Schulenberg. 2005-06-19 Hrvoje Niksic * src/all: Don't declare errno. Include both time.h and sys/time.h, as long as sys/time.h exists. Don't dereference function pointers when invoking the functions they point to. * src/cmpt.c (memmove): Remove function mandated by C89. (strerror): Ditto. (strstr): Ditto. * src/all: Undo the K&R-ization changes from 2005-05-03. * src/all: Remove support for K&R compilers: use C89 function declarations, remove definition of PARAMS, remove support for varargs, and remove ansi2knr. Assume the presence of time.h, string.h, and other headers mandated by C89. 2005-06-19 Hrvoje Niksic * aclocal.m4: Remove support for K&R compilers. 2005-06-18 Hrvoje Niksic * src/http.c (gethttp): When -E is in use, check for file existence after appending ".html" to the name and modify the file name if necessary. 2005-06-18 Hrvoje Niksic * src/http.c (gethttp): Don't free "head" before using it to save headers. 2005-06-17 Hrvoje Niksic * src/connect.c (socket_has_inet6): Removed. * src/host.c (lookup_host): Don't use the AI_ADDRCONFIG getaddrinfo hint. 2005-06-16 Hrvoje Niksic * src/init.c (cmd_lockable_boolean): Improve the error message so it lists the more useful values first. 2005-06-16 Hrvoje Niksic * doc/wget.texi (Logging and Input File Options): It's --no-verbose, not --non-verbose. 2005-06-15 Hrvoje Niksic * src/http.c (gethttp): Also report the system error when the POST data file is missing. 2005-06-15 Hrvoje Niksic * src/host.h (ip_address): Remove the trailing comma from the type enum in the no-IPv6 case. * src/main.c (struct cmdline_option): Remove the trailing comma from the enum. Reported by Jens Schleusener. 2005-06-15 Benno Schulenberg * src/ftp.c, http.c, connect.c, cookies.c, html-url.c, init.c, res.c: Gettext-ize messages that were previously missed. 2005-06-06 Keith Moore * doc/Makefile.in: Fix a harmless (but annoying) installation warning. 2005-05-30 Hrvoje Niksic * src/url.c (strpbrk_or_eos): Check for a recent GCC version before using the statement-as-expression extension. 2005-05-30 Hrvoje Niksic * src/init.c (cmd_spec_header): Don't split the string along the commas using cmd_vector; just append the new value using vec_append instead. * src/utils.c (vec_append): New function. 2005-05-30 Hrvoje Niksic * src/http.c (gethttp): Don't attempt to "skip short body" if we're issuing a HEAD request (in which case the response head is not followed by a body). 2005-05-30 Hrvoje Niksic * doc/wget.texi (HTTP Options): Removed statement that redirect in response to POST is "technically disallowed", which I cannot find in rfc2616 nor in rfc1945. Even if that were technically the case, the widespreadedness of such responses would make the prohibition irrelevant. 2005-05-27 Andreas Beckmann * src/html-url.c (tag_handle_link): Mark the content from the tag as expecting HTML. 2005-05-24 Hrvoje Niksic * src/http.c (http_atotm): Document the origin of the "cookie date" format. 2005-05-21 Hrvoje Niksic * src/init.c (setval_internal): Report exact command name alongside the "display name". 2005-05-18 Hrvoje Niksic * src/cookies.c (update_cookie_field): Explicitly cast -1 to time_t to cope with systems where time_t is unsigned. * src/cookies.c: Remove unnecessary casts to time_t from values already of that type. 2005-05-17 Hrvoje Niksic * src/ftp.c (ftp_loop_internal): Same here. * src/http.c (http_loop): Don't clobber the file when -c is specified and the first attempt to retrieve the file fails. 2005-05-16 Hrvoje Niksic * src/openssl.c (ssl_check_certificate): Print all issues with a certificate. 2005-05-16 Hrvoje Niksic * src/init.c (cleanup): Don't free the non-existent opt.ftp_acc. Free the SSL-related stuff. 2005-05-16 Hrvoje Niksic * src/ftp.c (print_length): Consistently print \n at end of the "Length" line. 2005-05-16 Hrvoje Niksic * src/ftp-basic.c: Don't xfree() the line returned by ftp_response if the returned code is not FTPOK. 2005-05-14 Hrvoje Niksic * src/openssl.c (ssl_connect): Announce the beginning and the end of the SSL handshake when in debug mode. * src/wget.h (PTR_FORMAT): New macro for easier printing of pointer values. Use %0*lx along with PTR_FORMAT instead of %p. 2005-05-14 Hrvoje Niksic * src/http.c (gethttp): Would forget to close the connection when keep_alive was not used. 2005-05-14 Hrvoje Niksic * doc/wget.texi (Overview): Document --[no-]proxy as primarily being used to turn *off* the use of proxies. 2005-05-13 Hrvoje Niksic * src/openssl.c (pattern_match): Document the code that decides whether "*" matches ".". 2005-05-12 Hrvoje Niksic * src/url.c (rewrite_shorthand_url): Don't rewrite "https://host" to "ftp://https//host" when SSL is not used. 2005-05-12 Hrvoje Niksic * src/ftp-ls.c (ftp_index): Use %d to print the port number, which is now int. From Steven M. Schweda's VMS patches. 2005-05-11 Hrvoje Niksic * src/openssl.c (ssl_init): Always use SSL_VERIFY_NONE, so that the handshake finishes even if the certificate is invalid. That way ssl_check_server_identity can provide better diagnostics on why the verification failed. 2005-05-11 Hrvoje Niksic * src/openssl.c (ssl_check_server_identity): Renamed to ssl_check_certificate because it does more than just checking the server's identity. (ssl_check_certificate): Tell the user about --no-check-certificate. 2005-05-11 Hrvoje Niksic * src/openssl.c (pattern_match): New function. (ssl_check_server_identity): Treat peer certificate common name as wildcard. 2005-05-11 Hrvoje Niksic * doc/wget.texi (HTTPS (SSL/TLS) Options): Explain certificate checking in more detail. 2005-05-10 Hrvoje Niksic * src/res.c (res_register_specs): Correctly pass pointers to hash_table_get_pair. 2005-05-10 Hrvoje Niksic * src/openssl.c (ssl_check_server_identity): Print certificate subject and issuer. 2005-05-10 Hrvoje Niksic * src/http.c (gethttp): Call ssl_check_server_identity. * src/openssl.c (ssl_check_server_identity): New function, verifies that the host name in the certificate matches the actual host name. (verify_cert_callback): Removed, since it didn't do anything except returning the preverify_ok argument. * src/connect.c (fd_transport_context): Allow retrieval of the context pointer registered with fd_register_transport. 2005-05-10 Hrvoje Niksic * configure.in: Test for OpenSSL includes we actually need. 2005-05-09 Hrvoje Niksic * src/openssl.c (verify_cert_callback): Renamed from verify_callback. Always return the received "ok" value. Print the X509 name in debug mode. (ssl_init): Enable partial writes in SSL context. 2005-05-08 Hrvoje Niksic * src/netrc.c (parse_netrc): Explicitly check for assignment != NULL to silence warning from Borland C. * src/url.c (sync_path): Don't unnecessarily increment p. (url_parse): Don't unnecessarily set url_encode to NULL just prior to return from the function. 2005-05-08 Hrvoje Niksic * src/log.c (escnonprint_internal): Place variable declarations before other statements. 2005-05-08 Hrvoje Niksic * src/init.c (run_command): Correctly interpret the return value of parse_line. (commands): Re-alphabetize. 2005-05-08 Hrvoje Niksic * src/http.c (http_loop): Check for wildcards in the URL path component, not in the whole URL. * src/ftp.c (ftp_loop): Check for wildcards in URL path before unescaping, so the users can escape globbing metacharacters with % escapes. 2005-05-08 Hrvoje Niksic * src/html-url.c: Include recur.h. * src/http.c (request_new): Define as accepting no args. 2005-05-08 Hrvoje Niksic * doc/texi2pod.pl.in: Allow an "EXAMPLES" section. 2005-05-07 Hrvoje Niksic * src/url.c (strpbrk_or_eos): Made inline. Use strchr(s, '\0') for finding the NUL char position. 2005-05-07 Hrvoje Niksic * src/url.c (decide_copy_method): Renamed to char_needs_escaping. Since it now returns only two possible values, change it to return boolean (int). 2005-05-07 Hrvoje Niksic * src/url.c (decide_copy_method): Never cause reencode_escapes to decode % escapes; it is too intrusive and breaks some servers. 2005-05-07 Hrvoje Niksic * src/http.c (gethttp): When tunnelling SSL traffic over proxy with CONNECT, we're really talking to the remote server directly. Because of this, the request-line argument must be the URL path rather than the whole URL, as it would be when using regular proxies. Reported by Charles Lane. 2005-05-07 Hrvoje Niksic * src/ftp-basic.c (ftp_request): Prevent newlines in VALUE causing inadvertent sending of multiple FTP commands. 2005-05-06 Hrvoje Niksic * src/main.c (print_help): Fix wording of --secure-protocol help text. 2005-05-06 Hrvoje Niksic * src/init.c (cmd_spec_useragent): Allow empty User-Agent. * src/http.c (gethttp): Don't print "unknown authentication scheme" for failed Basic authentication. (SET_USER_AGENT): Don't set user-agent if opt.useragent is empty. (gethttp): Use alloca for allocation of www_authenticate. 2005-05-06 Hrvoje Niksic * src/cmpt.c (strstr): Updated from glibc 2.3.5. 2005-05-06 Hrvoje Niksic * doc/wget.texi (HTTP Options): Document empty user-agent. 2005-05-06 Hrvoje Niksic * doc/wget.texi (Download Options): Explain that the read timeout really refers to idle timeout. (Download Options): Mention that decimal and subsecond values may be used for timeouts. 2005-05-06 Hrvoje Niksic * configure.in: Set MAKEINFO to "true" so build doesn't fail for users without either makeinfo or the pre-packaged info files. 2005-05-06 Hrvoje Niksic * Makefile.in ($(srcdir)/stamp-h.in): Don't print the line with the comment about running autoheader. 2005-05-05 Hrvoje Niksic * src/utils.c (touch): Set access time to current time. 2005-05-05 Hrvoje Niksic * src/url.c (url_unescape): Don't unescape %00, it effectively truncates the string. 2005-05-05 Hrvoje Niksic * src/retr.c (limit_bandwidth_reset): Reset sleep_adjust. (limit_bandwidth): Don't allow huge "adjustment" values that result from being suspended for a while. 2005-05-05 Hrvoje Niksic * src/main.c (main): Propagate option name to setoptval. * src/init.c (setoptval): Accept another argument, OPTNAME. Propagate that argument as the option name independently of the actual command, determined by command_by_name(com). 2005-05-05 Hrvoje Niksic * src/log.c (copy_and_escape): Slightly reduce code repetition between the two loops. 2005-05-05 Hrvoje Niksic * src/log.c (copy_and_escape): Replace the FOR_URI argument with a slightly more general mechanism for specifying different kinds of escape. 2005-05-05 Hrvoje Niksic * src/init.c (parse_line): Make the return value indicate whether there was a syntax error or a setter failed. (run_wgetrc): Return an indication on whether an error has been encountered. (initialize): Abort if there have been errors running either .wgetrc file. 2005-05-05 Hrvoje Niksic * src/http.c (http_atotm): Zero out the whole struct tm being passed to strptime. 2005-05-05 Hrvoje Niksic * src/ftp-basic.c (ftp_response): Fix printing FTP server response. 2005-05-05 Hrvoje Niksic * doc/wget.texi: We're using GFDL 1.2, not 1.1. 2005-05-05 Hrvoje Niksic * doc/wget.texi (Contributors): Updated. 2005-05-05 Charles C.Fu * src/utils.c (proclist): Strip leading slash when calling fnmatch too, otherwise wildcard comparisons always fail. 2005-05-04 Hrvoje Niksic * src/wget.h: If gettext was found but libtintl.h wasn't, declare gettext's return type to avoid type mismatches. 2005-05-03 Hrvoje Niksic * src/url.c (url_parse): Rename label `error' to avoid conflict with identifier. * src/retr.c (fd_read_body): Rename label `out' to avoid conflict with identifier. (fd_read_hunk): Use explicit double constant. (retrieve_from_file): Don't use string concatenation. (sleep_between_retrievals): Make sure xsleep is called with a `double' argument. (no_proxy_match): Define as static, like it is declared. * src/progress.c (bar_create): Use 0.0 instead of 0 because K&R compilers can't automatically promote it. * src/http-ntlm.c (ntlm_output): Replace \xHH sequences with \OOO for the sake of old compilers. * src/ftp.c (ftp_loop_internal): Don't use string concatenation. * src/http.c (request_send): Use explicit double constants when calling fd_read and fd_write. (post_file): Ditto. (gethttp): Ditto. (skip_short_body): Ditto. * src/ftp-basic.c: When calling fd_write, specify the last argument as a `double' constant for the sake of K&R compilers which don't see the prototype and therefore can't promote it to double automatically. * src/cookies.c (cookie_jar_load): Rename abort label to abort_cookie to avoid name conflict in K&R compilers. 2005-05-02 Hrvoje Niksic * INSTALL: Document environment variables affecting configure, especially $CC. * INSTALL: Mention that make install requires root. 2005-04-29 Hrvoje Niksic * src/ptimer.c (posix_init): Since we allow _POSIX_MONOTONIC_CLOCK==0, it is not enough to check for _POSIX_MONOTONIC_CLOCK-0, we must also check for defined(_POSIX_MONOTONIC_CLOCK). 2005-04-29 Hrvoje Niksic * configure.in: Don't set ipv6 to yes only because struct sockaddr_in6 was found. Stop the rest of the IPv6 checks when one check fails. Abort if IPv6 was explicitly requested, but not found. 2005-04-28 Hrvoje Niksic * windows/Makefile.top.bor: Use MAKEDIR for make clean too. 2005-04-28 Hrvoje Niksic * windows/Makefile.src.bor: Don't suppress unreachable code warning. 2005-04-28 Hrvoje Niksic * windows/Makefile.src.bor: Don't delete executables other than wget.exe. Delete various auxilliary files created by the Borland build process. 2005-04-28 Hrvoje Niksic * src/init.c (commands): Wrap the use of opt.random_file in #ifdef HAVE_SSL. 2005-04-28 Hrvoje Niksic * src/http.c (gethttp): Correctly set the user agent. 2005-04-28 Hrvoje Niksic * src/ftp.c, hash.c, connect.c, host.c, http.c: Remove unreached code, such as "break" following return or abort. 2005-04-28 Hrvoje Niksic * NEWS: Advertise new-style syntax for --no-dns-cache instead of --dns-cache=off. 2005-04-28 Hrvoje Niksic * INSTALL: Mention --disable-ntlm. 2005-04-28 Herold Heiko * windows/wget.dep: Rename gen_sslfunc.c to openssl.c. 2005-04-27 Mauro Tortonesi * src/main.c: Map --ftp-password, --http-password and --proxy-password to the new ftppassword, httppassword and proxypassword commands respectively. Document the --user and --password options in the help string. 2005-04-27 Mauro Tortonesi * src/ftp.c: Add support for --user and --password. * src/http.c: Add support for --user and --password. * src/init.c: Deprecated ftppasswd, httppasswd, login, passwd and proxypasswd commands. Added ftppassword, ftpuser, httppassword, password, proxypassword and user commands. * src/main.c: Renamed --ftp-passwd to --ftp-password. Added --ftp-user, --http-password, --password, --proxy-password and --user. Deprecated --http-passwd and --proxy-passwd. Added documentation for new options and removed documentation for deprecated options in the help string. * src/options.h (struct options): Added user and passwd members to handle --user and --password respectively. Renamed ftp_acc and ftp_pass members to ftp_user and ftp_passwd for consistency. 2005-04-27 Mauro Tortonesi * doc/wget.texi: Fixed a broken reference to Security Considerations section in tex-generated documents (like the man page). 2005-04-27 Mauro Tortonesi * doc/wget.texi: Document --user, --password, --ftp-user and the corresponding Wgetrc command. Renamed --ftp-passwd to --ftp-password, --http-passwd to --http-passwd and --proxy-passwd to proxy_password. Renamed ftp_passwd to ftp_password, http_passwd to http_passwd and proxy_passwd to proxy_password. Removed documentation for the deprecated login command. 2005-04-27 Mauro Tortonesi * NEWS: Mention the new --ftp-user, --ftp-password, --user and --password options, the name changes for --http-passwd and --proxy-passwd and the deprecation of login and passwd commands. 2005-04-27 Hrvoje Niksic * src/openssl.c (ssl_init): Ditto. * src/options.h (struct options): Allow separate specification of key type and certificate type. * src/init.c (cmd_spec_cert_type): Provide a "der" synonym for "asn1" certificate encoding. 2005-04-27 Hrvoje Niksic * src/openssl.c (init_prng): Disable the weak random seed by default. * src/http.c (gethttp): Simplify SSL initialization; disable SSL when anything goes wrong with the initialization. * src/options.h (struct options): New option opt.random_file. 2005-04-27 Hrvoje Niksic * src/main.c (print_help): Advertise "DER", not "ASN1". 2005-04-27 Hrvoje Niksic * src/init.c: Wrap private key commands in IF_SSL. 2005-04-27 Hrvoje Niksic * src/init.c (cmd_spec_useragent): Free the old value of opt.useragent before setting the new one. 2005-04-27 Hrvoje Niksic * doc/wget.texi: Improve wording of command descriptions. 2005-04-27 Hrvoje Niksic * doc/wget.texi: Document the new form of SSL/TLS options. 2005-04-27 Hrvoje Niksic * doc/wget.texi (HTTPS (SSL/TLS) Options): Document --random-file. 2005-04-27 Hrvoje Niksic * doc/wget.texi (HTTP Options): Mention --keep-session-cookies when documenting --post-data. 2005-04-27 Hrvoje Niksic * doc/wget.texi (Download Options): Fix bind address cindex entry that broke concept index generation. 2005-04-26 Hrvoje Niksic * src/openssl.c: Renamed "gen_sslfunc.c" to "openssl.c" and "gen_sslfunc.h" to "openssl.h". This reflects the intent of openssl.c encapsulating the OpenSSL-specific code. 2005-04-26 Hrvoje Niksic * src/init.c: Renamed "closure" (a synonym for context in some cultures) to "place", which more accurately reflects the usage. 2005-04-26 Hrvoje Niksic * src/http.c (gethttp): Call skip_short_body only if keep_alive is in use. (gethttp): Send the User-Agent header with the CONNECT request as well. 2005-04-26 Hrvoje Niksic * src/gen_sslfunc.c (ssl_init): Use default locations for loading the certificate bundles. (ssl_init_prng): Disable the cryptographically weak PRNG initialization fallback. * src/init.c: Renamed SSL command-line arguments and wgetrc commands. (defaults): Check the server certificate by default. 2005-04-26 Hrvoje Niksic * src/cookies.c (cookie_handle_set_cookie): Delete the part of the path after the trailing slash. * src/http.c (gethttp): Call cookie_handle_set_cookie with path that begins with '/'. 2005-04-26 Hrvoje Niksic * doc/wget.texi (Option Syntax): Document boolean options. Include the option syntax in the man page. (Directory Options): Removed stray text after --protocol-directories. 2005-04-26 Hrvoje Niksic * doc/wget.texi (HTTP Options): Improved entry on --keep-session-cookies. 2005-04-26 Hrvoje Niksic * doc/wget.texi (Directory Options): Removed stray text after --protocol-directories. 2005-04-25 Hrvoje Niksic * src/main.c (option_data): Removed support for the undocumented flag --use-proxy. 2005-04-25 Hrvoje Niksic * src/main.c (option_data): Don't treat -Y as a boolean switch; treat it as a value switch instead, so "-Y off" continues to work. 2005-04-25 Hrvoje Niksic * doc/wget.texi (Download Options): Document --retry-connrefused. 2005-04-25 Hrvoje Niksic * doc/wget.texi (Advanced Usage): Don't advertise the non-existent `-s' option. 2005-04-25 Hrvoje Niksic * doc/Makefile.in (wget.info): Depend on version.texi as well. * doc/wget.texi: Simplify copyright. Replace remaining instances of --OPTION=off with --no-OPTION. 2005-04-24 Hrvoje Niksic * src/utils.c (aprintf): Delete unreachable statement. 2005-04-24 Hrvoje Niksic * src/host.c (cmp_prefer_ipv4): New function. (cmp_prefer_ipv6): New function. (lookup_host): Use the appropriate comparator according to opt.prefer_family. * src/init.c: New option prefer_family. * src/host.c (is_valid_ipv6_address): Spell NS_* constants in lower case to avoid clash with system headers. (lookup_host): Reorder the addresses so that IPv4 ones come first. * src/utils.c (stable_sort): New function. 2005-04-24 Hrvoje Niksic * src/connect.c (retryable_socket_connect_error): Return 0 for ENETUNREACH and EHOSTUNREACH. 2005-04-24 Hrvoje Niksic * doc/wget.texi (Download Options): Don't claim that -6 accepts mapped IPv4 addresses. 2005-04-24 Hrvoje Niksic * doc/wget.texi (Download Options): Document --prefer-family. 2005-04-23 Hrvoje Niksic * src/utils.c (base64_encode): Treat input as unsigned chars. Required for correct encoding of binary stuff. 2005-04-23 Hrvoje Niksic * src/http-ntlm.c: Format the function definitions in an ansi2knr-friendly fashion. 2005-04-23 Hrvoje Niksic * src/cmpt.c: Reenable the memmove implementation for systems that lack it. * src/http.c (gethttp): Store the "authorized" state of the persistent connection. (request_remove_header): New function. (gethttp): Don't send the "Basic" authentication if the connection is already authorized. 2005-04-23 Hrvoje Niksic * doc/wget.texi: Documented the SSL command-line options. 2005-04-23 Hrvoje Niksic * doc/wget.texi (Wgetrc Commands): Document ftp_passwd. (FTP Options): Document --ftp-passwd. 2005-04-23 Hrvoje Niksic * doc/texi2pod.pl.in: First process @@ then @}, so @samp{-wget@@} is interpreted correctly. 2005-04-22 Hrvoje Niksic * src/http.c (gethttp): Handle multiple WWW-Authentication headers, only one of which is recognized. Those are sent by IIS with NTLM authorization. (create_authorization_line): Propagate information whether authorization is finished. (gethttp): Only stop authorization when it's really finished, not after fixed two steps. 2005-04-22 Hrvoje Niksic * po/eo.po: Added Esperanto translation. 2005-04-21 Hrvoje Niksic * src/main.c (print_help): Print the EGD option outside the cluster of SSL options. 2005-04-21 Hrvoje Niksic * src/http-ntlm.c (ntlm_output): Fix setting the domain. Suggested by Sami Krank. 2005-04-21 Hrvoje Niksic * src/gen_sslfunc.c (ssl_init): Fix warning message text; mark the message as translatable. 2005-04-21 Hrvoje Niksic * po/vi.po: Added Vietnamese translation. 2005-04-20 Mauro Tortonesi * src/connect.c: Set IPV6_V6ONLY socket option when -6 switch is used. 2005-04-20 Hrvoje Niksic * doc/wget.texi: Document behavior of -6 wrt mapped IPv4 addresses. 2005-04-20 Hrvoje Niksic * doc/wget.texi: Document IPv6 related options. 2005-04-20 FUJISHIMA Satsuki * src/http.c (request_set_header): Fix the check whether a new header needs to be allocated. 2005-04-18 Hrvoje Niksic * src/utils.c (base64_encode): Use the parameter order that makes more sense. Return the length of the base64 written. Updated all callers. 2005-04-18 Hrvoje Niksic * doc/wget.texi: Update mailing list information. 2005-04-18 Hrvoje Niksic * doc/wget.texi (Download Options): Don't claim that --no-dns-cache is necessary for dyndns servers -- it's not. 2005-04-18 Hrvoje Niksic * MACHINES: Removed. 2005-04-17 Hrvoje Niksic * src/snprintf.c (fmtstr): Declare VALUE as const char *. Based on patch by Russ Allbery. 2005-04-17 Hrvoje Niksic * src/snprintf.c (fmtfp): More correct handling of significant digit count with %g -- 0.002 has one significant digit, not three. 2005-04-17 Hrvoje Niksic * src/http.c (request_set_header): Free NAME when VALUE is NULL and freeing the header name is requested. 2005-04-16 Hrvoje Niksic * src/snprintf.c: Use the PARAMS macro to handle prototypes. Write function definitions in the ansi2knr-friendly way. (fmtstr): If string precision is specified, don't read VALUE past it. (dopr): Actually print %g and %e formats. (fmtfp): Fix a bug that caused 0.01 to be printed as 0.1. (fmtfp): Use LLONG in floating point conversions to be able to convert more digits. (fmtfp): Interpret precision as number of significant digits with %g. (fmtfp): Omit trailing decimal zeros with %g. * src/snprintf.c: Don't include because none of it is used. Include strings.h/string.h, as per Autoconf. 2005-04-16 Hrvoje Niksic * src/retr.c (fd_read_body): Respect read timeout with non-interactive or no progress gauge -- treat ETIMEDOUT specially only when progress_interactive. Reported by FUJISHIMA Satsuki. 2005-04-16 FUJISHIMA Satsuki * src/http.c (gethttp): Don't use HTTP/1.0 persistent connections over proxy. 2005-04-15 Hrvoje Niksic * src/ptimer.c: Use _POSIX_TIMERS - 0 > 0, which handles the case when _POSIX_TIMERS is defined but empty, as well as the case when it is undefined. Do the same with the check for _POSIX_MONOTONIC_CLOCK. Suggested by Larry Jones. 2005-04-15 Hrvoje Niksic * src/ptimer.c: Check that _POSIX_TIMERS is defined as well as > 0. Problem reported by Steven M. Schweda. 2005-04-14 Hrvoje Niksic * src/http.c (skip_short_body): Print the skipped body data in debug mode. (skip_short_body): Don't skip more than 4k of body data. (skip_short_body): Return whether the skipping was successful. (gethttp): If skip_short_body failed, invalidate the connection. 2005-04-12 Gisle Vanem * src/ftp.c (getftp): Ditto. * src/http.c (gethttp): Open the output file in binary mode. 2005-04-11 Hrvoje Niksic * src/xmalloc.c (register_ptr): Warn the user to increase SZ prior to aborting. 2005-04-11 Hrvoje Niksic * src/hash.c: Conditionalize including config.h on HAVE_CONFIG_H instead of on STANDALONE. 2005-04-11 Hrvoje Niksic * src/gen_sslfunc.c (ssl_print_errors): Made static. (ssl_print_errors): Eliminate potentially dangerous fixed-size error buffer. Passing NULL to ERR_error_string causes it to use its own static buffer, which is fine for our purposes. * src/gen_sslfunc.c: Include gen_sslfunc.h. Make the declarations of ssl_init and ssl_conect match the actual definitions. (ssl_connect): Simply return a boolean, the SSL context is no longer useful as a return value. * src/retr.c: Don't include SSL headers; we don't need them. 2005-04-11 Hrvoje Niksic * src/convert.c (convert_cleanup): Free converted_files. 2005-04-11 Hrvoje Niksic * src/config-post.h: Don't define _VA_LIST under Solaris; it breaks compilation with GCC 3.4 under Solaris 10. (It was defined to avoid a warning with previous versions of GCC.) 2005-04-10 Hrvoje Niksic * src/wget.h (DEBUGP): Use __builtin_expect to give a hint to GCC that opt.debug is false in the vast majority of cases. 2005-04-10 Hrvoje Niksic * src/config-post.h: Declare alloca as void *, not char *. 2005-04-09 Hrvoje Niksic * src/url.c: Use "static const" in preference to "const static". Sun's cc warns that "storage class after type is obsolescent". * src/url.c (urlchr_table): Don't mark ~ as unsafe, too many broken web sites are confused when ~ is changed to %7E. Their servers redirect /%7Efoo/ to /~foo/, which Wget again accesses using %7E, causing further redirections, therefore looping infinitely. See Debian bug #301624 for an example. 2005-04-09 Hrvoje Niksic * src/recur.c (download_child_p): When -p is used, (temporarily) ignore accept/reject rules for HTMLs, even when they are at the maximum recursion depth. That is because with -p we are, if necessary, overstepping the max. depth to get the requisites. 2005-04-09 Hrvoje Niksic * src/http-ntlm.c (ntlm_output): Use "char", not "unsigned char" for ntlmbuf. Our base64 functions accept char anyway. (ntlm_output): Join up the format string, since we nominally support K&R compilers. (ntlm_output): Ditto. 2005-04-09 Hrvoje Niksic * src/alloca.c: Include wget.h to be able to use xmalloc. In addition to defining malloc to xmalloc, also define free to xfree. 2005-04-08 Larry Jones * doc/Makefile.in (wget.info): Don't use $< in an explicit rule. 2005-04-08 Hrvoje Niksic * windows/Makefile.src: Compile ptimer.c and http-ntlm.c. 2005-04-08 Hrvoje Niksic * src/ptimer.c: Use Windows timers under Cygwin, whose POSIX timer implementation is incomplete. 2005-04-08 Hrvoje Niksic * src/ptimer.c (struct ptimer): Remove the unused initialized field. * src/ptimer.c: Renamed function parameters from WT (which used to stand for wget_timer or wtimer) to PT. 2005-04-08 Hrvoje Niksic * src/ptimer.c (ptimer_init): In fact, _POSIX_MONOTONIC_CLOCK needs to be >= 0. * src/ptimer.c (ptimer_init): Check whether _POSIX_MONOTONIC_CLOCK is defined instead of whether it's greater than 0. glibc defines it to 0, but still makes it available via sysconf. 2005-04-08 Hrvoje Niksic * src/ptimer.c (ptimer_init): Explicitly check that _POSIX_MONOTONIC_CLOCK is *both* defined and >=0. (Undefined symbols are >=0.) 2005-04-08 Hrvoje Niksic * src/ptimer.c (ptimer_diff): Fix typo affecting Windows build. 2005-04-08 Hrvoje Niksic * src/ptimer.c (posix_init): Be smarter about choosing clocks. In decreasing order of preference, use CLOCK_MONOTONIC, CLOCK_HIGHRES, and CLOCK_REALTIME. (ptimer_allocate): Removed. * src/ptimer.c: Refactor the code by cleanly separating the architecture-dependent code from the architecture-independent code. 2005-04-08 Hrvoje Niksic * src/mswindows.c (str_to_int64): Rename OVERFLOW and UNDERFLOW to INT64_OVERFLOW and INT64_UNDERFLOW, since those names are used. 2005-04-08 Hrvoje Niksic * configure.in: When checking for OpenSSL headers, check for all the ones that Wget is using. 2005-04-08 Hrvoje Niksic * configure.in: Use it. * aclocal.m4 (WGET_POSIX_CLOCK): Check whether -lrt is needed to use POSIX clock functions like clock_gettime. 2005-04-08 Hrvoje Niksic * Makefile.in ($(srcdir)/stamp-h.in): Don't attempt to run autoheader automatically; it breaks things with fresh CVS builds. 2005-04-07 Hrvoje Niksic * src/ptimer.c: New file. Move the "wtimer" functions from utils.c to this file and rename them to ptimer_. 2005-04-07 Hrvoje Niksic * src/host.c (NO_ADDRESS): Define NO_ADDRESS only after the system headers have been included. 2005-04-06 Hrvoje Niksic * src/http.c (pconn): Include NTLM data, which is per-connection. (known_authentication_scheme_p): Recognize NTLM authorization. (create_authorization_line): Call ntlm_input and ntlm_output. * src/http-ntlm.c: New file, donated by Daniel Stenberg and originally written for curl, heavily modified for Wget. * src/utils.c (base64_encode): Relocated from http.c, since it is now used by http-ntlm.c, and will possibly be used elsewhere. (base64_decode): New function, originally based on code from GNU recode. 2005-04-06 Hrvoje Niksic * configure.in: Allow the user to disable NTLM authorization. Make sure NTLM is disabled if OpenSSL is unavailable. If NTLM is *explicitly* requested and OpenSSL is unavailable, abort. * configure.in: Renamed USE_* to ENABLE_*. 2005-04-05 Mauro Tortonesi * src/Makefile.in: removed string_t.c from list of source files. 2005-04-04 Mauro Tortonesi * src/string_t.c: Removed. * src/string_t.h: Removed. 2005-04-02 Hrvoje Niksic * src/url.c (rewrite_shorthand_url): Only accept recognized schemes. That way "foo:80" will correctly be rewritten to "http://foo:80" instead of left unchanged and ultimately rejected because of "unsupported scheme foo". 2005-04-02 Hrvoje Niksic * src/ftp.c (ftp_loop): Ditto. * src/ftp-basic.c (ftp_pasv): Use the xzero shorthand for memset(0). (ftp_lpsv): Ditto. 2005-03-31 Hrvoje Niksic * src/utils.c (number_to_string): Avoid explicit 64-bit constants; construct them by multiplication at compile-time. * src/utils.c, elsewhere: Don't append "L" to 32-bit integer constants; we aren't really compilable on 16-bit systems anyway. * src/hash.c (prime_size): Remove primes larger than 2^31, but include 2^31-1, which is prime. 2005-03-30 Hrvoje Niksic * src/utils.c (string_set_to_array): New function. * src/convert.c: Replace the use of "slists" with sets/hash-tables, which in fact suit the intended purpose much better. downloaded_html_list is removed altogether. 2005-03-29 Hrvoje Niksic * src/utils.c (numdigit): More correct handling of negative numbers. 2005-03-29 Hrvoje Niksic * src/ftp.h (enum): Rename GLOBALL, GETALL, and GETONE to GLOB_GLOBALL, GLOB_GETALL, and GLOB_GETONE to avoid conflict with Linux headers. 2005-03-23 Hrvoje Niksic * po/POTFILES.in: Removed headers.c and rbuf.c. 2005-03-22 Joseph Caretto * doc/texi2pod.pl.in: Handle asis again. It used to work (see the 2001-12-11 entry), but the local change was lost in the upgrade to 1.4. 2005-03-21 Hrvoje Niksic * src/http.c (gethttp): Print the human-readable size. * src/ftp.c (getftp): Print the human-readable size of the file to be downloaded. * src/utils.c (human_readable): New function. * src/utils.c: Renamed "legible" to "with_thousand_seps", "legible_large_int" to "with_thousand_seps_large", and "legible_1" to "add_thousand_seps". 2005-03-21 Hrvoje Niksic * src/http.c (gethttp): Inhibit persistent connections when talking to proxies, as mandated by RFC 2068. 2005-03-20 Hrvoje Niksic * src/url.c (unescape_single_char): New function. (url_escape_dir): Use it to unescape slashes in directory components. (url_string): Escape unsafe chars in host name, except for the ':' charaters, which can appear in IPv6 addresses. * src/main.c (main): Don't access the cookie jar directly. * src/log.c (escnonprint_internal): Correctly calculate the needed string size. Don't forget the buffer's new size after having reallocated it. (log_cleanup): New function. Free the escnonprint ring data. * src/init.c (cleanup): Don't free the cookie jar explicitly, it is now done by http_cleanup. (cleanup): opt.user_headers is now a vector, free it with free_vec. * src/http.c (gethttp): Make sure to free the request data, the status message, and the response data before returning from the function. (save_cookies): New function. (http_cleanup): Free the cookie jar here. * src/hash.c: Renamed string_hash to hash_string and ptrhash to hash_pointer. Exported hash_pointer. * src/xmalloc.c: Organized malloc_table (previously malloc_debug) as a simple EQ hash table. register_ptr and unregister_ptr are now of O(1) complexity. * src/xmalloc.c: Renamed "*_debug" to debugging_* and "*_real" to checking_*. 2005-03-18 Hrvoje Niksic * src/init.c (cmd_file): Use concat_strings. * src/http.c (basic_authentication_encode): Use concat_strings. * src/ftp-ls.c (ftp_index): Use concat_strings. * src/ftp-basic.c (ftp_request): Use concat_strings. * src/utils.c (concat_strings): New function. 2005-03-17 Hrvoje Niksic * src/ftp-basic.c (ftp_login): Don't free the string if ftp_response returned an error status because the line didn't get allocated in the first place. 2005-03-15 Hrvoje Niksic * src/retr.c (fd_read_body): Undo the 2004-11-18 change. Instead, always be "exact". 2005-03-15 Hrvoje Niksic * src/http.c (read_http_response_head): Limit the response size to 64k bytes. * src/retr.c (fd_read_hunk): Accept a MAXSIZE argument that limits the number of bytes the function is allowed to allocate. (fd_read_line): Limit the line to 4096 bytes. 2005-03-12 Hrvoje Niksic * src/wget.h: Include options.h after wgint has been defined. * src/options.h (struct options): Declare options processed with cmd_bytes as wgint, not long. 2005-03-12 Hrvoje Niksic * src/utils.c (debug_test_md5): Moved to gen-md5.c. * src/mswindows.h: Don't declare inet_ntop, since we don't use it. * src/mswindows.h: For consistency, also wrap closesocket, it being a Winsock call. * src/mswindows.h: Don't declare sleep and usleep; we're defining xsleep now. * src/mswindows.h (mkdir): Don't special-case Borland C, _mkdir works there as well. * src/host.c: Don't include winsock header files; the correct ones are already included by mswindows.h. * src/mswindows.c (xsleep): Round toward the nearest millisecond in an attempt to avoid average short sleeps. * src/utils.c (wtimer_granularity): Report correct values for Windows timers and for high-resolution timers. * src/utils.c (wtimer_initialize_once): New function, called to initialize the timer frequency. * src/utils.c: Replace the use of GetSystemTime with high-resolution counters under Windows. When high-resolution counters are unavailable, use GetTickCount(). 2005-03-12 Hrvoje Niksic * src/init.c (simple_atof): Handle negative numbers; skip whitespace before the number. (simple_atoi): Ditto. Also, check for overflow and underflow. (cmd_number): Bail out on negative numbers. (parse_bytes_helper): Ditto. 2005-03-06 Hrvoje Niksic * windows/Makefile.src.bor: Reenable warnings under Borland C, disabling only specific warnings. Generate Pentium Pro code by default. 2005-03-06 Hrvoje Niksic * src/utils.c (read_file): Don't use wgint for file size; LFS won't work since the file is mmap'ed as a whole. * src/options.h: Don't include stdio.h. * src/log.c: Ditto. * src/init.c: Disambiguate assignment from non-zero test to avoid Borland C warning. * src/http.c (response_new): Don't needlessly post-increment count. * src/hash.c: Include stdio.h. * src/gnu-md5.h: Don't include stdio.h. * src/getopt.h (struct option): Always use const. * src/ftp.c (getftp): Avoid unnecessary assignment to RES to avoid Borland C warning. * src/ftp-ls.c: Disambiguate assignment from non-zero test to avoid Borland C warning. * src/cmpt.c (strptime_internal): Don't initialize rp_backup when !_NL_CURRENT to avoid Borland C warning. 2005-03-06 Hrvoje Niksic * src/utils.c (fopen_excl): Fix parse error when O_BINARY is available. 2005-03-06 Hrvoje Niksic * src/url.c (url_parse): Reject port numbers larger than 65535. We also check for overflow while parsing port numbers. 2005-03-06 Hrvoje Niksic * src/mswindows.c: Provide wrappers to Winsock functions that set errno to WSAGetLastError() in case of failure. Also provide a Windows-specific version of strerror. * src/mswindows.h: Wrap calls to socket, bind, connect, recv, send, select, getsockname, getpeername, and setsockopt. 2005-03-06 Hrvoje Niksic * src/init.c (defaults): Use passive FTP by default. 2005-03-06 Hrvoje Niksic * src/http.c (gethttp): Handle multiple Set-Cookie headers sent by remote server. 2005-03-05 Hrvoje Niksic * src/url.c (url_file_name): Don't allow hosts named ".." to be appended as path elements. 2005-03-03 Hrvoje Niksic * src/retr.c (retrieve_url): Escape location header. * src/http.c (print_server_response_1): Escape server response when printing it. (gethttp): Escape host name, status message, location header, and content type. (http_loop): Escape error message from server. * src/host.c (lookup_host): Escape host name when printing it. * src/ftp.c (getftp): Escape user name when printing it. (getftp): Escape remote file and directory for printing. (getftp): Escape server listing when printing it. (ftp_retrieve_list): Escape link name and file name. (ftp_retrieve_glob): Escape file name. * src/ftp-basic.c (ftp_response): Escape server response when printing it. * src/cookies.c (parse_set_cookies): Escape the cookie field when printing it. (parse_set_cookies): Escape contents of remote header. (cookie_handle_set_cookie): Escape host name and cookie domain. * src/connect.c (connect_to_ip): Escape the host name. * src/log.c (escnonprint): New function, used for printing strings coming from the server that possibly contain non-ASCII characters. (escnonprint_uri): Ditto. 2005-02-26 Hrvoje Niksic * src/utils.c: Use the nnnI64 syntax for __int64 constants under all Windows compilers. (SPRINTF_WGINT): Use "%I64" under all Windows compilers. * src/mswindows.h (WGINT_MAX): Use the nnnI64 syntax for __int64 constants under all Windows compilers. (struct_stat): Use `struct stati64' under Borland C. (fstat): Don't redefine to _fstati64 under Borland. * src/mswindows.c: Define str_to_int64 under Borland C as well as under (older) Visual C. 2005-02-26 Gisle Vanem * src/utils.c: Use the nnnLL syntax under GCC. Define struct_stat to struct _stati64 under __MINGW32__ as well as under MS VC. 2005-02-25 Hrvoje Niksic * src/ftp.c (getftp): Initialize err to suppress compiler warning. 2005-02-24 Hrvoje Niksic * src/mswindows.c (wget_ftello): Removed. * src/mswindows.h: Remove bogus definition of stat. (wget_ftello): Remove the leftover ftello replacement. 2005-02-24 Hrvoje Niksic * src/host.c (lookup_host): Test for AI_ADDRCONFIG directly, instead of checking for HAVE_GETADDRINFO_AI_ADDRCONFIG. 2005-02-24 Hrvoje Niksic * src/ftp.c (getftp): Ditto. * src/http.c (gethttp): When we're not supposed to overwrite files, use fopen_excl to open the file and recompute the file name. * src/log.c (redirect_output): Use unique_create to avoid a race condition. * src/mswindows.c (fake_fork_child): Use unique_create. * src/utils.c (fopen_excl): New function that opens a stdio stream with the O_EXCL flag (where available). (unique_create): New function, like unique_name, but also creating the file and returning a file pointer. (fork_to_background): Use unique_create to create the file immediately to avoid race condition with multiple instances of wget -b. 2005-02-24 Hrvoje Niksic * src/ftp.c (ftp_expected_bytes): Fix bug that caused infloop because of not correctly skipping the '(' character. 2005-02-23 Hrvoje Niksic * src/host.c (is_valid_ipv6_address): Move here from url.c. (lookup_host): If the address is numeric, don't print the "resolving..." line, don't set up DNS timeouts, and set the AI_NUMERICHOST hint, where available. 2005-02-20 Hrvoje Niksic * src/mswindows.c (wget_ftello): Wget's replacement for ftello. * src/utils.c (file_size): Use ftello where available. * src/ftp-ls.c (ftp_parse_unix_ls): Use str_to_wgint to parse the file size. (ftp_parse_winnt_ls): Ditto. * src/ftp-basic.c (ftp_size): Use str_to_wgint to convert number to wgint; pass 10 instead of 0 as the BASE argument. * src/ftp.c (ftp_expected_bytes): Use str_to_wgint to parse the file size. * src/sysdep.h (LARGE_INT_FMT): Use __int64 as LARGE_INT on MSVC/Windows; print it with "%I64". * src/wget.h: Define a `wgint' type, normally aliased to (the possibly 64-bit variant of) off_t. * src/all: Use `wgint' instead of `long' for numeric variables that can hold file sizes. * src/utils.c (number_to_string): Support printing of `wgint' argument. (number_to_static_string): New function. * src/all: Replace printf("%ld", long_value) with printf("%s", number_to_static_string(wgint_value)). 2005-02-20 Hrvoje Niksic * configure.in: Check for LFS. Determine SIZEOF_OFF_T. Check for ftello. 2005-02-18 Mauro Tortonesi * src/main.c: Added the --ftp-passwd command line option. * src/init.c: Renamed command passwd to ftppasswd. 2005-02-18 Marco Colombo * po/it.po: Updated Italian translation. 2005-02-11 Mauro Tortonesi * src/string_t.c: Fixed a bug in do_escape and triggered escape of backslashes in string_escape to avoid ambiguities in the result string. 2005-02-11 Mauro Tortonesi * doc/wget.texi: Added Simone Piunno as new contributor. 2005-02-10 Mauro Tortonesi * src/string.h: Renamed to string_t.h to fix a compilation conflict with the string.h header in the standard C library. * src/string.c: Renamed to string_t.c for consistency with string.h. * src/string_t.c: Ditto. * src/string_t.h: Ditto. 2005-01-01 Mauro Tortonesi * doc/wget.texi: Updated copyright information, added new contributors. 2004-12-31 Mauro Tortonesi * src/string.c: New file. * src/string.h: New file. * src/Makefile.in: Added string.c to the list of modules to compile. * src/main.c: Updated copyright. 2004-11-20 Hans-Andreas Engel * doc/wget.texi: Describe limitations of combining `-O' with `-k'. 2004-11-18 Ulf Harnhammar * src/ftp-ls.c: Fixed a problem in ftp_parse_winnt_ls that could allow a malicious remote FTP server to crash wget. 2004-11-18 Mauro Tortonesi * src/connect.c: Minor correction to the comment in front of fd_peek. 2004-11-18 Leonid Petrov * src/retr.c: Fix a timeout problem in fd_read_body when using http_proxy. 2004-11-18 Hans-Andreas Engel * src/http.c: Enable --convert-links (-k) when a single page is downloaded via --output-document (-O). 2004-11-15 YAMAZAKI Makoto * src/netrc.c: Fix termination by assertion bug in netrc parsing. 2004-06-12 Larry Jones * src/recur.c (download_child_p): Correct the logic in check number 6: test opt.reclevel (not DEPTH) against INFINITE_RECURSION. 2004-05-13 Nico R. * doc/Makefile.in: Allow building in a separate tree with source tree write-protected. 2004-05-09 David Fritz * windows/Makefile.src.bor: Fix broken build rule. Add clean target. * windows/Makefile.top.bor: Use tabs instead of spaces. Ignore errors in clean rules. Use lowercase filenames when building distribution .zip archive. * windows/config.h.bor: Don't define HAVE_UINT32_T. * windows/Makefile.doc: Fix remaining instance of build rules indented with spaces instead of tabs. * windows/Makefile.src: Update copyright year. * windows/Makefile.top: Update copyright year. * windows/config.h.mingw (WGET_USE_STDARG, HAVE_SIG_ATOMIC_T): Define. * windows/config.h.ms (HAVE_STRPBRK, HAVE_LIMITS_H) (HAVE_LOCALE_H): Define. * windows/Makefile.watcom: Add /I. to CFLAGS. Remove reference to specific Wget version from linker flags. Add missing dependencies. 2004-05-09 David Fritz * src/mswindows.c (fake_fork): Pass entire command line to the child process. 2004-03-31 Hrvoje Niksic * src/http.c (gethttp): Fix typo: SCHEME_SSL -> SCHEME_HTTPS. 2004-03-30 Hrvoje Niksic * src/http.c (gethttp): Send the Proxy-Authorization header over non-SSL connections too. 2004-03-25 David Fritz * src/mswindows.c (fake_fork_child): Ignore error code when OpenFileMapping() fails; assume it failed because the object does not exist. 2004-03-24 David Fritz * src/mswindows.c (fake_fork): New function. * src/mswindows.c (fork_to_background): Use it. 2004-03-19 Hrvoje Niksic * src/url.c (url_parse): Decode %HH sequences in host name. 2004-03-19 David Fritz * src/mswindows.c (ws_hangup): Incorporate old fork_to_background() code. Add event name argument. (fork_to_backgorund): Now a simple wrapper around ws_hangup(). (ws_handler): Correctly handle the case when neither CTRLC_BACKGND nor CTRLBREAK_BACKGND are defined. Don't bother handling close, logoff, or shutdown events. Call ws_hangup() with the correct event name; don't assume it was CTRL+Break. 2004-03-04 Hrvoje Niksic * src/convert.c (local_quote_string): Quote "#" as "%23" and "%" as "%25" when creating links to local files. 2004-03-04 Gisle Vanem * src/retr.c (fd_read_body): Don't change console title if quiet. 2004-03-02 David Fritz * src/mswindows.c (ws_percenttitle): Guard against future changes by doing nothing if the proper variables have not been initialized. Clamp percentage value. 2004-02-25 David Fritz * src/mswindows.c (set_sleep_mode): Remove argument and return value. Call GetModuleHandle() instead of LoadLibrary()/FreeLibrary() for kernel32.dll. Use typedef for function-pointer. Don't cast l-value. Don't use dereference operator when calling through function-pointer. (ws_startup): Update call to set_sleep_mode(). (ws_cleanup): Remove call to set_sleep_mode(). 2004-02-23 David Fritz * src/mswindows.h: Ditto. * src/mswindows.c: Misc. formatting/comment tweaks throughout. 2004-02-23 David Fritz * src/http.c (http_loop): Ditto. * src/ftp.c (ftp_loop_internal): Update call to ws_changetitle(). * src/main.c (main): Don't bother calling ws_changetitle(). * src/mswindows.h (ws_changetitle): Update prototype. * src/mswindows.c (ws_changetitle): Remove second argument. Use xfree_null(). (ws_percenttitle): Only update title when percentage has changed. 2004-02-22 Hrvoje Niksic * doc/wget.texi (Contributors): Updated. 2004-02-20 David Fritz * src/main.c (print_help): Remove call to ws_help(). * src/mswindows.c (ws_help): Remove. * src/mswindows.h (ws_help): Remove. 2004-02-16 David Fritz * src/init.c (home_dir): Use aprintf() instead of xmalloc()/sprintf(). Under Windows, if $HOME is not defined, use the directory that contains the Wget binary instead of hard-coded `C:\'. (wgetrc_file_name): Under Windows, look for $HOME/.wgetrc then, if not found, look for wget.ini in the directory of the Wget binary. * src/mswindows.c (ws_mypath): Employ slightly more robust methodology. Strip trailing path separator. 2004-02-12 Jens Roesner * doc/wget.texi (Wgetrc Commands): Document `-e' here. 2004-02-09 David Fritz * configure.bat.in: Don't clear the screen. * windows/README: Add introductory paragraph. Re-word a few sentences. Correct minor typographical errors. Use consistent capitalization of Wget, SSL, and OpenSSL. Refer to Microsoft Visual C++ as MSVC instead of VC++. Mention the --msvc option to configure.bat. Reflow paragraphs. * windows/Makefile.top: Use tabs instead of spaces. Ignore errors in clean rules. Use lowercase filenames when building distribution .zip archive. * windows/Makefile.doc: Use tabs instead of spaces. Ignore errors in clean rules. * windows/Makefile.src: Clean-up clean rules. Use tabs instead of spaces. Link against gdi32.lib. Don't define SYSTEM_WGETRC. Remove unused macros. Remove anachronistic and superfluous linker flags. Don't rename wget.exe to all upper-case. Add `preprocessor' conditionals for SSL and newer MSVC options. Use batch rules. Don't suppress all warnings. 2004-02-08 Hrvoje Niksic * doc/wget.texi (Security Considerations): Put @item contents on a separate line. Reported by Ted Rodriguez-Bell. 2004-02-06 Hrvoje Niksic * src/http.c (gethttp): Respect --ignore-length. (gethttp): Inhibit keep-alive if --ignore-length is specified. 2004-02-06 Hrvoje Niksic * src/connect.c (sockaddr_set_data): Zero out sockaddr_in/sockaddr_in6. Apparently BSD-derived stacks need this when binding a socket to local address. 2004-02-06 Hrvoje Niksic * doc/wget.texi (Wgetrc Commands): Document --no-http-keep-alive and the corresponding Wgetrc command. 2004-02-04 Hrvoje Niksic * src/hash.c: Make the file compilable outside Wget source tree when -DSTANDALONE is used. 2004-01-29 Hrvoje Niksic * src/utils.c (determine_screen_width): Return 0 if not running on Windows or on a TIOCGWINSZ-capable system. 2004-01-28 David Fritz * src/utils.c (determine_screen_width): Correctly determine console width under Windows. 2004-01-28 Christian Biere * src/progress.c (bar_set_params): Fixed syntax error when HAVE_ISATTY was undefined. * src/ftp.c (ftp_loop_internal): Allocate a larger buffer in case "try" gets translated to something large. Ditto in http_loop in http.c. * src/http.c (response_header_copy): Don't write to buf[BUFSIZE]. * src/ftp-opie.c (skey_response): Eliminate unnecessary string concatenation; just call gen_md5_update twice. 2004-01-25 Hrvoje Niksic * src/utils.c (xsleep): Don't call usleep with values larger than 1,000,000. 2003-12-16 Hrvoje Niksic * src/http.c (gethttp): Fix generation of `Content-Length'. 2003-12-15 Gisle Vanem * src/url.c (url_skip_credentials): Fixed return value; 'url' if no credentials. 2003-12-14 Hrvoje Niksic * src/url.c (url_skip_credentials): Return a pointer directly. 2003-12-14 Hrvoje Niksic * src/url.c (url_file_name): Ditto. (parse_errors): Ditto. * src/retr.c (retr_rate): Ditto. * src/progress.c (create_image): Ditto. * src/netrc.c (parse_netrc): Ditto. * src/main.c (struct cmdline_option): Ditto. (redirect_output_signal): Ditto. * src/init.c (commands): Ditto. * src/ftp-ls.c (ftp_index): Ditto. * src/cookies.c (check_domain_match): Declare the pointer to a literal string as `const'. 2003-12-12 Hrvoje Niksic * src/retr.c (fd_read_body): Pass total size to progress_create, not the remaining amount. 2003-12-12 Hrvoje Niksic * src/retr.c (fd_read_body): Don't fiddle with "interactive timeout" if read timeout is unset. 2003-12-12 Hrvoje Niksic * src/connect.c (bind_local): Don't set the IPV6_V6ONLY option on the socket. 2003-12-06 Hrvoje Niksic * src/url.c (url_file_name): Respect the setting of opt.protocol_directories. * src/main.c (main): Only check for ret=='?' when longindex is unset. (option_data): New option --protocol-directories. 2003-12-06 Hrvoje Niksic * src/ftp.c (getftp): Ditto. * src/http.c (gethttp): Correctly calculate bandwidth as total data read divided with download time. * src/retr.c (fd_read_body): Separate the return values for data written and read. 2003-12-06 Hrvoje Niksic * doc/wget.texi: Document the new option `--protocol-directories'. 2003-12-06 Hrvoje Niksic * doc/wget.texi (Download Options): Don't incorrectly claim that `-O' sets the number of retries to 1. 2003-12-05 Hrvoje Niksic * src/http.c (H_REDIRECTED): Respect the HTTP/1.1 "303 See Other" response code. Suggested by Dan Razzell. 2003-11-30 Hrvoje Niksic * src/retr.c (fd_read_body): Sanitize arguments and document them better. Make sure the timer is created and updated only if necessary. Updated callers. 2003-11-30 Hrvoje Niksic * src/retr.c (fd_read_body): Report the amount of data *written* as amount_read. This is not entirely logical, but that's what the callers expect, and it's not easy to change. * src/ftp.c (ftp_loop_internal): Ditto. * src/http.c (http_loop): Be smarter about assigning restval; if we're in the nth pass of a download, simply use the information we have about how much data has been retrieved as restval. * src/ftp.c (getftp): Ditto for FTP "REST" command. * src/http.c (gethttp): When the server doesn't respect range, skip the first RESTVAL bytes of the read body. Never truncate the output file. * src/retr.c (fd_read_body): Support skipping initial STARTPOS octets. 2003-11-30 Hrvoje Niksic * src/http.c (skip_short_body): Renamed skip_body to skip_short_body; don't bother calling fd_read_body. 2003-11-30 Hrvoje Niksic * src/http.c (skip_body): New function. (gethttp): Use it to skip the body of the responses we don't care to download. That allows us to reuse the connection. (gethttp): Trust that the HEAD requests will not generate body data. * src/retr.c (fd_read_body): Don't write to OUT if it's NULL. 2003-11-29 Hrvoje Niksic * src/http.c (gethttp): Initialize SSL only the first time when SSL URL is downloaded. 2003-11-29 Hrvoje Niksic * src/cookies.c (cookie_header): Only generate the contents of the header, not the leading "Cookie: " or the trailing "\r\n". * src/http.c (gethttp): When adding headers specified with `--header', allow them to override the headers generated by Wget. * src/init.c (cmd_spec_header): Made opt.user_headers a vector. * src/http.c (request_new): New function. Returns a request structure which can be modified in various ways, most notably by adding HTTP headers to the request. (request_set_header): New function for adding the header to the request. If the header is already available, it gets replaced. (request_send): Construct and send the request. (gethttp): Use the request_* functions to generate the request. 2003-11-28 Hrvoje Niksic * src/init.c: Don't #include netinet/Winsock stuff. 2003-11-28 Hrvoje Niksic * src/http.c: Deleted the old functions header_process, header_extract_number, header_exists, header_strdup, http_process_range, http_process_none, http_process_type, and http_process_connection. * src/http.c (response_new): New function. (response_header_bounds): Ditto. (response_header_copy): Ditto. (response_header_strdup): Ditto. (response_status): Ditto. (gethttp): Use the new response_* functions to parse the response. Support HTTP/0.9 responses. 2003-11-28 Hrvoje Niksic * src/http.c (gethttp): Use the CONNECT handle to establish SSL passthrough through non-SSL proxies. 2003-11-28 Hrvoje Niksic * src/http.c (gethttp): Don't include the Proxy-Authorization header in the request tunneled through proxy. 2003-11-27 Hrvoje Niksic * src/progress.c (create_image): Don't calculate ETA if nothing has been downloaded yet, because it causes division by zero. 2003-11-27 Hrvoje Niksic * src/connect.c (bind_local): Rename sa_len to addrlen because IRIX headers define sa_len as a macro. 2003-11-26 Hrvoje Niksic * src/html-parse.c (convert_and_copy): Remove embedded newlines when AP_TRIM_BLANKS is specified. 2003-11-26 Hrvoje Niksic * src/ftp.c: Set con->csock to -1 where rbuf_uninitialize was previously used. 2003-11-26 Hrvoje Niksic * src/Makefile.in (http$o): Added the trailing backslash that was missing. 2003-11-26 Hrvoje Niksic * aclocal.m4: Don't check for AI_V4MAPPED and for AI_ALL, since Wget doesn't need them. * configure.in: Check for struct sockaddr_storage. 2003-11-21 Hrvoje Niksic * src/rbuf.c: Removed. * src/ftp-basic.c (ftp_response): Use fd_read_line. No longer use struct rbuf. Updated all callers. * src/http.c (gethttp): Use fd_read_head to read all the headers in one go. (next_header): New function. * src/retr.c (fd_read_line): New function: reads a line from FD, leaving the rest of the data unread. (fd_read_head): New function. * src/connect.c (fd_peek): New function, implements peeking. (poll_internal): New function. (fd_read): Use it. (fd_write): Ditto. (fd_peek): Ditto. (fd_register_transport): Allow registering a "peeker" callback. 2003-11-21 Hrvoje Niksic * src/headers.c: Removed. The file is no longer relevant, now that no special handling of headers is done by the rbuf code. Moved portions to http.c. 2003-11-20 Hrvoje Niksic * src/connect.c: Renamed xread/xwrite/xclose to fd_read/fd_write/fd_close. The "x" prefix is not appropriate because the semantics have diverged from read/write/close too significantly. Updated all callers. 2003-11-20 Hrvoje Niksic * src/connect.c (sock_write): It's not necessary to initialize RES. 2003-11-19 Hrvoje Niksic * src/main.c (main): Don't reference opt.ipv4_only and opt.ipv6_only if IPv6 is disabled. 2003-11-19 Hrvoje Niksic * src/connect.c (socket_has_inet6): Only compile it if IPv6 is enabled and AI_ADDRCONFIG is missing. 2003-11-18 Hrvoje Niksic * src/host.c (address_list_contains): Renamed address_list_find to address_list_contains because its result is boolean. 2003-11-18 Hrvoje Niksic * src/gen_sslfunc.c (ssl_init_prng): Warn the user when using a weak random seed. 2003-11-18 Hrvoje Niksic * src/connect.c (select_fd): Return 1 if select is not available. 2003-11-17 Hrvoje Niksic * src/main.c (print_help): Fix alignment of FTP options output. 2003-11-17 Hrvoje Niksic * src/host.c (lookup_host): Check for the ability to create IPv6 sockets here. * src/init.c (defaults): Don't auto-set --inet4-only on IPv6-less systems. 2003-11-17 Hrvoje Niksic * src/connect.c (connect_to_host): Don't reference address list after releasing it. 2003-11-16 Hrvoje Niksic * src/main.c (print_help): Fix typo in `-O' help message. Fix docs of -Y/--proxy. Fix docs of `--cookies' and `--glob'. Improve docs of --convert-links. Fix docs of SSL options. 2003-11-16 Hrvoje Niksic * src/main.c (main): Don't allow setting of both opt.ipv4_only and opt.ipv6_only. * src/init.c (defaults): Mark opt.ipv4_only specially when set automatically. 2003-11-16 Hrvoje Niksic * src/ftp-basic.c: Don't include and others because they're no longer needed. 2003-11-15 Hrvoje Niksic * src/init.c: Make sure the options are in alphabetic order! * src/host.c (lookup_host): Merge lookup_host_passive and lookup_host after all -- having both would result in some code duplication. (lookup_host): Set hints.ai_family to AF_INET if ipv4_only is requested. Likewise, set it to AF_INET6 for ipv6_only. Specify AI_ADDRCONFIG where available. (lookup_host): New flag LH_REFRESH that specifies that a cached entry for HOST should be refreshed. (cache_query): New function. (cache_store): Ditto. (cache_remove): Ditto. (forget_host_lookup): No longer necessary, replaced with static function cache_remove. 2003-11-15 Hrvoje Niksic * src/http.c (persistent_available_p): Correctly specify the endpoint argument to socket_ip_address. (gethttp): When printing the "reusing connection to..." message, specify the host name of the reused connection, not the current host name. That makes more sense because it provides a useful piece of information -- we know to which host we're supposed to connect anyway! 2003-11-15 Hrvoje Niksic * src/host.c: Don't refer to the now-removed function forget_host_lookup in the documentation of lookup_host. 2003-11-15 Hrvoje Niksic * src/host.c (lookup_host): Use AI_ADDRCONFIG only if the family is unspecified. This ensures that specifying `--no-inet4' on systems where IPv6 resolves, but doesn't work behaves the same regardless of the availability of AI_ADDRCONFIG. 2003-11-15 Hrvoje Niksic * doc/wget.texi: Documented that --dont-remove-listing is now --no-remove-listing. 2003-11-15 Hrvoje Niksic * doc/wget.texi (Portability): Update slightly. 2003-11-14 Hrvoje Niksic * src/url.c (uri_merge): Merging "foo" and "bar" should result in "bar", not in "foo/bar". (path_simplify): Don't remove empty path elements; don't special-case leading slash. (path_simplify): Don't swallow ".."'s at the beginning of string. E.g. simplify "foo/../../bar" as "../bar", not as "bar". (append_uri_pathel): Defang ".." path element upon encountering it. 2003-11-14 Hrvoje Niksic * src/main.c: Enable -4 and -6 only if IPv6 is enabled. 2003-11-14 Hrvoje Niksic * src/connect.c (register_transport): Renamed from register_extended. Explain the intended usage. 2003-11-14 Hrvoje Niksic * doc/fdl.texi: New file. * doc/wget.texi: Upgrade to GNU Free Documentation License 1.2. 2003-11-13 Hrvoje Niksic * src/wget.h (CONNECT_ERROR): Use it. * src/connect.c (unsupported_socket_family_error): New function. 2003-11-13 Hrvoje Niksic * src/init.c (defaults): Turn on opt.ipv4_only if we're compiling with IPv6, and AI_ADDRINFO is not available, and AF_INET6 sockets can't be created. 2003-11-13 Hrvoje Niksic * src/http.c (persistent_available_p): Don't attempt to talk to two different SSL sites over the same secure connection. 2003-11-13 Hrvoje Niksic * src/http.c (gethttp): Ditto. * src/ftp.c (getftp): Use retryable_socket_connect_error instead of CONNECT_ERROR. * src/wget.h (CONNECT_ERROR): Removed. * src/connect.c (retryable_socket_connect_error): New function instead of unsupported_socket_family_error. 2003-11-13 Hrvoje Niksic * src/host.c (lookup_host): Document the fact that the addresses are returned in order. 2003-11-13 Hrvoje Niksic * src/connect.c (bind_local): Renamed bindport to bind_local; return the socket directly. Updated callers. (accept_connection): Renamed acceptport to accept_connection; return the created socket directly. Updated callers. 2003-11-12 Hrvoje Niksic * src/utils.c: Use limits.h only where available. * src/hash.c: Use INVALID_PTR and INVALID_PTR_BYTE. Include limits.h. 2003-11-12 Hrvoje Niksic * configure.in: Use a more standard checking message when checking for md5.h. 2003-11-12 Hrvoje Niksic * configure.in: Tweak ansi2knr test, use : instead of true. 2003-11-12 Hrvoje Niksic * configure.in: Check for limits.h. 2003-11-11 Hrvoje Niksic * src/main.c: Added options --inet4-only and --inet6-only. 2003-11-11 Hrvoje Niksic * src/host.c (host_errstr): Use the more standard message "Unknown host". 2003-11-10 Hrvoje Niksic * src/host.c (lookup_host): Special-case the numeric addresses only in the non-IPv6 case. 2003-11-10 Hrvoje Niksic * src/connect.c (resolve_bind_address): Call lookup_host_passive. Make sure that opt.bind_address is resolved only once. * src/host.c (lookup_host_passive): New function, handles "passive" lookups. (lookup_host): Remove the passive flags. Remove the family-related flags -- use ip_default_family instead. 2003-11-10 Hrvoje Niksic * src/connect.c (connect_to_host): Use that flag to decide whether to re-resolve the host name. * src/host.c (struct address_list): Added a flag that maintains whether the connection worked at some point. 2003-11-10 Hrvoje Niksic * aclocal.m4 (WGET_STRUCT_UTIMBUF): Use AC_CHECK_TYPES instead of AC_EGREP_CPP to check for struct utimbuf. 2003-11-10 Hrvoje Niksic * aclocal.m4 (WGET_SOCKLEN_T): Use AC_COMPILE_IFELSE instead of AC_TRY_COMPILE. 2003-11-09 Hrvoje Niksic * src/main.c (option_data): Specify the command to use for --mirror. 2003-11-09 Hrvoje Niksic * src/html-url.c: Get URLs from . 2003-11-09 Hrvoje Niksic * doc/wget.texi: Remove next/previous/up node links. Makeinfo doesn't require them, and they make the document harder to modify. 2003-11-09 Hrvoje Niksic * doc/wget.texi: No longer document options -s, -C, -g, and -G. (Contributors): Update my email address. 2003-11-09 Hrvoje Niksic * configure.in: Don't attempt to use Emacs as a makeinfo substitute. 2003-11-09 Hrvoje Niksic * aclocal.m4 (WGET_WITH_NLS): Respect the user's setting of LINGUAS, e.g. `LINGUAS="en bg ja" ./configure'. 2003-11-08 Hrvoje Niksic * src/utils.c (wtimer_allocate): Bless the use of wtimer_read on a timer that has merely been allocated because get_contents() does that. (wtimer_update): Abort if the timer is not initialized. 2003-11-08 Hrvoje Niksic * src/retr.c (get_contents): Pass the timer to limit_bandwidth(). * src/utils.c (wtimer_update): New function instead of wget_elapsed; just update the timer, but don't return anything. (wtimer_read): Read and return the last known value of the timer. 2003-11-08 Hrvoje Niksic * src/progress.c (update_speed_ring): Clear the speed ring when the download stalls. * src/retr.c (get_contents): Specify 0.95s read timeout, so that the progress gauge can be updated even when data arrives very slowly or stalls. 2003-11-08 Hrvoje Niksic * src/main.c (main): Removed one-letter options `-C', `-g', `-G', and `-s'. 2003-11-08 Hrvoje Niksic * src/main.c (main): Add --dont-remove-listing for backward compatibility with previous versions. (print_help): Fix typo, spotted by Dennis Smit. 2003-11-08 Hrvoje Niksic * src/http.c (persistent_available_p): No reason for the host lookup to be silent -- it's a lookup like any other. 2003-11-08 Hrvoje Niksic * src/http.c (persistent_available_p): Instead of matching all the addresses of HOST and last host, determine the peer's IP address with socket_ip_address and see if that address is one of those HOST resolves to. * src/host.c (address_list_match_all): Removed. (address_list_find): New function, finds an IP address in the address list. * src/ftp.c (ftp_do_pasv): Get the peer's address here, and pass it to ftp_epsv so it doesn't need to call getpeername. * src/ftp-basic.c (ftp_port): Use socket_ip_address instead of getpeername. (ftp_lprt): Ditto. * src/connect.c (socket_ip_address): Replaces conaddr, generalized to either get peer's or local address. (sockaddr_get_data): Made local to this file. 2003-11-08 Hrvoje Niksic * src/hash.c (HASH_POSITION): Explicitly accept the hash function. (grow_hash_table): Extract ht->hash_function outside the loop. (hash_table_remove): Ditto. (hash_table_clear): Fill entries with 0xff to clear them. (hash_table_remove): Mark entries as deleted with the correct marker. 2003-11-08 Hrvoje Niksic * src/cookies.c (cookie_handle_set_cookie): Specify exact match for unspecified domains. 2003-11-08 Hrvoje Niksic * src/connect.c (register_extended): Check that fd >= 0. 2003-11-08 Gisle Vanem * src/ftp-basic.c: Support Windows-2000 ftp servers. Win-2000 *is* Win-NT 5.0 so calling it ST_WINNT is okay I guess. 2003-11-07 Hrvoje Niksic * src/url.c (url_parse): Allow empty ports. 2003-11-07 Hrvoje Niksic * src/main.c (print_help): Break the help string into multiple chunks. (cmdline_option): New option handler OPT_FUNCALL. Generalized HANDLE_CMD to generic DATA. (option_data): Use the new OPT_FUNCALL feature. 2003-11-07 Hrvoje Niksic * src/hash.c (NON_EMPTY): Use the all-bit-set value as the marker that the field is empty. This allows NULL pointer and 0 value to be used as keys, which is necessary for the connect.c code to work when fd==0. (hash_table_new): Fill mappings with 0xff. (grow_hash_table): Ditto. 2003-11-07 Hrvoje Niksic * src/connect.c (LAZY_RETRIEVE_INFO): Set LAST_INFO. 2003-11-07 Hrvoje Niksic * README: Remove explicit version reference, so that the file doesn't have to be updated for each new release. 2003-11-06 Hrvoje Niksic * src/main.c (init_switches): New function. Convert option_data to long_options and short_options, which can be fed to getopt_long. (main): Execute command-line options by consulting option_data. 2003-11-06 Hrvoje Niksic * src/log.h: Declare log_init, log_close, and log_request_redirect_output here. 2003-11-06 Hrvoje Niksic * src/gen_sslfunc.c (ssl_read): Implement a more correct check for EINTR. (ssl_write): Ditto. (init_ssl): Use a global SSL context. 2003-11-06 Hrvoje Niksic * src/connect.c: Updated all callers of iread/ssl_iread/iwrite/ssl_iwrite to use xread and xwrite instead. * src/rbuf.h (struct rbuf): Removed the SSL member because SSL is handled automatically by xread. * src/hash.c (ptrhash): Made private. (ptrcmp): Ditto. (inthash): Removed. * src/connect.c (select_fd): Don't set errno, leave it to the caller. * src/gen_sslfunc.c (connect_ssl): Use register_extended to register SSL callbacks for communication with SSL-enabled endpoints. (ssl_read): New function. (ssl_write): Ditto. (ssl_poll): Ditto. (ssl_close): Ditto. * src/connect.c (register_extended): New function -- register callbacks for basic socket operations. (xread): Ditto. (xwrite): Ditto. (xclose): Ditto. (sock_read): New function, default implementation for reading. (sock_write): Ditto for writing. (sock_poll): Ditto for polling. (sock_close): Ditto for closing. 2003-11-06 Hrvoje Niksic * src/connect.c (xclose): Free INFO even if it doesn't provide a closer. 2003-11-05 Hrvoje Niksic * windows/config.h.ms: MSVC doesn't have uint32_t. 2003-11-05 Hrvoje Niksic * src/cookies.c (save_cookies_mapper): Respect the setting of keep-session-cookies. (cookie_jar_load): Import session cookies. Based on code submitted by Nicolas Schodet. * src/utils.c (datetime_str): Use information in TM when it's non-NULL. * src/main.c (main): New option `--keep-session-cookies'. 2003-11-05 Hrvoje Niksic * src/connect.c (resolve_bind_address): Use a more accurate error message -- we're not binding to ANY, we're disabling bind altogether. 2003-11-05 Hrvoje Niksic * src/connect.c (connect_to_ip): More compact error checking. (bindport): Don't treat failed setsockopt as a fatal error. 2003-11-05 Hrvoje Niksic * src/connect.c (bindport): Fix compilation under pre-C99 compilers. 2003-11-05 Hrvoje Niksic * libtool.m4, ltmain.sh, config.sub, config.guess: Upgrade to libtool 1.5. 2003-11-05 Hrvoje Niksic * doc/wget.texi (HTTP Options): Document `--keep-session-cookies'. 2003-11-05 Hrvoje Niksic * configure.in: Use the Autoconf macro AC_C_PROTOTYPES instead of the old AM_C_PROTOTYPES. 2003-11-05 Hrvoje Niksic * configure.in: Substitute ANSI2KNR and U, so we can compile. 2003-11-05 Hrvoje Niksic * configure.in: Remove the broken check for socks. 2003-11-05 Dennis Smit * src/main.c: (main): added --preserve-permissions option. * src/ftp.c (ftp_retrieve_list): added support for --preserve-permissions option. * src/init.c: added support for --preserve-permission option. * src/option.h: added support for --preserve-permission option. 2003-11-04 Hrvoje Niksic * src/progress.c (progress_handle_sigwinch): Don't call determine_screen_width() from the signal handler. Instead, just set a volatile variable. (bar_create): Check whether SIGWINCH was received. (bar_update): Ditto. * src/sysdep.h: Define SYSTEM_FNMATCH only if HAVE_FNMATCH_H is true. 2003-11-04 Hrvoje Niksic * src/config-post.h: New file, included from now autogenerated config.h.in. 2003-11-04 Hrvoje Niksic * src/Makefile.in (realclean): Delete config.h.in. 2003-11-04 Hrvoje Niksic * configure.in: Use the new form of AC_OUTPUT. * Makefile.cvs (prep): Invoke autoheader. 2003-11-04 Hrvoje Niksic * configure.in: Require Autoconf 2.57. 2003-11-04 Hrvoje Niksic * configure.in: Move some checks into aclocal.m4. Check whether fnmatch.h is includable. * configure.in: Also check whether #include works before deciding to use Solaris libmd5. * configure.in: Use AC_MSG_NOTICE instead of echo. Use AC_MSG_ERROR for fatal errors. 2003-11-04 Hrvoje Niksic * configure.in: Don't misuse AC_MSG_RESULT. Use AC_MSG_NOTICE where appropriate. 2003-11-04 Hrvoje Niksic * configure.in: Check whether volatile is supported. Don't check for gethostname and uname, which are not used. 2003-11-04 Hrvoje Niksic * aclocal.m4: Ditto. * configure.in: Add description annotations to AC_DEFINE. * Makefile.in: Update maintenance targets, preparing them for the use of `autoheader'. 2003-11-03 Hrvoje Niksic * src/wget.h (N_): Don't parenthesize argument. 2003-11-03 Hrvoje Niksic * src/utils.c (xsleep): New function. Uses nanosleep where available, resuming sleeps interrupted by signals. Updated callers of sleep and usleep to use xsleep. 2003-11-03 Hrvoje Niksic * src/log.c (logvprintf): Documented better. Renamed to log_vprintf_internal to avoid confusion with the public API functions logprintf and logputs. 2003-11-03 Hrvoje Niksic * src/host.h (ADDRESS_IPV4_DATA): Don't take the address of in.s_addr because that doesn't work on machines that define it as bitfield. 2003-11-03 Hrvoje Niksic * src/ftp-basic.c (ftp_login): Remove shadowing (and bogus) declaration of SEED. 2003-11-03 Hrvoje Niksic * src/connect.c (select_fd): Generalize the third argument into WAIT, so that the caller can request waiting for both read and write. Updated callers. 2003-11-03 Hrvoje Niksic * configure.in: Look for nanosleep in -lrt and -lposix4, which is where Solaris has them. 2003-11-03 Hrvoje Niksic * configure.in: Check for nanosleep. 2003-11-02 Hrvoje Niksic * src/wget.h: Retired the `boolean' type. Moved the DEFAULT_LOGFILE define to log.h. Moved the INFINITE_RECURSION define to recur.h. * src/xmalloc.h: Renamed FREE_MAYBE to xfree_null and moved the definition from wget.h to xmalloc.h. 2003-11-02 Hrvoje Niksic * src/html-url.c (cleanup_html_url): Destroy the hash tables, don't just call free on them. (init_interesting): Use hash_table_put instead of string_set_add because we don't need the strdup that the latter function performs. * src/init.c (cleanup): Don't pass NULL to cookie_jar_delete. * src/xmalloc.c (xfree_real): Abort when passed a NULL pointer. (xfree_debug): Print at the file and line of the offending call to free. 2003-11-02 Hrvoje Niksic * src/html-parse.c (decode_entity): New function; split the decoding of entities here. (convert_and_copy): Use it to decode entities. (decode_entity): Handle the &apos entity. (decode_entity): Don't decode Latin 1 numeric entities. Don't decode �. 2003-11-01 Hrvoje Niksic * src/host.c: Removed the ip_default_family global variable. * src/host.c (lookup_host): Document the function. Fixed declaration of VEC. 2003-11-01 Hrvoje Niksic * src/ftp-opie.c (calculate_skey_response): Use uint32_t instead of `unsigned long' for the cheksum array. Document the function. 2003-11-01 Hrvoje Niksic * src/connect.c (acceptport): Don't call select_fd when timeout is not requested. 2003-10-31 Hrvoje Niksic * src/wget.h: Move declarations of malloc and logging code to xmalloc.h and log.h respectively to unclutter this file. (STRDUP_ALLOCA): Made it side-effect free. * src/xmalloc.h: New files. Define macros xnew, xnew0, xnew_array, and xnew0_array. * src/xmalloc.c: New file. Move the xmalloc routines here. 2003-10-31 Hrvoje Niksic * src/various: Use new macros xnew, xnew0, xnew_array, and xnew0_array in various places. 2003-10-31 Hrvoje Niksic * src/sysdep.h (CLOSE): Don't call close on file descriptors less than 0, i.e. on uncreated sockets. * src/connect.c (resolve_bind_address): Work on struct sockaddr directly. (connect_to_host): Replacement for connect_to_many. Resolve HOST and connect to any of its addresses. If we can't connect and the host name lookup was cached, try to resolve it again. This should fix problems with hosts behind dynamic DNS. Updated all callers. (connect_to_ip): Replacement for connect_to_one. Removed SILENT; added the argument PRINT instead. Updated all callers. (set_connection_host_name): Removed. * src/host.c (address_list_address_at): New function instead of address_list_copy_one. It returns a pointer to ip_address *, so it's not necessary to copy the data. (address_list_cached_p): New function. (forget_host_lookup): Ditto. * src/connect.c: Got rid of the MSOCK global variable. Made bindport return the local socket it creates. Added a new argument to acceptport, the socket to call accept on. Updated callers. (closeport): Removed. * src/connect.c: Moved the sockaddr code from host.c to this file, because most of that stuff is used for connecting, and has nothing to do with host names anyway. (sockaddr_set_data, sockaddr_get_data): New functions, replace the old sockaddr_set_address, sockaddr_set_port, sockaddr_get_address, and sockaddr_get_port. 2003-10-31 Hrvoje Niksic * src/host.c (address_list_from_single): Removed. (address_list_from_ipv4_addresses): Renamed from address_list_from_vector. 2003-10-31 Hrvoje Niksic * src/connect.c (sockaddr_set_data): Remove the broken code that checked for NULL address. 2003-10-30 Hrvoje Niksic * src/url.c (append_uri_pathel): New argument ESCAPED_P that says whether [B, E) is to be treated as URL-escaped or not. If ESCAPED_P is false, don't unescape the region. (url_file_name): u->file is not URL-escaped. 2003-10-30 Hrvoje Niksic * src/sysdep.h: Use `S >= 8' rather than `S == 8' when looking for large integers. 2003-10-30 Hrvoje Niksic * src/retr.c (retrieve_from_file): Use retrieve_tree for page-requisites. * src/main.c (main): Don't define opt.recursive when -p is used. Instead, make sure that recursion is used for HTTP in that case. 2003-10-29 Hrvoje Niksic * src/host.h: Defined accessors for elements of ip_address. Updated all callers. (address_list_match_all): Use memcmp in the non-IPv6 case. * src/wget.h (CONNECT_ERROR): Don't retry connecting if connect() returned EAFNOSUPPORT. 2003-10-29 Hrvoje Niksic * configure.in: Reenable IPv6 autodetection. 2003-10-27 Mauro Tortonesi * src/connect.h: changed bindport prototype and added the related BIND_ON_IPV4_ONLY and BIND_ON_IPV6_ONLY flags. * src/connect.c: changed bindport and resolve_bind_address to allow protocol-version specific DNS resolution. modified conaddr, acceptport and connect_to_one to make use of struct sockaddr_storage and of the new ip_address structure. * src/ftp-basic.c: added LPRT/LPSV (RFC1639) support, refactored PORT/PASV (RFC959) and EPRT/EPSV (RFC2428) support code. * src/ftp.c: added the ftp_do_port and ftp_do_pasv functions to handle FTP over IPv6. * src/ftp.h: changed prototype of ftp_epsv and added prototypes for ftp_lpsv, ftp_lprt and ftp_eprt. * src/host.c: renamed the wget_sockaddr_set_address and wget_sockaddr_get_addr, and wget_sockaddr_{s,g}et_port couples to sockaddr_{s,g}et_address and sockaddr_{g,s}et_port respectively. changed address_list_match_all, address_list_from_addrinfo sockaddr_len, pretty_print_address, lookup_host, sockaddr_{s,g}et_address and sockaddr_{g,s}et_port to make use of struct sockaddr_storage and of the new ip_address structure. removed map_ipv4_to_ip and map_ip_to_ipv4. * src/host.h: redefined structure ip_address and removed structure wget_sockaddr. 2003-10-26 Hrvoje Niksic * windows/config.h.mingw: Ditto. * windows/Makefile.top.mingw: Ditto. * windows/Makefile.src.mingw: New file. * windows/wget.dep: Support convert.o. * configure.bat.in: New option `--mingw'. From Chin-yuan Kuo. 2003-10-26 Hrvoje Niksic * windows/config.h.bor: DEBUG is now ENABLE_DEBUG. Borland has snprintf, but not u_int32_t. * windows/Makefile.src.bor (OBJS): Use convert.c. From Chin-yuan Kuo. 2003-10-26 Hrvoje Niksic * windows/Makefile.src.watcom (OBJS): Use convert.c. From Chin-yuan Kuo. 2003-10-26 Hrvoje Niksic * src/sysdep.h: Include inttypes.h where available. * src/host.c: Switch from u_int32_t to uint32_t. 2003-10-26 Hrvoje Niksic * src/netrc.c (parse_netrc): Reset the QUOTE flag after the closing quote. 2003-10-26 Hrvoje Niksic * doc/wget.texi (Wgetrc Commands): Fixed typo. From DervishD . 2003-10-26 Hrvoje Niksic * configure.in: Switch from u_int32_t to uint32_t. Check for inttypes.h so it's used to get the definition of uint32_t where available. 2003-10-25 Hrvoje Niksic * src/url.c (is_valid_ipv6_address): Reformat to GNU coding style. Use enums for NS_IN* constants. Use ISXDIGIT. * src/convert.c (construct_relative): Document better how the function works. 2003-10-24 Hrvoje Niksic * doc/Makefile.in (install.info): Handle the case when only wget.info is generated from wget.texi. In that case, wget.info-*[0-9] doesn't match anything and therefore ends up as a bogus value of FILE in the loop. Fix this by not calling INSTALL_DATA on nonexistent files. 2003-10-23 Hrvoje Niksic * src/version.c: Bump version. 2003-10-23 Hrvoje Niksic * src/url.c: Ditto. * src/html-parse.c (advance_declaration): Don't use trailing comma in enum because older compilers don't support it. * src/utils.c: Don't redefine HAVE_SIGSETJMP. 2003-10-23 Hrvoje Niksic * src/config.h.in: Deploy preprocessor magic to avoid Ultrix's include which defines its own u_int32_t. Reported by Bernhard Simon. 2003-10-23 Hrvoje Niksic * Makefile.in (dist): Depend on configure.bat. (realclean-top): Delete configure.bat. 2003-10-21 Hrvoje Niksic * Makefile.in (distclean-top): Remove the libtool script, because it's generated by configure. 2003-10-16 Hrvoje Niksic * src/convert.c (construct_relative): Don't handle absolute files specially -- for example, -P/tmp/foo shouldn't imply that converted files must refer to "/tmp/foo/..."! 2003-10-16 Hrvoje Niksic * configure.in: Don't check for int32_t because we're not really using it. 2003-10-15 Philip Stadermann * src/ftp.c (ftp_retrieve_glob): Correctly loop through the list whose elements might have been deleted. 2003-10-15 Hrvoje Niksic * src/http.c: Consider status 307 a valid redirect. 2003-10-13 Hrvoje Niksic * src/sysdep.h: Only define u_int32_t. 2003-10-13 Hrvoje Niksic * src/html-url.c (tag_handle_meta): Set the Refresh link to expect HTML. (append_one_url): Renamed to append_url. 2003-10-11 Hrvoje Niksic * src/utils.c (large_int_to_string): Use snprintf() to print the number. This will work even on systems where libc doesn't understand %lld, but the compiler does, because it will use our snprintf replacement. * src/init.c (parse_bytes_helper): New function. (cmd_bytes): Use it to parse bytes, but cast the result to long. (cmd_bytes_large): Ditto, but store the result to LARGE_INT. Used for --quota so that --quota=10G works even on machines without long long. * src/options.h (struct options): Declare quota as LARGE_INT. * src/retr.c (downloaded_exceeds_quota): Removed. (downloaded_increase): Ditto. (total_downloaded_bytes): New variable, replaces opt.downloaded, which was the wrong place for it anyway. Updated callers of downloaded_exceeds_quota and downloaded_increase to check this variable directly. * src/sysdep.h: Get rid of VERY_LONG_TYPE. Use LARGE_INT for the same purpose, defined as `long', `long long' or `double', depending on size of long and whether long long is available. 2003-10-11 Hrvoje Niksic * src/sysdep.h: Also check size of short for int32_t. 2003-10-11 Hrvoje Niksic * src/host.c (lookup_host): Use u_int32_t to store the result of inet_addr(). That removes the need for offset fiddling, caring about endian-ness, etc. * src/sysdep.h: Define int32_t and u_int32_t if not available. 2003-10-11 Hrvoje Niksic * src/ftp-basic.c (ftp_epsv): Use socklen_t * as the third argument to getpeername. * src/config.h.in: Define socklen_t stub. * src/host.c (sockaddr_len): Return socklen_t. * src/connect.c (conaddr): Use socklen_t as the third argument to accept, getsockname, and connect. 2003-10-11 Hrvoje Niksic * configure.in: Check for int32_t and u_int32_t. Check for SIZEOF_INT. 2003-10-10 Hrvoje Niksic * src/recur.c (retrieve_tree): Don't descend into documents that are not expected to contain HTML, regardless of their content-type. * src/html-url.c (tag_url_attributes): Record which attributes are supposed to yield HTML links that can be followed. (tag_find_urls): Propagate that information to the caller through struct urlpos. 2003-10-10 Hrvoje Niksic * src/hash.c (hash_table_new): Slightly change the meaning of the first parameter. Instead of being the minimum initial size, it is now the minimum number of items that the hash table can take without needing to resize. 2003-10-10 Hrvoje Niksic * src/hash.c (find_mapping): Return the next available mapping when the key is not found, not NULL. (hash_table_put): Use find_mapping to find the storage for the new data. (hash_table_put): Grow the table before exceeding maximum fullness, not afterwards. 2003-10-10 Hrvoje Niksic * po/tr.po: Ditto. * po/sv.po: Updated from TP. 2003-10-10 Hrvoje Niksic * aclocal.m4 (WGET_WITH_NLS): First check for gettext in libintl, then use the libc version. That way systems that get libintl.h from /usr/local/include will get the matching gettext. 2003-10-09 Hrvoje Niksic * src/html-url.c (init_interesting): Initialize interesting_tags and interesting_attributes as hash tables. This simplifies the code immensely because hash tables handle allocation and remove duplicates automatically. (find_tag): Removed. (collect_tags_mapper): Instead of calling find_tag, simply get the entry from interesting_tags hash table, which is both simpler and faster. 2003-10-09 Hrvoje Niksic * src/hash.c (hash_table_get): Declare hash-table argument as const. (find_mapping): Ditto. (hash_table_get_pair): Ditto. (hash_table_contains): Ditto. (hash_table_count): Ditto. 2003-10-09 Hrvoje Niksic * po/ft.po, po/sk.po, po/ja.po: Updated from the TP. 2003-10-09 Herold Heiko * windows/Makefile.watcom (OBJS): Ditto. * windows/Makefile.src.bor: Ditto. * windows/wget.dep: Ditto. * windows/Makefile.src: Removed references to fnmatch.c and fnmatch.o. 2003-10-08 Hrvoje Niksic * src/main.c (print_help): Fix typo; stured -> stored. 2003-10-08 Hrvoje Niksic * src/html-url.c (get_urls_html): Parse the appropriate flags to html-parse.c. * src/html-parse.c (map_html_tags): Accept FLAGS from the caller instead of examining OPT. 2003-10-08 Hrvoje Niksic * src/html-url.c (find_tag): Switch to binary search. 2003-10-08 Hrvoje Niksic * src/getopt.c: Newer version, imported from Free libit. 2003-10-08 Hrvoje Niksic * src/getopt.c: Add definitions of getopt_long and getopt_long_only. 2003-10-08 Hrvoje Niksic * src/ftp.c (has_insecure_name_p): Define it here. * src/utils.c (has_wildcards_p): Define it here. * src/sysdep.h: Declare fnmatch-related macros here, if not using system fnmatch(). Update .c files to not declare fnmatch.h directly. * src/cmpt.c (fnmatch): Moved here. Use it only under non-GNU libc. 2003-10-08 Hrvoje Niksic * src/config.h.in: Renamed DEBUG to ENABLE_DEBUG. ENABLE_DEBUG is, I think, a better name, because it implies that debugging output is merely possible, not "on by default", as might be construed from just DEBUG. 2003-10-08 Hrvoje Niksic * po/wget.pot: Recreated. 2003-10-08 Hrvoje Niksic * configure.in: Renamed DEBUG to ENABLE_DEBUG. 2003-10-07 Hrvoje Niksic * src/cookies.c (struct cookie): Remove unused backpointer to cookie jar. 2003-10-07 Hrvoje Niksic * src/cmpt.c (memmove): Comment out, since it's no longer used. * src/cookies.c (cookie_jar_generate_cookie_header): Allocate room for chains in one pass. (find_chains_of_host): Assume that the caller has allocated DEST to be sufficiently large to take all the data. (eliminate_dups): Run through the array and eliminate dups on the fly instead of using memmove. (cookie_jar_process_set_cookie): Free cookie->domain before re-setting it. 2003-10-07 Hrvoje Niksic * doc/wget.texi (HTTP Options): Documented --post-file and --post-data. 2003-10-05 Hrvoje Niksic * src/utils.c (file_size): Return -1 if fopen() returns NULL. Prior to this patch, wget --post-file=nosuchfile dumped core. 2003-10-05 Gisle Vanem * src/mswindows.c (set_sleep_mode): Fix type of _SetThreadExecutionState. 2003-10-04 Hrvoje Niksic * src/log.c: Use `...' in function definitions; ansi2knr will convert them to va_dcl. This allowed removal of the ugly VA_START_1 and VA_START_2 macros. 2003-10-04 Hrvoje Niksic * libtool.m4: New file with contents imported from libtool. * aclocal.m4: Move libtool stuff into a separate file. That leaves this file only with Wget-specific stuff. 2003-10-04 Gisle Vanem * src/mswindows.c (run_with_timeout): Use WaitForSingleObject to wait for thread termination. 2003-10-03 Hrvoje Niksic * src/html-parse.c (convert_and_copy): Move variable declarations before statements. 2003-10-03 Gisle Vanem * src/connect.c: And don't include them here. * src/mswindows.h: Include winsock headers here. 2003-10-02 Hrvoje Niksic * src/wget.h (XMALLOC_ARRAY): Removed. (ALLOCA_ARRAY): Ditto. * src/html-parse.c: Renamed alloca_p to resized. (GROW_ARRAY): Renamed DO_REALLOC_FROM_ALLOCA to GROW_ARRAY and returned it to html-parse.c, since nothing else was using it. 2003-10-02 Hrvoje Niksic * src/utils.c (alarm_set): New function; use either setitimer or alarm to set up the alarm. (alarm_cancel): New function; cancel the alarm set up by alarm_set. (run_with_timeout): Use them. 2003-10-02 Hrvoje Niksic * src/retr.c (retrieve_url): Initialize DUMMY storage for DT. Caught by valgrind. 2003-10-02 Hrvoje Niksic * src/html-parse.c (convert_and_copy): Handle numeric entities in hexadecimal, &#xHH. (convert_and_copy): Copy the contents directly to the pool without a stack-allocated intermediary. 2003-10-02 Gisle Vanem * src/mswindows.c (run_with_timeout): For Windows: Run the 'fun' in a thread via a helper function. Continually query the thread's exit-code until finished or timed out. 2003-10-01 Hrvoje Niksic * src/url.c (url_parse): Don't leak memory when a reencoded URL turns out to be invalid. * src/url.c (parse_errors): Mark error messages for translation. (url_error): Translate error messages returned to the caller. 2003-10-01 Hrvoje Niksic * src/ftp.c (ftp_loop_internal): Initialize TMRATE to NULL to shut up the compiler. 2003-10-01 Hrvoje Niksic * po/hu.po: Updated from the TP. * po/et.po: Updated from the TP. * po/ro.po: New file from the TP. 2003-10-01 Hrvoje Niksic * po/hr.po: Updated. 2003-10-01 Hrvoje Niksic * po/POTFILES.in: Added src/convert.c. 2003-10-01 Hrvoje Niksic * doc/wget.texi: Renamed prep.ai.mit.edu to ftp.gnu.org. 2003-10-01 Hrvoje Niksic * doc/wget.texi (Contributors): Updated from ChangeLog entries. 2003-09-30 Herold Heiko * windows/Makefile.src (OBJ): Fix typo. 2003-09-29 Aaron Hawley * src/ftp.c (getftp): --spider option should now work with FTP downloads. (ftp_loop_internal): quiet reports and calculations of downloads when --spider option set, nor try deleting when --delete-after also set. (ftp_loop): --spider will skip HTML-ification of .listing file. 2003-09-26 Gisle Vanem * windows/config.h.ms: Don't declare alloca under compilers that support it. * windows/config.h.ms: Define HAVE_SNPRINTF, HAVE_VSNPRINTF, and HAVE_MEMMOVE. 2003-09-26 Gisle Vanem * src/src/mswindows.c: Added ws_percenttitle() showing progress in the window titlebar. Called from retr.c. Secured ws_mypath(). * src/windows/config.h.ms: alloca() prototype not needed. Removed "#undef ENABLE_NLS"; should be in Makefile IMHO. Moved WGET_USE_STDARG from mswindows.h to config.ms.h because of #ifdef in log.c. (MSVC's vararg.h and stdarg.h are incompatible). 2003-09-26 Gisle Vanem * src/mswindows.c (read_registry): Removed. (set_sleep_mode): New function. (windows_main_junk): Call it. 2003-09-26 Gisle Vanem * src/mswindows.c (read_registry): Fix invocation of registry functions. * src/mswindows.c (read_registry): Condition definitions of sleep and usleep with not HAVE_SLEEP and HAVE_USLEEP respectively. Define HAVE_SLEEP and HAVE_USLEEP under __DMC__. 2003-09-25 Herold Heiko * windows/Makefile.src: Updated OBJ list. 2003-09-24 Hrvoje Niksic * src/url.c (url_escape_1): Revert unintentional change to lowercase xdigit escapes. (url_escape_dir): Document that this function depends on the output of url_escape_1. 2003-09-23 Hrvoje Niksic * src/progress.c (create_image): Print the current ETA if we're done with the download. (create_image): Change '-' display char to '+' in the progress bar. * src/Makefile.in (clean): Remove .libs. 2003-09-23 Hrvoje Niksic * src/cookies.c (struct cookie): New flag domain_exact. (update_cookie_field): Skip leading dot in domain. (find_matching_chains): Match numeric addresses exactly; don't needlessly copy HOST to the stack. (matching_cookie): Added argument HOST. If cookie->domain_exact is set, check that HOST is equal to cookie->domain. (cookie_jar_load): Only use TAB as delimiter. Document the meaning of DOMAIN-FLAG. Skip leading dot in domain. (cookie_jar_load): Don't ignore DOMAIN-FLAG -- instead, set domain_exact to true if DOMAIN-FLAG is false. (save_cookies_mapper): If domain_exact is false, prepend the domain with dot, like Mozilla does. 2003-09-23 Hrvoje Niksic * Makefile.in (distclean-top): Remove autom4te.cache. 2003-09-23 Hrvoje Niksic * Makefile.in (clean-top): Remove .libs. 2003-09-22 Hrvoje Niksic * src/url.c (path_simplify): Instead of calls to memmove, handle "./" and "../" by advancing pointers. 2003-09-22 Hrvoje Niksic * src/safe-ctype.h (_sch_test): The cast of BIT to unsigned char was broken -- _sch_istable bitmasks are 16-bit, not 8-bit! Cast BIT to unsigned short instead. 2003-09-22 Hrvoje Niksic * src/retr.c (getproxy): Moved from url.c. * src/convert.c: Split off link conversion from url.c into separate file. Also included the book-keeping stuff from recur.c. 2003-09-22 Hrvoje Niksic * src/progress.c (create_image): Print the initial part of the download with '-' characters, analogous to how dot progress prints the initial part with ','. * src/hash.c (ptrhash): New function. (ptrcmp): Ditto. (hash_table_new): Default to identity hash table. 2003-09-21 Matthew J. Mellon * src/http.c (gethttp): Recognize content-type "application/xhtml+xml" as what Wget considers "text/html". 2003-09-21 Hrvoje Niksic * src/main.c (main): Use setoptval() for setting the options. Use run_command for `-e'. * src/init.c (parse_line): Rewritten to return COMIND right away. Changed linkage to static. (run_wgetrc): Use the available comind when calling setval, so it doesn't have to be computed twice. (setval_internal): New function, runs the command's action without any error checking. (setoptval): New function, does what setval used to do, but exits in case of error. (run_command): New function. 2003-09-21 Hrvoje Niksic * src/init.c: Improved documentation of functions. (cmd_boolean): Attempt to make code that tries to avoid calling strcmp for "speed" a bit more readable. * src/init.c (simple_atof): Report error on encountering non-digit, non-"." character. (simple_atoi): Replacement for myatoi(), calling interface compatible with simple_atof. Updated myatoi's callers. 2003-09-21 Hrvoje Niksic * src/init.c (simple_atof): New function. (cmd_time): Use it. (cmd_bytes): Accept things like "1.5k" and such. Use simple_atof to parse decimals. * src/retr.c (limit_bandwidth): Adjust each sleep by the error of the previous one. 2003-09-21 Hrvoje Niksic * src/connect.c (select_fd): Change MAXTIME's type to double. Handle its decimal part. * src/retr.c (sleep_between_retrievals): In the random-wait case, use random_float() to wait between 0 and 2*opt.wait seconds. * src/utils.c (run_with_timeout): Accept `double' timeouts. Correctly handle timeout values in (0, 1) range. (random_float): New function. * src/options.h (struct options): Change the types of wait, waitretry, and timeout to double. * src/init.c (cmd_time): Accept floating point time. 2003-09-21 Hrvoje Niksic * src/connect.c (connect_with_timeout): Made timeout type double. * src/options.h (struct options): New members read_timeout, dns_timeout, and connect_timeout. Use them. 2003-09-21 Hrvoje Niksic * doc/wget.texi: Documented the new timeout options. 2003-09-21 Bertrand Demiddelaer * src/url.c (path_simplify): Would read two bytes past the end of the string in the "./" case. 2003-09-21 Aaron S. Hawley * doc/wget.texi: Split version to version.texi. Tweak documentation's phrasing and markup. 2003-09-20 Hrvoje Niksic * src/url.c (uri_merge): Get rid of uri_merge_1. (uri_merge): Merge "foo//", "bar" as "foo//bar", not "foo///bar", i.e. don't add an extra slash merely because BASE ends with two slashes. (parse_credentials): Renamed from parse_uname. Rewrittern in standard [beg, end) calling style. (url_skip_credentials): Renamed from url_skip_uname. Made static. (url_skip_credentials): Include # and ; as terminators. Old code would mistakenly consider "http://foo.com#hniksic@iskon.hr" to contain a username. (url_skip_scheme): Removed because it was unused. (url_has_scheme): Require "scheme" to be at least one char long. 2003-09-20 Hrvoje Niksic * src/retr.c (get_contents): Cosmetic fixes. 2003-09-19 Hrvoje Niksic * src/wget.h (BOUNDED_TO_ALLOCA): Evaluate PLACE only once. (ARRAY_SIZE): Renamed to countof. All callers updated. 2003-09-19 Hrvoje Niksic * src/url.c (url_file_name): Expect NULL dir_prefix. * src/init.c (cmd_file): Use a macro to prevent multiple #ifdef WINDOWS. (defaults): Set dir_prefix to NULL by default. 2003-09-19 Hrvoje Niksic * src/safe-ctype.h (_sch_test): Cast BIT to unsigned char, like latest gcc does. 2003-09-19 Hrvoje Niksic * src/main.c (main): New option --strict-comments. * src/html-parse.c (find_comment_end): New function: simple BM search for "-->". (map_html_tags): Use it if looking at a comment and not in strict comments mode. 2003-09-19 Hrvoje Niksic * doc/wget.texi: Changed @itemx not preceded by @item to @item. 2003-09-18 Hrvoje Niksic * src/safe-ctype.h: Don't #define ctype.h macros to errors because that loses when someone #include's ctype.h after safe-ctype.h. 2003-09-18 Hrvoje Niksic * src/retr.c (get_contents): Pass the correct argument to ssl_iread. 2003-09-17 Hrvoje Niksic * src/url.c: Undef U, W, C after use. 2003-09-17 Hrvoje Niksic * src/url.c (url_escape_1): New function. (url_escape): Use it. (sync_path): Handle pathological cases where u->file and u->dir contain really strange characters. (ENCODE): Deleted. (REENCODE): Deleted. 2003-09-17 Hrvoje Niksic * src/init.c (cmd_spec_restrict_file_names): Allow the OS setting to be augmented by ",nocontrol" which means don't escape the control characters, but otherwise keep OS settings. * src/url.c (file_unsafe_char): Deleted. (append_uri_pathel): Query filechr_table directly. (filechr_table): Separated Unix, Windows, and control-unsafe characters. 2003-09-17 Hrvoje Niksic * ltmain.sh, aclocal.m4: Upgrade to libtool 1.4.3. Libtool 1.5 has been out for a while now, but it can wait until after Wget 1.9 is released. 2003-09-17 Hrvoje Niksic * install-sh, mkinstalldirs: Updated from Autoconf 2.57. 2003-09-17 Hrvoje Niksic * doc/wget.texi (Download Options): Explain new --restrict-file-names semantics. 2003-09-17 Hrvoje Niksic * doc/wget.texi (Download Options): Explain how --tries works by default. 2003-09-17 Hrvoje Niksic * config.sub: Ditto. * config.guess: Updated from Autoconf 2.57. 2003-09-17 Aurelien Marchand * src/ftp.h: Added OS400 system in enum * src/ftp-basic.c: recognize OS400 systems * src/ftp.c: don't prepend the CWD if talking to OS400, since it breaks the change in library 2003-09-16 Noel Kothe * doc/wget.texi (Download Options): Fix misspelling. 2003-09-16 Mauro Tortonesi * configure.in, aclocal.m4: Added proper IPv6 detection. 2003-09-16 Hrvoje Niksic * util/dist-wget: Fixed portable echo checking under Bash. 2003-09-16 Hrvoje Niksic * src/utils.c (wtimer_sys_diff): Convert the time difference to signed __int64, then to double. This works around MS VC++ 6 which can't convert unsigned __int64 to double directly. 2003-09-16 Hrvoje Niksic * src/url.c (url_file_name): Don't reallocate FNAME if the file doesn't exist, as is usually the case. * src/utils.c (unique_name): New flag allow_passthrough. 2003-09-16 Hrvoje Niksic * src/http.c (post_file): Don't pad the file if it's not large enough. Bail out instead. 2003-09-16 Hrvoje Niksic * src/Makefile.in (clean): Also remove the core. files produced by recent Linux systems. 2003-09-16 Hrvoje Niksic * doc/wget.texi: Set the man page title to a string more descriptive than "Wget manual". 2003-09-16 Hrvoje Niksic * doc/Makefile.in ($(TEXI2POD)): Update only the #! line. * doc/texi2pod.pl: New version from Gcc. 2003-09-16 Hrvoje Niksic * configure.in: Change AC_CHECK_FUNC(getaddrinfo...) to AC_CHECK_FUNCS, which automatically defines HAVE_GETADDRINFO. 2003-09-16 Hrvoje Niksic * Makefile.in (all): Don't build configure.bat by default. 2003-09-15 Nicolas Schodet * doc/wget.texi (Download Options): Add link to Proxies. 2003-09-15 Hrvoje Niksic * src/wget.h (XDIGIT_TO_XCHAR): Implement as index into a literal string. (XDIGIT_TO_xchar): Ditto. 2003-09-15 Hrvoje Niksic * src/utils.c: Change the type of timer-related functions from long to double, for better precision. On machines supporting gettimeofday the timers now work with granularity of less than one millisecond. 2003-09-15 Hrvoje Niksic * src/url.c (strpbrk_or_eos): Implement as a macro under Gcc. 2003-09-15 Hrvoje Niksic * src/retr.c (get_contents): Reduce the buffer size to the amount of data that may pass through for one second. This prevents long sleeps when limiting bandwidth. * src/connect.c (connect_to_one): Reduce the socket's RCVBUF when bandwidth limitation to small values is requested. 2003-09-15 Hrvoje Niksic * src/progress.c (update_speed_ring): Moved the speed ring update to a separate function and documented it better. * src/progress.c: Use `double' for most timers to support granularity smaller than 1ms. 2003-09-15 Hrvoje Niksic * src/cookies.c (parse_set_cookies): Fixed the parser to handle more edge conditions. (test_cookies): New function, contains a test suite for parse_set_cookies. 2003-09-15 Hrvoje Niksic * src/cookies.c (parse_set_cookies): Allow trailing space in set-cookies header. Also, allow any amount of whitespace, not only one character. Allow empty set-cookies header without spewing an error. 2003-09-14 Hrvoje Niksic * src/utils.c (wtimer_sys_set): Extracted the code that sets the current time here. (wtimer_reset): Call it. (wtimer_sys_diff): Extracted the code that calculates the difference between two system times here. (wtimer_elapsed): Call it. (wtimer_elapsed): Don't return a value smaller than the previous one, which could previously happen when system time is set back. Instead, reset start time to current time and note the elapsed offset for future calculations. The returned times are now guaranteed to be monotonically nondecreasing. 2003-09-14 Hrvoje Niksic * src/url.c (append_uri_pathel): Use opt.restrict_file_names when calling file_unsafe_char. * src/init.c: New command restrict_file_names. * src/main.c (main): New option --restrict-file-names[=windows,unix]. * src/url.c (url_file_name): Renamed from url_filename. (url_file_name): Add directory and hostdir prefix here, not in mkstruct. (append_dir_structure): New function, does part of the work that used to be in mkstruct. Iterates over path elements in u->path, calling append_uri_pathel on each one to append it to the file name. (append_uri_pathel): URL-unescape a path element and reencode it with a different set of rules, more appropriate for handling of files. (file_unsafe_char): New function, uses a lookup table to decide whether a character should be escaped for use in file name. (append_string): New utility function. (append_char): Ditto. (file_unsafe_char): New argument restrict_for_windows, decide whether Windows file names should be escaped in run-time. * src/connect.c: Include to get prototype for abort(). 2003-09-14 Hrvoje Niksic * doc/wget.texi (Download Options): Document the new option --restrict-file-names and the corresponding wgetrc command. 2003-09-10 Hrvoje Niksic * src/init.c: New command dns_cache. * src/main.c (main): New option --dns-cache[=off]. 2003-09-10 Hrvoje Niksic * src/init.c (cmd_boolean): Accept yes/no along with on/off. (cmd_lockable_boolean): Ditto. 2003-09-10 Hrvoje Niksic * src/host.c (lookup_host): Print the result of the DNS lookup. 2003-09-10 Hrvoje Niksic * doc/wget.texi (Download Options): Documented new option --dns-cache. 2003-09-09 Hrvoje Niksic * src/url.c (url_parse): Return an error if the URL contains a [...] IPv6 numeric address and we don't support IPv6. 2003-09-09 Hrvoje Niksic * src/config.h.in: Initialize HAVE_GETADDRINFO and ENABLE_IPV6. * src/all: Use #ifdef ENABLE_IPV6 instead of the older INET6. Use HAVE_GETADDRINFO for getaddrinfo-related stuff. 2003-09-09 Hrvoje Niksic * configure.in, aclocal.m4: Added configure check for IPv6 and getaddrinfo. From Daniel Stenberg. 2003-09-05 Mauro Tortonesi src/url.c: added RFC 2732 compliance for URL parsing. The functions is_*_address valid are a modified version of glibc 2.3.2 inet_pton's code. 2003-09-05 Maciej W. Rozycki * configure.in: Additional M4 quoting. 2003-09-05 Hrvoje Niksic * src/url.c (is_valid_ipv6_address): Modified to not require zero-terminated strings. (is_valid_ipv4_address): Ditto. 2003-09-04 Hrvoje Niksic * aclocal.m4, configure.in: Made them work under Autoconf 2.5x. 2003-09-03 Ahmon Dancy * src/main.c init.c options.h: Added --retry-connrefused option so that Connection Refused failures are treated as non-fatal (when trying to retrieve from busy servers). * src/wget.h: New CONNECT_ERROR macro for encapsulating this modification. * src/ftp.c http.c : Use CONNECT_ERROR macro in places where ECONNREFUSED was checked. 2003-03-09 Nicolas Schodet * Makefile.in: Fixed bad configure.bat scrdir. 2003-02-25 Hrvoje Niksic * libtool.m4, ltmain.sh, config.sub, config.guess: Upgrade to libtool 1.5.14. 2003-02-24 Hrvoje Niksic * configure.in: Don't check for AI_ADDRCONFIG here, it is checked for in the source directly. 2003-02-23 Hrvoje Niksic * libtool.m4, ltmain.sh, config.sub, config.guess: Upgrade to libtool 1.5.8. 2003-01-11 Ian Abbott * src/ftp.c (ftp_retrieve_glob): Reject insecure filenames as determined by calling new function has_insecure_name_p. This is based on a patch by Red Hat. * src/fnmatch.c (has_insecure_name_p): New function: returns non-zero if filename starts with `/' or contains `../' and is therefore considered insecure. * src/fnmatch.h: Declare has_insecure_name_p(). 2002-08-03 Hrvoje Niksic * src/init.c (cmd_file): Allocate RESULT correctly. 2002-07-24 Hrvoje Niksic * src/recur.c (retrieve_tree): Check whether downloaded_html_set is non-NULL before using it. 2002-05-27 Ian Abbott * windows/config.h.bor: Do #define WGET_USE_STDARG. 2002-05-27 Hrvoje Niksic * src/html-parse.c (NAME_CHAR_P): Allow almost any character here. 2002-05-24 Hrvoje Niksic * src/progress.c (bar_set_params): Fall back to dot progress if the terminal type is "emacs". 2002-05-20 Hrvoje Niksic * windows/config.h.ms: Ditto. * windows/config.h.bor: Don't #define __STDC__. 2002-05-20 Hrvoje Niksic * src/log.c: Don't #undef WGET_USE_STDARG. 2002-05-18 Hrvoje Niksic * ALL: Update the license to reflect the OpenSSL exception. 2002-05-16 Ian Abbott * src/url.c (schemes_are_similar_p): New function to test enumerated scheme codes for similarity. * src/url.h: Declare it. * src/recur.c (download_child_p): Use it to compare schemes. This also fixes a bug that allows hosts to be spanned (without the -H option) when the parent scheme is https and the child's is http or vice versa. 2002-05-16 Ian Abbott * src/recur.c (download_child_p): Minor optimization to avoid an unnecessary additional call to schemes_are_similar_p function. 2002-05-16 Hrvoje Niksic * src/hash.c (prime_size): Store the offset of the prime number in the prime table. When searching, start with the given offset. (hash_table_new): Pass the pointer to ht->prime_offset to prime_size. (grow_hash_table): Ditto. (prime_size): Make 13 the first prime to make empty hash tables slightly smaller. 2002-05-14 Bill Richardson * src/ftp.c (getftp): Don't ftruncate stdout. * src/http.c (gethttp): Don't ftruncate stdout. 2002-05-09 Ian Abbott * src/cmpt.c (strptime_internal): Synched with glibc-2.1.3. (get_number): Ditto. (get_alt_number): Ditto. (__isleap): New function-like macro used by strptime. (day_of_the_week): New function used by strptime. (day_of_the_year): Ditto. (__mon_yday): Now shared by mktime and strptime implementations. 2002-05-08 Hrvoje Niksic * src/cookies.c (check_domain_match): Use match_tail in case insensitive mode. * src/utils.c (match_tail): Allow the caller to specify case insensitive mode. * src/cookies.c (store_cookie): When expiry_time is 0, print it as undefined, not indefinite. 2002-05-07 Ian Abbott * src/cookies.c (cookie_jar_process_set_cookie): Do not store discarded cookie. 2002-04012 Ian Abbott * src/http.c (http_loop): Compensate for MS Windows two-second granularity of file modification time when comparing timestamps. * src/ftp.c (ftp_retrieve_list): Ditto. 2002-04-24 Hrvoje Niksic * doc/wget.texi (Robot Exclusion): Explain how to turn off the robot exclusion support from the command line. (Wgetrc Commands): Explain that the `robots' variable also takes effect on the "nofollow" matching. 2002-04-23 Ian Abbott * windows/config.h.ms: Accounted for MSVC not defining `__STDC__' when Microsoft's extensions are enabled and define it anyway (set to `1'). Defined some things that broke as a result of this. 2002-04-21 Thomas Lussnig * src/gen_ssl.c: - allow checking of server cert - allow defining client cert type - allow limit of ssl protocol - check more return values - added debug message on break 2002-04-21 Hrvoje Niksic * src/recur.c (download_child_p): Revert order of items in check number 6 for clarity. 2002-04-21 Hrvoje Niksic * src/cookies.c (check_domain_match): Allow cookies to be set for subdomains of unknown top-level domains under some circumstances. 2002-04-20 Hrvoje Niksic * src/init.c: Ditto. * src/main.c: Ditto. * src/http.c: Use the new interface. * src/cookies.c: Provide an OO-style "cookie jar" interface to enable separate cookie jars. * src/http.c (http_atotm): Declare argument as const. 2002-04-20 Hrvoje Niksic * src/cookies.c (cookie_new): Default to PORT_ANY. (find_cookie_chain_exact): Only search by DOMAIN. (find_matching_cookie): Also check that PORT matches. (store_cookie): Only match the domain. (set_cookie_header_cb): When a cookie "fakes" a domain, assume it is valid for that host rather than discarding it completely. (find_matching_chains): Don't search by PORT. (matching_cookie): Also match PORT. (load_cookies): Set the port if specified, otherwise leave it as ANY. (save_cookies_mapper): Save the port if specified, otherwise leave it empty. 2002-04-20 Hrvoje Niksic * po/de.po: Updated from the TP. 2002-04-19 Thomas Lussnig * src/init.c: The option `egdfile' was not in sort order. 2002-04-16 Hrvoje Niksic * src/http.c (gethttp): If Content-Type is not given, assume text/html. 2002-04-16 Hrvoje Niksic * src/ftp.c (getftp): Treat directories that begin with : as absolute. (getftp): Strip trailing slashes from con->id before merging it with TARGET. 2002-04-15 Ian Abbott Makefile.in: Updated several dependencies for object files to take account of nested include files. 2002-04-15 Ian Abbott Makefile.in: The target `connect$o' (connect.o) now depends on `utils.h' 2002-04-15 Ian Abbott * src/host.c (SET_H_ERRNO): New function-like macro to set `h_errno'. (gethostbyname_with_timeout): Use it. * src/utils.c: Don't define `SETJMP()', `run_with_timeout_env' or `abort_run_with_timeout()' when `USE_SIGNAL_TIMEOUT' is undefined. 2002-04-15 Hrvoje Niksic * src/recur.c (download_child_p): Don't ignore rejection of HTML documents that are themselves leaves of recursion. 2002-04-15 Hrvoje Niksic * src/host.c (getaddrinfo_with_timeout): New function. (gethostbyname_with_timeout): Ditto. (lookup_host): Use them. 2002-04-15 Hrvoje Niksic * po/hr.po: Editing the Project-Id-Version to say "wget" rather than "GNU Wget". 2002-04-15 Hrvoje Niksic * doc/wget.texi (Download Options): Fix the documentation of `--progress'. 2002-04-14 Hrvoje Niksic * src/utils.c (number_to_string): Handle the case when n < -INT_MAX. 2002-04-14 Hrvoje Niksic * src/retr.c (retrieve_url): Make sure that POST is not honored for redirections. * src/http.c (gethttp): Send the POST data when requested. (post_file): New function. (gethttp): Use it. * src/main.c (main): Ditto. * src/init.c: Add new options. * src/options.h (struct options): New options post_data and post_file_name. 2002-04-14 Hrvoje Niksic * src/main.c (print_help): Document `--post-data' and `--post-file'. 2002-04-14 Hrvoje Niksic * src/init.c (comind): Use a marginally faster implementation of binary search. To quote Martin Buchholz, "a nanosecond saved is a nanosecond earned." 2002-04-14 Hrvoje Niksic * src/http.c (gethttp): Ditto. * src/retr.c (retrieve_url): Initialize variables to appease the compiler. * src/gen_sslfunc.c (ssl_iread): Don't handle EINTR when calling select_fd. (ssl_iwrite): Ditto. * src/connect.c (select_fd): Rewrite to handle EINTR. Set errno to ETIMEDOUT in case of timeout. (iread): No need to handle EINTR when calling select_fd. (iwrite): Ditto. 2002-04-14 Hrvoje Niksic * src/connect.c (connect_with_timeout): Firing SIGALRM can result in connect() exiting with EINTR. Treat EINTR the same as ETIMEDOUT. 2002-04-14 Hrvoje Niksic * doc/wget.texi (Wgetrc Commands): Document `--limit-rate'. 2002-04-13 Hrvoje Niksic * src/url.c (getproxy): Accept a struct url argument. This obviates the need for USE_PROXY_P. * src/retr.c (retrieve_url): Allow proxy to be a non-FTP URL. * src/ftp.c (getftp): Recognize FWTK-style proxy. 2002-04-13 Hrvoje Niksic * src/connect.c (connect_with_timeout): Use it. * src/utils.c (run_with_timeout): New function. 2002-04-12 Ian Abbott * windows/config.h.bor: Account for Borland not defining `__STDC__' when Borland's extensions enabled, and define it anyway. 2002-04-12 Ian Abbott * windows/Makefile.src.bor: Removed pre-compiled header options as they increase build time (on my machine). 2002-04-12 Ian Abbott * src/utils.c (has_html_suffix_p): New function to test filename for common html extensions. * src/utils.h: Declare it. * src/http.c (http_loop): Use it instead of previous test. * src/retr.c (retrieve_url): Ditto. * src/recur.c (download_child_p): Ditto. 2002-04-12 Ian Abbott * src/Makefile.in: Specify libtool mode explicitly when linking. 2002-04-12 Hrvoje Niksic * src/connect.c (connect_with_timeout): New function. (connect_to_one): Use it. * src/config.h.in: Add stubs for HAVE_SIGSETJMP, HAVE_SIGBLOCK, and HAVE_SETJMP_H. 2002-04-12 Hrvoje Niksic * src/config.h.in: Only define _VA_LIST when compiled with gcc. 2002-04-12 Hrvoje Niksic * src/config.h.in: Define _VA_LIST on Solaris to prevent stdio.h from declaring va_list. From Kevin Rodgers . 2002-04-12 Hrvoje Niksic * configure.in: Check for . Check for sigsetjmp and sigblock. 2002-04-11 Hrvoje Niksic * src/url.c (url_filename): Use compose_file_name regardless of whether opt.dirstruct is set. (mkstruct): Don't handle the query and the reencoding of DIR; that is done in compose_file_name. 2002-04-11 Hrvoje Niksic * src/progress.c (struct progress_implementation): Use PARAMS when declaring the parameters of *create, *update, *finish, and *set_params. * src/netrc.c: Ditto. * src/http.c: Reformat some function definitions so that ansi2knr can read them. * src/hash.c (struct hash_table): Use the PARAMS macro around parameters in the declaration of hash_function and test_function. (prime_size): Spell 2580823717UL and 3355070839UL as (unsigned long)0x99d43ea5 and (unsigned long)0xc7fa5177 respectively, so that pre-ANSI compilers can read them. (find_mapping): Use PARAMS when declaring EQUALS. (hash_table_put): Ditto. * src/ftp.h: Wrap the parameters of ftp_index declaration in PARAMS. * src/cookies.c (cookie_new): Use (unsigned long)0 instead of 0UL, which was unsupported by pre-ANSI compilers. From Nelson H. F. Beebe , for the most part. 2002-04-11 Hrvoje Niksic * src/progress.c (bar_create): If INITIAL is larger than TOTAL, fix TOTAL. (bar_finish): Likewise. 2002-04-11 Hrvoje Niksic * src/log.c: Set WGET_USE_STDARG if __STDC__ is defined and stdarg.h is present. 2002-04-11 Hrvoje Niksic * src/html-url.c (tag_handle_form): New function. Pick up form actions and mark them for conversion only. 2002-04-10 Hrvoje Niksic * src/wget.h: Ditto for extern char *exec_name. * src/options.h: Don't guard against OPTIONS_DEFINED_HERE -- it is perfectly legal to follow an `extern' with a non-`extern' ones, provided the types match. * src/main.c: Don't define OPTIONS_DEFINED_HERE. 2002-04-10 Hrvoje Niksic * src/progress.c (create_image): Revert to calculating ETA based on average download speed. (create_image): Don't print ETA until the download has been active for at least 3 seconds. (create_image): When ETA is not available, don't print anything. The previous version would print --:--. 2002-04-10 Hrvoje Niksic * src/progress.c (bar_update): Keep updating a subinterval until it reaches or exceeds a watermark. That way the measurement will be guaranteed to span a configurable minimum of time. The current default is 3s in 30 100ms intervals. 2002-04-10 Hrvoje Niksic * doc/wget.texi: Warn about the dangers of specifying passwords on the command line and in unencrypted files. 2002-04-09 Ian Abbott * windows/config.h.bor: define `HACK_BCC_UTIME_BUG'. Define `utime' as `borland_utime' if `HACK_BCC_UTIME_BUG' is defined. 2002-04-09 Ian Abbott * src/mswindows.c (borland_utime): New function conditionally defined when `HACK_BCC_UTIME_BUG' is defined. A reimplementation of `utime()' as Borland's `utime()' function is broken on Windows 9x systems. (Original patch by Chin-yuan Kuo .) 2002-04-09 Hrvoje Niksic * src/progress.c (bar_update): Maintain an array of the time it took to perform previous 30 network reads. (create_image): Calculate the download speed and ETA based on the last 30 reads, not the entire download. (create_image): Make sure that the ETA is not changed more than once per second. 2002-04-08 Hrvoje Niksic * src/ftp.c (ftp_loop): Propagate the result of ftp_retrieve_glob. 2002-03-26 Ian Abbott * windows/wget.dep: Updated several dependencies for object files. 2002-03-26 Ian Abbott * src/Makefile.in: Updated several dependencies for object files. 2002-03-20 Ian Abbott * windows/config.h.bor: * windows/config.h.ms: Removed conditional cruft that was there for Unix-like systems. 2002-03-20 Ian Abbott * src/mswindows.c: Include "utils.h". 2002-03-20 Ian Abbott * * windows/wget.dep: Fix dependencies for target mswindows$o (mswindows.obj) 2002-03-19 Chin-yuan Kuo * configure.bat.in: Do not check %BORPATH% as C++Builder compiler does not use it. * windows/Makefile.src.bor: * windows/config.h.bor: Migrated to free (as in beer) C++Builder compiler. 2002-03-18 Ian Abbott * src/host.h: Don't include netdb.h on windows. 2002-03-13 Ian Abbott * configure.bat: Removed (renamed to configure.bat.ini). * configure.bat.in: New (renamed from configure.bat). * Makefile.in: Add rule to copy configure.bat.in to configure.bat, converting line endings to MS-DOS format in the process. 2002-02-19 Hrvoje Niksic * src/url.c (url_parse): Don't treat '?' as query string separator when parsing FTP URLs. 2002-02-19 Hrvoje Niksic * src/recur.c (retrieve_tree): Handle the case when start_url doesn't parse. 2002-02-19 Hrvoje Niksic * src/host.h: Include Unix-specific includes #ifndef WINDOWS. Patch originally provided by Christian Lackas. 2002-02-19 Andreas Damm * src/wget.h (DO_REALLOC_FROM_ALLOCA): Multiply with sizeof(type) when calling xmalloc and memcpy. 2002-02-19 Alan Eldridge * src/host.h: Also include and . * src/ftp-basic.c: Also include . 2002-02-11 Christian Lackas * src/recur.c: recurive downloading for https fixed. 2002-02-05 Ian Abbott * src/http.c (gethttp): when -c used, mark already fully retrieved file as successfully retrieved. 2002-02-01 Hrvoje Niksic * src/html-url.c (tag_handle_meta): Don't crash on where content is missing. 2002-01-31 Herold Heiko * src/ftp-basic.c, host.c: don't include sys/socket.h, arpa/inet.h, netdb.h on windows. 2002-01-30 Hrvoje Niksic * src/retr.c (retrieve_url): Remove redirection cycle detection. This is because some sites legitimately redirect the user back to the same location, e.g. after an authorization check performed by another page. MAX_REDIRECTIONS is still used to prevent infinite redirection loops. 2002-01-26 Hrvoje Niksic * src/url.c (urlchr_table): Make square braces reserved, so we can parse http://[::1]/. (url_parse): Handle host in braces. (url_string): If url->host contains colons, wrap it in braces. 2002-01-26 Hrvoje Niksic * src/url.c (url_parse): Allow all hex digits, not only decimal ones, to form an IP address. 2002-01-26 Hrvoje Niksic * src/http.c (gethttp): Wrap host name in square brackets if it contains a colon. 2002-01-24 Hrvoje Niksic * src/source: Integrated IPv6 support. Written by Thomas Lussnig . 2002-01-24 Hrvoje Niksic * src/host.c (lookup_host): Use sizeof(ip4_address) to calculate the offset. (address_list_new): Use map_ipv4_to_ip. (wget_sockaddr_set_address): Convert ADDR to IPv4 before using it in IPv4 context. 2002-01-24 Hrvoje Niksic * src/connect.c (resolve_bind_address): New function. (connect_to_one): Use it. (bindport): Ditto. * src/init.c: Don't resolve bind-address here. * src/host.c (wget_sockaddr_set_address): Would bug out with ADDR == NULL. 2002-01-17 Hrvoje Niksic * src/netrc.c (parse_netrc): Skip leading whitespace before testing whether the line is empty. Empty lines still contain the line terminator. 2002-01-15 Ian Abbott * src/init.c (cmd_file): Change `\' to `/' for Windows (yes, really!) (cmd_directory): New function. Like cmd_file(), but strips trailing directory separators. (commands): Change action for "dirprefix" from `cmd_file' to `cmd_directory'. * src/utils.c (make_directory): Allow intermediate `mkdir' calls to fail, as not all path components that do not exist should be directory components, especially under Windows. 2002-01-15 Hrvoje Niksic * src/gen_sslfunc.c (ssl_iread): Call select on the file descriptor only if no data is pending in SSL buffers. From tony@bluetail.com. 2002-01-15 Hrvoje Niksic * MACHINES: OS X entry by Jonathan Davis. 2002-01-14 Hrvoje Niksic * src/url.c (parse_uname): URL-decode *USER and *PASSWD. 2002-01-14 Hrvoje Niksic * src/http.c (gethttp): Invalidate SOCK if get_contents encountered an error. 2002-01-14 Hrvoje Niksic * src/headers.c (header_get): Strip trailing whitespace from the header. 2002-01-07 Ian Abbott * src/url.c (uri_merge_1): Deal with "net path" relative URL (one that starts with "//"). 2001-12-24 Hrvoje Niksic * src/version.c: Wget 1.8.1 is released. 2001-12-19 Hrvoje Niksic * src/version.c: Wget 1.8.1-pre3 is released. 2001-12-19 Hrvoje Niksic * src/recur.c (retrieve_tree): Enqueue the canonical representation of start_url, so that the test against dl_url_file_map works. 2001-12-19 Hrvoje Niksic * src/log.c (logputs): Check for requested verbosity before printing anything. 2001-12-19 Hrvoje Niksic * src/html-url.c (tag_handle_link): Treat the "shortcut icon" link as inline. 2001-12-19 Csaba Raduly * windows/Makefile.watcom: add gen-md5.obj and progress.obj to the list of "sources" * configure.bat: add section for Watcom 2001-12-18 Ian Abbott * src/recur.c (retrieve_tree): Pass on referring URL when retrieving recursed URL. 2001-12-18 Hrvoje Niksic * src/recur.c (retrieve_tree): Make a copy of file obtained from dl_url_file_map because the code calls xfree(file) later. 2001-12-18 Hrvoje Niksic * src/recur.c (register_html): Maintain a hash table of HTML files along with the list. Disallow duplicates. (retrieve_tree): Use downloaded_html_set to check whether the file found in dl_url_file_map is an HTML file, and descend into it if so. (convert_all_links): Don't guard against duplicates in downloaded_html_list, since they are no longer possible. 2001-12-17 Hrvoje Niksic * src/version.c: Wget 1.8.1-pre2 is released. 2001-12-17 Hrvoje Niksic * src/retr.c (sleep_between_retrievals): Simplify indentation. 2001-12-17 Hrvoje Niksic * src/gen_sslfunc.c (ssl_init_prng): Use random_number to get a byte of "randomness" at a time. (ssl_init_prng): Don't seed the PRNG; random_number will do that. * src/retr.c (sleep_between_retrievals): Use it. Make sure that the random amount averages in opt.wait. (sleep_between_retrievals): Don't seed the PRNG; random_number will do that. * src/utils.c (random_number): New function. 2001-12-16 Hrvoje Niksic * doc/wget.texi (Wgetrc Commands): Undocument simple_host_check. 2001-12-14 Hrvoje Niksic * src/url.c (path_simplify): Move here from utils.c, and make static. 2001-12-13 Hrvoje Niksic * src/version.c: Wget 1.8.1-pre1 is released. 2001-12-13 Hrvoje Niksic * src/res.c (matches): Fix broken URL in the docstring. 2001-12-13 Hrvoje Niksic * src/recur.c (register_download): Don't abort when one URL references two different files. 2001-12-13 Hrvoje Niksic * src/init.c (wgetrc_file_name): Print correct message when loading getenv("WGETRC") fails. 2001-12-13 Hrvoje Niksic * src/http.c (gethttp): Check for conn->scheme, not u->scheme, before calling ssl_iwrite. 2001-12-13 Hrvoje Niksic * src/html-url.c (tag_url_attributes): Mark as external. 2001-12-13 Hrvoje Niksic * po/ja.po: Ditto. * po/sv.po: Ditto. * po/de.po: Ditto. * po/es.po: Ditto. * po/fr.po: Ditto. * po/et.po: Ditto. * po/tr.po: Ditto. * po/ru.po: Update from TP. 2001-12-13 Hrvoje Niksic * doc/wget.texi (Robots): Fix broken URLs that point to the webcrawler web site. 2001-12-12 Hrvoje Niksic * src/url.c (get_urls_file): Cosmetic changes. 2001-12-12 Hrvoje Niksic * src/html-url.c (collect_tags_mapper): Break into several functions. (tag_url_attributes): Collect . 2001-12-12 Hrvoje Niksic * src/html-url.c (append_one_url): Resurrect warning when unable to resolve a relative link. 2001-12-12 Hrvoje Niksic * configure.in: Autodetect SSL. Check for SSL includes too. 2001-12-11 Hrvoje Niksic * src/host.c: New type ipv4_address. Use it consistently instead of `unsigned char[4]' and `unsigned char *'. (pretty_print_address): Accept a `const void *', to require even less casting. 2001-12-11 Hrvoje Niksic * src/ftp-ls.c (ftp_parse_vms_ls): Fix obvious memory leaks. 2001-12-11 Hrvoje Niksic * doc/wget.texi (HTTP Options): Explain how to make IE produce a `cookies.txt'-compatible file. Reported by Herold Heiko. 2001-12-11 Hrvoje Niksic * doc/texi2pod.pl.in: Handle @asis in table. 2001-12-11 Hrvoje Niksic * configure.in: Check for md5_calc rather than MD5Update when looking for Solaris md5. 2001-12-11 Hrvoje Niksic * config.sub: Ditto. * config.guess: Ditto. * aclocal.m4: Ditto. * ltmain.sh: Upgrade to libtool 1.4.2. 2001-12-10 Hrvoje Niksic * src/utils.c (path_simplify): Rewrite, with better comments, and without the use of strcpy to move overlapping blocks. 2001-12-10 Hrvoje Niksic * src/utils.c (path_simplify): Correctly handle the unlikely case that b starts out as path + 1. 2001-12-10 Hrvoje Niksic * src/utils.c (long_to_string): Return a pointer after where the number ends. (long_to_string): Rename to number_to_string. 2001-12-10 Hrvoje Niksic * src/main.c (main): Initialize progress after fork_to_background, so that it knows when to use dots. * src/mswindows.c (ws_hangup): Call log_request_redirect_output. * src/utils.c (fork_to_background): Print the PID of the child process. * src/log.c (log_request_redirect_output): Set a flag that output redirection has been requested. Doing anything else in a signal handler is unsafe. (check_redirect_output): New function: check whether redirection has been requested and, if so, call redirect_output(). (logputs): Call check_redirect_output. (logprintf): Ditto. (debug_logprintf): Ditto. (redirect_output): Print clearer messages. * src/main.c (redirect_output_signal): Don't call redirect_output_signal directly. Instead, call log_request_redirect_output. * src/utils.c (memfatal): Ditto. * src/progress.c (display_image): Use it. * src/log.c (log_set_save_context): New function: allow the caller to turn off saving log context lines. 2001-12-10 Hrvoje Niksic * src/host.c (address_list_set_faulty): Uncomment a sanity check. 2001-12-09 Hrvoje Niksic * src/version.c: Wget 1.8 is released. 2001-12-09 Hrvoje Niksic * src/url.c (reencode_string): Declare static. * src/res.c (registered_specs): Declare static. * src/progress.c (current_impl_locked): Declare static. * src/log.c (flush_log_p): Declare static. (needs_flushing): Ditto. * src/http.c (digest_authentication_encode): Declare static. * src/html-url.c (init_interesting): Declare static. * src/host.c (host_name_addresses_map): Declare static. * src/cookies.c (find_matching_chains): Declare static. * src/ftp-ls.c (ftp_parse_vms_ls): Warn about the memory leak indicated by lint. * src/utils.c (path_simplify): Remove unused variable STUB_CHAR. * src/host.c (address_list_set_faulty): Document that INDEX is currently unused. * src/url.c (rewrite_shorthand_url): Remove unused variable PATH. 2001-12-09 Hrvoje Niksic * src/progress.c (create_image): Fix ETA padding when hours are prined. 2001-12-09 Hrvoje Niksic * src/main.c (main): Remove stray debugging message. 2001-12-09 Hrvoje Niksic * src/init.c (cmd_spec_progress): Resurrect. Check whether VAL is a valid progress type before setting it. 2001-12-09 Hrvoje Niksic * doc/wget.texi: Bump version to 1.8. 2001-12-08 R.I.P. Deaddog * po/zh_TW.po: Updated for 1.8. 2001-12-08 Hrvoje Niksic * src/version.c: Wget 1.8-pre2 is released. 2001-12-08 Hrvoje Niksic * po/tr.po: Ditto. * po/sv.po: Ditto. * po/ru.po: Ditto. * po/fr.po: Ditto. * po/es.po: Ditto. * po/de.po: Update from TP. 2001-12-08 Hrvoje Niksic * doc/wget.texi (HTTP Options): Provide more specific information about how --load-cookies is meant to be used. 2001-12-08 Hrvoje Niksic * doc/texi2pod.pl: Include the EXAMPLES section. * doc/wget.texi (Overview): Shorten the man page DESCRIPTION. (Examples): Redo the Examples chapter. Include it in the man page. 2001-12-06 Hrvoje Niksic * src/version.c: Wget 1.8-pre1 is released. 2001-12-06 Hrvoje Niksic * src/utils.c (read_whole_line): Handle lines beginning with \0. 2001-12-06 Hrvoje Niksic * src/url.c (scheme_disable): New function. * src/main.c (main): Call ssl_init_prng from here rather than from init_ssl, so that it has a chance to disable support for https before a URL has been resolved. * src/gen_sslfunc.c (ssl_init_prng): Seed with rand() if all else failed. (ssl_init_prng): Disable support for https if seeding the PRNG fails. 2001-12-06 Hrvoje Niksic * src/progress.c (progress_handle_sigwinch): Set up the signal again. * src/utils.c: Include , where Solaris defines TIOCGWINSZ. * src/progress.c (bar_create): Don't use the last column on the screen. (create_image): Pad ETA to constant size. Pad SIZE to nine digits only until it exceeded them. 2001-12-06 Hrvoje Niksic * src/progress.c (progress_create): Make sure that, when the output is redirected, the progress implementation gets changed to the fallback one. (bar_set_params): Set current_impl_locked to 1 when "force" is specified. (progress_create): Don't change the progress implementation if current_impl_locked is non-zero. * src/main.c (redirect_output_signal): Call progress_schedule_redirect. * src/progress.c (progress_schedule_redirect): New function. 2001-12-06 Hrvoje Niksic * src/log.c (logvprintf): Restructure to allow being called multiple times. (logprintf): Call logvprintf in a loop. (debug_logprintf): Ditto. 2001-12-06 Hrvoje Niksic * src/gen_sslfunc.c (ssl_init_prng): Make the printed message translatable. 2001-12-06 Hrvoje Niksic * src/gen_sslfunc.c (ssl_init_prng): Allow the user to disable EGD by setting egd_file it to empty string. * src/main.c (main): Change the option name from --sslegdsock to --egd-file. 2001-12-06 Hrvoje Niksic * po/et.po: Update from the TP. 2001-12-06 Hrvoje Niksic * po/de.po: Ditto. * po/fr.po: Ditto. * po/tr.po: Ditto. * po/sv.po: Ditto. * po/et.po: Update from TP. * po/hu.po: New file from TP. 2001-12-06 Hrvoje Niksic * configure.in: Check for 2001-12-05 Hrvoje Niksic * src/utils.c (path_simplify): Document with test cases. 2001-12-05 Hrvoje Niksic * src/recur.c (convert_all_links): Guard against duplicates in downloaded_html_files. (register_download): Don't invalidate similar-looking URLs. (match_except_index): New function. 2001-12-04 Ian Abbott * src/snprintf.c (dopr): Use `unsigned int' as the second argument to va_arg when casting to `unsigned short' is intended. 2001-12-04 Hrvoje Niksic * src/url.c (local_quote_string): Reenable quoting of question marks, but only when `--html-extension' is used. 2001-12-04 Hrvoje Niksic * src/recur.c (retrieve_tree): Check whether the URL was already downloaded before downloading it again. (descend_child_p): Renamed to download_child_p. (register_download): When one URL is downloaded to a file already "owned" by another URL, delete all references that map any URL to that file. (register_delete_file): New function. (retrieve_tree): Use it after deleting a file. * src/url.c (url_parse): Re-canonicalize the URL also if the path is empty, so that e.g. "http://www.server.com" -> "http://www.server.com/". (lowercase_str): Use ISUPPER instead of !ISLOWER. * src/retr.c (retrieve_url): Use the canonical URL form when calling register_download(). 2001-12-04 Hrvoje Niksic * src/gen_sslfunc.c: Ditto. * src/rbuf.c: Include . 2001-12-04 Herold Heiko * windows\Makefile.src: add gen_sslfunc.c * windows\Makefile.src.bor: ditto. 2001-12-04 Herold Heiko * src/gen_sslfunc.c: on windows provide ssl crypto random initialization through RAND_screen(); could possibly be not enough for strong ssl communication (see the relevant manual page from the openssl package). 2001-12-03 Hrvoje Niksic * src/version.c: Wget 1.8-beta3 is released. 2001-12-03 Hrvoje Niksic * src/snprintf.c (dopr): Replace `short int' and `unsigned short int' with `int' when using it as the second argument to `va_arg'. 2001-12-03 Hrvoje Niksic * src/snprintf.c (dopr): Cast the result of va_arg to short int and short unsigned int where these types are expected to be used. 2001-12-03 Hrvoje Niksic * src/host.c (address_list_new_one): New function. (lookup_host): Use it. 2001-12-03 Hrvoje Niksic * src/ftp-basic.c (ftp_port): Don't return HOSTERR if we fail getting the socket data. * src/ftp.c: Ditto. * src/http.c: No need to declare h_errno. * src/host.c: Declare h_errno. 2001-12-03 Andre Majorel * src/host.c (lookup_host): Don't initialize TMPSTORE directly because it's not legal C. 2001-12-02 Hrvoje Niksic * src/version.c: Wget 1.8-beta2 is released. 2001-12-02 Hrvoje Niksic * src/utils.c (file_merge): If BASE doesn't contain a slash, just return a copy of FILE. 2001-12-01 Hrvoje Niksic * src/version.c: Wget 1.8-beta1 is released. 2001-12-01 Hrvoje Niksic * src/url.c (url_full_path): Document better. * src/http.c (gethttp): Use the full path when creating digest authorization. 2001-12-01 Hrvoje Niksic * src/url.c (replace_attr_refresh_hack): New function. (convert_links): Call replace_attr_refresh_hack for Refresh links. It will add the "TMOUT; URL=" junk before the link. * src/html-url.c (collect_tags_mapper): Set ID to the ID of the "content" attribute, not "http-equiv". (collect_tags_mapper): Don't use OFFSET to hack the raw_* values; instead, store the information that this entry belongs to a "refresh" link. 2001-12-01 Hrvoje Niksic * src/url.c (get_urls_file): If opt.base_href is specified, merge each URL with the base. 2001-12-01 Hrvoje Niksic * src/recur.c (retrieve_tree): Allow -p retrievals to exceed maximum depth by more than one. 2001-12-01 Hrvoje Niksic * src/main.c (print_help): Don't document the removed `-nh'. 2001-12-01 Hrvoje Niksic * src/ftp.c (getftp): When PWD fails, assume "/". * src/ftp-basic.c (ftp_syst): Fix indentation. 2001-12-01 Hrvoje Niksic * src/cookies.c (path_matches): Return 0 if PREFIX doesn't begin with '/'. 2001-12-01 Hrvoje Niksic * src/cookies.c (path_matches): FULL_PATH doesn't begin with '/', but PREFIX does. 2001-12-01 Hrvoje Niksic * src/cookies.c (check_domain_match): Reimplement to match Netscape's "preliminary specification" for cookies. 2001-12-01 Hrvoje Niksic * po/hr.po: Updated Croatian translation. 2001-12-01 Hrvoje Niksic * doc/wget.texi: Update the manual with the new recursive retrieval stuff. 2001-11-30 T. Bharath * src/http.c (persistent_available_p): Call SHUTDOWN_SSL if test_socket_open fails. 2001-11-30 Ingo T. Storm * doc/sample.wgetrc: Document ftp_proxy, too. 2001-11-30 Hrvoje Niksic * src/retr.c (retrieve_url): Don't allow more than 20 redirections. 2001-11-30 Hrvoje Niksic * src/recur.c (retrieve_tree): Skip the non-inline entries when enqueuing the children of a leaf HTML node in -p mode. (descend_url_p): Ignore opt.no_parent when in -p mode and UPOS is "inline". * src/html-url.c (get_urls_html): Don't accept dash_p_leaf_HTML. (collect_tags_mapper): When an entry is "inline", mark it as such. * src/recur.c (descend_url_p): Fix test when checking for acceptance/rejection rules. 2001-11-30 Hrvoje Niksic * src/progress.c (display_image): Just print one CR to reset the cursor position. 2001-11-30 Christian Fraenkel * src/init.c: New command `ssl_egd_sock'. * src/main.c (main): New option `--sslegdsock'. * src/gen_sslfunc.c (ssl_init_prng): Seed the RNG using EGD. 2001-11-29 Hrvoje Niksic * src/recur.c (descend_url_p): When resolving no_parent, compare with start_url, not parent url. Otherwise link from /a/b/ to /a/c/ wouldn't be followed, although the download started from /a/. 2001-11-29 Hrvoje Niksic * src/http.c (http_process_range): Accept the broken output of "JavaWebServer/1.1.1". 2001-11-29 Hrvoje Niksic * src/host.c (address_list_new): Initialize al->faulty. 2001-11-29 Hrvoje Niksic * src/headers: Guard against header files being included twice. 2001-11-29 Hrvoje Niksic * src/gen-md5.c: Use unsigned char * as the buffer argument to gen_md5_update. 2001-11-29 Hrvoje Niksic * src/connect.h: Declare select_fd. 2001-11-29 Hrvoje Niksic * src/cmpt.c (memmove): Include a simple memmove implementation. 2001-11-29 Hrvoje Niksic * configure.in: Use SSL's MD5 if we're compiling with SSL anyway. 2001-11-28 Hrvoje Niksic * src/progress.c (dot_set_params): If PARAMS is unspecified, use dot_style, if available. * src/init.c: Ditto. * src/main.c (main): Resurect --dot-style. * src/progress.c (dot_finish): Print the quantity if we're left at the beginning of a row. 2001-11-27 Ian Abbott * src/retr.c (retrieve_from_file): Initialize `new_file' to NULL to prevent seg fault. 2001-11-27 Hrvoje Niksic * src/url.c (convert_links): Don't translate %d-%d. * src/main.c (print_help): Remove stray HAVE_RANDOM code. 2001-11-27 Hrvoje Niksic * src/progress.c: Change the default progress implementation to "bar". 2001-11-27 Hrvoje Niksic * src/progress.c (bar_create): Print two newlines. 2001-11-27 Hrvoje Niksic * src/ftp.c (getftp): Improve output after sending PASV. Don't attempt to "look up" the IP address we already know; call connect_to_one directly. 2001-11-27 Hrvoje Niksic * src/connect.c (connect_to_many): Use address_list_set_faulty to prevent the faulty address from being reused. * src/host.c (address_list_set_faulty): New function. (address_list_get_bounds): New function, instead of address_list_count. 2001-11-27 Hrvoje Niksic * src/cmpt.c (random): Removed. * src/retr.c (sleep_between_retrievals): Use the more portable rand() instead of random(). 2001-11-27 Hrvoje Niksic * src/cmpt.c (random): New function, a simple-minded replacement for random() on systems that don't have it. 2001-11-27 Hrvoje Niksic * po/hr.po: Updated. 2001-11-27 Hrvoje Niksic * configure.in: Don't check for random. 2001-11-27 Hrvoje Niksic * configure.in: Check for random. 2001-11-26 Ian Abbott * src/http.c (gethttp): fix undeclared variable 'err' when compiled with HAVE_SSL. 2001-11-26 Hrvoje Niksic * src/recur.c (retrieve_tree): In case of followed redirection, blacklist the pre-redirection URL. 2001-11-26 Hrvoje Niksic * src/recur.c (descend_redirect_p): New function. (retrieve_tree): Make sure redirections are not blindly followed. 2001-11-26 Hrvoje Niksic * src/progress.c: Don't allocate new timers; use the timing data propagated from the caller. * src/retr.c (get_contents): Allocate and use a timer. 2001-11-26 Hrvoje Niksic * src/http.c (last_host_ip): Made into an address_list. (invalidate_persistent): Release pc_last_host_ip. (register_persistent): Use lookup_host. (persistent_available_p): Check for equality of hosts using address_list_match_all. Call address_list_release. (http_cleanup): New function. * src/ftp.c (getftp): Use lookup_host and connect_to_many. * src/http.c (gethttp): Use lookup_host and connect_to_many. * src/connect.c (make_connection): Removed. (connect_to_one): New function. (connect_to_many): Ditto. (set_connection_host_name): Ditto. * src/host.c (lookup_host): New function; new return type. (address_list_new): New function. (address_list_count): Ditto. (address_list_copy_one): Ditto. (address_list_delete): Ditto. (address_list_release): Ditto. (pretty_print_address): Ditto. 2001-11-26 Hrvoje Niksic * src/config.h.in: Put a HAVE_USLEEP stub. * src/cmpt.c (usleep): Replacement implementation of usleep using select. * src/init.c: New option init_rate. * src/main.c (main): New option --limit-rate. * src/retr.c (limit_bandwidth): New function. (get_contents): Call it to limit the bandwidth used when downloading. * src/progress.c (dot_update): Would print the wrong download speed on rows other than the first one when the download was continued. (dot_finish): Ditto. 2001-11-26 Hrvoje Niksic * configure.in: Check for usleep. 2001-11-25 Hrvoje Niksic * util/dist-wget: New file: the script used for building Wget. 2001-11-25 Hrvoje Niksic * src/url.c (reencode_string): Use unsigned char, not char -- otherwise the hex digits come out wrong for 8-bit chars such as nbsp. (lowercase_str): New function. (url_parse): Canonicalize u->url if needed. (get_urls_file): Parse each URL, and return only the valid ones. (free_urlpos): Call url_free. (mkstruct): Add :port if the port is non-standard. (mkstruct): Append the query string to the file name, if any. (urlpath_length): Use strpbrk_or_eos. (uri_merge_1): Handle the cases where LINK is an empty string, where LINK consists only of query, and where LINK consists only of fragment. (convert_links): Count and report both kinds of conversion. (downloaded_file): Use a hash table, not a list. (downloaded_files_free): Free the hash table. * src/retr.c (retrieve_from_file): Ditto. * src/main.c (main): Call either retrieve_url or retrieve_tree for each URL, not both. * src/retr.c (register_all_redirections): New function. (register_redirections_mapper): Ditto. (retrieve_url): Register the redirections. (retrieve_url): Make the string "Error parsing proxy ..." translatable. * src/res.c (add_path): Strip leading slash from robots.txt paths so that the path representations are "compatible". (free_specs): Free each individual path, too. (res_cleanup): New function. (cleanup_hash_table_mapper): Ditto. * src/recur.c (url_queue_new): New function. (url_queue_delete): Ditto. (url_enqueue): Ditto. (url_dequeue): Ditto. (retrieve_tree): New function, replacement for recursive_retrieve. (descend_url_p): New function. (register_redirection): New function. * src/progress.c (create_image): Cosmetic changes. * src/init.c (cleanup): Do all those complex cleanups only if DEBUG_MALLOC is defined. * src/main.c: Removed --simple-check and the corresponding simple_host_check in init.c. * src/html-url.c (handle_link): Parse the URL here, and propagate the parsed URL to the caller, who would otherwise have to parse it again. * src/host.c (xstrdup_lower): Moved to utils.c. (realhost): Removed. (same_host): Ditto. 2001-11-25 Hrvoje Niksic * src/url.c (convert_links): Handle CO_NULLIFY_BASE. * src/recur.c (retrieve_tree): Ignore download-ignorable children. (convert_all_links): Specify CO_NULLIFY_BASE when link_base_p. * src/html-url.c (handle_link): Return the newly created urlpos. (collect_tags_mapper): When dealing with BASE, store the base reference and mark it as download-ignorable. 2001-11-25 Hrvoje Niksic * src/url.c (convert_links): Attempt to quote '?' as "%3F" when linking to local files. Given up on the attempt, as it breaks local browsing. 2001-11-25 Hrvoje Niksic * src/recur.c (descend_url_p): Be more conservative with blacklisting URLs. (convert_all_links): Print how many files have been converted, and how long it took. * src/progress.c (create_image): Place the number of downloaded bytes right after the progress bar. * src/utils.c (suffix): Return a pointer into the string. 2001-11-25 Hrvoje Niksic * src/progress.c (dot_create): Align the "[ skipping ... ]" string with the dots. * src/retr.c (rate): Split into two functions: calc_rate and retr_rate. * src/progress.c (create_image): Draw a dummy progress bar even when total size is unknown. (display_image): Place the text cursor at the end of the "image". 2001-11-25 Hrvoje Niksic * src/main.c (private_initialize): Removed. (main): Don't call private_initialize. * src/http.c: Call lookup_host. * src/host.c (host_init): Removed. (add_host_to_cache): Initialize host_name_address_map here, on demand. (ngethostbyname): Commented out. * src/connect.c (make_connection): Remove dead code; use lookup_host. * src/host.c (store_hostaddress): Renamed to lookup_host and reversed the args. Removed host_address_name_map and host_slave_master_map. 2001-11-24 Hrvoje Niksic * src/utils.c (path_simplify): Preserver the (non-)existence of leading slash. Return non-zero if changes were made. 2001-11-24 Hrvoje Niksic * src/retr.c (retrieve_url): When the redirection URL doesn't parse, print the correct error message rather than "UNKNOWN". 2001-11-24 Hrvoje Niksic * src/recur.c (recursive_retrieve): Fix typo. 2001-11-24 Hrvoje Niksic * src/progress.c (bar_update): Don't modify bp->total_length if it is zero. 2001-11-24 Hrvoje Niksic * src/progress.c (bar_finish): If the timer didn't record any time since the download beginning, fake 1ms. 2001-11-23 Lemble Gregory * src/gen_sslfunc.c (ssl_init_prng): New function; seed the SSL RNG. 2001-11-23 Hrvoje Niksic * src/utils.c (determine_screen_width): New function. * src/main.c (main): New option `--progress=TYPE'. (main): Implement compatibility with the old option `--dot-style'. * src/init.c: Removed cmd_spec_dotstyle -- that logic is now in dp_set_params. (cmd_spec_progress): New function. * src/retr.c (get_contents): Use the progress_* functions instead of the old show_progress(). (show_progress): Removed. (rate): Print "xxxx.xx K/s" instead of "KB/s". Ditto for MB/s, etc. * src/progress.c (set_progress_implementation): New function. (valid_progress_implementation_p): Ditto. (progress_create): Ditto. (progress_update): Ditto. (progress_finish): Ditto. (dp_create): Ditto. (dp_update): Ditto. (dp_finish): Ditto. (dp_set_params): Ditto. (print_elapsed): Ditto. 2001-11-23 Hrvoje Niksic * src/progress.c: Renamed dp_* functions to dot_* for greater clarity and consistency with bar_*. (print_download_speed): Get rid of the unneeded '@' character. (create_image): Fix download rate geometry. * src/progress.c (print_elapsed): Remove spurious space. (print_elapsed): Renamed to print_download_speed, since that's what it does. 2001-11-23 Hrvoje Niksic * src/progress.c (create_image): Don't translate "%ld ". 2001-11-23 Hrvoje Niksic * src/progress.c (bar_update): If the downloaded amount becomes larger than the expected amount, adjust the expected amount accordingly. 2001-11-23 Hrvoje Niksic * src/progress.c (bar_set_params): Allow the user to force the use of the bar. 2001-11-23 Hrvoje Niksic * po/wget.pot: Rebuild. * po/POTFILES.in: Update with the new source files. 2001-11-23 Hrvoje Niksic * po/hr.po: A major overhaul. 2001-11-23 Hrvoje Niksic * doc/wget.texi (Download Options): Document the new `--progress' option. 2001-11-23 Hrvoje Niksic * configure.in: Check for sys/ioctl.h. 2001-11-22 Jochen Hein * src/main.c (main): Split the copyright notice for easier translation. 2001-11-22 Hrvoje Niksic * windows/Makefile.doc: Update docs generation. 2001-11-22 Hrvoje Niksic * src/utils.c (path_simplify): Don't remove trailing slashes. * src/ftp.c (ftp_get_listing): Use it. * src/utils.c (file_merge): New function. * src/url.c (opt_url): Removed. * src/recur.c (recursive_retrieve): Inline "opt_url" logic. * src/main.c (main): Use xfree(), not free(). * src/url.c (rewrite_url_maybe): Renamed to rewrite_shorthand_url. * src/ftp.c (ccon): Move `ccon' typedef here, since it's only used internally. * src/config.h.in: Include a stub for HAVE_STRPBRK. * src/cmpt.c (strpbrk): Include a replacement for systems without strpbrk(). * src/ftp.c: Use url_set_dir and url_set_file when modifying the URL. * src/url.c (url_set_dir): New function. (url_set_file): Ditto. * src/ftp-basic.c (ftp_process_type): Process FTP type here; the URL parser makes the URL "params" available, so we can do that in this function. * src/retr.c: Ditto. * src/ftp.c: Ditto; pass the local file information in `ccon'. * src/http.c: Get rid of the ugly kludge that had URL being replaced with the proxy URL when proxy retrieval was requested. Use a separate parameter to http_loop and gethttp for the proxy URL. * src/http.c: Changed to reflect the fact that local file, proxy, and referer information are no longer stored in struct url. The local file information is passed in `struct hstat' now. * src/url.c: Reworked URL parsing to be more regular. Reencode the URL using reencode_string. Removed non-URL-related information from struct url. This includes fields `proxy', `local', and `referer'. 2001-11-22 Hrvoje Niksic * src/retr.c (show_progress): Use it. * src/log.c (log_set_flush): New function. 2001-11-22 Hrvoje Niksic * doc/wget.texi (Proxies): Fix typo. (Proxies): Sync the text with the example. (Wgetrc Commands): There is no -f option. It's --follow-ftp. Reported by Wojtek Kotwica. 2001-11-22 Hrvoje Niksic * configure.in: Check for strpbrk(). 2001-11-22 Herold Heiko * windows/Readme * windows/Makefile.doc Windows documentation update. * windows/Makefile.src Cleanup config.h 2001-11-20 Hrvoje Niksic * src/url.c (parseurl): Don't depend on the now-obsolete TYPE. 2001-11-19 Hrvoje Niksic * src/url.c: Clean up handling of URL schemes. 2001-11-19 Hrvoje Niksic * src/url.c (getproxy): Handle URL shorthands. 2001-11-19 Hrvoje Niksic * src/main.c: Remove --wait / --waitretry backwards compatibility code. 2001-11-19 Hrvoje Niksic * src/main.c (main): Use it. * src/url.c (rewrite_url_maybe): New function. 2001-11-18 Hrvoje Niksic * src/version.c: Wget 1.7.1 is released. 2001-11-18 Hrvoje Niksic * src/res.c (res_register_specs): Initialize OLD and HP_OLD to appease the compiler. 2001-11-18 Hrvoje Niksic * src/md5.h: Renamed to gnu-md5.h. * src/md5.c: Renamed to gnu-md5.c. * src/http.c: Ditto. * src/ftp-opie.c: Use the new macros. * src/sysdep.h: Define md5-related macros. * src/config.h.in: Define HAVE_SOLARIS_MD5 or HAVE_BUILTIN_MD5 depending on which md5 implementation is used. 2001-11-18 Hrvoje Niksic * src/http.c (gethttp): Print the whole response line when printing headers is requested. 2001-11-18 Hrvoje Niksic * src/Makefile.in: Conditionally compile getopt.o. 2001-11-18 Hrvoje Niksic * po/: Installed ja.po, et.po, he.po, fr.po, da.po, uk.po, es.po, sl.po, nl.po from the Translation Project. 2001-11-18 Hrvoje Niksic * configure.in: Check for getopt_long in libc. 2001-11-18 Hrvoje Niksic * configure.in: Check for Solaris libmd5. 2001-11-17 Hrvoje Niksic * doc/Makefile.in (install.info): If info files from the build directory are not available, use the ones from $(srcdir). 2001-11-16 Peter Farmer * doc/Makefile.in: Use $? instead of $<. Use TEXI2POD more consistently. 2001-11-16 Hrvoje Niksic * src/html-parse.c (map_html_tags): Support XML-style empty tags. 2001-11-16 Hrvoje Niksic * src/html-parse.c (advance_declaration): Use 0x22 instead of '"' or '\"'. Different compilers' assert macros are broken in different ways. 2001-11-16 Hrvoje Niksic * src/headers.c (header_extract_number): Ignore trailing whitespace. 2001-11-16 Chris Seawood * src/init.c: Ditto. * src/host.c: Ditto. * src/connect.c: Ditto. * src/sysdep.h: Support compilation under BEOS. 2001-11-04 Alan Eldridge * src/config.h.in: added HAVE_RANDOM. * src/options.h: added random_wait to struct options. * src/main.c (print_help [HAVE_RANDOM], main): added arg parsing, help for --random-wait. * src/retr.c (sleep_between_retrievals) [HAVE_RANDOM]: added implementation of random wait times. * src/init.c (commands): added "randomwait" keyword. 2001-11-04 Alan Eldridge * doc/wget.texi: Document --random-wait, randomwait=on/off. 2001-10-31 Daniel BODEA * src/netrc.c (search_netrc): When slack_default is 0, still look for an account with matching password, just not the "default account". HTTP Authorization using .netrc should now work as expected. 2001-09-29 Christian Fraenkel * src/http.c (gethttp): print debug output for errors occuring during the ssl handshake. 2001-08-24 Ian Abbott * src/html-url.c (collect_tags_mapper): Fix bug converting links with -k option for tags with multiple link attributes by handling links in the order they appear. 2001-08-21 Dave Turner * src/ftp-basic.c (ftp_size): New function to send non-standard SIZE command to server to request file size. * src/ftp.h (ftp_size): Export it. * src/ftp.c (getftp): Use new ftp_size function if restoring transfer of a file with unknown size. 2001-08-15 Ian Abbott * src/ftp.c (ftp_loop_internal): Avoid a potential buffer overflow in the call to the 'rate' function by moving it past the error checking for the 'getftp' function return value. 2001-06-26 Hrvoje Niksic * src/wget.h (DO_REALLOC_FROM_ALLOCA): Set SIZEVAR after the memcpy() call because it needs the old value. 2001-06-26 Hrvoje Niksic * src/wget.h (DO_REALLOC_FROM_ALLOCA): Check for do_realloc_newsize in loop condition because we're no longer setting SIZEVAR here. 2001-06-18 Hrvoje Niksic * src/url.c (url_filename): Make sure that slashes that sneak in to u->file via query string get protected. (file_name_protect_query_string): New function. 2001-06-18 Hrvoje Niksic * src/cookies.c (ATTR_NAME_CHAR): Allow almost any character to be in an attribute name. 2001-06-16 Hrvoje Niksic * doc/wget.texi: Updated version to 1.7.1. 2001-06-16 Hrvoje Niksic * MACHINES: Added mips-sgi-irix6.5, as reported by Edward J. Sabol. 2001-06-15 Hrvoje Niksic * po/da.po: New version from TP. 2001-06-15 Hrvoje Niksic * doc/Makefile.in (install.wgetrc): Use $(DESTDIR) when testing whether $(WGETRC) exists. 2001-06-15 Hrvoje Niksic * doc/Makefile.in (install.wgetrc): Take $(DESTDIR) into account when running mkinstalldirs. 2001-06-15 Hrvoje Niksic * config.sub: New version from libtool 1.4. * config.guess: New version from libtool 1.4. * ltmain.sh: New version from libtool 1.4. * aclocal.m4: Imported `libtool.m4' from libtool 1.4. * ltconfig: Removed. * configure.in: First check the compiler, then invoke libtool. 2001-06-15 Adam J. Richter * doc/Makefile.in (install.wgetrc): Make `make install' non-interactive in all cases. 2001-06-14 Maciej W. Rozycki * configure.in: Use `libtool' to test linking of external libraries. 2001-06-14 Hrvoje Niksic * src/sysdep.h (MAP_FAILED): Provide MAP_FAILED for systems that don't define it. 2001-06-14 Hrvoje Niksic * src/recur.c (recursive_retrieve): Also check undesirable_urls with canonicalized URL. 2001-06-14 Hrvoje Niksic * src/http.c (gethttp): Search `.netrc' with real host, not the proxy one. 2001-06-14 Hrvoje Niksic * po/: Install new files from the TP: sv.po, cs.po, et.po, tr.po, es.po, de.po, gl.po, sk.po, ru.po, fr.po. 2001-06-14 Hrvoje Niksic * configure.in: Check for both gethostbyname and inet_ntoa before concluding that -lnsl is not needed. 2001-06-09 Jan Prikryl * src/ftp.h: Provide correct prototype for ftp_parse_ls(). 2001-06-08 Edward J. Sabol * src/url.c (url_equal): Fix a memory leak when parseurl returns an error on the second URL. Also, since url_equal is not used at the moment, do not compile it. * src/url.h: Ditto for the prototype of url_equal. 2001-06-05 Jan Prikryl * po/cs.po: Updated to match the 1.7 POT. 2001-06-05 Jan Prikryl * doc/Makefile.in (wget.info): Added -I$(srcdir) to support compilation outside the source tree. (install.man): Replaced $(srcdir)$(MAN) with $(MAN). The former did not work when compiling outside the source tree. 2001-06-04 Hrvoje Niksic * src/version.c: Wget 1.7 is released. 2001-06-04 Hrvoje Niksic * po/: New versions of de.po and gl.po from the TP. 2001-06-03 Karl Eichwalder * src/ftp-ls.c (ftp_parse_ls): Fix typo. 2001-06-03 Hrvoje Niksic * po/wget.pot: Updated. 2001-06-03 Hrvoje Niksic * po/hr.po: Updated to match the new POT. 2001-06-03 Hrvoje Niksic * po/es.po: Use the version from TP. 2001-06-02 R.I.P. Deaddog * po/zh_TW.po: Updated for 1.7. 2001-06-02 Hrvoje Niksic * po/pl.po: Use iso-8859-1 as charset. * po/hr.po: Update. 2001-06-02 Hrvoje Niksic * po/: Updated ru.po, et.po, and sv.po. Added tr.po. 2001-05-31 Hrvoje Niksic * doc/wget.texi (Mailing List): Fix the mailing list address. 2001-05-28 Maciej W. Rozycki * configure.in: Use $host_os instead of non-existent "$opsys" when deciding based on host type. * configure.in: Print "cross" when cross-compiling. 2001-05-27 Hrvoje Niksic * src/all: Update copyright information. 2001-05-27 Hrvoje Niksic * doc/wget.texi (Copying): Clarify. Link to "free-software-for-freedom.html". 2001-05-26 Hrvoje Niksic * src/version.c: Wget 1.7-pre1 is released. 2001-05-26 Hrvoje Niksic * src/version.c: Updated version to 1.7-pre1. 2001-05-26 Hrvoje Niksic * src/http.c (gethttp): Indicate that the continued download failed for *this* file. 2001-05-26 Hrvoje Niksic * po/hr.po: Updated. * po/wget.pot: Regenerated from sources. * README: Updated copyright statement. * INSTALL: Document the new OpenSSL autodetector. 2001-05-26 Hrvoje Niksic * doc/wget.texi: Updated version to 1.7. 2001-05-26 Hrvoje Niksic * doc/wget.texi: Updated version to 1.7-pre1. 2001-05-26 Hrvoje Niksic * doc/wget.texi (Contributors): Updated list of contributors. 2001-05-26 Hrvoje Niksic * configure.in: Provide a default for AC_TRY_RUN when cross-compiling. Effectively, assume that when cross-compiling, working linkage implies working executable. 2001-05-25 Hrvoje Niksic * configure.in: Rewrote OpenSSL library detection. Now the code loops over system locations where libssl/libcrypto might be located. Aside from linking, it actually tries to run the executable before concluding that the linking "worked". 2001-05-16 Csaba Raduly * windows/Makefile.watcom: Make linker accept space-separated list of object files. 2001-05-14 Hrvoje Niksic * src/http.c (gethttp): Use real URL data for cookies, not the proxy stuff. 2001-05-14 Hrvoje Niksic * src/gen_sslfunc.c: Don't include directly. 2001-05-14 Herold Heiko * windows/Makefile.src: Update for SSL. 2001-05-14 Herold Heiko * windows/Makefile.src: * windows/Makefile.src.bor: * windows/Makefile.watcom: * windows/config.h.bor: * windows/config.h.ms: * windows/wget.dep: Windows update. 2001-05-14 Csaba Raduly * windows/Makefile.watcom: Updated. 2001-05-14 Csaba Raduly * windows/Makefile.watcom: Rewritten. 2001-05-13 Hrvoje Niksic * src/url.c: Get rid of `protostrings'. (skip_proto): Don't use protostrings. (has_proto): Ditto. 2001-05-12 Hrvoje Niksic * src/res.c: New file. Implement all RES-related code here. 2001-05-12 Hrvoje Niksic * src/main.c (print_help): Document `--no-http-keep-alive'. * src/utils.c (numdigit): Handle negative numbers *correctly*. * src/hash.c (make_nocase_string_hash_table): Use term "nocase" rather than the confusing "unsigned". * src/utils.c (string_set_contains): Renamed from string_set_exists. * src/hash.c (hash_table_contains): Renamed from hash_table_exists. * src/cookies.c: Move case-insensitive hash tables to hash.c. 2001-05-09 Hrvoje Niksic * src/http.c (gethttp): Before concluding that the file is already fully retrieved, make sure that the file existed and `Range' was actually requested. 2001-05-09 Hrvoje Niksic * src/cookies.c (eliminate_dups): New function. (build_cookies_request): Use it. (build_cookies_request): Set chain_store_size after reallocating all_chains. (check_domain_match): Annotated for easier future debugging. (store_cookie): In the debug message, print whether the cookie is permanent. 2001-05-08 Hrvoje Niksic * src/http.c (http_loop): Reset no_truncate before deciding whether to set it. (gethttp): Further clarify "-c conflicts with existing file" error message, based on input from Herold Heiko. 2001-05-07 Hrvoje Niksic * src/http.c (http_loop): If restval is set, set no_truncate to 1 unconditionally. 2001-05-02 Jan Prikryl * src/ftp-ls.c (ftp_parse_winnt_ls): Assure months are being correctly converted. Pointed out by . (ftp_parse_vms_ls): Ditto. 2001-04-30 Hrvoje Niksic * src/init.c (cmd_address): Zero SIN before using it; apparently needed on *BSD. 2001-04-29 Hrvoje Niksic * src/main.c (main): Make `--cookies' respect its argument. 2001-04-29 Hrvoje Niksic * src/ftp.c (ftp_loop_internal): Don't set NO_TRUNCATE if the file is empty. 2001-04-28 Hrvoje Niksic * src/main.c (main): Removed undocumented option `--email-address'. * src/netrc.c: Use the latest read_whole_line. * src/init.c (defaults): Set opt.ftp_pass to "-wget@". * src/mswindows.c (pwd_cuserid): Ditto. * src/utils.c (pwd_cuserid): Removed. * src/host.c (ftp_getaddress): Removed. 2001-04-28 Hrvoje Niksic * src/http.c (gethttp): Return RETRUNNEEDED when the retrieval is unneeded because the file is already there and fully downloaded, and -c is specified. (http_loop): Handle RETRUNNEEDED. * src/wget.h (uerr_t): New value RETRUNNEEDED. * src/http.c (http_loop): Set no_truncate for files that both exist and are non-empty. (gethttp): Consider the download finished when restval >= contlen, not only when restval==contlen. (gethttp): Handle redirection before giving up due to -c. (gethttp): Clarify error message which explains that -c will not truncate the file. (gethttp): When returning CONTNOTSUPPORTED, don't forget to free the stuff that needs freeing and release the socket. 2001-04-28 Hrvoje Niksic * doc/wget.texi (Wgetrc Commands): Update docs for `continue'. 2001-04-28 Hrvoje Niksic (http_loop): Allocate space for filename_plus_orig_suffix with alloca; this is more efficient and removes the need to free it before each and every return. 2001-04-28 Herold Heiko * windows/wget.dep: Update. * windows/Makefile.src: Update. * windows/config.h.ms: Define inline to __inline. Define ftruncate to chsize. 2001-04-28 Csaba Raduly * windows/Makefile.watcom: Update. 2001-04-27 Hrvoje Niksic * src/safe-ctype.h: Instead of throwing #error when isalpha is defined, redefine it to something that will throw a compile-time error if actually *used*. Do the same for the rest of the standard C macros. 2001-04-27 Hrvoje Niksic * src/main.c (print_help): Wget booleans accept "off", not "no". 2001-04-27 Hrvoje Niksic * src/http.c (mktime_from_utc): Improve documentation. (http_atotm): Put format strings into a separate array. 2001-04-27 Hrvoje Niksic * src/http.c (http_loop): If allow_cache is zero, always disable caching, not only when retrieving through proxy. * src/init.c: Ditto. * src/options.h (struct options): Rename proxy_cache to allow_cache. 2001-04-27 Hrvoje Niksic * po/ja.po: New update by Hiroshi Takekawa. 2001-04-27 Hrvoje Niksic * po/hr.po: Updated. 2001-04-27 Hrvoje Niksic * doc/wget.texi (HTTP Options): Document cookie options. 2001-04-26 Hrvoje Niksic * src/url.c (getproxy): Ignore empty proxy vars. 2001-04-25 Roger L. Beeman * src/http.c (http_atotm): Initialize t.tm_isdst to 0. (mktime_from_utc): Prevent mktime() from having discontinuities at DST transition points. 2001-04-25 Hrvoje Niksic * src/utils.c: Document timer functions. * src/retr.c (rate): Use it. (rate): Print in GB/s if transfer rate exceeds 1 GB/s. * src/utils.c (wtimer_granularity): New function. 2001-04-25 Hrvoje Niksic * src/utils.c: Define each DIGITS_* in one line. 2001-04-25 Hrvoje Niksic * src/url.c (UNSAFE_CHAR): Reimplement using a static table. (url_init): Removed. (init_unsafe_char_table): Removed. 2001-04-25 Hrvoje Niksic * src/snprintf.c (dopr): Replace ISDIGIT with '0' <= ch && ch <= '9'. 2001-04-25 Hrvoje Niksic * src/http.c (http_loop): Would load cookies every time. * src/cookies.c (load_cookies): Handle cookies whose values contain embedded spaces. 2001-04-25 Hrvoje Niksic * src/html-url.c (get_urls_html): Fix documentation. 2001-04-25 Hrvoje Niksic * po/POTFILES.in: Add src/cookies.c. 2001-04-24 Hrvoje Niksic * src/utils.c (numdigit): Handle negative numbers. 2001-04-24 Hrvoje Niksic * src/utils.c (long_to_string): New, faster version. Favors smaller numbers; much of the calculation is now done at compile-time. 2001-04-24 Hrvoje Niksic * src/retr.c (show_progress): Ditto. * src/ftp.c (getftp): Ditto. * src/http.c (gethttp): Use new timer functions. * src/utils.c (wtimer_allocate): New function. (wtimer_new): Ditto. (wtimer_delete): Ditto. (wtimer_reset): Ditto. (wtimer_elapsed): Ditto. 2001-04-23 Hrvoje Niksic * src/retr.c (show_progress): Print the download rate even when the percentages are not available. 2001-04-21 Hrvoje Niksic * src/ftp.c (getftp): Adjust expected_bytes if the length is authoritative. 2001-04-15 Ian Abbott windows/wget.dep: The target `connect$o' (connect.obj) now depends on `utils.h'. 2001-04-15 Hrvoje Niksic * po/da.po: Ditto. * po/de.po: Ditto. * po/el.po: Ditto. * po/es.po: Ditto. * po/et.po: Ditto. * po/fr.po: Ditto. * po/gl.po: Ditto. * po/he.po: Ditto. * po/ja.po: Ditto. * po/pl.po: Ditto. * po/sk.po: Ditto. * po/sl.po: Ditto. * po/sv.po: Ditto. * po/tr.po: Ditto. * po/zh_TW.po: Update from TP. * po/ca.po: Ditto. * po/bg.po: New file from TP. 2001-04-14 Hrvoje Niksic * src/url.c: Don't declare `construct'. * src/hash.c (grow_hash_table): Speed up rehashing; inline storing of mappings to new locations. (hash_table_new): Make resize_threshold a field in the hash table, so we don't have to recalculate it in each hash_table_put. (grow_hash_table): Update resize_threshold. (MAX): Remove unused macro. (prime_size): Made static. 2001-04-14 Hrvoje Niksic * src/retr.c (retrieve_url): Call uri_merge, not url_concat. * src/html-url.c (collect_tags_mapper): Call uri_merge, not url_concat. * src/url.c (mkstruct): Use encode_string instead of xstrdup followed by URL_CLEANSE. (path_simplify_with_kludge): Deleted. (contains_unsafe): Deleted. (construct): Renamed to uri_merge_1. (url_concat): Renamed to uri_merge. 2001-04-13 Hrvoje Niksic * src/wget.h (XDIGIT_TO_xchar): Define here. * src/url.c (decode_string): Use new name. (encode_string): Ditto. * src/http.c (XDIGIT_TO_xchar): Rename HEXD2asc to XDIGIT_TO_xchar. (dump_hash): Use new name. * src/wget.h: Rename ASC2HEXD and HEXD2ASC to XCHAR_TO_XDIGIT and XDIGIT_TO_XCHAR respectively. 2001-04-13 Hrvoje Niksic * src/url.c (str_url): Use encode_string instead of the unnecessary CLEANDUP. (encode_string_maybe): New function, returns input string if no encoding is needed. (encode_string): Call encode_string_maybe to do the dirty work, xstrdup if no work needed. 2001-04-13 Hrvoje Niksic * src/init.c: Include cookies.h. * src/cookies.h: Declare cookies_cleanup. * src/cookies.c (check_domain_match): Remove unused variable. (save_cookies): Remove extraneous argument from debug statement. * src/host.c (same_host): Don't call skip_url. * src/url.c (skip_url): Removed. Removed its calls from various functions in url.c. 2001-04-13 Hrvoje Niksic * src/cookies.c (unsigned_string_hash): Use the new code in string_hash as reference. * src/hash.c (hash_table_map): Allow deletion and change of the element processed by MAPFUN. (string_hash): Use the function from glib. 2001-04-12 Hrvoje Niksic * src/hash.h: Declare hash_table_get_pair and hash_table_count. 2001-04-12 Hrvoje Niksic * src/ftp-ls.c (ftp_parse_vms_ls): Make seconds optional in time specification. 2001-04-12 Hrvoje Niksic * src/ftp-ls.c (ftp_parse_unix_ls): Use octal constants for permissions. A compiler that doesn't accept octal constants is seriously broken and shouldn't be used -- octal constants were present in K&R C! 2001-04-12 Hrvoje Niksic * src/cookies.c: Declare http_atotm. 2001-04-12 Hrvoje Niksic * src/config.h.in: Include #undef stub. * src/hash.c (hash_table_remove): Rewrite to actually clear deleted entries instead of just marking them as deleted. 2001-04-12 Hrvoje Niksic * configure.in: Check for inline. 2001-04-11 Hrvoje Niksic * src/url.c (parseurl): Don't strip trailing slash when u->dir is "/" because that strips the *leading* slash, thus forcing relative FTP retrieval. 2001-04-11 Hrvoje Niksic * po/zh_TW.po: Reinstated, after an update by Abel Cheung. * po/zh_TW.Big5.po: Removed. 2001-04-11 Hrvoje Niksic * po/zh_TW.Big5.po: New file, submitted by Abel Cheung. * po/zh.po: Removed outdated file. 2001-04-10 Jan Prikryl * src/ftp.c (getftp): Convert initial FTP directory from VMS to UNIX notation for VMS servers. (ftp_retrieve_dirs): Do not prepend '/' to f->name when odir is an empty string. 2001-04-10 Jan Prikryl * src/ftp-ls.c (ftp_parse_winnt_ls): Made the fix for AM/PM more effective. Suggested by Edward J. Sabol. 2001-04-10 Hrvoje Niksic * src/cookies.c (build_cookies_request): Use and sort cookies from all matching domains. (build_cookies_request): Check for duplicates before generating the `Cookies' header. * src/main.c (main): Don't load cookies here. (main): Make loadcookies and savecookies call the correct command. * src/http.c (http_loop): Load cookies on-demand. 2001-04-09 Hrvoje Niksic * src/init.c (cmd_file): New function. (enable_tilde_expansion): New variable. (run_wgetrc): Use it. (cmd_file): Use it. 2001-04-09 Hrvoje Niksic * src/http.c (gethttp): Fix indentation of SSL ifdef. 2001-04-09 Hrvoje Niksic * src/ftp.c (ftp_retrieve_dirs): Don't forcibly prepend "/" to u->dir; that hack is no longer necessary. (getftp): Prepend initial directory to *non*-absolute u->dir's. 2001-04-08 Jan Prikryl * src/ftp-ls.c (ftp_parse_winnt_ls): The AM/PM change did assume 12:01PM == 00:01, which was obviously wrong. Taken care of this anomaly. * src/ChangeLog: Removed an excess conflict marker. Reformatted the entry by Philipp Thomas from 2001-03-09. * src/ftp-ls.c (ftp_parse_winnt_ls): Ensure that adjusted PM hours lay between 0 and 23. Elminate unused variable `sec'. 2001-04-08 Hrvoje Niksic * src/utils.c (datetime_str): New function. 2001-04-08 Hrvoje Niksic * src/init.c: Include cookie-related options. * src/main.c (main): Include cookie-specific options. (main): Load cookies before download is finished. (main): Save cookies when done. * src/http.c (gethttp): Process the `Set-Cookie' header. (gethttp): Include cookies in the response. * src/cookies.c: New file. 2001-04-08 Hrvoje Niksic * src/hash.c (hash_table_count): New function. 2001-04-06 Hrvoje Niksic * src/utils.c (read_file): Cast MAP_FAILED to char *. Enforced by Digital Unix cc. 2001-04-06 Hrvoje Niksic * src/sysdep.h: Don't define VERY_LONG_FORMAT. * src/utils.c (very_long_to_string): New function. (legible_very_long): Use it; don't use VERY_LONG_FORMAT. 2001-04-06 Hrvoje Niksic * src/http.c (gethttp): Prepend literal newline with `\n\'. 2001-04-06 Hrvoje Niksic * src/hash.c: Include . 2001-04-06 Hrvoje Niksic * src/config.h.in: Oops, do the namespace tweaks only on systems we know about. 2001-04-06 Hrvoje Niksic * src/config.h.in: Define "compilation environment" options that work under Linux and Solaris. To be reviewed on other OS'es. 2001-04-06 Hrvoje Niksic * aclocal.m4 (AM_PROG_CC_STDC): Don't use -Xc under SYSV. It forces strict ANSI mode, which means we lose `long long'. Generally, don't require __STDC__ to be defined to 1 because that signifies strict ANSI. 2001-04-04 Hrvoje Niksic * NEWS: Cosmetic changes. 2001-04-04 Christian Fraenkel * src/url.c (parse_uname): Would run past the end of the string if the username was present, but the URL did not contain a slash, e.g. http://foo:bar@myhost. 2001-04-03 Trond Eivind Glomsrod * po/da.po: Ditto. * po/no.po: The charset is iso-8859-1, not iso-8859-2. 2001-04-03 Paul Bludov * src/mswindows.c (sleep): Use SleepEx() instead of Sleep(). (ws_changetitle): Use alloca() instead of malloc() to avoid memory leak. (ws_mypath): Use GetModuleFileName instead of argv[0]. (ws_startup): Use data.wVersion for comparison. 2001-04-03 KOJIMA Hajime * src/http.c (http_atotm): Use %A instead of %a to match full weekday. (On most systems there is no difference.) 2001-04-02 Hrvoje Niksic * src/retr.c (retrieve_url): New variable global_download_count used to identify first retrieval. * src/ftp.c (getftp): Ditto. * src/http.c (gethttp): Rewind opt.dfp only on first retrieval. 2001-04-02 Hrvoje Niksic * src/netrc.c (parse_netrc): Don't trim the line endings explicitly; they will be handled as whitespace. (parse_netrc): Correctly handle lines that end with whitespace. 2001-04-02 Hrvoje Niksic * src/init.c (cmd_address): Heap-allocate the address that gets stored to CLOSURE. Old code would simply assign an address on the stack. 2001-04-02 Hrvoje Niksic * src/http.c (http_loop): Ditto. * src/ftp.c (ftp_loop_internal): Made the check whether to continue retrieval `-O'-friendly. 2001-04-02 Hrvoje Niksic * po/et.po: New version by Toomas Soome. 2001-04-01 Nicolas Lichtmaier * src/main.c (main): Add -C to the string that is the third arg to getopt_long(). 2001-04-01 Nicolas Lichtmaier * src/ftp.c (ftp_get_listing): Propagate error status. (ftp_retrieve_glob): Use it. (ftp_loop): Ditto. 2001-04-01 Nicolas Lichtmaier * po/es.po: New file. 2001-04-01 Hrvoje Niksic * src/ftp.c (getftp): Don't start the download from scratch if `-c' was specified, but the file is already fully downloaded. * src/http.c (gethttp): Don't truncate a pre-existing file if `-c' was specified and the server doesn't support continued download. (gethttp): Don't start the download from scratch if `-c' was specified, but the file is already fully downloaded. 2001-04-01 Hrvoje Niksic * doc/wget.texi (Recursive Retrieval Options): Document more accurately what --convert-links does. 2001-03-31 Hrvoje Niksic * src/retr.c (retrieve_url): Call register_download() for downloaded files and register_html() for downloaded HTML files. * src/recur.c (register_download): New function; register here that a file has been downloaded, rather than in recursive_retrieve(). (register_html): New function; enqueue the location of HTML files here rather than in recursive_retrieve(). 2001-03-31 Hrvoje Niksic * src/recur.c (recursive_retrieve): Clear the hash tables only when they are defined. 2001-03-31 Hrvoje Niksic * src/main.c (print_help): Use multiple fputs instead of a single ugly printf(). (main): Consistently assign numbers >128 to options without a corresponding character. 2001-03-31 Hrvoje Niksic * src/http.c (gethttp): Make sure the socket is closed with CLOSE_INVALIDATE before we have drained the body. 2001-03-31 Hrvoje Niksic (recursive_retrieve): Don't clear the hash tables at this point at all; it interferes with the normal operation of register_download. 2001-03-27 Dan Harkless * src/Makefile.in: Moved top_builddir out of "User configuration section" of top Makefile and analogous spot in this one. 2001-03-27 Dan Harkless * doc/Makefile.in: Moved top_builddir out of "User configuration section" of top Makefile and analogous spot in this one. 2001-03-27 Dan Harkless * INSTALL: Updated to reflect --with-ssl's new optional parameter. * configure.in: Christian Fraenkel's tests for -lcrypto and -lssl were in the wrong order, causing a link failure if you're using libcrypto.a and libssl.a rather than shared libraries. Also put in checks for -ldl, necessary since the libcrypto shared library doesn't record its dependency on libdl. * {.,util,windows}/Makefile.in: Moved top_builddir out of "User configuration section" of top Makefile and analogous spot in others. * po/Makefile.in.in: Previous addition of top_builddir to po/Makefile.in was bogus -- it's generated from po/Makefile.in.in. 2001-03-26 Dan Harkless * doc/wget.texi (Recursive Retrieval Options): Explained that you need to use -r -l1 -p to get the two levels of requisites for a page. Also made a few other wording improvements. 2001-03-26 Dan Harkless * TODO: -p should probably go "_two_ more hops" on pages. 2001-03-22 Dan Harkless * MACHINES: Added rs6000-ibm-aix4.3.3.0. 2001-03-21 Dan Harkless * MACHINES: Added armv4l-unknown-linux-gnu. 2001-03-20 Dan Harkless * TODO: Oops. Hostless absolute link conversion _is_ working. My test that led me to believe it wasn't was exposing a different bug -- URLs specified on the commandline as opposed to being recursed to don't always get re-converted at the end of the Wget run. 2001-03-17 Dan Harkless * src/Makefile.in: Include @SSL_INCLUDES@ substition in INCLUDES. Define top_builddir. Link wget with libtool so the user doesn't have to supply a bunch of custom environment variables to correctly link with the OpenSSL shared libraries. 2001-03-17 Dan Harkless * doc/Makefile.in: Using '^' in the sed call caused a weird failure on Solaris 2.6. Changed it to a ','. Defined top_builddir. 2001-03-17 Dan Harkless * aclocal.m4: Appended libtool 1.3.5's libtool.m4 to it. * configure.in: Use AM_PROG_LIBTOOL macro (now defined in our aclocal.m4) to create a libtool script from ltconfig and ltmain.sh. If --with-ssl specified, look in /usr/local/ssl/lib by default for OpenSSL libs. Allow override with --with-ssl=. Set up -I/include and -R/lib (possibly rewritten by libtool) as well. Don't appear to be looking for a function main() in -lcrypto. If the OpenSSL lib checks fail, don't just silently build a wget without https support -- issue a warning. Define top_builddir. * ltconfig: New file from libtool 1.3.5 distribution. * ltmain.sh: New file from libtool 1.3.5 distribution. * {.,po,util,windows}/Makefile.in: Define top_builddir. 2001-03-16 Dan Harkless * TODO: For some reason on 2000-11-19, Hrvoje removed the item about converting hostless absolute links. That isn't working yet, so I've put the item back, with a modified wording. * config.guess: Hadn't been updated since 1996 -- didn't work for recent machines and OSes, such as NetWinder ARM Linux. Updated to latest version (2001-03-16) from . * config.sub: Ditto -- updated to latest version (2001-03-12). 2001-03-12 Dan Harkless * TODO: Only normal recursion should respect -np -- page-requisite recursion should not. 2001-03-09 Philipp Thomas * src/safe-ctype.h: New file. Locale independent ctype.h replacement taken from libiberty. * src/safe-ctype.c: New file. Tables for above. * src/Makefile.in: Add safe-ctype$o to OBJS. Add dependencies for safe-ctype$o. * src/cmpt.c: Remove include of ctype.h. Use ISSPACE instead of isspace. * src/ftp-basic.c: Don't include ctype.h. * src/ftp-ls.c: Likewise. * src/ftp.c: Likewise. * src/headers.c: Likewise. * src/host.c: Likewise. * src/html-parse.c: Likewise. * src/html-url.c: Likewise. * src/http.c: Likewise. * src/init.c: Likewise. * src/main.c: Likewise. Set LC_CTYPE along with LC_MESSAGES. * src/netrc.c: Likewise. * src/recur.c: Likewise. * src/retr.c: Likewise. * src/snprintf.c: Replace ctype.h with safe-ctype.h. Use ISDIGIT instead of isdigit. * src/sysdep.h: Remove defines of ctype macros as they aren't needed for safe-ctype-h. * src/url.c: Don't include ctype.h. * src/utils.c: Likewise. * src/wget.h: Include safe-ctype.h. 2001-03-07 Jan Prikryl * TODO: Removed an obsolete item about adding VMS and MS FTP server support. 2001-03-06 Hack Kampbjorn * src/http.c (gethttp): skip :port in host header if it is the DEFAULT_HTTPS_PORT when using SSL. * src/url.c: move the #define of DEFAULT_HTTP_PORT, DEFAULT_FTP_PORT and DEFAULT_HTTPS_PORT to the header file so it can be use in the rest of the code. * src/url.h: Ditto 2001-03-05 Dan Harkless * TODO: Add a --range option to download only a given byte range. 2001-03-01 Jonas Jensen * src/retr.c (show_progress): Correctly calculate the number of bytes in the first line of the download that have been actually downloaded in this run. 2001-03-01 Dan Harkless * ChangeLog.README: Renamed from README.branches and added a note that Wget has multiple ChangeLog files (currently ./ChangeLog, doc/ChangeLog, and src/ChangeLog), since this is unusual and people have complained their patches hadn't been applied after checking only the top-level ChangeLog. 2001-02-28 Dan Harkless * MACHINES: Explicitly tell people to send us config.guess output. 2001-02-27 Dan Harkless * TODO: Re-use FTP connection if multiple URLs on one host specified. Make "ftp:///%2F" cause an initial "CWD /". 2001-02-23 Dan Harkless * src/main.c (print_help): --help documentation for -N said it would re-download files if they had the _same_ timestamp on server. (print_help): -nr belongs in "FTP options" section of --help output, not "Recursive retrieval" section. Alphabetized FTP options by long option name. 2001-02-23 Dan Harkless * doc/wget.texi: Corrections, clarifications, and English fixes to time-stamping documentation. Also moved -nr from "Recursive Retrieval Options" to "FTP Options" and gave it a @cindex entry. Alphabetized FTP options by long option name. Mentioned that .listing symlinked to /etc/passwd is not a security hole, but that other files could be, so root shouldn't run wget in user dirs. 2001-02-23 Dan Harkless * NEWS: Note that Wget now has a man page again. * po/*.po*: Updated after changing --help's description of -N and moving -nr to a different category. * TODO: "Timestamps are sometimes not copied over on files retrieved by FTP." removed. Hopefully all the failures I was seeing were due to the fact that it wasn't documented that non-globbing, non-recursive FTP downloads need -N to get the remote timestamp to be preserved. 2001-02-22 Dan Harkless * doc/Makefile.in: Make wget man page and install it if we have pod2man. Added some missing '$(srcdir)/'s. Added missing dependencies on install targets (allowing you to just do `make install' rather than forcing you to do `make && make install'). Also, Makefile rules should always use output file parameters if available rather than redirecting stdout with '>', or you falsely satisfy dependencies if the tool you're running is missing or fails -- fixed call of texi2pod.pl that did this wrong. * doc/texi2pod.pl: Removed from CVS. Now automatically generated. * doc/texi2pod.pl.in: This new file is processed into texi2pod.pl, getting the appropriate path to the Perl 5+ executable on this system and becoming executable (CVS files, by contrast, don't arrive executable). 2001-02-22 Dan Harkless * TODO: Remove empty directories created due to --accept/--reject. * configure.in: Look for perl and pod2man and make substitutions. * Makefile.in (install): Do install.man if we have pod2man. 2001-02-19 Dan Harkless * doc/wget.texi (Download Options): Further improvement to --continue documentation -- explain interaction with -r and -N, mention usefulness for downloading new sections of appended-to files, etc. 2001-02-16 Dan Harkless * src/init.c (commands): Hack Kampbjørn discovered that "httpsproxy" had been inserted into commands[] out of alphabetical order, causing "BUG: unknown command `httpuser'". 2001-02-13 Jan Prikryl * windows/Makefile.src: Removed references to ftpparse sources. * windows/wget.dep: Ditto. * windows/Makefile.watcom: Ditto. 2001-02-13 Jan Prikryl * src/ftp-ls.c (ftp_parse_ls): Added support of ST_MACOS (Unix-like listing without correct permissons). * src/ftp.h (stype): Added ST_MACOS to identify the NetPresenz MacOS FTP server. * src/ftp.c (ftp_retrieve_list): New mirroring logic: A remote file shall be donwloaded only when it's newer than the local copy or when it has the same timeestamp but its size is different. ST_VMS and ST_MACOS as special cases that lie about file size. * src/ftp-ls.c (ftp_parse_ls): Support for ST_MACOS. * src/Makefile.in: Removed dependency on ftpparse library due to unclear copyright issues and absence of any feedback to our queries. * src/ftp-ls.c: Removed dependency on ftpparse library due to unclear copyright issues and absence of any feedback to our queries. (ftp_parse_ls): Added a warning message when remote server system does not seem to be suported by wget. (ftp_parse_vms_ls): New function for parsing VMS ftp server listing output. (clean_line): New function responsible for removing end-of-line characters from FTP listing texts. * src/ftp.c (getftp): Global variables pwd and host_type are now member of the ccon structure under names ccon.id and ccon.rs. * src/ftp.h (struct ccon): Added formed global variables from ftp.c, enum stype rs (remote system identification) and char *id (initial working directory), as suggested by Hrvoje. * src/url.c (parse_uname): Added support for passwords containing '@' characters. (skip_uname): Ditto. 2001-02-11 Hrvoje Niksic * src/ftp.c (ftp_loop_internal): Disable padding. (getftp): Ditto. * src/http.c (http_loop): Disable padding. * src/retr.c (show_progress): Use it to enable padding. * src/retr.c (rate): Optional parameter PAD for padding the rate. 2001-02-11 Hrvoje Niksic * src/ftp.c (ftp_loop): Reset con. 2001-02-11 Hack Kampbjørn * src/url.c (parseurl): Debug-print u->ftp_type. 2001-02-10 Tim Mooney * src/ftp.h: Rename enums `command' to `wget_ftp_command' and `fstatus' to `wget_ftp_status' because old names clash with Tru64 net/if.h. 2001-02-10 Jonas Jensen * src/retr.c (show_progress): Print the download rate along with the percentages. Along with Anders Thorsby . 2001-02-10 Hrvoje Niksic * src/retr.c (show_progress): Make sure that the last output line includes progress. 2001-02-08 Christian Fraenkel * src/gen_sslfunc.c: verify_callback is now static * src/gen_sslfunc.c (init_ssl): load certificate if specified * src/gen_sslfunc.c (ssl_printerr): new function * src/init.c: added new --sslcertfile and --sslcertkey switches * src/main.c: ditto * src/options.h: ditto * src/http.c (gethttp): abort when init_ssl fails 2001-01-23 Herold Heiko * windows/Makefile.src: Don't attempt to compile in alloca.c; it doesn't work and it's not needed. 2001-01-23 Herold Heiko * src/mswindows.h: Include ; it's needed for alloca(). 2001-01-23 Herold Heiko * src/config.h.ms, mswindows.h: defined HAVE_ISATTY, use _isatty for MS VC; somebody with Borland compiler please check and provide patch if possible; * src/cmpt.c: provided a usleep emulation. 2001-01-20 Karl Eichwalder * src/Makefile.in: Provide and use DESTDIR according to the Coding Standards. 2001-01-20 Karl Eichwalder * doc/Makefile.in: Provide and use DESTDIR according to the Coding Standards. 2001-01-16 Hrvoje Niksic * NEWS: Added more NEWS items. 2001-01-15 Jan Prikryl * util/wget.spec: Updated to 1.7, merged with the spec file from RedHat. * po/Makefile.in.in: `make realclean' equal to `make maintainer-clean'. * Makefile.in (realclean-top): Remove 'configure' as well. 2001-01-15 Dan Harkless * NEWS: Was not being maintained. Added some significant 1.7-dev stuff. 2001-01-11 Dan Harkless * TODO: If -c used with -N, check to make sure a file hasn't changed on the server before "continuing" to download it. 2001-01-11 Adrian Aichner * windows/Makefile.src: Updated. * windows/wget.dep: Ditto. 2001-01-10 Dan Harkless * src/url.c (str_url): Clarified this function's comment header after Hrvoje answered my question on the list as to when hide != 1. Also Hrvoje pointed out I need to use xstrdup() on the string literal. 2001-01-09 Dan Harkless * src/html-url.c: A bunch of fixup of `--page-requisites'-related comments to reflect Hrvoje's changes to my code when transplanting it into this new file, to fix spelling mistakes, to clarify, etc. * src/url.c (write_backup_file): Clarified a comment. (str_url): Henrik van Ginhoven pointed out on the list that we shouldn't give away the number of characters in the password by replacing each character with a 'x'. Use "" instead. * src/ftp.c (ftp_retrieve_dirs): The bug where recursion into FTP directories didn't work if logging in put you in a directory other than "/" is fixed now. Removed the comment here warning of the bug. * src/main.c (print_help): --continue's description was misleading. We don't "restart", we "resume". Also, better to say "partially-downloaded file" rather than just "existing file". 2001-01-09 Dan Harkless * doc/wget.texi (Download Options): Did a bunch of clarification and correction to the description of --continue. 2001-01-09 Dan Harkless * TODO: If -c is on, don't re-download a 100%-downloaded file. * TODO: The bug where you couldn't recurse into ftp directories if logging in put you somewhere else besides the server's "/" directory got fixed without the TODO entry for it being removed. * TODO: Add a "rollback" option to have --continue throw away X corrupted (e.g. by proxy) bytes from end of file before resuming. * po/*.po*: Updated after changing --help's description of -c. 2001-01-06 Jan Prikryl * src/url.c (parse_uname): Added support for passwords containing '@' characters. (skip_uname): Ditto. 2001-01-06 Jan Prikryl * doc/wget.texi (Reporting Bugs): Deleted the setence about Cc-ing the bug report to Wget mailing list as the bug report address is an alias for the mailing ist anyway. (Mailing List): Added URL for the alternate archive. * doc/wget.texi: Bunch of cosmetical changes. * doc/Makefile.in: Added targets for manpage generation using texi2pod.pl and pod2man (comes with Perl5). As we cannot rely on Perl5 being available on the system, manpage is not being built automatically. Updated '*clean' targets to remove 'sample.wgetrc.munged...', 'wget.pod', and 'wget.man'. * doc/texi2pod.pl: New file copied from GCC distribution to facilitate automatic manpage generation. 2001-01-06 Hrvoje Niksic * src/connect.c (bindport): Declare addrlen as int. Diagnosed by Drazen Kacar . (conaddr): Ditto. 2001-01-06 Dan Harkless * src/ChangeLog: The '[Not in 1.6 branch.]'s were decided not to be the best way to go about my aim. Removed them in favor of: * src/ChangeLog-branches/1.6_branch.ChangeLog: New file. 2001-01-06 Dan Harkless * doc/ChangeLog: The '[Not in 1.6 branch.]'s were decided not to be the best way to go about my aim. Removed them in favor of: * doc/ChangeLog-branches/1.6_branch.ChangeLog: New file. 2001-01-06 Dan Harkless * ChangeLog: The '[Not in 1.6 branch.]'s were decided not to be the best way to go about my aim. Removed them in favor of: * ChangeLog-branches/1.6_branch.ChangeLog: New file. * README.branches: Explains the 1.6_branch.ChangeLog files. * README.cvs: Falsely claimed you only needed GNU autoconf to build from the CVS sources. You also need GNU gettext and texinfo. I also did a bunch of general re-writing of this file. 2001-01-04 Hrvoje Niksic * src/url.c (replace_attr): New function, to be used by both TO_COMPLETE and TO_RELATIVE case in convert_links. (find_fragment): New function for finding URL fragments. (replace_attr): Better handle the case where the original string is not quoted. Use find_fragment. (convert_links): Use replace_attr(). 2001-01-03 Dan Harkless * TODO: We should make a simple man page referring to info doco. 2000-12-31 Dan Harkless * src/ChangeLog: Since this flat file doesn't have multiple branches, looking at the dates would make you think that things went into 1.6 that actually just went into the 1.7-dev branch. Added "[Not in 1.6 branch.]" where appropriate to clarify. 2000-12-31 Dan Harkless * doc/Makefile.in (distclean): sample.wgetrc.munged_for_texi_inclusion needs to be included in the distribution or it'll get regenerated due to the wget.info dependency, and then that file will get regenerated, forcing people to have makeinfo installed unnecessarily. We could use a kludge of a 0-length file in the distro, but the file isn't that big and should compress very well. * doc/wget.texi: Changed "VERSION 1.5.3+dev" to "VERSION 1.7-dev" and "UPDATED Feb 2000" to "UPDATED Dec 2000". Like the comment in the file says, it'd be nice if these were handled automatically... * doc/ChangeLog: Since this flat file doesn't have multiple branches, looking at the dates would make you think that things went into 1.6 that actually just went into the 1.7-dev branch. Added "[Not in 1.6 branch.]" where appropriate to clarify. 2000-12-31 Dan Harkless * README: Changed 1.5.3 in the FTP URL to 1.6. * NEWS: Released Wget version 1.6. * po/*.po: 'Project-Id-Version's were very haphazard, saying either "wget" or "GNU wget", and with versions of 1.5.2-b[124], 1.5.3, the nonexistent 1.5.4, and 1.6-pre. Standardized all to "GNU Wget 1.7-dev". Perhaps this is wrong to do because some of the translations haven't been updated since the versions they state, but I know some of the files were updated specifically for 1.6, and none of them used this version (unless you count the sole "1.6-pre" guy). In any case, the 'POT-Creation-Date's and 'PO-Revision-Date's remain the best indicator of whether a translation's out of date. * ChangeLog: Since this flat file doesn't have multiple branches, looking at the dates would make you think that things went into 1.6 that actually just went into the 1.7-dev branch. Added "[Not in 1.6 branch.]" where appropriate to clarify. 2000-12-30 Dan Harkless * src/ftp.c, http.c: Applied Hack Kampbjørn 's patch to deal with h_errno not being defined in netdb.h under Cygwin. 2000-12-18 Csaba Raduly * windows/Makefile.watcom: Updated. 2000-12-18 Csaba Raduly * src/sysdep.h: Include and under Watcom. 2000-12-17 Igor Khristophorov * src/http.c (check_end): Fix test for '+' or '-'. 2000-12-17 Hrvoje Niksic * src/url.c (parseurl): Rename inner loop var from i to ind to avoid clash with the function top-level-declared variable i. (str_url): Likewise, rename inner-loop i to j. * src/recur.c (parse_robots): Don't declare LEN at top of function. (robots_match): Renamed parameter FORBIDDEN to avoid hiding of global variable. * src/main.c (main): Change erroneous use of bitwise and to logical. * src/init.c (cmd_address): Don't heap-allocate `sin'; it can be on the stack because it will be copied to closure. Thanks to Csaba Raduly's run of PC-LINT over the sources. 2000-12-17 Hrvoje Niksic * src/mswindows.c: Include . * src/gen_sslfunc.c: Include . * src/ftp-basic.c: Don't attempt to declare errno or h_errno because they're not used. * src/main.c: Include because errno is used. * src/ftp.c: Ditto. * src/http.c: Include for h_errno. 2000-12-17 Hrvoje Niksic * src/http.c (basic_authentication_encode): Use xmalloc(), not malloc(). Thanks to Csaba Raduly's run of PC-LINT over the sources. 2000-12-17 Csaba Raduly * src/sysdep.h: Test for __EMX__ rather than for EMXOS2 for OS/2 compilation. 2000-12-13 Hrvoje Niksic * src/html-parse.c (advance_declaration): MSVC assert() chokes on '\"'. Use '"' instead. 2000-12-11 Hrvoje Niksic * src/utils.c (xfree_real): Removed. (xfree_debug): Just call free(). * src/wget.h (xfree): Make it an alias for free. 2000-12-11 Hrvoje Niksic * src/http.c (http_loop): Furthermore, touch output_document only if it is known to be an existing regular file. 2000-12-11 Hrvoje Niksic * src/ftp.c (ftp_retrieve_list): Ditto. * src/http.c (http_loop): Touch output_document if that is used for output. 2000-12-10 Hrvoje Niksic * po/POTFILES.in: Updated. 2000-12-10 Hrvoje Niksic * doc/Makefile.in (install.info): Info files are *not* in $(srcdir), but in the current build dir. 2000-12-10 Hrvoje Niksic * configure.in: Add windows/Makefile to the output block. * windows/Makefile.in: New file. * README.cvs: New file. 2000-12-06 Hrvoje Niksic * src/rbuf.h: Implement only a single version of RBUF_READCHAR, using rbuf_read_bufferful when the buffer is depleted. * src/rbuf.c (rbuf_read_bufferful): New function. 2000-12-06 Hrvoje Niksic * src/http.c: Include gen_sslfunc.h after including Wget's headers. (persistent_available_p): Needed coma before `int ssl'. 2000-12-06 Hrvoje Niksic * src/gen_sslfunc.h: Use ansi2knr style function declarations. * src/gen_sslfunc.c: Reformat according to the GNU coding standards. More should be done. * src/http.c (persistent_available_p): Place the cheap SSL test nearer the top of the function. (CLOSE_FINISH, CLOSE_INVALIDATE): Define only one version of each. 2000-12-06 Hrvoje Niksic * src/ftp.c (ftp_loop_internal): Ditto. * src/http.c (http_loop): Use it. * src/retr.c (sleep_between_retrievals): New function that handles the logic of opt.wait and opt.waitretry. 2000-12-05 Hrvoje Niksic * src/url.c (init_unsafe_char_table): Reinstate space as an unsafe char. 2000-12-05 Hrvoje Niksic * configure.in: Don't unconditionally define HAVE_SSL, even when --with-ssl is given. 2000-12-03 Christian Fraenkel * src/Makefile.in: added gen_sslfunc object * src/config.h.in: added HAVE_SSL define * src/connect.c: changed select_fd from static int to int * src/connect.h: ditto * src/gen_sslfunc.h: New file * src/gen_sslfunc.c: ditto * src/http.c: added HTTPS fuctionality * src/retrc.c: ditto * src/url.c: ditto * src/init.c: added opt.httpsproxy * src/options.h: ditto * src/rbuf.h: added alternate rbuf struct * src/wget.h: added CONSSLERR * src/rbuf.c: ditto * src/http.c: Added HTTPS fuctionality. * src/retrc.c: Ditto. * src/url.c: Ditto. * src/init.c: Added opt.httpsproxy. * src/options.h: Ditto. * src/rbuf.h: Added alternate rbuf struct. * src/wget.h: Added CONSSLERR. * src/rbuf.c: Ditto. 2000-12-03 Christian Fraenkel * INSTALL: Added the --with-ssl switch. * configure.in: Ditto. * TODO: Removed the corresponding entry. 2000-11-30 Jan Prikryl * src/ftp-ls.c (ftp_parse_unix_ls): Added second parameter "ignore_perms" to ignore file and directory permissions for Windows NT FTP server listings. (ftp_parse_winnt_ls): New function. (ftp_parse_ls): Parses UNIX and Windows NT listings separately. Simple heuristics for distinguishing between UNIX and MS-DOS-like FTP listing provided by Windows NT FTP service. 2000-11-30 Jan Prikryl * src/ftp-ls.c (ftp_parse_unix_ls): Added second parameter "ignore_perms" to ignore file and directory permissions for Windows NT FTP server listings. (ftp_parse_winnt_ls): New function. (ftp_parse_ls): Parses UNIX and Windows NT listings separately. Simple heuristics for distinguishing between UNIX and MS-DOS-like FTP listing provided by Windows NT FTP service. 2000-11-29 John Summerfield * src/netrc.c (parse_netrc): Get rid of line ending. 2000-11-25 Karl Eichwalder * Makefile.in (SUBDIRS): Add 'windows'. (dist, DISTFILES): Don't distribute CVS directories. 2000-11-25 Hrvoje Niksic * src/ftp.c (ftp_retrieve_list): Undo typo "fix" until resolution by Dan. 2000-11-24 Karl Eichwalder * src/main.c (print_help): Untabify. 2000-11-23 Hrvoje Niksic * src/utils.c (xrealloc_debug): Do the unregister/register thing only if the pointer has actually changed. (xmalloc_real): Declare `static' in DEBUG_MALLOC builds. (xfree_real): Ditto. (xrealloc_real): Ditto. (xstrdup_real): Ditto. 2000-11-23 Hrvoje Niksic * configure.in: Build ALL_LINGUAS dynamically. 2000-11-22 Hrvoje Niksic * src/ftp.c (getftp): ftp_getaddress() returns a malloc'ed copy of the string; no need to strdup() it. (getftp): Make pwd_len a local variable. (ftp_loop): Free PWD before returning. * src/init.c (cleanup): Free opt.ftp_pass only if it's non-NULL. 2000-11-22 Hrvoje Niksic * src/all: Use xfree() instead of free. * src/utils.c (xfree): New function. 2000-11-21 Hrvoje Niksic * src/url.c (convert_links): HTML-quote the converted string. * src/utils.c (html_quote_string): Move here from ftp-ls.c (html_quote_string): Make non-static; declare in utils.h. (html_quote_string): Convert SP to . 2000-11-21 Hrvoje Niksic * src/hash.c (hash_table_put): Don't overwrite deleted mappings. 2000-11-21 Hrvoje Niksic * src/hash.c (find_mapping): New function. (hash_table_get): Use it. (hash_table_get_pair): Ditto. (hash_table_exists): Ditto. (hash_table_remove): Ditto. (hash_table_remove): Really delete the entry if the mapping following LOCATION is empty. * src/utils.c (string_set_add): Check whether the element has existed before. * src/hash.c (hash_table_get_pair): New function. 2000-11-21 Hrvoje Niksic * src/ftp.c (getftp): Reformat Jan's code according to GNU coding standards; remove (debugging?) printf's; use '\0' for the ASCII zero character. Use alloca() instead of malloc() for inter-function temporary allocations. 2000-11-20 Hrvoje Niksic * src/version.c: Bump version from 1.5.3+dev to 1.7-dev. 2000-11-20 Hrvoje Niksic * src/recur.c (recursive_retrieve): Print the "so we don't load" debugging message only if we really don't load. * src/http.c (gethttp): Inhibit keep-alive if proxy is being used. (gethttp): Don't request keep-alive if keep-alive is inhibited. 2000-11-20 Hrvoje Niksic * src/http.c (http_process_type): Ignore trailing whitespace; use strdupdelim(). * src/recur.c (recursive_retrieve): Use the new `convert' field. (convert_all_links): Ditto. (convert_all_links): Don't respect meta_disallow_follow. * src/html-url.c (handle_link): Fill out link_relative_p and link_complete_p. * src/url.h (struct _urlpos): Make elements more readable. * src/recur.c (recursive_retrieve): Call slist_prepend instead of slist_append. (convert_all_links): Call slist_nreverse before iterating through urls_html. * src/utils.c (slist_prepend): New function. (slist_nreverse): Ditto. 2000-11-20 Hrvoje Niksic * src/http.c (http_loop): If username and password are known, try the `Basic' authentication scheme by default. * src/connect.h: Declare test_socket_open. 2000-11-20 Hrvoje Niksic * src/http.c (gethttp): Don't use the return value of sprintf(). (gethttp): Inhibit keep-alive if opt.http_keep_alive is 0. 2000-11-20 Hrvoje Niksic * src/http.c (check_end): Constify. 2000-11-19 Hrvoje Niksic * src/url.c (convert_links): Handle UREL2ABS case. * src/recur.c (recursive_retrieve): Instead of the list urls_downloaded, use hash tables dl_file_url_map and dl_url_file_map. (convert_all_links): Use them to retrieve data. * src/host.c (clean_hosts): Free the hash tables. * src/main.c (private_initialize): Call host_init(). * src/host.c (store_hostaddress): Use a saner, hash table-based data model. (realhost): Ditto. (host_init): Initialize the hash tables. 2000-11-19 Hrvoje Niksic * src/retr.c (get_contents): If use_expected, make sure that the appropriate amount of data is being read. * src/http.c (gethttp): Check for both `Keep-Alive: ...' and `Connection: Keep-Alive'. * src/wget.h (DEBUGP): Call debug_logprintf only if opt.debug is turned on. 2000-11-19 Hrvoje Niksic * src/http.c (gethttp): Make the HTTP persistent connections more robust. 2000-11-19 Hrvoje Niksic * src/http.c (connection_available_p): Use it. * src/connect.c (test_socket_open): New function. * src/http.c (gethttp): Support persistent connections. Based on the ideas, and partly on code, by Sam Horrocks . (register_persistent): New function. (connection_available_p): Ditto. (invalidate_connection): Ditto. 2000-11-18 Jan Prikryl * src/ftpparse.c, ftpparse.h: New files. * src/ftp-ls.c (ftp_parse_ls): Use ftp_parse_unix_ls for UNIX servers only. Use ftp_parse_nonunix_ls otherwise. (ftp_parse_nonunix_ls): Stub to the ftpparse library handling all exotic FTP servers. * src/ftp.h (stype): New enum, distinguishes UNIX, VMS, and "other" FTP servers. * src/ftp.c: New static wariables host_type, pwd, and pwd_len. (getftp): Support for VMS. Support for FTP servers that do not place you in the root directory after login. (ftp_retrieve_list): VMS is silent about the real file size, issue a more appropriate message. (ftp_get_listing): Pass host_type to ftp_parse_ls. * src/ftp-basic.c (ftp_pwd, ftp_syst): New functions. 2000-11-18 Jan Prikryl * src/ftpparse.c, ftpparse.h: New files. * src/ftp-ls.c (ftp_parse_ls): Use ftp_parse_unix_ls for UNIX servers only. Use ftp_parse_nonunix_ls otherwise. (ftp_parse_nonunix_ls): Stub to the ftpparse library handling all exotic FTP servers. * src/ftp.h (stype): New enum, distinguishes UNIX, VMS, and "other" FTP servers. * src/ftp.c: New static wariables host_type, pwd, and pwd_len. (getftp): Support for VMS. Support for FTP servers that do not place you in the root directory after login. (ftp_retrieve_list): VMS is silent about the real file size, issue a more appropriate message. (ftp_get_listing): Pass host_type to ftp_parse_ls. * src/ftp-basic.c (ftp_pwd, ftp_syst): New functions. 2000-11-18 Hrvoje Niksic * src/utils.c (slist_append): Eviscerate NOSORT. Hash tables are now used for what the sorted slists used to be used for. (slist_contains): Don't rely on the list being sorted. (slist_append): Simplify the code. * src/recur.c (recursive_cleanup): Use free_string_set. * src/utils.c (string_set_add, string_set_exists, string_set_free): New functions for easier freeing of hash tables whose keys are strdup'ed strings. * src/recur.c (recursive_retrieve): Use the hash table functions for storing undesirable URLs. * src/hash.c: New file. 2000-11-17 Hrvoje Niksic * src/main.c (private_initialize): Call url_init. (main): Call private_initialize. * src/url.c (unsafe_char_table): New table. (UNSAFE_CHAR): Use it. (init_unsafe_char_table): New function. (url_init): New function; call init_unsafe_char_table. 2000-11-16 Hrvoje Niksic * windows/config.h.ms: snprintf and vsnprintf exist under Windows. * windows/Makefile.src: Back out previous change. 2000-11-16 Hrvoje Niksic * src/mswindows.h: Define snprintf and vsnprintf to _snprintf and _vsnprintf respectively. 2000-11-16 Hrvoje Niksic * doc/wget.texi: Use --- consistently. Spell "Wget" with starting capital letter consistently. Use ``...'' or @dfn{} instead of simple double quotes where appropriate. Use double space as separator between sentences consistently. 2000-11-16 Herold Heiko * windows/Makefile.src: Compile in vsnprintf.c. 2000-11-15 Hrvoje Niksic * src/html-url.c (handle_link): Handle HTML fragment identifiers. * src/recur.c (recursive_retrieve): If norobot info is respected and the file is specified not to be followed by robots, respect that. * src/html-url.c (collect_tags_mapper): Handle . For us the important cases are where X is NONE or where X contains NOFOLLOW. (get_urls_html): Propagate that information to the caller. 2000-11-15 Hrvoje Niksic * src/config.h.in: Do the _XOPEN_SOURCE and _SVID_SOURCE things only on Linux. 2000-11-15 Hrvoje Niksic * doc/wget.texi (Robots): Rearrange text. Mention the meta tag. 2000-11-15 Hrvoje Niksic * doc/wget.texi (Robots): Document that we now support the meta tag exclusion. 2000-11-14 Hrvoje Niksic * doc/wget.texi: Add GFDL; remove norobots specification. * doc/wget.texi (Sample Wgetrc): Remove warnings with lateish makeinfo, mostly by changing xref{} to pxref{} when inside parentheses. 2000-11-13 Hrvoje Niksic * src/url.c (convert_links): Unlink the file we might be reading from before writing to it. (convert_links): Use alloca instead of malloc for filename_plus_orig_suffix. 2000-11-12 Hrvoje Niksic * src/host.c (realhost): Add HOST to the list with quality==0 only if it wasn't already there. Based on analysis by Lu Guohan . 2000-11-10 Hrvoje Niksic * src/url.c (get_urls_file): Ditto. (convert_links): Ditto. * src/html-url.c (get_urls_html): Use read_file() instead of load_file(). * src/utils.c (read_file): New function, instead of the old load_file(). (read_file_free): Ditto. * src/url.c (findurl): Search only for the supported protocols. (convert_links): Use fwrite() when writing out a region of characters. 2000-11-10 Hrvoje Niksic * src/init.c (run_wgetrc): Don't bother killing off '\r' since pars_line() skips whitespace at end of line anyway. (parse_line): Oops, it didn't. Now it does. * src/recur.c (parse_robots): Ditto here. * src/ftp-ls.c (ftp_parse_unix_ls): Kill off the newline character manually because read_whole_line no longer does. * src/utils.c (read_whole_line): Rewrite to: a) use less memory (reallocates to needed size after work), b) work faster --> fgets() instead of getc, c) be more correct --> doesn't kill the newline character at the end of line. 2000-11-10 Hrvoje Niksic * src/init.c (comind): Initialize MAX to array size - 1. 2000-11-10 Hrvoje Niksic * src/ftp-ls.c: Move html_quote_string and ftp_index here. * src/url.c: Remove get_urls_html, since that's now in html-url.c. * src/html-url.c: New file. * src/html-parse.c: New file. 2000-11-10 Hrvoje Niksic * doc/wget.texi: cc.fer.hr -> srk.fer.hr. 2000-11-10 Hrvoje Niksic * configure.in: Test for MMAP. 2000-11-08 Hrvoje Niksic * src/url.c (construct): Handle the case where host name is not followed by a slash. 2000-11-08 Hrvoje Niksic * src/url.c (construct): Changed last_slash[-1] to *(last_slash - 1). Suggested by Edward J. Sabol. 2000-11-06 Hrvoje Niksic * src/init.c: commands[] need to be sorted! ("base" wasn't.) 2000-11-05 Hrvoje Niksic * src/wget.h (DO_REALLOC_FROM_ALLOCA): Use braces to disambiguate `if'. 2000-11-05 Hrvoje Niksic * src/url.c (construct): Insert unneeded initialization for the compiler to shut up. * src/config.h.in: Define _XOPEN_SOURCE to 500 to get the prototype for strptime() (*duh*). Define _SVID_SOURCE to get S_IFLNK which otherwise gets lost when you define _XOPEN_SOURCE. * src/utils.c (touch): Include the file name in the error message. From Debian. 2000-11-05 Hrvoje Niksic * src/url.c (construct): Fix comment. (find_last_char): Document. 2000-11-05 Hrvoje Niksic * src/log.c (logvprintf): Use vsnprintf() in all cases. If necessary, resize the buffer to fit the formated message. That way, messages of arbitrary size may be printed. (logvprintf): Use saved_append() to optionally log the last several lines of output. (logputs): Ditto. (log_close): Adapt to new data structures. (log_dump): Ditto. (redirect_output): Print messages to stderr, not to stdout. * src/log.c (saved_append_1): New function. Replaces the old logging system ("log all output until 10M characters") with a new, much more reasonable one ("log last screenful of text"). (saved_append): New function; call saved_append_1. (free_log_line): New function. 2000-11-05 Hrvoje Niksic * doc/wget.texi: Updated names of contributors. 2000-11-05 Hrvoje Niksic * doc/Makefile.in (sample.wgetrc.munged_for_texi_inclusion): Use $(srcdir). 2000-11-04 Hrvoje Niksic * src/snprintf.c: New file. 2000-11-03 Hrvoje Niksic * src/wget.h: If HAVE_STDARG_H is not defined, don't declare argument types to logprintf() and debug_logprintf(). 2000-11-02 Tyler Riddle * src/http.c (known_authentication_scheme_p): Recognize NTML authentication. (create_authorization_line): Treat NTML the same as `Basic'. 2000-11-02 Matthew Seaman * util/rmold.pl: Various fixes. 2000-11-02 Junio Hamano * src/ftp-basic.c (ftp_login): Make comparison case-insensitive. 2000-11-02 Hrvoje Niksic * src/url.c (parseurl): Remove possible reading past the end of sup_protos[]. Spotted by Mark A. Mankins . 2000-11-02 Hrvoje Niksic * src/retr.c (retrieve_url): Free url before returning. (retrieve_url): Free mynewloc before returning. Spotted by Mark A. Mankins . 2000-11-02 Hrvoje Niksic * src/ftp.c (ftp_loop_internal): Hide the password from the URL when printing non-verbose. Problem spotted by Dariusz Mlynarczyk . 2000-11-01 Hrvoje Niksic * src/wget.h (enum): Remove extra space after last enumeration. 2000-11-01 Hrvoje Niksic * src/utils.c (long_to_string): Update with a later, better version. 2000-11-01 Hrvoje Niksic * src/url.c (path_simplify_with_kludge): New function. (path_simplify_with_kludge): Disable it. Instead... (parse_dir): ...make sure that at this point the right thing is done, i.e. that "query" part of the URL (?...) is always assigned to the file, never to the directory portion of the path. 2000-11-01 Hrvoje Niksic * src/url.c (get_urls_html): Decode HTML entities using html_decode_entities. * src/html.c (htmlfindurl): Don't count the `#' in numeric entities (&#NNN;) as an HTML fragemnt. (html_decode_entities): New function. 2000-11-01 Hrvoje Niksic * src/url.c (construct): Rewritten for clarity. Avoids the unnecessary copying and stack-allocation the old version performed. 2000-11-01 Hrvoje Niksic * src/retr.c (retrieve_url): Detect redirection cycles. 2000-11-01 Hrvoje Niksic * src/main.c (main): Use legible_very_long() for printing opt.downloaded. * src/utils.c (legible_1): New function that operates on strings and does the brunt of legible()'s work. (legible): Use legible_1(). (legible_very_long): New function; dump the argument with sprintf(), and call legible_1(). * src/options.h (struct options): Use VERY_LONG_TYPE for opt.downloaded. * src/sysdep.h (VERY_LONG_TYPE): Define it to have a 64-bit or greater type. * src/config.h.in: Make sure that SIZEOF_LONG and SIZEOF_LONG_LONG get defined. Define HAVE_LONG_LONG if long long is available. 2000-11-01 Hrvoje Niksic * src/main.c (main): In case of opt.downloaded overflowing, print instead of a totally bogus random value. * src/retr.c (retrieve_from_file): Ditto. * src/recur.c (recursive_retrieve): Ditto. * src/main.c (main): Ditto. * src/http.c (http_loop): Ditto. * src/ftp.c (ftp_loop_internal): Use downloaded_increase() instead of `+=', and downloaded_exceeds_quota() instead of the simple-minded check. (ftp_retrieve_list): Ditto. (ftp_retrieve_dirs): Ditto. (ftp_retrieve_glob): Ditto. * src/retr.c (downloaded_increase): New function. Notice overflows of opt.downloaded. (downloaded_exceeds_quota): Make sure that opt.downloaded is not used if it overflowed. * src/options.h (struct options): New member downloaded_overflow. 2000-11-01 Hrvoje Niksic * src/html.c (htmlfindurl): Fix recognition of # HTML fragments. 2000-11-01 Hrvoje Niksic * configure.in: Check for size of long and long long. 2000-10-31 Hrvoje Niksic * src/retr.c (retrieve_url): Use url_concat() to handle relative redirections instead of /ad hoc/ code. * src/url.c (url_concat): New function encapsulating weird construct(). (urllen_http_hack): New function. (construct): When constructing new URLs, recognize that `?' does not form part of the file name in HTTP. 2000-10-31 Hrvoje Niksic * src/ftp.c (getftp): Ditto. * src/http.c (gethttp): Rewind the stream when retrying from scratch. 2000-10-30 Dan Harkless * NEWS: Hrvoje pointed out that relative URL grokking deserves mention. 2000-10-27 Dan Harkless * src/retr.c (retrieve_url): Manually applied T. Bharath 's patch to get wget to grok illegal relative URL redirects. Reformatted and re-commented it. 2000-10-27 Dan Harkless * TODO: wget now groks illegal relative URL HTTP redirects. 2000-10-24 Dan Harkless * NEWS: Forgot to update regarding new --bind-address option. 2000-10-23 Hrvoje Niksic * doc/wget.texi (HTTP Options): Remove Netscape bullying. 2000-10-23 Dan Harkless * src/connect.c (make_connection and bindport): Manually applied Rob Mayoff 's 1.5.3 patch to add --bind-address, changing coding style to GNU's. * src/ftp.c (ftp_loop_internal): --delete-after wasn't implemented for files downloaded via FTP. Per a comment, .listing files were not counted towards number of bytes and files downloaded because they're deleted anyway. Well, they aren't under -nr, so count them then. * src/init.c: Manually applied Rob Mayoff's 1.5.3 patch to add --bind-address, alphabetizing, changing coding style to GNU's, commenting, and renaming cmd_ip_address() to cmd_address() to imply hostnames also okay. * src/main.c (main): --delete-after didn't delete the root of the tree. Ignore --convert-links if --delete-after was specified. Manually applied Rob Mayoff's 1.5.3 patch to add --bind-address, fixing duplicate use of added-since-1.5.3 case value. (print_help): Clarified that --delete-after deletes local files. Rob forgot to add a line for his new --bind-address option. * src/options.h (struct options): Manually applied Rob Mayoff's patch to add --bind-address (bind_address structure member). * src/recur.c (recursive_retrieve): Improved comment; added DEBUGP(). Ignore --convert-links if --delete-after was specified. * src/retr.c (retrieve_from_file): Just added a DEBUGP(). 2000-10-23 Dan Harkless * doc/wget.texi (Recursive Retrieval Options): Improved --delete-after docs. (Download Options): Documented Rob Mayoff's new --bind-address option. (Wgetrc Commands): Documented Rob Mayoff's new bind_address command. 2000-10-20 Dan Harkless * doc/wget.texi (Recursive Retrieval Options): Sugg. -E on 1-page download. 2000-10-20 Dan Harkless * TODO: -k needs to convert '?' to "%3F" in links to saved files containing the '?' character (e.g. CGI output). Also, we need to check the HTTP spec w.r.t. simplification of absolute URLs. Generalize --html-extension to something like --mime-extension. * MAILING-LIST: I didn't realize allowed posting by non-subscribers. soon to be an alias for it. * NEWS: Always forget to update this file when making user-vis. changes. 2000-10-19 Dan Harkless * src/ftp.c (ftp_loop_internal): downloaded_file() enumerators changed. (getftp): Applied Piotr Sulecki 's patch to work around FTP servers that incorrectly respond to the "REST" command with the remaining size rather than the total file size. * src/http.c (gethttp): Improved a comment and added code to tack on ".html" to text/html files without that extension when -E specified. (http_loop): Use new downloaded_file() enumerators and deal with the case of gethttp() called xrealloc() on u->local. * src/init.c (commands): Added new "htmlextension" command. Also renamed John Daily's cmd_quad() to the more descriptive cmd_lockable_boolean(), alpha-sorted the CMD_DECLARE()s and removed duplicate cmd_boolean() declaration. * src/main.c (print_help): Added my new -E / --html-extension option. (main): Undocumented --email-address option previously used -E synonym. Stole it away for the much more deserving --html-extension's use. * src/options.h (struct options): Added html_extension field. * src/url.c (convert_links): URL X that we saved as X.html locally due to -E needs to be backed up as X.orig, not X.html.orig. Added comments. (downloaded_file): Now remembers if we added .html extension to a file. * src/url.h (downloaded_file_t): Added extra enumerators to support above. (downloaded_file): Now takes and returns a downloaded_file_t. * src/wget.h (unnamed "dt" enum): Added ADDED_HTML_EXTENSION enumerator. 2000-10-19 Dan Harkless * doc/wget.texi (HTTP Options): Documented my new -E / --html-extension. (Wgetrc Commands): Documented my new html_extension option and John Daily's "quad" values (which I renamed to "lockable Boolean"). When I documented Damir Dzeko's --referer, I forgot to add the .wgetrc equivalent; mentioned the "referrer" spelling issue. 2000-10-19 Dan Harkless * TODO: -E / --html-extension / html_extension has been implemented. Make -I and -X allow an optional hostname before the directory name? When simplifying paths, wget needs to stop at any '?' character. * configure.in: Put "it" language in proper alphabetical order and added new languages "pl" and "ru". * po/pl.{gmo,po}: Added Grzegorz Kowal 's Polish message translation file. * po/ru.{gmo,po}: Added Const Kaplinsky 's Russian message translation file. 2000-10-16 Dan Harkless * TODO: Add option to save local filenames without extra %-encoding. 2000-10-13 Adrian Aichner * src/retr.c: Add msec timing support for WINDOWS. * src/retr.c (reset_timer): GetSystemTime() on WINDOWS. * src/retr.c (elapsed_time): Calculate delta time to msec on WINDOWS. 2000-10-09 Dan Harkless * src/html.c (htmlfindurl): Added unneeded initialization to quiet warning. * src/main.c (print_help): Clarified what --retr-symlinks does. 2000-10-09 Dan Harkless * doc/wget.texi (FTP Options): --retr-symlinks wasn't documented properly. 2000-10-09 Dan Harkless * TODO: --retr-symlinks should cause wget to traverse links to dirs too. 2000-09-25 Dan Harkless * TODO: Make wget return nonzero in situations like bad HTTP auth. Make wget follow (illegal) relative URL HTTP redirects. 2000-09-15 John Daily * src/init.c: Add support for "always" and "never" values to allow .wgetrc to override commandline (useful e.g. with .pm files calling `wget --passive-ftp' when your firewall doesn't allow that). * src/ftp.c (getftp): passive_ftp is first option to support always/never. 2000-08-30 Dan Harkless * src/ftp.c (ftp_retrieve_list): Use new INFINITE_RECURSION #define. * src/html.c: htmlfindurl() now takes final `dash_p_leaf_HTML' parameter. Wrapped some > 80-column lines. When -p is specified and we're at a leaf node, do not traverse , , or tags other than . * src/html.h (htmlfindurl): Now takes final `dash_p_leaf_HTML' parameter. * src/init.c: Added new -p / --page-requisites / page_requisites option. * src/main.c (print_help): Clarified that -l inf and -l 0 both allow infinite recursion. Changed the unhelpful --mirrior description to simply give the options it's equivalent to. Added new -p option. (main): Added some comments; handle new -p / --page-requisites. * src/options.h (struct options): Added new page_requisites field. * src/recur.c: Changed "URL-s" to "URLs" and "HTML-s" to "HTMLs". Calculate and pass down new `dash_p_leaf_HTML' parameter to get_urls_html(). Use new INFINITE_RECURSION #define. * src/retr.c: Changed "URL-s" to "URLs". get_urls_html() now takes final `dash_p_leaf_HTML' parameter. * src/url.c: get_urls_html() and htmlfindurl() now take final `dash_p_leaf_HTML' parameter. * src/url.h (get_urls_html): Now takes final `dash_p_leaf_HTML' parameter. * src/wget.h: Added some comments and new INFINITE_RECURSION #define. 2000-08-30 Dan Harkless * po/*.{gmo,po,pot}: Regenerated after modifying wget --help output. * MACHINES: Previously said to send updates to "me" (Hrvoje) -- now says to email the mailing list or bug-wget@gnu.org. * MAILING-LIST: Added mention of bug-wget@gnu.org. * NEWS: Added --waitretry and --page-requisites. 2000-08-30 Dan Harkless * doc/wget.texi (Recursive Retrieval Options): Documented new -p option. (Wgetrc Commands): Documented -p's equvialent, page_requisites. 2000-08-25 Dan Harkless * MACHINES: Alphabetized, changed "architectures" to "OSes and architectures", added missing company names, removed needless ^L, made AIX and FreeBSD entries more general to reflect successful use on those platforms by myself and others, removed the non-factual "this version of", and fixed some grammatical errors. 2000-08-23 Dan Harkless * src/main.c (print_help): -B / --base was not mentioned. 2000-08-23 Dan Harkless * doc/wget.texi (Download Options): Using -c on a file that's already fully downloaded results in an unchanged file and no second ".1" copy. * doc/wget.texi (Logging and Input File Options): -B / --base was not documented as a separate item, and the .wgetrc version was misleading. * doc/wget.texi (Wgetrc Commands): Changed all instances of ", the same as" to the more grammatical " -- the same as". 2000-08-22 Dan Harkless * src/main.c (print_help): Modified -nc description to mention that it also prevents the creation of multiple versions of the same file with "." suffixes. 2000-08-22 Dan Harkless * doc/wget.texi (Download Options): --no-clobber's documentation was severely lacking -- ameliorated the situation. Some of the previously-undocumented stuff (like the multiple-file-version numeric-suffixing) that's now mentioned for the first (and only) time in the -nc documentation should probably be mentioned elsewhere, but due to the way that wget.texi's hierarchy is laid out, I had a hard time finding anywhere else appropriate. 2000-07-21 Dan Harkless * TODO: But Brian McMahon wants old behavior as an option. 2000-07-19 Dan Harkless * TODO: -k should convert "hostless absolute" URLs, like "/index.html". 2000-07-17 Dan Harkless * doc/wget.texi (HTTP Options): Minor clarification in "download a single HTML page and all files necessary to display it" example. 2000-07-14 Jan Prikryl * src/retr.c (retrieve_url): Consistently strdup opt.referer when setting u->referer. 2000-06-09 Hrvoje Niksic * src/url.c (encode_string): Fix comment. Suggested by Herold Heiko . 2000-06-09 Dan Harkless * src/main.c (print_help): --help output for --waitretry was over 80 cols. 2000-06-01 Hrvoje Niksic * src/url.c (str_url): Print the port number only if it's different from the default port number for that protocol. 2000-06-01 Const Kaplinsky * src/ftp.c (ftp_retrieve_list): Change permissions only on plain files. 2000-05-24 Dan Harkless * TODO: Timestamps sometimes not copied over on files retrieved by FTP. 2000-05-22 Dan Harkless * src/main.c (print_help): Added --help line for Damir Dzeko 's until-now-undocumented --referer option. Removed comments that --referer and --waitretry were undocumented. Changed "`.wgetrc' command" to "`.wgetrc'-style command" on --help line for --execute. 2000-05-22 Dan Harkless * doc/wget.texi (HTTP Options): Damir Dzeko did not document his new --referer option. Did so. 2000-05-22 Dan Harkless * AUTHORS: Added myself to this file, as Hrvoje got confirmation of my FSF copyright assignment. * TODO: Added note that fragment identifiers don't work properly. * po/*.{gmo,po,pot}: Regenerated after modifying wget --help output. 2000-05-18 Hrvoje Niksic * src/ftp.c (getftp): Ditto. * src/http.c (gethttp): Check for return value of fclose/fflush. 2000-05-17 Dan Harkless * TODO: Make `-k' check for files that were downloaded in the past and convert links to them in newly-downloaded documents. 2000-04-18 Dan Harkless * doc/sample.wgetrc: Realized I put a global setting in the local section. 2000-04-13 Dan Harkless * doc/Makefile.in (sample.wgetrc.munged_for_texi_inclusion): Added build, dependencies, and distclean cleanup of this new file. * doc/sample.wgetrc: Uncommented waitretry and set it to 10, clarified some wording, and re-wrapped some text to 71 columns due to @sample indentation in wget.texi. * doc/wget.texi: Herold further expounded on the behavior of waitretry -- reworded docs again. Changed note saying _all_ lines in sample.wgetrc are commented out. Don't have an entire hand- cut-and-pasted copy of sample.wgetrc in this file -- use @include. 2000-04-12 Hrvoje Niksic * src/http.c (gethttp): Don't free REQUEST -- it was allocated with alloca(). Pointed out by Gisle Vanem . 2000-04-12 Hrvoje Niksic * src/host.c (store_hostaddress): Instead of shifting ADDR, start copying from the correct address. 2000-04-12 Dan Harkless * doc/Makefile.in (install.wgetrc): I completely missed the message that the new wgetrc wasn't being installed the first couple of times I ran `make install' after changing sample.wgetrc. Added blank lines around the message and a "", and reworded the message to be a bit more clear. * doc/sample.wgetrc: Added entries for backup_converted and waitretry. * doc/wget.texi (Download Options and Wgetrc Commands): Herold Heiko 's new --waitretry option was undocumented until now. Reworded the suggested documentation he sent to the list. 2000-04-05 Dan Harkless * TODO: Make -K only leave .orig files around when different. Add an option to save all text/html files with .html extension. Allow mirroring of FTP URLs where logging in puts you somewhere else besides '/'. 2000-04-04 Dan Harkless * src/host.c (store_hostaddress): R. K. Owen's patch introduces a "left shift count >= width of type" warning on 32-bit architectures. Got rid of it by tricking the compiler w/ a variable. * src/url.c (UNSAFE_CHAR): The macro didn't include all the illegal characters per RFC1738, namely everything above '~'. It also generated a warning on OSes where char =~ unsigned char. Fixed. 2000-04-04 Dan Harkless * NEWS (--follow-tags, -G / --ignore-tags): Forgot to mention these new options when I added them. 2000-03-31 Hrvoje Niksic * src/Use TOUPPER/TOLOWER. 2000-03-21 Hrvoje Niksic * src/wget.h (DO_REALLOC_FROM_ALLOCA): Ditto. * src/sysdep.h (ISALNUM): New macro. (TOLOWER): Ditto. (TOUPPER): Ditto. 2000-03-10 Dan Harkless * src/html.c (idmatch): Implemented checking of my new --follow-tags and --ignore-tags options. * src/init.c (commands): Added comment reminding people adding new entries doing allocation to add corresponding freeing in cleanup(). (commands): Added new followtags and ignoretags commands. (cleanup): Free storage for new followtags and ignoretags. * src/main.c: Use of "comma-separated list" was random -- normalized it. Did some alphabetization. Added comments pointing out "Options without arguments" and "Options accepting an argument" sections of long_options[]. Added new options --follow-tags and -G / --ignore-tags. Added comment that Damir's --referer is currently undocumented. Added comment that Heiko's --waitretry is partially undocumented (mentioned in --help but not in wget.texi). Moved improperly sorted 24, 129, and 'G' cases. * src/options.h (struct options): Added new fields follow_tags and ignore_tags. * src/wget.h: Added "#define EQ 0" so we can say "strcmp(a, b) == EQ". 2000-03-10 Dan Harkless * doc/wget.texi (Recursive Retrieval Options): In -K description, added a link to the discussion of interaction with -N. (Recursive Accept/Reject Options): Did some alphabetizing and added descriptions of new --follow-tags and -G / --ignore-tags options. (Following Links): Changed "the loads of" to "loads of". (Wgetrc Commands): Added descriptions of new follow_tags and ignore_tags commands. 2000-03-10 Dan Harkless * TODO: Removed done item: we now have an option (-G) that makes it easy to download a single HTML document and all its constituents. * po/*.{gmo,po,pot}: Regenerated after adding new options. * po/hr.po: Hrvoje forgot '\n's on his translations of my altered messages, causing msgfmt to balk and `make install' to fail. 2000-03-02 Hrvoje Niksic * doc/wget.texi (Contributors): Update contributors list. 2000-03-02 HIROSE Masaaki * src/html.c (html_allow): Add and