Commit graph

1892 commits

Author SHA1 Message Date
Bastian Kleineidam
7ad4f7c220 Compare size from meta info and content data. 2010-07-29 19:53:41 +02:00
Bastian Kleineidam
8413b427e9 Rename some warnings, and add size unequality warning. 2010-07-29 19:53:15 +02:00
Bastian Kleineidam
7d076e7129 Add get_size() method for local files. 2010-07-29 19:52:26 +02:00
Bastian Kleineidam
7536472797 Send correct host header when using http proxy. 2010-07-29 06:50:35 +02:00
Bastian Kleineidam
41e2e1a448 Add new warning to warning list. 2010-07-28 13:47:58 +02:00
Bastian Kleineidam
d9bfd25a68 Add warning if content size is zero 2010-07-28 08:19:55 +02:00
Bastian Kleineidam
7802bbbbc4 Use _WIN32 instead of WIN32 to detect Windows systems. 2010-03-31 08:45:21 +02:00
Bluebird75
28f4514b67 Use object with __slots__ for wire-format of UrlBase objects.
Saves memory since UrlBase wire-format objects are used for
logging and thus often created.

Signed-off-by: Bastian Kleineidam <calvin@debian.org>
2010-03-27 00:07:19 +01:00
Bastian Kleineidam
4e1b6d667e Set copyright. 2010-03-26 20:51:59 +01:00
Bastian Kleineidam
7457128e64 Add some but still imcomplete regexes for obfuscated IP addresses. 2010-03-26 20:51:44 +01:00
Bastian Kleineidam
3370ea1562 Reflect changes in httplib2.py: use buffered read in httplib response object and use bad status line exception attribute. 2010-03-26 20:50:38 +01:00
Bastian Kleineidam
c4c098bd83 pep8-ify the source a little more 2010-03-13 08:47:12 +01:00
Bastian Kleineidam
c83a79c0ef Added buffering for http responses from upstream. 2010-03-13 02:29:01 +01:00
Bastian Kleineidam
86ca7d0dba Do not break long words when text wrapping. 2010-03-11 21:50:23 +01:00
Bastian Kleineidam
0b7badc238 Do not quote slashes in query values. 2010-03-11 20:19:31 +01:00
Bastian Kleineidam
bf1dc2a585 Remove unused import. 2010-03-10 01:28:39 +01:00
Bastian Kleineidam
5b5a62f6d5 Updated copyright 2010-03-10 00:05:05 +01:00
Bastian Kleineidam
37b4e97012 Revert "Only parse anchors if both --anchors option is given and the current link has an anchor."
This reverts commit b238527d54.
2010-03-10 00:04:02 +01:00
Bastian Kleineidam
55f476b0dd Merge branch 'master' of ssh://linkchecker.git.sourceforge.net/gitroot/linkchecker/linkchecker 2010-03-10 00:03:09 +01:00
Bastian Kleineidam
57e3b05c88 limit cache sizes 2010-03-10 00:00:12 +01:00
Bastian Kleineidam
b238527d54 Only parse anchors if both --anchors option is given and the current link has an anchor. 2010-03-09 11:45:50 +01:00
Bastian Kleineidam
57397e938b Improved linkname parsing by adding a new peek() HTML parser function. 2010-03-09 11:31:12 +01:00
Bastian Kleineidam
074b5ded32 Support UTF-8 encoded filenames in FTP servers. 2010-03-09 08:15:29 +01:00
Bastian Kleineidam
c88791b815 Fix support for non-standard FTP ports. 2010-03-09 07:49:05 +01:00
Bastian Kleineidam
72f880e918 Fix the win32com cache generation. 2010-03-08 12:27:55 +01:00
Bastian Kleineidam
e1482b3eab Add detection for py2exe 2010-03-08 12:12:28 +01:00
Bastian Kleineidam
51a0ef0ad4 Speed up HTML parsing by stopping early and adding callbacks. 2010-03-08 09:04:33 +01:00
Bastian Kleineidam
e5599cce26 Fixed DNS changelog name. 2010-03-08 08:05:55 +01:00
Bastian Kleineidam
7c15d28f56 Prevent UnicodeDecodeError in robots.txt parsing. 2010-03-07 22:49:25 +01:00
Bastian Kleineidam
b8b0398dd2 Ensure redirected URL is Unicode encoded. 2010-03-07 22:11:55 +01:00
Bastian Kleineidam
c8e6995ecd Support HTTPS proxies. 2010-03-07 21:06:10 +01:00
Bastian Kleineidam
1e15e55689 Fix errors in Word file parsing. 2010-03-07 19:43:08 +01:00
Bastian Kleineidam
6a2fcf8ae9 Parse links in Word files. 2010-03-07 19:20:51 +01:00
Bastian Kleineidam
34a2f4a15d Disable and deprecated the --no-proxy-for option. 2010-03-07 17:45:48 +01:00
Bastian Kleineidam
b08524aa5f Updated dnspython module from upstream. 2010-03-07 16:40:17 +01:00
Bastian Kleineidam
796cf0a7cd Updated copyright year 2010-03-07 11:59:18 +01:00
Bastian Kleineidam
049d989c65 Move msgfmt.py into po/ subdir 2010-03-07 11:56:25 +01:00
Bastian Kleineidam
1bf004dafe Updated copyright year 2010-03-07 11:10:53 +01:00
Bastian Kleineidam
af6cb287d7 Only warn about missing emails in mailto: URLs. 2010-03-07 10:43:29 +01:00
Bastian Kleineidam
3d5c114f14 Warn on permament redirections even when URL is outside of domain filter. 2010-03-07 09:36:21 +01:00
Bastian Kleineidam
09623f066c Remove trailing spaces. 2010-03-06 11:03:25 +01:00
Bastian Kleineidam
2d73b907f1 Retry HTTP when server sent empty status line; should fix most of the BadStatusLine errors that are sporadically encountered. 2010-03-06 10:23:34 +01:00
Bastian Kleineidam
280a675698 Distinguish the empty status line from other BadStatusLine errors. 2010-03-06 10:08:08 +01:00
Bastian Kleineidam
bee8023540 Fixed URL encoding 2010-02-22 01:06:19 +01:00
Bastian Kleineidam
92272b9af9 Added missing license info to some files. 2009-12-02 21:39:29 +01:00
Bastian Kleineidam
77daf80e82 Add url encoding parameter 2009-11-28 11:56:35 +01:00
Bastian Kleineidam
d404b0b1bb Use /dev/urandom to avoid waiting for /dev/random 2009-11-21 12:13:49 +01:00
Bastian Kleineidam
66da1316bd Remove version comment in Changelog 2009-09-23 00:33:33 +02:00
Bastian Kleineidam
153c241049 Use i18n.default_encoding instead of a hardcoded one of iso-8859-1. 2009-08-22 12:21:50 +02:00
Bastian Kleineidam
92e1f90117 Use locale encoding as default, even if the language is not supported 2009-08-22 10:09:41 +02:00