Commit graph

1975 commits

Author SHA1 Message Date
Bastian Kleineidam
b06638c769 Add more GUI context menu actions. 2010-09-30 07:32:39 +02:00
Bastian Kleineidam
2bae9e8679 Updated copyright 2010-09-29 19:26:16 +02:00
Bastian Kleineidam
e0f4097eb0 Ensure HttpUrl.set_title_from_content() is only called when the content is allowed to be retrieved. 2010-09-29 19:26:03 +02:00
Bastian Kleineidam
840538d12a Remove uneeded check for HTML content. 2010-09-29 19:25:14 +02:00
Bastian Kleineidam
279a1eae70 Only add geoip info for non-empty hostnames. 2010-09-29 15:59:57 +02:00
Bastian Kleineidam
cc848cdb33 Fix import for moved geoip module. 2010-09-29 15:17:27 +02:00
Bastian Kleineidam
c59bbae587 Remove unused import and move geoip module from the cache module into base linkcheck module. 2010-09-29 15:15:21 +02:00
Bastian Kleineidam
6292ec54fa Catch GeoIP lookup errors; ensure GeoIP information is Unicode. 2010-09-29 15:04:37 +02:00
Bastian Kleineidam
3795a91a99 Fix compatibility problem with logging.StreamHandler in Python 2.7 2010-09-29 14:41:38 +02:00
Bastian Kleineidam
473c834f0c Do not crash when geoip information is None. 2010-09-29 14:19:28 +02:00
Bastian Kleineidam
8995be1739 Support city-level geoip lookup; remove the geoip cache since lookup is fast enough; remove the duplicated geoip country name map. 2010-09-29 14:10:36 +02:00
Bastian Kleineidam
5a32bf4a21 Updated copyright 2010-09-29 08:25:00 +02:00
Bastian Kleineidam
1446797020 Support pygeoip 2010-09-29 08:24:30 +02:00
Bastian Kleineidam
5576440f10 Add context menu item to copy URL to system clipboard. 2010-09-28 20:34:20 +02:00
Bastian Kleineidam
d202661ff7 Give example binary path for QT designer. 2010-09-28 20:20:33 +02:00
Bastian Kleineidam
b90c832ab1 Clarify documentation. 2010-09-28 20:20:08 +02:00
Bastian Kleineidam
5fd8ea1d92 Use explicit variable for URL string. 2010-09-28 20:19:19 +02:00
Bastian Kleineidam
e996f0d98e Enable list widget items, making them selectable. 2010-09-28 20:18:39 +02:00
Bastian Kleineidam
d5707ef205 Change registry base string to avoid false spyware alarms. 2010-09-12 18:49:57 +02:00
Bastian Kleineidam
ffcd274087 Updated copyright 2010-09-05 21:02:51 +02:00
Bastian Kleineidam
8a1ac26c85 Warn about obfuscated IP numbers. 2010-09-05 20:11:02 +02:00
Bastian Kleineidam
5284017d67 Only fallback to HTTP GET when robots.txt sallows it. 2010-09-04 18:09:59 +02:00
Bastian Kleineidam
fb67df662c Use repr() for robotparser debug. 2010-09-04 18:01:12 +02:00
Bastian Kleineidam
8a074aeea9 Work around Python 2.6+ urljoin bug. 2010-08-31 09:16:24 +02:00
Bastian Kleineidam
c3b8ff00b3 Check content and recursion in one try/except to avoid multiple errors when getting page content. 2010-08-31 06:52:08 +02:00
Bastian Kleineidam
60f7af4598 Allow redirections to external URLs with same domain. 2010-08-13 01:22:18 +02:00
Bastian Kleineidam
1faedafb33 Fix data size for HTTP requests. 2010-08-04 00:06:25 +02:00
Bastian Kleineidam
851e1121e9 Use semicolon as default CSV separator. 2010-07-31 22:30:11 +02:00
Bastian Kleineidam
76f7f6b6a3 Prefer anchor element content as name instead of title attribute. 2010-07-30 21:03:04 +02:00
Bastian Kleineidam
c086f49cea Catch KeyError when quoting URLs of index.html. 2010-07-30 20:12:52 +02:00
Bastian Kleineidam
4678802a81 Do not truncate UNC filepaths 2010-07-30 20:07:11 +02:00
Bastian Kleineidam
761b292e37 Added skype: to list of recognized but ignored URL schemes. 2010-07-29 20:26:04 +02:00
Bastian Kleineidam
0f92b76290 Remove the unnormed URL warning. 2010-07-29 20:20:59 +02:00
Bastian Kleineidam
7ad4f7c220 Compare size from meta info and content data. 2010-07-29 19:53:41 +02:00
Bastian Kleineidam
8413b427e9 Rename some warnings, and add size unequality warning. 2010-07-29 19:53:15 +02:00
Bastian Kleineidam
7d076e7129 Add get_size() method for local files. 2010-07-29 19:52:26 +02:00
Bastian Kleineidam
7536472797 Send correct host header when using http proxy. 2010-07-29 06:50:35 +02:00
Bastian Kleineidam
41e2e1a448 Add new warning to warning list. 2010-07-28 13:47:58 +02:00
Bastian Kleineidam
d9bfd25a68 Add warning if content size is zero 2010-07-28 08:19:55 +02:00
Bastian Kleineidam
7802bbbbc4 Use _WIN32 instead of WIN32 to detect Windows systems. 2010-03-31 08:45:21 +02:00
Bluebird75
28f4514b67 Use object with __slots__ for wire-format of UrlBase objects.
Saves memory since UrlBase wire-format objects are used for
logging and thus often created.

Signed-off-by: Bastian Kleineidam <calvin@debian.org>
2010-03-27 00:07:19 +01:00
Bastian Kleineidam
4e1b6d667e Set copyright. 2010-03-26 20:51:59 +01:00
Bastian Kleineidam
7457128e64 Add some but still imcomplete regexes for obfuscated IP addresses. 2010-03-26 20:51:44 +01:00
Bastian Kleineidam
3370ea1562 Reflect changes in httplib2.py: use buffered read in httplib response object and use bad status line exception attribute. 2010-03-26 20:50:38 +01:00
Bastian Kleineidam
c4c098bd83 pep8-ify the source a little more 2010-03-13 08:47:12 +01:00
Bastian Kleineidam
c83a79c0ef Added buffering for http responses from upstream. 2010-03-13 02:29:01 +01:00
Bastian Kleineidam
86ca7d0dba Do not break long words when text wrapping. 2010-03-11 21:50:23 +01:00
Bastian Kleineidam
0b7badc238 Do not quote slashes in query values. 2010-03-11 20:19:31 +01:00
Bastian Kleineidam
bf1dc2a585 Remove unused import. 2010-03-10 01:28:39 +01:00
Bastian Kleineidam
5b5a62f6d5 Updated copyright 2010-03-10 00:05:05 +01:00
Bastian Kleineidam
37b4e97012 Revert "Only parse anchors if both --anchors option is given and the current link has an anchor."
This reverts commit b238527d54.
2010-03-10 00:04:02 +01:00
Bastian Kleineidam
55f476b0dd Merge branch 'master' of ssh://linkchecker.git.sourceforge.net/gitroot/linkchecker/linkchecker 2010-03-10 00:03:09 +01:00
Bastian Kleineidam
57e3b05c88 limit cache sizes 2010-03-10 00:00:12 +01:00
Bastian Kleineidam
b238527d54 Only parse anchors if both --anchors option is given and the current link has an anchor. 2010-03-09 11:45:50 +01:00
Bastian Kleineidam
57397e938b Improved linkname parsing by adding a new peek() HTML parser function. 2010-03-09 11:31:12 +01:00
Bastian Kleineidam
074b5ded32 Support UTF-8 encoded filenames in FTP servers. 2010-03-09 08:15:29 +01:00
Bastian Kleineidam
c88791b815 Fix support for non-standard FTP ports. 2010-03-09 07:49:05 +01:00
Bastian Kleineidam
72f880e918 Fix the win32com cache generation. 2010-03-08 12:27:55 +01:00
Bastian Kleineidam
e1482b3eab Add detection for py2exe 2010-03-08 12:12:28 +01:00
Bastian Kleineidam
51a0ef0ad4 Speed up HTML parsing by stopping early and adding callbacks. 2010-03-08 09:04:33 +01:00
Bastian Kleineidam
e5599cce26 Fixed DNS changelog name. 2010-03-08 08:05:55 +01:00
Bastian Kleineidam
7c15d28f56 Prevent UnicodeDecodeError in robots.txt parsing. 2010-03-07 22:49:25 +01:00
Bastian Kleineidam
b8b0398dd2 Ensure redirected URL is Unicode encoded. 2010-03-07 22:11:55 +01:00
Bastian Kleineidam
c8e6995ecd Support HTTPS proxies. 2010-03-07 21:06:10 +01:00
Bastian Kleineidam
1e15e55689 Fix errors in Word file parsing. 2010-03-07 19:43:08 +01:00
Bastian Kleineidam
6a2fcf8ae9 Parse links in Word files. 2010-03-07 19:20:51 +01:00
Bastian Kleineidam
34a2f4a15d Disable and deprecated the --no-proxy-for option. 2010-03-07 17:45:48 +01:00
Bastian Kleineidam
b08524aa5f Updated dnspython module from upstream. 2010-03-07 16:40:17 +01:00
Bastian Kleineidam
796cf0a7cd Updated copyright year 2010-03-07 11:59:18 +01:00
Bastian Kleineidam
049d989c65 Move msgfmt.py into po/ subdir 2010-03-07 11:56:25 +01:00
Bastian Kleineidam
1bf004dafe Updated copyright year 2010-03-07 11:10:53 +01:00
Bastian Kleineidam
af6cb287d7 Only warn about missing emails in mailto: URLs. 2010-03-07 10:43:29 +01:00
Bastian Kleineidam
3d5c114f14 Warn on permament redirections even when URL is outside of domain filter. 2010-03-07 09:36:21 +01:00
Bastian Kleineidam
09623f066c Remove trailing spaces. 2010-03-06 11:03:25 +01:00
Bastian Kleineidam
2d73b907f1 Retry HTTP when server sent empty status line; should fix most of the BadStatusLine errors that are sporadically encountered. 2010-03-06 10:23:34 +01:00
Bastian Kleineidam
280a675698 Distinguish the empty status line from other BadStatusLine errors. 2010-03-06 10:08:08 +01:00
Bastian Kleineidam
bee8023540 Fixed URL encoding 2010-02-22 01:06:19 +01:00
Bastian Kleineidam
92272b9af9 Added missing license info to some files. 2009-12-02 21:39:29 +01:00
Bastian Kleineidam
77daf80e82 Add url encoding parameter 2009-11-28 11:56:35 +01:00
Bastian Kleineidam
d404b0b1bb Use /dev/urandom to avoid waiting for /dev/random 2009-11-21 12:13:49 +01:00
Bastian Kleineidam
66da1316bd Remove version comment in Changelog 2009-09-23 00:33:33 +02:00
Bastian Kleineidam
153c241049 Use i18n.default_encoding instead of a hardcoded one of iso-8859-1. 2009-08-22 12:21:50 +02:00
Bastian Kleineidam
92e1f90117 Use locale encoding as default, even if the language is not supported 2009-08-22 10:09:41 +02:00
Bastian Kleineidam
c52d15fd4d Improve search for lccollection.qhc on Windows systems 2009-07-31 07:18:25 +02:00
Bastian Kleineidam
3fd3ee90ec Fix GUI documentation install 2009-07-29 21:53:52 +02:00
Bastian Kleineidam
2a4f76be3b Moved HTML documentation files 2009-07-26 23:31:57 +02:00
Bastian Kleineidam
5cd7b84596 Allow digits at end of domain names in safe domain check. 2009-07-26 23:16:42 +02:00
Bastian Kleineidam
8533ade21f Add ID for each logged URL. 2009-07-26 22:31:51 +02:00
Bastian Kleineidam
5e06b6b8d4 Updated FSF address in GPL blurb 2009-07-24 23:58:20 +02:00
Bastian Kleineidam
a82e2236ae Copy system config to user config 2009-07-24 23:16:12 +02:00
Bastian Kleineidam
e6f43b6822 Fixed the no_proxy handling and added changelog entry 2009-07-24 07:19:49 +02:00
Bastian Kleineidam
d666e63d19 Print the noproxy variable in system info string 2009-07-22 23:41:46 +02:00
Bastian Kleineidam
fd610ba350 Encode spaces with %20 instead of + 2009-07-22 22:52:40 +02:00
Bastian Kleineidam
34219e2a12 Improved logging message. 2009-07-22 22:29:48 +02:00
Bastian Kleineidam
3121e3385e Use direct HTML documentation for the GUI client; moved the homepage content to a separate package. 2009-07-20 18:33:09 +02:00
Bastian Kleineidam
9faa7d33d2 Fix Python2.6 deprecation warnings 2009-06-26 23:19:36 +02:00
Bastian Kleineidam
4389cc28eb Fix Python2.6 deprecation warnings 2009-06-26 22:43:18 +02:00
Bastian Kleineidam
7f67027abf ignore the fragment part (ie. the anchor) of URIs when
+  getting and caching content
2009-06-26 07:22:36 +02:00
Bastian Kleineidam
00c15eef6a Minor code cleanups 2009-06-22 20:40:00 +02:00
Bastian Kleineidam
62922d48e8 Fix previous introduce config parsing error. 2009-06-18 20:29:31 +02:00