Antoine Beaupré
d51b7f34b6
Merge branch '9.3.x'
2017-01-31 19:21:22 -05:00
Antoine Beaupré
da8cecd83c
Merge remote-tracking branch 'anarcat/norobots'
2017-01-31 11:34:09 -05:00
Antoine Beaupré
bf45fb1884
fix HTTPS URL checks
...
in Debian Jessie, linkchecker fails because of an API problem.
it completely breaks HTTPs checks.
this patch fixes the problem
from https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=772947
2017-01-31 11:25:45 -05:00
Bastian Kleineidam
1e291afdfa
Fix python requests version check
2017-01-31 11:25:38 -05:00
Antoine Beaupré
46d96d0aa0
fix HTTPS URL checks
...
in Debian Jessie, linkchecker fails because of an API problem.
it completely breaks HTTPs checks.
this patch fixes the problem
from https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=772947
2016-09-30 11:20:38 -04:00
Bastian Kleineidam
c2ce810c3f
Fix python requests version check
2016-06-28 21:55:10 +02:00
Antoine Beaupré
9d899d1dfa
add --no-robots commandline flag
...
While this flag can be abused, it seems to me like a legitimate use
case that you want to check a fairly small document for mistakes,
which includes references to a website which has a robots.txt that
denies all robots. It turns out that most websites do *not* add a
permission for LinkCheck to use their site, and some sites, like the
Debian BTS for example, are very hostile with bots in general.
Between me using linkcheck and me using my web browser to check those
links one by one, there is not a big difference. In fact, using
linkcheck may be *better* for the website because it will use HEAD
requests instead of a GET, and will not fetch all page elements
(javascript, images, etc) which can often be fairly big.
Besides, hostile users will patch the software themselves: it took me
only a few minutes to disable the check, and a few more to make that
into a proper patch.
By forcing robots.txt without any other option, we are hurting our
good users and not keeping hostile users from doing harm.
The patch is still incomplete, but works. It lacks: documentation and
unit tests.
Closes : #508
2016-05-19 14:43:59 -04:00
Bastian Kleineidam
0ef00eea56
Move GUI files to separate project
2016-01-23 13:28:15 +01:00
Bastian Kleineidam
549533d701
Improved debugging
2016-01-19 21:55:50 +01:00
wummel
a40c39be59
Merge pull request #560 from xvadim/feature
...
Added plugin for parsing and checking links in Markdown files
2016-01-19 07:30:34 +01:00
wummel
e2556abbb6
Merge pull request #561 from nbigaouette/issue555
...
Detect if "url_data" contains proxy attributes before using them.
2016-01-17 21:59:35 +01:00
Bastian Kleineidam
3d711666e1
Fix parser for changes in bison 3.0.x
2015-11-26 12:33:44 +01:00
Nicolas Bigaouette
4e56eceb35
Detect if "url_data" contains proxy attributes before using them.
...
Fix proposed by @colwilson in issue #555 .
2014-11-12 09:58:30 -05:00
Vadim Khohlov
d4352fc828
Added plugin for parsing and checking links in Markdown files
2014-11-11 15:35:18 +02:00
Bastian Kleineidam
27937e6f83
Fix requests module version check.
2014-09-22 22:45:04 +02:00
Bastian Kleineidam
228bce1ba2
Add to instead of replace the HTTP client headers.
2014-09-20 12:17:42 +02:00
Bastian Kleineidam
92c4ca9a5e
Debug request headers
2014-09-20 12:16:24 +02:00
Bastian Kleineidam
029c20ed98
More python3 fixes
2014-09-12 21:59:07 +02:00
Bastian Kleineidam
35eb30432e
Added some Python3 fixes.
2014-09-12 19:36:30 +02:00
Bastian Kleineidam
697e7b82e1
Search for system certs
2014-09-11 21:19:49 +02:00
Bastian Kleineidam
21c7200360
Reactivate paging of help pages.
2014-09-11 19:42:42 +02:00
Bastian Kleineidam
06c6b80ed3
Fix proxy support.
2014-09-05 22:48:10 +02:00
wummel
6580d37dc9
Merge pull request #545 from ArloL/patch-1
...
Use correct attribute
2014-09-05 21:13:40 +02:00
Bastian Kleineidam
ee4545399d
Support itms-services: URLs. #532
2014-09-05 21:06:10 +02:00
Bastian Kleineidam
37d4ed6f83
Add hyphen and dot to the allowed scheme characters.
2014-09-05 20:59:54 +02:00
Bastian Kleineidam
c8df9355f0
Try to use the SSL certs from the certifi package.
2014-09-05 20:00:30 +02:00
Bastian Kleineidam
c684918ba6
Ignore urllib3 warnings about invalid SSL certs since we check them ourselves.
2014-09-05 20:00:00 +02:00
Bastian Kleineidam
2354f16dbb
Catch urllib3 errors.
2014-09-05 19:59:28 +02:00
Arlo Louis O'Keeffe
52337f82cb
Use correct attribute
2014-09-03 09:36:22 +02:00
Bastian Kleineidam
85dadc1f1a
Add documentation
2014-07-16 07:37:19 +02:00
Bastian Kleineidam
37664ea8a4
Fix Word file check plugin.
2014-07-15 22:39:41 +02:00
Bastian Kleineidam
b646293fd6
Remove unused import.
2014-07-15 22:38:57 +02:00
Bastian Kleineidam
29193bbcc9
Fix login URL cookies and don't sanitize after config reading.
2014-07-15 22:23:38 +02:00
Bastian Kleineidam
032c4091c3
Some easy python3 compatibility changes.
2014-07-15 18:40:47 +02:00
Bastian Kleineidam
90257a1b5e
Replace twill with custom code.
2014-07-15 18:37:05 +02:00
Bastian Kleineidam
a665d35feb
Use proxies and checker session in robots.txt.
2014-07-14 20:28:28 +02:00
Bastian Kleineidam
266e9e189f
Further code cleanup.
2014-07-14 20:14:00 +02:00
Bastian Kleineidam
6c38b4165a
Use given HTTP auth data for robots.txt fetching.
2014-07-14 19:50:11 +02:00
Bastian Kleineidam
7838521b6e
Code cleanup.
2014-07-14 19:49:01 +02:00
Bastian Kleineidam
100ce11d40
Sanitize CGI configuration.
2014-07-13 21:56:01 +02:00
Bastian Kleineidam
eafa1ed2da
Updated unknown URL schemes.
2014-07-13 21:51:53 +02:00
Bastian Kleineidam
176b95a30e
Do not strip quotes from resolved URLs.
2014-07-11 00:43:46 +02:00
Bastian Kleineidam
27702ddbac
Catch log output start errors.
2014-07-09 21:54:47 +02:00
Bastian Kleineidam
6ff89e9e8c
Fix GUI startup
2014-07-06 20:20:03 +02:00
Bastian Kleineidam
0fa7ed2699
Fix empty URL handling.
2014-07-03 23:34:40 +02:00
Bastian Kleineidam
1590ab6240
cleanup
2014-07-01 21:12:47 +02:00
Bastian Kleineidam
9a124513e3
Merge branch 'master' of github.com:wummel/linkchecker
2014-07-01 21:11:33 +02:00
wummel
9bb3852edf
Merge pull request #515 from Mark-Hetherington/extern-redirect
...
When following redirections update url.extern
2014-07-01 21:11:13 +02:00
Bastian Kleineidam
12cc12db53
Add get_redirects() function.
2014-07-01 21:11:06 +02:00
Bastian Kleineidam
cde261c009
Parse Refresh: and Content-Location: header values for URLs.
2014-07-01 20:16:43 +02:00
Bastian Kleineidam
c3ec91ac6d
Fix intern URL search pattern.
2014-06-13 23:52:21 +02:00
Bastian Kleineidam
ad8eb424f3
Merge Mark-Hetherington-xml-parse-warn with slight modifications.
2014-06-13 20:50:37 +02:00
Mark Hetherington
34d83db29c
When following redirections update url.extern
2014-05-19 14:59:58 +10:00
Bastian Kleineidam
eaa8a963ec
Refactor logging configuration.
2014-05-10 21:23:06 +02:00
Bastian Kleineidam
4b28e6e860
Move mime stuff into own submodule.
2014-05-10 21:22:10 +02:00
Bastian Kleineidam
9b794b936c
Print interrupt note in text output.
2014-04-30 20:17:33 +02:00
Bastian Kleineidam
43c2e6641b
Logging refactor, interrupt and abort flags added.
2014-04-30 19:59:43 +02:00
Bastian Kleineidam
b152ce7a6e
Add PDF test and fix page number.
2014-04-29 18:53:24 +02:00
Bastian Kleineidam
0d9881cf03
Fix add_url() with local files.
2014-04-29 18:43:21 +02:00
Bastian Kleineidam
82dd76b0d7
Add PDF link parsing.
2014-04-28 18:13:45 +02:00
Bastian Kleineidam
0ffdea2b8d
Added parser plugins and the applies_to() function.
2014-04-28 18:11:19 +02:00
Bastian Kleineidam
0f8ee234c3
Fix documentation.
2014-04-28 18:10:20 +02:00
Bastian Kleineidam
6bae3e0f49
Use the same request arguments for redirects.
2014-04-23 22:03:44 +02:00
Bastian Kleineidam
981079c041
Support itemtype attribute parsing.
2014-04-23 22:03:20 +02:00
Bastian Kleineidam
4232b69633
Support <img> srcset attribute parsing.
2014-04-10 17:51:59 +02:00
Bastian Kleineidam
6caf654031
Parse Link: heaaders.
2014-04-10 17:50:55 +02:00
Bastian Kleineidam
22caa9367a
Refactor recursion checks.
2014-04-10 17:50:55 +02:00
Bastian Kleineidam
08fbd891ef
Do not check external robots.txt sitemaps.
2014-04-09 19:44:29 +02:00
Bastian Kleineidam
c57f607fc3
Use urldata.add_url()
2014-04-07 18:54:33 +02:00
Bastian Kleineidam
9c5693ad41
Add doc and copyright.
2014-03-30 19:23:42 +02:00
Bastian Kleineidam
4759cee377
Updated mailto: documentation.
2014-03-30 08:30:14 +02:00
Bastian Kleineidam
b6b5c7a12e
Simpler link parsing routine.
2014-03-27 19:49:17 +01:00
Bastian Kleineidam
f180592cc4
Increase thread poll intervall to reduce CPU usage.
2014-03-27 17:43:14 +01:00
Bastian Kleineidam
81da2eb48f
Code cleanup
2014-03-27 17:19:52 +01:00
Bastian Kleineidam
da0ef8e8ea
Fix for moved functions.
2014-03-27 17:19:24 +01:00
Bastian Kleineidam
fa26876f67
Don't use encoding detection since it's very slow.
2014-03-27 12:27:11 +01:00
Bastian Kleineidam
8cf84be2e2
Fix pyopenssl certificate date parsing.
2014-03-26 20:25:44 +01:00
Bastian Kleineidam
49df359317
Some fixes when pyopenssl is used instead of python ssl module.
2014-03-26 19:59:17 +01:00
Bastian Kleineidam
dec0f6c8dc
Fix error with SNI checks
2014-03-26 12:38:16 +01:00
Bastian Kleineidam
a8623bc0bc
Display SSL info on redirects.
2014-03-26 07:16:03 +01:00
Bastian Kleineidam
be59802569
Set http connection charset.
2014-03-20 21:20:34 +01:00
Bastian Kleineidam
098dede12c
Fix warningregex setting in GUI.
2014-03-20 20:46:58 +01:00
Bastian Kleineidam
9cd67dfcb2
More SSL message work.
2014-03-20 20:24:57 +01:00
Bastian Kleineidam
4c76345338
Add certificate valid date info and always set verify flag.
2014-03-19 17:16:42 +01:00
Bastian Kleineidam
9a7ad3a84f
Print SSL cipher info for https URLs.
2014-03-19 17:02:34 +01:00
Bastian Kleineidam
931ca4f402
Add missing log keyword arg.
2014-03-19 17:02:00 +01:00
Bastian Kleineidam
71a7898ee6
Don't check non-connected URLs.
2014-03-19 16:33:38 +01:00
Bastian Kleineidam
ce733ae76b
Don't check for robots.txt directives in local html files.
2014-03-19 16:33:22 +01:00
Bastian Kleineidam
e528d5f7db
Fix ssl connection handling and change plugin type to connection plugin.
2014-03-19 14:28:33 +01:00
Bastian Kleineidam
9be667b52a
Do not warn about missing addresses on mailto links that have subjects.
2014-03-18 23:27:59 +01:00
Bastian Kleineidam
2eb6b1b44c
Call connect() on unconnected ssl responses.
2014-03-18 23:27:21 +01:00
Bastian Kleineidam
fc73c6ca6e
Log number of checked unique URLs.
2014-03-14 23:46:17 +01:00
Bastian Kleineidam
91c6e1d29f
Don't log bytes in status.
2014-03-14 22:25:19 +01:00
Bastian Kleineidam
34bdf5c75a
Updated copyright and docs.
2014-03-14 22:09:05 +01:00
Bastian Kleineidam
19b8baf08c
Move cached queue items to top once in a while.
2014-03-14 22:08:51 +01:00
Bastian Kleineidam
6437f08277
Display downloaded bytes.
2014-03-14 21:06:10 +01:00
Bastian Kleineidam
c51caf1133
Assertions should be earlier.
2014-03-14 20:26:11 +01:00
Bastian Kleineidam
cc401923ac
Improve wording of status message.
2014-03-14 20:25:37 +01:00
Bastian Kleineidam
cfff4c4a84
Disable URL length warning for data: URLs.
2014-03-14 20:24:28 +01:00
Bastian Kleineidam
ac78c6d5b8
Internal errors do not stop the checking thread any more.
2014-03-14 20:23:04 +01:00
Bastian Kleineidam
b18854649d
Count unique URLs for url queue limit.
2014-03-14 20:21:46 +01:00
Bastian Kleineidam
257644e660
Add cache length function to get number of cached elements.
2014-03-14 20:19:34 +01:00
Bastian Kleineidam
306979abca
Add HttpHeaderInfo plugin
2014-03-12 19:28:37 +01:00
Bastian Kleineidam
279db5c5b8
Fix documentation.
2014-03-12 19:22:18 +01:00
Bastian Kleineidam
ccd0d4ead7
Updated the list of unknown or ignored URI schemes.
2014-03-12 19:20:49 +01:00
Bastian Kleineidam
121602df87
Use SSL cert on Windows systems.
2014-03-11 20:58:16 +01:00
Bastian Kleineidam
0ad5969b54
Simplify config dir functions.
2014-03-11 20:23:49 +01:00
Bastian Kleineidam
41d07729bb
Install certificate store with installers.
2014-03-10 22:34:37 +01:00
Bastian Kleineidam
ee0717131d
Add marker for http debugging
2014-03-10 20:09:05 +01:00
Bastian Kleineidam
9c9cf0c3e2
Check for Python requests >= 2.2.0
2014-03-10 19:31:31 +01:00
Bastian Kleineidam
57edf0923e
Updated copyright year
2014-03-10 19:27:22 +01:00
Bastian Kleineidam
bca226c293
Fix assertion checking external links; fix tests
2014-03-10 18:23:44 +01:00
Bastian Kleineidam
40b663cf9e
Ignore URLs earlier.
2014-03-10 18:05:11 +01:00
Bastian Kleineidam
6b334dc79b
Fix URL result caching.
2014-03-08 19:35:10 +01:00
Bastian Kleineidam
0113f06406
Enable arbitrary output encodings in CSV output. See #467
2014-03-06 22:40:52 +01:00
Bastian Kleineidam
102837b875
Set maximum redirects
2014-03-06 21:58:35 +01:00
Bastian Kleineidam
fab2c2da98
Improve content type setting.
2014-03-05 20:12:19 +01:00
Bastian Kleineidam
ef13a3fce1
Implement sitemap and sitemap index parsing.
2014-03-05 19:26:37 +01:00
Bastian Kleineidam
b72cf252fb
Move parseable check down since it might get the content.
2014-03-05 19:26:05 +01:00
Bastian Kleineidam
9ef65cb774
Fix UrlData string representation.
2014-03-05 19:25:40 +01:00
Bastian Kleineidam
00bd549c0c
Remove duplicate content type map.
2014-03-05 19:24:58 +01:00
Bastian Kleineidam
380f14453b
Fix mimetype guessing from content.
2014-03-05 19:23:58 +01:00
Bastian Kleineidam
192cfab009
Cleanup of the UrlData.is_* functions
2014-03-05 19:23:16 +01:00
Bastian Kleineidam
b17211f162
Set for release.
2014-03-04 21:36:24 +01:00
Bastian Kleineidam
978b24f2d7
Merge branch 'caching'
2014-03-04 07:21:42 +01:00
Bastian Kleineidam
f1076c8813
Increase url-too-long warning.
2014-03-03 23:31:04 +01:00
Bastian Kleineidam
82f81241fd
Check all links and add better caching.
2014-03-03 23:29:45 +01:00
Bastian Kleineidam
510af337c1
Improved --version output.
2014-03-01 21:00:16 +01:00
Bastian Kleineidam
74d804ac82
Print release date on --version and internal errors.
2014-03-01 20:59:00 +01:00
Bastian Kleineidam
39df1812c7
Default to 10 threads instead of 100.
2014-03-01 20:49:06 +01:00
Bastian Kleineidam
6f205a2574
Support checking Sitemap: URLs in robots.txt files.
2014-03-01 20:25:19 +01:00
Bastian Kleineidam
0f0d79c7e0
Remove crawl-delay stuff
2014-03-01 20:01:42 +01:00
Bastian Kleineidam
00f8011709
Catch overflowerror in robots.txt crawl-delay
2014-03-01 19:58:22 +01:00
Bastian Kleineidam
0e4d6f6e1a
Parse sitemap urls in robots.txt files.
2014-03-01 19:57:57 +01:00
Bastian Kleineidam
78a99717fe
Check regular expressions from users for errors.
2014-03-01 19:15:48 +01:00
Bastian Kleineidam
c20005a031
Add missing docstring.
2014-03-01 19:14:43 +01:00
Bastian Kleineidam
39c39b1d9f
Disable twill page refresh.
2014-03-01 18:19:29 +01:00
Bastian Kleineidam
0211529d79
Use twill form field number if all else fails.
2014-03-01 18:12:06 +01:00
Bastian Kleineidam
7d84e1e729
Do not check permissions on non-posix systems for now.
2014-03-01 18:01:08 +01:00
Bastian Kleineidam
eb7e52c0e2
-o none sets exit code now
2014-03-01 15:31:39 +01:00
Bastian Kleineidam
f7f5001256
Add missing column name to SQL insert statement.
2014-03-01 12:03:33 +01:00
Bastian Kleineidam
f9bf831804
Remove some empty lines
2014-03-01 12:02:00 +01:00
Bastian Kleineidam
900e04ceda
Dynamic language switch in the GUI.
2014-03-01 12:01:47 +01:00
Bastian Kleineidam
9d0255e156
Fix bookmark imports
2014-03-01 10:16:29 +01:00
Bastian Kleineidam
7b34be590b
Introduce check plugins, use Python requests for http/s connections, and some code cleanups and improvements.
2014-03-01 00:12:34 +01:00
Bastian Kleineidam
c806be5c15
Updated copyright
2014-01-08 22:33:04 +01:00
Bastian Kleineidam
c076e312a2
Send an Accept header.
2014-01-08 19:56:00 +01:00
Bastian Kleineidam
f3b435c2a6
Add missing docstrings.
2013-12-24 07:15:31 +01:00
Bastian Kleineidam
e0a2558b2b
Updated copyright.
2013-12-24 07:13:16 +01:00
Bastian Kleineidam
845a6a1146
Fix loader in frozen executables.
2013-12-18 20:53:17 +01:00
wummel
9646f0b652
Merge pull request #418 from chuckbjones/reset-url-on-fallback
...
Reset to original url when falling back to GET
2013-12-17 22:37:17 -08:00
Bastian Kleineidam
fbbced4d8f
Fix tests
2013-12-13 07:39:59 +01:00
Bastian Kleineidam
5151e68a3e
Fix logger config
2013-12-13 07:37:21 +01:00
Bastian Kleineidam
103e00b4d1
Allow disabling of ssl certificate checks.
2013-12-12 22:17:57 +01:00
Bastian Kleineidam
39fb02f9a9
Remember last save result as filetype.
2013-12-12 20:44:09 +01:00
Bastian Kleineidam
5736987b60
Refactor output loggers.
2013-12-11 18:41:55 +01:00
Bastian Kleineidam
78ed1e9e52
Do not GET on POST forms.
2013-12-10 23:42:43 +01:00
Bastian Kleineidam
0ca63797bf
Remove content cache.
2013-12-10 23:41:52 +01:00
Bastian Kleineidam
a7c1cdd6f6
Check for help files.
2013-12-10 20:56:26 +01:00
Bastian Kleineidam
2c5ede2eb7
Fallback to GET for Apache Coyote servers.
2013-12-08 08:22:56 +01:00
Bastian Kleineidam
b567f766ba
Fix strtime test.
2013-12-06 07:13:44 +01:00
Bastian Kleineidam
6d68e00068
Merge branch 'master' of github.com:wummel/linkchecker
2013-12-04 19:21:45 +01:00
Bastian Kleineidam
023da7c993
Remove the duplicate URL content check.
2013-12-04 19:12:40 +01:00
Bastian Kleineidam
36badddfac
Update cookie code from Python module.
2013-12-04 19:05:08 +01:00
wummel
ab54809d95
Merge pull request #426 from alperkokmen/fix-lastmod-format
...
Fix ISO formatting for modified datetime.
2013-12-03 12:22:27 -08:00
Bastian Kleineidam
c676a4c829
Avoid DoS in SSL certificate host matching.
2013-11-30 22:07:23 +01:00
Alper Kokmen
4b3e78cac0
Fix ISO formatting for modified datetime.
...
This change will make sure that format_modified returns datetime value
in ISO 8601 format. See W3C documentation at
http://www.w3.org/TR/NOTE-datetime .
Since ```modified``` is parsed and then converted to UTC after it's
extracted from HTTP response, it's safe to assume that format_modified
will always format UTC datetime values.
Instead of ```isoformat``` method which omits timezone information for
UTC values, ```strftime``` with a specific format (that ends with Z)
will be used.
2013-09-02 15:38:54 -07:00
Charles Jones
4294633c04
Close connection prior to falling back to get, since we change the url back to the original at that time.
2013-08-09 13:08:51 -05:00
Charles Jones
8bc138f18b
Reset to original url when falling back to GET
2013-07-30 13:38:59 -05:00
Bastian Kleineidam
c966fe6b24
Remove the http-wrong-redirect warning
2013-04-11 18:33:19 +02:00
Bastian Kleineidam
134db22830
Updated homepage URL.
2013-04-09 20:11:04 +02:00
Bastian Kleineidam
21678c661d
Updated gzip and httplib copies.
2013-03-11 20:21:58 +01:00
Bastian Kleineidam
6b05f1d290
Paginate help output again.
2013-02-28 21:21:00 +01:00
Bastian Kleineidam
123578a4cd
Make per-host connection limits configurable.
2013-02-27 19:37:28 +01:00
Bastian Kleineidam
b7c82d1e75
Fix strformat.strsize() test.
2013-02-27 19:36:03 +01:00
Bastian Kleineidam
b38317d57b
Replace optparse with argparse.
2013-02-27 19:35:44 +01:00
Bastian Kleineidam
64d95e45e0
Remove local HTML and CSS syntax check.
2013-02-08 21:36:02 +01:00
Bastian Kleineidam
b104482174
Add missing docstring.
2013-01-25 21:15:12 +01:00
Bastian Kleineidam
35bc79dd90
Updated copyright.
2013-01-25 21:14:27 +01:00
Bastian Kleineidam
707b7b7db1
Close HTTP connections without body content. Github issue #376
2013-01-23 19:42:29 +01:00
Bastian Kleineidam
e6ad32c028
Catch UnicodeError for invalid host names.
2013-01-23 19:42:29 +01:00
Bastian Kleineidam
c0a0efbd1d
Do not handle non-existing SIGUSR1 signal.
2013-01-22 21:23:46 +01:00
Bastian Kleineidam
47451d7def
Fix GUI drag and drop.
2013-01-22 19:06:10 +01:00
Bastian Kleineidam
faa743e876
Increase per-host connection limits.
2013-01-22 18:18:48 +01:00
Bastian Kleineidam
fa402c0d70
Allow drag-and-drop of all local files.
2013-01-22 18:17:07 +01:00
Bastian Kleineidam
7134c0bb05
Print thread stack traces on SIGUSR1
2013-01-22 18:16:53 +01:00
Bastian Kleineidam
9b8cb67d78
Updated copyright.
2013-01-17 20:41:47 +01:00
Bastian Kleineidam
4dad2aa33c
Support dns-prefetch URLs.
2013-01-17 20:41:09 +01:00
Bastian Kleineidam
7fe72745ae
Updated copyright.
2013-01-09 23:03:12 +01:00
Bastian Kleineidam
fe7e9a5c6c
Improve Word document opening: open read-only and invisble, avoiding unnecessary dialogs.
2013-01-07 22:18:39 +01:00
Bastian Kleineidam
a5b6136e70
Check word document validity before closing.
2013-01-07 21:58:02 +01:00
Bastian Kleineidam
0e50834f9a
Rename external module to exclude it from some style checks.
2013-01-06 18:17:29 +01:00
Bastian Kleineidam
65a0031c10
Updated copyright.
2013-01-06 18:12:44 +01:00
Bastian Kleineidam
16b84be490
Updated all links.
2013-01-06 18:10:13 +01:00
Bastian Kleineidam
0283362ce6
Updated copyright.
2012-12-23 21:32:16 +01:00
Bastian Kleineidam
a7b83e6200
Fix GUI startup for Windows.
2012-12-19 21:12:02 +01:00
Bastian Kleineidam
9820530313
Use better_exchook to print more internal error info.
2012-12-18 23:06:48 +01:00
Bastian Kleineidam
f568a04a7c
Fix ignore option storing in GUI.
2012-12-13 17:06:06 +01:00
Bastian Kleineidam
27df4e20da
Add error handling for screen console function.
2012-12-07 22:31:48 +01:00
Bastian Kleineidam
efbbb656a1
Remove python-dns conflict by moving the dns module into a custom subdirectory.
2012-12-07 22:19:32 +01:00
Bastian Kleineidam
45a4bbdaa9
Use locale.format() and os.path.getsize()
2012-12-01 00:05:14 +01:00
Bastian Kleineidam
42a17cbb98
Prepare py3 port and display sys.argv on internal errors.
2012-11-26 18:49:07 +01:00
Bastian Kleineidam
ec03d56b62
Remove pysqlite dependency.
2012-11-14 20:23:56 +01:00
Bastian Kleineidam
7ae1eadadb
Improve http status 305 code message.
2012-11-13 18:13:36 +01:00
Bastian Kleineidam
cd4abb1f12
Improve repr() of url data, and remove alexa test script.
2012-11-09 19:09:38 +01:00
Bastian Kleineidam
f3e52f1176
loginpasswordfield is not a password
2012-11-08 22:11:35 +01:00
Bastian Kleineidam
e5735e2a5d
Fix URL queue handling.
2012-11-08 12:48:21 +01:00
Bastian Kleineidam
96c6a7f378
Display portable flag in about dialog.
2012-11-08 11:59:20 +01:00
Bastian Kleineidam
bc683577de
Remove URLs from the in_progress cache.
2012-11-08 11:03:16 +01:00
Bastian Kleineidam
810a62e093
Fix file url checking.
2012-11-07 19:37:16 +01:00
Bastian Kleineidam
2d6cfb238f
Add trailing dot when creating user configuration directory on Windows.
2012-11-07 18:22:07 +01:00
Bastian Kleineidam
b0c2a90b94
Updated copyright.
2012-11-07 18:08:44 +01:00
Bastian Kleineidam
f9a7f5ef96
Restrict local file checking.
2012-11-07 18:07:00 +01:00
Bastian Kleineidam
02ec94dbfb
Improve cancel message.
2012-11-06 21:54:09 +01:00
Bastian Kleineidam
eabaa41bd2
Do not check duplicate URLs.
2012-11-06 21:34:22 +01:00
Bastian Kleineidam
ae5f9e8801
Print active threads in debug level.
2012-11-06 21:33:43 +01:00
Bastian Kleineidam
9745be9d71
Fix cookie path matching with empty paths.
2012-10-30 17:44:00 +01:00
Bastian Kleineidam
e2fd37b886
Encode user and password for telnet connection.
2012-10-30 17:44:00 +01:00
Bastian Kleineidam
c6d8b0050e
Improve PHP command check.
2012-10-29 21:05:26 +01:00
Bastian Kleineidam
e8da486d66
Detect redirection errors when getting content.
2012-10-26 18:05:00 +02:00
Bastian Kleineidam
2390827735
Debug cookies.
2012-10-25 17:53:16 +02:00
Bastian Kleineidam
c44aa2db1f
Fix anchor checking of cached HTTP URLs by using the cached content type.
2012-10-25 06:37:10 +02:00
Bastian Kleineidam
dca52145d3
Misc stuff.
2012-10-24 22:59:28 +02:00
Bastian Kleineidam
b39158e65c
Improve available anchor message.
2012-10-24 22:21:46 +02:00
Bastian Kleineidam
dd2c963fac
Fix non-ASCII exception handling.
2012-10-24 22:14:45 +02:00
Bastian Kleineidam
64de760b97
Added debug statements for unparseable content types.
2012-10-24 22:06:42 +02:00
Bastian Kleineidam
3a51ac7662
Warn about accessible passwords in config files.
2012-10-15 14:36:10 +02:00
Bastian Kleineidam
8750d55a73
Add configuration entry for maximum number of URLs.
2012-10-14 11:13:55 +02:00
Bastian Kleineidam
2ebedbaaa6
Fix content reading.
2012-10-13 16:48:29 +02:00
Bastian Kleineidam
0e4e694ad1
Fix connection handling on redirects.
2012-10-13 13:36:43 +02:00
Bastian Kleineidam
3b5877161c
Improved debugging.
2012-10-13 13:36:28 +02:00
Bastian Kleineidam
d3b44be2c4
Improved documentation.
2012-10-13 12:03:19 +02:00
Bastian Kleineidam
7929a48d78
Fix url split with invalid port names.
2012-10-13 12:03:09 +02:00
Bastian Kleineidam
aa057bd36f
Fix colorama init error.
2012-10-12 20:39:34 +02:00
Bastian Kleineidam
6a204120b6
Handle stale file system links for local file checks.
2012-10-12 17:20:19 +02:00
Bastian Kleineidam
c4e15c7b88
Improved duplication url check.
2012-10-10 21:04:48 +02:00
Bastian Kleineidam
b758fc6f52
Reuse existing response.
2012-10-10 12:27:36 +02:00
Bastian Kleineidam
a0610310b4
Print debug on stderr.
2012-10-10 12:27:25 +02:00
Bastian Kleineidam
0c20ef5de4
Strip console characters only from line text.
2012-10-10 12:27:08 +02:00
Bastian Kleineidam
e1e80b7dd5
Remove addrinfo cache.
2012-10-10 10:54:58 +02:00
Bastian Kleineidam
20be0f2519
Strip control chars from logger output.
2012-10-10 10:54:30 +02:00
Bastian Kleineidam
f484a6776d
Use timeout value from configuration.
2012-10-10 10:53:52 +02:00
Bastian Kleineidam
871508ef5d
Add docs and updated copyright.
2012-10-10 06:53:16 +02:00
Bastian Kleineidam
63cf8adf54
Catch ValueError on invalid cookie expiration dates.
2012-10-10 06:44:38 +02:00
Bastian Kleineidam
06a25676c5
Only read the maximum data size plus one, not the whole file.
2012-10-10 06:35:33 +02:00
Bastian Kleineidam
3e1d51b8bf
Use RLock to simplify internal locking.
2012-10-09 21:11:35 +02:00
Bastian Kleineidam
c4cd66ea1b
Simplify decorator duration check logic.
2012-10-09 21:05:24 +02:00
Bastian Kleineidam
03a5d476b3
Use URL name if title is empty.
2012-10-09 21:04:54 +02:00
Bastian Kleineidam
6d47b76509
Limit HTTP and FTP connections. Gets rid of spurious BadStatusLine errors.
2012-10-09 21:04:20 +02:00
Bastian Kleineidam
7d3ece502c
Support semaphores.
2012-10-09 19:46:06 +02:00
Bastian Kleineidam
ad8525c483
Improve BadStatusline error message.
2012-10-05 08:32:24 +02:00
Bastian Kleineidam
d15fafb1f7
Code cleanup.
2012-10-05 08:10:44 +02:00
Bastian Kleineidam
5ebd754cdb
Improved duplicate url check.
2012-10-01 16:11:45 +02:00
Bastian Kleineidam
ed7c60e491
Do not warn about duplicate URLs which can point to the same content.
2012-10-01 13:42:46 +02:00
Bastian Kleineidam
148846be67
Add flag to log lock contentions.
2012-10-01 13:32:30 +02:00
Bastian Kleineidam
b56c054932
Use finer-grained robots.txt locks to improve lock contention.
2012-10-01 13:29:29 +02:00
Bastian Kleineidam
27b61c3bfa
Fix gzip handling in http content decoder.
2012-09-30 14:00:49 +02:00
Bastian Kleineidam
cbc3bcb0d3
Sitemap logger fixes.
2012-09-23 23:20:21 +02:00
Bastian Kleineidam
60305d8877
Code cleanup.
2012-09-23 21:20:12 +02:00
Bastian Kleineidam
e21187b275
Put in-progress URLs back near the front of URL queue, not at end.
2012-09-23 21:00:01 +02:00
Bastian Kleineidam
1f3034b5f5
Sitemap logger fixes.
2012-09-23 20:59:38 +02:00
Bastian Kleineidam
38dd63f055
Code cleanup.
2012-09-23 16:19:42 +02:00
Bastian Kleineidam
7f8fd01b22
Add Accept-Encoding and Accept-Charset headers.
2012-09-23 15:06:44 +02:00
Bastian Kleineidam
03ecff22bb
Fix endless loop in http authentication.
2012-09-22 22:21:10 +02:00
Bastian Kleineidam
653b5f27dd
Updated ignored schemes.
2012-09-22 16:18:37 +02:00
Bastian Kleineidam
1c59cb4d4c
Use GET in case a HEAD method does not succeed, even if robots.txt content checkes denied the page. This way proper check results are achieved (but the content is still not checked, so it's ok).
2012-09-22 07:53:11 +02:00
Bastian Kleineidam
fba465e8e8
Fix robotstxt cache miss stats.
2012-09-21 21:12:28 +02:00
Bastian Kleineidam
f6b007f757
Fix useragent matching in robots.txt parser.
2012-09-21 21:12:13 +02:00
Bastian Kleineidam
bbf25106fa
Fix double result setting on http checks.
2012-09-21 20:33:15 +02:00
Bastian Kleineidam
3e464e509c
Do not allow empty configuration string values.
2012-09-21 16:05:34 +02:00
Bastian Kleineidam
ecf8753a19
Improved user-agent string similar to Google and Bing search bots.
2012-09-21 15:46:14 +02:00
Bastian Kleineidam
c274b50c50
Store lowercase URL scheme in checker class.
2012-09-21 14:35:25 +02:00
Bastian Kleineidam
0941f6ff02
Improve exception handling by using unicode.
2012-09-21 14:29:20 +02:00
Bastian Kleineidam
f46889a4af
Log timestamps in debug output.
2012-09-21 13:05:36 +02:00
Bastian Kleineidam
049882e4fe
Remove accept-encoding since some sites have wrong compression.
2012-09-20 22:39:15 +02:00
Bastian Kleineidam
7c6dce6136
Only warn non-empty site duplicates.
2012-09-20 20:39:36 +02:00
Bastian Kleineidam
a03090c20f
Optimize intern/extern pattern parsing.
2012-09-20 20:19:13 +02:00
Bastian Kleineidam
c385c35b1a
Fix ansicolor again.
2012-09-20 16:39:40 +02:00
Bastian Kleineidam
b9d234c78a
Fix wrong method name in SSL certificate check.
2012-09-20 16:28:01 +02:00
Bastian Kleineidam
bff217c58b
Never log ignored warnings.
2012-09-20 12:44:40 +02:00
Bastian Kleineidam
600b7c0e69
Fix duplicate content warning when self.size is not set yet.
2012-09-20 12:44:23 +02:00
Bastian Kleineidam
9cfee5eb5b
Improved color detection with curses.
2012-09-20 12:13:15 +02:00
Bastian Kleineidam
bc0a17c1c4
Display last modified date in the GUI.
2012-09-19 21:23:39 +02:00
Bastian Kleineidam
d37347cab0
Remove unused variable.
2012-09-19 11:08:06 +02:00
Bastian Kleineidam
18a200d85f
Fix tests.
2012-09-19 11:05:26 +02:00
Bastian Kleineidam
b8f8bdf5fc
Fix last modified formatting.
2012-09-19 10:09:19 +02:00
Bastian Kleineidam
f5fbd7666f
Remove unused import.
2012-09-19 09:39:32 +02:00
Bastian Kleineidam
75719b34f6
Updated copyright.
2012-09-19 09:17:25 +02:00
Bastian Kleineidam
71fba0f8b7
Log all valid URLs in sitemap loggers.
2012-09-19 09:17:08 +02:00
Bastian Kleineidam
9d1c90f96c
Write extra script to analyse a memory dump.
2012-09-18 16:08:31 +02:00
Bastian Kleineidam
3a352631ba
Add modified field to loggers.
2012-09-18 12:12:00 +02:00
Bastian Kleineidam
1db63227f6
Memoize file operations to minimize disk I/O.
2012-09-18 09:37:21 +02:00
Bastian Kleineidam
932a07a9cf
Added XML sitemap logger.
2012-09-18 09:16:34 +02:00
Bastian Kleineidam
4e59056ee7
Warn about duplicate URL contents.
2012-09-17 19:49:50 +02:00
Bastian Kleineidam
02a09dbb28
Add documentation.
2012-09-17 16:30:32 +02:00
Bastian Kleineidam
99bf8aa940
Updated copyright.
2012-09-17 16:09:55 +02:00
Bastian Kleineidam
cb71f483a5
Warn about too long URLs.
2012-09-17 16:00:23 +02:00
Bastian Kleineidam
03667a4ec9
Print warning tags in text output.
2012-09-17 15:29:04 +02:00
Bastian Kleineidam
1f9ee987f9
Improved terminal color detection with curses.
2012-09-17 15:24:04 +02:00
Bastian Kleineidam
6e1841cf1f
Print download and cache statistics.
2012-09-17 15:23:25 +02:00
Bastian Kleineidam
0b5b6ab37b
Automatically set --complete for graph output.
2012-09-15 15:06:29 +02:00
Bastian Kleineidam
273230d98b
Send HTTP Do-Not-Track header.
2012-09-14 22:41:38 +02:00
Bastian Kleineidam
e98f15933f
Stop checking of all output loggers have been deactivated.
2012-09-14 22:36:59 +02:00
Bastian Kleineidam
81d2c4dbd9
Improved documentation.
2012-09-14 22:26:45 +02:00
Bastian Kleineidam
86f1c74006
Close loggers properly on I/O errors.
2012-09-14 22:09:18 +02:00
Bastian Kleineidam
6730fb51ee
Allow maximum check time specification.
2012-09-03 20:17:49 +02:00
Bastian Kleineidam
a1dfaf2f91
Add missing docstring.
2012-09-02 23:37:43 +02:00
Bastian Kleineidam
21db38546c
Updated copyright.
2012-09-02 23:36:31 +02:00
Bastian Kleineidam
3baaca47a0
Add maximum number of allowed puts on URL queue.
2012-09-02 22:44:29 +02:00
Bastian Kleineidam
d8fce1ceeb
Do not sort URL queue anymore.
2012-09-02 22:32:14 +02:00
Bastian Kleineidam
7a6436f08f
Increase checked cache in URL queue.
2012-09-02 22:21:49 +02:00
Bastian Kleineidam
4c16d3e702
Make 401 unauthorized GET response a warning.
2012-08-26 11:32:17 +02:00
Bastian Kleineidam
b6d45eabe5
Code cleanup.
2012-08-24 09:46:38 +02:00
Bastian Kleineidam
ac6591a009
Recognize WML files on Windows.
2012-08-24 09:46:26 +02:00
Bastian Kleineidam
7334a9863e
Make URL properties in GUI selectable with the mouse.
2012-08-24 00:10:59 +02:00
Bastian Kleineidam
ae15d51b30
Translate more result strings.
2012-08-23 23:59:33 +02:00
Bastian Kleineidam
ce4253263c
Do not special case http->ftp redirects.
2012-08-23 23:56:36 +02:00
Bastian Kleineidam
7374068941
Remove unused import.
2012-08-23 16:46:14 +02:00
Bastian Kleineidam
73d64e50ab
Fix redirection to new scheme.
2012-08-23 16:45:24 +02:00
Bastian Kleineidam
99ab68908c
Increase the default number of checker threads.
2012-08-23 16:11:47 +02:00
Bastian Kleineidam
bc287d7710
Make unauthorized access responses with missing www-authenticate headers an error.
2012-08-23 15:52:11 +02:00
Bastian Kleineidam
e252bbf623
Remove Amazon quirk because the default behaviour handles this now.
2012-08-23 05:36:51 +02:00
Bastian Kleineidam
02a9f0bacb
Add utility method to read string options.
2012-08-23 04:52:25 +02:00
Bastian Kleineidam
ecef16b2c9
Support WML sites.
2012-08-22 22:43:14 +02:00
Bastian Kleineidam
36b1bb01e0
Fix variable name typo.
2012-08-22 22:00:11 +02:00
Bastian Kleineidam
8d36bf4e3d
Show URLs in status bar.
2012-08-14 23:00:50 +02:00
Bastian Kleineidam
76f57dc4ad
Updated copyright.
2012-08-14 20:37:24 +02:00
Bastian Kleineidam
6915e2f989
Detect sites not supporting HEAD requests.
2012-08-14 18:43:39 +02:00
Bastian Kleineidam
db76f01d48
Stop application when aborting timed out. Only used on the command line.
2012-08-14 17:41:26 +02:00
Bastian Kleineidam
29a5c1a44a
Display the real url name in gui property field.
2012-08-13 18:55:25 +02:00
Bastian Kleineidam
f3b66b102d
Fallback to GET when method HEAD is not allowed.
2012-08-13 07:07:21 +02:00
Bastian Kleineidam
e65b5c72ce
Correct list of schemes requiring host name.
2012-08-12 14:21:56 +02:00
Bastian Kleineidam
7b567cc378
Make scheme and domain for internal url pattern case insensitive.
2012-08-12 14:19:42 +02:00
Bastian Kleineidam
afc0ecd7a6
--ignore-url now really ignores URLs.
2012-08-12 11:16:29 +02:00
Bastian Kleineidam
b86be09d9e
Recalculate extern settings after changing intern patterns.
2012-08-12 11:15:18 +02:00
Bastian Kleineidam
6be3e9ddff
Cleanup code and improve redirect anchor handling.
2012-08-12 11:14:56 +02:00
Bastian Kleineidam
10cc59c654
Use colorama only on Windows systems.
2012-08-12 10:23:44 +02:00
Bastian Kleineidam
cf53b33c94
Remove unused functions.
2012-08-11 19:34:27 +02:00
Bastian Kleineidam
aa22dc2702
Fix windows console output.
2012-08-11 07:52:04 +02:00
Bastian Kleineidam
d9acc97f9f
Use colorama instead of wconio.
2012-08-10 22:24:00 +02:00
Bastian Kleineidam
c74690a79a
Do not check SSl certificates on HTTPS -> HTTP redirects.
2012-08-10 19:43:57 +02:00
Bastian Kleineidam
451a520943
Prevent double color stream proxying.
2012-08-10 19:43:33 +02:00
Bastian Kleineidam
580ab74f0e
Updated german translation.
2012-08-09 20:43:31 +02:00
Bastian Kleineidam
82b4dea4fe
Updated copyright
2012-08-09 20:43:22 +02:00
Bastian Kleineidam
1c739aed81
Use urlparse.uses_relative instead of unofficial urlparse.non_hierarchical (which has been removed in the current CPython 2.7.x trunk).
2012-08-04 20:40:31 +02:00
Bastian Kleineidam
b0e5c7fc59
Ignore feed: URLs.
2012-06-27 21:32:03 +02:00
Bastian Kleineidam
0fd1a78378
Always compare encoded anchor names.
2012-06-27 20:59:53 +02:00
Bastian Kleineidam
e0d6aecad9
Add cancel button to show memory dialog.
2012-06-25 20:25:02 +02:00
Bastian Kleineidam
103fb12987
Display debug output with fixed width font; allow showing of memory dump.
2012-06-25 20:21:45 +02:00
Bastian Kleineidam
bfe79e478f
Detect when all checker threads are dead and the queue is not empty.
2012-06-24 22:51:43 +02:00
Bastian Kleineidam
5c045fef44
Fix UNC path handling on Windows.
2012-06-24 10:30:54 +02:00
Bastian Kleineidam
b550a9dcb5
Updated copyright.
2012-06-23 14:31:11 +02:00
Bastian Kleineidam
31519f6a01
Fix handling of UNC pathnames.
2012-06-23 14:30:58 +02:00
Bastian Kleineidam
3f063a5e9f
Remove unused import.
2012-06-23 14:29:16 +02:00
Bastian Kleineidam
363ccc0121
Check <object codebase=...> as normal URL.
2012-06-23 14:28:32 +02:00
Bastian Kleineidam
73b176d7c9
Fix URL joining: properly detect absolute URL.
2012-06-23 13:33:27 +02:00
Bastian Kleineidam
cdf6b91b39
Don't use <object codebase=...> attribute as parent url.
2012-06-23 13:32:08 +02:00
Bastian Kleineidam
8d23e2a3c6
Add debugging for checker class name.
2012-06-23 13:30:13 +02:00
Bastian Kleineidam
6d9a8859d3
Require and use Python 2.7.2.
2012-06-22 23:58:20 +02:00
Bastian Kleineidam
dbe57c0f9b
Treat Windows UNC paths as absolute paths.
2012-06-22 23:42:37 +02:00
Bastian Kleineidam
713b9ebada
Only assume local file links for URLs given on the command line.
2012-06-22 23:42:05 +02:00
Bastian Kleineidam
9d0cced73c
Fix SSL check errors.
2012-06-22 07:37:37 +02:00
Bastian Kleineidam
f10b38c13e
Fix saving of the debugmemory option.
2012-06-21 22:13:09 +02:00
Bastian Kleineidam
22a4d48d68
Ignore broken locale configuration.
2012-06-20 21:44:57 +02:00
Bastian Kleineidam
979d7f13d3
Updated copyright.
2012-06-20 21:40:21 +02:00
Bastian Kleineidam
5c18a7f8b2
Catch errors getting the default locale.
2012-06-20 21:35:52 +02:00
Bastian Kleineidam
addbcfc54f
Updated translation.
2012-06-20 20:18:39 +02:00
Bastian Kleineidam
2888528a13
Code cleanup.
2012-06-20 20:11:13 +02:00
Bastian Kleineidam
4cce99a77d
Test SSL certificate expiration.
2012-06-20 20:10:40 +02:00
Bastian Kleineidam
da7c68981b
Add missing imports.
2012-06-20 19:23:52 +02:00
Bastian Kleineidam
e903a2a13c
Add constants for number of seconds per minute, hour and day.
2012-06-20 19:23:35 +02:00
Bastian Kleineidam
727281a7fc
The notimplemented decorator reports now the filename and linenumber of the function.
2012-06-20 19:23:13 +02:00
Bastian Kleineidam
1e13a4f8fc
Add donation url to info blurb.
2012-06-20 00:37:39 +02:00
Bastian Kleineidam
eb30191bb0
Add copyright and missing docs.
2012-06-20 00:30:52 +02:00
Bastian Kleineidam
2f730f4e13
Add Nagios plugin.
2012-06-20 00:29:46 +02:00
Bastian Kleineidam
a6eaae2c38
Implement abstract method for logger.
2012-06-20 00:15:45 +02:00
Bastian Kleineidam
2dfe9d4b4f
Use abstract base class for loggers.
2012-06-19 23:27:26 +02:00
Bastian Kleineidam
a50bf226d7
Print Python environment variables for internal errors.
2012-06-19 22:47:26 +02:00
Bastian Kleineidam
cbb13a8983
Add SSL certificate verification.
2012-06-18 23:05:44 +02:00
Bastian Kleineidam
b0694ddfa7
Add meliae to optional module list.
2012-06-18 22:39:07 +02:00
Bastian Kleineidam
d26b6e707c
Fix function name typo.
2012-06-10 22:54:32 +02:00
Bastian Kleineidam
f107092a8a
Fix handling of user/password info in URLs.
2012-06-10 22:07:42 +02:00
Bastian Kleineidam
838095cbd5
Updated copyright.
2012-06-10 14:58:38 +02:00
Bastian Kleineidam
00aa631267
Add localwebroot configuration option.
2012-06-10 14:47:27 +02:00
Bastian Kleineidam
98537eea2f
Code cleanup: use add_url() function in UrlBase.
2012-06-10 14:24:17 +02:00
Bastian Kleineidam
db95fce77e
Ignore PHP processing instructions in local files.
2012-06-10 14:02:01 +02:00
Bastian Kleineidam
2dee223555
Allow memory dumps to be written.
2012-06-10 13:18:35 +02:00
Bastian Kleineidam
837ab22d01
Syntax cleanup.
2012-06-10 11:46:05 +02:00
Bastian Kleineidam
3ebe5708db
Code cleanup: add function for boolean option settings.
2012-06-10 11:22:54 +02:00
Bastian Kleineidam
77b8ec0fcd
Fix writing temporary Word files.
2012-06-10 11:07:35 +02:00
Bastian Kleineidam
b12c1e40fb
Allow specifying the file mode when opening temp files.
2012-06-10 11:03:43 +02:00
Bastian Kleineidam
54ffb102d8
Code cleanup: add function for GET fallback.
2012-06-10 09:52:12 +02:00
Bastian Kleineidam
5c94c47901
Remove old Squid proxy workaround.
2012-06-10 09:45:07 +02:00
Bastian Kleineidam
b83d3b53fc
Added missing documentation.
2012-06-02 00:31:02 +02:00
Bastian Kleineidam
6aa0249ea7
Updated httplib2.py from upstream.
2012-06-02 00:28:12 +02:00
Bastian Kleineidam
9273b9e8af
Fix configdata attribute name.
2012-05-23 21:15:30 +02:00
Bastian Kleineidam
e13a479362
Fix error logging.
2012-05-23 06:55:02 +02:00
Bastian Kleineidam
fa92d59448
Catch mimedb init errors.
2012-05-23 06:02:23 +02:00
Bastian Kleineidam
f6f43ffba8
Improve warning about missing twill module.
2012-05-10 21:29:18 +02:00
Bastian Kleineidam
bcbacec79a
Code cleanup.
2012-05-10 21:05:33 +02:00
Bastian Kleineidam
90a0f4b51e
Detect pygeoip library on Windows.
2012-05-10 06:45:29 +02:00
Bastian Kleineidam
61138744e6
Always use GET for Zope servers.
2012-05-08 20:47:47 +02:00
Bastian Kleineidam
cd6ee8a1bc
Fix checker test cases for non-english locales.
2012-04-23 20:56:33 +02:00
Bastian Kleineidam
4cc19f4e9c
Fix strformat tests on non-english locales.
2012-04-23 20:35:32 +02:00
Bastian Kleineidam
1cc3ac5f80
Updated copyright.
2012-04-22 20:45:24 +02:00
Bastian Kleineidam
1ab2e2c1a9
Use mkdir() instead of makedirs() for user configuration directory. Also fix logging on errors copying the user configuration.
2012-04-22 20:42:39 +02:00
Bastian Kleineidam
52dcf101e0
Remove rest of deprecated options.
2012-04-22 17:55:12 +02:00
Bastian Kleineidam
c7ced2445b
Ensure correct encoding when writing non-ascii CSV output.
2012-04-22 17:52:47 +02:00
Bastian Kleineidam
c12d44d092
Catch and report locale errors in WSGI script.
2012-04-22 13:10:03 +02:00
Bastian Kleineidam
c2221e1a18
Ensure proper encoding in WSGI script.
2012-04-22 12:48:21 +02:00
Bastian Kleineidam
d423a8b1af
Document web interface request timeout.
2012-04-21 12:34:34 +02:00
Bastian Kleineidam
0fdaf00154
Use Python threads in stead of Qt threads. The WSGI script works now.
2012-04-21 12:08:17 +02:00
Bastian Kleineidam
dab01b00d6
Add 5 minute timeout for WSGI script.
2012-04-20 19:25:27 +02:00
Bastian Kleineidam
ad09bc50b3
Improved logging in CGI script.
2012-04-19 20:56:51 +02:00
Bastian Kleineidam
1ef9a022ca
Make WSGI script more responsive by using threads.
2012-04-18 21:52:36 +02:00
Bastian Kleineidam
3d831c1adb
Updated copyright.
2012-04-11 22:23:43 +02:00
Bastian Kleineidam
f9290932b8
Fix bookmark menu entries in GUI.
2012-04-11 22:23:06 +02:00
Bastian Kleineidam
67b751f1c7
Write correct statistics when saving data in the GUI client.
2012-04-11 21:53:15 +02:00
Bastian Kleineidam
ae0bd406d4
Do not encode CSV outro output comment.
2012-04-11 20:43:46 +02:00
Bastian Kleineidam
bf1d9b1fad
Fix locale file lookup.
2012-04-11 20:34:55 +02:00
Bastian Kleineidam
192a560258
Code cleanup.
2012-04-11 20:33:32 +02:00
Bastian Kleineidam
7a019aac51
Fix update check when content could not be downloaded.
2012-04-09 13:36:18 +02:00
Bastian Kleineidam
a76824043c
Ignore any errors from bookmark file finders.
2012-04-06 08:24:30 +02:00
Bastian Kleineidam
d3d3529c23
Added a WSGI script.
2012-04-06 08:23:41 +02:00
Bastian Kleineidam
797024c69b
Fix URL connection cache key.
2012-04-04 22:58:09 +02:00
Bastian Kleineidam
1a28c2e334
Detect invalid empty cookies.
2012-04-03 08:03:54 +02:00
Bastian Kleineidam
e9420d77db
Updated copyright.
2012-03-31 09:24:08 +02:00
Bastian Kleineidam
b48812f612
Encode comments in CSV logger.
2012-03-31 09:17:49 +02:00
Bastian Kleineidam
4feea986b4
Fix concatenation of multiple cookie values.
2012-03-31 08:51:58 +02:00
Bastian Kleineidam
da6d7b0eca
Store cookies on redirect.
2012-03-31 08:37:18 +02:00
Bastian Kleineidam
8afc11e546
Upstream update.
2012-03-30 22:32:46 +02:00
Bastian Kleineidam
6d5e5f9efb
Updated copyright.
2012-03-30 22:24:10 +02:00
Bastian Kleineidam
9ee9abcf0f
Parse invalid comments <! bla >
2012-03-23 07:41:03 +01:00
Bastian Kleineidam
b9b8e3f5b2
Honor the charset encoding of the Content-Type HTTP
...
header when parsing HTML.
2012-03-22 22:45:11 +01:00
Bastian Kleineidam
d6d82b96f1
Add cookiefile config option.
2012-03-22 22:29:56 +01:00
Bastian Kleineidam
98b4768419
Use timeout when checking email addresses with SMTP.
2012-03-16 21:44:18 +01:00
Bastian Kleineidam
3b818d99c0
Allow --pause to be set in configuration file.
2012-03-16 21:28:30 +01:00
Bastian Kleineidam
67974386f6
Remove donate menu entry, it's already displayed in the about dialog.
2012-03-16 21:23:40 +01:00
Bastian Kleineidam
fb845ec813
Replace configuration filename in options dialog.
2012-03-16 06:19:02 +01:00
Bastian Kleineidam
64a7713f8f
Set option dialog title.
2012-03-15 21:41:49 +01:00
Bastian Kleineidam
4c9fd8d488
Cache real url.
2012-03-14 21:12:13 +01:00
Bastian Kleineidam
5c542277af
Improved error message for connect error to clamav daemon.
2012-03-12 21:13:22 +01:00
Bastian Kleineidam
5e13a78f66
Fix non-ascii HTTP header debugging.
2012-03-09 11:54:18 +01:00
Bastian Kleineidam
187a94312b
Updated copyright.
2012-03-09 11:16:18 +01:00
Bastian Kleineidam
3fcff8a4e5
Fix non-ascii HTTP header handling.
2012-03-09 11:14:18 +01:00
Bastian Kleineidam
24811ac7b0
Recheck extern status on HTTP redirects even if domain did not change.
2012-03-08 10:07:31 +01:00
Bastian Kleineidam
0f0c1a22b8
Added missing python function docs.
2012-01-29 17:21:24 +01:00
Bastian Kleineidam
71f5ee42c8
Updated copyright.
2012-01-29 17:18:28 +01:00
Bastian Kleineidam
c3423b3d6a
Add drag-n-drop support for project files.
2012-01-26 20:57:03 +01:00
Bastian Kleineidam
1deeb88e16
Add project support.
2012-01-26 20:23:15 +01:00
Bastian Kleineidam
b5852683ab
Do not write userconfig in portable mode.
2012-01-23 22:24:51 +01:00
Bastian Kleineidam
042b0569ec
Fall back to W3C checkers.
2012-01-22 08:13:27 +01:00
Bastian Kleineidam
51cf55b7a6
Remove warning: prefix from warning messages.
2012-01-21 00:25:02 +01:00
Bastian Kleineidam
053ea33b06
Updated GUI icons from oxygen upstream.
2012-01-19 16:59:14 +01:00
Bastian Kleineidam
6e1e9148d8
Work around a squid bug resulting in not detecting broken links
2012-01-17 08:36:11 +01:00
Bastian Kleineidam
e99c55f6c4
Proper proxy type check.
2012-01-16 21:15:53 +01:00
Bastian Kleineidam
4c15fc6a8b
Properly handle non-ASCII HTTP header values.
2012-01-14 11:01:09 +01:00
Bastian Kleineidam
a0581cc2a1
Ignore steam:// URIs.
2012-01-10 19:37:19 +01:00
Bastian Kleineidam
19f0bbe7ce
Add Ctrl-L shortcut to highlight URL.
2012-01-09 20:52:24 +01:00
Bastian Kleineidam
886bacb2ac
Updated copyright.
2012-01-06 20:33:49 +01:00
Bastian Kleineidam
ee5ba648be
Determine width of current console for help output.
2012-01-06 13:05:30 +01:00
Bastian Kleineidam
f1eb51d885
Updated copyright
2012-01-06 09:21:30 +01:00
Bastian Kleineidam
9b437fd1e4
Use dictionary-based logging config.
2012-01-04 21:07:35 +01:00
Bastian Kleineidam
033280cfb9
Remove workarounds for old Python versions.
2012-01-04 20:17:53 +01:00
Bastian Kleineidam
d36c46a27f
Require Python 2.7
2012-01-04 20:17:13 +01:00
Bastian Kleineidam
e316f5be68
Remove unneeded comment.
2012-01-04 20:04:36 +01:00
Bastian Kleineidam
6409651f55
Remove unused function.
2012-01-04 20:04:14 +01:00
Bastian Kleineidam
28c58519d9
Update httplib.py from upstream.
2012-01-04 19:55:47 +01:00
Bastian Kleineidam
fb979b4f3c
Add test for archive attribute support.
2011-12-30 12:36:22 +01:00
Bastian Kleineidam
d06c43d470
Split comma-separated archive attribute values.
2011-12-30 08:58:45 +01:00
Bastian Kleineidam
4a4985a960
Add HTML5 link elements and attributes.
2011-12-30 08:55:38 +01:00
Bastian Kleineidam
422d4e9c6c
Fix saving of GUI results.
2011-12-29 06:52:48 +01:00
Bastian Kleineidam
dff425710d
More Freshmeat/Freecode replacements.
2011-12-25 09:06:18 +01:00
Bastian Kleineidam
1c26c14b64
Set copyright year and add missing docstrings.
2011-12-25 08:45:27 +01:00
Bastian Kleineidam
10b9c12ce9
Fix GUI check result saving.
2011-12-20 20:53:49 +01:00
Bastian Kleineidam
61f9f31de2
Make copy from loggerclass dict.
2011-12-17 20:54:30 +01:00
Bastian Kleineidam
de652d8138
Minor code cleanups.
2011-12-17 19:13:54 +01:00
Bastian Kleineidam
3d9958dfbb
Parse Safari bookmark files.
2011-12-17 16:38:25 +01:00
Bastian Kleineidam
925a7166b6
Remove separator in edit menu.
2011-12-17 12:38:14 +01:00
Bastian Kleineidam
aeb6df05bd
Fix application config directory for Darwin platforms.
2011-12-17 07:33:21 +01:00
Bastian Kleineidam
351cc72aa4
Make GUI options a little wider; add separator in edit menu.
2011-12-16 21:49:31 +01:00
Bastian Kleineidam
21532a70ec
Return with non-zero return value when internal program errors occurred.
2011-12-14 22:54:26 +01:00
Bastian Kleineidam
433cf029d9
Replace copy.copy() with simple list copying when backing up configuration values.
2011-12-14 19:55:06 +01:00
Bastian Kleineidam
5f0853be87
Display GUI options in one dialog; support warning strings in GUI options; support ignore patterns in GUI options.
2011-12-13 21:25:09 +01:00
Bastian Kleineidam
9367e3166c
Recreate UI files.
2011-12-12 19:01:40 +01:00
Bastian Kleineidam
1b5cad3b3e
Only remove whitespace at start and end of URL.
2011-12-10 11:49:44 +01:00
Bastian Kleineidam
5576dfe10d
Fix LFUCache: delete 5% of least frequently used entries on shrink, not 95%.
2011-12-08 22:10:53 +01:00
Bastian Kleineidam
9956f3712e
Properly detect too-long Unicode hostnames.
2011-12-05 20:51:42 +01:00
Bastian Kleineidam
76eddb62a8
Properly detect home directory on OS X systems.
2011-11-05 12:03:55 +01:00
Bastian Kleineidam
5c496df9ed
Regenerate HTML parser with new Bison 2.5 version.
2011-10-31 06:41:45 +01:00
Bastian Kleineidam
a7f3b43bab
Remove unused file.
2011-10-28 00:49:46 +02:00
Bastian Kleineidam
c8c3470255
Use new warningregex option in GUI.
2011-10-20 09:12:28 +02:00
Bastian Kleineidam
fb237041d1
Updated copyright
2011-10-20 08:14:16 +02:00
Bastian Kleineidam
91e296b7ae
Add warning-regex GUI option.
2011-10-19 22:43:51 +02:00
Bastian Kleineidam
3461b96729
Fix adding bookmark context menu entries.
2011-10-19 20:54:34 +02:00
Bastian Kleineidam
c7afb3f1a1
Add actions to find bookmark files to edit menu.
2011-10-19 20:44:09 +02:00
Bastian Kleineidam
8f93245c26
Refactor and create a menu action adding function for the bookmark actions.
2011-10-19 20:42:24 +02:00
Bastian Kleineidam
91dce84c59
Fix sqlify for multiline contents.
2011-10-18 14:40:33 +02:00
Bastian Kleineidam
a2978209e6
Ignore errors trying to get FTP feature set.
2011-10-18 13:10:49 +02:00
Bastian Kleineidam
74ea444a9a
Parse logger and logging part names case insensitive.
2011-10-10 20:32:58 +02:00