Bastian Kleineidam
|
fd3fe8dcaa
|
Fix missing content types for cached URLs.
|
2010-12-23 07:37:36 +01:00 |
|
Bastian Kleineidam
|
7c55351511
|
Add get_content_type methods to subclasses.
|
2010-12-15 07:54:44 +01:00 |
|
Bastian Kleineidam
|
01184784ef
|
Remove warning about Unicode domains which are more widely supported now.
|
2010-12-11 07:58:15 +01:00 |
|
Bastian Kleineidam
|
6fac69cddb
|
Fall back to GET when connection is reset.
|
2010-11-21 19:50:51 +01:00 |
|
Bastian Kleineidam
|
147bf31e1e
|
Check for allowed HTTP GET method before parsing anchors in HTML file contents.
|
2010-11-17 19:13:26 +01:00 |
|
Bastian Kleineidam
|
4f5c957e43
|
Fix check of external domain after HTTP redirect.
|
2010-11-06 18:00:49 +01:00 |
|
Bastian Kleineidam
|
23b20306e9
|
Remove duplicate HTTP response codes.
|
2010-11-01 09:27:53 +01:00 |
|
Bastian Kleineidam
|
c5f93a561d
|
Fix debug message formatting.
|
2010-11-01 05:59:04 +01:00 |
|
Bastian Kleineidam
|
f14340a0a8
|
Do not check content of already cached URLs.
|
2010-10-27 19:52:48 +02:00 |
|
Bastian Kleineidam
|
1f81124dfa
|
Fix typo.
|
2010-10-27 19:23:14 +02:00 |
|
Bastian Kleineidam
|
23403f09bb
|
Do not print warning for HTTP to HTTPS or HTTPS to HTTP redirects.
|
2010-10-27 14:44:05 +02:00 |
|
Bastian Kleineidam
|
b2cf40151f
|
Improved redirection warning text.
|
2010-10-27 09:15:46 +02:00 |
|
Bastian Kleineidam
|
d9e981e497
|
Don't log a warning if commandline URL has been redirected.
|
2010-10-26 16:24:27 +02:00 |
|
Bastian Kleineidam
|
4375d35328
|
Add warning about unsupported HTTP authentication, and revert the realm changes.
|
2010-10-25 22:41:31 +02:00 |
|
Bastian Kleineidam
|
2a7292845c
|
Improved info message about sent cookies; do not report the retrieved cookie information.
|
2010-10-13 22:32:50 +02:00 |
|
Bastian Kleineidam
|
a8aa3bdb00
|
Another fix to ensure get_content() is only called when allowed.
|
2010-10-13 22:14:43 +02:00 |
|
Bastian Kleineidam
|
61e611e4bf
|
Prevent unallowed content read when checking for robots.txt allowance in HTML files.
|
2010-10-12 00:40:34 +02:00 |
|
Bastian Kleineidam
|
e494d6bbb6
|
Move MIME type detection into fileutil.py module, and use mimetools for detection.
|
2010-10-03 08:47:48 +02:00 |
|
Bastian Kleineidam
|
e0f4097eb0
|
Ensure HttpUrl.set_title_from_content() is only called when the content is allowed to be retrieved.
|
2010-09-29 19:26:03 +02:00 |
|
Bastian Kleineidam
|
5284017d67
|
Only fallback to HTTP GET when robots.txt sallows it.
|
2010-09-04 18:09:59 +02:00 |
|
Bastian Kleineidam
|
60f7af4598
|
Allow redirections to external URLs with same domain.
|
2010-08-13 01:22:18 +02:00 |
|
Bastian Kleineidam
|
1faedafb33
|
Fix data size for HTTP requests.
|
2010-08-04 00:06:25 +02:00 |
|
Bastian Kleineidam
|
7ad4f7c220
|
Compare size from meta info and content data.
|
2010-07-29 19:53:41 +02:00 |
|
Bastian Kleineidam
|
7536472797
|
Send correct host header when using http proxy.
|
2010-07-29 06:50:35 +02:00 |
|
Bastian Kleineidam
|
3370ea1562
|
Reflect changes in httplib2.py: use buffered read in httplib response object and use bad status line exception attribute.
|
2010-03-26 20:50:38 +01:00 |
|
Bastian Kleineidam
|
b8b0398dd2
|
Ensure redirected URL is Unicode encoded.
|
2010-03-07 22:11:55 +01:00 |
|
Bastian Kleineidam
|
c8e6995ecd
|
Support HTTPS proxies.
|
2010-03-07 21:06:10 +01:00 |
|
Bastian Kleineidam
|
6a2fcf8ae9
|
Parse links in Word files.
|
2010-03-07 19:20:51 +01:00 |
|
Bastian Kleineidam
|
3d5c114f14
|
Warn on permament redirections even when URL is outside of domain filter.
|
2010-03-07 09:36:21 +01:00 |
|
Bastian Kleineidam
|
2d73b907f1
|
Retry HTTP when server sent empty status line; should fix most of the BadStatusLine errors that are sporadically encountered.
|
2010-03-06 10:23:34 +01:00 |
|
Bastian Kleineidam
|
5e06b6b8d4
|
Updated FSF address in GPL blurb
|
2009-07-24 23:58:20 +02:00 |
|
Bastian Kleineidam
|
7f67027abf
|
ignore the fragment part (ie. the anchor) of URIs when
+ getting and caching content
|
2009-06-26 07:22:36 +02:00 |
|
Bastian Kleineidam
|
897b68ae9b
|
Fix copying of httpurl info
|
2009-03-07 00:17:17 +01:00 |
|
Bastian Kleineidam
|
29adfe92fd
|
Minor syntax fix
|
2009-03-06 20:14:50 +01:00 |
|
Bastian Kleineidam
|
6024f2e43e
|
Add missing reset of self.reused_connection flag
|
2009-03-06 20:10:03 +01:00 |
|
Bastian Kleineidam
|
58925b21d3
|
Improved persistent connection handling by retrying closed connections.
|
2009-03-06 08:15:34 +01:00 |
|
Bastian Kleineidam
|
29599e4c74
|
Make sure persistent connection will not close after reading contents.
|
2009-03-05 19:15:44 +01:00 |
|
Bastian Kleineidam
|
bf9ed8c659
|
Make sure file descriptors are closed after decoding HTTP content.
|
2009-03-05 19:15:03 +01:00 |
|
Bastian Kleineidam
|
7862147ca3
|
Fix showing content size.
|
2009-03-01 23:04:48 +01:00 |
|
calvin
|
e9805dbd8a
|
Updated copyright year to 2009
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3887 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2009-01-08 14:18:03 +00:00 |
|
calvin
|
209d5abc18
|
fix timeouts by testing earlier for persistent connections with HEAD
HEAD requests never have a body; nevertheless the http lib tries to
read() from them. This times out on some servers of course. Fix is
not to let those connections be persistent.
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3871 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2008-11-29 08:14:28 +00:00 |
|
calvin
|
c20e706761
|
Made some format changes on translated strings.
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3870 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2008-11-28 20:22:48 +00:00 |
|
calvin
|
c3b6fc5aa4
|
Readd
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3867 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2008-11-20 21:30:10 +00:00 |
|
calvin
|
97cf700e04
|
Fixed wrong cookie debugging format line.
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3849 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2008-07-13 12:51:56 +00:00 |
|
calvin
|
b30fb3b09c
|
Remove duplicate code in http checker.
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3820 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2008-06-16 19:52:09 +00:00 |
|
calvin
|
caf8ba6297
|
Really allow parsing of XHTML files; I forgot some places to adjust the MIME checking.
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3818 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2008-06-16 13:03:48 +00:00 |
|
calvin
|
a6deeeb8a5
|
Support parsing of HTML pages served with content type application/xhtml+xml
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3817 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2008-06-16 09:39:49 +00:00 |
|
calvin
|
a880939c40
|
Initialize variables in reset(), not in subsequent methods
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3796 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2008-06-08 09:27:13 +00:00 |
|
calvin
|
5f4d61e018
|
Use keyword arguments in translation strings.
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3780 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2008-05-27 19:44:40 +00:00 |
|
calvin
|
bacb59597e
|
Use relative imports from Python 2.5
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3750 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2008-05-09 06:16:03 +00:00 |
|
calvin
|
92c74ece4d
|
Send HTTP Referer header to both http and https URLs
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3741 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2008-04-29 13:33:35 +00:00 |
|
calvin
|
5d8bdaaa1f
|
Use generators instead of lists where possible
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3739 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2008-04-28 00:26:02 +00:00 |
|
calvin
|
3eac1be9ab
|
Require and use Python 2.5
Use Python 2.5 features and get rid of old compat code. Also some
code cleanups have been made.
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3737 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2008-04-27 11:39:21 +00:00 |
|
calvin
|
973da91f44
|
Source code cleanup: use or remove unused variables
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3724 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2008-04-25 07:49:52 +00:00 |
|
calvin
|
62efec3b35
|
Added CSS syntax check.
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3719 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2008-04-24 09:44:18 +00:00 |
|
calvin
|
5a2f89fa3d
|
Add redirect warning for commandline URLs
If URLs given on the commandline are redirected, the automatic
intern patterns might not match anymore. A warning makes this
more prominent.
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3712 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2008-04-21 09:18:36 +00:00 |
|
calvin
|
4055721fd4
|
Use internal gzip2 module
Use the internal gzip replacement module gzip2 for all GzipFile handling.
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3685 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2008-04-14 22:33:55 +00:00 |
|
calvin
|
4ce0ddd166
|
Changes for future Python 3.x compatibility
Replace backticks with repr(), replace .has_key() with "in".
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3680 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2008-03-19 10:22:57 +00:00 |
|
calvin
|
91a0aad5d8
|
Fix buggy persistent HTTP connections
Workaround for buggy servers that break protocol synchronization of
persistent HTTP connections.
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3677 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2008-03-14 13:28:43 +00:00 |
|
calvin
|
6499cb1a63
|
updated copyright year
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3658 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2008-01-02 14:31:19 +00:00 |
|
calvin
|
c971ebdabf
|
Added Shockwave Flash (SWF) parsing
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3656 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2007-12-28 02:12:48 +00:00 |
|
calvin
|
30d2b4f520
|
HTTP content data is only considered valid for parsing if the request was not redirected and is a GET request.
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3633 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2007-12-13 10:50:13 +00:00 |
|
calvin
|
41bc0b2b32
|
use 'self.data is None' to test if data is already read or not
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3631 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2007-12-03 14:09:20 +00:00 |
|
calvin
|
5591bbe052
|
fix self.downloadtime to self.dltime
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3630 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2007-12-03 14:01:36 +00:00 |
|
calvin
|
8d2dc781e1
|
Ensure unused or expired connections are closed.
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3617 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2007-11-30 16:42:41 +00:00 |
|
calvin
|
9cf3314eab
|
Use constants for warning tags, avoiding typos in string constants. And move the constants into a separate module const.py
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3611 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2007-11-29 07:50:22 +00:00 |
|
calvin
|
fcde8bd4d6
|
try to detect unknown URL schemes instead of manually setting the assume_local flag
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3609 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2007-11-28 18:46:50 +00:00 |
|
calvin
|
2edfaea03e
|
Read complete body data on persistent connections, else subsequent requests could fail.
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3568 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2007-08-08 19:33:10 +00:00 |
|
calvin
|
2b94c0c161
|
Assume missing HEAD requests for Zope server on text/plain content type
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3567 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2007-08-08 18:26:55 +00:00 |
|
calvin
|
df48d4a905
|
bump up copyright year
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3534 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2007-01-01 14:57:38 +00:00 |
|
calvin
|
c217b6d441
|
don't set result on self.get_content() redirections
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3515 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2006-11-17 20:42:00 +00:00 |
|
calvin
|
bef2494211
|
remove unused imports
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3482 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2006-09-24 10:13:59 +00:00 |
|
calvin
|
1883b79303
|
follow redirections when getting HTTP contents
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3473 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2006-09-21 09:27:38 +00:00 |
|
calvin
|
576d404ce2
|
close non-idle connections
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3453 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2006-09-15 21:27:19 +00:00 |
|
calvin
|
86514bb882
|
activate asset
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3448 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2006-09-15 20:21:06 +00:00 |
|
calvin
|
6348205dcc
|
add persistent connections back to the connection cache, close all others
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3444 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2006-09-15 19:59:30 +00:00 |
|
calvin
|
d6676ab0a0
|
more response closing, and cleanups
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3443 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2006-09-15 19:51:02 +00:00 |
|
calvin
|
6fe2db6755
|
use unicode_safe alias helper
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3442 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2006-09-15 19:46:55 +00:00 |
|
calvin
|
4b818cb4b3
|
Detect more cases to close the connection, and close response objects
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3437 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2006-09-15 16:35:53 +00:00 |
|
calvin
|
da15b15923
|
Split off the host wait time function, and use it with a separate lock
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3434 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2006-09-15 12:18:24 +00:00 |
|
calvin
|
7e1e01bd36
|
do not catch UnicodeError, handle that intern
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3269 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2006-05-19 17:13:16 +00:00 |
|
calvin
|
3adaf48b3d
|
add callback for crawldelay
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3227 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2006-05-17 15:35:58 +00:00 |
|
calvin
|
75e88c062a
|
added --cookiefile option to set initial cookie values
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3210 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2006-05-16 20:56:34 +00:00 |
|
calvin
|
91ff370ed7
|
on redirection to different URL scheme take caching into account; adjust tests
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3173 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2006-05-14 17:47:50 +00:00 |
|
calvin
|
2a336f8dad
|
put redirects in url queue
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3172 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2006-05-14 17:01:41 +00:00 |
|
calvin
|
dc9f04e6dc
|
adjust debug asserts
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3159 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2006-05-13 21:03:21 +00:00 |
|
calvin
|
f002c5f965
|
Replace the old threading algorithm with a new one based on Queue.Queue and consumer threads
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3146 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2006-05-13 13:44:52 +00:00 |
|
calvin
|
276437c7d8
|
syntax cleanup
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3067 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2006-02-09 07:57:22 +00:00 |
|
calvin
|
e92aee054c
|
updated copyright
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3010 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2006-01-03 19:12:47 +00:00 |
|
calvin
|
388475cbe2
|
new-style exceptions
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2999 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2005-12-20 20:17:33 +00:00 |
|
calvin
|
c84a33c7ce
|
syntax fix
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2988 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2005-12-18 09:13:23 +00:00 |
|
calvin
|
856ff8ef2a
|
assert debugs
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2987 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2005-12-18 08:55:42 +00:00 |
|
calvin
|
19c0a3c2ed
|
use new cookie parsing
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2983 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2005-12-18 08:19:11 +00:00 |
|
calvin
|
9425873830
|
cache aliases (from redirects)
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2964 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2005-12-08 20:20:21 +00:00 |
|
calvin
|
c86c0870d6
|
debugging
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2946 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2005-12-06 22:32:31 +00:00 |
|
calvin
|
a2e422ce0d
|
reindent
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2900 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2005-10-13 22:26:12 +00:00 |
|
calvin
|
f01b84b894
|
rework the redirection routine a little, putting warnings specifically for redirections
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2786 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2005-07-20 15:51:12 +00:00 |
|
calvin
|
317ef181f0
|
handle all redirections to different URL schemes,
not just HTTP -> not HTTP, and fix a variable typo
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2780 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2005-07-20 08:53:57 +00:00 |
|
calvin
|
d347840dee
|
use official HTTP status names
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2754 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2005-07-15 11:31:32 +00:00 |
|
calvin
|
c140b510b1
|
fix warning tags
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2746 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2005-07-13 19:33:26 +00:00 |
|
calvin
|
f80480469f
|
config to ignore certain warnings
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2743 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2005-07-13 15:03:17 +00:00 |
|
calvin
|
e7ca18403a
|
work around python bug occuring with incomplete gzip streams
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2716 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2005-07-05 19:56:42 +00:00 |
|
calvin
|
901227d436
|
assign extern info to local var
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2709 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2005-07-04 22:19:52 +00:00 |
|
calvin
|
49aa4ec208
|
remove unused imports
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2678 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2005-06-27 15:02:10 +00:00 |
|
calvin
|
6d8f0146d6
|
fix GET fallback handling
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2623 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2005-05-19 19:53:13 +00:00 |
|
calvin
|
d2107c7a6a
|
split and fix initialization of UrlData objects
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2622 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2005-05-19 19:42:14 +00:00 |
|
calvin
|
1fc5eb32b6
|
fix extern calculation and a typo
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2619 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2005-05-19 19:02:16 +00:00 |
|
calvin
|
55d7d5f7df
|
use a decorator to synchronize all the checker threads
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2614 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2005-05-19 15:49:06 +00:00 |
|
calvin
|
ed738d2436
|
cmdline param defaults to False
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2595 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2005-05-17 19:00:17 +00:00 |
|
calvin
|
1a7a771648
|
new intern/extern handling
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2584 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2005-05-09 22:05:21 +00:00 |
|
calvin
|
532a04e436
|
method change is an info, not a warning
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2563 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2005-05-04 13:18:19 +00:00 |
|
calvin
|
49a2dee338
|
make robots.txt deny an info instead of a warning
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2555 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2005-05-04 10:48:48 +00:00 |
|
calvin
|
82dee6ae52
|
reintroduce the self.url updated, this time only for redirections but without a warning
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2542 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2005-04-26 17:51:15 +00:00 |
|
calvin
|
5e6317a4bb
|
remove old warning, and move effective url test to urlbase
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2532 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2005-04-25 18:25:32 +00:00 |
|
calvin
|
ba5939c7c5
|
allow optional empty url path
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2500 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2005-03-31 13:50:11 +00:00 |
|
calvin
|
eeb6da8330
|
do not cache HTTP connections with pending data
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2487 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2005-03-29 18:47:09 +00:00 |
|
calvin
|
7429dfa209
|
cached connection debugging
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2476 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2005-03-29 13:40:54 +00:00 |
|
calvin
|
dcd407782d
|
no double alias reset
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2454 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2005-03-18 00:02:50 +00:00 |
|
calvin
|
0b0e672b4e
|
cleanup of the fallbacks
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2453 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2005-03-16 17:34:02 +00:00 |
|
calvin
|
6857196b25
|
documentation
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2426 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2005-03-11 23:15:13 +00:00 |
|
calvin
|
964f313a89
|
fix return codes
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2424 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2005-03-11 22:39:34 +00:00 |
|
calvin
|
2b8ca62e2e
|
split up the big check function
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2423 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2005-03-11 22:37:09 +00:00 |
|
calvin
|
dc960185df
|
only set connection debug level if CHECK logging is on debug
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2388 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2005-03-09 21:55:20 +00:00 |
|
calvin
|
7f6d1b1022
|
more headers stuff
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2382 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2005-03-09 14:59:03 +00:00 |
|
calvin
|
b3c3c3eaaa
|
split off header methods in separate file
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2381 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2005-03-09 13:46:58 +00:00 |
|
calvin
|
ca49e04730
|
content encoding getter method
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2380 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2005-03-09 13:39:20 +00:00 |
|
calvin
|
12b726b4df
|
strip content type mime
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2379 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2005-03-09 13:33:20 +00:00 |
|
calvin
|
0d063a3b53
|
skip accept encoding, and list complete recursion list
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2377 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2005-03-09 08:44:45 +00:00 |
|
calvin
|
4972a5b2f4
|
add warning on decompress errors
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2375 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2005-03-08 21:04:02 +00:00 |
|
calvin
|
e1e713036b
|
use noproxyfor config values to ignore proxy settings
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2360 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2005-03-07 13:57:01 +00:00 |
|
calvin
|
703da9eb65
|
safe decoding of non-unicode strings
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2322 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2005-02-18 11:22:52 +00:00 |
|
calvin
|
05f5123d7c
|
make sure infos and warnings are unicode
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2320 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2005-02-18 09:20:45 +00:00 |
|
calvin
|
0e12b923f6
|
make sure results are unicode
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2319 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2005-02-18 09:17:46 +00:00 |
|
calvin
|
44075c47bf
|
clean up raise calls
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2294 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2005-02-08 14:52:50 +00:00 |
|
calvin
|
77cea28d42
|
pylint cleanups
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2292 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2005-02-08 13:43:33 +00:00 |
|
calvin
|
2fced4f37d
|
documentation
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2288 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2005-02-08 12:10:39 +00:00 |
|
calvin
|
01ceb0aafd
|
HTTP connection pooling
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2245 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2005-01-30 22:39:09 +00:00 |
|
calvin
|
d030a5b054
|
documentation updated
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2164 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2005-01-19 15:56:48 +00:00 |
|
calvin
|
647d7167ee
|
documentation syntax
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2163 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2005-01-19 15:08:02 +00:00 |
|
calvin
|
b008747f39
|
documentation
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2160 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2005-01-19 01:04:38 +00:00 |
|
calvin
|
b06f144ced
|
updated copyright year
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2122 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2005-01-11 02:22:43 +00:00 |
|
calvin
|
ef0d624c0d
|
unicode fixes
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2089 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2004-12-06 22:34:20 +00:00 |
|
calvin
|
3b13dd87e2
|
display correct url in redirect message
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2014 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2004-11-19 13:08:26 +00:00 |
|
calvin
|
be5a59e9fd
|
add dots to warnings
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2010 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2004-11-19 11:18:06 +00:00 |
|
calvin
|
bc89449080
|
add dots to info messages
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2008 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2004-11-19 11:06:31 +00:00 |
|
calvin
|
d2a6f4ed95
|
recheck robots.txt allowance after redirect
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2002 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2004-11-18 00:57:35 +00:00 |
|
calvin
|
89616a4bba
|
basic robots_txt check method
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2001 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2004-11-18 00:55:47 +00:00 |
|
calvin
|
74f1935be5
|
recheck extern/intern filters for redirected URLs
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1998 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2004-11-18 00:49:33 +00:00 |
|
calvin
|
265f942ba1
|
add unquoted redirectino url to info
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1980 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2004-11-14 11:02:17 +00:00 |
|
calvin
|
75ec50b156
|
adjust for new url_norm method
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1965 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2004-11-13 11:20:33 +00:00 |
|