Commit graph

85 commits

Author SHA1 Message Date
Bastian Kleineidam
e429dbcc13 Do not parse URL CGI part recursively. 2010-10-27 20:55:21 +02:00
Bastian Kleineidam
4483635552 Add debuglevel, log errors and remove default handlers that are added by urllib2 for get_opener(). 2010-10-14 07:51:29 +02:00
Bastian Kleineidam
388ea0e7ff Add ability to pass POST data to url content function. 2010-10-11 19:54:06 +02:00
Bastian Kleineidam
a68329329f Fix get_content() function. 2010-10-03 12:11:25 +02:00
Bastian Kleineidam
9e54bbfa57 Move URL retreiving functions into url.py module. 2010-10-03 08:46:49 +02:00
Bastian Kleineidam
4e1b6d667e Set copyright. 2010-03-26 20:51:59 +01:00
Bastian Kleineidam
c4c098bd83 pep8-ify the source a little more 2010-03-13 08:47:12 +01:00
Bastian Kleineidam
0b7badc238 Do not quote slashes in query values. 2010-03-11 20:19:31 +01:00
Bastian Kleineidam
bee8023540 Fixed URL encoding 2010-02-22 01:06:19 +01:00
Bastian Kleineidam
77daf80e82 Add url encoding parameter 2009-11-28 11:56:35 +01:00
Bastian Kleineidam
5cd7b84596 Allow digits at end of domain names in safe domain check. 2009-07-26 23:16:42 +02:00
Bastian Kleineidam
5e06b6b8d4 Updated FSF address in GPL blurb 2009-07-24 23:58:20 +02:00
Bastian Kleineidam
fd610ba350 Encode spaces with %20 instead of + 2009-07-22 22:52:40 +02:00
calvin
366c711b43 Improved domain name checking
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3956 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2009-02-18 15:33:52 +00:00
calvin
e9805dbd8a Updated copyright year to 2009
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3887 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2009-01-08 14:18:03 +00:00
calvin
5d8bdaaa1f Use generators instead of lists where possible
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3739 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2008-04-28 00:26:02 +00:00
calvin
3eac1be9ab Require and use Python 2.5
Use Python 2.5 features and get rid of old compat code. Also some
code cleanups have been made.


git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3737 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2008-04-27 11:39:21 +00:00
calvin
4ce0ddd166 Changes for future Python 3.x compatibility
Replace backticks with repr(), replace .has_key() with "in".


git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3680 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2008-03-19 10:22:57 +00:00
calvin
6499cb1a63 updated copyright year
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3658 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2008-01-02 14:31:19 +00:00
calvin
40b3be412b revert the catch UnicodeError change
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3607 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2007-11-28 18:39:12 +00:00
calvin
c93fc79702 ignore errors in idna encoding of hostnames
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3591 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2007-11-13 12:36:42 +00:00
calvin
df48d4a905 bump up copyright year
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3534 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2007-01-01 14:57:38 +00:00
calvin
781ccf96c1 add irc scheme to netloc using schemes
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3238 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2006-05-17 19:12:29 +00:00
calvin
2947b4710d quote slashes in cgi values
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3158 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2006-05-13 20:52:30 +00:00
calvin
cce1d273a7 really return value
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3117 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2006-04-08 09:00:44 +00:00
calvin
38b83415dd correctly replace spaces in cgi params
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3102 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2006-04-03 19:12:43 +00:00
calvin
1df673a79d add url_unsplit method
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3029 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2006-02-01 18:56:20 +00:00
calvin
48cdfd32ee fix tests
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3011 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2006-01-04 01:20:20 +00:00
calvin
e92aee054c updated copyright
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3010 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2006-01-03 19:12:47 +00:00
calvin
fe24c56e84 only split host ports on known schemes
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3009 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2006-01-03 18:37:42 +00:00
calvin
bc846a20ff use own splitport method
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3001 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2006-01-02 14:26:24 +00:00
calvin
388475cbe2 new-style exceptions
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2999 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2005-12-20 20:17:33 +00:00
calvin
8d67ee35ef removed is_safe_js_url
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2995 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2005-12-20 09:08:51 +00:00
calvin
bcbbe6ff60 allow https js urls
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2935 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2005-11-12 19:56:46 +00:00
calvin
9eab0056ac encode url query
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2930 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2005-11-09 07:46:05 +00:00
calvin
a2e422ce0d reindent
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2900 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2005-10-13 22:26:12 +00:00
calvin
9c8cee034a quote anchor
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2866 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2005-10-13 12:50:48 +00:00
calvin
8953766ccd fix url part quoting
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2853 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2005-10-11 14:08:14 +00:00
calvin
fccccac818 coerce to unicode in url_norm if arg was unicode
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2845 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2005-10-11 12:09:16 +00:00
calvin
fc4ce83372 dont quote exclams
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2809 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2005-08-16 13:57:36 +00:00
calvin
b559c4f82e test for hierarchical URLs, and retain non-numeric port parts
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2739 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2005-07-12 07:18:13 +00:00
calvin
1a7a771648 new intern/extern handling
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2584 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2005-05-09 22:05:21 +00:00
calvin
96b74a1f41 collapse reduntant dot path at beginning of url
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2558 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2005-05-04 12:58:38 +00:00
calvin
93ce3364b0 split CGI params on both ampersand and semicolon
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2541 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2005-04-26 17:28:12 +00:00
calvin
ba5939c7c5 allow optional empty url path
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2500 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2005-03-31 13:50:11 +00:00
calvin
642e2d1698 do not split cgi params on semicolons
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2463 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2005-03-28 23:51:29 +00:00
calvin
703da9eb65 safe decoding of non-unicode strings
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2322 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2005-02-18 11:22:52 +00:00
calvin
fcf18b8e1f add ! to no quote chars
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2280 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2005-02-07 00:07:02 +00:00
calvin
74805745e8 fix needs_quoting method by escaping chars and add [] to allowed chars
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2214 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2005-01-27 13:59:23 +00:00
calvin
d030a5b054 documentation updated
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2164 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2005-01-19 15:56:48 +00:00