Commit graph

497 commits

Author SHA1 Message Date
calvin
d8e738c60b check syntax and cache before putting url objects in the queue
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1277 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-03-04 12:17:38 +00:00
calvin
d79aee3a2c xml prefix for attr var
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1272 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-03-01 15:49:32 +00:00
calvin
af5be26d2c use XmlUtils instead of xmlify for quoting
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1271 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-03-01 15:38:56 +00:00
calvin
b63fb15986 hmmmm
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1267 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-02-21 15:06:22 +00:00
calvin
b7e54260b0 also quote parent url
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1265 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-02-21 14:54:10 +00:00
calvin
033a0873be better error msg
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1261 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-02-21 11:56:24 +00:00
calvin
58057bd07f better err msg
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1260 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-02-21 11:48:39 +00:00
calvin
85115c2039 cleanup
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1257 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-02-20 14:17:49 +00:00
calvin
bd628b7de7 use new url.py
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1256 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-02-20 14:14:31 +00:00
calvin
5187dbc4c2 quote url in output
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1255 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-02-20 14:13:42 +00:00
calvin
ab9092d7a0 catch errors earlier in recursion check
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1253 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-02-19 23:27:21 +00:00
calvin
fefba0036d catch ValueError, raise IncompleteRead on invalid chunk length
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1250 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-02-19 23:13:30 +00:00
calvin
a02d8ae2a4 fallback in redirections
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1239 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-29 23:47:21 +00:00
calvin
83b7ef7ff9 break cycles
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1238 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-29 23:46:13 +00:00
calvin
4e8c8547ec fix typos
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1237 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-29 21:39:15 +00:00
calvin
7121f81aff language
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1236 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-29 21:37:06 +00:00
calvin
967cadaa26 fallback to GET
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1231 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-29 21:20:28 +00:00
calvin
76452953f8 use file instead of open
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1226 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-29 19:04:49 +00:00
calvin
669866a7ab add NoneLogger
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1223 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-29 19:02:50 +00:00
calvin
fa9023d9f8 fix file parsing, ignore comments and empty lines
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1222 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-29 19:02:31 +00:00
calvin
8a474914f3 added NOneLogger, adjust blacklist default file and handling
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1221 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-29 19:02:06 +00:00
calvin
d78d96dd0e added
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1220 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-29 19:01:24 +00:00
calvin
7216e582fe nicer host not found error msg
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1213 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-29 14:36:21 +00:00
calvin
2c119a027a added
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1211 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-29 14:10:35 +00:00
calvin
4df200a2d2 merged from webcleaner
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1205 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-28 23:38:00 +00:00
calvin
f4dde29117 parse fixes merged from webcleaner
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1204 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-28 23:04:39 +00:00
calvin
44f5941552 use new parser interface
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1203 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-28 22:49:20 +00:00
calvin
66ecc466b7 resolve entities
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1202 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-28 22:48:50 +00:00
calvin
26072afd92 new style parser object class
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1200 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-28 22:33:34 +00:00
calvin
aa64775892 added setdefault function
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1196 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-28 09:03:21 +00:00
calvin
c62de8c0d5 gc debug functions
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1195 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-28 09:03:11 +00:00
calvin
ad7689ee02 documentation
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1183 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-07 21:37:10 +00:00
calvin
23eb7efc89 less aggressive thread aqcuiring
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1182 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-07 21:27:49 +00:00
calvin
fce225826b honor nofollow robots.txt param in html meta tag
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1177 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-07 20:50:07 +00:00
calvin
78d969cd47 updated
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1175 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-04 09:35:43 +00:00
calvin
ed563ee2e6 cleanup
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1173 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-04 09:23:00 +00:00
calvin
17d79f45f3 fix mime-type checking to allow parsing of external stylesheets
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1172 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-04 09:19:12 +00:00
calvin
96243c3047 restructure lock functions
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1164 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-03 19:44:49 +00:00
calvin
977cc8ae9d add strduration imports
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1155 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-03 15:16:32 +00:00
calvin
2398ee2aa3 copyright updated
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1153 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-03 15:12:04 +00:00
calvin
fef96392d6 updated copyright
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1150 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-03 14:59:33 +00:00
calvin
da786040ef path join with list
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1149 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-03 14:57:54 +00:00
calvin
6a09ab9e22 increase cache limits
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1146 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-03 13:59:18 +00:00
calvin
20b8f0dbc5 active threads function
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1145 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-03 13:41:53 +00:00
calvin
1f9ce630aa new --status option
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1142 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-03 13:30:00 +00:00
calvin
a7607f3858 new --status option
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1141 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-03 13:27:47 +00:00
calvin
45620a8453 strduration helper
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1140 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-03 13:26:30 +00:00
calvin
a954c1d998 use setThreads
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1138 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-03 12:31:59 +00:00
calvin
a17bf11f4b updated caching
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1132 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-02 23:30:22 +00:00
calvin
c0c91b17d5 updated threading
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1131 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-02 23:30:11 +00:00