Commit graph

683 commits

Author SHA1 Message Date
calvin
0fe112ad4e better url joining
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1466 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-08-23 22:53:57 +00:00
calvin
311be4ac04 added
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1465 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-08-23 22:53:27 +00:00
calvin
01976e953b add trailing directory slash
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1463 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-08-23 22:27:24 +00:00
calvin
4e654e327b recursion of directories
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1462 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-08-23 22:20:11 +00:00
calvin
84b9c1ab2e set result for directories
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1461 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-08-23 22:15:59 +00:00
calvin
a2a1429b7e fix intern url setting
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1460 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-08-23 21:34:16 +00:00
calvin
5bdea05dff fix endless loops with broken urls with a non-empty anchor
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1458 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-08-23 21:24:30 +00:00
calvin
83274d0cd3 validated html output
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1456 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-08-23 21:13:12 +00:00
calvin
2964bce633 no os.linesep in translation
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1455 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-08-23 20:56:41 +00:00
calvin
7669ef1b6b more case tests
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1452 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-08-23 20:18:30 +00:00
calvin
36db68852f errors and warnings
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1451 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-08-23 20:16:53 +00:00
calvin
d1c432f146 add real url to test result
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1449 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-08-23 20:04:54 +00:00
calvin
3f300f38e1 check resources
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1448 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-08-23 20:03:07 +00:00
calvin
e44354c3e2 fix timeout exception
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1446 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-08-23 15:20:49 +00:00
calvin
2bce376e61 log real url
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1445 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-08-19 23:38:35 +00:00
calvin
614255a441 add intern url always on cmdline
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1444 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-08-19 23:24:25 +00:00
calvin
726aac7ac0 fix cache key
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1443 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-08-19 23:05:41 +00:00
calvin
c37d3acd50 move debug message for url queue adding
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1442 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-08-19 23:05:31 +00:00
calvin
53e89e3b39 added
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1435 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-08-19 21:39:08 +00:00
calvin
c3100ef518 new consumer interface
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1434 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-08-19 21:36:17 +00:00
calvin
a66a8359d1 colored logger is default now
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1433 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-08-19 21:36:00 +00:00
calvin
f2e7ca6040 split off cache and url consumer routines into separate classes
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1432 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-08-19 21:35:47 +00:00
calvin
097624bc98 pylint
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1429 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-08-16 20:11:59 +00:00
calvin
7a0888b9eb fix mail parsing, and connection closing
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1428 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-08-16 20:11:21 +00:00
calvin
60666f5abb removed unused import, changed adress to address
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1427 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-08-16 20:10:48 +00:00
calvin
e25ea13fa7 added
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1426 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-08-16 19:28:42 +00:00
calvin
4756641e1b source code restructuring
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1423 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-08-16 19:20:53 +00:00
calvin
1cf5a14352 functional tests
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1422 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-08-16 19:20:16 +00:00
calvin
14a9b5c426 unit tests
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1421 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-08-16 19:20:06 +00:00
calvin
9f7e3e67a9 removed
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1420 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-08-16 19:17:36 +00:00
calvin
c9261c96c2 install_data
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1403 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-07-30 08:09:18 +00:00
calvin
8a083287fb dns config
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1401 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-07-26 14:07:22 +00:00
calvin
213b9e3cec pycheck fixes
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1400 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-07-26 13:54:32 +00:00
calvin
1f6670e8cd import fixes
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1399 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-07-26 13:47:19 +00:00
calvin
96dd6ef4b8 import fixes
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1398 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-07-26 12:01:52 +00:00
calvin
3e56e96e14 import fixes
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1397 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-07-26 11:43:11 +00:00
calvin
02b177017e fix getUrlDataFrom invocation
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1396 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-07-26 11:18:12 +00:00
calvin
b674575de1 import fixes
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1388 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-07-22 13:36:43 +00:00
calvin
6942fccb50 import fixes
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1386 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-07-22 13:19:18 +00:00
calvin
82d2ec5d51 removed
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1385 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-07-22 11:34:53 +00:00
calvin
32bc5cd292 more import fixes
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1383 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-07-22 10:54:47 +00:00
calvin
6c2c8f78b6 removed
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1382 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-07-22 10:47:55 +00:00
calvin
b60070a922 bk movements
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1375 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-07-20 14:50:00 +00:00
calvin
5ad8c827b4 syntax updated
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1374 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-07-20 14:49:44 +00:00
calvin
c071230c1b removed
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1372 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-07-20 14:42:18 +00:00
calvin
a5204c56d5 resynced with newest upstream
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1371 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-07-20 14:32:04 +00:00
calvin
6d8ae43f37 moved
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1367 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-07-19 09:02:52 +00:00
calvin
6476c8675d more import fixes
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1364 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-07-19 08:58:59 +00:00
calvin
916f96cc0d checker module
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1357 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-07-07 18:15:17 +00:00
calvin
018cf945d1 new module layout
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1356 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-07-07 18:04:40 +00:00
calvin
6f37e1961d added
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1355 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-07-07 18:01:25 +00:00
calvin
5a644b35b3 removed
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1354 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-07-06 22:08:05 +00:00
calvin
3bbfac47c7 removed
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1353 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-07-06 20:34:00 +00:00
calvin
dc90faac9e renamed
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1351 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-07-03 06:17:08 +00:00
calvin
bde88f9715 added string utils to parser, and sync with webcleaner
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1350 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-07-02 18:25:00 +00:00
calvin
a4925830f6 cleanup
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1345 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-05-27 21:16:14 +00:00
calvin
cf0ec06ef0 do not quote None
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1343 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-05-27 21:05:14 +00:00
calvin
097bb8a143 mv contentAllowsRobots to end of recursion check
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1339 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-05-27 20:43:41 +00:00
calvin
49e2b1f10d rework anchor fallback
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1336 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-05-27 20:27:59 +00:00
calvin
715a80afff ignore flush errors
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1335 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-05-27 17:31:05 +00:00
calvin
7556d4e72c correctly quote request url
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1331 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-05-27 09:05:46 +00:00
calvin
58fab5a44f updated from webcleaner
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1330 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-05-27 08:29:43 +00:00
calvin
04e0a9448d updated
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1325 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-05-27 00:30:37 +00:00
calvin
4353d97854 added
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1324 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-05-27 00:18:04 +00:00
calvin
1f28911a23 actually fallback to GET with Zope servers
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1321 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-05-26 23:54:47 +00:00
calvin
ce68bb782a cleanup
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1320 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-05-26 23:48:28 +00:00
calvin
e9341590d4 better err msg on bad status line
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1318 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-05-26 23:02:47 +00:00
calvin
abccff16ea fall back to GET on bad status line
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1317 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-05-26 23:00:21 +00:00
calvin
37b69e16e4 uri regex url added
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1314 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-05-03 08:53:38 +00:00
calvin
8dcd8f408a copyright
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1313 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-04-05 09:55:14 +00:00
calvin
ca081c2168 also check robots allowance of HTML files
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1304 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-04-04 10:48:31 +00:00
calvin
50bc463bb1 check cache
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1303 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-04-04 10:42:15 +00:00
calvin
2ffb97a855 get new urls from top of queue
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1302 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-04-04 09:45:33 +00:00
calvin
e78a8ea539 full css parsing
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1300 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-04-04 09:30:10 +00:00
calvin
f4802fd467 pychecker
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1299 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-04-04 08:41:30 +00:00
calvin
fa46757bd7 fix import
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1298 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-04-04 08:34:21 +00:00
calvin
68451e65dd O3
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1297 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-04-04 08:31:57 +00:00
calvin
93253954a8 updated
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1296 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-04-04 08:30:48 +00:00
calvin
672e118d9b use sorted dict
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1295 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-04-04 08:30:38 +00:00
calvin
8e4e92dddd minor improvements
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1294 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-04-04 08:30:21 +00:00
calvin
1b148b0b4e sorted dict
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1293 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-04-04 08:30:01 +00:00
calvin
e183ac84dc handle missing startquotes
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1292 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-04-04 08:29:31 +00:00
calvin
52609e4399 pychecker
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1291 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-04-03 18:10:38 +00:00
calvin
6b1d124d35 no sys.path fiddling
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1290 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-04-03 17:59:31 +00:00
calvin
8584d5bc8e only check robots.txt for http
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1285 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-04-03 16:34:58 +00:00
calvin
67fabd5d8e addd contact email and url to user-agent string
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1284 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-03-11 11:01:00 +00:00
calvin
d8e738c60b check syntax and cache before putting url objects in the queue
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1277 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-03-04 12:17:38 +00:00
calvin
d79aee3a2c xml prefix for attr var
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1272 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-03-01 15:49:32 +00:00
calvin
af5be26d2c use XmlUtils instead of xmlify for quoting
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1271 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-03-01 15:38:56 +00:00
calvin
b63fb15986 hmmmm
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1267 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-02-21 15:06:22 +00:00
calvin
b7e54260b0 also quote parent url
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1265 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-02-21 14:54:10 +00:00
calvin
033a0873be better error msg
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1261 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-02-21 11:56:24 +00:00
calvin
58057bd07f better err msg
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1260 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-02-21 11:48:39 +00:00
calvin
85115c2039 cleanup
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1257 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-02-20 14:17:49 +00:00
calvin
bd628b7de7 use new url.py
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1256 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-02-20 14:14:31 +00:00
calvin
5187dbc4c2 quote url in output
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1255 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-02-20 14:13:42 +00:00
calvin
ab9092d7a0 catch errors earlier in recursion check
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1253 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-02-19 23:27:21 +00:00
calvin
fefba0036d catch ValueError, raise IncompleteRead on invalid chunk length
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1250 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-02-19 23:13:30 +00:00
calvin
a02d8ae2a4 fallback in redirections
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1239 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-29 23:47:21 +00:00
calvin
83b7ef7ff9 break cycles
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1238 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-29 23:46:13 +00:00
calvin
4e8c8547ec fix typos
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1237 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-29 21:39:15 +00:00
calvin
7121f81aff language
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1236 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-29 21:37:06 +00:00
calvin
967cadaa26 fallback to GET
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1231 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-29 21:20:28 +00:00
calvin
76452953f8 use file instead of open
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1226 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-29 19:04:49 +00:00
calvin
669866a7ab add NoneLogger
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1223 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-29 19:02:50 +00:00
calvin
fa9023d9f8 fix file parsing, ignore comments and empty lines
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1222 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-29 19:02:31 +00:00
calvin
8a474914f3 added NOneLogger, adjust blacklist default file and handling
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1221 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-29 19:02:06 +00:00
calvin
d78d96dd0e added
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1220 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-29 19:01:24 +00:00
calvin
7216e582fe nicer host not found error msg
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1213 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-29 14:36:21 +00:00
calvin
2c119a027a added
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1211 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-29 14:10:35 +00:00
calvin
4df200a2d2 merged from webcleaner
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1205 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-28 23:38:00 +00:00
calvin
f4dde29117 parse fixes merged from webcleaner
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1204 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-28 23:04:39 +00:00
calvin
44f5941552 use new parser interface
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1203 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-28 22:49:20 +00:00
calvin
66ecc466b7 resolve entities
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1202 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-28 22:48:50 +00:00
calvin
26072afd92 new style parser object class
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1200 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-28 22:33:34 +00:00
calvin
aa64775892 added setdefault function
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1196 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-28 09:03:21 +00:00
calvin
c62de8c0d5 gc debug functions
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1195 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-28 09:03:11 +00:00
calvin
ad7689ee02 documentation
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1183 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-07 21:37:10 +00:00
calvin
23eb7efc89 less aggressive thread aqcuiring
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1182 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-07 21:27:49 +00:00
calvin
fce225826b honor nofollow robots.txt param in html meta tag
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1177 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-07 20:50:07 +00:00
calvin
78d969cd47 updated
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1175 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-04 09:35:43 +00:00
calvin
ed563ee2e6 cleanup
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1173 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-04 09:23:00 +00:00
calvin
17d79f45f3 fix mime-type checking to allow parsing of external stylesheets
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1172 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-04 09:19:12 +00:00
calvin
96243c3047 restructure lock functions
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1164 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-03 19:44:49 +00:00
calvin
977cc8ae9d add strduration imports
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1155 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-03 15:16:32 +00:00
calvin
2398ee2aa3 copyright updated
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1153 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-03 15:12:04 +00:00
calvin
fef96392d6 updated copyright
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1150 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-03 14:59:33 +00:00
calvin
da786040ef path join with list
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1149 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-03 14:57:54 +00:00
calvin
6a09ab9e22 increase cache limits
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1146 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-03 13:59:18 +00:00
calvin
20b8f0dbc5 active threads function
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1145 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-03 13:41:53 +00:00
calvin
1f9ce630aa new --status option
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1142 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-03 13:30:00 +00:00
calvin
a7607f3858 new --status option
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1141 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-03 13:27:47 +00:00
calvin
45620a8453 strduration helper
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1140 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-03 13:26:30 +00:00
calvin
a954c1d998 use setThreads
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1138 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-03 12:31:59 +00:00
calvin
a17bf11f4b updated caching
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1132 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-02 23:30:22 +00:00
calvin
c0c91b17d5 updated threading
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1131 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-01-02 23:30:11 +00:00
calvin
06ddaf2bab debugging
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1128 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2003-12-29 19:13:20 +00:00
calvin
83a8c945dd only cache needed info
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1127 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2003-12-29 19:12:51 +00:00
calvin
0ae492c0ee leak debugging
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1126 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2003-12-29 19:12:33 +00:00
calvin
02f42652fe cosmetic fix
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1125 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2003-12-29 19:11:48 +00:00
calvin
d13f779d74 fix nt dns init and remove apply() use
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1123 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2003-12-28 23:01:51 +00:00
calvin
95611de5c3 replace backticks with repr
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1121 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2003-12-20 11:28:55 +00:00
calvin
fbfa5ee64e more robust registry indexing
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1120 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2003-12-20 11:27:54 +00:00
calvin
fe62a76aa0 fix safe_url pattern, it was too strict
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1119 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2003-12-19 19:16:21 +00:00
calvin
0357b1237e fix https support test
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1107 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2003-12-19 10:23:18 +00:00
calvin
d000aa3d21 missing import
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1104 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2003-12-18 14:21:53 +00:00
calvin
b5023d14c4 update nameserver parsing
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1103 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2003-12-18 14:20:37 +00:00
calvin
1e198a3b4d print last-modified date in infos
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1100 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2003-12-10 20:40:56 +00:00
calvin
7d87f007d4 do not add automatic filters with --strict when there are already some
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1090 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2003-12-07 10:14:18 +00:00
calvin
f39d0ef56e rename noanchorcaching to anchorcaching
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1087 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2003-12-05 00:38:30 +00:00