Chris Mayo
ffa6ac457f
Remove support for non-Tag elements from Parser
...
This change is made because the linkchecker handlers only process
Tags.
The test HtmlPrettyPrinter handler is updated to output element text
because its support for non-Tag elements has been removed. This results
in a number of the existing tests still passing.
2020-03-31 20:10:35 +01:00
Chris Mayo
1255119ca8
Move HtmlPrinter and HtmlPrettyPrinter into tests
2020-03-30 19:32:30 +01:00
Chris Mayo
5b66964afa
Remove unused .charset from checker classes
...
Unused since:
4f8c2954 ("Don't set parser.encoding", 2019-10-05)
2020-03-30 19:32:30 +01:00
Chris Mayo
f743be57e8
Remove unused functions from linkcheck.HtmlParser
...
resolve_entities() unused since:
2c000683 ("Remove unused linkcheck.htmlutil.linkname module",
2020-03-30)
set_doctype(), set_encoding() unused since:
51a06d8a ("Remove home-cooked htmlparser and use BeautifulSoup",
2019-07-22)
2020-03-30 19:32:18 +01:00
Petr Dlouhý
51a06d8a1e
Remove home-cooked htmlparser and use BeautifulSoup
2019-07-22 19:59:37 +01:00
Petr Dlouhý
8b9f29ae52
Python3: fix unichr() in htmlparser
2019-09-09 19:51:30 +01:00
Bastian Kleineidam
029c20ed98
More python3 fixes
2014-09-12 21:59:07 +02:00
Bastian Kleineidam
7b34be590b
Introduce check plugins, use Python requests for http/s connections, and some code cleanups and improvements.
2014-03-01 00:12:34 +01:00
Bastian Kleineidam
6d5e5f9efb
Updated copyright.
2012-03-30 22:24:10 +02:00
Bastian Kleineidam
b9b8e3f5b2
Honor the charset encoding of the Content-Type HTTP
...
header when parsing HTML.
2012-03-22 22:45:11 +01:00
Bastian Kleineidam
fb237041d1
Updated copyright
2011-10-20 08:14:16 +02:00
Bastian Kleineidam
d2ae6bf71c
Properly detect HTML character encoding.
2011-08-14 12:49:31 +02:00
Bastian Kleineidam
5e06b6b8d4
Updated FSF address in GPL blurb
2009-07-24 23:58:20 +02:00
calvin
e9805dbd8a
Updated copyright year to 2009
...
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3887 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2009-01-08 14:18:03 +00:00
calvin
6499cb1a63
updated copyright year
...
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3658 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2008-01-02 14:31:19 +00:00
calvin
9de237b4c2
Check that charset is not None before lowering it in set_encoding().
...
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3547 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2007-03-21 19:32:19 +00:00
calvin
df48d4a905
bump up copyright year
...
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3534 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2007-01-01 14:57:38 +00:00
calvin
3a5f06cfa9
remove unused strip_quotes
...
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3040 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2006-02-02 21:53:11 +00:00
calvin
75be4d0bb6
fix entity resolving
...
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3038 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2006-02-02 21:40:02 +00:00
calvin
cbef33ec5e
fix parsing of hexadecimal entities
...
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3037 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2006-02-02 20:06:59 +00:00
calvin
e94c61529b
catch value error on codec lookup
...
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3013 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2006-01-06 15:02:32 +00:00
calvin
e92aee054c
updated copyright
...
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3010 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2006-01-03 19:12:47 +00:00
calvin
cff9b1341b
improved charset parsing
...
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2979 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2005-12-18 08:16:25 +00:00
calvin
7d5be699df
documentation
...
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2922 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2005-11-02 13:49:03 +00:00
calvin
c2b4132eb9
ensure html attr values are strings
...
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2915 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2005-10-25 13:47:03 +00:00
calvin
a2e422ce0d
reindent
...
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2900 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2005-10-13 22:26:12 +00:00
calvin
b28be779d7
ensure tags are ASCII, regen with bison 2.0
...
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2817 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2005-10-10 21:13:35 +00:00
calvin
bf9e55c1c8
updated documentation
...
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2731 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2005-07-11 14:43:52 +00:00
calvin
3ce6aadfd6
updated documentation
...
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2730 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2005-07-11 14:37:52 +00:00
calvin
7df1caf58c
documentation
...
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2605 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2005-05-18 17:57:41 +00:00
calvin
d030a5b054
documentation updated
...
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2164 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2005-01-19 15:56:48 +00:00
calvin
edfea898b4
documentation updated, and set_encoding no longer has tag attr
...
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2151 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2005-01-18 15:53:23 +00:00
calvin
b06f144ced
updated copyright year
...
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2122 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2005-01-11 02:22:43 +00:00
calvin
4fbdbe3a51
XHTML support
...
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@2108 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-12-08 09:09:06 +00:00
calvin
10209ae499
emit unicode data, store encoding
...
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1853 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-10-06 19:05:48 +00:00
calvin
e25ea13fa7
added
...
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@1426 e7d03fd6-7b0d-0410-9947-9c21f3af8025
2004-08-16 19:28:42 +00:00