Chris Mayo
|
d189445a8e
|
LinkFinder does not raise StopParse
|
2020-04-18 20:30:46 +01:00 |
|
Chris Mayo
|
ee6628a831
|
Move HtmlParser/htmlsax.py to htmlutil/htmlsoup.py
Remove one subpackage and some import lines where htmlutil.linkparse is
also being used.
|
2020-04-18 20:30:45 +01:00 |
|
Chris Mayo
|
0795e3c1b4
|
Replace Parser class using BeautifulSoup.find_all()
|
2020-04-10 13:51:09 +01:00 |
|
Chris Mayo
|
02e1c389b2
|
Remove parser flush() and reset()
Remnants of the feed() interface.
|
2020-04-08 20:03:35 +01:00 |
|
Chris Mayo
|
3771dd9136
|
Use parser.feed_soup() instead of parser.feed()
Markup is not being passed in pieces to the parser, so simplify the
interface and reduce the state further.
|
2020-04-08 20:03:35 +01:00 |
|
Chris Mayo
|
9d8d251d06
|
Replace Parser lineno() and column() methods
Stop storing this data in Parser object state.
|
2020-04-08 20:03:35 +01:00 |
|
Bastian Kleineidam
|
176b95a30e
|
Do not strip quotes from resolved URLs.
|
2014-07-11 00:43:46 +02:00 |
|
Bastian Kleineidam
|
981079c041
|
Support itemtype attribute parsing.
|
2014-04-23 22:03:20 +02:00 |
|
Bastian Kleineidam
|
4232b69633
|
Support <img> srcset attribute parsing.
|
2014-04-10 17:51:59 +02:00 |
|
Bastian Kleineidam
|
b6b5c7a12e
|
Simpler link parsing routine.
|
2014-03-27 19:49:17 +01:00 |
|
Bastian Kleineidam
|
c806be5c15
|
Updated copyright
|
2014-01-08 22:33:04 +01:00 |
|
Bastian Kleineidam
|
e0a2558b2b
|
Updated copyright.
|
2013-12-24 07:13:16 +01:00 |
|
Bastian Kleineidam
|
78ed1e9e52
|
Do not GET on POST forms.
|
2013-12-10 23:42:43 +01:00 |
|
Bastian Kleineidam
|
57397e938b
|
Improved linkname parsing by adding a new peek() HTML parser function.
|
2010-03-09 11:31:12 +01:00 |
|
Bastian Kleineidam
|
51a0ef0ad4
|
Speed up HTML parsing by stopping early and adding callbacks.
|
2010-03-08 09:04:33 +01:00 |
|
Bastian Kleineidam
|
5e06b6b8d4
|
Updated FSF address in GPL blurb
|
2009-07-24 23:58:20 +02:00 |
|
calvin
|
1f803fd80c
|
Run tests with nosetests
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3941 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2009-01-28 23:12:03 +00:00 |
|
calvin
|
e9805dbd8a
|
Updated copyright year to 2009
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3887 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2009-01-08 14:18:03 +00:00 |
|
calvin
|
7cb6900a9c
|
Account for added htmlutil package
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3791 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2008-06-07 13:08:10 +00:00 |
|
calvin
|
9352dbf5e4
|
Move test files to separate module
git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@3763 e7d03fd6-7b0d-0410-9947-9c21f3af8025
|
2008-05-20 17:01:16 +00:00 |
|