lychee/src
Paweł Romanowski cd00fa643e
Fix HTML parsing for non-closed elements like <link> (#92)
* Fix HTML parsing for non-closed elements like <link>

The XML parser we use requires all tags to be closed by default,
and if they aren't (like HTML5 <link> elements), it simply gives up
on further parsing.  This change makes it ignore such issues.

Also uncover a bug with the current parser (it simply won't parse
elements like `<script defer src="..."></script>`) -- e.g. elements
with no attribute values.

The XML parser is an XML parser and will have to be replaced with
HTML aware parser in the future.

* Add check for empty elements

* Update extract.rs

Co-authored-by: Matthias <matthias-endler@gmx.net>
2021-01-03 17:32:13 +01:00
..
bin/lychee Add (machine-readable) output file support (fixes #53) 2020-12-14 01:15:14 +01:00
client.rs Make lychee usable as a library #13 (#46) 2020-12-04 10:44:31 +01:00
client_pool.rs Make lychee usable as a library #13 (#46) 2020-12-04 10:44:31 +01:00
collector.rs Make lychee usable as a library #13 (#46) 2020-12-04 10:44:31 +01:00
excludes.rs Make lychee usable as a library #13 (#46) 2020-12-04 10:44:31 +01:00
extract.rs Fix HTML parsing for non-closed elements like <link> (#92) 2021-01-03 17:32:13 +01:00
lib.rs Make lychee usable as a library #13 (#46) 2020-12-04 10:44:31 +01:00
test_utils.rs Make lychee usable as a library #13 (#46) 2020-12-04 10:44:31 +01:00
types.rs Make lychee usable as a library #13 (#46) 2020-12-04 10:44:31 +01:00
uri.rs Add (machine-readable) output file support (fixes #53) 2020-12-14 01:15:14 +01:00