lychee

mirror of https://github.com/Hopiu/lychee.git synced 2026-05-27 06:24:02 +00:00

Author	SHA1	Message	Date
dependabot[bot]	2ce1a9ae06	Bump clap from 3.2.23 to 4.0.22 (#813 ) * Bump clap from 3.2.23 to 4.0.22 Bumps [clap](https://github.com/clap-rs/clap) from 3.2.23 to 4.0.22. - [Release notes](https://github.com/clap-rs/clap/releases) - [Changelog](https://github.com/clap-rs/clap/blob/master/CHANGELOG.md) - [Commits](https://github.com/clap-rs/clap/compare/v3.2.23...v4.0.22) * The `headers` option got renamed to `header` to align with the rest of the options, which are singular. * The short option for `header` (`-h`) was removed to avoid a conflict with help (`lychee -h`). * Update and simplify readme check Co-authored-by: Matthias <matthias-endler@gmx.net>	2022-11-13 21:10:32 +01:00
Matthias	35ccfb87c3	Add support for dumping links to file (#810 )	2022-11-08 00:33:16 +01:00
Matthias	264af23822	Improve wording	2022-11-05 17:25:44 +01:00
Andy Grunwald	a67b513238	Extend description of "--exclude" to also exclude email addresses, not only URLs (#801 )	2022-10-23 12:17:20 +02:00
Matthias	cbd936960a	Move from structopt to clap (#732 ) Structopt was subsumed by clap. See https://github.com/clap-rs/clap/blob/master/CHANGELOG.md#migrating	2022-08-12 22:53:13 +02:00
Matthias	69f387c1bd	Markdown-status (#729 ) * Fix typos * Add status code description to markdown output	2022-08-11 22:08:05 +02:00
tooomm	092b8b0bf1	reorder md output (#708 )	2022-08-04 00:48:45 +02:00
dependabot[bot]	960e32c55f	Bump tabled from 0.7.0 to 0.8.0 (#701 ) * Bump tabled from 0.7.0 to 0.8.0 Bumps [tabled](https://github.com/zhiburt/tabled) from 0.7.0 to 0.8.0. - [Release notes](https://github.com/zhiburt/tabled/releases) - [Changelog](https://github.com/zhiburt/tabled/blob/master/CHANGELOG.md) - [Commits](https://github.com/zhiburt/tabled/compare/v0.7.0...v0.8.0) --- updated-dependencies: - dependency-name: tabled dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> * Update tabled formatting and tests Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Matthias <matthias-endler@gmx.net>	2022-08-03 23:22:08 +02:00
dependabot[bot]	7c1b2f7527	Bump indicatif from 0.16.2 to 0.17.0 (#711 ) * Bump indicatif from 0.16.2 to 0.17.0 Bumps [indicatif](https://github.com/console-rs/indicatif) from 0.16.2 to 0.17.0. - [Release notes](https://github.com/console-rs/indicatif/releases) - [Commits](https://github.com/console-rs/indicatif/compare/0.16.2...0.17.0) --- updated-dependencies: - dependency-name: indicatif dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> * Update progress bar setup Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Matthias <matthias-endler@gmx.net>	2022-08-03 14:20:25 +02:00
Matthias	6fae93f2da	Skip caching unsupported and excluded URLs (#692 ) As discussed in https://github.com/lycheeverse/lychee/issues/647#issuecomment-1170773449, it does not make much sense to cache unsupported and excluded URLs. Unsupported URLs might be supported in the future and caching them would mean they won't get checked then. Excluded URLs were excluded for a reason and should not appear in the cache. Furthermore they might not be excluded in a consecutive run, leading to a false-positive.	2022-07-17 18:40:45 +02:00
Walter Beller-Morales	75a3da0b7e	Add status code in Markdown output (#677 )	2022-07-05 14:43:15 +02:00
Matthias	78185d3b63	Add documentation	2022-06-21 10:03:31 +02:00
Matthias	84de43c554	Refactor request types (#637 )	2022-06-03 20:13:07 +02:00
Matthias	a557cba0b4	Add support for parsing list of status codes from config file (#636 )	2022-06-02 18:53:04 +02:00
Matthias	9b4dfadffd	Fix parsing errors with config options (#632 )	2022-05-31 19:43:46 +02:00
vpereira01	d48a3279a8	Improve configuration example (#631 ) * Add missing parameters * Remove deprecated `--exclude-file` parameter * Improve TOML comments * Add config smoketest	2022-05-31 19:05:27 +02:00
Matthias	b40aacd459	Prepare for release v0.10.0 (#629 )	2022-05-30 23:02:18 +02:00
Matthias	22fecfc056	Add support for URI remapping (#620 ) Remaps allow mapping from a URI pattern to a different URI. The syntax is ``` lychee --remap 'https://example.com http://127.0.0.1' ``` Some use-cases are - Testing URIs prior to production deployment - Testing URIs behind a proxy Be careful when using this feature because checking every link against a large set of regular expressions has a performance impact. Also there are no constraints on the URI mapping, so the rules might contradict with each other. Remap rules get applied in order of definition to every input URI.	2022-05-29 21:41:22 +02:00
Matthias	363b95fe5f	Add support for excluding paths from link checking (#623 ) This change deprecates `--exclude-file` as it was ambiguous. Instead, `--exclude-path` was introduced to support excluding paths to files and directories that should not be checked. Furthermore, `.lycheeignore` is now the only way to exclude URL patterns.	2022-05-29 17:27:09 +02:00
Matthias	b40c785b64	Also dump excluded links (#615 ) This is a minimally invasive version, which allows to grep for `[excluded]`. The reason for exclusion would require more work and it's debatable if it adds any value, because it might make grepping harder and the source of exclusion is easily deducatable from the commandline parameters or the `.lycheeignore` file. Fixes #587.	2022-05-13 18:53:16 +02:00
Matthias	b0136683a9	Add support for comments in `.lycheeignore` (#616 ) Lines starting with the comment character (`#`) inside the .lycheeignore file will be ignored. Whitespace at the beginning of each line will be ignored, so even an indented comment character will work.	2022-05-13 18:51:58 +02:00
Matthias	8c0a32d81d	Refactor response formatting (#599 ) * Add support for raw formatter (no color) * Introduce ResponseFormatter trait * Pass the same params to every cli command * Update dependencies * Remove pretty_assertions dependency (latest version doesn't build)	2022-04-25 19:19:36 +02:00
dependabot[bot]	0d6f84217f	Bump tabled from 0.5.0 to 0.6.0 (#583 ) * Bump tabled from 0.5.0 to 0.6.0 Bumps [tabled](https://github.com/zhiburt/tabled) from 0.5.0 to 0.6.0. - [Release notes](https://github.com/zhiburt/tabled/releases) - [Changelog](https://github.com/zhiburt/tabled/blob/master/CHANGELOG.md) - [Commits](https://github.com/zhiburt/tabled/compare/v0.5.0...v0.6.0) --- updated-dependencies: - dependency-name: tabled dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> * #[field] #[header] in Tabled macro was renamed to #[tabled]. * Fix tabled rename field Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Matthias <matthias-endler@gmx.net>	2022-04-06 01:02:12 +02:00
MichaIng	b338ba2abc	Enhance verbosity check (#578 ) as suggested here: https://github.com/lycheeverse/lychee/pull/570#discussion_r835931903 Signed-off-by: MichaIng <micha@dietpi.com>	2022-04-04 10:31:30 +02:00
Matthias	36d3195c68	Cache verbosity issue (fixes #562 )	2022-03-27 14:48:09 +02:00
Matthias	743d386252	Allow input URLs without scheme (fixes #567 ) This requires `Input::new` to return a `Result`, because the URL parsing could fail when prepending `http://`. We use http instead of https, because curl does as well: `70ac27604a/lib/urlapi.c (L1104-L1124)` Missing files will be interpreted as URLs from the command line and these can be invalid, but that's not seen as an error anymore.	2022-03-27 01:27:27 +01:00
Matthias	d616177a99	Implement excluding code blocks (#523 ) This is done in the extractor to avoid unnecessary allocations.	2022-03-26 10:42:56 +01:00
Matthias	e1d112dbab	Remove `missing_panic_doc` (#561 )	2022-03-22 21:02:56 +01:00
Matthias	8097bfa408	Print Github token error once at the end (#537 ) Print original reqwest error for every Github link. It contains more information about the underlying error. Only print a message about the Github token at the end if it's not set and there were Github errors.	2022-03-03 10:04:55 +01:00
Matthias	4c51fce22f	Fix broken pipe error on failing writes to stdout (#535 ) Make sure that broken pipes (e.g. when a reader of a pipe prematurely exits during execution) get handled gracefully. This change also moves some error messages to stderr by using eprintln. More info: https://github.com/jez/as-tree/issues/15	2022-03-02 23:39:54 +01:00
Matthias	05bd3817ee	Make retry wait time configurable (#525 )	2022-02-24 12:24:57 +01:00
Matthias	41b291037a	Response output overhaul (#524 ) Clean up the response output. Superfluous information was removed and the formatting was changed to make the output more readable to humans.	2022-02-23 17:28:14 +01:00
dependabot[bot]	c4e004bdf8	Bump tabled from 0.4.2 to 0.5.0 (#505 ) * Bump tabled from 0.4.2 to 0.5.0 Bumps [tabled](https://github.com/zhiburt/tabled) from 0.4.2 to 0.5.0. - [Release notes](https://github.com/zhiburt/tabled/releases) - [Changelog](https://github.com/zhiburt/tabled/blob/master/CHANGELOG.md) - [Commits](https://github.com/zhiburt/tabled/compare/v0.4.2...v0.5.0) --- updated-dependencies: - dependency-name: tabled dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> * Update `tabled` format; add test Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Matthias <matthias-endler@gmx.net>	2022-02-19 02:23:38 +01:00
Matthias	ba276cd51b	Error cleanup (#510 ) * Add more fine-grained error types; remove generic IO error * Update error message for missing file * Remove missing `Error` suffix * Rename ErrorKind::Github to ErrorKind::GithubRequest for consistency with NetworkRequest	2022-02-19 01:44:00 +01:00
Matthias	812663d832	Prevent flaky tests (#514 ) Move from example.org to example.com, which seems to be more permissive for testing	2022-02-18 10:29:49 +01:00
Lucius Hu	6d56c6b55c	Replace plain String with SecretString for GitHub token (#509 ) This commit changed the type of `lychee-lib::ClientBuilder::github_token` from `String` to `secrecy::SecretString` to fortify the secret management within our program. Note that this won't affect TOML configuration of `lychee-bin` because `serde::Deserialize` is still implemented for `SecretString`.	2022-02-13 13:53:46 +01:00
Matthias	47df7780fe	Use captured identifiers in format strings (#507 ) Makes for arguably cleaner-looking code. The downside is that the MSRV is 1.58 https://blog.rust-lang.org/2022/01/13/Rust-1.58.0.html Given that nobody uses lychee as a library yet and we have precompiled binaries, it's an acceptable tradeoff. My little research revealed that this is a much-liked feature: https://twitter.com/matthiasendler/status/1483895557621960715	2022-02-12 10:51:52 +01:00
Matthias	9d738fb3f5	Fix default config (#491 ) The default configuration was broken since the introduction of caching and specifically `max_cache_age`. This fixes deserialization and config merging for the case where this key is missing from the config.	2022-02-07 23:17:50 +01:00
Markus Unterwaditzer	d8305f7f53	fix constant updating of progressbar (#488 ) * fix constant updating of progressbar In other issues I've already lamented how slow lychee is when used without `-n`. This fixes an issue where without `-n`, lychee would take 1 minute instead of 4 seconds to check sentry-docs. * fix values	2022-02-07 23:15:26 +01:00
Markus Unterwaditzer	68d09f7e5b	Add html5gum as alternative link extractor (#480 ) html5gum is a HTML parser that offers lower-level control over which tokens actually get created and are tracked. As such, the extractor doesn't allocate anything tokens it doesn't care about. On some benchmarks it provides a substantial performance boost. The old parser, html5ever is still available by setting the `LYCHEE_USE_HTML5EVER=1` env var.	2022-02-07 22:54:47 +01:00
Lucius Hu	6bf8c1fe39	lychee-bin: replace lazy_static by const_format (#495 ) This commit replaced the use of `lazy_static` by `const_format` in `lychee-bin`. Currently `lazy_static` is used to generate static String at runtime. With `const_format` we can instead make constant String at compile time. Co-authored-by: Lucius Hu <lebensterben@users.noreply.github.com>	2022-02-07 22:45:17 +01:00
Matthias	4630216c30	Add description for `max-cache-age` flag	2022-01-14 16:55:56 +01:00
Matthias	ac490f9c53	Add caching functionality (v2) (#443 ) A while ago, caching was removed due to some issues (see #349). This is a new implementation with the following improvements: * Architecture: The new implementation is decoupled from the collector, which was a major issue in the last version. Now the collector has a single responsibility: collecting links. This also avoids race-conditions when running multiple collect_links instances, which probably was an issue before. * Performance: Uses DashMap under the hood, which was noticeably faster than Mutex<HashMap> in my tests. * Simplicity: The cache format is a CSV file with two columns: URI and status. I decided to create a new struct called CacheStatus for serialization, because trying to serialize the error kinds in Status turned out to be a bit of a nightmare and at this point I don't think it's worth the pain (and probably isn't idiomatic either). This is an optional feature. Caching only gets used if the `--cache` flag is set.	2022-01-14 15:25:51 +01:00
Matthias	36450621fa	Update dependencies (#454 )	2022-01-10 22:35:37 +01:00
dependabot[bot]	54b5be81c2	Bump tabled from 0.3.0 to 0.4.2 (#447 ) * Bump tabled from 0.3.0 to 0.4.2 Bumps [tabled](https://github.com/zhiburt/tabled) from 0.3.0 to 0.4.2. - [Release notes](https://github.com/zhiburt/tabled/releases) - [Changelog](https://github.com/zhiburt/tabled/blob/master/CHANGELOG.md) - [Commits](https://github.com/zhiburt/tabled/compare/v0.3.0...v0.4.2) --- updated-dependencies: - dependency-name: tabled dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Matthias <matthias-endler@gmx.net>	2022-01-07 23:10:39 +01:00
Matthias	21f3160b71	Make retries configurable; align constants (#446 ) Using the same default values for the library and the binary now but tweaked the values a bit for slightly faster performance.	2022-01-07 01:03:10 +01:00
Matthias	5eb062cbec	Always hide GH token in opts	2022-01-06 09:54:03 +01:00
Matthias	01393b34a2	Upgrade to Rust 2021 (#427 )	2021-12-17 01:32:13 +01:00
Matthias	166c86c30e	Use tokenizer for extraction; add benchmark (#424 ) This avoids creating a DOM tree for link extraction and instead uses a `TokenSink` for on-the-fly extraction. In hyperfine benchmarks it was about 10-25% faster than the master. Old: 4.557 s ± 0.404 s New: 3.832 s ± 0.131 s The performance fluctuates a little less as well. Some missing element/attribute pairs were also added, which contain links according to the HTML spec. These occur very rarely, but it's good to parse them for completeness' sake. Furthermore tried to clean up a lot of papercuts around our types. We now differentiate between a `RawUri` (stringy-types) and a Uri, which is a properly parsed `URI` type. The extractor now only deals with extracting `RawUri`s while the collector creates the request objects.	2021-12-16 18:45:52 +01:00
Matthias	c41ba64a69	Max concurrency moved to check (#419 ) Concurrency is defined by the channel size consuming from the request stream in `check`	2021-12-07 11:52:40 +01:00

1 2

82 commits