Commit graph

6356 commits

Author SHA1 Message Date
Chris Mayo
89c7c74bcf Remove unused set_linecache() from better_exchook2.py 2020-05-25 19:50:57 +01:00
Chris Mayo
7257e5e1a0 Remove unused imports in parser/__init__.py 2020-05-25 19:50:57 +01:00
Chris Mayo
6c8e88dae6
Merge pull request #412 from cjmayo/unicode2
Remove instances of Python 2 unicode
2020-05-24 19:20:07 +01:00
Chris Mayo
313a14ff0d Remove instances of Python 2 unicode 2020-05-24 19:14:47 +01:00
Marius Gedminas
d0169c46d4
Merge pull request #348 from weshaggard/HandleRateLimiting
Turn status code 429 into warning instead of failure
2020-05-24 16:16:56 +03:00
Marius Gedminas
dcafa2df75
Avoid u-prefixed strings
linkchecker is Python 3 only, all strings are unicode.
2020-05-24 14:50:07 +03:00
Chris Mayo
9c982533e0
Merge pull request #411 from cjmayo/empty-http
Fix internal error on empty HTML files accessed over HTTP
2020-05-23 20:27:12 +01:00
Chris Mayo
03b1c4919d Record encoding in debug log messages 2020-05-23 20:01:24 +01:00
Chris Mayo
f7337f55e8 Fix error due to an empty html file accessed over http
Use the already fixed [1] UrlBase.get_content() in HttpUrl.

[1] 5bd1fb4 ("Fix internal error on empty HTML files", 2020-05-21)
2020-05-23 20:01:24 +01:00
Chris Mayo
d611564cb0 Add a test for an empty html file accessed over http 2020-05-23 20:01:24 +01:00
Marius Gedminas
f268a90cfb
Merge branch 'master' into HandleRateLimiting 2020-05-23 14:15:52 +03:00
anarcat
b1e8137da2
Merge pull request #410 from cjmayo/install
Installation of test data, standardise Markdown extension, remove linkchecker.desktop
2020-05-22 20:07:15 -04:00
Chris Mayo
df79e9b196 Add missing test data and Markdown documentation to the distribution 2020-05-22 19:43:57 +01:00
Chris Mayo
c60887cc63 Rename .mdwn files to .md
- RFC 7763 file extensions are .md and .markdown
- Consistent with other documentation files
2020-05-22 19:43:57 +01:00
Chris Mayo
87f0c31928 Remove linkchecker.desktop
- Not useful for a command-line application
- Refers to an icon with a generic name, that is not installed
2020-05-22 19:43:57 +01:00
Marius Gedminas
6dffacf17f
Merge pull request #409 from linkchecker/fix-login-timeouts
Make sure login form fetching uses a timeout and sends User-Agent
2020-05-22 21:40:48 +03:00
anarcat
2256a6e889
Merge pull request #408 from linkchecker/fix-timeouts
Make sure fetching robots.txt uses the configured timeout
2020-05-22 14:29:12 -04:00
Marius Gedminas
b0435b3d47 Make sure login form fetching uses a timeout
Also resolve an XXX comment about the User-Agent header (which is
configured in new_request_session), but add a couple of XXX comments
about using proxy and possibly disabling TLS certificate checking.
2020-05-22 11:19:51 +03:00
Marius Gedminas
4f3fe5e1c3 Make sure fetching robots.txt uses the configured timeout
Closes #396.
2020-05-22 10:53:33 +03:00
Marius Gedminas
639ba0dba2
Merge pull request #406 from linkchecker/fix-empty-file-problem
Fix internal error on empty HTML files
2020-05-21 19:57:46 +03:00
Marius Gedminas
c60d7c66e4 Clarify the decision to fall back to Latin-1 2020-05-21 19:35:39 +03:00
Marius Gedminas
5bd1fb4e36 Fix internal error on empty HTML files
When BeautifulSoup finds an empty file on disk, it sets
original_encoding to None.  It doesn't matter what encoding we pick for
empty files, so let's just pick one.

I don't know if there are any circumstances where BeautifulSoup might
set the encoding to None for a non-empty file.

Closes #392.
2020-05-21 19:01:33 +03:00
Marius Gedminas
fd3ab13470
Merge pull request #397 from linkchecker/doc-389
do not require ssh to clone from source
2020-05-21 18:28:20 +03:00
anarcat
a226b4e406
Merge pull request #405 from cjmayo/tidyten13
Remove encoding of TestLogger diff  and url in Checker.check_url_data()
2020-05-21 08:56:08 -04:00
Chris Mayo
6cfc8eeb49 Replace threading.Thread.setName() with setting the name property
As recommended in:

https://docs.python.org/3.5/library/threading.html#threading.Thread.setName
2020-05-20 19:58:44 +01:00
Chris Mayo
42eba19a7d No need to encode url in Checker.check_url_data()
Was causing b'' in log messages e.g. CheckThread-b'http:...
2020-05-20 19:58:44 +01:00
Chris Mayo
96e1c00ff7 TestLogger diff output is all Unicode in Python 3 2020-05-20 19:58:44 +01:00
Chris Mayo
768952e111
Merge pull request #403 from cjmayo/tidyten12
Remove "from builtins import str as str_text"
2020-05-20 19:38:14 +01:00
Marius Gedminas
1ab45c2e60
Merge pull request #402 from linkchecker/flake8
Add a 'tox -e flake8' and a Travis CI job
2020-05-19 23:09:03 +03:00
Chris Mayo
71eaf9a982 Remove str_text from tests/ 2020-05-19 19:56:42 +01:00
Chris Mayo
28f4587dfa Remove str_text from fileutil.py, strformat.py and url.py 2020-05-19 19:56:42 +01:00
Chris Mayo
ebcc3c4961 Remove str_text from plugins/ 2020-05-19 19:56:42 +01:00
Chris Mayo
1c14583535 Remove str_text from logger/ 2020-05-19 19:56:42 +01:00
Chris Mayo
6bddd4ac60 Remove str_text from checker/ 2020-05-19 19:56:42 +01:00
Chris Mayo
a127902607 Replace str_text in asserts 2020-05-19 19:56:42 +01:00
Chris Mayo
7490804e2c
Merge pull request #395 from cjmayo/tidyten11
Remove unused code from linkcheck/fileutil.py
2020-05-19 19:45:08 +01:00
Marius Gedminas
7a43abe6d6 Add a flake8 job to the Travis matrix
https://docs.travis-ci.com/user/build-matrix/#rows-that-are-allowed-to-fail
suggests that this might not work.
2020-05-19 19:27:40 +03:00
Marius Gedminas
72e7c600f3 Add a 'tox -e flake8' 2020-05-19 19:24:22 +03:00
anarcat
8183b7feb8
Update doc/install.txt
Co-authored-by: Marius Gedminas <marius@gedmin.as>
2020-05-19 12:05:21 -04:00
Marius Gedminas
391bd5882a
Merge pull request #394 from gbabin/fix-translations-encoding
Fix translations encoding (issue #165)
2020-05-19 18:53:06 +03:00
Marius Gedminas
e6e969f975
Merge pull request #391 from linkchecker/dev-version
Bump version in git to 10.0.0.dev0
2020-05-19 18:49:34 +03:00
Antoine Beaupré
2c9bc4e7ba
do not require ssh to clone from source
Closes: #389
2020-05-19 11:04:15 -04:00
Chris Mayo
690605c519 Remove unused code from linkcheck/fileutil.py 2020-05-18 19:29:55 +01:00
Chris Mayo
ce0a151c8a
Merge pull request #393 from cjmayo/tidyten10
Assorted test tidy-ups
2020-05-18 19:16:44 +01:00
Guillaume Babin
1cda677bac
Make po/Makefile version-agnostic
Co-authored-by: Marius Gedminas <marius@gedmin.as>
2020-05-18 18:24:36 +02:00
gbabin
fc763be5a2 Make translations 2020-05-18 13:33:12 +02:00
gbabin
ede0b149fd Fix .mo makefile 2020-05-18 13:32:28 +02:00
gbabin
d339c61602 Fix #165 translations encoding 2020-05-18 12:58:45 +02:00
Chris Mayo
12fd59057e Remove duplicate tests from test_strformat.py 2020-05-17 20:10:28 +01:00
Chris Mayo
339d293326 Convert tests/test_po.py to UTF-8 2020-05-17 20:10:28 +01:00