Commit graph

5812 commits

Author SHA1 Message Date
Marius Gedminas
19a5f19282 Merge pull request #29 from linkcheck/dev-requirements
Move dev requirements into dev-requirements.txt
2017-02-08 14:41:56 +02:00
Marius Gedminas
83c584958e Move dev requirements into dev-requirements.txt 2017-02-07 16:07:38 +02:00
anarcat
9f3cc4f369 Merge pull request #19 from mgedmin/crawl-deterministically
Crawl HTML attributes in deterministic order
2017-02-01 12:34:24 -05:00
Marius Gedminas
743a5f31cb Crawl HTML attributes in deterministic order
Fixes #17.
2017-02-01 19:19:53 +02:00
anarcat
5a346bae66 Merge pull request #9 from mgedmin/add-tox
Add a tox.ini
2017-02-01 12:02:34 -05:00
Marius Gedminas
a825b9d901 Mark the non-deterministic test as xfail 2017-02-01 18:57:40 +02:00
anarcat
a4ce062b2c Merge pull request #15 from seamang/case-insensitive-headers
Force header names to lower to allow for CaseInsensitiveDict variability
2017-02-01 11:51:26 -05:00
Marius Gedminas
02869ea076 Mark TestFile.test_directory_listing as known to fail
The test unzipps a zip file with a weird-looking non-ASCII filename in it.
I don't think zip files specify the encoding for filenames.  Different
unzip utilities may interpret the filename differently.  Plus, the byte
representation of the unzipped filename may be different depending on
the filesystem charset.

To me it looks as if the filename is garbage encoded as valid UTF-8, and
the test expectation is to get it in latin-1 or something.
2017-02-01 18:45:05 +02:00
Marius Gedminas
cffea5fcbd Mark TestHttps.test_https as known to fail
This test depends on the way http://amazon.com/ works.  I don't think
that's a good idea.
2017-02-01 18:44:21 +02:00
Marius Gedminas
f4ec7531c1 Fix TestHttp.test_html
The HTML tag has two attributes with URLs:

  <applet archive="file.html" src="file.css">

It would appear that the order in which these attributes are crawled
does not match the order in the result file.

Possibly the crawling order is non-deterministic, although I cannot
reproduce that.  If that's the case, the fix would be to sort the
attributes in the crawler before following them, which means we want the
expected results sorted as well (and since 'archive' comes before 'src',
so file.html should come before file.css).
2017-02-01 18:41:47 +02:00
Graham Seaman
2e32780dc7 Force header names to lower to allow for CaseInsensitvieDict variability 2017-02-01 16:28:07 +00:00
Marius Gedminas
ee3b590035 Tell tox to use develop mode
We must do this, because py.test adjusts sys.path to make
'tests.test_foo' importable [*].  When py.test does this, the
'linkcheck' directory at the top of the git tree is the one that gets
imported in the tests.  If we've told pip to use develop mode, all's
fine.  If we haven't, then we're going to get errors because extension
modules like _network.so get installed into
.tox/*/lib/*/site-packages/linkcheck/network and not into
./linkcheck/network/

[*] http://doc.pytest.org/en/latest/goodpractices.html#choosing-a-test-layout-import-rules
2017-02-01 18:24:21 +02:00
anarcat
c5ad32c7aa Merge pull request #14 from mgedmin/add-install-requires
Add install_requires for setuptools
2017-02-01 11:06:31 -05:00
Marius Gedminas
81673ff1fc Add install_requires for setuptools
This makes 'pip install' install everything that's needed for
LinkChecker to actually work.

Fixes #12.
2017-02-01 17:55:21 +02:00
Marius Gedminas
97bb57205b Run tox from Travis 2017-02-01 16:18:38 +02:00
Marius Gedminas
d59a153a13 Add a tox.ini 2017-02-01 16:14:47 +02:00
anarcat
821b661923 Merge pull request #11 from mgedmin/fix-travis
Fix TypeError: hasattr(): attribute name must be string
2017-02-01 09:14:10 -05:00
Marius Gedminas
3c99b6aa30 Fix TypeError: hasattr(): attribute name must be string
The one test failure in Travis happens in
TestConsole.test_internal_error, but only if you have the argcomplete
package installed.

This was a real bug in error reporting code.
2017-02-01 16:02:35 +02:00
Antoine Beaupré
2ce06d6e55 Merge remote-tracking branch 'origin/master' 2017-02-01 08:54:16 -05:00
Antoine Beaupré
d51b7f34b6 Merge branch '9.3.x' 2017-01-31 19:21:22 -05:00
Antoine Beaupré
559aa477e8 Merge tag 'v9.3.1' 2017-01-31 19:18:10 -05:00
anarcat
9b20aa5c8d Merge pull request #6 from astrofrog/fix-badge
Fix Travis badge in README
2017-01-31 12:26:30 -05:00
Antoine Beaupré
caf5ea6984 draft a release process 2017-01-31 12:25:57 -05:00
Thomas Robitaille
5e43aed52a Fix Travis badge in README 2017-01-31 17:18:47 +00:00
Antoine Beaupré
26e5270abb bump version 2017-01-31 12:00:13 -05:00
Robin Winslow
e62e630e60 Fix setup.py 2017-01-31 11:58:33 -05:00
Antoine Beaupré
da8cecd83c Merge remote-tracking branch 'anarcat/norobots' 2017-01-31 11:34:09 -05:00
Antoine Beaupré
bf45fb1884 fix HTTPS URL checks
in Debian Jessie, linkchecker fails because of an API problem.

it completely breaks HTTPs checks.

this patch fixes the problem

from https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=772947
2017-01-31 11:25:45 -05:00
Bastian Kleineidam
1e291afdfa Fix python requests version check 2017-01-31 11:25:38 -05:00
Antoine Beaupré
46d96d0aa0 fix HTTPS URL checks
in Debian Jessie, linkchecker fails because of an API problem.

it completely breaks HTTPs checks.

this patch fixes the problem

from https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=772947
2016-09-30 11:20:38 -04:00
Bastian Kleineidam
c2ce810c3f Fix python requests version check 2016-06-28 21:55:10 +02:00
Antoine Beaupré
9d899d1dfa add --no-robots commandline flag
While this flag can be abused, it seems to me like a legitimate use
case that you want to check a fairly small document for mistakes,
which includes references to a website which has a robots.txt that
denies all robots. It turns out that most websites do *not* add a
permission for LinkCheck to use their site, and some sites, like the
Debian BTS for example, are very hostile with bots in general.

Between me using linkcheck and me using my web browser to check those
links one by one, there is not a big difference. In fact, using
linkcheck may be *better* for the website because it will use HEAD
requests instead of a GET, and will not fetch all page elements
(javascript, images, etc) which can often be fairly big.

Besides, hostile users will patch the software themselves: it took me
only a few minutes to disable the check, and a few more to make that
into a proper patch.

By forcing robots.txt without any other option, we are hurting our
good users and not keeping hostile users from doing harm.

The patch is still incomplete, but works. It lacks: documentation and
unit tests.

Closes: #508
2016-05-19 14:43:59 -04:00
Bastian Kleineidam
0ef00eea56 Move GUI files to separate project 2016-01-23 13:28:15 +01:00
Bastian Kleineidam
e410169fd0 Remove unused test module 2016-01-20 20:18:09 +01:00
Bastian Kleineidam
354d6adb20 Remove deprecated pip option --use-mirror 2016-01-20 20:16:04 +01:00
Bastian Kleineidam
6fbd1e1c99 Try using container based travis ci boxes by replacing sudo 2016-01-20 20:01:19 +01:00
Bastian Kleineidam
88c060699d Fix tests 2016-01-19 22:05:15 +01:00
wummel
4de5348b9d Merge pull request #575 from sitsofe/master
Expand RegexCheck docs and fix up some typos.
2016-01-19 21:59:42 +01:00
Bastian Kleineidam
549533d701 Improved debugging 2016-01-19 21:55:50 +01:00
Bastian Kleineidam
914995b5fc Use example.com for tests. 2016-01-19 12:17:08 +01:00
wummel
a40c39be59 Merge pull request #560 from xvadim/feature
Added plugin for parsing and checking links in Markdown files
2016-01-19 07:30:34 +01:00
wummel
ab110f1934 Merge pull request #611 from florianjacob/10_threads_docs_correction
Make documentation reflect 39df181
2016-01-18 19:52:56 +01:00
wummel
e2556abbb6 Merge pull request #561 from nbigaouette/issue555
Detect if "url_data" contains proxy attributes before using them.
2016-01-17 21:59:35 +01:00
wummel
1baac19ca3 Merge pull request #621 from PeterDaveHello/patch-1
Update README.rst
2016-01-17 14:50:13 +01:00
Bastian Kleineidam
e3ab90246d Remove platform-specific installer stuff and ensure a build .whl wheel file can be built. 2016-01-17 09:05:21 +01:00
Bastian Kleineidam
3d711666e1 Fix parser for changes in bison 3.0.x 2015-11-26 12:33:44 +01:00
Peter Dave Hello
5a4482fb7e Update README.rst
Use svg instead of png to get better image quality
2015-11-09 01:19:57 +08:00
Florian Jacob
fa262a0894 Make documentation reflect 39df181
10 instead of 100 default threads
2015-09-15 11:15:04 +02:00
Sitsofe Wheeler
40be399f79 Fix up some typos and grammar in the linkcheckerrc man page. 2015-02-11 10:42:03 +00:00
Sitsofe Wheeler
ed683bc4c0 Expand RegexCheck section.
- Add the warningregex parameter to the RegexCheck section.
- Add a note that the REGEX shouldn't be quoted.
- Change the quote style to double quotes to match the rest of the document.
2015-02-11 10:41:57 +00:00