git-svn-id: https://linkchecker.svn.sourceforge.net/svnroot/linkchecker/trunk/linkchecker@129 e7d03fd6-7b0d-0410-9947-9c21f3af8025
This commit is contained in:
calvin 2000-07-10 12:29:08 +00:00
parent 5376ef7f47
commit e02e32d5f6
7 changed files with 76 additions and 7 deletions

View file

@ -38,3 +38,9 @@ Running LinkChecker
Type "linkchecker" followed by your URLs you want to check.
Type "linkchecker -h" for help.
(Fast)CGI web interface
-----------------------
The *cgi files are three CGI script which you can use to run LinkChecker
from the web.
You can use and adjust the example HTML files to run the script.

View file

@ -60,10 +60,11 @@ version:
upload: files packages
scp debian/changelog shell1.sourceforge.net:/home/groups/linkchecker/htdocs/changes.txt
scp linkchecker-out.* shell1.sourceforge.net:/home/groups/linkchecker/htdocs
scp VERSION shell1.sourceforge.net:/home/groups/linkchecker/htdocs/raw/
scp $(DEBPACKAGE) ../Packages.gz shell1.sourceforge.net:/home/groups/linkchecker/htdocs/debian
ncftpput download.sourceforge.net /incoming $(ALLPACKAGES)
ssh -C shell1.sourceforge.net cd /home/groups/linkchecker/htdocs/raw && make
ssh -C shell1.sourceforge.net 'cd /home/groups/linkchecker/htdocs/raw && make'
test:
rm -f test/*.result

10
README
View file

@ -17,6 +17,7 @@ o HTTP proxy support
o give username/password for HTTP and FTP authorization
o robots.txt exclusion protocol support
o internationalization support
o (Fast)CGI web interface
Installing, Requirements, Running
@ -56,7 +57,7 @@ fintl.py from http://sourceforge.net/snippet/detail.php?type=snippet&id=100059
Note that the following packages are modified by me:
httplib.py (renamed to http11lib.py and a bug fixed)
fcgi.py (implemented immediate output)
fcgi.py (implemented streamed output)
sz_fcgi.py (simplified the code)
DNS/Lib.py:566 fixed rdlength name error
DNS/Base.py: fixed /etc/resolv.conf parser to cope with empty lines
@ -80,8 +81,8 @@ commandline options and stores them in a Config object.
(2) Which leads us directly to the Config class. This class stores all
options and works a little magic: it tries to find out if your platform
supports threads. If so, they are enabled. If not, they are disabled.
Several functions are replaced with their non-threaded equivalents if
threading is disabled.
Several functions are replaced with their threaded equivalents if
threading is enabled.
Another thing are config files. A Config object reads config file options
on initialization so they get handled before any commandline options.
@ -93,7 +94,7 @@ UrlData.check() function.
Derived from the base class UrlData are the different URL types:
HttpUrlData for http:// links, MailtoUrlData for mailto: links and so on.
So UrlData defines the functions which are common for *all* URLs, and
UrlData defines the functions which are common for *all* URLs, and
the subclasses define functions needed for their URL type.
(4) Lets look at the output. Every output is defined in a Logger class.
@ -103,6 +104,7 @@ newUrl() (through UrlData.logMe()) and after all checking we call
endOfOutput(). Easy.
New loggers are created with the Config.newLogger(name, fileoutput) function.
Nifty features you did not expect
---------------------------------
o Included brain enhancer. Just read Python code to gain intelligence.

4
debian/control vendored
View file

@ -8,8 +8,7 @@ Package: linkchecker
Architecture: any
Depends: python-base (>= 1.5.2)
Suggests: libssl09, libssl09-dev
Description: a link checker written in Python
With LinkChecker you can check your HTML documents for broken links.
Description: check/validate/test your HTML documents for broken links/URLs
Features:
o recursive checking
o multithreaded
@ -23,3 +22,4 @@ Description: a link checker written in Python
o give username/password for HTTP and FTP authorization
o robots.txt exclusion protocol support
o internationalization support
o (Fast)CGI web interface

12
lconline/index.html Normal file
View file

@ -0,0 +1,12 @@
<!doctype html public "-//w3c//dtd html 4.0//en">
<html><head>
<title>LinkChecker Online</title>
</head>
<frameset rows="40%,60%" border="1" frameborder="0" framespacing="0">
<frame name="formular" src="lc_cgi.html" noresize frameborder="0">
<frame name="links" src="leer.html" noresize frameborder="0">
<noframes>
Please use a frame capable browser.
</noframes>
</frameset>
</html>

40
lconline/lc_cgi.html Normal file
View file

@ -0,0 +1,40 @@
<!doctype html public "-//w3c//dtd html 4.0//en">
<html><head>
<title>LinkChecker Online</title>
</head>
<body text="#192c83" bgcolor="#fff7e5" link="#191c83" vlink="#191c83"
alink="#191c83" >
<font face="Lucida,Verdana,Arial,sans-serif">
<center><h2>LinkChecker Online</h2>
(powered by <a href="http://linkchecker.sourceforge.net/">LinkChecker</a>)
</center>
<blockquote>
<form method="POST" action="http://YOURCGIHOSTHERE/cgi-bin/lc.cgi"
target="links">
<table border=0>
<tr><td colspan=2 bgcolor="#fff7e5">
<font face="Lucida,Verdana,Arial,sans-serif">URL:
<input size=70 name="url" value="http://">
<input type="submit" value="Go!"></font></td></tr>
<td><font face="Lucida,Verdana,Arial,sans-serif">Recursion Level: <select name="level">
<option> 0
<option selected> 1
<option> 2
<option> 3
</select>
</font></td>
<td><font face="Lucida,Verdana,Arial,sans-serif">
Check anchors in HTML: <input type="checkbox" name="anchors" checked>
</font></td></tr>
<tr><td>
<font face="Lucida,Verdana,Arial,sans-serif">
Log only errors: <input type="checkbox" name="errors">
</font></td>
<td><font face="Lucida,Verdana,Arial,sans-serif">
Check only intern links: <input type="checkbox" name="intern" checked>
</font></td></tr></table>
</blockquote>
</font>
</body></html>

8
lconline/leer.html Normal file
View file

@ -0,0 +1,8 @@
<!doctype html public "-//w3c//dtd html 4.0//en">
<html><head>
<title>Empty</title>
</head>
<body text="#192c83" bgcolor="#fff7e5" link="#191c83" vlink="#191c83"
alink="#191c83" >
No links checked, dude!
</body></html>