From: John Conover <john@johncon.johncon.com>
Subject: wgetrel - intellgent www information robot
Date: Mon, 23 Feb 1998 10:39:43 GMT

-----BEGIN PGP SIGNED MESSAGE-----


The shell script wgetrel intelligently transverses the Internet
searching for web pages by determining the relevance of HTML documents
to a search criteria. The criteria is specified by boolean
operators-the supported operators are logical or, logical and, and
logical not.  These operators are represented by the symbols, "|",
"&", and, "!", respectively, and left and right parenthesis, "(" and
")", are used as grouping operators. The relevance of the documents is
determined by the program htmlrel, which is a modification to the
program rel(1). Source modifications are provided in the wgetrel
distribution. Also required is Hrvoje Niksic's excellent program,
wget(1). The shell script, wgetrel, can reduce web searching by about
an order of magnitude.

        John

Title:          wgetrel
Version:        1.0
Entered-date:   February, 1998
Description:    Source modifications to the rel(1) program sources to
                make a variant of the program, htmlrel(1), that
                determines the relevance of HTML text documents to a
                set of keywords expressed in boolean infix
                notation-the output file syntax is Netscape level 1
                bookmark compatible. The shell script, wgetrel(1),
                executes the programs htmlrel(1) and Hrvoje Niksic's
                excellent wget(1) to form an intelligent Internet
                search engine.  (The program sources to wget(1) are
                available via anonymous ftp from
                ftp://prep.ai.mit.edu/pub/gnu/wget.tar.gz. The program
                sources to rel(1) are available via anonymous ftp from
                ftp://sunsite.unc.edu/pub/Linux/utils/text/rel.tar.gz.)
                Installation requires virgin sources of the rel(1)
                program-there is a shar file in the wgetrel
                distribution that installs the modifications in the
                rel program source directory. The program wgetrel(1)
                controls search direction across the Internet through
                determination of the relevance of the documents to a
                search criteria.
Keywords:       www robot infobot bot information retrieval Internet search
Author:         john@johncon.com (John Conover)
Maintained-by:  john@johncon.com (John Conover)
Primary-site:   sunsite.unc.edu /pub/Linux/utils/text/wgetrel.tar.gz
Alternate-site:
Original-site:  johncon.com
Platform:       Linux, USG, BSD
Copying-policy: No limitations for non-commercial use

- -- 

John Conover, 631 Lamont Ct., Campbell, CA., 95008, USA.
VOX 408.370.2688, FAX 408.379.9602
conover@netcom.com



- -- 
This article has been digitally signed by the moderator, using PGP.
http://www.iki.fi/mjr/cola-public-key.asc has PGP key for validating signature.
Send submissions for comp.os.linux.announce to: linux-announce@news.ornl.gov
PLEASE remember a short description of the software and the LOCATION.
This group is archived at http://www.iki.fi/liw/linux/cola.html

-----BEGIN PGP SIGNATURE-----
Version: 2.6.3ia
Charset: latin1

iQCVAgUBNPFR8lrUI/eHXJZ5AQEUIQP/fa6TJ8g1fJWQQ9h/RHVDnqrd+Gxl9GS0
15SifEBDnOQtFZSG5IjLE1t2ABYYzxCOksY1Y1t/alpYi2hd0IFzLvJmx93H+cBG
vIk7qgKnoDYxEGOBBbbS2DQi6IO8ZnfL7x5d5TJ3cF93tU0TY3eXDc3sXAjitDmz
/yFa2a+jrkw=
=L8Xw
-----END PGP SIGNATURE-----