Path: usenet.cise.ufl.edu!newsfeeds.nerdc.ufl.edu!newsfeeds.nerdc.ufl.edu!news.magicnet.net!news.maxwell.syr.edu!nntp.abs.net!outfeed1.news.cais.net!news.neta.com!not-for-mail
From: John Heidemann <
[email protected]>
Newsgroups: comp.lang.perl.announce,comp.lang.perl.modules
Subject: WWW::Search 1.021 released
Followup-To: comp.lang.perl.modules
Date: 28 Aug 1998 14:51:29 GMT
Organization: Internet Access Inc. NetA.Com
Lines: 136
Approved:
[email protected] (comp.lang.perl.announce)
Message-ID: <
[email protected]>
NNTP-Posting-Host: gadget.cscaper.com
X-Trace: news.neta.com 904315889 2197 206.67.186.3 (28 Aug 1998 14:51:29 GMT)
X-Complaints-To:
[email protected]
NNTP-Posting-Date: 28 Aug 1998 14:51:29 GMT
X-Disclaimer: The "Approved" header verifies header information for article transmission and does not imply approval of content.
Xref: usenet.cise.ufl.edu comp.lang.perl.announce:120 comp.lang.perl.modules:3662
WWW::Search and AutoSearch
==========================
WHAT IS NEW WITH WWW::Search 1.021? (27-Aug-98)
------------------------------------------------
overview: a general release
- new: Windows installation is now supported by
Jim Smyser <
[email protected]>; please see his web
page <
http://pubinfo.phx.primenet.com/www.search/>
for details.
- new: MacPerl should now be supported. Thanks to Chris Nandor
for the problem and a fix.
- bug fix: Infoseek, WebCrawler, Dejanews, HotBot by Martin Thurn
- bug fix: AltaVista approx_count bug found by
Darren Stalder <
[email protected]>
- bug fix: documentation cleanups from Neil Bowers
Since the recent 1.020 was not widely announced, here's what was new then:
1.020, 12-Aug-98
overview: lots of bug fixes and new back-ends
- bug fix: maximum_to_retrieve now works for very small values.
(Problem identified by Vidyut Luther <
[email protected]>.)
- new back-ends: ExciteForWebServers, FolioViews, Livelink, MSIndexServer,
Null, Search97
all from Paul Lindner (thanks!)
- bug fix: Gopher, PLweb, SFgate, Simple, Verity from Paul Lindner
- bug fix: Lycos from John Heidemann
- new test suites: PLweb, FolioViews, Null, MSIndexServer, Search97,
SFgate, ExciteForWebServers rom Paul Lindner
- bug fix: HotBot repair from Martin Thurn
Note: WWW::Search may have problems with older libwww's (5.08). If
"make test" dies with an error in RobotUA, upgrade libwww. (Tested
with libwww-5.30.)
WHAT IS WWW::Search?
--------------------
WWW::Search is a collection of Perl modules which provide an API to
WWW search engines. Currently WWW::Search includes back-ends for
variations of AltaVista, Dejanews, Excite, HotBot, Infoseek, Lycos,
Magellan, PLweb, SFgate, Verity, WebCrawler, and Yahoo. We include
two applications built from this library: AutoSearch (an program to
automate tracking of search results over time), and WebSearch, a small
demonstration program to drive the library. Back-ends for other
search engines and more sophisticated clients are currently under
development.
Because WWW::Search depends on parsing the HTML output of web search
engines it will fail of the search engine operators change their
format (an unfortunately frequent occurrence). WWW::Search includes a
test suite for most back-ends which verifies that it's functioning
correctly. As of the day of the release the current back-end
status is:
AltaVista working
Dejanews working
Excite working
ExciteForWebServers working
FolioViews working
Gopher not working? (not in test suite)
HotBot working
Infoseek working
Livelink not working? (not in test suite)
Lycos working
Magellan not working (maybe gone?)
MSIndexServer working
Null working
PLweb working
Search97 working
SFgate working
Simple not working? (not in test suite)
Verity not working (not in test suite)
WebCrawler working?
Yahoo working
Yahoo's test suites are sometimes flakey.
(others are currently under development, see contributors below for details)
WHAT IS AutoSearch?
-------------------
WWW::Search's primary client is AutoSearch. AutoSearch performs a
web-based search and puts the results set in a web page. It
periodically updates this web page, indicating how the search changes
over time. Sample output from WWW::Search can be found at
<
http://www.isi.edu/lsam/tools/autosearch/>. Output format is configurable.
See the man page for AutoSearch details, or Demonstration section
below for the quick-start instructions.
REQUIREMENTS
------------
WWW::Search requires Perl5 and libwww-perl.
For information on Perl5, see <
http://www.perl.com>.
For libwww-perl, see <
http://www.sn.no/libwww-perl/>.
Both are also available from the Comprehensive Perl Archive
Network (CPAN). Visit <
http://www.perl.com/CPAN/> to find a CPAN
site near you.
At this time WWW::Search is tested under Perl version 5.004_04.
AVAILABILITY
------------
The latest version of WWW::Search should always be available from
<
http://www.isi.edu/lsam/tools/WWW_SEARCH/>. Alpha releases are only
available here (not at CPAN).
WWW::Search is also available as part of CPAN. Visit
<
http://www.perl.com/CPAN/> to find a CPAN site near you.
Feedback about WWW::Search is encouraged. If you're using it for a
neat application, please let us know. If you'd like to (or have)
implemented a new back-end for WWW::Search, let us know so we don't
duplicate work.