man WWW::Search::AltaVista::AdvancedWeb () - class for advanced Alta Vista web searching
NAME
WWW::Search::AltaVista::AdvancedWeb - class for advanced Alta Vista web searching
SYNOPSIS
use WWW::Search; my $search = new WWW::Search('AltaVista::AdvancedWeb'); $search->native_query(WWW::Search::escape_query('(bmw AND mercedes) AND NOT (used OR Ferrari)')); $search->maximum_to_retrieve('100'); while (my $result = $search->next_result()) { print $result->url, "\n"; }
DESCRIPTION
Class hack for Advance AltaVista web search mode originally written by John Heidemann http://www.altavista.com.
This hack now allows for AltaVista AdvanceWeb search results to be sorted and relevant results returned first. Initially, this class had skiped the 'r' option which is used by AltaVista to sort search results for relevancy. Sending advance query using the 'q' option resulted in random returned search results which made it impossible to view best scored results first.
This class exports no public interface; all interaction should be done through WWW::Search objects.
HELP
Use AND to join two terms that must both be present for a document to count as a match.
Use OR to join two terms if either one counts.
Use AND NOT to join two terms if the first must be present and the second must NOT.
Use NEAR to join two terms if they both must appear and be within 10 words of each other.
Try this example:
cars AND bmw AND mercedes
You don't have to capitalize the operators AND, OR, AND NOT, or NEAR. But many people do to make it clear what is a query term and what is an instruction to the search engine.
One other wrinkle that's very handy: you can group steps together with parentheses to tell the system what order you want it to perform operations in.
(bmw AND mercedes) NEAR cars AND NOT (used OR Ferrari)
Keep in mind that grouping should be used as much as possible because if you attempt to enter a long query using AND to join the words you may not receive any results because the entire query would be like one long phrase. For best reuslts follow the example herein.
AUTHOR
CWWWW::Search hack by Jim Smyser, <jsmyser@bigfoot.com>.
COPYRIGHT
Copyright (c) 1996 University of Southern California. All rights reserved.
Redistribution and use in source and binary forms are permitted provided that the above copyright notice and this paragraph are duplicated in all such forms and that any documentation, advertising materials, and other materials related to such distribution and use acknowledge that the software was developed by the University of Southern California, Information Sciences Institute. The name of the University may not be used to endorse or promote products derived from this software without specific prior written permission.
THIS SOFTWARE IS PROVIDED AS IS AND WITHOUT ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, WITHOUT LIMITATION, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE.
VERSION HISTORY
2.07 - unescape URLs, and bugfix for undefined CW$hit
2.06 - do not use URI::URL
2.02 - Added HELP POD. Misc. Clean-up for latest changes.
2.01 - Additional query modifiers added for even better results.
2.0 - Minor change to set lowercase Boolean operators to uppercase.
1.9 - First hack version release.