spacer.png, 0 kB



Tutorial – Guide to Effective Searching of the Internet

 

[ Previous | Home | Index | Next ]

Part 5: Advanced Operators

There are four additional Boolean operators that provide more fine-grained control than the basic AND and OR. These operators are less frequently used and are not all supported by search services with basic Boolean capabilities.

Topic 16: NEAR Operator
Topic 17: BEFORE and AFTER Operators
Topic 18: AND NOT Operator

You may click on any of the topics above to go directly to its discussion.

Topic 16: NEAR Operator

Remember for the AND operator that the terms or phrases on both sides of the operator can appear anywhere in the document in order to get a successful result. One example above described how a successful result for "peregrine falcon*" AND "endangered species" could be obtained, even though the falcon reference was to 200 mph diving speeds and the endangered species discussion was many pages later dealing with the dodo bird. The NEAR operator is designed specifically to avoid such false results.

The NEAR operator requires the two phrases or terms to be within a specified word count of one another to be counted as a successful result. Generally, most search engines that support the NEAR operator have a set value of a ten word maximum distance. A few [see Topic 38] allow you to specify a word distance of your choice if you supply an additional argument. Some engines also use ADJ (for adjacent) as the equivalent operator to NEAR.

Search Tip:
Use NEAR as an alternative to phrases and an improvement to AND, but only when you know the concepts are closely linked.
The NEAR operator does not care which of the phrases or terms on either side of the argument comes first or not, just that the two phrases or terms are within the specified distance.

The NEAR operator is a great way to ensure that your search terms occur within the same sentence or same paragraph. It is a very useful way to remove large, comprehensive Web sites that have a reference to everything under the sun, but not specific information of use to your search.

The NEAR operator can have drawbacks, however. It is possible to overlook the definitive document on endangered peregrine falcons, for example, if in one section of the document it uses peregrine falcon but elsewhere when its endangered status is discussed it only uses the word peregrine. It is very difficult in all cases to foretell how document authors will use, repeat or link such terms.

Another drawback is the relatively few search services that support this operator. This problem can be overcome when using third-party search tools, such as BrightPlanet's Deep Query Manager that work on the results of search engines, but support this operator themselves.

But, if your terms can pass the test of confidently appearing within a sentence or two of one another, we recommend you consider the use of the NEAR operator.

Topic 17: BEFORE and AFTER Operators

The BEFORE and AFTER operators work in the exact same manner as the NEAR operator, only you can now specify which terms or phrases need to come first or second. In the case of the BEFORE operator, the first term or phrase MUST occur before the second term or phrase within the specified word distance. In the case of the AFTER operator, the first term or phrase MUST occur after the second term or phrase within the specified word distance.

These operators do provide even greater control to your searches. But their drawbacks are even more severe than the NEAR operator. First, not only must your terms appear within the word distance, but you also must get the precedence right. Second, to our knowledge, only two major search engines support these operators [see Topic 38].

For these reasons we've included these operators here for the sake of completeness, but we do not recommend that you seriously consider using them . If you become an Internet 'power searcher' and you decide you disagree with this recommendation, then your skills have surpassed the purpose of this tutorial anyway.

Topic 18: AND NOT Operator

AND NOT removes any documents that contain that term or phrase. AND NOT is a unary operator; that is, it only works on the term or phrase that immediately follows the operator. It does not evaluate terms or phrases on both sides of the operator.

Most of the major search services support the AND NOT operator. It is sometimes called BUT NOT or NOT and sometimes denoted by placing a minus sign (-) before the term or phrase to be removed.

[NOTE: Technically NOT is the unary operator. For example, NOT falcon would exclude all documents that use the word falcon. The problem arises in the middle of a query. While some search engines allow NOT by itself, such as falcon NOT car which would return documents using the word falcon but not car, the statement is technically ambiguous as to how to treat falcon. As a result, most engines require matching NOT in the middle of a query with AND or OR (OR NOT is rarely used). This removes the ambiguity and is the form we've adopted herein for use within the middle of a query.]

Again using AltaVista document counts, here are the results for this operator:

As discussed for other operators, search services indexing or retrieval accuracy or is not 100%.

AND NOT is a very powerful command that should be used with care. AND NOT works to narrow a search, subtracting all citations that contain the specified term or phrase.

AND NOT is completely non-discriminatory; it only takes one instance of a word or phrase to eliminate a document from your results set. As one source describes it, think of AND NOT logic sort of like peeling a potato [30]. A peeled potato is potato AND NOT peel. There's only one trouble: some of the good part of the potato goes with the peel. So, use the AND NOT operator with as much care as you would a paring knife, and only when you're absolutely sure you want to exclude a term or phrase from your results.

Search Tip:
AND NOT is a powerful operator, use with care! A single instance will cause a document to be excluded.
Generally, we do not recommend using AND NOT in the beginning iterations of a search. See what results are obtained in the early steps before applying this operator, if at all. Then, apply it incrementally to make sure you're not stripping away too much of the fruit.

A good example of where this might apply is with the falcon* search noted for the AND operator. The term falcon* returns references to cars, products, companies and place names, in addition to birds. Successively applying AND NOT to car*, product* and compan* is another approximation to the search bird* AND falcon*. On the other hand, using AND NOT with place* could be going too far by eliminating references to falcon bird sightings that occur in various places.

Though in this example we have a good AND qualifier in bird* for our interest in peregrine falcons, a suitably encompassing word such as bird* may not apply to other search topics. In these cases, AND NOT, judiciously applied, can be an alternate way of getting to the same end.

[ Previous | Home | Index | Next ]

 
spacer.png, 0 kB
spacer.png, 0 kB
spacer.png, 0 kB

Sitemap Privacy About Us Contact Us Site Use

spacer.png, 0 kB