Search
Engine Glossary
Boolean search: A search allowing the inclusion
or exclusion of documents containing certain words through the use
of operators such as AND, NOT and OR.
Concept search: A search for documents related
conceptually to a word, rather than specifically containing the
word itself.
Full-text
index: An index containing every word of every document
cataloged, including stop words (defined below).
Fuzzy search: A search that will find matches
even when words are only partially spelled or misspelled.
Index: The searchable catalog of documents
created by search engine software. Also called
"catalog." Index is often used as a synonym for search
engine. Index is commonly pluralized as "indices."
However, Search Engine Watch instead uses the alternative plural
form "indexes."
Keyword search: A search for documents containing
one or more words that are specified by a user.
Phrase search: A search for documents containing
a exact sentence or phrase specified by a user.
Precision: The degree in which a search engine
lists documents matching a query. The more matching documents that
are listed, the higher the precision. For example, if a search
engine lists 80 documents found to match a query but only 20 of
them contain the search words, then the precision would be 25%.
Proximity search: A search where users to specify
that documents returned should have the words near each other.
Query-By-Example: A search where a user instructs an engine to
find more documents that are similar to a particular document.
Also called "find similar."
Recall: Related to precision, this is the degree
in which a search engine returns all the matching documents in a
collection. There may be 100 matching documents, but a search
engine may only find 80 of them. It would then list these 80 and
have a recall of 80%.
Relevancy: How well a document provides the
information a user is looking for, as measured by the user.|
Search Engine: The software that searches an
index and returns matches. Search engine is often used
synonymously with spider and index, although these are separate
components that work with the engine.
Spider: The software that scans documents and
adds them to an index by following links. Spider is often used as
a synonym for search engine.
Stemming: The ability for a search to include the
"stem" of words. For example, stemming allows a user to
enter "swimming" and get back results also for the stem
word "swim."
Stop words: Conjunctions, prepositions and
articles and other words such as AND, TO and A that appear often
in documents yet alone may contain little meaning.
Thesaurus: A list of synonyms a search engine can
use to find matches for particular words if the words themselves
don't appear in documents.


