Index term
In information retrieval, an index term (also known as subject term, subject heading, descriptor, or keyword) is a term that captures the essence of the topic of a document. Index terms make up a controlled vocabulary for use in bibliographic records. They are an integral part of bibliographic control, which is the function by which libraries collect, organize and disseminate documents. They are used as keywords to retrieve documents in an information system, for instance, a catalog or a search engine. A popular form of keywords on the web are tags, which are directly visible and can be assigned by non-experts. Index terms can consist of a word, phrase, or alphanumerical term. They are created by analyzing the document either manually with subject indexing or automatically with automatic indexing or more sophisticated methods of keyword extraction. Index terms can either come from a controlled vocabulary or be freely assigned.
Keywords are stored in a search index. Common words like articles (a, an, the) and conjunctions (and, or, but) are not treated as keywords because it's inefficient. Almost every English-language site on the Internet has the article "the", and so it makes no sense to search for it. The most popular search engine, Google removed stop words such as "the" and "a" from its indexes for several years, but then re-introduced them, making certain types of precise search possible again.
The term "descriptor" was by Calvin Mooers in 1948. It is in particular used about a preferred term from a thesaurus.
The Simple Knowledge Organization System language (SKOS) provides a way to express index terms with Resource Description Framework for use in the context of the Semantic Web.[1]
In web search engines
[edit]Most web search engines are designed to search for words anywhere in a document—the title, the body, and so on. This being the case, a keyword can be any term that exists within the document. However, priority is given to words that occur in the title, words that recur numerous times, and words that are explicitly assigned as keywords within the coding.[2] Index terms can be further refined using Boolean operators such as "AND, OR, NOT." "AND" is normally unnecessary as most search engines infer it. "OR" will search for results with one search term or another or both. "NOT" eliminates a word or phrase from the search, getting rid of any results that include it. Multiple words can also be enclosed in quotation marks to turn the individual index terms into a specific index phrase. These modifiers and methods all help to refine search terms, to better maximize the accuracy of search results.[3]
Author keywords
[edit]Author keywords are an integral part of literature.[1] Many journals and databases provide access to index terms made by authors of the respective articles. How qualified the provider is decides the quality of both indexer-provided index terms and author-provided index terms. The quality of these two types of index terms is of research interest, particularly in relation to information retrieval. In general, an author will have difficulty providing indexing terms that characterize his or her document relative to other documents in the database.
Keyword stuffing
[edit]Most search engines are designed to look for words anywhere in a document[4] — in the title, text, etc. In this case, a keyword can be any term that exists in the document. However, priority is given to words that appear in the title, words that are repeated many times, and words that are explicitly designated as keywords in the markup.[5] Keyword stuffing is a method of search engine optimization (SEO) where keywords are loaded into the web page’s meta tags, visible content, or backlink anchor text in an attempt to gain an unfair advantage in search engine rankings. Most permanent ranking improvements after adding secondary keywords occur within the first 2–4 weeks.[6] But overall, optimization using secondary keywords should be monitored for 8 weeks.
Keyword stuffing can lead to temporary or permanent banning or penalizing of the website in major search engines.[7] Repetition of words in meta tags may explain why many search engines no longer use these tags. Currently, search engines focus more on unique, comprehensive, relevant, and useful content, which overall improves quality, making keyword stuffing useless, yet it is still practiced by many webmasters.
Many major search engines have implemented algorithms that recognize keyword stuffing and reduce or eliminate any unfair advantage in search that might have been gained through this tactic, and often they also penalize, downgrade, or remove from their indexes websites that use keyword stuffing.
Changes and algorithms specifically designed to punish or block sites using keyword stuffing include the Google Florida update (November 2003),[8][9] Google Panda (February 2011),[10] Google Hummingbird (August 2013),[11] and the Bing update from September 2014.[12][13]
Headlines on online news sites are increasingly filled only with search-friendly keywords that identify the story.[14] Traditional reporters and editors do not approve of this practice, but it is effective for optimizing news stories for search.
Examples
[edit]- Canadian Subject Headings (CS)
- Library of Congress Subject Headings (LCSH)
- Medical Subject Headings (MeSH)
- Polythematic Structured Subject Heading System (PSH)
- Subject Headings Authority File (SWD)
See also
[edit]References
[edit]- ^ a b Svenonius, Elaine (2009). The intellectual foundation of information organization (1st MIT Press pbk. ed.). Cambridge, Massachusetts: MIT Press. ISBN 9780262512619.
- ^ Cutts, Matt. (2010, March 4). How search works. Retrieved from https://www.youtube.com/watch?v=BNHR6IQJGZs
- ^ CLIO. Keyword search. Columbia University Libraries. Retrieved from http://www.columbia.edu/cu/lweb/help/clio/keyword.html
- ^ "Search Engine - an overview". www.sciencedirect.com. Retrieved 2025-06-30.
- ^ "The Complete Guide to On-Page Optimization". digitalmarketinginstitute.com. Retrieved 2025-06-30.
- ^ "How Long Does It Take for Keywords to Improve After Optimization? A 1,800-Keyword Experiment" (PDF). rankulate.com. Retrieved 2025-06-30.
- ^ "Keyword Stuffing and Why It Should Be Avoided". seranking.com. Retrieved 2025-06-30.
- ^ "A Brief History of Google Algorithm Updates". www.madfishdigital.com. Retrieved 2025-06-30.
- ^ "Google Algorithm Updates You Should Know". visimpact.com. Retrieved 2025-06-30.
- ^ "Google Panda Update - Optimization & Recovery Guide". www.clickintelligence.co.uk. Retrieved 2025-06-30.
- ^ "Google Algorithms - Studiodog". studiodog.com. Retrieved 2025-06-30.
- ^ "Bing Search Update Affects 5 Million Sites". acs-web.com. Retrieved 2025-06-30.
- ^ "Langley web design for smartphones". s3.fr-par.scw.cloud. Retrieved 2025-06-30.
- ^ "How to Do SEO for News Websites". www.pagetraffic.in. Retrieved 2025-06-30.
Further reading
[edit]- Ferris, Anna M. (2018). "Birth of a Subject Heading". Library Resources & Technical Services. 62 (1): 16–27. doi:10.5860/lrts.62n1.16.