0% found this document useful (0 votes)
73 views2 pages

Enterprise Search

Enterprise search involves making content from various enterprise sources like databases and intranets searchable. It differs from web search which searches publicly available online documents and desktop search which searches a single computer. Enterprise search systems index data from multiple sources, integrate structured and unstructured data, and use access controls. They involve phases of content collection, processing, analysis, indexing, query processing, and matching to return relevant results to users.

Uploaded by

olivia523
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
73 views2 pages

Enterprise Search

Enterprise search involves making content from various enterprise sources like databases and intranets searchable. It differs from web search which searches publicly available online documents and desktop search which searches a single computer. Enterprise search systems index data from multiple sources, integrate structured and unstructured data, and use access controls. They involve phases of content collection, processing, analysis, indexing, query processing, and matching to return relevant results to users.

Uploaded by

olivia523
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

Enterprise search

Enterprise search is the practice of making content from multiple enterprise-type sources, such as
databases and intranets, searchable to a defined audience.[1]

"Enterprise search" is used to describe the software of search information within an enterprise (though the
search function and its results may still be public).[2] Enterprise search can be contrasted with web search,
which applies search technology to documents on the open web, and desktop search, which applies search
technology to the content on a single computer.

Enterprise search systems index data and documents from a variety of sources such as: file systems,
intranets, document management systems, e-mail, and databases. Many enterprise search systems integrate
structured and unstructured data in their collections.[3] Enterprise search systems also use access controls to
enforce a security policy on their users.[4]

Enterprise search can be seen as a type of vertical search of an enterprise.

Components of an enterprise search system


In an enterprise search system, content goes through various phases from source repository to search
results:

Content awareness

Content awareness (or "content collection") is usually either a push or pull model. In the push model, a
source system is integrated with the search engine in such a way that it connects to it and pushes new
content directly to its APIs. This model is used when real-time indexing is important. In the pull model, the
software gathers content from sources using a connector such as a web crawler or a database connector.
The connector typically polls the source with certain intervals to look for new, updated or deleted
content.[5]

Content processing and analysis

Content from different sources may have many different formats or document types, such as XML, HTML,
Office document formats or plain text. The content processing phase processes the incoming documents to
plain text using document filters. It is also often necessary to normalize content in various ways to improve
recall or precision. These may include stemming, lemmatization, synonym expansion, entity extraction, part
of speech tagging.

As part of processing and analysis, tokenization is applied to split the content into tokens which is the basic
matching unit. It is also common to normalize tokens to lower case to provide case-insensitive search, as
well as to normalize accents to provide better recall.

Indexing
The resulting text is stored in an index, which is optimized for quick lookups without storing the full text of
the document. The index may contain the dictionary of all unique words in the corpus as well as
information about ranking and term frequency.

Query processing

Using a web page, the user issues a query to the system. The query consists of any terms the user enters as
well as navigational actions such as faceting and paging information.

Matching

The processed query is then compared to the stored index, and the search system returns results (or "hits")
referencing source documents that match. Some systems are able to present the document as it was indexed.

See also
Collaborative search engine
Data defined storage
Enterprise bookmarking
Enterprise information access
Faceted search
Information extraction
Knowledge management
List of search engines
Text mining
Vertical search

References
1. Kruschwitz, Udo; Hull, Charlie (2017). "Searching the Enterprise". Foundations and Trends
in Information Retrieval. 11: 1–142. doi:10.1561/1500000053 (https://doi.org/10.1561%2F15
00000053).
2. "What is Enterprise Search?" (http://www.aiim.org/What-is-Enterprise-Search).
3. "The New Face of Enterprise Search: Bridging Structured and Unstructured Information" (htt
ps://web.archive.org/web/20151028015812/http://www.arma.org/bookstore/files/Delgado.pd
f) (PDF). Archived from the original (http://www.arma.org/bookstore/files/Delgado.pdf) (PDF)
on 2015-10-28. Retrieved 2013-05-27.
4. "Security Requirements to Enterprise Search: part 1 - New Idea Engineering" (http://www.ide
aeng.com/tabId/98/itemId/118/Mapping-Security-Requirements-to-Enterprise-Search.aspx).
5. "Understanding Content Collection and Indexing" (http://www.information-management.com/
issues/20_7/content_management_data_integration_indexing_metadata-10019105-1.html).

Retrieved from "https://en.wikipedia.org/w/index.php?title=Enterprise_search&oldid=1155064219"

You might also like