Web Search

Uploaded by

bhaveshchitriv70

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views

Web Search

Uploaded by

bhaveshchitriv70

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

Web Search is a type of Information Retrieval (IR) system designed to search for and retrieve

relevant documents from the web in response to a user's query. Web search engines (like
Google, Bing, etc.) use several techniques to index and rank web pages based on relevance.

Key Components:

1. Crawling: A web crawler (or spider) systematically browses the internet to collect web
pages.
2. Indexing: Collected web pages are processed, and relevant keywords, metadata, and
structure are extracted and stored in an index.
3. Ranking Algorithms: Once a query is entered, the engine uses various ranking
algorithms (e.g., PageRank, BM25, etc.) to score the relevance of web pages based on
factors like keyword frequency, page structure, authority, and user behavior.

How Web Search Works:

1. User Query: A user inputs a query such as "best programming languages for AI."
2. Query Processing: The search engine parses the query, possibly removing stop words
and applying stemming or synonym expansion.
3. Matching: The search engine looks for pages in its index that match the query
keywords.
4. Ranking: Based on relevance (determined by the ranking algorithms), pages are scored
and ranked.
5. Results Presentation: The most relevant results are displayed to the user, often with
snippets or previews of content.

Example:

Suppose a user searches for "best AI frameworks":

 Crawling: The search engine has crawled numerous web pages related to AI
frameworks.
 Indexing: It has indexed pages based on terms like "TensorFlow," "PyTorch," and
"AI."
 Ranking: Pages that mention "AI frameworks" frequently and are considered
authoritative (e.g., official documentation, popular blogs) will rank higher.
 Results: The search engine presents a list of results, such as articles comparing AI
frameworks, ranked by relevance.

Key Techniques:

 Vector Space Model (VSM): Representing documents and queries as vectors in a

multidimensional space and calculating cosine similarity between them.
 TF-IDF: Measures the importance of terms in a document relative to a corpus, used to
weigh keywords.
 Natural Language Processing (NLP): Used to better understand user queries and
match them with relevant content.

Challenges:
 Scalability: Handling massive web data requires efficient crawling, indexing, and
ranking techniques.
 Relevance: Providing highly relevant results while filtering out low-quality or
irrelevant pages.
 Personalization: Web searches often integrate user history and preferences to tailor
results.

Applications:

 Search Engines: Google, Bing, and DuckDuckGo are common examples that
implement IR systems for web search.
 Enterprise Search: Organizations use internal web search engines to retrieve
documents from their intranets or knowledge bases.

In web search, the objective is to retrieve information that not only matches the query but also
ranks highly for quality and relevance, helping users find what they need quickly.

P72828A - 4MA1-2H-rms-June 2023
55% (11)
P72828A - 4MA1-2H-rms-June 2023
32 pages
Micros 3700 POS Configurator Manual
100% (6)
Micros 3700 POS Configurator Manual
252 pages
Top 100 General Philosophy Quiz Questions and Answers Part 1
No ratings yet
Top 100 General Philosophy Quiz Questions and Answers Part 1
14 pages
Search Tools and Their Components
No ratings yet
Search Tools and Their Components
7 pages
UNIT3(SEARCH_ENGINE)
No ratings yet
UNIT3(SEARCH_ENGINE)
7 pages
Chapter 6. Search Semantic and Recommendation Technology
No ratings yet
Chapter 6. Search Semantic and Recommendation Technology
29 pages
Computer - Search Engines
No ratings yet
Computer - Search Engines
10 pages
Chapter - 6 Part 1
No ratings yet
Chapter - 6 Part 1
21 pages
Unit 5 - Data Science & Big Data - WWW - Rgpvnotes.in
No ratings yet
Unit 5 - Data Science & Big Data - WWW - Rgpvnotes.in
17 pages
IR Unit V Notes remaining
No ratings yet
IR Unit V Notes remaining
10 pages
Chap 1
No ratings yet
Chap 1
22 pages
Searching The Web
No ratings yet
Searching The Web
24 pages
Internet Searching Technique - Last Edited
No ratings yet
Internet Searching Technique - Last Edited
36 pages
Yogvardhan (A3) DM
No ratings yet
Yogvardhan (A3) DM
9 pages
Everything in Brief Introduction
No ratings yet
Everything in Brief Introduction
5 pages
Unit 8 - Search Engines
No ratings yet
Unit 8 - Search Engines
8 pages
IRWM: Assignment 1: How Does Google Search Engine Works?
No ratings yet
IRWM: Assignment 1: How Does Google Search Engine Works?
7 pages
Pre 5 Midterm Reviewer Nerfed
No ratings yet
Pre 5 Midterm Reviewer Nerfed
6 pages
Search Engine Optimization - Using Data Mining Approach
No ratings yet
Search Engine Optimization - Using Data Mining Approach
5 pages
IR_MOD1_NOTES
No ratings yet
IR_MOD1_NOTES
20 pages
ASSIGNMENT 3 DM
No ratings yet
ASSIGNMENT 3 DM
12 pages
Search Engine Seminar
No ratings yet
Search Engine Seminar
17 pages
Web Technology
No ratings yet
Web Technology
17 pages
How To Make A Simple Search Engine
No ratings yet
How To Make A Simple Search Engine
2 pages
Working of Search Engines: Avinash Kumar Widhani, Ankit Tripathi and Rohit Sharma Lnmiit
No ratings yet
Working of Search Engines: Avinash Kumar Widhani, Ankit Tripathi and Rohit Sharma Lnmiit
13 pages
Seminar Formatkhjj
No ratings yet
Seminar Formatkhjj
24 pages
Preparation
No ratings yet
Preparation
10 pages
Web Search-Engines: Preksha Mangal B-Tech CS-3 Year
No ratings yet
Web Search-Engines: Preksha Mangal B-Tech CS-3 Year
43 pages
IR_workbook_answers
No ratings yet
IR_workbook_answers
36 pages
E-Commerce: Search Engine Optimization
No ratings yet
E-Commerce: Search Engine Optimization
25 pages
005-001-000-024 Search Engines
No ratings yet
005-001-000-024 Search Engines
11 pages
UNIT 3 Notes
No ratings yet
UNIT 3 Notes
32 pages
A Brief Review On Search Engine Optimization: Dushyant Sharma Rishabh Shukla
No ratings yet
A Brief Review On Search Engine Optimization: Dushyant Sharma Rishabh Shukla
6 pages
Web Mining: By:-Vineeta 8pgc18 M.Tech (II Semester)
No ratings yet
Web Mining: By:-Vineeta 8pgc18 M.Tech (II Semester)
33 pages
Seminar Report 3D Searching
100% (1)
Seminar Report 3D Searching
20 pages
SearchLand: Search Quality For Beginners
No ratings yet
SearchLand: Search Quality For Beginners
29 pages
Lab Manual: Web Technology
No ratings yet
Lab Manual: Web Technology
39 pages
SEOmoz The Beginners Guide To SEO 2012
No ratings yet
SEOmoz The Beginners Guide To SEO 2012
67 pages
Search Engine
No ratings yet
Search Engine
3 pages
SEARCH ENGINE (Synopsis) - Vivek
No ratings yet
SEARCH ENGINE (Synopsis) - Vivek
17 pages
Assignment 3 of DM
No ratings yet
Assignment 3 of DM
7 pages
IRT
No ratings yet
IRT
100 pages
Implementing A Web Crawler in A Smart Phone Mobile Application
No ratings yet
Implementing A Web Crawler in A Smart Phone Mobile Application
4 pages
Working of Webb Search Engines
No ratings yet
Working of Webb Search Engines
29 pages
Search Engines-UNIT-II
No ratings yet
Search Engines-UNIT-II
4 pages
ISR_UNIT6
No ratings yet
ISR_UNIT6
14 pages
BA4029 SOCIAL MEDIA WEB ANALYTICS unit 5
No ratings yet
BA4029 SOCIAL MEDIA WEB ANALYTICS unit 5
23 pages
IEEE Paper Format Template
No ratings yet
IEEE Paper Format Template
2 pages
How Google Works
No ratings yet
How Google Works
61 pages
IR ASS1
No ratings yet
IR ASS1
12 pages
IR QP ANSWER
No ratings yet
IR QP ANSWER
59 pages
Unit 5
No ratings yet
Unit 5
36 pages
Ai Ml Text Media and Web Analytics
No ratings yet
Ai Ml Text Media and Web Analytics
5 pages
NSums
No ratings yet
NSums
9 pages
Webmininglec
No ratings yet
Webmininglec
75 pages
SEO
No ratings yet
SEO
7 pages
Search Engine Optimization To Increase Website Visibility: Abstract
No ratings yet
Search Engine Optimization To Increase Website Visibility: Abstract
6 pages
Unit-1 WAD
No ratings yet
Unit-1 WAD
13 pages
Seo Learning Guide
From Everand
Seo Learning Guide
ngencoband
No ratings yet
Automatic Image Annotation: Fundamentals and Applications
From Everand
Automatic Image Annotation: Fundamentals and Applications
Fouad Sabry
No ratings yet
Mastering Search Engine Marketing: A Guide for SEM Campaign Success
From Everand
Mastering Search Engine Marketing: A Guide for SEM Campaign Success
Rebecca Cox
No ratings yet
Google Search Revealed: Mastering the Algorithm for Search Dominance
From Everand
Google Search Revealed: Mastering the Algorithm for Search Dominance
Azhar ul Haque Sario
No ratings yet
Automatic Image Annotation: Enhancing Visual Understanding through Automated Tagging
From Everand
Automatic Image Annotation: Enhancing Visual Understanding through Automated Tagging
Fouad Sabry
No ratings yet
Verb To Be Present Simple
No ratings yet
Verb To Be Present Simple
4 pages
Lesson Plan Group 1 Practice Teaching Reading and Writing
No ratings yet
Lesson Plan Group 1 Practice Teaching Reading and Writing
4 pages
GLIDE: Towards Photorealistic Image Generation and Editing With Text-Guided Diffusion Models
No ratings yet
GLIDE: Towards Photorealistic Image Generation and Editing With Text-Guided Diffusion Models
20 pages
5655-2 (Topic 1)
No ratings yet
5655-2 (Topic 1)
15 pages
Frasers GAMSAT Prep - Section-1-Question-Log
No ratings yet
Frasers GAMSAT Prep - Section-1-Question-Log
7 pages
Modbus/TCP Client Support: Additional Important Product Information For Release 3.83
No ratings yet
Modbus/TCP Client Support: Additional Important Product Information For Release 3.83
11 pages
Pressed
No ratings yet
Pressed
29 pages
Crusades Documents 2013
No ratings yet
Crusades Documents 2013
3 pages
SSL Reference 236
No ratings yet
SSL Reference 236
253 pages
Da Grin - CEO (Chief Executive Omoita)
100% (4)
Da Grin - CEO (Chief Executive Omoita)
3 pages
Faculty of Health and Applied Sciences Department of Computer Science and Information Technology Bsc. Computer Science
No ratings yet
Faculty of Health and Applied Sciences Department of Computer Science and Information Technology Bsc. Computer Science
3 pages
Artificial Intelligence Interviewer With Gen Ai
No ratings yet
Artificial Intelligence Interviewer With Gen Ai
45 pages
New-1st PT-Oral Com (DSL)
No ratings yet
New-1st PT-Oral Com (DSL)
2 pages
Smp3 Week15 The Memorandum
No ratings yet
Smp3 Week15 The Memorandum
3 pages
J1 Music
No ratings yet
J1 Music
2 pages
Capsule3 Pro - 94x84-230310
No ratings yet
Capsule3 Pro - 94x84-230310
69 pages
Congiuntivo TRANSCRIPT
No ratings yet
Congiuntivo TRANSCRIPT
11 pages
Habitat Lesson 1-Needs of Living Things Lesson
No ratings yet
Habitat Lesson 1-Needs of Living Things Lesson
8 pages
Nurse Patient
No ratings yet
Nurse Patient
37 pages
Acr English Plus
No ratings yet
Acr English Plus
13 pages
The Elements of Culture
No ratings yet
The Elements of Culture
15 pages
Oxford Primary Skills 5
100% (1)
Oxford Primary Skills 5
60 pages
(Ebook) Mathematics IIT JEE 2005-2019 Engineering Solved Papers topic wise chapter wise problems questions solutions fully solved by B L Sharma ISBN 9789313199687, 9313199688 instant download
100% (1)
(Ebook) Mathematics IIT JEE 2005-2019 Engineering Solved Papers topic wise chapter wise problems questions solutions fully solved by B L Sharma ISBN 9789313199687, 9313199688 instant download
52 pages
Grammar Mini Lessons 1
No ratings yet
Grammar Mini Lessons 1
7 pages
9 Components of Effective
No ratings yet
9 Components of Effective
7 pages
DTU EP Syllabus PDF
100% (1)
DTU EP Syllabus PDF
63 pages
Essence of Mantra
No ratings yet
Essence of Mantra
4 pages

Web Search

Uploaded by

Web Search

Uploaded by

Web Search is a type of Information Retrieval (IR) system designed to search for and retrieve

How Web Search Works:

Suppose a user searches for "best AI frameworks":

 Vector Space Model (VSM): Representing documents and queries as vectors in a

You might also like