0% found this document useful (0 votes)
99 views

13 Building Search Engine Using Machine Learning

This document proposes building a search engine using machine learning techniques to improve relevance of search results compared to traditional search engines. Existing search engines can retrieve information but cannot rank documents by importance. The proposed system would use machine learning algorithms like SVM and XGBoost on the output of the PageRank algorithm to discover the most relevant URLs for given keywords. This would allow users to more easily identify important documents in search results.

Uploaded by

Mahesh Gudelli
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
99 views

13 Building Search Engine Using Machine Learning

This document proposes building a search engine using machine learning techniques to improve relevance of search results compared to traditional search engines. Existing search engines can retrieve information but cannot rank documents by importance. The proposed system would use machine learning algorithms like SVM and XGBoost on the output of the PageRank algorithm to discover the most relevant URLs for given keywords. This would allow users to more easily identify important documents in search results.

Uploaded by

Mahesh Gudelli
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 4

Building Search Engine Using Machine Learning Technique

The web is the huge and most extravagant wellspring of data. To recover the
information from the World Wide Web, Search Engines are commonly utilized.
Search engines provide a simple interface for searching for user query and
displaying results in the form of the web address of the relevant web page, but
using traditional search engines has become very challenging to obtain suitable
information. This paper proposed a search engine using Machine Learning
technique.

EXISTING SYSTEM:

Information retrieval is to retrieve the information resources that we are interested


in or extract whatever information we need. Now, you can retrieve any information
easily. Information retrieval (IR) may deal with the organization, storage, retrieval
and evaluation of information from documents, particularly textual information.
But we cannot give the ranks to those documents. If we are giving the ranks then
we can easily identify the important documents.
DISADVANTAGES:

1. Information retrieval is a very difficult task in large numbers of texts in a


document.

Difficult to identify the important concepts or topic in a collection of


documents.

The explicit rankings are always difficult to obtain or even not available in

many documents.

PROPOSED SYSTEM:
To build a search engine which gives the web address of the most relevant
web page at the top of the search result, according to user queries. The main
focus of our system is to build a search engine using machine learning
techniques for increasing accuracy compared to available search engines.
This paper utilizes Machine Learning Techniques to discover the utmost
suitable web address for the given keyword. The output of the PageRank
algorithm is given as input to the machine learning algorithm.

ADVANTAGES OF PROPOSED SYSTEM:

⮚ 1.Search engine is very useful for finding out more relevant URLs for given
keywords.
⮚ Anyone can easily identify the important documents in a collection of
documents and retrieve the related data.

⮚ It proposes a novel model, named LDA (Linear Discriminant Analysis),


achieves good performance and easy to clustering the related documents
based on that ranking

Algorithm: svm, xgboost.

SYSTEM REQUIREMENTS:
HARDWARE REQUIREMENTS:

⮚ System : Intel Core i3.


⮚ Hard Disk : 1 TB.
⮚ Monitor : 15’’ LED
⮚ Input Devices : Keyboard, Mouse
⮚ Ram : 8 GB.

SOFTWARE REQUIREMENTS:
⮚ Operating system : Windows 10.
⮚ Coding Language : Python
⮚ Tool : PyCharm, Visual Studio Code
⮚ Database : SQLite

REFERENCE:
Rushikesh Karwa, Vikas Honmane Department of Computer Science and
Engineering Walchand College of Engineering Sangli, India" Building Search
Engine Using Machine Learning Technique " Proceedings of the International
Conference on Intelligent Computing and Control Systems (ICICCS 2019) IEEE
Xplore Part Number: CFP19K34-ART; ISBN: 978-1-5386-8113-8o 0 INSPEC
Accession Number: 19557097 DOI: 10.1109/ICACCS48705.2020.9074233.

You might also like