NSums

The N-Sums framework aims to make specialized domain search engines easier to design, implement, and maintain. It combines multiple search strategies, including general search engines, specialized domain engines, direct page searching, and local databases. The framework was used to create ComicSearch, a comic book search engine that outperformed the general purpose Copernic engine in tests. N-Sums emphasizes the strengths of each strategy while reducing weaknesses by combining strategies.

Uploaded by

aaes2

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

136 views9 pages

NSums

Uploaded by

aaes2

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

N-Sums: A Framework for Web-based

Search Engines
Apisitt Rattana and Andrew Davison
Dept. of Computer Engineering
Prince of Songkla University
Hat Yai, Songkhla 90112, Thailand
E-mail: [email protected]

Abstract
The N-Sums framework is aimed at making the design, implementation, and maintenance of
specialized domain search engines easier, with the resulting applications able to produce more
useful matches and less bad hits. The heart of N-Sums is the realization that effective search
requires a combination of search strategies.
ComicSearch is a specialized domain search engine for finding comic covers and other
information, built using the N-Sums framework. It has outperformed the popular Copernic general
purpose search engine in all of its tests.

1. Introduction
N-Sums (Niche-Search Using Multiple Strategies) is a framework for creating effective Web search
engines for specialized domains (e.g. finding comic books, finding the latest soccer scores), By
effective, we mean that the engine will return matches containing a high percentage of URLs
pointing to the requested information and a low percentage of links to poorly related or unrelated
data,
At the heart of N-Sums is the recognition that a variety of different search mechanisms must be
combined in order to obtain good search results. This differs from the present trend in search engine
design which hopes that 'more of the same' will lead to improved answers.
In the following sections, we describe our framework, and a particular example of its use, the
ComicSearch search engine. We compare ComicSearch with one of the leading 'super-search'
engines, Copernic [3], and see that ComicSearch performs significantly better within its chosen
domain.

2. Four Search Strategies

N-Sums identifies four categories of Web search, each with their own advantages and
disadvantages:
• general purpose search engines;
• specialized domain search engines;
• direct search of static/dynamic Web pages;
• local database(s) of search information.
N-Sums combines these to emphasize their positive features while reducing their negative ones.

2.1. General Purpose Search Engines

General purpose search engines, such as Google and AltaVista, possess many advantages: they have
simple query interfaces (usually phrase-based, with boolean operators), results are presented in a
structured format, typically including the match's title, a text summary, and a score, and their
internal indexes are relatively current.
Also grouped into this category are 'super-search' engines, such as Copernic. Copernic passes a
user's query onto 15 other search engines (e.g. Google, Hotbot), and collects their results. The main
advantage of Copernic-like applications are their increased coverage of the Web by utilizing
multiple engines.
Despite their utility, general purpose search engines have some problems. A common one is the
weakness of their query languages (e.g. they almost never allow regular expressions). A
consequence of poor query expressiveness is the increased chance of obtaining bad hits (URLs
pointing to unrelated information). This is especially true if the queries contain common words (e.g.
mad, action, battle) or words with different meanings in different contexts (e.g. wolverine,
superman). Applications like Copernic make this problem worse by requiring users to formulate
queries in a lowest common denominator notation that is understandable to all of their component
engines. 'Super-search' engines also increase the possibility of duplicate hits even though they make
some attempt to filter them out.
Another problem with general purpose search engines is the slow update frequency of their internal
indexes, caused by the need to re-examine large parts of the Web. This update rate can be
inadequate for some kinds of search, such as sports scores or cinema movie listings, which are
highly transient.
General purpose search engines make a strong selling point of their Web coverage – Google
currently advertises an index of over 1.3 billion pages (early March 2001). However, current
estimates place the number of pages at between 3 and 4 billion, and growing rapidly [2]. General
purpose search engines will never be able to index every page, and this will only become more
apparent as the Web continues to grow.
The extensive coverage and weak query power of these engines means that numerous matches are
returned (often in the 1000's). The engines attempt to score the matches, and supply summaries of
the pages, to alleviate the user's feeling of being swamped by data. Even so, the user usually has to
perform a manual, second search by scrolling through the returned matches, trying to select links
which are really useful. At best this will require a reading of the summaries, at worst it involves
downloading the full page for a result to examine it in detail; this is time-consuming and wasteful of
resources.

2.2. Specialized Domain Search Engines

This kind of search engine focuses on a particular domain, a popular example being AuctionHawk
(http://www.auctionhawk.com), a 'super-search' engine which limits its searches to online
auctions. It passes its queries to the search engines attached to Ebay, Amazon, Yahoo, and over 100
other sites.
Advantages of specialized domain search engines include the reduction of bad hits since topics
outside the domain are not examined, a potential for increased speed since the domain is smaller,
and the likelihood that the domain's index is more current. The reduction in bad hits often means
that a user will not have to carry out a second search of the results. A specialized domain offers the
opportunity to refine the query language. For example, AuctionHawk allows searches to be
restricted to auctions which include images.
An obvious problem with specialized domain search engines is that they may exclude certain kinds
of data of interest to the user.

2.3. Searching Web Pages Directly

Directly searching Web pages is often overlooked since most pages do not hold collections of data.
Examples which can be usefully searched include pages of recent sports scores, pages holding
catalogs of images, and links pages.
Web pages of this kind can be divided into two groups: static and dynamic. Static Web pages are
those where the information does not change, or changes very rarely. Web pages holding comic
cover images are usually static, since covers do not change once published. Dynamic Web pages
change frequently, as with pages holding soccer scores.
In fact, all Web pages change over time: perhaps they are reformatted or moved to another location.
The static label is something of a fiction, but is still useful as it suggests that the repeated search of
static Web pages can be replaced by the search of locally-held data extracted from those pages. This
approach is examined in the next subsection.
An advantage of searching Web pages directly is the opportunity to tailor the search to the page
format and content, and to filter the information more accurately. This requires tools which are not
found in most programming languages, such as regular expressions for pattern matching on
unstructured text, and the ability to parse HTML for searching over the structured parts of a page.
Some research Web languages contain these capabilities [e.g. 6], and there are several Java classes
that offer this functionality [1, 5].
A major drawback is the difficulty of writing search code that can handle unpredictable changes in
the pages. Robust code should detect change, perhaps by comparing page metrics gathered during
the current search (e.g. the page modification time, page size) against earlier values. Another
drawback is the need to develop search code for each Web page under consideration.

2.4. Local Storage of Search Information

It is useful if the data in a static Web page is (manually) extracted and its contents and format
converted to a local form. The advantages are that the engine designer can decide on the local data
format and its search capabilities. The outcome is that the engine can quickly examine the local
database rather than carry out network communication with many Web pages, each with their own
format and search requirements.
A significant drawback is how to devise a data format that can encompass disparate Web pages. It
must be succinct, but also easy to maintain. This latter point is required to handle the inevitable
changes to the data, the addition of new sources, and the disappearance of others.
A local database may also be useful for simplifying the search of dynamic Web pages. For example,
the database may contain the URLs of the pages together with the regular expressions for searching
them. In effect, the database stores meta-level information about the pages, which is likely to
change more slowly than the data itself.
3. Development Strategies
In this section, we describe in overview how N-Sums is applied when designing, implementing, and
maintaining a search engine.

3.1. Design Issues

A N-Sums search engine will usually employ one general purpose search engine to carry out a
through global search as a backup to the specialized domain searches done by its other components.
Crucial design elements for the general purpose and specialized domain searches are the
development of suitable queries, and a way of judging (or scoring) the results they give. The
primary aim is to develop queries and filters which reduce or eliminate the need for a second,
manual search of the engine's results, and to remove the need to examine the results pages.
The process of query design is akin to the refinement of test cases in program validation. Queries
must be formulated which adequately cover the search domain and test the various capabilities of
the search engine. Disappointing results for the test queries can be utilized in two ways: the queries
can be modified (e.g. by adding extra keywords), or the choice of search engine can be altered. This
latter approach means that the designer should try out a range of engines. It may not be possible to
modify the query in a satisfactory manner, due to the inherent limitations of the engine's query
interface. In that situation, the results must be filtered after being returned.
For a general purpose search engine, a query must be complex enough to remove bad hits. When
choosing specialized domain search engines, the goal is coverage. A single specialized domain
engine will probably not be able to handle all the likely queries.
Designing the search of Web pages falls into two distinct categories, page retrieval and filtering.
Filtering can be based on regular expression matching for unstructured text, parsing and regular
expressions for more structured formats such as results tables.

3.2. Implementation Issues

A N-Sums application will typically employ one thread (or process) for each search component, and
a further thread for the GUI interface. It will be necessary to coordinate the search threads when
they are passing their results to the GUI and when the user presses the start or stop buttons for the
searches. These requirements are easily addressed by programming in Java.
A real-world issue is network communication through a firewall; Java has support for proxies and
user authentication. Most search engines accept information encoded using the HTTP GET
protocol, but some forms-based engines require the POST protocol [4]. Both of these are supported
by Java's networking classes.

3.3. Maintenance Issues

Maintenance is a pressing problem for search engines because of the dynamic nature of the Web.
The application should include a utility for testing the liveness of its components. At the simplest
level, the continued existence of the component search engines and Web pages must be monitored.
A more complex task is to detect when they change. For search engines, the query and/or the results
formats may change.
A related point is the discovery of new sources of information, which requires repeated surveys of
the Web.

4. ComicSearch
ComicSearch finds information about comics – the user enters a comic title (or a phrase from the
title) and issue number of interest, and the search engine returns a table of URLs. The user can click
on a URL to open it in the system's default browser. The results of a query for "iron man" issue 1
are shown in Figure 1. "iron man" matches against all the comics which contain that phrase (e.g.
"Giant-Size Iron Man"), but only details on the comics with the given issue number are returned.

Figure 1. ComicSearch Results for "iron man" 1.

ComicSearch is primarily aimed at US comics produced during the so-called Golden, Silver, and
Bronze ages (roughly 1939 – 1970). These comics are the main concern of collectors. ComicSearch
is based on a similar search engine, MCCSE, which concentrated on the Marvel Comic publishing
company [7].
Google is used as the general purpose search engine, and acts as a backup to three specialized
domain search engines, AuctionHawk, the Grand Comic Database (GCD,
http://213.203.29.50/), and the search engine at Nick Simon's Marvel Silver Age site
(http://www.geocities.com/Area51/Zone/4414/index.html). GCD stores author and artist
information on over 64,000 comics, but has many gaps in its data, some errors, and a poor selection
of cover images. It is also somewhat daunting for non-technical users due to its complex query
interface. Nick Simon's site focuses on a single US publisher, Marvel, and a particular era (about
1956 to 1970), but is very comprehensive within those boundaries.
ComicSearch comes with a small database of useful comic sites (currently about 40). For each site,
a list is given of the comic titles which can be found there, together with issue numbers and URLs.
However, storing a single URL for each issue would quickly lead to a very large database. Instead,
a format was designed around storing issue ranges and URL patterns using place-holders. For
example:
Title: The Mighty Banana
Issues: 1 5 7 9-205 1004
imageURL: http://foo.com/mb**.html
These three lines represent 201 URLs for comics. At run time, ComicSearch substitutes the issue
number for the place-holding *'s in the image URL, left-padding it with 0's if necessary. A search
for "The Mighty Banana" issue 7 will return the URL http://foo.com/mb07.html. Issues with
more than two digits are substituted with no padding (e.g. http://foo.com/mb1004.html).
The database format was designed to be understandable by its users, the intention being that they
could extend it. The online documentation for ComicSearch encourages users to submit their
information by e-mail for inclusion in future releases of the application.
ComicSearch utilizes Java threads to query its component search engines and local database. An
important aspect of the threads is the filtering of the results returned by the engines. For example,
the Google thread uses a regular expression based on the comic title followed by spaces or letters or
a '#' and then the issue number. This relatively simple pattern is applied to the title lines of the
Google results, and filters out 50-70% of the bad hits on average, compared with the unfiltered
results. ComicSearch employs the SteveSoft regular expression class [2] for this, although much
can be achieved with Java's String class alone. The number of good hits filtered out is typically
quite small, about 5%. A similar technique is used in the AuctionHawk thread, but was unnecessary
in the GCD and Nick Simon threads due to their accuracy.
A specially coded networking class lets ComicSearch operate through proxies/firewalls, and deals
with user authentication.
There are no secondary searches of the results by visiting their Web pages. The success of the filters
makes this extra step unnecessary. Also there is no direct searching of Web pages. Since the comic
sites are quite static, their details were converted into database entries.
The specialized domain search engines do not return duplicate URLs since they search in different
places on the Web, but Google does occasionally return a few hits found by the others. The removal
of duplicate URLs from the results table would be straightforward, but is not currently carried out.
However, the contents of the results pages do often overlap, but this is not seen as a bad thing – it is
quite useful to have repeated information, images, etc. from different sources, to allow comparisons
between them. For example in figure 1, the results rows 1, 2, 3, 5, 7, 8, 9, 10, 15, and 18 all refer to
the same comic, but the details on the pages vary, such as where to buy the comic, it's cost, the
publication history, and the cover image size and quality.
ComicSearch was made freely available over the Internet in March 2001. It can be obtained from
http://fivedots.coe.psu.ac.th/~ad/ComicSearch/readme.html.

5. Comparisons with Copernic

We have run extensive tests of ComicSearch and compared them with Copernic, the popular 'super-
search' engine (http://www.copernic.com).
Partial Title and No. of Rows where the No. of No. of Total no.
issue no. exact exact matches related unrelated of matches
matches occur matches matches
Wolverine 1 1 32 27 40 68
Superman 100 2 2, 7 10 35 47
Spider-man 122 4 1, 22, 28, 40 17 45 66
Mad 99 0 0 58 58
Green Lantern 59 1 3 26 23 50
Flash 13 0 1 70 71
Table 1. Copernic results for six queries.

Table 1 shows six typical comic queries, the number of exact matches, related matches, unrelated
matches, and the total number of matches returned by Copernic. An exact match is a URL to a page
which describes a comic containing the partial title string with the given issue number, related
matches are URLs to pages about the general comic or its characters. Unrelated matches have
nothing to do with the comic. The 'rows' column gives the row positions of the exact matches in
Copernic's output after it had been sorted into decreasing order by score. The query input was a
(partial) comic title and issue number, separated by a space.

Partial Title and No. of Sources which No. of No. of Total no.
issue no. exact supplied the related unrelated of matches
matches exact matches matches matches
Wolverine 1 37 GG/18, AH/13, 4 3 43
GCD/6
Superman 100 10 AH/4, DB/3, 0 0 10
GG/2, GCD/1
Spider-man 122 9 GCD/3, GG/3, 0 0 9
DB/2, AH/1
Mad 99 3 GG/2, DB/1 0 2 5
Green Lantern 59 8 DB/3, GCD/3, 0 0 8
AH/2
Flash 13 11 DB/4, GCD/3, 0 19 30
GG/3, AH/1
Table 2. ComicSearch results for six queries.

Table 2 shows the same queries processed by ComicSearch. A 'rows' column is not included, partly
because the output from ComicSearch is ordered nondeterministically due to its threaded behaviour.
The other reason is that the exact matches almost always appear before the related or unrelated
ones, a consequence of the slow response rate of Google. Instead, a 'sources' column is given, which
details the number of exact matches contributed by each search thread. AH is AuctionHawk, DB is
the local database, GCD is the Grand Comic Database, GG is Google, and NS is Nick Simon's site.
ComicSearch produces fewer results than Copernic, but the quality is higher; quality can be
measured as the percentage of exact matches in the total number of matches. Another indicator is
the coverage of the exact matches. Queries such as "flash 13" can match on many different comics
which use the word "flash" in their titles, and different volumes within the same comic title.
ComicSearch frequently returns at least one link to each of these possibilities.
The presence of so many unrelated matches in Copernic's output can make using it quite tiresome:
finding the best URLs frequently involves a lengthy study of the 50 or more results. This indicates
that usability is as much affected by the number of bad hits as the number of good ones.
The number of unrelated matches generated by Copernic points to a difficulty when searching for
comics using general purpose search engines – comic titles regularly use common words such as
'flash' and 'mad'. Even a word like 'wolverine' has a number of meanings; in this instance, a US
football team and the Canadian animal.
ComicSearch is helped and hindered by the presence of Google, which accounts for all the
unrelated matches, but also occasionally turns up exact matches missed by the other searches.
Fortunately, the number of unrelated matches from Google is significantly reduced by having
ComicSearch filter its results.
The local database contributes exact matches in almost all the example queries, but its success
depends on the choice of queries. AuctionHawk returns exact matches quite often, but this depends
on the auctions currently in progress; when an auction finishes, the information will disappear soon
after. Nick Simon's search engine does not return anything for the test queries, but this is because
the titles and issues are outside its range of interest. GCD information can sometimes be rather
brief.

6. Conclusions
The N-Sums framework rests on the idea that effective search engines for specific domains (e.g.
comics, soccer scores) are best designed using a combination of search strategies. These utilize:
• general purpose search engines;
• specialized domain search engines;
• direct search of static/dynamic Web pages;
• local database(s) of search information.
We described how these different approaches have consequences for the design, implementation,
and maintenance of the resulting search engine.
We used the N-Sums framework to build ComicSearch, a search engine focussing on US comics
from the 1940's to the 1970's. It performs significantly better than general purpose search engines
due to its multi-faceted approach to finding results. ComicSearch was made freely available on the
Web in March 2001.
Our future work will utilize N-Sums to build three more search engines: one for finding celebrity
mailing addresses, one for returning the latest soccer scores for any team, and a tool for finding
online resources for computing textbooks (e.g. slides, software, exercises). We expect that these
efforts will indicate further avenues for the refinement of N-Sums.
References
[1] BRANDT, S.R. "Regular Expressions in Java", Package com.stevesoft.pat version 1.4. Available at
http://javaregex.com/, March 2001.

[2] CLIENT HELP DESK. "Web Statistics: Size, the Average Page", Available at
http://www.clienthelpdesk.com/statistics_research/
web_statistics.html, March 2001

[3] COPERNIC TECHNOLOGIES. Copernic 2001 Basic, version 5. Available at http://www.copernic.com,

March 2001.
[4] FIELDING, R., GETTYS, J., MOGUL, J., FRYSTYK, H., MASINTER, L., LEACH, P., BERNERS-LEE, T.
"Hypertext Transfer Protocol – HTTP/1.1", RFC 2616, Available at ftp://ftp.isi.edu/in-notes/rfc2616.txt,
1999.
[5] HOTHOUSE OBJECTS. "Tags: HTML Toolkit for Java", version 1.0.5. Available at
http://www.hothouseobjects.com/, March 2001.

[6] KISTLER, T. AND MARAIS, H. "WebL – A Programming Language for the Web", SRC Research Report, Digital
Systems Research Center, Palo Alto, California, USA., 1998.
[7] RATTANA, A. "Marvel Comics Cover Search Engine (MCCSE)", Senior Project, Dept. of Comp. Eng., PSU, Hat
Yai, Thailand, February 2001.

Google Search Revealed: Mastering the Algorithm for Search Dominance
From Everand
Google Search Revealed: Mastering the Algorithm for Search Dominance
Azhar ul Haque Sario
No ratings yet
Practice 2 - IZO 083
No ratings yet
Practice 2 - IZO 083
107 pages
SEO for Beginners A Step-by-Step Guide to Ranking Higher
From Everand
SEO for Beginners A Step-by-Step Guide to Ranking Higher
Steven Mcananey
No ratings yet
Malware Binary Deobfuscation PDF
No ratings yet
Malware Binary Deobfuscation PDF
14 pages
Schedules in DBMS - Types of Schedules in DBMS
No ratings yet
Schedules in DBMS - Types of Schedules in DBMS
16 pages
Data Logger Line No
No ratings yet
Data Logger Line No
1 page
Top 50 SQL Interview Questions
No ratings yet
Top 50 SQL Interview Questions
8 pages
Techniques for Advanced Search Engine Optimization: On Autopilot, Increase Your Traffic and Profits!
From Everand
Techniques for Advanced Search Engine Optimization: On Autopilot, Increase Your Traffic and Profits!
Jim Stephens
No ratings yet
Year 11 DPR Note
No ratings yet
Year 11 DPR Note
17 pages
Scality RING Datasheet 240606
No ratings yet
Scality RING Datasheet 240606
4 pages
PL 300T00A ENU Powerpoint02
No ratings yet
PL 300T00A ENU Powerpoint02
27 pages
Enhancing Link Evaluation Through A Coor
No ratings yet
Enhancing Link Evaluation Through A Coor
21 pages
HOD JSS3 Computer Exam
No ratings yet
HOD JSS3 Computer Exam
2 pages
Stored Procedure Activity
No ratings yet
Stored Procedure Activity
36 pages
TP 1 - HDFS
No ratings yet
TP 1 - HDFS
40 pages
SIT103 - SIT772 7.1P TaskSheet
No ratings yet
SIT103 - SIT772 7.1P TaskSheet
3 pages
Chapter 7. Databases
No ratings yet
Chapter 7. Databases
41 pages
Data Science QB Solve SEM6
No ratings yet
Data Science QB Solve SEM6
157 pages
Data Ontap 8.2: Storage Management Guide For 7-Mode
No ratings yet
Data Ontap 8.2: Storage Management Guide For 7-Mode
397 pages
Mysql All - Queries
No ratings yet
Mysql All - Queries
14 pages
CS571 Note
No ratings yet
CS571 Note
2 pages
Untitled Document
No ratings yet
Untitled Document
4 pages
Microsoft Power Bi Training
100% (1)
Microsoft Power Bi Training
2 pages
SEO, SEM & SMM for Small Business Owners: SEO, SEM & SMM SERIES, #1
From Everand
SEO, SEM & SMM for Small Business Owners: SEO, SEM & SMM SERIES, #1
Harriet Fosuah Quansah
No ratings yet
Irt Unit3
No ratings yet
Irt Unit3
50 pages
Chapter 3
No ratings yet
Chapter 3
39 pages
Web Crawler A Review
No ratings yet
Web Crawler A Review
5 pages
Mastering Search Engine Marketing: A Guide for SEM Campaign Success
From Everand
Mastering Search Engine Marketing: A Guide for SEM Campaign Success
Rebecca Cox
No ratings yet
Seo Learning Guide
From Everand
Seo Learning Guide
ngencoband
No ratings yet
Diagnostic Challenges With Acyclovir Crystalluria - A Case Study
No ratings yet
Diagnostic Challenges With Acyclovir Crystalluria - A Case Study
7 pages
Search Engine: An Effective Tool For Exploring The Internet
No ratings yet
Search Engine: An Effective Tool For Exploring The Internet
5 pages
Assignment 05 ANSWERS
100% (1)
Assignment 05 ANSWERS
5 pages
The Telecheckinternet Check Acceptance and Checks by Phone Service
No ratings yet
The Telecheckinternet Check Acceptance and Checks by Phone Service
1 page
نسخة من Learning SEO With Free Resources (v.17) - A Roadmap by @Aleyda
No ratings yet
نسخة من Learning SEO With Free Resources (v.17) - A Roadmap by @Aleyda
66 pages
Supporting The SBR Style of Web Usage: Engine. There Are Other Forms of Search
No ratings yet
Supporting The SBR Style of Web Usage: Engine. There Are Other Forms of Search
6 pages
G.711, G.721, G.726 and G.728 Codecs Voip
No ratings yet
G.711, G.721, G.726 and G.728 Codecs Voip
5 pages
Submitted By:: Lovely Professional University, Punjab
100% (1)
Submitted By:: Lovely Professional University, Punjab
9 pages
PSS SINCAL - Multi - User
No ratings yet
PSS SINCAL - Multi - User
20 pages
Web Search Engines
No ratings yet
Web Search Engines
30 pages
Topic 3 W3 Crawls and Feeds - SDR - March2023
No ratings yet
Topic 3 W3 Crawls and Feeds - SDR - March2023
32 pages
KX-TGEA20 KX-TGHA20: Additional Digital Cordless Handset
No ratings yet
KX-TGEA20 KX-TGHA20: Additional Digital Cordless Handset
12 pages
UNIT 3 Notes
No ratings yet
UNIT 3 Notes
32 pages
Apache Airflow On Docker For Complete Beginners - Justin Gage - Medium
No ratings yet
Apache Airflow On Docker For Complete Beginners - Justin Gage - Medium
12 pages
Advanced Encryption Standard (AES) : Feature Overview
No ratings yet
Advanced Encryption Standard (AES) : Feature Overview
20 pages
Speak Freely Application Programming Interface: Draft 0.1 - 06/03/1999 by Brian C. Wiles
No ratings yet
Speak Freely Application Programming Interface: Draft 0.1 - 06/03/1999 by Brian C. Wiles
7 pages
UNIT 4 Cte Note
No ratings yet
UNIT 4 Cte Note
12 pages
Understanding Genitourinary System Cytology
No ratings yet
Understanding Genitourinary System Cytology
130 pages
Lab 7 JDBC 1 (Part 2) : Objectives
No ratings yet
Lab 7 JDBC 1 (Part 2) : Objectives
14 pages
Clinical Excellence-V2n1p50-En PDF
No ratings yet
Clinical Excellence-V2n1p50-En PDF
20 pages
Oc 2 RJPGT 2023
No ratings yet
Oc 2 RJPGT 2023
13 pages
MRF648
No ratings yet
MRF648
1 page
WEB BROWSERS+search Engine
No ratings yet
WEB BROWSERS+search Engine
10 pages
BA4029 SOCIAL MEDIA WEB ANALYTICS Unit 5
No ratings yet
BA4029 SOCIAL MEDIA WEB ANALYTICS Unit 5
23 pages
Learn Farsi 1
No ratings yet
Learn Farsi 1
347 pages
LLLLLLLLLLLLLLLLL
No ratings yet
LLLLLLLLLLLLLLLLL
30 pages
Excel Assignment #2 Pivot Tables: What Is A Pivot Table
No ratings yet
Excel Assignment #2 Pivot Tables: What Is A Pivot Table
4 pages
Web Search Engingine Indexing Crawling and Ranking
No ratings yet
Web Search Engingine Indexing Crawling and Ranking
63 pages
Power Point - Web Searching Techniques
No ratings yet
Power Point - Web Searching Techniques
27 pages
Unit 8 - Search Engines
No ratings yet
Unit 8 - Search Engines
8 pages
2 SC 1971
No ratings yet
2 SC 1971
1 page
DECODE For SQL Sample Coding
No ratings yet
DECODE For SQL Sample Coding
5 pages
Duplicate Oracle Duplicate Oracle Database With RMANDatabase With RMAN
No ratings yet
Duplicate Oracle Duplicate Oracle Database With RMANDatabase With RMAN
7 pages
Electronic Check Processing
No ratings yet
Electronic Check Processing
1 page
Electronic Check Processing
No ratings yet
Electronic Check Processing
1 page
1999 GORDON - Search Engines - Findind Information On The World Wide Web - INFORMATION PROCESSING and MANAGEMENT
No ratings yet
1999 GORDON - Search Engines - Findind Information On The World Wide Web - INFORMATION PROCESSING and MANAGEMENT
40 pages
Chapter 6. Search Semantic and Recommendation Technology
No ratings yet
Chapter 6. Search Semantic and Recommendation Technology
29 pages
Rami Reddy (Msbi) Resume
No ratings yet
Rami Reddy (Msbi) Resume
3 pages
SQL Server Audit Records
No ratings yet
SQL Server Audit Records
5 pages
Search Engine Using Apache Lucene
No ratings yet
Search Engine Using Apache Lucene
5 pages
Steven Keith, Owen Kaser, Daniel Lemire, Analyzing Large Collections of Electronic Text Using OLAP, UNBSJ CSAS Technical Report TR-05-001, June 2005.
No ratings yet
Steven Keith, Owen Kaser, Daniel Lemire, Analyzing Large Collections of Electronic Text Using OLAP, UNBSJ CSAS Technical Report TR-05-001, June 2005.
13 pages
SEO Skills & Mastery: Get More Traffic with SEO
From Everand
SEO Skills & Mastery: Get More Traffic with SEO
Sarah May Hack
No ratings yet
Web Search Engines: Practice and Experience: Content Analysis Query Prcessing Search Log
No ratings yet
Web Search Engines: Practice and Experience: Content Analysis Query Prcessing Search Log
21 pages
Internet Searching Technique - Last Edited
No ratings yet
Internet Searching Technique - Last Edited
36 pages
Jaff Seminar
No ratings yet
Jaff Seminar
31 pages
Articulo Proyecto
No ratings yet
Articulo Proyecto
37 pages
The Anatomy of A Large-Scale Hypertextual
No ratings yet
The Anatomy of A Large-Scale Hypertextual
41 pages
SQL SERVER DBA Class Room Training
No ratings yet
SQL SERVER DBA Class Room Training
28 pages
An Introduction To SEO
From Everand
An Introduction To SEO
Nirmalya Roy
No ratings yet
Lab Manual: Web Technology
No ratings yet
Lab Manual: Web Technology
39 pages
Dept. of Cse, Msec 2014-15
No ratings yet
Dept. of Cse, Msec 2014-15
19 pages
Search Engine
No ratings yet
Search Engine
35 pages
Chapter - 2 Literature Survey: S. No Page No
No ratings yet
Chapter - 2 Literature Survey: S. No Page No
22 pages
W4 Lesson 6 Use Search Engines and Directories Effectively - Presentation
No ratings yet
W4 Lesson 6 Use Search Engines and Directories Effectively - Presentation
36 pages
Q21 - What Is Search Engine? Give Examples. Discuss Its Features and Working (With Examples) - Ans
No ratings yet
Q21 - What Is Search Engine? Give Examples. Discuss Its Features and Working (With Examples) - Ans
11 pages
IR Unit 3
No ratings yet
IR Unit 3
47 pages
SEO Strategies
From Everand
SEO Strategies
Mila Petrovick
No ratings yet
An Approach For Search Engine Optimization Using Classification - A Data Mining Technique
No ratings yet
An Approach For Search Engine Optimization Using Classification - A Data Mining Technique
4 pages
Google Paper
100% (8)
Google Paper
20 pages
Chapter 1 Search Engine 1. Objective
No ratings yet
Chapter 1 Search Engine 1. Objective
63 pages
Seminar Formatkhjj
No ratings yet
Seminar Formatkhjj
24 pages
The Anatomy of A Large-Scale Hypertextual Web Search Engine: Sergey Brin and Lawrence Page
No ratings yet
The Anatomy of A Large-Scale Hypertextual Web Search Engine: Sergey Brin and Lawrence Page
19 pages
SPPM 1002 Web Searching
No ratings yet
SPPM 1002 Web Searching
12 pages
Search Engines and Web Dynamics: Knut Magne Risvik Rolf Michelsen
No ratings yet
Search Engines and Web Dynamics: Knut Magne Risvik Rolf Michelsen
17 pages
Seminar Report'04 3D Searching
No ratings yet
Seminar Report'04 3D Searching
21 pages
Search Engine: Amit Kamath Ancy Alphonso
No ratings yet
Search Engine: Amit Kamath Ancy Alphonso
22 pages
Search Engine Optimization - Using Data Mining Approach
No ratings yet
Search Engine Optimization - Using Data Mining Approach
5 pages
Meta Search Engines
No ratings yet
Meta Search Engines
48 pages
My Investigation Into Search Engines: Rewan Marzani
No ratings yet
My Investigation Into Search Engines: Rewan Marzani
7 pages
The Anatomy of A Large-Scale Hypertextual Web Search Engine
No ratings yet
The Anatomy of A Large-Scale Hypertextual Web Search Engine
20 pages
Working of Search Engines: Avinash Kumar Widhani, Ankit Tripathi and Rohit Sharma Lnmiit
No ratings yet
Working of Search Engines: Avinash Kumar Widhani, Ankit Tripathi and Rohit Sharma Lnmiit
13 pages
Search Engine Student Documents
No ratings yet
Search Engine Student Documents
6 pages
Term Paper OF Int-301: Web Programming: Topic: Search Engine
No ratings yet
Term Paper OF Int-301: Web Programming: Topic: Search Engine
18 pages
Search Engine Comparisons
No ratings yet
Search Engine Comparisons
23 pages
Web Search Engine
No ratings yet
Web Search Engine
26 pages
Preparation
No ratings yet
Preparation
10 pages