0% found this document useful (0 votes)

134 views28 pages

Introduction To Automatic Indexing

This document discusses different approaches to automatic indexing, including statistical, natural language, and concept indexing. It focuses on statistical indexing and vector weighting. Statistical approaches store frequency statistics about terms to generate relevance scores for searches. Vector weighting represents documents as vectors of term weights. Several term weighting algorithms are discussed, including term frequency, inverse document frequency, and normalization methods. The goal is to extract meaningful topics from documents and represent them in a way that facilitates accurate information retrieval.

Uploaded by

7killers4u

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

134 views28 pages

Introduction To Automatic Indexing

Uploaded by

7killers4u

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 28

1

Introduction to Automatic
Indexing
2
Overview
The indexing process is a transformation of an item that extracts the
semantics of the topics discussed in the item
The extracted information is used to create the processing tokens and the
searchable data structure.
Automatic indexing is the process of analyzing an item to extract the
information to be permanently kept in an index.
Search Strategies are classified as statistical, natural language and concept.

Input Zoning
Identify
processing
tokens
Apply
Stoplists
Characterize
Tokens
Apply
Stemming
Create
Searchable
data
structure
Update
Document
File
User
Command
Create Hit
List
Indexing
3
Automatic Indexing Approaches
Statistical strategies
Most prevalent in commercial system
Cover broadest range of indexing technology
Approach
Use frequency of occurrence of events
Events are related to occurrences of Processing Tokens(PT)s within documents
and within the database
Store a single statistic, such as how often each word occurs in an item, that is
used in generating relevance scores after a standard Boolean search
Statistics applied to the event data are probabilistic, Bayesian, vector space,
neural net
Static approach stores occurrence of each word in an item, that is used in
generating relevance scores after a standard Boolean search.
Probabilistic indexing stores the probability that a particular item satisfies a
particular query
Bayesian and Vector approaches store information used in generating a relative
confidence level of an items relevance to a query.
Neural Net are dynamic learning structures used to determine concept classes.

4
Automatic Indexing Approaches (Cont.)
Natural language
Additionally perform varying levels of natural language parsing of the item for
disambiguating the context of the PTs and generalizes to more abstract
concepts within an item (e.g., present, past, future actions)
This additional information is stored within the index to be used to enhance the search
precision
Concept indexing
Use the words within an item to correlate to concepts discussed in the item
A generalization of the specific words to values used to index the item
Hypertext Linkages
Provide virtual threads of concepts between items versus directly defining the
concept within an item.
To maximize location of relevant items, applying several different algorithms
to the same corpus provides the optimum results, but the storage and
processing overhead is significant
5
Statistical Indexing
Vector Weighting
6
Overview
The semantics of every item are represented as a vector
A vector is a one-dimensional set of values, where the order/position of
each value in the set is fixed and represents a particular domain
In IR, each position in the vector typically represents a PT( Processing
Token).
Two approaches to the domain values
Binary: the domain contains the the value of one or zero
1: represent the existence of the PT in the item
Weighted: the domain is the set of all real positive numbers
Relative importance of that PT in representing the semantics of
the item (provide a basis for determining the rank of an item)
Ex: discuss petroleum refineries in Mexico

Binary
Weighted
Petroleum Mexico Oil Taxes Refineries
1
2.8
1
1.6
1
3.5
0
.3
1
3.1
7
Overview (Cont.)
Each processing token can be considered a dimension in an item
representation space.

There are many algorithms that can be used in calculating the weights
of Processing Tokens.
Simple Term Frequency Algorithm
Inverse Document Weighting
Signal Weighting
Discrimination Value

Oil
Petroleum
Mexico
3.5
2.8
1.6
8
Simple term frequency Algorithm
The term frequency tf
t,d
of term t in document d is defined as the number of
times that t occurs in d.
The weight is equal to the term frequency
TF (Term Frequency) : Frequency of Occurrence of PT in an existing item
TOTF (Total Term Frequency ) : Frequency of Occurrence of PT in an existing database
IF /DF (Item / Document Frequency ) : No. of Unique items in the database that contain the
processing token
Emphasize the use of particular PT within an item
computer occurs 15 times within an item a weight of 15
Problems: normalization between items and use of the PT within the database
The longer an item is, the more often a PT may occur within the item
Use of absolute value biases weights toward longer items.
Hence Normalization is done
Ex
(1+log(TF)/1+log(average(TF))
----------------------------------------------------------------
(1-slope)*pivot + slope * no. of unique terms
Pivot : avg no. of unique terms occurring in the collection.
Simple Term Frequency Algorithm contd..
Maximum Term Frequency
In the first technique, the term frequency for each word is divided by the maximum
frequency of the word in any item. This normalizes the term frequency values to a value
between zero and one.
problem : the maximum term frequency can be so large that it decreases the value of
term frequency in short items to too small a value and loses significance.
Logaritmetic Term Frequency
In this technique the log of the term frequency plus a constant is used to replace the
term frequency. The log function will perform the normalization when the term
frequencies vary significantly due to size of documents.
COSINE function
COSINE function used as a similarity measure can be used to normalize values in a
document.
This is accomplished by treating the index of a document as a vector and divide the
weights of all terms by the length of the vector. This will normalize to a vector of
maximum length one.
This uses all of the data in a particular item to perform the normalization and will not be
distorted by any particular term.
Simple Term Frequency Algorithm contd
The problem occurs when there are multiple topics within an item. The COSINE
technique will normalize all values based upon the total length of the vector that
represents all of topics.
If a particular topic is important but briefly discussed, its normalized value could
significantly reduce its overall importance in comparison to another document that
only discusses the topic.
penalizing long documents
Singhal did experiments that showed longer documents in general are more likely to
be relevant to topics then short documents.
normalization was making all documents appear to be the same length.
To compensate, a correction factor was defined that is based upon document length
that maps the Cosine function into an adjusted normalization function.
The function determines the document length crossover point for longer
documents where the probability of relevance equals the probability of retrieval,
(given a query set).
This value called the "pivot point" is used to apply an adjustment to the normalization
process.
Simple Term Frequency Algorithm contd
The theory is based upon straight lines so it is a matter of determining slope
of the lines.
New normalization = (slope)*(old normalization)+k
- K is generated by the rotation of the pivot point to generate the new line
and the old normalization = the new normalization at that point.
- Substituting pivot for both old and new value in the above formula we can
solve for K at that point.
Then using the resulting formula for K and substituting in the above formula
produces the following formula:
New normalization = (slope)*(old normalization)+(1.0-slope)*(pivot)
Slope and pivot are constants for any document/query set.
problem: Cosine function favors short documents over long documents and
also favors documents with a large number of terms.
In documents with large number of terms the Cosine factor is approximated
by the square root of the number of terms.
This suggests that using the ratio of the logs of term frequencies would work
best for longer items in the calculations:

Simple Term Frequency Algorithm contd
Document frequency
Rare terms are more informative than frequent
terms
Consider a term in the query that is rare in the
collection (e.g., arachnocentric)
A document containing this term is very likely to
be relevant to the query arachnocentric
We want a high weight for rare terms like
arachnocentric.
Sec. 6.2.1
Document frequency, continued
Frequent terms are less informative than rare terms
Consider a query term that is frequent in the collection (e.g.,
high, increase, line)
A document containing such a term is more likely to be
relevant than a document that doesnt
But its not a sure indicator of relevance.
For frequent terms, we want high positive weights for words
like high, increase, and line
But lower weights than for rare terms.
We will use document frequency (df) to capture this.

Sec. 6.2.1
idf weight
df
t
is the document frequency of t: the number
of documents that contain t
df
t
is an inverse measure of the informativeness of t
df
t
s N
We define the idf (inverse document frequency) of t
by

We use log (N/df
t
) instead of N/df
t
to dampen
the effect of idf.
) /df ( log idf
10 t t
N =
Will turn out the base of the log is immaterial.
Sec. 6.2.1
idf example, suppose N = 1 million
term df
t
idf
t

calpurnia 1
animal 100
sunday 1,000
fly 10,000
under 100,000
the 1,000,000
There is one idf value for each term t in a collection.
Sec. 6.2.1
) /df ( log idf
10 t t
N =
The term computer represents a concept used in
an item, but it does not help a user find the specific
information being sought since it returns the
complete database.

This leads to the general statement enhancing
weighting algorithms that the weight assigned to an
item should be inversely proportional to the
frequency of occurrence of an item in the database.
18
Inverse Document Frequency (IDF)
The weight equal to the frequency of
occurrence of the processing token in the
database
WEIGHT
ij
=Tf
ij
*[Log
2
(n)-Log
2
(IF
j
)+1]
WEIGHT
ij
: assigned to term jin item i
TF
ij
: frequency of term j in item i
IF
ij
: number of items in the database that have
term j in them
n

: number of items in the database

n TF IF
Oil 2048 4 128
Mexico 2048 8 16
Refinery 2048 10 1024

Effect of idf on ranking
Does idf have an effect on ranking for one-
term queries, like
iPhone
idf has no effect on ranking one term queries
idf affects the ranking of documents for queries
with at least two terms
For the query capricious person, idf weighting
makes occurrences of capricious count for much
more in the final document ranking than
occurrences of person.
20
Collection vs. Document frequency
The collection frequency of t is the number of
occurrences of t in the collection, counting
multiple occurrences.
Example:

Which word is a better search term (and should
get a higher weight)?
Word Collection frequency Document frequency
insurance 10440 3997
try 10422 8760
Sec. 6.2.1
22
Signal Weighting
IDF does not account the term frequency
distribution of the PT in the items that contain
the term
The distribution of the frequency of
processing tokens within an item can affect
the ability to rank items
Item Distribution SAW DRILL
A 10 2
B 10 2
C 10 18
D 10 10
E 10 18
An instance of an event
that occurs all the time has
less information value than
an instance of a seldom
occurring event
23
Signal Weighting (Cont.)
In information theory, the information content
value of an object is inversely proportional to the
probability of occurrence of the item
INFORMATON = -Log
2
(p)
p is the probability of occurrence of event p
p = 0.5% INFORMATION = -Log
2
(0.005) = -(-10) = 10
p = 50% INFORMATION = -Log
2
(0.5) = -(-1) = 1
If there are many independent occurring event

Maximum when the value for every p
k
is the same
p
k
can be defined as TF
ik
/TOTF
k

=
=
n
k
k k
p Log p INFO AVE
1
2
) ( _
24
Signal Weighting (Cont.)

(

=
=
=

=
) / ( * ) ( *
*
_ ) (
2
1
2
2
k ik
n
i
k
ik
k ik ik
k ik ik
k
TOTF TF Log
TOTF
TF
TOTF Log TF Weight
Signal TF Weight
INFO AVE TOTF Log Singal
(
(
(
(

+ +
+ +
=
(

=
) 50 / 18 ( *
50
18
) 50 / 10 ( *
50
10
) 50 / 18 ( *
50
18
) 50 / 2 ( *
50
2
) 50 / 2 ( *
50
2
) 50 (
) 50 / 10 ( *
50
10
* 5 ) 50 (
2 2 2
2 2
2
2 2
Log Log Log
Log Log
Log Singal
Log Log Singal
DRILL
SAW
25
Similarity Measure
Measure the similarity between a query and a
document
Similarity measure examples
) (QTerm ) (DTerm ) QUERY , SIM(DOC
k j,
k
k i, j i
=
2 2
*

=
k
k j,
k
k i,
k j,
k
k i,
j i
) (QTerm ) (DTerm
) (QTerm ) (DTerm
) QUERY , SIM(DOC
26
Problems with Weighting Schemes
The two weighting schemes, IDF and signal, use total
frequency and item frequency factors which makes
them dependent on distributions of PTs within the DB
These factors are changing dynamically
Approaches to compensate for changing values
Ignore the variances and calculates weights based on
current values, with the factors changing over time.
Periodically rebuild the complete search database
Use a fixed value while monitoring changes in the factors.
When the changes reach a certain threshold, start using
the new value and update all existing vectors with the new
value
Store the invariant values (e.g. TF) and at search time
calculate the latest weights for PTs in items needed for
search terms
27
Problems with Weighting Schemes (Cont.)
Side effect of maintaining currency in the DB
for term weights
The same query over time returns a different
ordering of items
A new word in the DB undergoes significant
changes in its weight structure from initial
introduction until its frequency in the DB reaches
a level where small changes do not have
significant impact on changes in weight values

28
Problems with Vector Model
A major problem comes in the vector model
when there are multiple topics being
discussed in a particular item
Assume an item has an in-depth discussion of oil
in Mexico and also coal in Pennsylvania
This item results in a high value in a search for coal in
Mexico
Cannot handle proximity searching

Virtusa Coding Questions 9th July, 2025
No ratings yet
Virtusa Coding Questions 9th July, 2025
12 pages
Search Capabilities in Information Retrieval System
No ratings yet
Search Capabilities in Information Retrieval System
16 pages
Operator'S Manual: T6.145 T6.155 T6.165 T6.175 T6.180 Autocommand
No ratings yet
Operator'S Manual: T6.145 T6.155 T6.165 T6.175 T6.180 Autocommand
22 pages
CH 16 Keller, Grumach, Et Al
No ratings yet
CH 16 Keller, Grumach, Et Al
22 pages
All India Machinery Data
0% (1)
All India Machinery Data
1,705 pages
Fiat Hitachi Excavator Ex135w Workshop Manual
100% (1)
Fiat Hitachi Excavator Ex135w Workshop Manual
22 pages
BSMA 2022 Curriculum
100% (1)
BSMA 2022 Curriculum
2 pages
Unit-I: Introduction To Information Retrieval Systems
100% (1)
Unit-I: Introduction To Information Retrieval Systems
14 pages
Unit-I: Introduction To Information Retrieval Systems
100% (1)
Unit-I: Introduction To Information Retrieval Systems
14 pages
IRS UNIT 5-Compressed
No ratings yet
IRS UNIT 5-Compressed
80 pages
1941 - National Building Code of Canada
No ratings yet
1941 - National Building Code of Canada
432 pages
Linux Programming Lecture Notes
79% (19)
Linux Programming Lecture Notes
190 pages
Unit 2 Data - Structures
No ratings yet
Unit 2 Data - Structures
84 pages
Signature File Structure in Information Retrieval System
No ratings yet
Signature File Structure in Information Retrieval System
8 pages
Statistical Indexing Is A Method Used in Information Retrieval Systems
No ratings yet
Statistical Indexing Is A Method Used in Information Retrieval Systems
22 pages
Resilient Control Architectures and Power Systems 1st Edition Craig Rieger Instant Download
100% (1)
Resilient Control Architectures and Power Systems 1st Edition Craig Rieger Instant Download
44 pages
Ways To Integrate Social Emotional Learning
No ratings yet
Ways To Integrate Social Emotional Learning
21 pages
Irs Unit-V
No ratings yet
Irs Unit-V
48 pages
Information Visualization Technologies
No ratings yet
Information Visualization Technologies
15 pages
Beyond The Blackboard Reflection Paper
100% (1)
Beyond The Blackboard Reflection Paper
3 pages
Unit - 3:: Explain Briefly About Automatic Indexing? Explain About Types of Classes Automatic Indexing?
No ratings yet
Unit - 3:: Explain Briefly About Automatic Indexing? Explain About Types of Classes Automatic Indexing?
28 pages
Unit Iii Data Structure
No ratings yet
Unit Iii Data Structure
43 pages
7 8 STS Handout Key
No ratings yet
7 8 STS Handout Key
9 pages
Iso 3511 Instrument - Symbols - Part - 4 PDF
0% (1)
Iso 3511 Instrument - Symbols - Part - 4 PDF
10 pages
Information Storage and Retrieval: Chapter One - Introduction
No ratings yet
Information Storage and Retrieval: Chapter One - Introduction
50 pages
IRS Automatic Indexing UNIT-2
67% (3)
IRS Automatic Indexing UNIT-2
18 pages
AASHTO M300 Inorganic Zinc-Rich Primer
100% (2)
AASHTO M300 Inorganic Zinc-Rich Primer
8 pages
Chapter 3 IR
No ratings yet
Chapter 3 IR
34 pages
IRS-Class - Unit-3
No ratings yet
IRS-Class - Unit-3
95 pages
5.hyperparameters and Validation Sets (C)
No ratings yet
5.hyperparameters and Validation Sets (C)
3 pages
DrWeb Crash
No ratings yet
DrWeb Crash
12 pages
Dbms Lab Manual II Cse II Sem
No ratings yet
Dbms Lab Manual II Cse II Sem
58 pages
Quiet Versus Loud Luxury The Influence of Overt and Covert Narcissism On Young Chinese and US Luxury Consumers' Preferences
No ratings yet
Quiet Versus Loud Luxury The Influence of Overt and Covert Narcissism On Young Chinese and US Luxury Consumers' Preferences
27 pages
SPM Swivels Operation Instruction and Service Manual
No ratings yet
SPM Swivels Operation Instruction and Service Manual
44 pages
IRS III Year UNIT-3 Part 1
50% (2)
IRS III Year UNIT-3 Part 1
18 pages
CNS Unit 2
No ratings yet
CNS Unit 2
38 pages
Iso 11600 2002
No ratings yet
Iso 11600 2002
9 pages
S-LAP Format
No ratings yet
S-LAP Format
3 pages
STM Question Paper R18
No ratings yet
STM Question Paper R18
2 pages
Mid - 1 Questions Irs
No ratings yet
Mid - 1 Questions Irs
2 pages
09a51202 Linuxprogramming
No ratings yet
09a51202 Linuxprogramming
4 pages
Chap6 Stair Design MDM
No ratings yet
Chap6 Stair Design MDM
33 pages
Irs PPT Unit Ii
No ratings yet
Irs PPT Unit Ii
19 pages
Automatic Indexing
100% (1)
Automatic Indexing
15 pages
Searching The Internet and Hypertext in Information Retrieval Systems
No ratings yet
Searching The Internet and Hypertext in Information Retrieval Systems
1 page
Irs Question Papers
No ratings yet
Irs Question Papers
6 pages
Reasoning Systems For Categories
No ratings yet
Reasoning Systems For Categories
13 pages
DTX Adjust Captured Images
No ratings yet
DTX Adjust Captured Images
3 pages
PPE Lab Manual
No ratings yet
PPE Lab Manual
52 pages
R22 Unit 5
No ratings yet
R22 Unit 5
23 pages
Unit-3 Irs
No ratings yet
Unit-3 Irs
48 pages
Digital Notes: (Department of Computer Applications)
No ratings yet
Digital Notes: (Department of Computer Applications)
14 pages
UNIT 2 IRS Up
No ratings yet
UNIT 2 IRS Up
42 pages
LPDM Lab Manul
No ratings yet
LPDM Lab Manul
89 pages
IRS Unit-3
No ratings yet
IRS Unit-3
30 pages
Quiz 2 PF
No ratings yet
Quiz 2 PF
7 pages
Completed Final UNIT-V 9.10.17
100% (1)
Completed Final UNIT-V 9.10.17
74 pages
SM 6th-Sem Cse Internet-Of-Things
No ratings yet
SM 6th-Sem Cse Internet-Of-Things
76 pages
Introduction To Clustering Thesaurus Generation Item Clustering
No ratings yet
Introduction To Clustering Thesaurus Generation Item Clustering
15 pages
Irs Important Questions
0% (1)
Irs Important Questions
3 pages
CAS Reflection (Sample)
No ratings yet
CAS Reflection (Sample)
4 pages
STM Unit 5
No ratings yet
STM Unit 5
31 pages
Visualizing Data Structures
From Everand
Visualizing Data Structures
Rhonda Hoenigman
No ratings yet
Configuration E3D V5 Folder :: Bltouch Hotend (Stock) : /01 - Mk4 - Hex - Nuts/02 - Bltouch
No ratings yet
Configuration E3D V5 Folder :: Bltouch Hotend (Stock) : /01 - Mk4 - Hex - Nuts/02 - Bltouch
5 pages
Operating System Questions
No ratings yet
Operating System Questions
15 pages
Unit 3 Path Testing
No ratings yet
Unit 3 Path Testing
2 pages
Unit Ii Modeling
No ratings yet
Unit Ii Modeling
15 pages
Unit-1 Cyber Laws
No ratings yet
Unit-1 Cyber Laws
21 pages
NLP Important and Super Important Questions-18CS743
No ratings yet
NLP Important and Super Important Questions-18CS743
2 pages
IRS Unit-3
100% (2)
IRS Unit-3
28 pages
Unit 4 (IRS)
No ratings yet
Unit 4 (IRS)
13 pages
Sp09midterm Revised
No ratings yet
Sp09midterm Revised
6 pages
Rapid Serial Visual Presentation in Dynamic Graph Visualization
No ratings yet
Rapid Serial Visual Presentation in Dynamic Graph Visualization
8 pages
Ir MCQ-1
No ratings yet
Ir MCQ-1
22 pages
Irs Unit Ii Part 1
No ratings yet
Irs Unit Ii Part 1
16 pages
08 - FGD by Ammonia Scrubbing in CFB Power Plant
No ratings yet
08 - FGD by Ammonia Scrubbing in CFB Power Plant
4 pages
Case Tools &amp Software Testing Lab Manual (IV-CSE)
No ratings yet
Case Tools &amp Software Testing Lab Manual (IV-CSE)
63 pages
PhysicsBowl 2017
No ratings yet
PhysicsBowl 2017
11 pages
ML Lab Programs (1-12)
No ratings yet
ML Lab Programs (1-12)
35 pages
Optimal Lot-Size With The Andler Formula: Sensitivity Analysis
No ratings yet
Optimal Lot-Size With The Andler Formula: Sensitivity Analysis
3 pages
Data Sheet: SFH757 and SFH757V
No ratings yet
Data Sheet: SFH757 and SFH757V
4 pages
Irs Unit1
No ratings yet
Irs Unit1
15 pages
Employee Retention, Engagement and Careers
No ratings yet
Employee Retention, Engagement and Careers
16 pages
Studocu DAA Unit 5 Notes
No ratings yet
Studocu DAA Unit 5 Notes
23 pages
General Architecture of Text Mining Systems
No ratings yet
General Architecture of Text Mining Systems
6 pages
TRCS - Assignment Issued To Students
No ratings yet
TRCS - Assignment Issued To Students
4 pages
DM Lab Manual
No ratings yet
DM Lab Manual
32 pages
AI Unitwise Imp Questions
No ratings yet
AI Unitwise Imp Questions
3 pages
IRS Questions Qbank
100% (1)
IRS Questions Qbank
2 pages
Completed UNIT-III 20.9.17
No ratings yet
Completed UNIT-III 20.9.17
61 pages
Unit-Ii: Cataloging and Indexing
100% (3)
Unit-Ii: Cataloging and Indexing
13 pages
CG Questions
No ratings yet
CG Questions
4 pages
Information Retrieval Systems U6
No ratings yet
Information Retrieval Systems U6
13 pages
IRS Assignment-I: 1. Define IRS & Goals. Ans
No ratings yet
IRS Assignment-I: 1. Define IRS & Goals. Ans
3 pages
Single Link Example
No ratings yet
Single Link Example
8 pages
Unit-4: Define The Domain For Clustering
No ratings yet
Unit-4: Define The Domain For Clustering
13 pages
Uml Lab Manual
No ratings yet
Uml Lab Manual
33 pages
IRS
No ratings yet
IRS
1 page
Main Pages Final - B.tech-CSE
No ratings yet
Main Pages Final - B.tech-CSE
5 pages
Main Pages Final - B.tech-CSE
No ratings yet
Main Pages Final - B.tech-CSE
5 pages
LP Imp Questions
No ratings yet
LP Imp Questions
2 pages
912 Brochure
No ratings yet
912 Brochure
3 pages
STM Q Paper 2-Mid
No ratings yet
STM Q Paper 2-Mid
2 pages
LP II Mid Question Paper
No ratings yet
LP II Mid Question Paper
2 pages
Certificate Cse
No ratings yet
Certificate Cse
1 page
Vidya Vikas Institute of Technology: Chevella I - Mid Questions
No ratings yet
Vidya Vikas Institute of Technology: Chevella I - Mid Questions
1 page
Gate 2 K 13 Poster
No ratings yet
Gate 2 K 13 Poster
1 page
Software Testing Methodologies Question Paper For I-Mid (IV-I CSE & IT) Faculty B.Sunil Srinivas
No ratings yet
Software Testing Methodologies Question Paper For I-Mid (IV-I CSE & IT) Faculty B.Sunil Srinivas
1 page
Software Testing Methodologies Question Paper For I-Mid (IV-I CSE & IT) Faculty B.Sunil Srinivas
No ratings yet
Software Testing Methodologies Question Paper For I-Mid (IV-I CSE & IT) Faculty B.Sunil Srinivas
1 page
DMDW II-mid Questions: Unit-5
No ratings yet
DMDW II-mid Questions: Unit-5
1 page
Unit 1 - Modern Information Retrieval - WWW - Rgpvnotes.in
No ratings yet
Unit 1 - Modern Information Retrieval - WWW - Rgpvnotes.in
8 pages
PAT Trees and PAT Arrays
No ratings yet
PAT Trees and PAT Arrays
12 pages
Unit 4 Notes
No ratings yet
Unit 4 Notes
46 pages
Parallel Database Systems
No ratings yet
Parallel Database Systems
17 pages
CS1352-Principles of Compiler Design Question Bank
100% (1)
CS1352-Principles of Compiler Design Question Bank
2 pages

Introduction To Automatic Indexing

Uploaded by

Introduction To Automatic Indexing

Uploaded by

1

You might also like