Module 5-1
Module 5-1
2020 March
11. What is spatial trend analysis?
Spatial trend analysis is a data mining technique that involves identifying and analyzing
patterns and relationships in spatial data, such as maps, satellite images, or GPS data, to
gain insights into spatial phenomena and inform decision-making.
2021 April
11. What is spatial autocorrelation?
Spatial autocorrelation is a measure of the degree to which similar values of a variable are
clustered together in space, and is commonly used in spatial analysis to identify spatial
patterns and relationships.
2022 April
11. What is spatial autocorrelation?
(Same answer as 11th question from 2021 April paper)
12. What is Information Retrieval in text mining?
Information retrieval in text mining is the process of searching and retrieving relevant
information from large collections of unstructured text data, such as documents, web pages,
or social media posts, using various techniques and algorithms, such as keyword-based
search, text classification, and clustering.
Part B
Module 5 : Data Mining
2020 March
20. Explain multidimensional analysis of multimedia data.
Multidimensional analysis of multimedia data involves analyzing multimedia data, such as
images, videos, and audio, in multiple dimensions. These dimensions can include color,
shape, texture, motion, and sound. By analyzing these dimensions, we can discover patterns
and relationships in the multimedia data. For example, we can use multidimensional analysis
to classify images into categories such as landscapes, animals, and buildings based on their
color, texture, and shape features.
2021 April
20. Write notes on document classification analysis.
Document classification analysis is a method used in natural language processing to
automatically categorize documents into predefined classes. It involves the use of machine
learning algorithms such as decision trees, support vector machines, and Naive Bayes to
classify text documents based on the features extracted from them. The process involves
several steps such as pre-processing the text, feature extraction, and model building. The
pre-processing step involves cleaning the text by removing stop words, stemming, and
tokenizing. Feature extraction involves identifying the most relevant words in the text, and
model building involves training a machine learning algorithm on the extracted features.
Document classification analysis has numerous applications, such as spam filtering,
sentiment analysis, and topic modeling.
21. What are the deficiencies for a keyword-based search
engine?
Keyword-based search engines have several deficiencies that affect their effectiveness in
retrieving relevant information. First, they are not effective in capturing the meaning of the
search query as they rely on the occurrence of specific keywords in the document. This
results in a high number of irrelevant results being returned to the user. Secondly, keyword-
based search engines are not good at handling synonyms, polysemy, and homonymy. This
means that a user may miss relevant documents that do not contain the exact keywords
used in the search query. Thirdly, keyword-based search engines are not effective in
retrieving information from unstructured data sources such as social media and multimedia
content. This is because such data sources have a high level of noise and contain
unstructured text that is difficult to index and retrieve. To overcome these deficiencies,
researchers have developed more advanced search engines that use natural language
processing, machine learning, and semantic technologies to improve the accuracy and
relevance of search results.
2022 April
20. Explain various similarity-based retrieval in image
databases based on image signature.
Similarity-based retrieval methods for image databases are used to retrieve images that
are visually similar to a query image. Image signatures are used to represent images in a
compact form, and they can be used to measure the similarity between two images.
Examples of image signatures include color histograms, texture descriptors, and shape
features. The main methods for similarity-based retrieval are content-based retrieval and
query-by-example retrieval. Content-based retrieval uses a query image as input and
retrieves images that are visually similar to the query image based on a similarity measure.
Query-by-example retrieval uses an example image as input and retrieves images that are
similar to the example image. Image signatures are used to measure the similarity between
the query image and the database images. Examples of similarity measures include
Euclidean distance, cosine similarity, and Jaccard similarity.
2020 March
(No questions asked)
2021 April
(No questions asked)
2022 April
(No questions asked)