0% found this document useful (0 votes)
15 views1 page

IDS Unit-4&5 Important Questions

The document outlines important questions for Units IV and V of an IDS course, focusing on supervised and unsupervised learning, as well as applications and evaluation methods. It includes both 2-mark and 6-mark questions covering topics such as logistic regression, decision trees, A/B testing, and sentiment analysis. The questions aim to assess understanding of key concepts and methodologies in data science.

Uploaded by

sravyasankuratri
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
15 views1 page

IDS Unit-4&5 Important Questions

The document outlines important questions for Units IV and V of an IDS course, focusing on supervised and unsupervised learning, as well as applications and evaluation methods. It includes both 2-mark and 6-mark questions covering topics such as logistic regression, decision trees, A/B testing, and sentiment analysis. The questions aim to assess understanding of key concepts and methodologies in data science.

Uploaded by

sravyasankuratri
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 1

IDS Unit-4 & 5 Important Questions

Unit IV: Supervised and Unsupervised Learning

2 Marks Questions

1. What is the main difference between logistic regression and softmax regression?
2. Define the role of a decision rule in classification tasks.
3. What is Hierarchical Clustering?

6 Marks Questions

1. Explain the concept of k-Nearest Neighbors (kNN) classification with an example.


2. Compare and contrast decision trees and random forests in terms of accuracy and
overfitting.
3. Describe the Naïve Bayes algorithm and its application in text classification.
4. Discuss the working of Support Vector Machines (SVM) for binary classification,
including the concept of the hyperplane.
5. Elaborate on the Expectation-Maximization (EM) algorithm and its use in clustering
tasks.
6. What are different kernel functions? How are these used in SVM?
7. Explain the construction of a decision tree using Entropy and Information Gain.

Unit V: Applications, Evaluations, and Methods

2 Marks Questions

1. What is A/B testing, and why is it used in model evaluation?


2. Mention one advantage of using focus groups over individual interviews in data
collection.
3. Define cross-validation in the context of model evaluation.

6 Marks Questions

1. Explain the process of collecting and analyzing Twitter data for sentiment analysis.
2. Discuss the advantages and challenges of conducting surveys for data collection, with
examples.
3. Compare qualitative and quantitative methods for analyzing user behavior data.
4. Explain the role of mixed-method studies in evaluating the effectiveness of a
recommendation system.
5. Describe the steps involved in designing and analyzing a user study in a lab setting.
6. Discuss the importance of log and diary data in understanding user behavior, with
examples of real-world applications.
7. Discuss how metrics like accuracy, precision, recall, and F1-score help in model
comparison.
8. Highlight the tools and techniques used for extracting insights from Twitter data.
9. Discuss different types of survey questions and the factors to consider when designing a
survey.

You might also like