0% found this document useful (0 votes)

17 views5 pages

SHORT BA Mid 2

The document discusses various concepts and techniques in data analysis, including practical applications of multiple regression, definitions of least squares regression and K-Nearest Neighbor (KNN), and the principles of unsupervised learning. It also covers data cube aggregation, advantages of supervised learning, simulation processes, and the importance of what-if analysis. Additionally, it outlines the types of association rules in data mining, advantages and disadvantages of simulation techniques, and the differences between validation and verification.

Uploaded by

maninani0332

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

17 views5 pages

SHORT BA Mid 2

Uploaded by

maninani0332

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

1.

Write some examples of practical applications of multiple regression

analysis.
• Market segmentation
• Demand forecast

2. Define least squares regression.

A least squares regression line represents the relationship between variables in a
scatterplot. The procedure fits the line to the data points in a way that minimizes
the sum of the squared vertical distances between the line and the points. It is also
known as a line of best fit or a trend line.

3. Write down the assumptions for building good regression models.

4. What are the categorical independent variables?
5. Define K-Nearest Neighbor (KNN)?

• K-Nearest Neighbor (KNN) is an algorithm that classifies data based on

its proximity to other data. The basis for KNN is rooted in the assumption
that data points that are close to each other are more similar to each other
than other bits of data. This non-parametric, supervised technique is used
to predict the features of a group based on individual data points.

6. Write three applications of data exploration.

• Business Intelligence and Analytics
• Healthcare and Medicine
• Financial Sector
• E-commerce and Customer Experience
7. Explain the data cube aggregation with an example?
1. Data Cube Aggregation:
This technique is used to aggregate data in a simpler form. For example,
imagine the information you gathered for your analysis for the years 2012 to
2014, that data includes the revenue of your company every three months.
They involve you in the annual sales, rather than the quarterly average, so we
can summarize the data in such a way that the resulting data summarizes the
total sales per year instead of per quarter. It summarizes the data.

8. Outline the unsupervised learning?

Unsupervised learning is a branch of machine learning that deals with unlabeled

data. Unlike supervised learning, where the data is labeled with a specific
category or outcome, unsupervised learning algorithms are tasked with finding
patterns and relationships within the data without any prior knowledge of the
data’s meaning. This makes unsupervised learning a powerful tool for exploratory
data analysis, where the goal is to understand the underlying structure of the data.

9. What are the types of Association Rules in Data Mining?

Types of Association Rules in Data Mining

There are typically four different types of association rules in data mining.
They are

• Multi-relational association rules

• Generalized Association rule
• Interval Information Association Rules
• Quantitative Association Rules

10.Differentiate between an antecedent (if) and a consequent?

The same has been discussed in brief in this article.

An association rule has 2 parts:

• an antecedent (if) and

• a consequent (then)

An antecedent is something that’s found in data, and a consequent is

an item that is found in combination with the antecedent.
11.Write three Advantages of Supervised learning?
Advantages of Supervised learning
• Supervised learning allows collecting data and produces data output from
previous experiences.
• Helps to optimize performance criteria with the help of experience.
• Supervised machine learning helps to solve various types of real-world
computation problems.
• It performs classification and regression tasks.
• It allows estimating or mapping the result to a new sample.
• We have complete control over choosing the number of classes we want in
the training data.

12. Outline the advantages of data partitioning?

• Improve scalability
• Improve availability
• Improve performance

13.Define Simulation with real time example.

Define Simulation?
Simulation is the process of creating a model of a real world scenario for a variety
of reasons including education, preparing for an anticipated event or
troubleshooting a problem. The models used during a simulation might be real or
dramatized.

What is an example of a simulation?

A fire drill is an example of a simulation. It reenacts the real world scenario of a
fire in a building or an environment with the purpose of teaching appropriate
actions in the event a real fire is encountered.

14.Explain random number generation with an example.

Random Number Generation

At the hearth of any simulation model there is the capability of creating
numbers that mimic those we would expect in real life. In simulation modeling
we will assume that specific processes will be distributed according to a
specific random variable. For instance we will assume that an employee in a
donut shop takes a random time to serve customers distributed according to a
Normal random variable with mean μ and variance σ2. In order to then carry
out a simulation the computer will need to generate random serving times. This
corresponds to simulating number that are distributed according to a specific
distribution.
Let’s consider an example. Suppose you managed to generate two sequences of
numbers, say x1 and x2. Your objective is to simulate numbers from a Normal
distribution. The histograms of the two sequences are reported in
Figure 4.1 together with the estimated shape of the density. Clearly the
sequence x1 could be following a Normal distribution, since it is bell-shaped
and reasonably symmetric. On the other hand, the sequence x2 is not symmetric
at all and does not resembles the density of a Normal.
15.How ‘what-if analysis’ is useful in analytics?
By using What-If Analysis tools in Excel, you can use several different sets of
values in one or more formulas to explore all the various results.
For example, you can do what-If Analysis to build two budgets that each assumes
a certain level of revenue. Or, you can specify a result that you want a formula to
produce, and then determine what sets of values will produce that result. Excel
provides several different tools to help you perform the type of analysis that fits
your needs.

16.What are the advantages and disadvantages of simulation techniques?

Advantages of Simulation
• Control over Variables
• Risk-Free Environment
• Cost-Effective
Disadvantages of Simulation
• Accuracy and Validity
• Data Requirements
• Simplification of Realities
• Technical Skills Required
17.Write any two disadvantages of decision tree analysis. Disadvantages of
using a tree diagram as a decision-making tool
Rather than displaying real outcomes, decision trees only show patterns
connected with decisions. Because decision trees don’t provide information on
aspects like implementation, timeliness, and prices, more research may be needed
to figure out if a particular plan is viable.This type of model does not provide
insight into why certain events are likely while others are not, but it can be used
to develop prediction models that illustrate the chance of an event occurring in
certain situations.

18.What are three application areas of Monte Carlo simulation technique?

19.Differentiate between ‘Validation’ and ‘Verification’.
20.What are the two advantages of simulation?

PI Kit - MBA Admissions 2023
No ratings yet
PI Kit - MBA Admissions 2023
50 pages
120 Data Science Interview Questions
No ratings yet
120 Data Science Interview Questions
25 pages
Da - MP - 1
No ratings yet
Da - MP - 1
19 pages
7 - Simulation W-Notes
No ratings yet
7 - Simulation W-Notes
10 pages
Final Exam Review
No ratings yet
Final Exam Review
44 pages
Simulation
No ratings yet
Simulation
63 pages
120 24pgs Mlinterviewquestions
No ratings yet
120 24pgs Mlinterviewquestions
24 pages
Chapter 2
No ratings yet
Chapter 2
136 pages
Data Analytics Chapter - 1
No ratings yet
Data Analytics Chapter - 1
42 pages
JTW115E Q&A With Explanation - 240702 - 171526
No ratings yet
JTW115E Q&A With Explanation - 240702 - 171526
27 pages
DA - AKTU Short Answer + Differences
No ratings yet
DA - AKTU Short Answer + Differences
42 pages
Datascience Interview
100% (1)
Datascience Interview
31 pages
BigDataSolution of Paper Oct 2022
No ratings yet
BigDataSolution of Paper Oct 2022
11 pages
Activity5 Basillote Hermoso Samoc Saguindang Tabuniag
No ratings yet
Activity5 Basillote Hermoso Samoc Saguindang Tabuniag
6 pages
Big Data (Imp-Questions)
No ratings yet
Big Data (Imp-Questions)
17 pages
Data Science
No ratings yet
Data Science
28 pages
K. v. Narayanan, B. Lakshmikutty - Stoichiometry and Process Calculations-PHI Learning (2017)
No ratings yet
K. v. Narayanan, B. Lakshmikutty - Stoichiometry and Process Calculations-PHI Learning (2017)
613 pages
Big Data Analysis On ML Main Points
No ratings yet
Big Data Analysis On ML Main Points
5 pages
Data Mining Notes
No ratings yet
Data Mining Notes
25 pages
Unit 5 Simulation-1
No ratings yet
Unit 5 Simulation-1
13 pages
Data Science
No ratings yet
Data Science
14 pages
Question Samples
No ratings yet
Question Samples
4 pages
John Loucks: Slides by
No ratings yet
John Loucks: Slides by
63 pages
IV Ai-Ds Ad3491 Fdsa QB Unit5
No ratings yet
IV Ai-Ds Ad3491 Fdsa QB Unit5
4 pages
Soft Skills: Publisher
No ratings yet
Soft Skills: Publisher
172 pages
Mmds
No ratings yet
Mmds
12 pages
Unit 2 Question Answer
No ratings yet
Unit 2 Question Answer
13 pages
Da Question Bank
No ratings yet
Da Question Bank
7 pages
Dwdmsem 6 QB
No ratings yet
Dwdmsem 6 QB
13 pages
Ba 2 Marks Unit 4
No ratings yet
Ba 2 Marks Unit 4
4 pages
Data Analytic 3 Marks Q
No ratings yet
Data Analytic 3 Marks Q
10 pages
5 What Is Data-WPS Office
No ratings yet
5 What Is Data-WPS Office
19 pages
Top Data Science Interview Questions and Answers in 2023 PDF
100% (1)
Top Data Science Interview Questions and Answers in 2023 PDF
14 pages
Ch-04: Data and Analysis - Short Question and Answers - PDF
No ratings yet
Ch-04: Data and Analysis - Short Question and Answers - PDF
10 pages
CC Unit - 4 Imp Questions
No ratings yet
CC Unit - 4 Imp Questions
4 pages
Ds Revision 1
No ratings yet
Ds Revision 1
5 pages
Statistics
No ratings yet
Statistics
14 pages
100 Data Science Interview Questions and Answers
No ratings yet
100 Data Science Interview Questions and Answers
33 pages
500 Data Science Interview Questions and Answers - Vamsee Puligadda PDF
75% (8)
500 Data Science Interview Questions and Answers - Vamsee Puligadda PDF
141 pages
Data Scientist Interview Questions and Answers PDF
No ratings yet
Data Scientist Interview Questions and Answers PDF
37 pages
Cma Notes
100% (1)
Cma Notes
89 pages
Big Data
No ratings yet
Big Data
5 pages
Home Building Manual 2014
100% (2)
Home Building Manual 2014
39 pages
GRAUER &amp WEIL (INDIA) LTD PDF
100% (2)
GRAUER &amp WEIL (INDIA) LTD PDF
8 pages
AD8552-Machnie Learning QB
No ratings yet
AD8552-Machnie Learning QB
25 pages
2marks With Answers
No ratings yet
2marks With Answers
10 pages
Whats App
No ratings yet
Whats App
23 pages
Data Science
100% (1)
Data Science
7 pages
DA Interview Questions
No ratings yet
DA Interview Questions
7 pages
Unit 6 Questions and Answers
No ratings yet
Unit 6 Questions and Answers
4 pages
Data Science Interview Questions
No ratings yet
Data Science Interview Questions
31 pages
1) What Is Business Analytics?
No ratings yet
1) What Is Business Analytics?
6 pages
It 6001 Da 2 Marks With Answer PDF
No ratings yet
It 6001 Da 2 Marks With Answer PDF
10 pages
Data Warehouse 1
No ratings yet
Data Warehouse 1
21 pages
Basic Data Science Interview Questions
No ratings yet
Basic Data Science Interview Questions
18 pages
PhilipCardiff UCD Geometry, Meshing in OpenFOAM
No ratings yet
PhilipCardiff UCD Geometry, Meshing in OpenFOAM
72 pages
Schedule MANPOWER TLD JAN. 2023
No ratings yet
Schedule MANPOWER TLD JAN. 2023
21 pages
CMA Unit-3-1
No ratings yet
CMA Unit-3-1
68 pages
Unit - 2 CMA-1
No ratings yet
Unit - 2 CMA-1
34 pages
Science
No ratings yet
Science
5 pages
CH 3 Geo Drainage
No ratings yet
CH 3 Geo Drainage
29 pages
Borjan Proposal
No ratings yet
Borjan Proposal
12 pages
English Notebook Face Sheet-21.04.25
No ratings yet
English Notebook Face Sheet-21.04.25
3 pages
Q.1. What Is Data Mining?
No ratings yet
Q.1. What Is Data Mining?
15 pages
Cisco Hidden Commands
100% (1)
Cisco Hidden Commands
24 pages
7 Counters PDF
No ratings yet
7 Counters PDF
13 pages
Unit-3 CMA-1
No ratings yet
Unit-3 CMA-1
35 pages
Waves - Label
100% (1)
Waves - Label
2 pages
Topo Sheet Report
No ratings yet
Topo Sheet Report
15 pages
Hpfs Instruments India LLP
No ratings yet
Hpfs Instruments India LLP
25 pages
Unit 5 Pointers
No ratings yet
Unit 5 Pointers
9 pages
Chapter 2 Different Types of Fixtures
No ratings yet
Chapter 2 Different Types of Fixtures
20 pages
Be 20230428
No ratings yet
Be 20230428
8 pages
Essay
No ratings yet
Essay
7 pages
70 433 Question
No ratings yet
70 433 Question
5 pages
Chapter 1 Business
No ratings yet
Chapter 1 Business
52 pages
San Ildefonso College: Table of Specification
No ratings yet
San Ildefonso College: Table of Specification
11 pages
Fa Naveen
No ratings yet
Fa Naveen
7 pages
School Memorandum With Number
No ratings yet
School Memorandum With Number
29 pages
97-680 Multiprime
No ratings yet
97-680 Multiprime
2 pages
Euglena S
No ratings yet
Euglena S
4 pages
Microsilica 92% Dark Grey
No ratings yet
Microsilica 92% Dark Grey
3 pages
Network Administrator or Configuration Manager or Application de
No ratings yet
Network Administrator or Configuration Manager or Application de
2 pages
HDCS DS
No ratings yet
HDCS DS
4 pages
How Much Power
No ratings yet
How Much Power
5 pages
Datasheet SX95
No ratings yet
Datasheet SX95
1 page
MACHINE LEARNING FOR BEGINNERS: A Practical Guide to Understanding and Applying Machine Learning Concepts (2023 Beginner Crash Course)
From Everand
MACHINE LEARNING FOR BEGINNERS: A Practical Guide to Understanding and Applying Machine Learning Concepts (2023 Beginner Crash Course)
Elaine Tate
No ratings yet
The Secret Of Machine Learning
From Everand
The Secret Of Machine Learning
Mhd Arjunanta
No ratings yet
Manufacturing: Engineering, Management and Marketing
From Everand
Manufacturing: Engineering, Management and Marketing
S.O.T Ogaji
No ratings yet
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
Machine Learning Interview Questions
From Everand
Machine Learning Interview Questions
Tech Interviews
4.5/5 (2)
Top 20 MS Excel VBA Simulations, VBA to Model Risk, Investments, Growth, Gambling, and Monte Carlo Analysis
From Everand
Top 20 MS Excel VBA Simulations, VBA to Model Risk, Investments, Growth, Gambling, and Monte Carlo Analysis
Andrei Besedin
2.5/5 (2)
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
From Everand
Machine Learning with Clustering: A Visual Guide for Beginners with Examples in Python
Artem Kovera
No ratings yet
Applied Predictive Modeling: An Overview of Applied Predictive Modeling
From Everand
Applied Predictive Modeling: An Overview of Applied Predictive Modeling
Steven Taylor
No ratings yet
Process Performance Models: Statistical, Probabilistic & Simulation
From Everand
Process Performance Models: Statistical, Probabilistic & Simulation
Vishnuvarthanan Moorthy
No ratings yet

SHORT BA Mid 2

Uploaded by

SHORT BA Mid 2

Uploaded by

1.

Write some examples of practical applications of multiple regression

2. Define least squares regression.

3. Write down the assumptions for building good regression models.

• K-Nearest Neighbor (KNN) is an algorithm that classifies data based on

6. Write three applications of data exploration.

8. Outline the unsupervised learning?

Unsupervised learning is a branch of machine learning that deals with unlabeled

9. What are the types of Association Rules in Data Mining?

Types of Association Rules in Data Mining

• Multi-relational association rules

10.Differentiate between an antecedent (if) and a consequent?

The same has been discussed in brief in this article.

• an antecedent (if) and

An antecedent is something that’s found in data, and a consequent is

12. Outline the advantages of data partitioning?

13.Define Simulation with real time example.

What is an example of a simulation?

14.Explain random number generation with an example.

Random Number Generation

16.What are the advantages and disadvantages of simulation techniques?

18.What are three application areas of Monte Carlo simulation technique?

You might also like