0% found this document useful (0 votes)

39 views5 pages

Day 3 Statistics Interview QnA

Uploaded by

spandushetty28

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

39 views5 pages

Day 3 Statistics Interview QnA

Uploaded by

spandushetty28

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

### Descriptive Statistics

What is the mean of the dataset: 3, 7, 8, 10, 12?

The mean is (3 + 7 + 8 + 10 + 12) / 5 = 8.

How do you calculate the median of a dataset?

Sort the data, and the median is the middle value (or the average of the two middle values if
the dataset has an even number of observations).

What is the mode of the dataset: 4, 1, 2, 4, 3, 4, 5?

The mode is 4, as it appears most frequently.

What is standard deviation?

Standard deviation measures the amount of variation or dispersion in a set of values.

Define variance.
Variance is the average of the squared differences from the mean.

What is a percentile?
A percentile indicates the relative standing of a value within a dataset, showing the
percentage of observations below it.

How is the interquartile range (IQR) calculated?

IQR is the difference between the first quartile (Q1) and the third quartile (Q3).

What does a box plot represent?

A box plot shows the distribution of data based on five summary statistics: minimum, first
quartile, median, third quartile, and maximum.

What is a skewed distribution?

A skewed distribution is one where values are not symmetrically distributed around the mean,
often with a tail on one side.

How do you identify outliers in a dataset?

Outliers can be identified using IQR: values below Q1 - 1.5*IQR or above Q3 + 1.5*IQR.

### Inferential Statistics

What is a hypothesis test?

A hypothesis test is a method to determine if there is enough evidence to reject a null
hypothesis.

What is a p-value?
The p-value indicates the probability of observing the test results under the null hypothesis.
What does a confidence interval represent?
A confidence interval estimates a range of values that likely contains the population
parameter.

What is the difference between Type I and Type II errors?

Type I error is rejecting a true null hypothesis; Type II error is failing to reject a false null
hypothesis.

Explain what a t-test is.

A t-test compares the means of two groups to determine if they are statistically different from
each other.

What is ANOVA used for?

ANOVA (Analysis of Variance) is used to compare means among three or more groups.

What is the central limit theorem?

The central limit theorem states that the sampling distribution of the sample mean
approaches a normal distribution as sample size increases.

Define correlation.
Correlation measures the strength and direction of a linear relationship between two
variables.

What is a chi-square test?

A chi-square test assesses how expectations compare to actual observed data in categorical
variables.

What is a Z-score?
A Z-score indicates how many standard deviations an element is from the mean.

### Probability

What is the difference between independent and dependent events?

Independent events do not affect each other's probabilities; dependent events do.

Define conditional probability.

Conditional probability is the probability of an event occurring given that another event has
already occurred.

What is a probability distribution?

A probability distribution describes how probabilities are distributed over the values of a
random variable.
What is a normal distribution?
A normal distribution is a continuous probability distribution characterized by a symmetric bell
shape.

What is the law of large numbers?

The law of large numbers states that as a sample size increases, the sample mean will
converge to the population mean.

What is a Bernoulli trial?

A Bernoulli trial is an experiment or process that results in a binary outcome: success or
failure.

Define joint probability.

Joint probability is the probability of two events happening at the same time.

What is the difference between discrete and continuous random variables?

Discrete random variables take on countable values; continuous random variables can take
on any value within a range.

Explain Bayes’ theorem.

Bayes’ theorem describes how to update the probability of a hypothesis based on new
evidence.

What is the expected value?

The expected value is the average outcome of a random variable when an experiment is
repeated many times.

### Regression Analysis

What is linear regression?

Linear regression models the relationship between a dependent variable and one or more
independent variables by fitting a linear equation.

What does R-squared represent?

R-squared indicates the proportion of variance in the dependent variable that can be
explained by the independent variables.

What is multicollinearity?
Multicollinearity occurs when two or more independent variables in a regression model are
highly correlated.

How do you interpret coefficients in a regression model?

Coefficients indicate the change in the dependent variable for a one-unit change in the
independent variable, holding other variables constant.
What is logistic regression used for?
Logistic regression is used to model binary outcome variables.

What is overfitting in a model?

Overfitting occurs when a model learns the noise in the training data rather than the
underlying pattern.

Explain the concept of residuals.

Residuals are the differences between observed values and predicted values from a
regression model.

What is a regression assumption?

Regression assumptions are the conditions that must be met for regression results to be
valid (e.g., linearity, independence, homoscedasticity).

What is the difference between simple and multiple regression?

Simple regression involves one independent variable, while multiple regression involves two
or more independent variables.

What is the purpose of a scatter plot in regression analysis?

A scatter plot visually shows the relationship between two variables, helping to identify
potential correlations.

### Advanced Topics

What is time series analysis?

Time series analysis involves statistical techniques to analyze time-ordered data points.

Explain what a confounding variable is.

A confounding variable is an outside influence that affects both the independent and
dependent variables, potentially misleading results.

What is cross-validation?
Cross-validation is a technique for assessing how the results of a statistical analysis will
generalize to an independent dataset.

Define non-parametric tests.

Non-parametric tests are statistical tests that do not assume a specific distribution for the
data.

What is the difference between a sample and a population?

A population includes all members of a defined group, while a sample is a subset of the
population.
What is sampling bias?
Sampling bias occurs when the sample is not representative of the population, leading to
incorrect conclusions.

Explain the concept of power in hypothesis testing.

Power is the probability that a test correctly rejects a false null hypothesis.

What is bootstrapping?
Bootstrapping is a resampling technique used to estimate the distribution of a statistic by
repeatedly sampling with replacement.

What is a survival analysis?

Survival analysis is used to analyze the time until an event occurs, often used in medical
research.

What is a control chart?

A control chart is a statistical tool used to monitor and control a process over time.

### Application and Interpretation

How can you visualize data distribution?

Data distribution can be visualized using histograms, box plots, or density plots.

What is the importance of data cleaning?

Data cleaning ensures accuracy and consistency, leading to valid analysis results.

What is A/B testing?

A/B testing compares two versions of a variable to determine which one performs better.

James R. Evans - Statistics, Data Analysis and Decision Modeling International 5th Ed.-Pearson (2013)
86% (14)
James R. Evans - Statistics, Data Analysis and Decision Modeling International 5th Ed.-Pearson (2013)
543 pages
Bmsi Solved Past Papers April Updated
No ratings yet
Bmsi Solved Past Papers April Updated
69 pages
C207 Study Guide
No ratings yet
C207 Study Guide
27 pages
SQL Interview Questions Goldman Sachs
No ratings yet
SQL Interview Questions Goldman Sachs
19 pages
50 Important Statistics' Q & A To Crack DS Interview
No ratings yet
50 Important Statistics' Q & A To Crack DS Interview
14 pages
Fact 2
No ratings yet
Fact 2
6 pages
Final Stats Intrerview Q&A
No ratings yet
Final Stats Intrerview Q&A
12 pages
Statistics
No ratings yet
Statistics
13 pages
Real Statistics Using Excel - Examples Workbook Charles Zaiontz, 9 April 2015
No ratings yet
Real Statistics Using Excel - Examples Workbook Charles Zaiontz, 9 April 2015
1,595 pages
Screenshot 2024-12-15 at 8.15.38 PM
No ratings yet
Screenshot 2024-12-15 at 8.15.38 PM
138 pages
Descriptive Statistics Is That Branch of Statistics Which Is Concerned With Describing The Population Under Study
No ratings yet
Descriptive Statistics Is That Branch of Statistics Which Is Concerned With Describing The Population Under Study
19 pages
Data Science Interview Questions - 1
No ratings yet
Data Science Interview Questions - 1
55 pages
DA Notes
No ratings yet
DA Notes
15 pages
Data Analytics Visualization Oral QA
No ratings yet
Data Analytics Visualization Oral QA
2 pages
Statistics Practise Questions
No ratings yet
Statistics Practise Questions
19 pages
Solution Manual For Statistics Data Analysis and Decision Modeling 5th Edition Evans 0132744287 9780132744287
100% (51)
Solution Manual For Statistics Data Analysis and Decision Modeling 5th Edition Evans 0132744287 9780132744287
7 pages
Statistics
No ratings yet
Statistics
7 pages
It0089 Finalreviewer
100% (1)
It0089 Finalreviewer
143 pages
STAT100 - Full Course Notes
No ratings yet
STAT100 - Full Course Notes
27 pages
Basicof Stats
No ratings yet
Basicof Stats
7 pages
Type II Error
No ratings yet
Type II Error
6 pages
SCS3250A - Module 1 - Introduction To Statistics and Analytics
No ratings yet
SCS3250A - Module 1 - Introduction To Statistics and Analytics
44 pages
FDS Sem5
No ratings yet
FDS Sem5
15 pages
Descriptive Statistics
No ratings yet
Descriptive Statistics
63 pages
Unit 2
No ratings yet
Unit 2
20 pages
Chapter2-Statistical Analysis
No ratings yet
Chapter2-Statistical Analysis
86 pages
Unit II TYCS DS
No ratings yet
Unit II TYCS DS
176 pages
Machine Learning (1) : Inteligência Artificial E Cibersegurança (Inacs)
No ratings yet
Machine Learning (1) : Inteligência Artificial E Cibersegurança (Inacs)
33 pages
Statistical Data Science
No ratings yet
Statistical Data Science
5 pages
Regression
No ratings yet
Regression
86 pages
Unit-2 Data Analytics Approaches
No ratings yet
Unit-2 Data Analytics Approaches
24 pages
Statistics - Compendium - DMS IIT DELHI - 2025
No ratings yet
Statistics - Compendium - DMS IIT DELHI - 2025
18 pages
Data Science by CFA
No ratings yet
Data Science by CFA
27 pages
Das FFFF
No ratings yet
Das FFFF
16 pages
Datascience Interview
100% (1)
Datascience Interview
31 pages
Cheat Sheet
No ratings yet
Cheat Sheet
3 pages
Notes Data Analytics
No ratings yet
Notes Data Analytics
19 pages
Lecture 4 - Data Science Statistics
No ratings yet
Lecture 4 - Data Science Statistics
21 pages
Introduction To Data Analysis: Professor David Richardson IIT Stuart School of Business
No ratings yet
Introduction To Data Analysis: Professor David Richardson IIT Stuart School of Business
31 pages
Questions and Answers
No ratings yet
Questions and Answers
5 pages
Bussiness Statistics Book
No ratings yet
Bussiness Statistics Book
5 pages
Final Stats Intrerview Q&A
No ratings yet
Final Stats Intrerview Q&A
20 pages
Statistics Syllabus
No ratings yet
Statistics Syllabus
4 pages
Introduction To Statistics
No ratings yet
Introduction To Statistics
54 pages
Bocalig Act5 MMW
No ratings yet
Bocalig Act5 MMW
6 pages
Comprehensive Ebook of Statistics For Data Science - Chaitali
No ratings yet
Comprehensive Ebook of Statistics For Data Science - Chaitali
21 pages
CBSE Sample Papers For Class 2 Maths - Mock Paper 1
50% (2)
CBSE Sample Papers For Class 2 Maths - Mock Paper 1
7 pages
Document 8
No ratings yet
Document 8
10 pages
Interview Questions
No ratings yet
Interview Questions
225 pages
Statistics For Data Analysis
No ratings yet
Statistics For Data Analysis
13 pages
SB-5 - OSRS - SampleDetailedSummaryReport PDF
100% (1)
SB-5 - OSRS - SampleDetailedSummaryReport PDF
3 pages
ISDS 361A - Cheat Sheet Exam 1 PDF
No ratings yet
ISDS 361A - Cheat Sheet Exam 1 PDF
2 pages
FIN10002 - Notes Master
No ratings yet
FIN10002 - Notes Master
44 pages
Ecology - Sampling Techniques
100% (1)
Ecology - Sampling Techniques
25 pages
Data Science Interview Questions and Answer
100% (1)
Data Science Interview Questions and Answer
41 pages
Statistics For Data Analytics
No ratings yet
Statistics For Data Analytics
15 pages
Lecture Sheet For SPSS
100% (1)
Lecture Sheet For SPSS
29 pages
Calculate Mean, Median, Mode, Variance and Standard Deviation For Column A
No ratings yet
Calculate Mean, Median, Mode, Variance and Standard Deviation For Column A
22 pages
Data Analysis and Decision Making PDF
No ratings yet
Data Analysis and Decision Making PDF
97 pages
Statistics 1: 2 Marks
No ratings yet
Statistics 1: 2 Marks
5 pages
Biogas Power Plant
50% (2)
Biogas Power Plant
7 pages
AP Stat Spring Pacing
No ratings yet
AP Stat Spring Pacing
4 pages
DLP in Limits of Exponential and Logarithmic Functions
No ratings yet
DLP in Limits of Exponential and Logarithmic Functions
24 pages
Passage Practice Sheet by Latifurs
No ratings yet
Passage Practice Sheet by Latifurs
34 pages
Cambridge O Level: Mathematics (Syllabus D) 4024/11
No ratings yet
Cambridge O Level: Mathematics (Syllabus D) 4024/11
20 pages
Biology Project Colour Blindness
No ratings yet
Biology Project Colour Blindness
20 pages
Cec 410
No ratings yet
Cec 410
38 pages
Conformity and Deviance
No ratings yet
Conformity and Deviance
32 pages
Practical Data Analysis With JMP
No ratings yet
Practical Data Analysis With JMP
8 pages
ERM 200 Series Info Produit E@
No ratings yet
ERM 200 Series Info Produit E@
16 pages
Essay On Indian Army
100% (2)
Essay On Indian Army
6 pages
Content Standard Performance Standard
No ratings yet
Content Standard Performance Standard
17 pages
Ch01 - Projects in Contemporary Organizations
No ratings yet
Ch01 - Projects in Contemporary Organizations
57 pages
CLAR16024A-SDS-EN-United Kingdom
No ratings yet
CLAR16024A-SDS-EN-United Kingdom
11 pages
Name Compatibility As Per Numerology
No ratings yet
Name Compatibility As Per Numerology
1 page
Ammendment IKSA CSR 24
No ratings yet
Ammendment IKSA CSR 24
4 pages
Privilege Speech Msabella
No ratings yet
Privilege Speech Msabella
3 pages
Lec 01
No ratings yet
Lec 01
10 pages
03 - Exponential Equations With Logarithms
No ratings yet
03 - Exponential Equations With Logarithms
4 pages
Prosper Chikanyire Final Project Esh 2017 Chapters 1,2,3,4,5
No ratings yet
Prosper Chikanyire Final Project Esh 2017 Chapters 1,2,3,4,5
92 pages
Invisibility: "Invisible" Redirects Here. For Other Uses, See
No ratings yet
Invisibility: "Invisible" Redirects Here. For Other Uses, See
4 pages
Ibis Styles Hotel Presentation
No ratings yet
Ibis Styles Hotel Presentation
21 pages
Lecture 3: The Canonical Ensemble: 3.1 Recommended Textbook Chapters For This Section
No ratings yet
Lecture 3: The Canonical Ensemble: 3.1 Recommended Textbook Chapters For This Section
8 pages
Mabel Chidinma Onu SOP
No ratings yet
Mabel Chidinma Onu SOP
1 page
Ee 420: Final Examination Submit On Blackboard by 5PM On December 16, 2020 Maximum Points: 150
No ratings yet
Ee 420: Final Examination Submit On Blackboard by 5PM On December 16, 2020 Maximum Points: 150
4 pages
Worksheet 3-Optimizing - Cost-and-Profit
No ratings yet
Worksheet 3-Optimizing - Cost-and-Profit
4 pages
AIW Unit Plan - Ind. Tech Example
No ratings yet
AIW Unit Plan - Ind. Tech Example
4 pages
Institut Agama Islam Banten (Iaib) Serang - Banten: TAHUN AKADEMIK 2020/2021
No ratings yet
Institut Agama Islam Banten (Iaib) Serang - Banten: TAHUN AKADEMIK 2020/2021
2 pages
D116 Check CraneFS DS 1021 p54
No ratings yet
D116 Check CraneFS DS 1021 p54
1 page
Biostatistics Explored Through R Software: An Overview
From Everand
Biostatistics Explored Through R Software: An Overview
Vinaitheerthan Renganathan
3.5/5 (2)

Day 3 Statistics Interview QnA

Uploaded by

Day 3 Statistics Interview QnA

Uploaded by

### Descriptive Statistics

What is the mean of the dataset: 3, 7, 8, 10, 12?

How do you calculate the median of a dataset?

What is the mode of the dataset: 4, 1, 2, 4, 3, 4, 5?

What is standard deviation?

How is the interquartile range (IQR) calculated?

What does a box plot represent?

What is a skewed distribution?

How do you identify outliers in a dataset?

### Inferential Statistics

What is a hypothesis test?

What is the difference between Type I and Type II errors?

Explain what a t-test is.

What is ANOVA used for?

What is the central limit theorem?

What is a chi-square test?

What is the difference between independent and dependent events?

Define conditional probability.

What is a probability distribution?

What is the law of large numbers?

What is a Bernoulli trial?

Define joint probability.

What is the difference between discrete and continuous random variables?

Explain Bayes’ theorem.

What is the expected value?

### Regression Analysis

What is linear regression?

What does R-squared represent?

How do you interpret coefficients in a regression model?

What is overfitting in a model?

Explain the concept of residuals.

What is a regression assumption?

What is the difference between simple and multiple regression?

What is the purpose of a scatter plot in regression analysis?

### Advanced Topics

What is time series analysis?

Explain what a confounding variable is.

Define non-parametric tests.

What is the difference between a sample and a population?

Explain the concept of power in hypothesis testing.

What is a survival analysis?

What is a control chart?

### Application and Interpretation

How can you visualize data distribution?

What is the importance of data cleaning?

What is A/B testing?

You might also like