0% found this document useful (0 votes)
17 views1 page

Data Science 101

The document outlines essential topics in data science, including definitions and differences between data types, the importance of data cleaning, and key steps in the data science workflow. It covers fundamental concepts in statistics and probability, machine learning basics, common algorithms, and evaluation metrics. Additionally, it provides instructions for students to submit their handwritten answer sheets for certification.

Uploaded by

aryanrajput20039
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
17 views1 page

Data Science 101

The document outlines essential topics in data science, including definitions and differences between data types, the importance of data cleaning, and key steps in the data science workflow. It covers fundamental concepts in statistics and probability, machine learning basics, common algorithms, and evaluation metrics. Additionally, it provides instructions for students to submit their handwritten answer sheets for certification.

Uploaded by

aryanrajput20039
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 1

Topic: Data Science Essentials

Fundamentals:

1. Define Data Science in your own words.


2. What is the difference between structured and unstructured data? Provide an example of each.
3. Explain the importance of data cleaning in the data science process.
4. What are the key steps involved in a typical data science workflow?
5. Differentiate between exploratory data analysis (EDA) and confirmatory data analysis.

Statistics and Probability:

1. Define mean, median, and mode. When is the median a better measure of central tendency than the mean?
2. What is standard deviation, and what does it tell you about a dataset?
3. Explain the concept of correlation. What is the difference between positive and negative correlation?
4. What is a probability distribution? Give an example of a common discrete probability distribution.
5. State the Central Limit Theorem and explain its significance in statistics.

Machine Learning Basics:

1. What is the difference between supervised and unsupervised learning? Provide an example of each.
2. Explain the concept of features and labels in supervised learning.
3. What is the purpose of splitting data into training and testing sets?
4. Define overfitting and underfitting in the context of machine learning models.
5. What is the bias-variance tradeoff?

Common Algorithms (Briefly Explain):

1. Briefly describe the K-Nearest Neighbors (KNN) algorithm.


2. What is a decision tree, and how does it make predictions?
3. Explain the basic idea behind linear regression.
4. What is the goal of clustering algorithms? Give an example of a clustering algorithm.
5. Briefly describe the concept of a neural network.

Evaluation and Metrics:

1. For a binary classification problem, what do True Positives, True Negatives, False Positives, and False Negatives
represent?
2. Define accuracy, precision, and recall. When might precision be more important than recall?
3. What is the F1-score, and why is it often a useful metric?
4. For a regression problem, what is Mean Squared Error (MSE)?
5. Explain the concept of cross-validation and why it's used.

Each Question Carrying: 2 Marks


Minimum 30% is Required to get Certificate

Instruction: All Students are requested to Mention Exact Name, Department of Study , University Roll Number,
Subject Name Properly Otherwise Answer-sheet will be canceled.

After Successful Completion of the Program Kindly Send Hand Written Answer-sheet to
[email protected]

You might also like