FDS Apr - May 2024

This document outlines the examination structure for the B.E./B.Tech. degree in Computer Science and Engineering for the course CS 8352 - Foundations of Data Science. It includes a variety of questions covering data analysis, statistical methods, Python coding tasks, and data visualization techniques. The exam is divided into three parts, with a total of 100 marks allocated across multiple questions.

Uploaded by

Amudaria

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views4 pages

FDS Apr - May 2024

Uploaded by

Amudaria

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

B.E./B.Tech.

DEGREE EXAMINATIONS, APRIL/MAY 2024

Third Semester
Computer Science and Engineering
CS 8352 — FOUNDATIONS OF DATA SCIENCE

(Common to: Computer Science and Engineering / Artificial Intelligence and Machine
Learning / Computer Science and Engineering (Cyber Security)/Computer and
Communication Engineering/Electronics and Instrumentation Engineering/
Instrumentation and Control Engineering / Information Technology)
(Regulation 2021)
Time : Three hours Answer ALL questions Maximum : 100 marks
PART A — (10 x 2 - 20 marks)
1. How missing values present in a dataset are treated during data analysis phase?
2. Identify and write down various data analytic challenges faced in the conventional
system.
3. Will treating categorical variables as continuous variables result in a better predictive
model? Justify your answer.
4. Issue: Feeding date which has variables correlated to one another is not a good statistical
practice, since we are providing multiple weightage to the same type of data.
Solution: Correlation Analysis. Show how such issues are prevented by correlation
analyais technique. Justify with a small instance dataset.
5. State the purpose of adding additional quantitative and/or categorical explanatory
variables to any developed linear regression model. Justify with an example.
6. Give an example of a data set with a non-GausBian distribution.
7. Under what circumstances, the pivot_table( ) in pandas is used?
8. Using appropriate data visualization modules develop a python code snippet that
generates a simple sinusoidal wave in empty gridded axes?
9. Write a python code snippet that generates a time series graph representing COVID-19
incidence cases for a particular week.
Day 1 Day 2 Day 3 Day 4 Day 5 Day 6 Day 7
7 18 9 44 2 5 89

69898
10. Write a python code snippet that draws a histogram for the following list of positive
numbers.
7 l8 9 44 2 5 89 91 11 6 77 85 91 6 55

PART B — (5 x 13 – 65 marks)
11. (a). i. Suppose there is a dataset having variables with missing values of more than (6)
30%, how will you deal with such dataset?
ii. List down the various feature selection methods for selecting the right variables (7)
for building efficient predictive models. Explain about any two selection
methods.
OR
(b). i. Explain Data Analytics life cycle. Brief about Time-Series Analysis. (6)
ii. Outline the purpose of data cleaning. How missing and nullified data attributes (7)
are handled and modified during pre-processing stage?
12. (a). i. Indicate whether each of the following distributions is positively or negatively
skewed. The distribution of
(1) Incomes of tax payers have a mean of $48,00d and a median of $43,600. (3)
(2) GPAs for all students at some college have a mean of 5.01 and a median (3)
of 3.20.
ii. During their first swim through a water maze, 15 laboratory rats made the
following number of errors (blind alleyway entrances):
2, 17, 5, 3, 28, 7, 5, 8, 5, 6, 2, 12, 10, 4, 3.
(1) Find the mode, median, and mean for these data. (3)
(2) Without constructing a frequency distribution or graph, would it be (4)
possible to characterize the shape of this distribution an balanced,
positively skewed, or negatively skewed?
OR

69898
(b). i. Assume that SAT math scores approximate a normal curve with a mean of 500
and a standard deviation of 100. Sketch a normal curve and shade in the target
areas described by each of the following statements:
• More than 570 (2)
• Less than 515 (2) (2)
• Between 520 and fi40 (2) (2)
• Convert to z scores and find the target areas specific to the above values. (1)
ii. Assume that the burning times of electric light bulbs approximate a normal
curve with a mean of 1200 hours and a standard deviation of 120 hours. If a
large number of new lights are installed at the same time (possibly along a
newly opened freeway), at what time will
• 1 percent fail? (2)
• 50 percent fail? (2)
• 95 percent fail? (2)
13. (a). i. In Statistics, highlight the impact when the goodness of fit test score is low? (6)
ii. Given the following dataset of employee, using regression analysis, find the (7)
expected salary of an employee if the age is 45?
Age
Salary
5467000
4243000
4955000
5771000
3525000
OR
(b). i. Define auto-correlation and how is it calculated? What does the negative (6)
correlation convey?
ii. What is the philosophy of logistic regression? What kind of model it is? What (7)
does logistic Regression predict? Tabulate tire cardinal difference of Linear and
Logistic Regression?
14. (a). i. Define Dictionary in Python. Do the following operations on dictionaries. (3)
Initialize two dictionaries (D1 and D2) with key and value pairs.
ii. Compared those two dictionaries with master key list 'M' and print (3)
iii. Find keys that are in D1 but NOT in D2. (3)
iv. Merge D1 and D2 and create D3 using expressions? (4)
OR

69898
(b). i. How to create hierarchical data from the existing data frame? (6)
ii. How to use group by with 2 columns in data set? Give a python code snippet. (7)
15. (a) Write a code snippet that projects our globe as a 2-D flat surface (using (13)
cylindrical project) and convey information about the location of any three
major Indian cities in the map (using scatter plot).
OR
(b). i. Write a working code that performs a simple Gaussian process regression (6)
(GPR), using the Scikit-Learn API.
ii. Briefly explain about visualization with Seaborn. Give an example working (7)
code segment that represents a 2D kernel density plot for any data.

PART C — (1 x 15 – 15 marks)
16. (a). Given an unsorted multi indexes that represents the distance between two cities, (15)
write a python code snippet using appropriate libraries to find the appropriate
distance between any two given cities. The following matrix representation can
be used to create the data frame that can be served as an input for the prescribed
program.
A B C D E
A 0 30 24 6 13
B 16 0 19 5 10
C 7 16 0 15 12
D 9 17 22 0 18
E 21 8 9 11 0
OR
(b). An URL Server wants to consolidate a history of websites visited by an user 'U'. (15)
Every website visit information is stored in a 2-tuple format viz., (website_id,
Duration_of_visit) in the URL cache. Using split, apply and combine operations,
device a code snippet that consolidates the website history and find out the
website whose duration of visit is maximum.
Example :
Input: [(4,2), (5,1), (4,3), (1,4), (7,3), (5,2), (1,1), (7,1))
Output: [(4,5), (5,3), (1,5), (7,4)].
The website with key_id '1' has the max. duration of visit = 5.

69898

ALL-In-OnE XII CS PB QP MS 2024-25 From Know Python Bytes
No ratings yet
ALL-In-OnE XII CS PB QP MS 2024-25 From Know Python Bytes
339 pages
CS3352 Foundations of Data Science Nov Dec 2022 Question Paper Download
No ratings yet
CS3352 Foundations of Data Science Nov Dec 2022 Question Paper Download
4 pages
Base Foundation of Generator: SECTION I - I SECTION II - II
80% (5)
Base Foundation of Generator: SECTION I - I SECTION II - II
1 page
Ai Class 12 Practical 2
No ratings yet
Ai Class 12 Practical 2
21 pages
DSBDA LAB - MANUAL (Autosaved) - Sd1-Converted-1-2
100% (1)
DSBDA LAB - MANUAL (Autosaved) - Sd1-Converted-1-2
256 pages
DS JRE Paper June 2023
No ratings yet
DS JRE Paper June 2023
9 pages
Python Lab Manual
No ratings yet
Python Lab Manual
33 pages
April May 2023 FODS Arrear
No ratings yet
April May 2023 FODS Arrear
3 pages
SL-III Lab Manual
No ratings yet
SL-III Lab Manual
74 pages
TE SEM VI DEC 2023 KT - Compressed
No ratings yet
TE SEM VI DEC 2023 KT - Compressed
8 pages
12TH Hy Ip St. Mary 2023
No ratings yet
12TH Hy Ip St. Mary 2023
10 pages
LOTO: Lock Out / Tag Out & Energy Isolation
75% (4)
LOTO: Lock Out / Tag Out & Energy Isolation
62 pages
June Set-2 MT Xii Cs 2024
No ratings yet
June Set-2 MT Xii Cs 2024
4 pages
Scoring Key/marking Scheme
No ratings yet
Scoring Key/marking Scheme
9 pages
CS3361 Set2
No ratings yet
CS3361 Set2
6 pages
Manishadav
No ratings yet
Manishadav
27 pages
23HCS4142 PDF
No ratings yet
23HCS4142 PDF
24 pages
Grade11 DSC Hy - Sample-Pa
No ratings yet
Grade11 DSC Hy - Sample-Pa
6 pages
Ai Class 12 Practical
No ratings yet
Ai Class 12 Practical
21 pages
Assignment - 1 (IDSUP) - Omkar
No ratings yet
Assignment - 1 (IDSUP) - Omkar
13 pages
DBDM, FDS, Ds Cat 2 QP
No ratings yet
DBDM, FDS, Ds Cat 2 QP
6 pages
Ip CLSS Xii 2024-25 Hy
No ratings yet
Ip CLSS Xii 2024-25 Hy
14 pages
DAV Practical File 234003
No ratings yet
DAV Practical File 234003
14 pages
Computational
No ratings yet
Computational
7 pages
DAV Practicle File
No ratings yet
DAV Practicle File
28 pages
Hyderabad Region QP With MS
No ratings yet
Hyderabad Region QP With MS
19 pages
Data Science 500 Assignment
No ratings yet
Data Science 500 Assignment
6 pages
DAP Lab Manual
No ratings yet
DAP Lab Manual
20 pages
CS3361 Set1
No ratings yet
CS3361 Set1
5 pages
Python Practical Questions@Subas
No ratings yet
Python Practical Questions@Subas
7 pages
Influence of Contours On Architecture
No ratings yet
Influence of Contours On Architecture
68 pages
DSBDAlab Manual
No ratings yet
DSBDAlab Manual
116 pages
Fods Question Paper
No ratings yet
Fods Question Paper
4 pages
Question Bank2 1722502558363
No ratings yet
Question Bank2 1722502558363
6 pages
End Sem PYQ
No ratings yet
End Sem PYQ
8 pages
QB For DS - V Sem Students
No ratings yet
QB For DS - V Sem Students
23 pages
FDS Important Q
No ratings yet
FDS Important Q
5 pages
Dav Pyq 2023-24
No ratings yet
Dav Pyq 2023-24
3 pages
Fods Model Set A
No ratings yet
Fods Model Set A
2 pages
SET-1 QP
No ratings yet
SET-1 QP
9 pages
DSBDA Manual
No ratings yet
DSBDA Manual
76 pages
1.QP Set A CS Dec 2024
No ratings yet
1.QP Set A CS Dec 2024
11 pages
DAV Guidelines
No ratings yet
DAV Guidelines
4 pages
Solution
No ratings yet
Solution
18 pages
Pbi Xii CS QP
No ratings yet
Pbi Xii CS QP
7 pages
Data Science Syllabus
No ratings yet
Data Science Syllabus
4 pages
(Advanced Studies in Theoretical and Applied Econometrics) Jan Beran, Yuanhua Feng, Hartmut Hebbel-Empirical Economic and Financial Research - Theory, Methods and Practice-Springer (
100% (2)
(Advanced Studies in Theoretical and Applied Econometrics) Jan Beran, Yuanhua Feng, Hartmut Hebbel-Empirical Economic and Financial Research - Theory, Methods and Practice-Springer (
506 pages
Lab Manual
No ratings yet
Lab Manual
19 pages
XII-PB1-2024-25 Set-B
No ratings yet
XII-PB1-2024-25 Set-B
7 pages
End Module A Mock Questions
No ratings yet
End Module A Mock Questions
28 pages
VIP Question Bank For DPV For Theory Exam
No ratings yet
VIP Question Bank For DPV For Theory Exam
6 pages
Syllabus AIML
No ratings yet
Syllabus AIML
14 pages
ComputerScience Xi Worksheet
No ratings yet
ComputerScience Xi Worksheet
6 pages
DATASCIENCE
No ratings yet
DATASCIENCE
3 pages
Datascience
No ratings yet
Datascience
8 pages
Computer SC XII 2024-25 (Set 1)
No ratings yet
Computer SC XII 2024-25 (Set 1)
7 pages
DBDAL LAB - MANUAL - Final
No ratings yet
DBDAL LAB - MANUAL - Final
93 pages
2024 Fods Ques
No ratings yet
2024 Fods Ques
4 pages
DAV Guidelines
No ratings yet
DAV Guidelines
4 pages
Repair
No ratings yet
Repair
0 pages
Analytics Quiz and Case Study
No ratings yet
Analytics Quiz and Case Study
12 pages
Quiz 1 (23 24)
No ratings yet
Quiz 1 (23 24)
2 pages
Transmission Powershift Hl750
100% (1)
Transmission Powershift Hl750
28 pages
Python CAT Papers
No ratings yet
Python CAT Papers
6 pages
IDS Syllabus
No ratings yet
IDS Syllabus
5 pages
G10pretest Posttest
100% (1)
G10pretest Posttest
3 pages
Shortcut Keys of Tally .ERP 9
94% (33)
Shortcut Keys of Tally .ERP 9
6 pages
Lumber Tycoon 2 Roblox
No ratings yet
Lumber Tycoon 2 Roblox
6 pages
4.4 Identity and Access Management Architecture (IAM) Basic Concept and Definitions of IAM Functions For Any Service
No ratings yet
4.4 Identity and Access Management Architecture (IAM) Basic Concept and Definitions of IAM Functions For Any Service
4 pages
Uniquely Decodable Codes
No ratings yet
Uniquely Decodable Codes
10 pages
CN Manual
No ratings yet
CN Manual
71 pages
Deltabeam Brochure Eng
No ratings yet
Deltabeam Brochure Eng
20 pages
Technical Analyst Mock Test - Vskills Practice Tests
No ratings yet
Technical Analyst Mock Test - Vskills Practice Tests
9 pages
03 Amelogenesis - English
No ratings yet
03 Amelogenesis - English
158 pages
ThermostatCatalog 570-280
0% (1)
ThermostatCatalog 570-280
12 pages
Algorithm Lab Record IV Sem 2021 Reg
No ratings yet
Algorithm Lab Record IV Sem 2021 Reg
76 pages
Unit 4
No ratings yet
Unit 4
26 pages
Unit 3
No ratings yet
Unit 3
38 pages
NNDL Lab Manual Front
No ratings yet
NNDL Lab Manual Front
10 pages
How Does The Shape of An Ice Cube Affect How Fast It Melts
No ratings yet
How Does The Shape of An Ice Cube Affect How Fast It Melts
2 pages
Log 1
No ratings yet
Log 1
27 pages
RRB Paramedical Answer Key 30-04-2025 Afternoon Shift LABORATORY-ASSISTANT-GRADE-II-3
No ratings yet
RRB Paramedical Answer Key 30-04-2025 Afternoon Shift LABORATORY-ASSISTANT-GRADE-II-3
15 pages
Unit 3
No ratings yet
Unit 3
29 pages
CN Course Delivery Plan
No ratings yet
CN Course Delivery Plan
15 pages
Buchholz Relay Operation and Principle
No ratings yet
Buchholz Relay Operation and Principle
6 pages
Unit 2
No ratings yet
Unit 2
13 pages
Student Journal
No ratings yet
Student Journal
11 pages
NM Full Stack Django
No ratings yet
NM Full Stack Django
99 pages
Unit 5
No ratings yet
Unit 5
2 pages
Excel Calculation Guide For Pipette Intermediate Checks Advance
No ratings yet
Excel Calculation Guide For Pipette Intermediate Checks Advance
3 pages
Artificial Neural Networks: An Overview: August 2023
No ratings yet
Artificial Neural Networks: An Overview: August 2023
11 pages
Algorithm It 2 QP
No ratings yet
Algorithm It 2 QP
2 pages
Neural Network Internal 2
No ratings yet
Neural Network Internal 2
2 pages
Gate Os
No ratings yet
Gate Os
29 pages
EVALKIT
No ratings yet
EVALKIT
24 pages
2022 Summer Question Paper (Msbte Study Resources)
No ratings yet
2022 Summer Question Paper (Msbte Study Resources)
3 pages
Course Outcomes
No ratings yet
Course Outcomes
1 page
CH 00
No ratings yet
CH 00
6 pages
Design Spec WASP UAV
No ratings yet
Design Spec WASP UAV
42 pages
P5x30 Selection Guide
No ratings yet
P5x30 Selection Guide
2 pages
Semaphore Worksheet
No ratings yet
Semaphore Worksheet
2 pages
ESAT Scholarship Exam For NEET JEE
No ratings yet
ESAT Scholarship Exam For NEET JEE
4 pages
Chapter 12
No ratings yet
Chapter 12
15 pages
Ooad Internal 1 Anskey 2019
No ratings yet
Ooad Internal 1 Anskey 2019
2 pages
Pitriani Rajab Mangasi - 201830112
No ratings yet
Pitriani Rajab Mangasi - 201830112
14 pages
Ooad Internal 1 2019
No ratings yet
Ooad Internal 1 2019
1 page
Exp11 RA2112703010019
No ratings yet
Exp11 RA2112703010019
4 pages
Missing Views: College of Engineering Engineering Education Innovation Center
No ratings yet
Missing Views: College of Engineering Engineering Education Innovation Center
7 pages
IGNOU BCA Data and File Structure Previous Year Unsolved Papers MCS 021
From Everand
IGNOU BCA Data and File Structure Previous Year Unsolved Papers MCS 021
Manish Soni
No ratings yet

FDS Apr - May 2024

Uploaded by

FDS Apr - May 2024

Uploaded by

B.E./B.Tech.

DEGREE EXAMINATIONS, APRIL/MAY 2024

You might also like