0% found this document useful (0 votes)
6 views3 pages

CS 3361 Set 2

The document outlines the practical examination details for the Data Science Laboratory course at DMI College of Engineering for the third semester, including the exam duration, marking scheme, and a list of programming tasks using NumPy and Pandas. Students are required to answer one question from the provided list, which covers various data manipulation and analysis techniques. The tasks include creating arrays, performing statistical analysis, and visualizing data using plots.

Uploaded by

rajdmice
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
6 views3 pages

CS 3361 Set 2

The document outlines the practical examination details for the Data Science Laboratory course at DMI College of Engineering for the third semester, including the exam duration, marking scheme, and a list of programming tasks using NumPy and Pandas. Students are required to answer one question from the provided list, which covers various data manipulation and analysis techniques. The tasks include creating arrays, performing statistical analysis, and visualizing data using plots.

Uploaded by

rajdmice
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 3

DMI COLLEGE OF ENGINEERING

(AN AUTONOMOUS INSTITUTION)


APPROVED BY AICTE, AFFILIATED TO ANNA UNIVERSITY,

ACCREDITED BY NBA, ISO CERTIFIED INSTITUTION

PALANCHUR – NAZARATHPET P.O., CHENNAI – 600 123

B.E / B.Tech. PRACTICAL END SEMESTER EXAMINATIONS, NOVEMBER /DECEMBER 2024


Third Semester
CS3361 - DATA SCIENCE LABORATORY
(Regulations 2021)

Time: 3 Hours Answer any one Question Max. Marks 100

Aim/Principle/Apparatus Tabulation/Circuit/ Calculation


Viva-Voce Record Total
required/Procedure Program/Drawing & Results

20 30 30 10 10 100

1. i. Write a NumPy program to convert a list and tuple into arrays

ii. Write a NumPy program to convert the values of Centigrade degrees into Fahrenheit
degrees and vice versa. Values have to be stored into a NumPy array.

2. i. Write a NumPy program to find the real and imaginary parts of an array of complex
numbers

ii. Write a NumPy program to convert a NumPy array into a csv file

3. i. Write a NumPy program to perform the basic arithmetic operations

ii.Write a NumPy program to transpose an array

4. i. Use NumPy , Create an array with 5 dimensions and verify that it has 5 dimensions

ii. Using NumPy, Sort a boolean array

5. i. Create your own simple Pandas DataFrame and print its values

ii. Create your own DataFrame from dict of narray/list

6. Perform appending, slicing, addition and deletion of rows with a Pandas DataFrame.
Page 1 of 3
7. i. Using Pandas, Create a DataFrame with a list of dictionaries, row indices, and column indices.

ii. Use index label to delete or drop rows from a Pandas DataFrame.

8. Apply and explore various plotting functions on UCI data sets

9. Compare the results of the Univariate and Bivariate analysis for the UCI diabetes data set

10. Using Pandas library,

i.Load the iris.CSV file

ii.Convert it into the data frame and read it .

iii.Display records only with species "Iris-setosa".

11. Use the diabetes data set from UCI, Perform Univariate analysis

12. Use the diabetes data set from Pima Indians Diabetes , Perform Bivariate analysis

13. Perform Multiple Regression analysis on your own dataset ( For example, Car dataset with
information Company Name, Model, Volume, Weight, CO2) with more than one independent
value to predict a value based on two or more variables.

14. i. Using various functions in numpy library, mathematically calculate the values for a normal
distribution and create Histograms to plot the probability distribution curve.

ii. Using plt.contour(), plt.contourf(), plt.imshow(), plt.colorbar(), plt.clabel() functions


visualize a contour plot

15. i.Using the “concrete strength” dataset, explore relationships between two continuous
variables with Scatterplots

ii. Draw a Scatter Plot for the following Pandas DataFrame with Team name and Rank Points
as x and y axis

["Australia", 2500],["Bangladesh", 1000],["England", 2000],["India", 3000],["Srilanka", 1500]

16. Make a three-dimensional plot with randomly generate 50 data points for x, y, and z. Set the
point color as red, and size of the point as 50.

Page 2 of 3
17. How will you plot and visualize geographical data with the help of Basemap. State the Procedure
for it with an example.

18. Perform Reading data from text files, Excel and the web and exploring various commands for
doing descriptive analytics on the Iris data set

19. Perform Univariate analysis with the following pandas DataFrame

'points': [1, 1, 2, 3.5, 4, 4, 4, 5, 5, 6.5, 7, 7.4, 8, 13, 14.2]

'assists': [5, 7, 7, 9, 12, 9, 9, 4, 6, 8, 8, 9, 3, 2, 6]

'rebounds': [11, 8, 10, 6, 6, 5, 9, 12, 6, 6, 7, 8, 7, 9, 15]

20. Perform Bivariate analysis using the pandas DataFrame that contains information about two
variables: (1) Hours spent studying and (2) Exam score received by 20 different students:

INTERNAL EXAMINER EXTERNAL EXAMINER

Page 3 of 3

You might also like