Cse2026 - Important Questions
Cse2026 - Important Questions
PART A
PART-B
2. Demonstrate at least five data preprocessing steps by taking any sample dataset (using
python).
3. Define data validation & Illustrate the levels on it.
4. Define data. List and explain Data in the Real World and Compare data vs. information
5. List the advantages of data preparation and explain key steps in data preparation.
6. Explain Plotting Using pandas DataFrames, Displaying Figures and Saving Figures in
Matplotlib.
7. Describe visual variables. Explain in detail all the types of visual variables in visualization.
8. Explain space filling and nonspace filling methods in detail with respect to treemaps,
sunburst displays and node-link graphs with diagrams.
9. Define Data Science. Describe in detail with suitable example.
11. Explain in detail data wrangling using own dataset& List the disadvantage of data with high
dimensionality.
12. Explain how PCA helps in reducing dimensions. Use the python code to reduce the
dimensions of breast cancer data to predict 'Benign' or 'Malignant'
13. Explain the tools and libraries used for visualization with appropriate examples.
14. Explain Multivariate Data. Compare multivariate data with multidimensional with example.
15. Use the python code to visualize different types of Multivariate Data by considering
appropriate data.
16. Explain the role of data cleaning in Exploratory Data Analysis (EDA). Describe the different
ways of cleaning the data.
17. Explain different visualization techniques using matplotlib library and iris dataset
(minimum of 5 visualization methods)
18. Explain about vector visualization techniques with own examples.
19. Explain the different types of mapping color models
20. Demonstrate the following operations with respect to time series data(using python):
i) Indexing and selection
ii) data ranges, frequencies and shifting
21. Classify any four types of multivariate data visualization in detail with diagrams.
22. Classify the spatial data visualization technology with their category and explain time series
of data visualization with example.
23. Define time oriented data. List the characteristics of time oriented data. Use a python code
to visualize the time oriented data
24. Apply how text data visualization assists in sentiment analysis. Provide a detailed example,
including the types of visualizations that could be used to represent data extracted from
social media feeds.
25. Determine how geospatial data is a spatial type of spatial data. Compare raster and vector
spatial data.
26. Use datetime package to visualize and analyze the stock market data with feature set of
('Open', 'High', 'Low', 'Close', 'Volume').
27. Use python code to visualize the streaming data by considering random data point in 2D
space.
28. Explain data-stream-management system with a neat architectural diagram. Compare DSMS
with DBMS
29. Use python code to visualize the streaming data by considering random data point in 2D
space
30. Describe a comprehensive discussion on the dos and don’ts of data visualization, using
examples from election result reporting.
31. Illustrate how these principles affect public perception and understanding of the results,
and suggest improvements based on recent technological advancements in visualization
tools.