Viva Answers
Viva Answers
Data science involves analyzing and interpreting complex data using scientific methods,
Python is versatile, has extensive libraries (e.g., NumPy, pandas), is easy to learn, and is
NumPy (Numerical Python) is a library for numerical computation. It provides support for
4. Pandas definition:
Pandas is a Python library used for data manipulation and analysis, offering data structures like
A CSV (Comma-Separated Values) file stores tabular data as plain text, where each line
Libraries are collections of pre-written code (functions, classes) that perform specific tasks, such
An array is a collection of elements of the same type, stored at contiguous memory locations,
Arrays enable faster data processing and efficient manipulation for large datasets compared to
lists.
A bell-shaped curve representing a normal distribution, where most data points are near the mean,
A scatter plot visualizes relationships between two variables. Used in exploratory data analysis to
Matplotlib is a Python library for creating static, interactive, and animated visualizations. It is used
13. Histogram:
Tools like matplotlib, seaborn, and Plotly are used to create graphs, charts, and interactive
16. DataFrame:
A 2D data structure in pandas, similar to a table, with labeled rows and columns.
A recursive function that calls itself once per invocation, solving problems step-by-step.
18. Basemap:
A matplotlib toolkit used for plotting geographical data and creating maps.