Data Analytics Lab QA
Data Analytics Lab QA
A: Data refers to raw facts and figures that are collected for analysis and reference. It can be in
A: Data is used to make informed decisions, identify trends, improve processes, and drive strategic
planning.
A: Structured data is organized and easily searchable (e.g., spreadsheets), while unstructured data
A: Data Analytics is the process of examining datasets to draw conclusions about the information
they contain.
A: It helps organizations optimize performance, predict trends, make data-driven decisions, and gain
competitive advantage.
A: Businesses use data analytics to understand customer behavior, improve operations, and
support decision-making.
A: Data collection, Data processing, Statistical analysis, Data visualization, and Interpretation of
results.
A: Extremely large datasets that may be analyzed computationally to reveal patterns, trends, and
associations.
A: Social media platforms, Sensors, Transactional records, Mobile devices, Web logs.
A: A central place where data is stored and maintained, like a database or data warehouse.
A: Data Science focuses on predictive analysis while Business Intelligence focuses on descriptive
analysis.
paradigms.
A: 1D: np.array([1,2,3])
2D: np.array([[1,2],[3,4]])
Example: df = pd.DataFrame({'Name':['A','B'],'Age':[20,30]})
A: s = pd.Series([10,20,30], index=['a','b','c'])
A: df = pd.DataFrame({'Name':['Tom','Jerry'], 'Score':[90,85]})
Q: How to import & export csv file using Pandas? Explain with example?
A: df = pd.read_csv('file.csv')
df.to_csv('output.csv')
Q: How to import & export excel file using Pandas? Explain with example?
A: df = pd.read_excel('file.xlsx')
df.to_excel('output.xlsx')
Q: How to import & export SQL file using Pandas? Explain with example?
Example: plt.plot(x,y)
Q: How to create multiple plots in same canvas using Matplotlib? Explain with example?
A: Use plt.boxplot(data).
A: The process of cleaning, transforming, and preparing raw data for analysis.
A: Use df.describe().
A: Grouping continuous values into bins using pd.cut(). Used to reduce errors.
Q: What is EDA?