Lab Manual Data Analysis
Lab Manual Data Analysis
Problem Statement
To explore the features of NumPy, SciPy, Jupyter, Statsmodels, and Pandas packages and
read data from a text file, Excel file, and the web.
Aim
To familiarize with Python libraries essential for data analysis and scientific computation.
Algorithm
1. Install required libraries using `pip install numpy scipy pandas statsmodels jupyter`.
2. Explore features of each library with basic and advanced examples.
3. Read data from a text file using Pandas.
4. Read data from an Excel file using Pandas.
5. Fetch and analyze data from a web source using Pandas and Requests.
# Importing libraries
import numpy as np
import pandas as pd
from scipy.stats import ttest_1samp, norm
from scipy.optimize import minimize
import statsmodels.api as sm
import requests
# NumPy Examples
print("NumPy Examples:")
# Create a 1D array and compute basic statistics
arr = np.array([1, 2, 3, 4, 5])
print("Array:", arr)
print("Mean of array:", np.mean(arr))
print("Standard Deviation:", np.std(arr))
# SciPy Examples
print("
SciPy Examples:")
# Perform a t-test
print("T-test Example:")
t_stat, p_value = ttest_1samp(arr, 3)
print("T-statistic:", t_stat, "P-value:", p_value)
# Optimization Example
print("
Optimization Example:")
result = minimize(lambda x: x**2 + 5, 0)
print("Optimization Result:", result)
# Pandas Examples
print("
Pandas Examples:")
# Creating a DataFrame
data = {
'Name': ['Alice', 'Bob', 'Charlie'],
'Age': [25, 30, 35],
'Salary': [50000, 60000, 70000]
}
df = pd.DataFrame(data)
print("DataFrame:")
print(df)
print("Summary Statistics:")
print(df.describe())
# Statsmodels Example
print("
Statsmodels Example:")
X = sm.add_constant([4, 5, 6])
Y = [1, 2, 3]
model = sm.OLS(Y, X).fit()
print(model.summary())
Enhanced Output
The output includes extensive examples of NumPy functionalities, statistical analysis using
SciPy, comprehensive use of Pandas for data manipulation, and regression analysis with
Statsmodels.