0% found this document useful (0 votes)

65 views12 pages

56 Assignments

This document discusses SciPy, Pandas, and NumPy libraries. It shows how to use SciPy functions like linalg to solve linear equations and random distributions. It demonstrates creating and manipulating DataFrames in Pandas from different data structures. It also covers merging, concatenating and handling missing data in DataFrames.

Uploaded by

kPrasad8

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

65 views12 pages

56 Assignments

Uploaded by

kPrasad8

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 12

SCIPY

#import required libraries

import numpy as np
from scipy import linalg
#Test has 30 questions and worth 150 marks
#True and false questions worth 4 marks each
#multiple choice questions worth 9 points each

#let x is the number of true/ false questions

#let y is the number of multiple choice questions

# (x + y = 30 )
# (4x + 9y = 150)
testQuestionVariable = np.array([[1,1],[4,9]])
testQuestionValue = np.array([30,150])
#use linalg function of Scipy
#use solve method to solve the linear equation and find value for x and y
linalg.solve(testQuestionVariable,testQuestionValue)

#import required library for normal distribution

from scipy.stats import norm
#define 20 randmon variables for normal distribution of data
norm.rvs(loc=0,scale=1,size=20)
# perfrom Cumulative Distribution Function or CDF for 10 random variables, loc=1 and scale 3
norm.cdf(10,loc=1,scale=3)
# perfrom Probability Density Function or PDF for 14 random variables, loc=1 and scale 1
norm.pdf(14,loc=1,scale=1)

#import the required libraries

import numpy as np
from scipy import linalg
#test_data matrix - (rating on scale of 10)
test_rating_data = np.array([[5,8],[7,9]])
eigenValues, eigenVector = linalg.eig(test_rating_data)
first_eigen, second_eigen = eigenValues
#print eigen values (first and second eigen values)
print(first_eigen, second_eigen)
#print first eigen vector
print(eigenVector[:,0])
#print second eigen vector
print(eigenVector[:,1])
PANDAS
Dataframe.ipynb:
import pandas as pd

#Create DataFrame from dict of equal length list

#last five olymnics data: place, year and number of countries participated
olympic_data_list = {'HostCity':['London','Beijing','Athens','Sydney','Atlanta'],
'Year':[2012,2008,2004,2000,1996],
'No. of Participating Countries':[205,204,201,200,197]
}
df_olympic_data = pd.DataFrame(olympic_data_list)
df_olympic_data

#Create DataFrame from dict of dicts

olympic_data_dict = {'London':{2012:205},'Beijing':{2008:204}}
df_olympic_data_dict = pd.DataFrame(olympic_data_dict)
df_olympic_data_dict
#select by City name
df_olympic_data.HostCity
#use describe function to display the content
df_olympic_data.describe

#Create DataFrame from dict of series

olympic_series_participation =
pd.Series([205,204,201,200,197],index=[2012,2008,2004,2000,1996])
olympic_series_country = pd.Series(['London','Beijing','Athens','Sydney','Atlanta'],
index=[2012,2008,2004,2000,1996])
df_olympic_series = pd.DataFrame({'No. of Participating
Countries':olympic_series_participation,
'Host Cities':olympic_series_country})
df_olympic_series

#Create DataFrame from dict of ndarray

import numpy as np
np_array = np.array([2012,2008,2004,2006])
dict_ndarray = {'year':np_array}
df_ndarray = pd.DataFrame(dict_ndarray)
df_ndarray
Create DataFrame from DataFrame object

df_from_df = pd.DataFrame(df_olympic_series)
df_from_df
#view values
df_from_df.values

View dataset

#view top 2 rows of the dataset

df_from_df.head(2)
#view bottom two rows of dataset
df_from_df.tail(2)
#view indexes of dataset
df_from_df.index
#view columns of the dataset
df_from_df.columns

Select dataset

#select column name from the dataset

df_from_df['No. of Participating Countries']
#another selecion by column name
df_from_df['Host Cities']
#select elements by index location
df_from_df.loc[2012]
#select elements by slicing from 0 to 2
df_from_df.iloc[0:2]
#select element by position
df_from_df.iat[2,1]
# select element by boolean indexing where countries participated are more than 200
df_from_df[df_from_df['No. of Participating Countries']>200]

#View & Select Data

#import libraries
import numpy as np
import pandas as pd

#create dataframe from dict of series for summer olympics : 1996 to 2012
olympic_series_participation =
pd.Series([205,204,201,200,197],index=[2012,2008,2004,2000,1996])
olympic_series_country = pd.Series(['London','Beijing','Athens','Sydney','Atlanta'],
index=[2012,2008,2004,2000,1996])
df_olympic_series = pd.DataFrame({'No. of Participating
Countries':olympic_series_participation,
'Host Cities':olympic_series_country})

# display content of the dataset

df_olympic_series

View Data

#view dataframe describe

df_olympic_series.describe
#view top 2 records
df_olympic_series.head(2)
#view last 3 records
df_olympic_series.tail(3)
#view indexes of dataset
df_olympic_series.index
#view columns of the dataset
df_olympic_series.columns

Select Data
#select data for Host Cities
df_olympic_series['Host Cities']
#another data selecion No. of Participating Countries
df_olympic_series['No. of Participating Countries']
#select lable-location based access by label
df_olympic_series.loc[2012]
#Integer-location based indexing by position
df_olympic_series.iloc[0:2]
#Integer-location based data selection by index value
df_olympic_series.iat[3,1]
#select data element by condition where number of participated countries are more than 200
# hint - use boolean expression
df_olympic_series[df_olympic_series['No. of Participating Countries']>200]

Data Operation Demo

#import libraries
import pandas as pd
#create test score dataset for test takers
df_test_scores = pd.DataFrame({'Math':[91,97,66,83,45],
'English':[93,88,55,65,74]},
index=['James','David','Stacy','Travis','Mike'])
#view the content of the dataset
df_test_scores.describe
#use describe() function to view dataset statistics
df_test_scores.describe()
#define a custom function to grade the test scores
def test_grade(score):
if score>90:
return 'A'
elif score>80:
return 'B'
elif score>70:
return 'C'
elif score>60:
return 'D'
else:
return 'F'
#validate/test the custom function
test_grade(85)
#use applymap method to the dataset to view the grade for tests
df_test_scores.applymap(test_grade)

#Merge Duplicate Cocatenate

import numpy as np
import pandas as pd
df_student_test_math_data = pd.DataFrame({'student':['Tom','Jack','Dan','Ram','Jeff','David'],
'ID':[10,56,31,85,9,22]
})
df_student_test_science_data = pd.DataFrame({'student':['Tom','Ram','David'],
'ID':[10,85,22]
})
pd.merge(df_student_test_math_data,df_student_test_science_data)
pd.merge(df_student_test_math_data,df_student_test_science_data,on='student')
pd.merge(df_student_test_math_data,df_student_test_science_data,on='ID',how='right')
(pd.merge(df_student_test_math_data,df_student_test_science_data,on='ID',how='left')).fillna('X
')
pd.merge(df_student_test_math_data,df_student_test_science_data,on='ID',how='outer')
pd.concat([df_student_test_math_data,df_student_test_science_data],ignore_index=True)
df_student_survey_data = pd.DataFrame({'student':['Tom','Jack','Tom','Ram','Jeff','Jack'],
'ID':[10,56,10,85,9,56]
})
df_student_survey_data
df_student_survey_data.duplicated()
df_student_survey_data.drop_duplicates()
df_student_survey_data.drop_duplicates(['student'])
df_student_survey_data.drop_duplicates('ID')
#Database interaction with SQL

#import pandas library

import pandas as pd
#import sqllite
import sqlite3

#Create SQL table

create_SQL_table = """
CREATE TABLE student_test_score
(Id INTEGER, Name VARCHAR(20), Math REAL,
Science REAL
);"""

#execute the SQL statement

executeSQL = sqlite3.connect(':memory:')
executeSQL.execute(create_SQL_table)
executeSQL.commit()

#prepare a SQL query

SQL_query = executeSQL.execute('select * from student_test_score')

#fetch result from the SQLlite database

resultSet = SQL_query.fetchall()

#view result
resultSet

#prepare records to be inserted into SQL table through SQL statement

insertData_SQL = [(10,'Jack',85,92),
(29,'Tom',73,89),
(65,'Ram',65.5,77),
(5,'Steve',55,91)
]

#insert records into SQL table through SQL statement

insert_statement = "Insert into student_test_score values(?,?,?,?)"
executeSQL.executemany(insert_statement,insertData_SQL)
executeSQL.commit()

#prepare SQL query

SQL_query = executeSQL.execute('select * from student_test_score')

#fetch the resultset for the query

resultSet = SQL_query.fetchall()
#view the resultset
resultSet

#put the reconrds together in dataframe

df_student_records = pd.DataFrame(resultSet,columns=zip(*SQL_query.description)[0])

#view the records in pandas dataframe

df_student_records

#MISSING VALUES

import pandas as pd

#declare first series

first_series = pd.Series([1,2,3,4,5],index=['a','b','c','d','e'])

#declare second series

second_series=pd.Series([10,20,30,40,50],index=['c','e','f','g','h'])

sum_of_series = first_series+second_series

sum_of_series

# drop NaN( Not a Number) values from dataset

dropna_s = sum_of_series.dropna()

dropna_s

dropna_s.fillna(0)

# Fill NaN( Not a Number) values with Zeroes (0)

fillna_s = sum_of_series.fillna(0)

fillna_s

#fill values with zeroes before performing addition operation for missing indices
fill_NaN_with_zeros_before_sum =first_series.add(second_series,fill_value=0)

fill_NaN_with_zeros_before_sum

Start Pandas Series Exercises

Exercise 01 : Create simple series

import numpy as np
import pandas as pd
#print a simple series with list as an argument
first_series = pd.Series(list('abcdef'))
print (first_series)

Exercise 02 : Create Series from ndarray

#create a series using ndarray countries data

np_country = np.array(['Luxembourg','Norway','Japan','Switzerland','United
States','Qatar','Iceland','Sweden',
'Singapore','Denmark'])

s_country = pd.Series(np_country)
print (s_country)

Exercise 03 : Create Series from dict

#Evaluate countries and their corresponding gdp per capita and print them as series
dict_country_gdp =
pd.Series([52056.01781,40258.80862,40034.85063,39578.07441,39170.41371,
37958.23146,37691.02733,36152.66676,34706.19047,33630.24604,
33529.83052,30860.12808],index=['Luxembourg','Macao, China','Norway',
'Japan','Switzerland','Hong Kong, China','United States','Qatar','Iceland','Sweden',
'Singapore','Denmark'])

print (dict_country_gdp)

Exercise 04: Access elements in Series

#access elements in the series

dict_country_gdp[0]
#access first 5 countries from the series
dict_country_gdp[0:5]
#look up a country by name or index
dict_country_gdp.loc['United States']
#look up by position
dict_country_gdp.iloc[0]

Exercise 05 : Create Series from scalar

#Print Series with scalar input

scalar_series = pd.Series(5.,index=['a','b','c','d','e'])
scalar_series

Exercise 05 : Vectorized Operations

#declare two different vector series with same indexes
first_vector_series = pd.Series([1,2,3,4],index=['a','b','c','d'])
second_vector_series = pd.Series([10,20,30,40],index=['a','b','c','d'])

first_vector_series+second_vector_series
#now shuffle index of second vector series
second_vector_series = pd.Series([10,20,30,40],index=['a','d','b','c'])

first_vector_series+second_vector_series
#now replace few indexes with new ones in second vector series
second_vector_series = pd.Series([10,20,30,40],index=['a','b','e','f'])
first_vector_series+second_vector_series
Assignment 01 FAA
Analyse the Federal Aviation Authority Dataset using Pandas
DESCRIPTION

Problem:
Analyze the Federal Aviation Authority (FAA) dataset using Pandas to do the following:
1. View
 aircraft make name
 state name
 aircraft model name
 text information
 flight phase
 event description type
 fatal flag
2. Clean the dataset and replace the fatal flag NaN with “No”
3. Find the aircraft types and their occurrences in the dataset
4. Remove all the observations where aircraft names are not available
5. Display the observations where fatal flag is “Yes”

#import necessary library

import pandas as pd

#read the faa (federal aviation authority) dataset

df_faa_dataset = pd.read_csv('C:\\dataset\\faa_ai_prelim.csv')

#view the dataset shape

df_faa_dataset.shape

#view the first five observations

df_faa_dataset.head()

#view all the columns present in the dataset

df_faa_dataset.columns

#now create a new data frame with only required columns

df_analyze_dataset=df_faa_dataset[['ACFT_MAKE_NAME','LOC_STATE_NAME','ACFT_M
ODEL_NAME','RMK_TEXT',
'FLT_PHASE','EVENT_TYPE_DESC','FATAL_FLAG']]

#view the type of the object

type(df_analyze_dataset)

#view first five observations

df_analyze_dataset.head()

#replace all NaN for Fatal flaf witth 'No'

df_analyze_dataset['FATAL_FLAG'].fillna(value='No',inplace=True)

#now view first five observations

df_analyze_dataset.head()

#view the shape of the dataset

df_analyze_dataset.shape

#drop values where ACFT_MAKE_NAME (aircraft make name) is not available

df_final_dataset = df_analyze_dataset.dropna(subset=['ACFT_MAKE_NAME'])

#now view the new shape of the dataset

df_final_dataset.shape

#group by aircraft name

aircraftType = df_final_dataset.groupby('ACFT_MAKE_NAME')

#view them by aircraft using size method

aircraftType.size()

#Now group the dataset by fatal flag

fatalAccidents = df_final_dataset.groupby('FATAL_FLAG')

#view the fatal accidents size

fatalAccidents.size()

#select the accidents with fatality with fatl flag yes

accidents_with_fatality = fatalAccidents.get_group('Yes')

#view the accidents with fatality

accidents_with_fatality

#FDNY Assignment 02:

Analyse NewYork city fire department Dataset
DESCRIPTION

What to:
A dataset in CSV format is given for the Fire Department of New York City. Analyze the dataset to
determine:
1. The total number of fire department facilities in New York city
2. The number of fire department facilities in each borough
3. The facility names in Manhattan
#import libraries
import pandas as pd

#read data from csv file fire department of New York City (FDNY)
df_fdny_csv_data_raw = pd.read_csv('C:\dataset\FDNY_Firehouse_Listing.csv')

#view content of the data

df_fdny_csv_data_raw.describe

#view first five records

df_fdny_csv_data_raw.head(5)

#skip the first row from dataset

df_fdny_csv_data = pd.read_csv('C:\dataset\FDNY_Firehouse_Listing.csv',skiprows=1)

#view first five records from fixed dataset

df_fdny_csv_data.head(5)

#view data statistics using describe()

df_fdny_csv_data.describe()

#view columns of the dataset

df_fdny_csv_data.columns

#view indexe of dataset

df_fdny_csv_data.index

#Count number of records

df_fdny_csv_data.count()

#view datatypes
df_fdny_csv_data.dtypes

#select FDNY information boroughwise

groupby_borough = df_fdny_csv_data.groupby('Borough')

#view FDNY informationn for each borough

groupby_borough.size()

#select FDNY information for Manhattan

fdny_info_Manhattan = groupby_borough.get_group('Manhattan')

#View FDNY information for Manhattan

fdny_info_Manhattan

Continuum Mech Problems
100% (1)
Continuum Mech Problems
85 pages
Linear Algebra Cheat Sheet
67% (3)
Linear Algebra Cheat Sheet
3 pages
Learn Random Forest Using Excel
No ratings yet
Learn Random Forest Using Excel
9 pages
Final Class 12 Commerce Practical File
No ratings yet
Final Class 12 Commerce Practical File
19 pages
7+ Use-Cases of Generative AI in Marketing
No ratings yet
7+ Use-Cases of Generative AI in Marketing
16 pages
Data Science With R Exam Questions: PG Program in Analytics
100% (2)
Data Science With R Exam Questions: PG Program in Analytics
4 pages
Python Program - X Pyplot-Ii
No ratings yet
Python Program - X Pyplot-Ii
9 pages
Practical File 2024
No ratings yet
Practical File 2024
25 pages
Ai&ds Syllabus 1
No ratings yet
Ai&ds Syllabus 1
54 pages
Introduction To Linear Algebra - Models Methods and Theory PDF
No ratings yet
Introduction To Linear Algebra - Models Methods and Theory PDF
556 pages
Matplotlib Linechatsy
No ratings yet
Matplotlib Linechatsy
38 pages
Practical Record Programs - Solutions
No ratings yet
Practical Record Programs - Solutions
23 pages
2.BI LAb Tableau
No ratings yet
2.BI LAb Tableau
30 pages
HEI Workshop BahirDar Report Math
100% (1)
HEI Workshop BahirDar Report Math
130 pages
Pandas Practicals - Term-1
100% (1)
Pandas Practicals - Term-1
18 pages
Instructor's Answer Book: Mathematical Methods in The Physical Sciences, 3rd Edition
No ratings yet
Instructor's Answer Book: Mathematical Methods in The Physical Sciences, 3rd Edition
73 pages
Xii Ip Practical File 24-25
No ratings yet
Xii Ip Practical File 24-25
111 pages
Informatics Practices Record Class 12
No ratings yet
Informatics Practices Record Class 12
60 pages
Olympics Data Analysis - ML - FA - DA Projects
No ratings yet
Olympics Data Analysis - ML - FA - DA Projects
55 pages
Density Matrix of Harm Osc
100% (1)
Density Matrix of Harm Osc
9 pages
Jamboree
No ratings yet
Jamboree
56 pages
B.tech Civil Details Curriculum As Per NEP 2023 W.E.F 2023-24-1694930910
No ratings yet
B.tech Civil Details Curriculum As Per NEP 2023 W.E.F 2023-24-1694930910
44 pages
Aadarsh
No ratings yet
Aadarsh
26 pages
Tution Representation
No ratings yet
Tution Representation
38 pages
Dav Practicals
No ratings yet
Dav Practicals
33 pages
Information Practices
No ratings yet
Information Practices
38 pages
DSC Lab Programs
No ratings yet
DSC Lab Programs
24 pages
Python Codes
No ratings yet
Python Codes
15 pages
Fda Batch2program
No ratings yet
Fda Batch2program
18 pages
FDS Slot 1
No ratings yet
FDS Slot 1
19 pages
Time Series Analysis Group 9
No ratings yet
Time Series Analysis Group 9
16 pages
Data Science Practical Book - Ipynb
No ratings yet
Data Science Practical Book - Ipynb
21 pages
CQF Math Aptitude Test Solutions
No ratings yet
CQF Math Aptitude Test Solutions
27 pages
Practical File
No ratings yet
Practical File
20 pages
Practical For Class XII
No ratings yet
Practical For Class XII
19 pages
Es 2022
No ratings yet
Es 2022
16 pages
Virat Kohil
No ratings yet
Virat Kohil
31 pages
Practical File
No ratings yet
Practical File
19 pages
Building An NLP Chatbot For A Restaurant With Flask
No ratings yet
Building An NLP Chatbot For A Restaurant With Flask
30 pages
Class 12 IP File 23 24
No ratings yet
Class 12 IP File 23 24
27 pages
HW 1
No ratings yet
HW 1
11 pages
MIT18 S096F13 Lecnote15
No ratings yet
MIT18 S096F13 Lecnote15
39 pages
Assessment Test
No ratings yet
Assessment Test
22 pages
It Reduce Manual Repetitive Work With IT Automation Executive Brief V3
No ratings yet
It Reduce Manual Repetitive Work With IT Automation Executive Brief V3
22 pages
CLASS XII - IP List of Practicals With Coding 2020
No ratings yet
CLASS XII - IP List of Practicals With Coding 2020
15 pages
Nitya Samskara Vedic Devatha Puja Introduction
No ratings yet
Nitya Samskara Vedic Devatha Puja Introduction
18 pages
Principal Component Analysis (PCA) - by Kavishka Abeywardana - Jun, 2024 - Medium
No ratings yet
Principal Component Analysis (PCA) - by Kavishka Abeywardana - Jun, 2024 - Medium
22 pages
Ip Practical File
No ratings yet
Ip Practical File
23 pages
Marine Enginnering Syllabus
0% (1)
Marine Enginnering Syllabus
3 pages
Preksha Ai Practical Class 10th - 070428
No ratings yet
Preksha Ai Practical Class 10th - 070428
13 pages
The Shannon Capacity of A Graph: Femke Bekius July 22, 2011
No ratings yet
The Shannon Capacity of A Graph: Femke Bekius July 22, 2011
43 pages
Numpy Dataframe
No ratings yet
Numpy Dataframe
12 pages
7 Types of Classification Algorithms
No ratings yet
7 Types of Classification Algorithms
9 pages
3 - Baselines - Machine Learning Blog - ML@CMU - Carnegie Mellon University
No ratings yet
3 - Baselines - Machine Learning Blog - ML@CMU - Carnegie Mellon University
9 pages
XII IP Practical List - Anand
No ratings yet
XII IP Practical List - Anand
25 pages
Grid Search For KNN
No ratings yet
Grid Search For KNN
17 pages
10eps11 Applied Mathematics: Scheme of Teaching and Examination M.Tech. Power Systems Engineering (Eps) I Semester
No ratings yet
10eps11 Applied Mathematics: Scheme of Teaching and Examination M.Tech. Power Systems Engineering (Eps) I Semester
6 pages
Steps For PCA
No ratings yet
Steps For PCA
5 pages
Dal Programs With Output
No ratings yet
Dal Programs With Output
11 pages
Lecture Notes-Multiple Antennas For MIMO Communications - Basic Theory
No ratings yet
Lecture Notes-Multiple Antennas For MIMO Communications - Basic Theory
48 pages
INFO II Practice 7
No ratings yet
INFO II Practice 7
15 pages
Assignment EDA Casestudy11
No ratings yet
Assignment EDA Casestudy11
20 pages
5 Strategies To Beat Stage Fright
No ratings yet
5 Strategies To Beat Stage Fright
8 pages
Class X Practical-2025 - Jupyter Notebook
No ratings yet
Class X Practical-2025 - Jupyter Notebook
6 pages
Python For Exploratory Data Analysis
No ratings yet
Python For Exploratory Data Analysis
12 pages
Aditya Pratap Olympic Analysis (1
No ratings yet
Aditya Pratap Olympic Analysis (1
9 pages
Final Cihan Yazıcı
No ratings yet
Final Cihan Yazıcı
6 pages
Grid Search For Random Forest
No ratings yet
Grid Search For Random Forest
12 pages
Numerical Methods To Liouville Equationn
No ratings yet
Numerical Methods To Liouville Equationn
39 pages
Final Coding
No ratings yet
Final Coding
9 pages
Practical File Question 28.09.2022
No ratings yet
Practical File Question 28.09.2022
15 pages
Fds Assigns
No ratings yet
Fds Assigns
5 pages
Ai & ML FDP
No ratings yet
Ai & ML FDP
7 pages
Sturm-Liouville Eigenvalue Problems
No ratings yet
Sturm-Liouville Eigenvalue Problems
34 pages
12 IP Practical Exampl
No ratings yet
12 IP Practical Exampl
6 pages
FDS All Practicals
No ratings yet
FDS All Practicals
10 pages
Debreu (1953) - Nonnegative Square Matrices
No ratings yet
Debreu (1953) - Nonnegative Square Matrices
12 pages
1 IAPT Bulletin JUNE JULY-2021 - Article
No ratings yet
1 IAPT Bulletin JUNE JULY-2021 - Article
6 pages
Z Angew Math Mech - 2002 - Lee - The Eigenfunctions of The Stokes Operator in Special Domains III
No ratings yet
Z Angew Math Mech - 2002 - Lee - The Eigenfunctions of The Stokes Operator in Special Domains III
9 pages
Print Print Print Print: Import As
No ratings yet
Print Print Print Print: Import As
6 pages
Grid Search For SVM
No ratings yet
Grid Search For SVM
9 pages
Soln
No ratings yet
Soln
3 pages
12 Ip Practical List With Solution Complete
No ratings yet
12 Ip Practical List With Solution Complete
5 pages
Principles and Spin 3.1 Four Principles of Quantum Mechanics
No ratings yet
Principles and Spin 3.1 Four Principles of Quantum Mechanics
23 pages
Input: Import As
No ratings yet
Input: Import As
5 pages
BSC IYear Mathematics CBCSPattern Syllabuswith Circular 201920 VV
No ratings yet
BSC IYear Mathematics CBCSPattern Syllabuswith Circular 201920 VV
15 pages
The Essential R Reference
From Everand
The Essential R Reference
Mark Gardener
No ratings yet
Formulario - EA
No ratings yet
Formulario - EA
6 pages
Workshop 5 - Pandas
No ratings yet
Workshop 5 - Pandas
2 pages
Employee Master
No ratings yet
Employee Master
5 pages
Output
No ratings yet
Output
3 pages
Gerasmio, Angelie P. (Application of Linear Algebra)
No ratings yet
Gerasmio, Angelie P. (Application of Linear Algebra)
6 pages
GenSpark Tracker For - AI Architect Curriculum
No ratings yet
GenSpark Tracker For - AI Architect Curriculum
4 pages
Pds
No ratings yet
Pds
3 pages
ST Annes June 01
No ratings yet
ST Annes June 01
13 pages
Answer Key
No ratings yet
Answer Key
2 pages
Data Science With Python
No ratings yet
Data Science With Python
12 pages
Csmarks Feedback 22660382 Pervea01 - 32308
No ratings yet
Csmarks Feedback 22660382 Pervea01 - 32308
11 pages
Olympics Medals Predictor PDF
No ratings yet
Olympics Medals Predictor PDF
2 pages
End Semester Performance Name: Guneet Singh Oberai ROLL NO.: BTECH/10240/18 Branch: Eee
No ratings yet
End Semester Performance Name: Guneet Singh Oberai ROLL NO.: BTECH/10240/18 Branch: Eee
6 pages
MAT 517 Exercises Set 6
No ratings yet
MAT 517 Exercises Set 6
3 pages
Bernd Heidergott, Geert Jan Olsder, and Jacob Van Der Woude: Max Plus at Work
No ratings yet
Bernd Heidergott, Geert Jan Olsder, and Jacob Van Der Woude: Max Plus at Work
11 pages
Data Visulization Notes
No ratings yet
Data Visulization Notes
3 pages
IF Function
No ratings yet
IF Function
2 pages
NpTel Numerical Analysis Syllabus
No ratings yet
NpTel Numerical Analysis Syllabus
4 pages
Retail Analysis With Walmart Data
No ratings yet
Retail Analysis With Walmart Data
2 pages
Course Outline Applied Linear Algebra: International University-National University Hochiminh City
No ratings yet
Course Outline Applied Linear Algebra: International University-National University Hochiminh City
2 pages
Comcast Telecom Consumer Complaints
No ratings yet
Comcast Telecom Consumer Complaints
1 page
Analyse GDP of Countries
No ratings yet
Analyse GDP of Countries
1 page
Course Information: Instructor: Serkan Ho Sten
No ratings yet
Course Information: Instructor: Serkan Ho Sten
3 pages
Analyse The Federal Aviation Authority Dataset Using Pandas
No ratings yet
Analyse The Federal Aviation Authority Dataset Using Pandas
1 page
Data Engineering
No ratings yet
Data Engineering
1 page
Python Assignment
No ratings yet
Python Assignment
3 pages

56 Assignments

Uploaded by

56 Assignments

Uploaded by

SCIPY

#import required libraries

#let x is the number of true/ false questions

#import required library for normal distribution

#import the required libraries

#Create DataFrame from dict of equal length list

#Create DataFrame from dict of dicts

#Create DataFrame from dict of series

#Create DataFrame from dict of ndarray

#view top 2 rows of the dataset

#select column name from the dataset

#View & Select Data

# display content of the dataset

#view dataframe describe

Data Operation Demo

#Merge Duplicate Cocatenate

#import pandas library

#Create SQL table

#execute the SQL statement

#prepare a SQL query

#fetch result from the SQLlite database

#prepare records to be inserted into SQL table through SQL statement

#insert records into SQL table through SQL statement

#prepare SQL query

#fetch the resultset for the query

#put the reconrds together in dataframe

#view the records in pandas dataframe

#declare first series

#declare second series

# drop NaN( Not a Number) values from dataset

# Fill NaN( Not a Number) values with Zeroes (0)

Start Pandas Series Exercises

Exercise 01 : Create simple series

Exercise 02 : Create Series from ndarray

#create a series using ndarray countries data

Exercise 03 : Create Series from dict

Exercise 04: Access elements in Series

#access elements in the series

Exercise 05 : Create Series from scalar

#Print Series with scalar input

Exercise 05 : Vectorized Operations

#import necessary library

#read the faa (federal aviation authority) dataset

#view the dataset shape

#view the first five observations

#view all the columns present in the dataset

#now create a new data frame with only required columns

#view the type of the object

#view first five observations

#replace all NaN for Fatal flaf witth 'No'

#now view first five observations

#view the shape of the dataset

#drop values where ACFT_MAKE_NAME (aircraft make name) is not available

#now view the new shape of the dataset

#group by aircraft name

#view them by aircraft using size method

#Now group the dataset by fatal flag

#view the fatal accidents size

#select the accidents with fatality with fatl flag yes

#view the accidents with fatality

#FDNY Assignment 02:

#view content of the data

#view first five records

#skip the first row from dataset

#view first five records from fixed dataset

#view data statistics using describe()

#view columns of the dataset

#view indexe of dataset

#Count number of records

#select FDNY information boroughwise

#view FDNY informationn for each borough

#select FDNY information for Manhattan

#View FDNY information for Manhattan

You might also like