Short Notes On Coding
Short Notes On Coding
train.isnull().sum()
train.select_dtypes(include=[np.object,np.float64,np.int64])
)
train["column_name"].replace(["","",""]),["","",""],inplace=True)
train.info()
---------------------------------------
import matplotlib.pyplot as plt
plt.subplots(1,2,1)
plt.xlabel('name on x-axis')
plt.ylabel('name on y-axis')
sns.countplot("columnname",data=train,pallete='ocean','spring','summer')
df.plt(kind= 'scatter or hist',x= 'column as x', y= 'column name x')
plt.bar(x,y)
plt.hist(x)
plt.scatter(x,y)
y=np.array([12,34,56,90])
plt.pie(y)
plt.plot
----------------------------------------------------
import numpy as np
from scipy import stats
np.mean(list)
np.meadian(list)
stats.mode(list)
np.var(list) -> (sum over all i(xi- (mean of xi)))/no. of points= variance
np.std(list) -> sqrt(variance)
np.percentile(list, 75)
75 percentile meaning 75 percentile= 43 that means 75% of the population has values
lower than 43
max_no= m
min_no= n
25 percentile = 0.25* (max_no-min_no)
---------------------------------------------------
Distributions:
np.random.uniform(start,end,size)
np.random.normal(start,end,size)
------------------------------------------------------
map we use when we want to perform an operation over all the elments of list
list(map(myfunc,iterable))
-----------------------------------------------------------------
df= dataframe
di= dict(df.groupby(['district']).['model_price'].mean())
data[data['state']== 'Telanagana']['commodity'].value_counts
data.groupby('state')['commodity'].nunique()
data.sort_values(by=['column_name'],inplace=True)