Machine Learning Numpy
Machine Learning Numpy
1 1 1 1 0 0
2 1 1 0 1 1
Kmean
It does not makes sense if you want to plot this table into a vector space, we have
7 dimensions!
Principal Component Analysis
2. Reduce noise
Let say you want to study stress level of a student, based on,
Principal Component Analysis
2. Reduce noise
Let say you want to study stress level of a student, based on,
Not all these 7 dimensions bring important information! We want to reject some
attributes.
Principal Component Analysis
2. Reduce noise
Let say you want to study stress level of a student, based on,
Not all these 7 dimensions bring important information! We want to reject some
attributes. Maybe 7 does not hurt much. What happen if you have 512 * 512 * 3
(image) dimension?! insane!
Principal Component Analysis
3. Reduce memory (computer science)
Let say a float took 1 bytes, we have 7 columns and 1 billion of rows.
Let say, this plane is Rn , we only visualize it on R2 , I want to visualize the data
points at axis-0, which is x-axis.
Principal Component Analysis
Principal Component Analysis
Principal Component Analysis
Principal Component Analysis
We cannot distinguish between oranges and blues! How about axis-2, which is,
axis-y?
Principal Component Analysis
Principal Component Analysis
It is quite okay, just a few data points overlapped each others. But we don’t
overlapping right?!
Principal component analysis
[5, 0],
[0, 1]
l, v = np.linalg.eig(np.array([[5,0],[0,1]]))
l, v
(array([5., 1.]), array([[1., 0.],
[0., 1.]]))