GMM

Uploaded by

Suyash Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF or read online on Scribd

0% found this document useful (0 votes)

48 views

GMM

Uploaded by

Suyash Kumar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF or read online on Scribd

You are on page 1/ 5

215/24, 109M —_producton-gradescope-uploads.s2-us-est-2.amazonaws.comluploads(text_fifle40277343/gmm.py7X-Amz-Algoritim=AWS4- import numpy as_np from tqdn import tadn from kneans import kMeans SIGMA_CONST = 1e-6 LOG_CONST = 1¢-32 FULL_MATRIX = True # Set False if the covariance matrix is a diagonal matrix class GHM(object) def _init_(self, x, K, max_iters=100): # No need to change Angs: X: the observations/datapoints, N x D nunpy array K: number of clusters/components max maxinun nunber of iterations (used in EM implementation) self.points = x self.max_iters = max_iters self.N = self.points.shape[0] # nunber of observations self.D = self.points.shape[1] # number of features self.k = K # number of conponents/clusters 4 Helper function for you to implement def softmax(self, logit): # [Spts] args: logit: N x D numpy array Return prob: Nx D numpy array. See the above function. rue in your np.sum() function to avoid broadcast error. maxelt = np.anax(logit, axi logit = logit - maxelt # print(np.sum(np.exp(logit), axis=-1, keepdins-True)) return np.exp(logit)/np.sum(np.exp(logit), axise-1, keepdims=True) 1) -reshape((-1,1)) def 1 unexp(self, logit): # [Spts] args: logit: Nx D numpy array Return si Nx 1 array where [4,0] = logsumexp(logit[i,:]). See the above function Hint: ‘The keepdims paraneter could be handy maxelt = np.amax(logit, axis= logit = logit - maxelt ans = np.log(np.sum(np.exp(logit), axis=-1, keepdins= True)) return ans + maxelt 1) .reshape((-1,1)) 4 for undergraduate student def nornalPOF(self, points, mui, signai): # [5pts] args: points: Nx D nunpy array mu_i: (D,) numpy array, the center for the ith gaussian. sigma_i: OxD numpy array, the covariance matrix of the ith gaussian. Return pdf: (N,) numpy array, the probability density value of N data for the ith gaussian ntps: production -gradescope-uploads s3-us-west-2.amazoraws.comiuploadstext_flefle!402773438igmm.py?X-Amz-Algorthm=AWSS-HMAC-SHA... 1/5215/24, 209M —_producton-gradescope-uploads.s2-us-est-2.amazonaws.comluploads(text_flfle402773438/gmm,py7X-Amz-Algoritim=AWS4- Hint: np.diagonal() should be handy. raise NotImplenentedérror # for grad students def multinornalPOF(self, points, mui, sigmai): # [Spts] args: points: Nx D nunpy array my_i: (D,) numpy array, the center for the ith gaussian. signa_i: DxD numpy array, the covariance matrix of the ith gaussian. Return normal_pdf: (N,) numpy array, the probability density value of N data for the ith gaussian Hint: 1. np-Linalg.det() and np.linalg.inv() should be handy. 2. The value in self.D may be outdated and not correspond to the current dataset, ‘try using another method involving the current arguments to get the value of D D = points. shape[1] constant = 1/np.power(2*np.pi, D/2) try: inv = np.linalg.inv(signa_i) det = np.linalg-det(sigma_i) except inv = np.Lnalg.inv(signa_i + SIGMA_CONST) det = np.linalg.det(signa_i + SIGMA_CONST) diff = points - mu_i[np.newaxis, :] term = np.matmul (diff, inv) # print(terml. shape) ‘tern? = np.sum(terml.T*(diff.1), axis=0) # print(term2.shape) return constant*np.power(det, -1/2)*np.exp(-(1/2)*term2) def _init_conponents(self, **kwargs): # [Spts] args: kwargs: any other arguments you want Return pi: numpy array of length k, prior mu: KxD numpy array, the center for each gaussian signa: KxOxD numpy array, the diagonal standard deviation of each gaussian. You will have KxDxD nunpy array for full covariance matrix case Hint: np-random.seed(5) may be used at the start of this function to ensure consistent outputs. np.random.seed(S) #00 Not Remove Seed pi = np.full(self.k, 1/self.k) my = self.points[np.randon.randint(@, self.N, self.k)] signa = np.repeat(np.eye(self.0)[np-newaxis, :, :], self.k, axi: return pi, mu, sigma nips: production -gradescope-uploads s3-us-west-2.amazoraws.comiuploadsitext_lefle!402773438igmm.py?X-Amz-Algorthm=AWSS-HMAC-SHA.. 2/5,215/24, 209M —_producton-gradescope-uploads.s2-us-est-2.amazonaws.comluploads(text_flfle402773438/gmm,py7X-Amz-Algoritim=AWS4- det 1_joint(self, pi, mu, signa, full_matrix-FULL_MATRIX, **kwargs): # [10 pts] ngs: pi: np array of length K, the prior of each conponent mu: KxD numpy array, the center for each gaussian signa: KxOxD numpy array, the diagonal standard deviation of each gaussian. You will have KxOxD numpy array for full covariance matrix case full_matrix: whether we use full covariance matrix in Normal POF or not. Default is True. Return L1(1og-Likelihood): NxK array, where 11(i, k) = log pi(k) + log NormalPOF(points_i | uk], signa[k]) # == graduate implenentation wif full_matrix is True: # === undergraduate inplenentation wif full_matrix is False: #. small_const = 1e-32 # Llenp.array([]) if full_matrix is True: Llsnp.array([)) for k in range(self.k) normal = self.nultinormalPDF(self.points, mu[k], signa[k]) b = np-log(pi[k] + small_const) + np.log(normal + smal1_const) # print(11.shape, b.shape) # print(11) if Ten(11): np.column_stack((11, b[:, np-newaxis])) np.append(11, b) # print("returning l= ", 11) return 11 def _£ tep(self, pi, mu, sigma, full_matrix = FULL_MATRIX , **kwargs): # [Spts] Angs: pi: np array of length K, the prior of each component mu: KxD numpy array, the center for each gaussian signa: KxOxD numpy array, the diagonal standard deviation of each gaussian.You will have KxOxD numpy array for full covariance matrix case full_matrix: whether we use full covariance matrix in Normal POF or not. Default is True. Return ganma(tau): NxK array, the posterior distribution (a.k.a, the soft cluster assignment) for each observation Hint: You should be able to do this with just a few lines of code by using _11_joint() and softmax() defined above. # se= graduate implementation #if full_matrix is True: #. # =s= undergraduate inplenentation #if full_matrix is False: # nips: production -gradescope-uploads s3-us-west-2.amazoraws.comiuploadstext_flefle!402773438igmm.py?X-Amz-Algorthm=AWSS-HMAC-SHA... 315,215/24, 209M —_producton-gradescope-uploads.s2-us-est-2.amazonaws.comluploads(text_flfle402773438/gmm,py7X-Amz-Algoritim=AWS4- if full_matrix is True: self._11_joint(pi, mu, sigma, full_matrix) # print("logit= “, logit) return self. softmax(logit) def _Mstep(self, gamma, full_matrix=FULL_MATRIX, **kwargs): # [1epts] args: ‘gamma(tau): NxK array, the posterior distribution (a.k.a, the soft cluster assignment) for each observation. full_matrix: whether we use full covariance matrix in Normal POF or not. Default is True. Return pi: np array of length K, the prior of each conponent mu: KxD numpy array, the center for each gaussian signa: KxOxD nunpy array, the diagonal standard deviation of each gaussian. You will have KxDxD numpy array for full covariance matrix case Hint: There are formulas in the slides and in the Jupyter Notebook. Undergrads: To simplify your calculation in signa, make sure to only take the diagonal terms in your covariance matrix graduate implenentation #if full_matrix is True: #. # <= undergraduate inplementation #if full_matrix is False: #. Af full_matrix is True: pis np.zeros(self.k) mu = np.zeros((self.K, self.D)) sigma = np.zeros((self.k, self.D, self.D)) for k in range(self.k) ganma_k = ganna[:, k] Nk = np.sum(ganna_k) uk] = np.dot( (ganma_k), self. points) /Nk Pilk]= Nk/self.N Signa[k] = (1/Nk)*np.dot(ganna_k.T*(self.points - mufk]).T, (self.points - mu(k])) return pi, mu, signa def _call_(self, full_matri to change ULL_MATRTX, abs_tol=1e-16, rel_to je-16, **kwargs): # No need args: abs_tol: convergence criteria w.r.t absolute change of loss rel_tol: convergence criteria w.r.t relative change of loss kwargs: any additional arguments you want Return ganna(tau): NxK array, the posterior distribution (a.k.a, the soft cluster assignnent) for each observation. (pi, mu, sigma): (1xk np array, KxD nunpy array, KxDXD aunpy array) Hint: You do not need to change it. For each iteration, we process & and M steps, then update the paramters. pi, mu, signa = self._init_components(**kwargs) pbar = tadm(range(sel¥.max_iters)) for it in pbar: ntps:production-gradescope-uploads s3-us-west-2.amazoraws.comiuploadsitext_ flefle!402773438igmm.py?X-Amz-Algorthm=AWSS-HMAC-SHA... 8/5215/24, 209M —_producton-gradescope-uploads.s2-us-est-2.amazonaws.comluploads(text_flfle402773438/gmm,py7X-Amz-Algoritim=AWS4- # Estep ganna self.£ step(pi, mu, signa, full_natrix) # Mstep pi, mu, signa = self._M_step(ganma, full_natrix) # calculate the negative log-Likelinood of observation joint_1l = self._ll_joint(pi, mu, sigma, full_matrix) loss = -np.sum(self.logsumexp(joint_11)) sf it: diff = np.abs(prev_loss - loss) Sf Giff < abs_tol and ciff / prev loss < rel_tol: break prev_loss = loss pbarsset. description( ‘iter Xd, loss: X.4f* X (it, loss) return ganna, (pi, mu, signa) nips: production -gradescope-uploads s3-us-west-2.amazoraws.comiuploadsitext flefle!402773438igmm.py?X-Amz-Algorthm=AWSS-HMAC-SHA... 5/5

Support Vector Machine - Python Implementation Using CVXOPT - Data Blog
100% (1)
Support Vector Machine - Python Implementation Using CVXOPT - Data Blog
12 pages
Machine Learning and Pattern Recognition Minimal GP Demo
No ratings yet
Machine Learning and Pattern Recognition Minimal GP Demo
3 pages
Em Algo For Multivariate GMM
No ratings yet
Em Algo For Multivariate GMM
9 pages
SoftMax_regress_real
No ratings yet
SoftMax_regress_real
8 pages
EE 559 HW2Code PDF
No ratings yet
EE 559 HW2Code PDF
7 pages
Pca 2382487
No ratings yet
Pca 2382487
8 pages
aiml
No ratings yet
aiml
18 pages
# ELG 5255 Applied Machine Learning Fall 2020 # Assignment 3 (Multivariate Method)
No ratings yet
# ELG 5255 Applied Machine Learning Fall 2020 # Assignment 3 (Multivariate Method)
8 pages
Kanish 9-12
No ratings yet
Kanish 9-12
18 pages
STAT3006: Tutorial 1: Sample Solutions
No ratings yet
STAT3006: Tutorial 1: Sample Solutions
10 pages
T4
No ratings yet
T4
2 pages
AIML_LAB
No ratings yet
AIML_LAB
37 pages
Data Manipulation With Numpy
No ratings yet
Data Manipulation With Numpy
13 pages
NB 13
No ratings yet
NB 13
27 pages
L_AND_T_project_Naveen 24cs002895
No ratings yet
L_AND_T_project_Naveen 24cs002895
7 pages
ML Journal
No ratings yet
ML Journal
58 pages
hw7 Sol
No ratings yet
hw7 Sol
12 pages
Joining Instructions Lisboa
No ratings yet
Joining Instructions Lisboa
8 pages
Weekly Homework X
No ratings yet
Weekly Homework X
15 pages
Code
No ratings yet
Code
7 pages
DEEP LEARNING (ACt Func)
No ratings yet
DEEP LEARNING (ACt Func)
10 pages
PATTERN FILE[1]
No ratings yet
PATTERN FILE[1]
29 pages
Data Science Manual
No ratings yet
Data Science Manual
16 pages
ML Lab
No ratings yet
ML Lab
7 pages
Numerical Methods To Solve Systems of Equations in Python
No ratings yet
Numerical Methods To Solve Systems of Equations in Python
12 pages
Cheat Sheet
No ratings yet
Cheat Sheet
4 pages
2024 Week 12 - Jupyter Notebook
No ratings yet
2024 Week 12 - Jupyter Notebook
3 pages
Foundations of Data Science: Exercise 1
No ratings yet
Foundations of Data Science: Exercise 1
5 pages
Experiment1111
No ratings yet
Experiment1111
25 pages
Origin CFG
No ratings yet
Origin CFG
1 page
15CSL76 Students
No ratings yet
15CSL76 Students
18 pages
ml labs
No ratings yet
ml labs
14 pages
AD3411 (2)
No ratings yet
AD3411 (2)
28 pages
ModuleAr Merged
No ratings yet
ModuleAr Merged
42 pages
ML Classification
No ratings yet
ML Classification
54 pages
Rajeek8 12
No ratings yet
Rajeek8 12
21 pages
Data Preparation
No ratings yet
Data Preparation
11 pages
Assignment #1: K Nearest Neighbor Classifier: Name: Srikanth Mujjiga (Roll No: 2015-50-831
No ratings yet
Assignment #1: K Nearest Neighbor Classifier: Name: Srikanth Mujjiga (Roll No: 2015-50-831
8 pages
Nlp2.ipynb - Colab
No ratings yet
Nlp2.ipynb - Colab
3 pages
ML Lab Record
No ratings yet
ML Lab Record
33 pages
ASSi2 DSBDA
No ratings yet
ASSi2 DSBDA
4 pages
Python assignments
No ratings yet
Python assignments
11 pages
Steps Involved in The PCA: Dataset Matrix
No ratings yet
Steps Involved in The PCA: Dataset Matrix
4 pages
Name: Muhammad Sarfraz Seat: EP1850086 Section: A Course Code: 514 Course Name: Data Warehousing and Data Mining
No ratings yet
Name: Muhammad Sarfraz Seat: EP1850086 Section: A Course Code: 514 Course Name: Data Warehousing and Data Mining
39 pages
fOU2
No ratings yet
fOU2
11 pages
ml_all_projectpdf_removed
No ratings yet
ml_all_projectpdf_removed
41 pages
Data Science Lab Manual
No ratings yet
Data Science Lab Manual
32 pages
EE2211 CheatSheet
No ratings yet
EE2211 CheatSheet
15 pages
Pattern Recognition
No ratings yet
Pattern Recognition
26 pages
Ai Last 5
No ratings yet
Ai Last 5
4 pages
hw1 ML IvanReyes
No ratings yet
hw1 ML IvanReyes
21 pages
Ann 1
No ratings yet
Ann 1
20 pages
utf-8''C2M1 Assignment
No ratings yet
utf-8''C2M1 Assignment
24 pages
Argha's ML LAB_240927_121838
No ratings yet
Argha's ML LAB_240927_121838
13 pages
Assignment No 8
No ratings yet
Assignment No 8
17 pages
Machine Learning Techniques
No ratings yet
Machine Learning Techniques
8 pages
MACHINE LEARNING manual
No ratings yet
MACHINE LEARNING manual
36 pages
exercise01
No ratings yet
exercise01
3 pages
DA_Programs
No ratings yet
DA_Programs
44 pages

GMM

Uploaded by

GMM

Uploaded by

You might also like