Machine Learning Questions and Answers: Decision Tree
Machine Learning Questions and Answers: Decision Tree
2. Measures used to select features for root and internal nodes in a decision tree
- Entropy: Measures impurity in a dataset.
- Gini Index: Measures the probability of misclassification.
- Information Gain: Reduction in entropy when a feature is used.
- Gain Ratio: Adjusted version of information gain to account for attribute splits.
3. Multivariate Classifier
- Considers multiple features simultaneously.
- Examples: Linear Discriminant Analysis (LDA), Quadratic Discriminant Analysis (QDA),
Multivariate Decision Trees.
8. Distance Metrics
- Euclidean Distance: d(x,y) = sqrt(sum (x_i - y_i)^2)
- Manhattan Distance: d(x,y) = sum |x_i - y_i|
- Minkowski Distance: d(x,y) = (sum |x_i - y_i|^p)^(1/p).
19. Difference between Gaussian Mixture Model (GMM) and Dirichlet Mixture Model (DMM)
- GMM: Assumes Gaussian distributions with known priors.
- DMM: Uses Dirichlet Process as a prior, allowing a variable number of clusters.