0% found this document useful (0 votes)
90 views1 page

Text Mining Algorithms PDF

This document discusses 10 common text mining algorithms: K-means clustering, Naive Bayes classifier, K-nearest neighbor (KNN), Support Vector Machines (SVM), Decision Tree, Generalized Linear Models (GLM), Neural Networks, Association Rules, Genetic Algorithms, and Latent Dirichlet Allocation (LDA). Each algorithm is briefly defined in 1-2 sentences.

Uploaded by

Lawrence Moore
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
90 views1 page

Text Mining Algorithms PDF

This document discusses 10 common text mining algorithms: K-means clustering, Naive Bayes classifier, K-nearest neighbor (KNN), Support Vector Machines (SVM), Decision Tree, Generalized Linear Models (GLM), Neural Networks, Association Rules, Genetic Algorithms, and Latent Dirichlet Allocation (LDA). Each algorithm is briefly defined in 1-2 sentences.

Uploaded by

Lawrence Moore
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 1

10

TEXT MINING ALGORITHMS

K-MEANS CLUSTERING

1 Aims to find groups in given data set. The number


of groups is represented by a variable called K.

NAIVE BAYES CLASSIFIER

2 A family of algorithms which assume that values of


the features used in the classification are
independent.

K-NEAREST NEIGHBOR (KNN)

3 A simple algorithm that stores all existing data


objects and classifies the new data objects based
on a similarity measure.

SUPPORT VECTOR MACHINES (SVM)

4 Used to sort two data sets by similar classification.


Draw lines (hyperplanes) that separate the groups
according to some patterns.

DECISION TREE

5 A machine learning technique for data mining that


creates classification or regression models in the
shape of a tree structure.

GENERALIZED LINEAR MODELS (GLM)

6 Combines a number of models including linear


regression models, logistic regression, Poisson
regression, ANOVA, log-linear models and etc.

NEURAL NETWORKS

7 Nonlinear models which represent a metaphor for


the functioning of the human brain.

ASSOCIATION RULES

8 If/then statements that aim to uncover some


relationships between unrelated data in a given
database.

GENETIC ALGORITHMS

9 A family of stochastic search algorithms witch


mechanism is inspired by the process of neo-
Darwinian evolution.

LATENT DIRICHLET ALLOCATION (LDA)

10 A generative probabilistic model designed for


collections of discrete data.

intellspot.com

You might also like