Data Clustering Solution
Data Clustering Solution
Retained content for each document after removing stop words, punctuation, and irrelevant terms:
|-------------------|-------|-------|-------|
| information |3 |0 |2 |
| data |2 |0 |1 |
| meaning |1 |0 |0 |
| objective |1 |0 |0 |
| database |2 |0 |0 |
| administration |1 |0 |0 |
| storing |1 |0 |0 |
| facts |1 |0 |0 |
| computer |1 |0 |2 |
| carbon |0 |2 |0 |
| chemical |0 |2 |0 |
| compounds |0 |2 |0 |
| program |0 |0 |1 |
| instructions |0 |0 |1 |
| application |0 |0 |1 |
| example |0 |0 |1 |
| values |0 |0 |1 |
Clusters:
Cluster 2: {Doc 2}
@relation documents
@attribute term_information numeric
@data
3, 2, 1, 1, 2, 1, 1, 1, 1, 0, 0, 0, 0, 0, 0, 0, 0
0, 0, 0, 0, 0, 0, 0, 0, 0, 2, 2, 2, 0, 0, 0, 0, 0
2, 1, 0, 0, 0, 0, 0, 0, 2, 0, 0, 0, 1, 1, 1, 1, 1