Unstructured Data Classification
Unstructured Data Classification
sentiment_analysis_data.head(3)
2.In Supervised learning, class labels of the training samples are ____________
known
5.In document classification, each document has to be converted from full text to a
document vector.
true
6.A technique used to depict the performance in a tabular form that has 2
dimensions namely actual and predicted sets of data is ___________
Confusion Matrix
7.Which NLP technique uses a lexical knowledge base to obtain the correct base form
of the words?
lemmatization
TF-IDF
16.SVM is a _____________
Supervised learning algorithm
18.Imagine you have just finished training a decision tree for spam classification,
and it is showing abnormal bad performance on both your training and test sets.
Assume that your implementation has no bugs. What could be the reason for this
problem?
All the options
19.Which of the given hyperparameters, when increased, may cause the random forest
to overfit the data?
Depth of Tree
22.