Experiments Results: Machine Learning Prediction of Companies' Business Success
Experiments Results: Machine Learning Prediction of Companies' Business Success
Logistic Regression
Logistic regression is s widely-used algorithm to We selected KNN model to run on test set with:
model a binary dependent variable with many Accuracy = 73.70% F1 score: 44.45%
independent variables.
Model Selection
Random Forest we present three metrics: FUTURE WORK REFERENCE
Random Forest is an ensemble learning method for
● Accuracy: The proportion we have predicted right.
classification with constructing a multitude of decision ● Include more features of the companies,
● F1 Score:
trees at training time and outputting the class that is ● Wei CP, Jiang YS, Yang CS. Patent
such as business description.
the mode of the classes. Analysis for Supporting Merger and
● Try more complex models, such as
K Nearest Neighbours Acquisition (M&A) Prediction: A Data
Neural Network and pre-trained word
We classify an object by a majority vote of its K Mining Approach[M]. Berlin: Springer,
embedding.
nearest neighbours. 2009: 187-200.
● AUC Score: Area under the ROC Curve, which is an ● Try kernel method as moving the data to
● Bento FRSR. Predicting Start-up Success
aggregate measure of performance across all possible higher dimensional space.
with Machine Learning[D]. Lisboa: NOVA
classification thresholds. ● Explore some new questions, such as
Information Management School, 2018.
● TPR = TP / (TP + FN), FPR = FP / (FP + TN) predicting the total funding size for a
9-83.
company (regression problem).