Cheat Sheet
Cheat Sheet
Cheat Sheet
Reinforcement. In reinforcement learning, we get some form of output but it does not
correspond directly to the correct output for each instance. For example, we could get a single
output after a batch of instances is processed (as commonly seen in games, where the eventual
game win/loss outcome is the eventual output of a series of instance moves). For example, as a
Year 1 student, I would like to choose modules that maximize my starting income upon
graduation. If we consider the choice of modules during each of the nominal eight semesters as
a problem instance, this would constitute a reinforcement learning problem. In this final task,
the input might be the historical enrollment records of past students (and possibly their
performance within each module), and the output would be their starting salary. Data is the set
of inputs that correspond to the resultant output.
在强化学习中,我们得到某种形式的输出,但它并不直接对应于每个实例的正确输出。例
如,我们可以在处理一批实例后获得单个输出(正如在游戏中常见的那样,最终的游戏赢
/输结果是一系列实例移动的最终输出)。例如,作为一名一年级学生,我想选择能够最
大化我毕业后起始收入的模块。如果我们将标称八个学期中每个学期的模块选择视为一个
问题实例,这将构成一个强化学习问题。在这个最终任务中,输入可能是过去学生的历史
入学记录(可能还有他们在每个模块中的表现),输出可能是他们的起薪。 数据是对应于
结果输出的一组输入。
k-Nearest Neighbors
https://zhuanlan.zhihu.com/p/71646003
https://en.wikipedia.org/wiki/K-d_tree#Complexity
07b.evaluations, Tutorial 05 Evaluation Metrics
07c.evaluations, 08a.data-processing, 08b.feature-engineering, tutorial-06:
MAE https://www.statology.org/mean-absolute-error-calculator/
MSE https://www.statology.org/mse-calculator/
RMSE https://www.statology.org/rmse-calculator/
MAPE https://www.statology.org/mape-calculator/
SMAPE https://rdrr.io/cran/Metrics/man/smape.html