Semantic Information G Theory and Logical Bayesian Inference for Machine Learning

Lu, Chenguang

doi:10.3390/info10080261

Computer Science > Artificial Intelligence

arXiv:1809.01577 (cs)

[Submitted on 3 Sep 2018 (v1), last revised 23 Dec 2022 (this version, v2)]

Title:Semantic Information G Theory and Logical Bayesian Inference for Machine Learning

Authors:Chenguang Lu

View PDF

Abstract:An important problem with machine learning is that when label number n>2, it is very difficult to construct and optimize a group of learning functions, and we wish that optimized learning functions are still useful when prior distribution P(x) (where x is an instance) is changed. To resolve this problem, the semantic information G theory, Logical Bayesian Inference (LBI), and a group of Channel Matching (CM) algorithms together form a systematic solution. A semantic channel in the G theory consists of a group of truth functions or membership functions. In comparison with likelihood functions, Bayesian posteriors, and Logistic functions used by popular methods, membership functions can be more conveniently used as learning functions without the above problem. In Logical Bayesian Inference (LBI), every label's learning is independent. For Multilabel learning, we can directly obtain a group of optimized membership functions from a big enough sample with labels, without preparing different samples for different labels. A group of Channel Matching (CM) algorithms is developed for machine learning. For the Maximum Mutual Information (MMI) classification of three classes with Gaussian distributions on a two-dimensional feature space, 2-3 iterations can make mutual information between three classes and three labels surpass 99% of the MMI for most initial partitions. For mixture models, the Expectation-Maximization (EM) algorithm is improved and becomes the CM-EM algorithm, which can outperform the EM algorithm when mixture ratios are imbalanced, or local convergence exists. The CM iteration algorithm needs to combine neural networks for MMI classifications on high-dimensional feature spaces. LBI needs further studies for the unification of statistics and logic.

Comments:	32 Pages, 14 figures
Subjects:	Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
MSC classes:	03B42, 03B48, , 03B52, 03B65, 62F15, 68P30, 94F15, 68T27, 68T37, 68T50
ACM classes:	H.1.1; F.4.1; I.2.3; I.2.6; I.5.2; I.5.3
Cite as:	arXiv:1809.01577 [cs.AI]
	(or arXiv:1809.01577v2 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.1809.01577
Journal reference:	Information 2019, 10(8), 261
Related DOI:	https://doi.org/10.3390/info10080261

Submission history

From: Chenguang Lu [view email]
[v1] Mon, 3 Sep 2018 11:39:11 UTC (344 KB)
[v2] Fri, 23 Dec 2022 00:27:55 UTC (3,555 KB)

Computer Science > Artificial Intelligence

Title:Semantic Information G Theory and Logical Bayesian Inference for Machine Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Semantic Information G Theory and Logical Bayesian Inference for Machine Learning

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators