TY - JOUR
T1 - Maximum mutual information regularized classification
AU - Wang, Jim Jing-Yan
AU - Wang, Yi
AU - ZHAO, SHIGUANG
AU - Gao, Xin
N1 - KAUST Repository Item: Exported on 2020-10-01
PY - 2014/9/7
Y1 - 2014/9/7
N2 - In this paper, a novel pattern classification approach is proposed by regularizing the classifier learning to maximize mutual information between the classification response and the true class label. We argue that, with the learned classifier, the uncertainty of the true class label of a data sample should be reduced by knowing its classification response as much as possible. The reduced uncertainty is measured by the mutual information between the classification response and the true class label. To this end, when learning a linear classifier, we propose to maximize the mutual information between classification responses and true class labels of training samples, besides minimizing the classification error and reducing the classifier complexity. An objective function is constructed by modeling mutual information with entropy estimation, and it is optimized by a gradient descend method in an iterative algorithm. Experiments on two real world pattern classification problems show the significant improvements achieved by maximum mutual information regularization.
AB - In this paper, a novel pattern classification approach is proposed by regularizing the classifier learning to maximize mutual information between the classification response and the true class label. We argue that, with the learned classifier, the uncertainty of the true class label of a data sample should be reduced by knowing its classification response as much as possible. The reduced uncertainty is measured by the mutual information between the classification response and the true class label. To this end, when learning a linear classifier, we propose to maximize the mutual information between classification responses and true class labels of training samples, besides minimizing the classification error and reducing the classifier complexity. An objective function is constructed by modeling mutual information with entropy estimation, and it is optimized by a gradient descend method in an iterative algorithm. Experiments on two real world pattern classification problems show the significant improvements achieved by maximum mutual information regularization.
UR - http://hdl.handle.net/10754/556641
UR - http://linkinghub.elsevier.com/retrieve/pii/S0952197614002085
UR - http://www.scopus.com/inward/record.url?scp=84910656722&partnerID=8YFLogxK
U2 - 10.1016/j.engappai.2014.08.009
DO - 10.1016/j.engappai.2014.08.009
M3 - Article
SN - 0952-1976
VL - 37
SP - 1
EP - 8
JO - Engineering Applications of Artificial Intelligence
JF - Engineering Applications of Artificial Intelligence
ER -