Chrome Extension

WeChat Mini Program

Use on ChatGLM

Log in

Academic Profile User Profile

My Following Paper Collections Browse History

Conditional Mutual Information Constrained Deep Learning for Classification

En-Hui Yang,Shayan Mohajer Hamidi,Linfeng Ye,Renhao Tan,Beverly Yang

IEEE transactions on neural networks and learning systems（2025）

Cited 0|Views17

Abstract

The concepts of conditional mutual information (CMI) and normalized conditional mutual information (NCMI) are introduced to measure the concentration and separation performance of a classification deep neural network (DNN) in the output probability distribution space of the DNN, where CMI and the ratio between CMI and NCMI represent the intra-class concentration and inter-class separation of the DNN, respectively. By using NCMI to evaluate popular DNNs pretrained over ImageNet in the literature, it is shown that their validation accuracies over ImageNet validation data set are more or less inversely proportional to their NCMI values. Based on this observation, the standard deep learning (DL) framework is further modified to minimize the standard cross entropy function subject to an NCMI constraint, yielding CMI constrained deep learning (CMIC-DL). A novel alternating learning algorithm is proposed to solve such a constrained optimization problem. Extensive experiment results show that DNNs trained within CMIC-DL outperform the state-of-the-art models trained within the standard DL and other loss functions in the literature in terms of both accuracy and robustness against adversarial attacks. In addition, visualizing the evolution of learning process through the lens of CMI and NCMI is also advocated.

More

Translated text

Key words

Alternating minimization,concentration and separation,conditional mutual information (CMI),cross entropy (CE),deep learning (DL)

Bibtex

AI Read Science

Must-Reading Tree

Example

Generate MRT to find the research sequence of this paper

Related Papers

Reference papers

Fixed-slope Universal Lossy Data Compression

EH Yang,Z Zhang,T Berger

1997

被引用96 | 浏览

Noise Reduction in Speech Processing

Jacob Benesty,Jingdong Chen,Yiteng Huang,Israel Cohen

2009

被引用633 | 浏览

Very Deep Convolutional Networks for Large-Scale Image Recognition

Karen Simonyan,Andrew Zisserman

2014

被引用144185 | 浏览

Explaining and Harnessing Adversarial Examples

Ian J. Goodfellow,Jonathon Shlens,Christian Szegedy

2014

被引用25241 | 浏览

A Discriminative Feature Learning Approach for Deep Face Recognition.

Yandong Wen,Kaipeng Zhang,Zhifeng Li,Yu Qiao

2016

被引用4855 | 浏览

Building a Regular Decision Boundary with Deep Networks.

Edouard Oyallon

2017

被引用38 | 浏览

Learning multiple layers of features from tiny images

A Krizhevsky,G Hinton

2009

被引用35386 | 浏览

Towards Deep Learning Models Resistant to Adversarial Attacks.

Aleksander Madry,Aleksandar Makelov,Ludwig Schmidt,Dimitris Tsipras,Adrian Vladu

2018

被引用15168 | 浏览

EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks

Mingxing Tan,Quoc V. Le

2019

被引用28474 | 浏览

Prevalence of Neural Collapse During the Terminal Phase of Deep Learning Training

Vardan Papyan,X. Y. Han,David L. Donoho

2020

被引用395 | 浏览

Orthogonal Projection Loss

Kanchana Ranasinghe,Muzammal Naseer,Munawar Hayat,Salman Khan,Fahad Shahbaz Khan

2021

被引用93 | 浏览

Data Disclaimer

The page data are from open Internet sources, cooperative publishers and automatic analysis results through AI technology. We do not make any commitments and guarantees for the validity, accuracy, correctness, reliability, completeness and timeliness of the page data. If you have any questions, please contact us by email: report@aminer.cn

Chat Paper

【要点】：论文提出了一种基于条件互信息约束的深度学习框架（CMIC-DL），通过引入条件互信息（CMI）和归一化条件互信息（NCMI）衡量深度神经网络在输出概率分布空间的分类性能，实现了更高的准确性和鲁棒性。

【方法】：使用CMI和NCMI评估DNN在输出概率分布空间的分类性能，并根据NCMI值修改标准深度学习框架，通过约束优化问题提出CMIC-DL，并采用交替学习算法解决该问题。

【实验】：在ImageNet验证数据集上进行的广泛实验表明，基于CMIC-DL训练的DNN在准确性和对抗攻击的鲁棒性方面均优于标准深度学习框架和其他文献中的损失函数方法。

去 AI 文献库对话