Network properties determine neural network performance

4Citations
Citations of this article
37Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

Machine learning influences numerous aspects of modern society, empowers new technologies, from Alphago to ChatGPT, and increasingly materializes in consumer products such as smartphones and self-driving cars. Despite the vital role and broad applications of artificial neural networks, we lack systematic approaches, such as network science, to understand their underlying mechanism. The difficulty is rooted in many possible model configurations, each with different hyper-parameters and weighted architectures determined by noisy data. We bridge the gap by developing a mathematical framework that maps the neural network’s performance to the network characters of the line graph governed by the edge dynamics of stochastic gradient descent differential equations. This framework enables us to derive a neural capacitance metric to universally capture a model’s generalization capability on a downstream task and predict model performance using only early training results. The numerical results on 17 pre-trained ImageNet models across five benchmark datasets and one NAS benchmark indicate that our neural capacitance metric is a powerful indicator for model selection based only on early training results and is more efficient than state-of-the-art methods.

References Powered by Scopus

Deep residual learning for image recognition

173367Citations
N/AReaders
Get full text

Deep learning

63305Citations
N/AReaders
Get full text

Delving deep into rectifiers: Surpassing human-level performance on imagenet classification

15409Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Artificial Intelligence-Empowered Radiology—Current Status and Critical Review

1Citations
N/AReaders
Get full text

New design of an intelligent electromagnetic torque controller based on neural network and fractional calculus: Variable-speed wind energy systems application

1Citations
N/AReaders
Get full text

Frobenius Norm-Based Global Stability Analysis of Delayed Bidirectional Associative Memory Neural Networks

0Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Jiang, C., Huang, Z., Pedapati, T., Chen, P. Y., Sun, Y., & Gao, J. (2024). Network properties determine neural network performance. Nature Communications, 15(1). https://doi.org/10.1038/s41467-024-48069-8

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 6

50%

Researcher 4

33%

Professor / Associate Prof. 2

17%

Readers' Discipline

Tooltip

Computer Science 3

33%

Engineering 3

33%

Physics and Astronomy 2

22%

Agricultural and Biological Sciences 1

11%

Article Metrics

Tooltip
Mentions
News Mentions: 2
Social Media
Shares, Likes & Comments: 18

Save time finding and organizing research with Mendeley

Sign up for free