Home / Papers / Top Research Papers on Neural Networks

Top Research Papers on Neural Networks

Explore our curated list of top research papers on Neural Networks. Delve into cutting-edge innovations and advancements that are shaping the future of artificial intelligence. Perfect for researchers, students, and enthusiasts who want to stay updated with the latest trends and findings in this dynamic field.

Looking for research-backed answers?Try AI Search

Neural Network

774 Citations 2020

Robi Ardiansyah, Enny Itje, Universitas Teknologi + 1 more

Natural Language Processing

Explanations for Neural Networks by Neural Networks

12 Citations 2022

Sascha Marton, S. Lüdtke, Christian Bartelt

Applied Sciences

This paper introduces a real-time approach for generating a symbolic representation of the function learned by a neural network via another neural network (called the Interpretation Network, or I-Net), which maps network parameters to a symbolic representations of the network function.

Graph Neural Networks for Learning Equivariant Representations of Neural Networks

33 Citations 2024

Miltiadis Kofinas, Boris Knyazev, Yan Zhang + 5 more

ArXiv

This work proposes to represent neural networks as computational graphs of parameters, which allows them to harness powerful graph neural networks and transformers that preserve permutation symmetry, and enables a single model to encode neural computational graphs with diverse architectures.

Benchmarking Graph Neural Networks

932 Citations 2023

Vijay Prakash Dwivedi, Chaitanya K. Joshi, T. Laurent + 2 more

ArXiv

A reproducible GNN benchmarking framework is introduced, with the facility for researchers to add new models conveniently for arbitrary datasets, and a principled investigation into the recent Weisfeiler-Lehman GNNs (WL-GNNs) compared to message passing-based graph convolutional networks (GCNs).

Artificial Neural Networks in Medical Diagnosis

516 Citations 2024

Qeethara Al-Shayea

International Journal of Research Publication and Reviews

The results of applying the artificial neural networks methodology to acute nephritis diagnosis based upon selected symptoms show abilities of the network to learn the patterns corresponding to symptoms of the person.

Deep neural networks

339 Citations 2014

Mariette Awad, Rahul Khanna

Machine Learning

Inductive biases are any assumptions that learners utilize to learn the world and predict the output that reduce the amount of data needed to fit the model while constraining the model’s flexibility.

Path Neural Networks: Expressive and Accurate Graph Neural Networks

28 Citations 2023

Gaspard Michel, Giannis Nikolentzos, J. Lutzeyer + 1 more

journal unavailable

This paper derives three different variants of the PathNN model that aggregate single shortest paths, all shortest paths and all simple paths of length up to K, and proves that two of these variants are strictly more powerful than the 1-WL algorithm, and experimentally validate the theoretical results.

Convolutional Neural Networks: A Survey

297 Citations 2023

M. Krichen

Comput.

This paper provides a comprehensive overview of CNNs and their applications in image recognition tasks, and reviews recent developments in CNNs, including attention mechanisms, capsule networks, transfer learning, adversarial training, quantization and compression, and enhancing the reliability and efficiency ofCNNs through formal methods.

Optimal Conversion of Conventional Artificial Neural Networks to Spiking Neural Networks

204 Citations 2021

Shi-Wee Deng, Shi Gu

ArXiv

A novel strategic pipeline is proposed that transfers the weights to the target SNN by combining threshold balance and soft-reset mechanisms and enables almost no accuracy loss between the converted SNNs and conventional ANNs with only $\sim1/10$ of the typical SNN simulation time.

GNN2GNN: Graph neural networks to generate neural networks

10 Citations 2022

Andrea Agiollo, A. Omicini

journal unavailable

A novel framework leveraging Graph Neural Networks to Generate Neural Networks (GNN2GNN) where powerful NN architectures can be learned out of a set of available architecture-performance pairs, and paves the way towards generalisation between datasets.

Deep Neural Networks

24 Citations 2023

Randall Balestriero, Richard Baraniuk

journal unavailable

While some RNN architectures possess the capability to maintain a memory of the previous inputs/ outputs, to compute output, the memory states need to encompass information of many previous states, which can be difficult especially when performing tasks with long-term dependencies.

Quantum Neural Network for Quantum Neural Computing

40 Citations 2023

Min-Gang Zhou, Zhi-Ping Liu, Hua‐Lei Yin + 3 more

Research

This work proposes a new quantum neural network model for quantum neural computing using (classically controlled) single-qubit operations and measurements on real-world quantum systems with naturally occurring environment-induced decoherence, which greatly reduces the difficulties of physical implementations.

Neural Network Diffusion

23 Citations 2024

Kaitian Wang, Zhaopan Xu, Yukun Zhou + 4 more

ArXiv

This work demonstrates that diffusion models can also generate high-performing neural network parameters, and empirically finds that the generated models are not memorizing the trained ones.

Graph Neural Networks in Network Neuroscience

177 Citations 2021

Alaa Bessadok, M. Mahjoub, I. Rekik

IEEE Transactions on Pattern Analysis and Machine Intelligence

Current GNN-based methods are reviewed, highlighting the ways that they have been used in several applications related to brain graphs such as missing brain graph synthesis and disease classification, and charting a path toward a better application of GNN models in network neuroscience field for neurological disorder diagnosis and population graph integration.

Graph Neural Networks

223 Citations 2021

Yuyu Zhang, Xinshi Chen, Yuan Yang + 4 more

Deep Learning on Graphs

This chapter systematically organize the existing research of GNNs along three axes: foundations, frontiers, and applications, and introduces the fundamental aspects of GNNs ranging from the popular models and their expressive powers, to the scalability, interpretability and robustness of GNNs.

Neural heterogeneity controls computations in spiking neural networks

40 Citations 2024

Richard Gast, S. Solla, Ann Kennedy

Proceedings of the National Academy of Sciences of the United States of America

This work analyzes a mathematical model of networks of heterogeneous spiking neurons and reveals how a mostly overlooked property of the brain—neural heterogeneity—allows for the emergence of computationally specialized networks.

Neural Networks And Machine Learning

87 Citations 2023

Anu Sayal, Janhvi Jha, Chaithra N + 4 more

2023 IEEE 5th International Conference on Cybernetics, Cognition and Machine Learning Applications (ICCCMLA)

This chapter has attempted to depict the types of neural networks and machine learning as well as their applications in different industrial disciplines such as science, commerce, and medicine.

Neural Network Branch-and-Bound for Neural Network Verification

7 Citations 2021

Florian Jaeckle, Jingyue Lu, M. P. Kumar

ArXiv

This work proposes a novel machine learning framework that can be used for designing an effective branching strategy as well as for computing better lower bounds, and learns two graph neural networks that both directly treat the network they want to verify as a graph input and perform forward-backward passes through the GNN layers.

Understanding plasticity in neural networks

100 Citations 2023

Clare Lyle, Zeyu Zheng, Evgenii Nikishin + 3 more

ArXiv

A systematic empirical analysis into plasticity loss is conducted, finding that loss of plasticity is deeply connected to changes in the curvature of the loss landscape, but that it often occurs in the absence of saturated units.

Stitchable Neural Networks

25 Citations 2023

Zizheng Pan, Jianfei Cai, Bohan Zhuang

2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Stitchable Neural Networks (SN-Net) is presented, a novel scalable and efficient framework for model deployment that cheaply produces numerous networks with different complexity and performance trade-offs given a family of pretrained neural networks, which the authors call anchors.

Constructing Deep Spiking Neural Networks from Artificial Neural Networks with Knowledge Distillation

67 Citations 2023

Qi Xu, Yaxin Li, Jiangrong Shen + 3 more

2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

A novel method of constructing deep SNN models with knowledge distillation (KD) that uses ANN as the teacher model and SNN as the student model and it has a superb ability of noise immunity for various types of artificial noises and natural signals.

Comparison of Convolutional Neural Network and Artificial Neural Network for Rice Detection

10 Citations 2023

Endang Suherman, Djarot Hindarto, A. Makmur + 1 more

Sinkron

The purpose of this research is to classify the rice image dataset and detect the rice images using neural networks in experiments using public datasets.

Epistemic Neural Networks

103 Citations 2021

Ian Osband, Zheng Wen, M. Asghari + 3 more

ArXiv

The epinet is introduced: an architecture that can supplement any conventional neural network, including large pretrained models, and can be trained with modest incremental computation to estimate uncertainty, and the epistemic neural network (ENN) is introduced as an interface for models that produce joint predictions.

Dynamic Neural Networks: A Survey

644 Citations 2021

Yizeng Han, Gao Huang, Shiji Song + 3 more

IEEE Transactions on Pattern Analysis and Machine Intelligence

This survey comprehensively review this rapidly developing area of dynamic networks by dividing dynamic networks into three main categories: sample-wise dynamic models that process each sample with data-dependent architectures or parameters; spatial-wiseynamic networks that conduct adaptive computation with respect to different spatial locations of image data; and temporal-wise Dynamic networks that perform adaptive inference along the temporal dimension for sequential data.

Network In Graph Neural Network

10 Citations 2021

Xiang Song, Runjie Ma, Jiahang Li + 2 more

ArXiv

This paper presents a model-agnostic methodology, namely Network In Graph Neural Network (NGNN), that allows arbitrary GNN models to increase their model capacity by making the model deeper, by inserting non-linear feedforward neural network layer(s) within each GNN layer.

Are aligned neural networks adversarially aligned?

242 Citations 2023

Nicholas Carlini, Milad Nasr, Christopher A. Choquette-Choo + 8 more

ArXiv

It is shown that existing NLP-based optimization attacks are insufficiently powerful to reliably attack aligned text models: even when current NLP-based attacks fail, the authors can find adversarial inputs with brute force.

Verifying Neural Network Controlled Systems Using Neural Networks

13 Citations 2022

Qingye Zhao, Xin Chen, Zhuoyu Zhao + 3 more

Proceedings of the 25th ACM International Conference on Hybrid Systems: Computation and Control

A novel approach to synthesizing neural networks as barrier certificates, which can provide safety guarantees for neural network controlled systems, and implements the tool NetBC, which is more effective and scalable than the existing polynomial barrier certificate-based method.

Integral Neural Networks

12 Citations 2023

Kirill Solodskikh, Azim Kurbanov, Ruslan Aydarkhanov + 4 more

2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

A new family of deep neural networks, where instead of the conventional representation of network layers as N-dimensional weight tensors, they use a continuous layer representation along the filter and channel dimensions, which can be applied to prune the model directly on an edge device while suffering only a small performance loss.

Neural Bellman-Ford Networks: A General Graph Neural Network Framework for Link Prediction

309 Citations 2021

Zhaocheng Zhu, Zuobai Zhang, Louis-Pascal Xhonneux + 1 more

journal unavailable

The Neural Bellman-Ford Network (NBFNet) is proposed, a general graph neural network framework that solves the path formulation with learned operators in the generalized Bell man-Ford algorithm, and outperforms existing methods by a large margin in both transductive and inductive settings.

An Introduction to Convolutional Neural Networks

187 Citations 2022

Aarush Saxena

International Journal for Research in Applied Science and Engineering Technology

CNNs are primarily used to solve difficult image-driven pattern recognition tasks and with their precise yet simple architecture, offer a simplified method of getting started with ANNs.

e3nn: Euclidean Neural Networks

178 Citations 2022

M. Geiger, T. Smidt

ArXiv

The core of e3nn are equivariant operations such as the TensorProduct class or the spherical harmonics functions that can be composed to create more complex modules such as convolutions and attention mechanisms, which can be used to efficiently articulate Tensor Field Networks, 3D Steerable CNNs, Clebsch-Gordan Networks, SE(3) Transformers and other E(3).

A Survey on Oversmoothing in Graph Neural Networks

202 Citations 2023

T. Konstantin Rusch, Michael M. Bronstein, Siddhartha Mishra

ArXiv

The definition of over-smoothing is axiomatically defined as the exponential convergence of suitable similarity measures on the node features of graph neural networks and extended to the rapidly emerging field of continuous-time GNNs.

On the Kolmogorov neural networks

8 Citations 2023

Aysu Ismayilova, V. Ismailov

Neural networks : the official journal of the International Neural Network Society

In this paper, we show that the Kolmogorov two hidden layer neural network model with a continuous, discontinuous bounded and unbounded activation function in the second hidden layer can precisely represent continuous, discontinuous bounded and all unbounded multivariate functions, respectively.

Factor Graph Neural Networks

32 Citations 2023

Zhen Zhang, Mohammed Haroon Dupty, Fan Wu + 1 more

ArXiv

This work derives an efficient approximate Sum-Product loopy belief propagation inference algorithm for discrete higher-order PGMs, and neuralizes the novel message passing scheme into a Factor Graph Neural Network (FGNN) module by allowing richer representations of the message update rules, which facilitates both efficient inference and powerful end-to-end learning.

Calibration of Neural Networks

9 Citations 2023

Ruslan Vasilev, A. D'yakonov

ArXiv

A survey of confidence calibration problems in the context of neural networks and an empirical comparison of calibration methods is presented and a problem statement, calibration definitions, and different approaches to evaluation are analyzed.

Factorizing Knowledge in Neural Networks

124 Citations 2022

Xingyi Yang, Jingwen Ye, Xinchao Wang

ArXiv

An information-theoretic objective, InfoMax-Bottleneck~(IMB), is introduced, to carry out KF by optimizing the mutual information between the learned representations and input, and the derived factor networks yield gratifying performances on not only the dedicated tasks but also disentanglement, while enjoying much better interpretability and modularity.

Attention Spiking Neural Networks

169 Citations 2022

Man Yao, Guangshe Zhao, Hengyu Zhang + 5 more

IEEE Transactions on Pattern Analysis and Machine Intelligence

This work lights up SNN's potential as a general backbone to support various applications in the field of SNN research, with a great balance between effectiveness and energy efficiency.

Neural Networks and the Chomsky Hierarchy

147 Citations 2022

Gr'egoire Del'etang, Anian Ruoss, Jordi Grau-Moya + 6 more

ArXiv

It is demonstrated that grouping tasks according to the Chomsky hierarchy allows us to forecast whether certain architectures will be able to generalize to out-of-distribution inputs, including negative results where even extensive amounts of data and training time never lead to any non-trivial generalization.

Superhypergraph Neural Networks and Plithogenic Graph Neural Networks: Theoretical Foundations

2 Citations 2024

Takaaki Fujita

ArXiv

The theoretical foundation for the development of SuperHyperGraph Neural Networks (SHGNNs) and Plithogenic Graph Neural Networks is established, expanding the applicability of neural networks to these advanced graph structures.

Clusterability in Neural Networks

31 Citations 2021

Daniel Filan, Stephen Casper, Shlomi Hod + 3 more

ArXiv

It is found that a trained neural network is typically more clusterable than randomly initialized networks, and often clusterable relative to random networks with the same distribution of weights.

Networks of neural networks: more is different

2 Citations 2025

E. Agliari, Andrea Alessandrelli, Adriano Barra + 2 more

journal unavailable

The common thread behind the recent Nobel Prize in Physics to John Hopfield and those conferred to Giorgio Parisi in 2021 and Philip Anderson in 1977 is disorder. Quoting Philip Anderson:"more is different". This principle has been extensively demonstrated in magnetic systems and spin glasses, and, in this work, we test its validity on Hopfield neural networks to show how an assembly of these models displays emergent capabilities that are not present at a single network level. Such an assembly is designed as a layered associative Hebbian network that, beyond accomplishing standard pattern reco...

Exploring Neural Network Structure through Sparse Recurrent Neural Networks: A Recasting and Distillation of Neural Network Hyperparameters

1 Citations 2023

Quincy Hershey, Randy Paffenroth, Harsh Nilesh Pathak

2023 International Conference on Machine Learning and Applications (ICMLA)

The potential of RNNs to be better realized through sparse parameterizations is found, which significantly improve the stability and expressiveness of model performance across a wider array of hyperparameters while improving performance differentials at significantly reduced weight counts.

Causal Abstractions of Neural Networks

237 Citations 2021

Atticus Geiger, Hanson Lu, Thomas F. Icard + 1 more

journal unavailable

It is discovered that a BERT-based model with state-of-the-art performance successfully realizes parts of the natural logic model's causal structure, whereas a simpler baseline model fails to show any such structure, demonstrating that BERT representations encode the compositional structure of MQNLI.

Diagnosis and Evaluation of Stomach Surgery with CNN Neural Network

229 Citations 2024

Seyed Masoud Ghoreishi Mokri, Newsha Valadbeygi, Khafaji Mohammed Balyasimovich

International Journal of Innovative Science and Research Technology (IJISRT)

This examination underscores the potential of counterfeit insights models utilizing neural systems in diagnosing cases requiring gastric surgery.

On permutation-invariant neural networks

8 Citations 2024

Masanari Kimura, Ryotaro Shimizu, Yuki Hirakawa + 2 more

ArXiv

It is shown that Deep Sets, one of the well-known permutation-invariant neural networks, can be generalized in the sense of a quasi-arithmetic mean, and the behavior of Deep Sets is sensitive to the choice of the aggregation function.

Nested Graph Neural Networks

167 Citations 2021

Muhan Zhang, Pan Li

ArXiv

NGNN is a plug-and-play framework that can be combined with various base GNNs and it is proved that NGNN can discriminate almost all r-regular graphs, where 1-WL always fails.

Fredholm Neural Networks

1 Citations 2024

Kyriakos Georgiou, Constantinos Siettos, A. Yannacopoulos

ArXiv

The proposed methodology provides insight into the connection between neural networks and classical numerical methods, and it is believed that it can have applications in fields such as Uncertainty Quantification and explainable artificial intelligence (XAI).

Fundamentals of Neural Networks

35 Citations 2021

Amey Thakur

International Journal for Research in Applied Science and Engineering Technology

The purpose of this study is to familiarise the reader with the foundations of neural networks and highlight the different learning approaches and algorithms used in Machine Learning and Deep Learning.

A High-Efficient Hybrid Physics-Informed Neural Networks Based on Convolutional Neural Network

111 Citations 2021

Z. Fang

IEEE Transactions on Neural Networks and Learning Systems

This is the first work that the machine learning PDE’s solver has a convergent rate, such as in numerical methods, and can be applied in inverse problems and surface PDEs, although without proof.

The Potential of Neural Network Potentials

19 Citations 2024

Timothy T. Duignan

ACS Physical Chemistry Au

Equivariant neural network potentials are a breakthrough new tool that are already enabling us to simulate systems at the molecular scale with unprecedented accuracy and speed, relying on nothing but fundamental physical laws.