Top Research Papers on Computer Vision
Explore the top research papers on computer vision and stay ahead of the curve in this exciting field. From object detection to image recognition and beyond, these papers offer invaluable insights and advancements. Whether you're a student, researcher, or industry professional, our curated collection will keep you informed and inspired.
Looking for research-backed answers?Try AI Search
This undergraduate textbook-reference comprehensively examines computer vision techniques, analysis, and real-world applications in which they are used.
Computer vision in surgery
144 Citations 2020Thomas M. Ward, Pietro Mascagni, Yutong Ban + 4 more
Surgery
The development and refining of deep neural networks that can now accurately identify objects in images and remember past surgical events has sparked a surge in the applications of CV to analyze intraoperative video and has allowed for the accurate identification of surgical phases (steps) and instruments across a variety of procedures.
Vision Transformers in medical computer vision—A contemplative retrospection
239 Citations 2023Arshi Parvaiz, Muhammad Anwaar Khalid, Rukhsana Zafar + 3 more
Engineering Applications of Artificial Intelligence
The intersection of vision transformers and medical images is investigated, an overview of various ViT based frameworks that are being used by different researchers to decipher the obstacles in medical computer vision is proffered, and the pointers to possible solutions for future direction are deliberated.
Computer Vision Techniques in Manufacturing
187 Citations 2022Longfei Zhou, Zhang Li, Nicholas Konz
IEEE Transactions on Systems Man and Cybernetics Systems
A comprehensive review of the state of the art of computer vision techniques and their applications in manufacturing industries, including the most common methods, including feature detection, recognition, segmentation, and three-dimensional modeling.
Computer Vision and Image Understanding
768 Citations 2022Wael Saideni, Fabien Courrèges, David Helbert + 1 more
SSRN Electronic Journal
In this work, we study a complete framework of Video Compressive Sensing (VCS), from capturing a sequence of video frames in one single compressed measurement to reconstructing the original frames. To our best knowledge, we present the first end-to-end sampling and recovery network built upon video Transformers, widely explored in vision related tasks, to capture long-range spatio-temporal relations. Our proposed Video Transformer for Snapshot Compressive Imaging recovery (ViT-SCI) is based on Spatio-temporal Convolutional Multi-Head Attention (ST-ConvMHA) which is an extended version of the f...
Fashion Meets Computer Vision
138 Citations 2021Wen-Huang Cheng, Sijie Song, Chieh-Yun Chen + 2 more
ACM Computing Surveys
A comprehensive survey of more than 200 major fashion-related works covering four main aspects for enabling intelligent fashion and highlighting promising directions for future research.
Deep Learning in Computer Vision
190 Citations 2020M. Hassaballah, Ali Ismail Awad
journal unavailable
Deep learning algorithms have brought a revolution to the computer vision community by introducing non-traditional and efficient solutions to several image-related problems that had long remained unclear or difficult to solve.
Machine Learning in Computer Vision
201 Citations 2020Asharul Islam Khan, Salim Al-Habsi
Procedia Computer Science
During last few years the computer applications have gone dramatic transformation from simple data processing to machine learning, thanks to the availability and accessibility of huge volume of data collected through sensors and internet. The idea of machine learning demonstrates and propagates the facts that computer has the ability to improve itself with the passage of time. The western countries have shown great interest on the topic of machine learning, computer vision, and pattern recognition via organizing conferences, workshops, collective discussion, experimentation, and real life impl...
Modern computing: Vision and challenges
139 Citations 2024Sukhpal Singh Gill, Huaming Wu, Panos Patros + 22 more
Telematics and Informatics Reports
This comprehensive review of modern computing systems looks ahead to the future of research in the field, highlighting key challenges and emerging trends, and underscoring their importance in cost-effectively driving technological progress.
Serverless Edge Computing: Vision and Challenges
198 Citations 2021Mohammad Sadegh Aslanpour, Adel N. Toosi, Claudio Cicconetti + 7 more
journal unavailable
In this paper, an in-depth analysis promotes a broad vision for bringing Serverless to the Edge Computing and issues major challenges for serverless to be met before entering Edge computing.
An Overview of the Attention Mechanisms in Computer Vision
123 Citations 2020Xiao Yang
Journal of Physics Conference Series
Focusing on the models of attention mechanisms commonly used in computer vision, their categorizations, principles, and outlook are summarized in this overview.
Computer Vision and Pattern Recognition 2020
647 Citations 2021Zeynep Akata, Andreas Geiger, Torsten Sattler
International Journal of Computer Vision
This special issue covers a wide range of topics from the area of Computer Vision, Pattern Recognition, and Machine Learning.This breadth of scope is reflected by the papers included in this special issue, which touch topics including geometric Computer Vision, medical image processing, physical scene understanding, and interpretability of deep neural networks.This special issue consists of extended versions of the best papers originally presented at the 42nd German Conference on Pattern Recognition (DAGM GCPR 2020), held virtually between September 28th and October 1st, 2020.This special issu...
Generative Adversarial Networks in Computer Vision
238 Citations 2021Zhengwei Wang, Qi She, Tomás Ward
ACM Computing Surveys
The objective is to provide an overview as well as a critical analysis of the status of GAN research in terms of relevant progress toward critical computer vision application requirements and discuss the most compelling applications in computer vision.
Attention mechanisms in computer vision: A survey
2188 Citations 2022Meng-Hao Guo, Tian-Xing Xu, Jiangjiang Liu + 7 more
Computational Visual Media
This survey provides a comprehensive review of various attention mechanisms in computer vision and categorize them according to approach, such as channel attention, spatial attention, temporal attention, and branch attention.
In-sensor dynamic computing for intelligent machine vision
135 Citations 2024Yuekun Yang, Chen Pan, Yixiang Li + 10 more
Nature Electronics
The correlated optoelectronic characteristics of multi-terminal mixed-dimensional graphene–germanium heterostructure devices can be used for the accurate detection and robust tracking of dim targets.
Tensor Methods in Computer Vision and Deep Learning
160 Citations 2021Yannis Panagakis, Jean Kossaifi, Grigorios G. Chrysos + 4 more
Proceedings of the IEEE
This article provides an in-depth and practical review of tensors and tensor methods in the context of representation learning and deep learning, with a particular focus on visual data analysis and computer vision applications.
Smart Traffic Monitoring System Using Computer Vision and Edge Computing
100 Citations 2021Guanxiong Liu, Hang Shi, Abbas Kiani + 5 more
IEEE Transactions on Intelligent Transportation Systems
This paper proposes a two-tier edge computing based model that takes into account of both the limited computing capability in cloudlets and the unstable network condition to the TMC, and shows that the proposed hybrid edge-cloud solution outperforms both the cloud-only and edge-only solutions.
Applications of fractional calculus in computer vision: A survey
135 Citations 2022Sugandha Arora, Trilok Mathur, Shivi Agarwal + 2 more
Neurocomputing
Fractional calculus is an abstract idea exploring interpretations of differentiation having non-integer order. For a very long time, it was considered as a topic of mere theoretical interest. However, the introduction of several useful definitions of fractional derivatives has extended its domain to applications. Supported by computational power and algorithmic representations, fractional calculus has emerged as a multifarious domain. It has been found that the fractional derivatives are capable of incorporating memory into the system and thus suitable to improve the performance of locality-aw...
A review of computer vision technologies for plant phenotyping
351 Citations 2020Zhenbo Li, Ruohao Guo, Meng Li + 2 more
Computers and Electronics in Agriculture
This review extensively reviews 200+ papers of plant phenotyping in the light of its technical evolution, spanning over twenty years, including imaging technologies, plant datasets, and state-of-the-art phenotypesing methods.
Deep learning-enabled medical computer vision
1178 Citations 2021Andre Esteva, Katherine Chou, Serena Yeung + 7 more
npj Digital Medicine
Recent progress in the development of modern computer vision techniques—powered by deep learning—for medical applications, focusing on medical imaging, medical video, and clinical deployment is surveyed.
Deep Learning for Computer Vision: A Brief Review
102 Citations 2022Ksheera R Shetty, Vaibhav S Soorinje, Prinson Dsouza + 1 more
International Journal of Advanced Research in Science Communication and Technology
Over the last years deep learning methods have been shown to outperform previous state-of-the-art machine learning techniques in several fields, with computer vision being one of the most prominent cases. This review paper provides a brief overview of some of the most significant deep learning schemes used in computer vision problems, that is, Convolutional Neural Networks, Deep Boltzmann Machines and Deep Belief Networks, and Stacked Denoising Autoencoders. A brief account of their history, structure, advantages, and limitations is given, followed by a description of their applications in var...
Computer Vision Techniques in Construction: A Critical Review
385 Citations 2020Shuyuan Xu, Jun Wang, Wenchi Shou + 3 more
Archives of Computational Methods in Engineering
This research aims to guide practitioners to successfully find suitable approaches for a particular project, with a focus on state-of-the-art methods in a typical vision-based scheme.
Florence: A New Foundation Model for Computer Vision
339 Citations 2021Lu Yuan, Dongdong Chen, Yi‐Ling Chen + 20 more
arXiv (Cornell University)
Automated visual understanding of our diverse and open world demands computer vision models to generalize well with minimal customization for specific tasks, similar to human vision. Computer vision foundation models, which are trained on diverse, large-scale dataset and can be adapted to a wide range of downstream tasks, are critical for this mission to solve real-world computer vision applications. While existing vision foundation models such as CLIP, ALIGN, and Wu Dao 2.0 focus mainly on mapping images and textual representations to a cross-modal shared representation, we introduce a new co...
Deep learning and computer vision will transform entomology
417 Citations 2021Toke T. Høye, Johanna Ärje, Kim Bjerge + 7 more
Proceedings of the National Academy of Sciences
Most animal species on Earth are insects, and recent reports suggest that their abundance is in drastic decline. Although these reports come from a wide range of insect taxa and regions, the evidence to assess the extent of the phenomenon is sparse. Insect populations are challenging to study, and most monitoring methods are labor intensive and inefficient. Advances in computer vision and deep learning provide potential new solutions to this global challenge. Cameras and other sensors can effectively, continuously, and noninvasively perform entomological observations throughout diurnal and sea...
A review of convolutional neural networks in computer vision
710 Citations 2024Xia Zhao, Limin Wang, Yufei Zhang + 3 more
Artificial Intelligence Review
An elementary understanding of CNN components and their functions, including input layers, convolution layers, pooling layers, activation functions, batch normalization, dropout, fully connected layers, and output layers are presented.
Computer vision-based construction progress monitoring
142 Citations 2022Varun Kumar Reja, Koshy Varghese, Q. P. Ha
Automation in Construction
Automating the process of construction progress monitoring through computer vision can enable effective control of projects. Systematic classification of available methods and technologies is necessary to structure this complex, multi-stage process. Using the PRISMA framework, relevant studies in the area were identified. The various concepts, tools, technologies, and algorithms reported by these studies were iteratively categorised, developing an integrated process framework for Computer-Vision-Based Construction Progress Monitoring (CV-CPM). This framework comprises: data acquisition and 3D-...
Deep reinforcement learning in computer vision: a comprehensive survey
224 Citations 2021Ngan Le, Vidhiwar Singh Rathour, Kashu Yamazaki + 2 more
Artificial Intelligence Review
This work provides a detailed review of recent and state-of-the-art research advances of deep reinforcement learning in computer vision, and proposes a categorization ofDeep reinforcement learning methodologies and discusses their advantages and limitations.
Review of Weed Detection Methods Based on Computer Vision
262 Citations 2021Zhangnan Wu, Yajun Chen, Bo Zhao + 2 more
Sensors
This review elaborates the two aspects of using traditional image-processing methods and deep learning-based methods to solve weed detection problems and provides an overview of various methods for weed detection in recent years.
Dynamic machine vision with retinomorphic photomemristor-reservoir computing
108 Citations 2023Hongwei Tan, Sebastiaan van Dijken
Nature Communications
In this system, a retinomorphic photomemristor array, working as dynamic vision reservoir, embeds past motion frames as hidden states into the present frame through inherent dynamic memory, which facilitates accurate recognition of past and prediction of future motions with machine learning algorithms.
COVID-19 Control by Computer Vision Approaches: A Survey
128 Citations 2020Anwaar Ulhaq, Jannis Born, Asim Khan + 3 more
IEEE Access
This survey paper is intended to provide a preliminary review of the available literature on the computer vision efforts against COVID-19 pandemic, and to make it available to computer vision researchers to save precious time.
Computer Vision to Automatically Assess Infant Neuromotor Risk
102 Citations 2020Claire Chambers, Nidhi Seethapathi, Rachit Saluja + 6 more
IEEE Transactions on Neural Systems and Rehabilitation Engineering
This work automatically extracts body poses and movement kinematics from the videos of at-risk infants and calculates how much they deviate from a group of healthy infants using Naïve Gaussian Bayesian Surprise.
Automated estimation of cementitious sorptivity via computer vision
114 Citations 2024Hossein Kabir, Jordan Wu, Sunav Dahal + 2 more
Nature Communications
Monitoring water uptake in cementitious systems is crucial to assess their durability against corrosion, salt attack, and freeze-thaw damage. However, gauging absorption currently relies on labor-intensive and infrequent weight measurements, as outlined in ASTM C1585. To address this issue, we introduce a custom computer vision model trained on 6234 images, consisting of 4000 real and 2234 synthetic, that automatically detects the water level in prismatic samples absorbing water. This model provides accurate and frequent estimations of water penetration values every minute. After training the ...
Computer vision for pattern detection in chromosome contact maps
136 Citations 2020Cyril Matthey-Doret, Lyam Baudry, Axel Breuer + 13 more
Nature Communications
Chromosomes of all species studied so far display a variety of higher-order organisational features, such as self-interacting domains or loops. These structures, which are often associated to biological functions, form distinct, visible patterns on genome-wide contact maps generated by chromosome conformation capture approaches such as Hi-C. Here we present Chromosight, an algorithm inspired from computer vision that can detect patterns in contact maps. Chromosight has greater sensitivity than existing methods on synthetic simulated data, while being faster and applicable to any type of genome...
Computer vision in surgery: from potential to clinical value
154 Citations 2022Pietro Mascagni, Deepak Alapatt, Luca Sestini + 10 more
npj Digital Medicine
Current CV techniques that have been applied to minimally invasive surgery and their clinical applications are reviewed, and the challenges and obstacles that remain to be overcome for broader implementation and adoption of CV in surgery are discussed.
Large image datasets: A pyrrhic win for computer vision?
238 Citations 2021Abeba Birhane, Vinay Uday Prabhu
journal unavailable
This paper performs a cross-sectional model-based quantitative census covering factors such as age, gender, NSFW content scoring, class- wise accuracy, human-cardinality-analysis, and the semanticity of the image class information in order to statistically investigate the extent and subtleties of ethical transgressions.
Computer Vision Applications in Intelligent Transportation Systems: A Survey
115 Citations 2023Esma Dilek, Murat Dener
Sensors
How computer vision techniques can help transportation systems to become smarter is shown by presenting a holistic picture of the literature on different CV applications in the ITS context by bringing together research from various sources.
Advances in solar forecasting: Computer vision with deep learning
114 Citations 2023Quentin Paletta, Guillermo Terrén-Serrano, Yuhao Nie + 6 more
Advances in Applied Energy
Renewable energy forecasting is crucial for integrating variable energy sources into the grid. It allows power systems to address the intermittency of the energy supply at different spatiotemporal scales. To anticipate the future impact of cloud displacements on the energy generated by solar facilities, conventional modeling methods rely on numerical weather prediction or physical models, which have difficulties in assimilating cloud information and learning systematic biases. Augmenting computer vision with machine learning overcomes some of these limitations by fusing real-time cloud cover o...
Overview: Computer Vision and Machine Learning for Microstructural Characterization and Analysis
215 Citations 2020Elizabeth A. Holm, Ryan Cohn, Nan Gao + 4 more
Metallurgical and Materials Transactions A
This overview surveys CV methods for numerically encoding the visual information contained in a microstructural image using either feature-based representations or convolutional neural network layers, which then provides input to supervised or unsupervised ML algorithms that find associations and trends in the high-dimensional image representation.
Computer Vision for Autonomous Vehicles: Problems, Datasets and State of the Art
221 Citations 2020Joel Janai, Fatma Güney, Aseem Behl + 1 more
journal unavailable
Recent years have witnessed enormous progress in AI-related fields such as computer vision, machine learning, and autonomous vehicles. As with any rapidly growing field, it becomes increasingly difficult to stay up-to-date or enter the field as a beginner. While several survey papers on particular sub-problems have appeared, no comprehensive survey on problems, datasets, and methods in computer vision for autonomous vehicles has been published. This monograph attempts to narrow this gap by providing a survey on the state-of-the-art datasets and techniques. Our survey includes both the historic...
An overview of Human Action Recognition in sports based on Computer Vision
133 Citations 2022Kristina Host, Marina Ivašić-Kos
Heliyon
An overview of HAR applications in sports primarily based on Computer Vision as the main contribution is presented, along with popular publicly available datasets for this purpose, including actions of everyday activities.