Home / Papers / Top Research Papers on OCR

Top Research Papers on OCR

Explore our curated selection of the top research papers on OCR. From foundational theories to cutting-edge applications, these papers offer valuable insights into Optical Character Recognition technology. Whether you're a researcher, developer, or enthusiast, this collection provides a deep dive into the latest advancements and innovations. Expand your knowledge and stay updated with the leading-edge research in OCR.

Looking for research-backed answers?Try AI Search

TrOCR: Transformer-Based Optical Character Recognition with Pre-trained Models

318 Citations 2023

Minghao Li, Tengchao Lv, Jingye Chen + 6 more

Proceedings of the AAAI Conference on Artificial Intelligence

The TrOCR model is simple but effective, and can be pre-trained with large-scale synthetic data and fine-tuned with human-labeled datasets, and outperforms the current state-of-the-art models on the printed, handwritten and scene text recognition tasks.

Construction of Statistical SVM based Recognition Model for Handwritten Character Recognition

139 Citations 2021

Yasir Babiker Hamdan, Sathish

Journal of Information Technology and Digital World

Analysis SVM for OCR system performance that is providing a good result that is configured with machine learning approach has proved Statistical SVM for OCR system performance that is providing a good result that is configured with machine learning approach.

PP-OCR: A Practical Ultra Lightweight OCR System

107 Citations 2020

Yuning Du, Chenxia Li, Ruoyu Guo + 8 more

arXiv (Cornell University)

This paper proposes a practical ultra lightweight OCR system, i.e., PP-OCR, with an overall model size of only 3.5M, and introduces a bag of strategies to either enhance the model ability or reduce the model size.

TextScanner: Reading Characters in Order for Robust Scene Text Recognition

151 Citations 2020

Zhaoyi Wan, Minghang He, Hao Chen + 2 more

Proceedings of the AAAI Conference on Artificial Intelligence

TextScanner bears three characteristics: it belongs to the semantic segmentation family, as it generates pixel-wise, multi-channel segmentation maps for character class, position and order, and also adopts RNN for context modeling.

Multinational License Plate Recognition Using Generalized Character Sequence Detection

140 Citations 2020

Chris Henry, Sung-Yoon Ahn, Sang-Woong Lee

IEEE Access

This study presents a deep ALPR system designed to be applicable to multinational LPs, mainly based on the you only look once (YOLO) networks, and proposes a layout detection algorithm that can extract the correct sequence of LP numbers from multinational LLP.

A new Arabic handwritten character recognition deep learning system (AHCR-DLS)

109 Citations 2020

Hossam Magdy Balaha, Hesham Ali, Mohamed S. Saraya + 1 more

Neural Computing and Applications

A deep learning (DL) system with two convolutional neural network (CNN) architectures (named HMB1 and HMB2); with the appliance of optimization, regularization, and dropout techniques is introduced to serve as a baseline for future research on handwritten Arabic text.

Survey of Post-OCR Processing Approaches

244 Citations 2021

Thi Tuyet Haï Nguyen, Adam Jatowt, Mickaël Coustaty + 1 more

ACM Computing Surveys

The importance of enhancing quality of OCR results by studying their effects on information retrieval and natural language processing applications is clarified by defining the post-OCR processing problem, illustrating its typical pipeline, and reviewing the state-of-the-art post- OCR processing approaches.

Assessing the Impact of OCR Quality on Downstream NLP Tasks

113 Citations 2020

Daniel van Strien, Kaspar Beelen, Mariona Coll Ardanuy + 3 more

journal unavailable

A series of extrinsic assessment tasks are performed using popular, out-of-the-box tools in order to quantify the impact of OCR quality on these tasks, finding a consistent impact resulting from OCR errors on downstream tasks with some tasks more irredeemably harmed by O CR errors.

Distributed Optical Fiber Sensing Intrusion Pattern Recognition Based on GAF and CNN

131 Citations 2020

Chengang Lyu, Ziqiang Huo, Xin Cheng + 3 more

Journal of Lightwave Technology

An intrusion pattern recognition scheme based on Gramian Angular Field (GAF) and convolutional neural network (CNN) for the dual Mach–Zehnder Interference (DMZI) distributed fiber perimeter security system, which has the advantages of fast recognition speed and high recognition accuracy rate.

From OCR and ECAR to energy: Perspectives on the design and interpretation of bioenergetics studies

184 Citations 2021

Cameron A. Schmidt, Kelsey H. Fisher‐Wellman, P. Darrell Neufer

Journal of Biological Chemistry

This review enumerates various important considerations for designing and interpreting cellular and mitochondrial bioenergetics experiments, some common challenges and pitfalls in data interpretation, and some potential “next steps” to be taken that can address these highlighted challenges.

Automated recognition of optical image based potato leaf blight diseases using deep learning

118 Citations 2021

Kulendu Kashyap Chakraborty, Rashmi Mukherjee, Chandan Chakroborty + 1 more

Physiological and Molecular Plant Pathology

The Potato crop (Solanum tuberosum L.) is one of the most important vegetable food crop grown globally. The yield of potato crop is greatly hampered both in quality and quantity by fungal blight diseases which pose a major threat to the global food security. Late blight caused by Phytophthora infestans and early blight caused by Alternaria solani are the most devastating foliage diseases for potato crops. In reality, the farmers presume such disorders by visualizing mainly the color change in the potato leaves that is usually risky due to subjectivity and huge time consumption. Under such situ...

Phage T7 DNA mimic protein Ocr is a potent inhibitor of BREX defence

101 Citations 2020

Artem Isaev, Alena Drobiazko, Nicolas Sierro + 5 more

Nucleic Acids Research

It is reported that T7 bacteriophage Ocr, a DNA mimic protein that protects the phage from the defensive action of type I restriction–modification systems, is also active against BREX.

Neuro-inspired optical sensor array for high-accuracy static image recognition and dynamic trace extraction

124 Citations 2023

Peiyu Huang, Biyi Jiang, Hongji Chen + 9 more

Nature Communications

A neuro-inspired optical sensor array based on two-dimensional NbS2/MoS2 hybrid films featured remarkable photo-induced conductance plasticity and low electrical energy consumption and was experimentally implemented such that the post-processing could yield a high restoration accuracy.

The Thought and Character of William James

355 Citations 2020

Ralph Barton Perry

Vanderbilt University Press eBooks

When it was originally published in a two-volume edition in 1935, Ralph Barton Perry's magisterial work on William James was greeted with much critical acclaim. A briefer one-volume edition was published in 1947 to serve as both a systematic account of James's development and a repository of selections from his unpublished writings. The one-volume work (which forms the basis for this new paperback edition) offers a brief and convenient sourcebook of James's thought, set forth in terms that require no previous familiarity with technical problems of philosophy and psychology. An anthology of wel...

Extraction and Analysis of Fictional Character Networks

110 Citations 2022

Vincent Labatut

HAL (Le Centre pour la Communication Scientifique Directe)

A character network is a graph extracted from a narrative, in which vertices represent characters and edges correspond to interactions between them. A number of narrative-related problems can be addressed automatically through the analysis of character networks, such as summarization, classification, or role detection. Character networks are particularly relevant when considering works of fictions (e.g. novels, plays, movies, TV series), as their exploitation allows developing information retrieval and recommendation systems. However, works of fiction possess specific properties making these t...

Character controllers using motion VAEs

244 Citations 2020

Hung Yu Ling, Fabio Zinno, George G. Cheng + 1 more

ACM Transactions on Graphics

This work uses deep reinforcement learning to learn controllers that achieve goal-directed movements in data-driven generative models of human movement using autoregressive conditional variational autoencoders, or Motion VAEs.

Real-time deep dynamic characters

104 Citations 2021

Marc Habermann, Lingjie Liu, Weipeng Xu + 3 more

ACM Transactions on Graphics

We propose a deep videorealistic 3D human character model displaying highly realistic shape, motion, and dynamic appearance learned in a new weakly supervised way from multi-view imagery. In contrast to previous work, our controllable 3D character displays dynamics, e.g., the swing of the skirt, dependent on skeletal body motion in an efficient data-driven way, without requiring complex physics simulation. Our character model also features a learned dynamic texture model that accounts for photo-realistic motion-dependent appearance details, as well as view-dependent lighting effects. During tr...

Face Mask Recognition System with YOLOV5 Based on Image Recognition

209 Citations 2020

Guanhao Yang, Wei Feng, Jintao Jin + 4 more

journal unavailable

The experimental results show that the algorithm proposed in this paper can effectively recognize face masks and realize the effective monitoring of personnel, and YOLOV5, the most powerful objection detection algorithm at present, is effective in the actual environment.

A Survey of Speaker Recognition: Fundamental Theories, Recognition Methods and Opportunities

114 Citations 2021

Md. Mohsin Kabir, M. F. Mridha, Jungpil Shin + 2 more

IEEE Access

This literature survey gives a concise introduction to ASR and provides an overview of the general architectures dealing with speaker recognition technologies, and upholds the past, present, and future research trends in this area.

Handbook of Fingerprint Recognition

233 Citations 2022

Davide Maltoni, Dario Maio, Anil K. Jain + 1 more

journal unavailable

With their distinctiveness and stability over time, fingerprints continue to be the most widely used anatomical characteristic in systems that automatically recognize a person's identity. This fully updated third edition provides in-depth coverage of the state-of-the-art in fingerprint recognition readers, feature extraction, and matching algorithms and applications. Deep learning (resurgence beginning around 2012) has been a game changer for artificial intelligence and, in particular, computer vision and biometrics. Performance improvements (both recognition accuracy and speed) for most biome...

Biomimetic carbohydrate recognition

152 Citations 2020

Anthony P. Davis

Chemical Society Reviews

This tutorial review describes work on synthetic receptors which bind carbohydrates through non-covalent interactions, mimicking the strategies used in biology, and augurs well for real-world applications.

InSight Constraints on the Global Character of the Martian Crust

135 Citations 2022

M. A. Wieczorek, Adrien Broquet, S. M. McLennan + 23 more

Journal of Geophysical Research Planets

Abstract Analyses of seismic data from the InSight mission have provided the first in situ constraints on the thickness of the crust of Mars. These crustal thickness constraints are currently limited to beneath the lander that is located in the northern lowlands, and we use gravity and topography data to construct global crustal thickness models that satisfy the seismic data. These models consider a range of possible mantle and core density profiles, a range of crustal densities, a low‐density surface layer, and the possibility that the crustal density of the northern lowlands is greater than ...

Text Recognition in the Wild

176 Citations 2021

Xiaoxue Chen, Lianwen Jin, Yuanzhi Zhu + 2 more

ACM Computing Surveys

This literature review attempts to present an entire picture of the field of scene text recognition, which provides a comprehensive reference for people entering this field and could be helpful in inspiring future research.

The Hedonic Character of Nostalgia: An Integrative Data Analysis

108 Citations 2020

Joost M. Leunissen, Tim Wildschut, Constantine Sedikides + 1 more

Emotion Review

We conducted an integrative data analysis to examine the hedonic character of nostalgia. We combined positive and negative affect measures from 41 experiments manipulating nostalgia ( N = 4,659). Overall, nostalgia inductions increased positive and ambivalent affect, but did not significantly alter negative affect. The magnitude of nostalgia’s effects varied markedly across different experimental inductions of the emotion. The hedonic character of nostalgia, then, depends on how the emotion is elicited and the benchmark (i.e., control condition) to which it is compared. We discuss implications...

Nasionalism: Character Education Orientation in Learning Development

128 Citations 2021

Dian Arief Pradana, Mahfud Mahfud, Candra Hermawan + 1 more

Budapest International Research and Critics Institute (BIRCI-Journal) Humanities and Social Sciences

The values of nationalism can be learned from educational materials that are oriented towards character development. Character education is part of a revolutionary zone among students, therefore character education is a very important part to be urgently developed in the minds of the student head because students are the next generation who will lead the nation and state. Character learning that is oriented towards the values of nationalism can overcome various social problems. Building character for the current generation is one of the goals of national education. The Ministry of National Edu...

Comparing Recognition Performance and Robustness of Multimodal Deep Learning Models for Multimodal Emotion Recognition

351 Citations 2022

Wei Liu, Jielin Qiu, Wei‐Long Zheng + 1 more

IEEE Transactions on Cognitive and Developmental Systems

By visualizing features before and after DCCA transformation on the SEED-V data set, it is found that the transformed features are more homogeneous and discriminative across emotions.

Recognition and inhibition of SARS-CoV-2 by humoral innate immunity pattern recognition molecules

152 Citations 2022

Matteo Stravalaci, Isabel Pagani, Elvezia Maria Paraboschi + 26 more

Nature Immunology

It is suggested that selected humoral fluid-phase PRMs can play an important role in resistance to, and pathogenesis of, COVID-19, a finding with translational implications.

A Review of Face Recognition Technology

326 Citations 2020

Lixiang Li, Xiaohui Mu, Siying Li + 1 more

IEEE Access

Face recognition has become the future development direction and has many potential application prospects and is introduced in the general evaluation standards and the general databases of face recognition.

Multiview Transformers for Video Recognition

265 Citations 2022

Yan Shen, Xuehan Xiong, Anurag Arnab + 4 more

2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

This work presents Multiview Transformers for Video Recognition (MTV), a model that consists of separate encoders to represent different views of the input video with lateral connections to fuse information across views and achieves state-of-the-art results on six standard datasets.

Optical RAM and integrated optical memories: a survey

165 Citations 2020

Theoni Alexoudi, George T. Kanellos, Nikos Pleros

Light Science & Applications

State-of-the-art integrated optical memory technologies and optical RAM cell demonstrations describing the physical mechanisms of several key devices along with their performance metrics in terms of their energy, speed and footprint are reviewed.

A survey of music emotion recognition

112 Citations 2022

Donghong Han, Yanru Kong, Jiayi Han + 1 more

Frontiers of Computer Science

The knowledge and algorithms involved in each part are introduced with detailed analysis, including some commonly used datasets, emotion models, feature extraction, and emotion recognition algorithms.

Meta-optics for spatial optical analog computing

151 Citations 2020

Sajjad Abdollahramezani, Omid Hemmatyar, Ali Adibi

Nanophotonics

This review discusses state-of-the-art developments, as well as emerging trends, in computational metastructures as disruptive platforms for spatial optical analog computation and discusses two fundamental approaches based on general concepts of spatial Fourier transformation and Green’s function.

Automatic speech recognition: a survey

313 Citations 2020

Mishaim Malik, Muhammad Kamran Malik, Khawar Mehmood + 1 more

Multimedia Tools and Applications

This study explores different feature extraction methods, state-of-the-art classification models, and vis-a-vis their impact on an ASR.

Adversarial Examples on Object Recognition

130 Citations 2020

Alex Serban, Erik Poll, Joost Visser

ACM Computing Surveys

The hypotheses behind their existence, the methods used to construct or protect against them, and the capacity to transfer adversarial examples between different machine learning models are introduced.

The formation, character and changing nature of mesoscale convective systems

264 Citations 2020

Russ S. Schumacher, Kristen L. Rasmussen

Nature Reviews Earth & Environment

Mesoscale convective systems (MCSs) describe organized groupings of thunderstorms in the tropics and mid-latitudes that span thousands of square kilometres. While recognized for over a century, the advent of satellite and radar observations, as well as atmospheric-model simulations, has brought about their increased understanding. In this Review, we synthesize current knowledge on MCS formation, climatological characteristics, hazardous weather, predictive capacity and projected changes with anthropogenic warming. Driven by typical deep moist convective processes (moisture, lift and instabilit...

Retracted: IRIS Recognition System

147 Citations 2021

Kavita Borkar, Suresh Salankar

2021 IEEE International Conference on Mobile Networks and Wireless Communications (ICMNWC)

The main aim of the research work is to discuss the various methods and techniques used till now by different research scholars for iris recognition and various steps required in iris Recognition process.

Automatic Number Plate Recognition

119 Citations 2024

Shaik Nehar

American Journal of Electronics & Communication

Experimental results show that the identification accuracy of the proposed ANPR exceeds 95%, demonstrating its potential in practical applications and suggestions for future research to improve stability and efficiency.

Pedestrian attribute recognition: A survey

106 Citations 2021

Xiao Wang, Shaofei Zheng, Rui Yang + 4 more

Pattern Recognition

The background of pedestrian attribute recognition (PAR), including the fundamental concepts of pedestrian attributes and corresponding challenges are introduced, and existing benchmarks, including popular datasets and evaluation criterion are introduced.

Face Recognition: A Literature Review

120 Citations 2023

Gabriela Laura Sălăgean, Monica Leba

International Scientific Conference ITEMA. Recent Advances in Information Technology, Tourism, Economics, Management and Agriculture

This paper proposes a review of research on face recognition techniques, algorithms and existing applications, with their advantages and disadvantages, and presents future directions of development in the field of face recognition.

Kaldi Speech Recognition Toolkit

4893 Citations 2024

Daniel Povey

journal unavailable

The design of Kaldi is described, a free, open-source toolkit for speech recognition research that provides a speech recognition system based on finite-state automata together with detailed documentation and a comprehensive set of scripts for building complete recognition systems.