Top Research Papers on OCR
Explore our curated selection of the top research papers on OCR. From foundational theories to cutting-edge applications, these papers offer valuable insights into Optical Character Recognition technology. Whether you're a researcher, developer, or enthusiast, this collection provides a deep dive into the latest advancements and innovations. Expand your knowledge and stay updated with the leading-edge research in OCR.
Looking for research-backed answers?Try AI Search
TrOCR: Transformer-Based Optical Character Recognition with Pre-trained Models
318 Citations 2023Minghao Li, Tengchao Lv, Jingye Chen + 6 more
Proceedings of the AAAI Conference on Artificial Intelligence
The TrOCR model is simple but effective, and can be pre-trained with large-scale synthetic data and fine-tuned with human-labeled datasets, and outperforms the current state-of-the-art models on the printed, handwritten and scene text recognition tasks.
Construction of Statistical SVM based Recognition Model for Handwritten Character Recognition
139 Citations 2021Yasir Babiker Hamdan, Sathish
Journal of Information Technology and Digital World
Analysis SVM for OCR system performance that is providing a good result that is configured with machine learning approach has proved Statistical SVM for OCR system performance that is providing a good result that is configured with machine learning approach.
PP-OCR: A Practical Ultra Lightweight OCR System
107 Citations 2020Yuning Du, Chenxia Li, Ruoyu Guo + 8 more
arXiv (Cornell University)
This paper proposes a practical ultra lightweight OCR system, i.e., PP-OCR, with an overall model size of only 3.5M, and introduces a bag of strategies to either enhance the model ability or reduce the model size.
TextScanner: Reading Characters in Order for Robust Scene Text Recognition
151 Citations 2020Zhaoyi Wan, Minghang He, Hao Chen + 2 more
Proceedings of the AAAI Conference on Artificial Intelligence
TextScanner bears three characteristics: it belongs to the semantic segmentation family, as it generates pixel-wise, multi-channel segmentation maps for character class, position and order, and also adopts RNN for context modeling.
Multinational License Plate Recognition Using Generalized Character Sequence Detection
140 Citations 2020Chris Henry, Sung-Yoon Ahn, Sang-Woong Lee
IEEE Access
This study presents a deep ALPR system designed to be applicable to multinational LPs, mainly based on the you only look once (YOLO) networks, and proposes a layout detection algorithm that can extract the correct sequence of LP numbers from multinational LLP.
A new Arabic handwritten character recognition deep learning system (AHCR-DLS)
109 Citations 2020Hossam Magdy Balaha, Hesham Ali, Mohamed S. Saraya + 1 more
Neural Computing and Applications
A deep learning (DL) system with two convolutional neural network (CNN) architectures (named HMB1 and HMB2); with the appliance of optimization, regularization, and dropout techniques is introduced to serve as a baseline for future research on handwritten Arabic text.
Survey of Post-OCR Processing Approaches
244 Citations 2021Thi Tuyet Haï Nguyen, Adam Jatowt, Mickaël Coustaty + 1 more
ACM Computing Surveys
The importance of enhancing quality of OCR results by studying their effects on information retrieval and natural language processing applications is clarified by defining the post-OCR processing problem, illustrating its typical pipeline, and reviewing the state-of-the-art post- OCR processing approaches.
Assessing the Impact of OCR Quality on Downstream NLP Tasks
113 Citations 2020Daniel van Strien, Kaspar Beelen, Mariona Coll Ardanuy + 3 more
journal unavailable
A series of extrinsic assessment tasks are performed using popular, out-of-the-box tools in order to quantify the impact of OCR quality on these tasks, finding a consistent impact resulting from OCR errors on downstream tasks with some tasks more irredeemably harmed by O CR errors.
Distributed Optical Fiber Sensing Intrusion Pattern Recognition Based on GAF and CNN
131 Citations 2020Chengang Lyu, Ziqiang Huo, Xin Cheng + 3 more
Journal of Lightwave Technology
An intrusion pattern recognition scheme based on Gramian Angular Field (GAF) and convolutional neural network (CNN) for the dual Mach–Zehnder Interference (DMZI) distributed fiber perimeter security system, which has the advantages of fast recognition speed and high recognition accuracy rate.
From OCR and ECAR to energy: Perspectives on the design and interpretation of bioenergetics studies
184 Citations 2021Cameron A. Schmidt, Kelsey H. Fisher‐Wellman, P. Darrell Neufer
Journal of Biological Chemistry
This review enumerates various important considerations for designing and interpreting cellular and mitochondrial bioenergetics experiments, some common challenges and pitfalls in data interpretation, and some potential “next steps” to be taken that can address these highlighted challenges.
Automated recognition of optical image based potato leaf blight diseases using deep learning
118 Citations 2021Kulendu Kashyap Chakraborty, Rashmi Mukherjee, Chandan Chakroborty + 1 more
Physiological and Molecular Plant Pathology
The Potato crop (Solanum tuberosum L.) is one of the most important vegetable food crop grown globally. The yield of potato crop is greatly hampered both in quality and quantity by fungal blight diseases which pose a major threat to the global food security. Late blight caused by Phytophthora infestans and early blight caused by Alternaria solani are the most devastating foliage diseases for potato crops. In reality, the farmers presume such disorders by visualizing mainly the color change in the potato leaves that is usually risky due to subjectivity and huge time consumption. Under such situ...
Phage T7 DNA mimic protein Ocr is a potent inhibitor of BREX defence
101 Citations 2020Artem Isaev, Alena Drobiazko, Nicolas Sierro + 5 more
Nucleic Acids Research
It is reported that T7 bacteriophage Ocr, a DNA mimic protein that protects the phage from the defensive action of type I restriction–modification systems, is also active against BREX.
Neuro-inspired optical sensor array for high-accuracy static image recognition and dynamic trace extraction
124 Citations 2023Peiyu Huang, Biyi Jiang, Hongji Chen + 9 more
Nature Communications
A neuro-inspired optical sensor array based on two-dimensional NbS2/MoS2 hybrid films featured remarkable photo-induced conductance plasticity and low electrical energy consumption and was experimentally implemented such that the post-processing could yield a high restoration accuracy.
The Thought and Character of William James
355 Citations 2020Ralph Barton Perry
Vanderbilt University Press eBooks
When it was originally published in a two-volume edition in 1935, Ralph Barton Perry's magisterial work on William James was greeted with much critical acclaim. A briefer one-volume edition was published in 1947 to serve as both a systematic account of James's development and a repository of selections from his unpublished writings. The one-volume work (which forms the basis for this new paperback edition) offers a brief and convenient sourcebook of James's thought, set forth in terms that require no previous familiarity with technical problems of philosophy and psychology. An anthology of wel...
Extraction and Analysis of Fictional Character Networks
110 Citations 2022Vincent Labatut
HAL (Le Centre pour la Communication Scientifique Directe)
A character network is a graph extracted from a narrative, in which vertices represent characters and edges correspond to interactions between them. A number of narrative-related problems can be addressed automatically through the analysis of character networks, such as summarization, classification, or role detection. Character networks are particularly relevant when considering works of fictions (e.g. novels, plays, movies, TV series), as their exploitation allows developing information retrieval and recommendation systems. However, works of fiction possess specific properties making these t...
Character controllers using motion VAEs
244 Citations 2020Hung Yu Ling, Fabio Zinno, George G. Cheng + 1 more
ACM Transactions on Graphics
This work uses deep reinforcement learning to learn controllers that achieve goal-directed movements in data-driven generative models of human movement using autoregressive conditional variational autoencoders, or Motion VAEs.
Real-time deep dynamic characters
104 Citations 2021Marc Habermann, Lingjie Liu, Weipeng Xu + 3 more
ACM Transactions on Graphics
We propose a deep videorealistic 3D human character model displaying highly realistic shape, motion, and dynamic appearance learned in a new weakly supervised way from multi-view imagery. In contrast to previous work, our controllable 3D character displays dynamics, e.g., the swing of the skirt, dependent on skeletal body motion in an efficient data-driven way, without requiring complex physics simulation. Our character model also features a learned dynamic texture model that accounts for photo-realistic motion-dependent appearance details, as well as view-dependent lighting effects. During tr...
Face Mask Recognition System with YOLOV5 Based on Image Recognition
209 Citations 2020Guanhao Yang, Wei Feng, Jintao Jin + 4 more
journal unavailable
The experimental results show that the algorithm proposed in this paper can effectively recognize face masks and realize the effective monitoring of personnel, and YOLOV5, the most powerful objection detection algorithm at present, is effective in the actual environment.
A Survey of Speaker Recognition: Fundamental Theories, Recognition Methods and Opportunities
114 Citations 2021Md. Mohsin Kabir, M. F. Mridha, Jungpil Shin + 2 more
IEEE Access
This literature survey gives a concise introduction to ASR and provides an overview of the general architectures dealing with speaker recognition technologies, and upholds the past, present, and future research trends in this area.
Handbook of Fingerprint Recognition
233 Citations 2022Davide Maltoni, Dario Maio, Anil K. Jain + 1 more
journal unavailable
With their distinctiveness and stability over time, fingerprints continue to be the most widely used anatomical characteristic in systems that automatically recognize a person's identity. This fully updated third edition provides in-depth coverage of the state-of-the-art in fingerprint recognition readers, feature extraction, and matching algorithms and applications. Deep learning (resurgence beginning around 2012) has been a game changer for artificial intelligence and, in particular, computer vision and biometrics. Performance improvements (both recognition accuracy and speed) for most biome...
This tutorial review describes work on synthetic receptors which bind carbohydrates through non-covalent interactions, mimicking the strategies used in biology, and augurs well for real-world applications.
InSight Constraints on the Global Character of the Martian Crust
135 Citations 2022M. A. Wieczorek, Adrien Broquet, S. M. McLennan + 23 more
Journal of Geophysical Research Planets
Abstract Analyses of seismic data from the InSight mission have provided the first in situ constraints on the thickness of the crust of Mars. These crustal thickness constraints are currently limited to beneath the lander that is located in the northern lowlands, and we use gravity and topography data to construct global crustal thickness models that satisfy the seismic data. These models consider a range of possible mantle and core density profiles, a range of crustal densities, a low‐density surface layer, and the possibility that the crustal density of the northern lowlands is greater than ...
Text Recognition in the Wild
176 Citations 2021Xiaoxue Chen, Lianwen Jin, Yuanzhi Zhu + 2 more
ACM Computing Surveys
This literature review attempts to present an entire picture of the field of scene text recognition, which provides a comprehensive reference for people entering this field and could be helpful in inspiring future research.
The Hedonic Character of Nostalgia: An Integrative Data Analysis
108 Citations 2020Joost M. Leunissen, Tim Wildschut, Constantine Sedikides + 1 more
Emotion Review
We conducted an integrative data analysis to examine the hedonic character of nostalgia. We combined positive and negative affect measures from 41 experiments manipulating nostalgia ( N = 4,659). Overall, nostalgia inductions increased positive and ambivalent affect, but did not significantly alter negative affect. The magnitude of nostalgia’s effects varied markedly across different experimental inductions of the emotion. The hedonic character of nostalgia, then, depends on how the emotion is elicited and the benchmark (i.e., control condition) to which it is compared. We discuss implications...
Nasionalism: Character Education Orientation in Learning Development
128 Citations 2021Dian Arief Pradana, Mahfud Mahfud, Candra Hermawan + 1 more
Budapest International Research and Critics Institute (BIRCI-Journal) Humanities and Social Sciences
The values of nationalism can be learned from educational materials that are oriented towards character development. Character education is part of a revolutionary zone among students, therefore character education is a very important part to be urgently developed in the minds of the student head because students are the next generation who will lead the nation and state. Character learning that is oriented towards the values of nationalism can overcome various social problems. Building character for the current generation is one of the goals of national education. The Ministry of National Edu...
Comparing Recognition Performance and Robustness of Multimodal Deep Learning Models for Multimodal Emotion Recognition
351 Citations 2022Wei Liu, Jielin Qiu, Wei‐Long Zheng + 1 more
IEEE Transactions on Cognitive and Developmental Systems
By visualizing features before and after DCCA transformation on the SEED-V data set, it is found that the transformed features are more homogeneous and discriminative across emotions.
Recognition and inhibition of SARS-CoV-2 by humoral innate immunity pattern recognition molecules
152 Citations 2022Matteo Stravalaci, Isabel Pagani, Elvezia Maria Paraboschi + 26 more
Nature Immunology
It is suggested that selected humoral fluid-phase PRMs can play an important role in resistance to, and pathogenesis of, COVID-19, a finding with translational implications.
A Review of Face Recognition Technology
326 Citations 2020Lixiang Li, Xiaohui Mu, Siying Li + 1 more
IEEE Access
Face recognition has become the future development direction and has many potential application prospects and is introduced in the general evaluation standards and the general databases of face recognition.
Multiview Transformers for Video Recognition
265 Citations 2022Yan Shen, Xuehan Xiong, Anurag Arnab + 4 more
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
This work presents Multiview Transformers for Video Recognition (MTV), a model that consists of separate encoders to represent different views of the input video with lateral connections to fuse information across views and achieves state-of-the-art results on six standard datasets.
Optical RAM and integrated optical memories: a survey
165 Citations 2020Theoni Alexoudi, George T. Kanellos, Nikos Pleros
Light Science & Applications
State-of-the-art integrated optical memory technologies and optical RAM cell demonstrations describing the physical mechanisms of several key devices along with their performance metrics in terms of their energy, speed and footprint are reviewed.
A survey of music emotion recognition
112 Citations 2022Donghong Han, Yanru Kong, Jiayi Han + 1 more
Frontiers of Computer Science
The knowledge and algorithms involved in each part are introduced with detailed analysis, including some commonly used datasets, emotion models, feature extraction, and emotion recognition algorithms.
Meta-optics for spatial optical analog computing
151 Citations 2020Sajjad Abdollahramezani, Omid Hemmatyar, Ali Adibi
Nanophotonics
This review discusses state-of-the-art developments, as well as emerging trends, in computational metastructures as disruptive platforms for spatial optical analog computation and discusses two fundamental approaches based on general concepts of spatial Fourier transformation and Green’s function.
Automatic speech recognition: a survey
313 Citations 2020Mishaim Malik, Muhammad Kamran Malik, Khawar Mehmood + 1 more
Multimedia Tools and Applications
This study explores different feature extraction methods, state-of-the-art classification models, and vis-a-vis their impact on an ASR.
Adversarial Examples on Object Recognition
130 Citations 2020Alex Serban, Erik Poll, Joost Visser
ACM Computing Surveys
The hypotheses behind their existence, the methods used to construct or protect against them, and the capacity to transfer adversarial examples between different machine learning models are introduced.
The formation, character and changing nature of mesoscale convective systems
264 Citations 2020Russ S. Schumacher, Kristen L. Rasmussen
Nature Reviews Earth & Environment
Mesoscale convective systems (MCSs) describe organized groupings of thunderstorms in the tropics and mid-latitudes that span thousands of square kilometres. While recognized for over a century, the advent of satellite and radar observations, as well as atmospheric-model simulations, has brought about their increased understanding. In this Review, we synthesize current knowledge on MCS formation, climatological characteristics, hazardous weather, predictive capacity and projected changes with anthropogenic warming. Driven by typical deep moist convective processes (moisture, lift and instabilit...
Retracted: IRIS Recognition System
147 Citations 2021Kavita Borkar, Suresh Salankar
2021 IEEE International Conference on Mobile Networks and Wireless Communications (ICMNWC)
The main aim of the research work is to discuss the various methods and techniques used till now by different research scholars for iris recognition and various steps required in iris Recognition process.
Automatic Number Plate Recognition
119 Citations 2024Shaik Nehar
American Journal of Electronics & Communication
Experimental results show that the identification accuracy of the proposed ANPR exceeds 95%, demonstrating its potential in practical applications and suggestions for future research to improve stability and efficiency.
Pedestrian attribute recognition: A survey
106 Citations 2021Xiao Wang, Shaofei Zheng, Rui Yang + 4 more
Pattern Recognition
The background of pedestrian attribute recognition (PAR), including the fundamental concepts of pedestrian attributes and corresponding challenges are introduced, and existing benchmarks, including popular datasets and evaluation criterion are introduced.
Face Recognition: A Literature Review
120 Citations 2023Gabriela Laura Sălăgean, Monica Leba
International Scientific Conference ITEMA. Recent Advances in Information Technology, Tourism, Economics, Management and Agriculture
This paper proposes a review of research on face recognition techniques, algorithms and existing applications, with their advantages and disadvantages, and presents future directions of development in the field of face recognition.
The design of Kaldi is described, a free, open-source toolkit for speech recognition research that provides a speech recognition system based on finite-state automata together with detailed documentation and a comprehensive set of scripts for building complete recognition systems.