Top Research Papers on Data Engineering
Dive into a curated selection of the most influential research papers on Data Engineering. This collection covers groundbreaking approaches, methodologies, and applications that are shaping the future of this critical field. Expand your knowledge and keep up with the latest trends and innovations in Data Engineering.
Looking for research-backed answers?Try AI Search
Data-Driven Science and Engineering
564 Citations 2022Steven L. Brunton, J. Nathan Kutz
Cambridge University Press eBooks
Data-driven discovery is revolutionizing how we model, predict, and control complex systems. Now with Python and MATLAB®, this textbook trains mathematical scientists and engineers for the next generation of scientific discovery by offering a broad overview of the growing intersection of data-driven methods, machine learning, applied optimization, and classical fields of engineering mathematics and mathematical physics. With a focus on integrating dynamical systems modeling and control with modern methods in applied machine learning, this text includes methods that were chosen for their releva...
Data engineering for fraud detection
135 Citations 2021Bart Baesens, Sebastiaan Höppner, Tim Verdonck
Decision Support Systems
This work proposes several data engineering techniques to improve the performance of an analytical model while retaining the interpretability property, and illustrates the improvement in performance of these data engineering steps for popular analytical models on a real payment transactions data set.
The R Language: An Engine for Bioinformatics and Data Science
200 Citations 2022Federico M. Giorgi, Carmine Ceraolo, Daniele Mercatelli
Life
An historical chronicle of how R became what it is today is provided, describing all its current features and capabilities, and the role of R in science in general as a driver for reproducibility is discussed.
Automated data processing and feature engineering for deep learning and big data applications: A survey
102 Citations 2024Alhassan Mumuni, Fuseini Mumuni
Journal of Information and Intelligence
A thorough review of approaches for automating data processing tasks in deep learning pipelines, including automated data preprocessing, as well as data augmentation (including synthetic data generation using generative AI methods and feature engineering), and the use of AutoML methods and tools to simultaneously optimize all stages of the machine learning pipeline are presented.
Low-N protein engineering with data-efficient deep learning
383 Citations 2021Surojit Biswas, Grigory Khimulya, Ethan C. Alley + 2 more
Nature Methods
A machine learning-guided paradigm that can use as few as 24 functionally assayed mutant sequences to build an accurate virtual fitness landscape and screen ten million sequences via in silico directed evolution is introduced.
Google Earth Engine for geo-big data applications: A meta-analysis and systematic review
1181 Citations 2020Haifa Tamiminia, Bahram Salehi, Masoud Mahdianpari + 3 more
ISPRS Journal of Photogrammetry and Remote Sensing
A meta-analysis investigation of recent peer-reviewed GEE articles focusing on several features, including data, sensor type, study area, spatial resolution, application, strategy, and analytical methods confirmed that GEE has and continues to make substantive progress on global challenges involving process of geo-big data.
Sentinel-1 SAR Backscatter Analysis Ready Data Preparation in Google Earth Engine
335 Citations 2021Adugna Mullissa, Andreas Vollrath, Christelle Odongo-Braun + 5 more
Remote Sensing
A framework for preparing Sentinel-1 SAR backscatter Analysis-Ready-Data in the Google Earth engine that combines existing and new Google Earth Engine implementations for additional border noise correction, speckle filtering and radiometric terrain normalization is presented.
Extracting accurate materials data from research papers with conversational language models and prompt engineering
249 Citations 2024Maciej P. Polak, Dane Morgan
Nature Communications
This work proposes the ChatExtract method, a method that can fully automate very accurate data extraction with minimal initial effort and background, using an advanced conversational LLM, and shows that approaches similar to ChatExtract are likely to become powerful tools for data extraction in the near future.
Sustainable industrial and operation engineering trends and challenges Toward Industry 4.0: a data driven analysis
316 Citations 2021Ming‐Lang Tseng, Thi Phuong Thuy Tran, Hiền Minh Hà + 2 more
Journal of Industrial and Production Engineering
This study supplies contributions to the existing literature with a state-of-the-art bibliometric review of sustainable industrial and operation engineering as the field moves toward Industry 4.0, and guidance for future studies and practical achievements. Although industrial and operation engineering is being promoted forward to sustainability, the systematization of the knowledge that forms firms’ manufacturing and operations and encompasses their wide concepts and abundant complementary elements is still absent. This study aims to analyze contemporary sustainable industrial and operations e...
Google Earth Engine Cloud Computing Platform for Remote Sensing Big Data Applications: A Comprehensive Review
1035 Citations 2020Meisam Amani, Arsalan Ghorbanian, Seyed Ali Ahmadi + 9 more
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing
This study aims to comprehensively explore different aspects of the GEE platform, including its datasets, functions, advantages/limitations, and various applications, and observed that Landsat and Sentinel datasets were extensively utilized by GEE users.
A XGBoost-Based Lane Change Prediction on Time Series Data Using Feature Engineering for Autopilot Vehicles
103 Citations 2022Yi Zhang, Xiupeng Shi, Sheng Zhang + 1 more
IEEE Transactions on Intelligent Transportation Systems
A lane change prediction framework for feature learning, with the aim to have a deep and comprehensive understanding of lane change behaviors, and reach a high performance based on the selected features.
Rapid and robust monitoring of flood events using Sentinel-1 and Landsat data on the Google Earth Engine
459 Citations 2020Ben DeVries, Chengquan Huang, John Armston + 3 more
Remote Sensing of Environment
An algorithm is presented that exploits all available Sentinel-1 SAR images in combination with historical Landsat and other auxiliary data sources hosted on the GEE to rapidly map surface inundation during flood events, relying on multi-temporal SAR statistics to identify unexpected floods in near real-time.
Combining machine learning and process engineering physics towards enhanced accuracy and explainability of data-driven models
177 Citations 2020Timur Bikmukhametov, Johannes Jäschke
Computers & Chemical Engineering
By adding physics-based models to machine learning, it is possible not only to improve the performance of the purely black-box machine learning models, but also to make them more transparent and interpretable.
Aircraft engine remaining useful life estimation via a double attention-based data-driven architecture
235 Citations 2022Lu Liu, Xiao Song, Zhetao Zhou
Reliability Engineering & System Safety
Remaining useful life (RUL) estimation has been intensively studied, given its important role in prognostics and health management (PHM) of industry. Recently, data-driven structures such as convolutional neural networks (CNNs), have achieved outstanding RUL prediction performance. However, conventional CNNs do not include an adequate mechanism for adaptively weighing input features. In this paper, we propose a double attention-based data-driven framework for aircraft engine RUL prognostics. Specifically, a channel attention-based CNN was utilized to apply greater weights to more significant f...
Engineering Is Elementary: An Engineering And Technology Curriculum For Children
162 Citations 2020Kate Hester, Christine M. Cunningham
journal unavailable
As our society becomes increasingly dependent on engineering and technology, it is more important than ever that everyone have a basic understanding of what engineers do, and the uses and implications of the technologies they create.Yet few citizens are technologically literate, in large part because technology and engineering are not taught in our schools 1 .Just as it is important to begin science instruction in the elementary grades by building on children's curiosity about the natural world, it's important to begin engineering instruction in elementary school by building on children's natu...
Atomically engineered, high-speed non-volatile flash memory device exhibiting multibit data storage operations
116 Citations 2023Ghulam Dastgeer, Sobia Nisar, Aamir Rasheed + 6 more
Nano Energy
Non-volatile memory devices, which offer large capacity and mechanical dependability as a mainstream technology, have played a key role in fostering innovation in modern electronics. Despite the advantages of non-volatile memory devices, their low ON/OFF ratio and slow operational speed have limited their performance compared to their volatile counterparts. In this study, we present a non-volatile floating-gate memory device based on van der Waals heterostructures , which exhibits ultrahigh-speed memory operations in the range of a hundred nanoseconds with an extinction ratio of up to 10 6 . T...
Dynamic predictive maintenance for multiple components using data-driven probabilistic RUL prognostics: The case of turbofan engines
108 Citations 2023Mihaela Mitici, Ingeborg de Pater, Anne Barros + 1 more
Reliability Engineering & System Safety
The increasing availability of condition-monitoring data for components/systems has incentivized the development of data-driven Remaining Useful Life (RUL) prognostics in the past years. However, most studies focus on point RUL prognostics, with limited insights into the uncertainty associated with these estimates. This limits the applicability of such RUL prognostics to maintenance planning, which is per definition a stochastic problem. In this paper, we therefore develop probabilistic RUL prognostics using Convolutional Neural Networks. These prognostics are further integrated into maintenan...
Abstract NOTE: The first page of text has been automatically extracted and included below in lieu of an abstract Session 2480 Illuminating Engineering Laura J. Bottomley and Elizabeth A. Parry North Carolina State University/Science Surround Abstract Engineering is a difficult profession to explain to the average person, much less student, and is probably one of the most frequently misunderstood. The session described in this paper was developed to put engineering in common terms for the lay person, as well as provide an interesting and fun way to explore different concentration areas of the p...
Aerodynamics for Engineers
350 Citations 2025John J. Bertin, Mike L. Smith
Cambridge University Press eBooks
Revised and expanded to reflect cutting-edge innovation in aerodynamics, and packed with new features to support learning, the seventh edition of this classic textbook introduces the fundamentals of aerodynamics using clear explanations and real-world examples. Structured around clear learning objectives, this is the ideal textbook for undergraduate students in aerospace engineering, and for graduate students and professional engineers seeking a readable and accessible reference. Over 10 new Aerodynamics Computation boxes that bring students up to speed on modern computational approaches for p...
Aerodynamics for Engineers
174 Citations 2021John J. Bertin, Russell M. Cummings
Cambridge University Press eBooks
Now reissued by Cambridge University Press, this sixth edition covers the fundamentals of aerodynamics using clear explanations and real-world examples. Aerodynamics concept boxes throughout showcase real-world applications, chapter objectives provide readers with a better understanding of the goal of each chapter and highlight the key 'take-home' concepts, and example problems aid understanding of how to apply core concepts. Coverage also includes the importance of aerodynamics to aircraft performance, applications of potential flow theory to aerodynamics, high-lift military airfoils, subsoni...