If you are eager to deepen your knowledge of XGBoost, this list of top research papers should be on your reading list. Gain valuable insights into this powerful machine learning algorithm that has revolutionized data science. Whether you're a beginner or an expert, these papers will provide essential understanding and advanced techniques, helping you to effectively leverage XGBoost in your projects.
Looking for research-backed answers?Try AI Search
Kartina Diah Kesuma Wardhani, Memen Akbar
Jurnal Online Informatika
The machine learning technique used to predict diabetes in this study is extreme gradient boosting (XGBoost), an advanced implementation of gradient boosting along with multiple regularization factors to accurately predict target variables by combining simpler and weaker model set estimations.
Muamar Mohamed, Farhad E. Mahmood, Mehmmood A. Abd + 3 more
2023 IEEE Industry Applications Society Annual Meeting (IAS)
This study focuses on the development of an electrical demand forecasting model using machine learning techniques, specifically Long Short-Term Memory (LSTM) and eXtreme Gradient Boosting (XGBoost), and uses data collected from wireless sensors installed in the city.
Priyanka Sharma, Mayank Jain
2023 International Conference on Computing, Communication, and Intelligent Systems (ICCCIS)
Results show that the XGBOOST model is efficient in generating future stock prices and performs with a mean squared error of 0.004, mean absolute error of 0.014 and R2 score of 0.995.
Dylan Norbert Gono, H. Napitupulu, Firdaniza
Mathematics
This article presents a study on forecasting silver prices using the extreme gradient boosting (XGBoost) machine learning method with hyperparameter tuning, and finds that the proposed models exhibited the best performance.
Chibuzor N Obiora, Ahmed Ali, Ali N. Hasan
2021 IEEE PES/IAS PowerAfrica
Globally, the consistent clamor by environmentalists for the need to mitigate the effects of climate change has necessitated the adoption of renewable energy sources (RES) for use by many developed/developing nations. The approach is geared towards gradually replacing or reducing the use of fossil fuels for electric power production. Solar power is among the major renewable energy sources in use today. But the use of solar energy is, unfortunately, characterized by fluctuations in its power generation due to the unpredictability of solar irradiance. Despite many methods in use already, accurat...
Albi Mulyadi Sapari, Asep Id Hadiana, F. R. Umbara
Innovation in Research of Informatics (INNOVATICS)
This study aims to classify air quality into three labels or categories: good, moderate, and unhealthy, using XGBoost, a development of the Gradient Decision Tree with several advantages, such as high scalability and prevention of overfitting.
Iqbal Hanif
Proceedings of the Proceedings of the 1st International Conference on Statistics and Analytics, ICSA 2019, 2-3 August 2019, Bogor, Indonesia
XGBoost algorithm gives a better prediction than LogReg algorithm does based on its prediction accuracy, specificity, sensitivity and ROC curve and has a better capability to separate churned customers from not-churned customers than LogReg model does according to KS chart and Gains-Lift charts produced by each algorithm.
Weiwen He, Hongyuan He, Fanglin Wang + 4 more
Analytical Letters
Abstract Ripe fruit provides essential nutrients for the human body. To fulfill the needs of consumers, the practice of artificial ripening has become more common. Artificial ripening not only degrades the quality of fruit but also impairs health. The potential of hyperspectral imaging coupled with machine learning to quickly and uninvasively identify differently ripened bananas was explored in this study. A total of 300 banana samples that were naturally ripened or ripened by ethephon or calcium carbide were characterized by their hyperspectral images. To improve the accuracy of classificatio...
R. Vijay, S. Manoj, V. Ravikanth + 2 more
journal unavailable
This specific task focuses on anomaly-based society intrusion detection by employing XGBoost algorithm on KDD CUP 1999 info positioned to get the ideal outcomes.
Shubo Wu, Q. Yuan, Zhongwei Yan + 1 more
Journal of Advanced Transportation
Vehicle to vulnerable road user (VRU) crashes occupy a large proportion of traffic crashes in China, and crash injury severity analysis can support traffic managers to understand the implicit rules behind the crashes. Therefore, 554 VRUs-involved crashes are collected from January, 2017, to February, 2021, in a city in northern China, including 322 vehicle-pedestrian crashes and 232 vehicle-bicycle crashes. First, a descriptive statistical analysis is conducted to investigate the characteristics of VRUs-involved crashes. Second, the extreme gradient boosting (XGBoost) model is introduced to id...
Diah Asmawati, Lukman Arif Sanjani, Christiant Dimas Renggana + 2 more
Journal of Technology and Informatics (JoTI)
This study aims to fill the research gap by creating an ECG classification model to detect arrhythmia using the XGBoost algorithm, and the results are quite good for each class, but are still considered not better than those models.
Kartina Diah Kesuma Wardhani, Wenda Novayani
Scientific Journal of Informatics
By demonstrating how PCA can improve the efficiency and accuracy of prediabetes prediction models, this research provides valuable insights to inform future studies and contribute to the development of more effective diagnostic tools for early detection and prevention of prediabetes.
Kartina Diah Kusuma, Memen Akbar
Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)
Extreme gradient boosting is the machine learning technique to predict diabetes (xgboost) using Feature Importance XGBoost using UCI Machine Learning to produce a model that can be used by medical staff to predict and identify diabetes in patients.
J. P. Haumahu, S. Permana, Y. Yaddarabullah
IOP Conference Series: Materials Science and Engineering
The result of this study shows that the machine learning model created using XGBoost has an accuracy value of 89%, with the precision value of 90% and recall value 80%.
Muhammad Talha Ashraf, Isma Hamid, Qamar Nawaz + 1 more
2023 International Multi-disciplinary Conference in Emerging Research Trends (IMCERT)
An Evolutionary Algorithm named as Group Counseling Optimizer (GCO) in tandem with Extreme gradient boost (XGBoost) to classify cancer in microarray data is employed.
Sri Elina Herni, Yulianti, O. Soesanto + 1 more
Journal of Mathematics Theory and Application
It is proved that the use of algorithms with hyperparameter tuning can improve the performance of eXtreme Gradient Boosting algorithm in the process of classification of credit card customers with an accuracy of 80.039%, precision of 81.338% and a recall value of 96.854%.
Dennis A-L Mariadass, E. Moung, Mai Sufian + 1 more
2022 12th International Conference on Computer and Knowledge Engineering (ICCKE)
This paper proposes to use the XGBoost model for annual crop yield prediction in Malaysia and shows promising results with 0.98 R-Squared value and outperformed the current models.
Peng Liu, Wei Fan
Promet - Traffic&Transportation
The results show that the XGBoost model outperforms the IDM in terms of prediction errors and the analysis of the feature importance reveals that the longitudinal position has the greatest influence on vehicle trajectory prediction results.
Opitasari Opitasari, Fauzan Natsir, Ega Shela Marsiani
Jurnal Ilmiah FIFO
Kanker serviks yang juga biasa dikenal dengan kanker mulut leher rahim merupakan satu dari beberapa jenis penyakit kanker yang mematikan pada wanita setelah kanker payudara. Menurut survei WHO, dari total kasus kanker di Indonesia, 9,2% kasus di antaranya adalah kanker serviks dengan jumlah 36.633 kasus. Sulitnya menentukan gejala awal pada kanker serviks dikarenakan gejala yang timbul tidak kasat mata sehingga banyak sekali kasus terlambat penanganan pada pasien penderita penyakit ini. Penelitian dilakukan dengan metode XGBoost untuk mengklasifikasi gejala awal penyakit kanker serviks dengan ...
Alber S. Aziz, Haitham Rizk Fadlallah
International Journal of Advances in Applied Computational Intelligence
To accurately estimate HCV using sparse weather information, this work offers two machine learning methods: The Support Vector Machine (SVM) and a simple tree-based ensemble approach called Extreme Gradient Boosting (XGBoost) that are applied to real-world data on HCV.
Noman Shabbir, O. Husev, Kamran Daniel + 3 more
2024 IEEE 18th International Conference on Compatibility, Power Electronics and Power Engineering (CPE-POWERENG)
This research introduces a new approach for predicting short-term DC power loads using the XgBoost algorithm that utilizes weak learners to build robust prediction models using weaker learning combinations and shows the ability to capture complex non-linear patterns based on real-life residential DC load data.
Cengiz Zopluoglu
Educational and Psychological Measurement
The utility of XGBoost in detecting examinees with potential item preknowledge is investigated using a real data set that includes examinees who engaged in fraudulent testing behavior, such as illegally obtaining live test content before the exam.
Iyad Lahsen Cherif, A. Kortebi
2019 Wireless Days (WD)
This work considers a supervised approach, namely eXtreme Gradient Boosting (XGBoost) algorithm, which has never been investigated for TC, and obtains 99.5% accuracy on a dataset containing real flows.
Hossein Khajezadeh, Morteza Savaedi Pour, M. Manthouri
2024 10th International Conference on Control, Instrumentation and Automation (ICCIA)
The results demonstrate that XGBoost outperforms other models in analyzing sensor data and identifying motion errors and improper robot performance, making it an effective tool in developing AI-driven robots.
Risma Moulidya Syafei, Devi Ajeng Efrilianda
Recursive Journal of Informatics
It can be concluded that the implementation of XGBoost Feature Importance can be used to improve LightGBM's performance in bankruptcy prediction, and this study aims to implement a machine learning model using the Light Gradientboosting Machine (LightGBM) classification algorithm which is optimized using Extreme Gradient Boosting (XGBeost) Feature Importation to increase the accuracy of bankruptcy prediction.
Yulistiani Yulistiani, Styawati Styawati
Jurnal Informatika: Jurnal Pengembangan IT
This research takes Twitter data to see public opinion on presidential candidates, and determines the process of digital text analysis and the application of the XGBOOST method to Twitter user sentiment in two categories (positive and negative) and three categories (positive, negative and neutral) for each candidate.
Bin Yu, Wenying Qiu, Cheng Chen + 4 more
Bioinformatics
SubMito-XGBoost has obtained satisfactory prediction results by the leave-one-out-cross-validation (LOOCV) compared with existing methods and achieves satisfactory predictive performance on plant and non-plant protein submitochondrial datasets.
M. G. Raja, S. Jeyalaksshmi
J. Interconnect. Networks
In this paper, Self-Configuration and Self-healing Framework using an extreme gradient boosting (XGBoost) Classifier are proposed and the results exhibit that the proposed framework has higher packet delivery ratio with reduced packet drops and computational cost.
R. Kumar, G. S
Advances in Science, Technology and Engineering Systems Journal
This work proposes a malware classification scheme that constructs a model using low-end computing resources and a very large balanced dataset for malware and gives improved performance for accuracy with the tuning of the hyperparameter and achieve higher accuracy.
Da-ping Yu, Zhidong Liu, C. Su + 6 more
Thoracic Cancer
The main cause of cancer death is lung cancer (LC) which usually presents at an advanced stage, but its early detection would increase the benefits of treatment. Blood is particularly favored in clinical research given the possibility of using it for relatively noninvasive analyses. Copy number variation (CNV) is a common genetic change in tumor genomes, and many studies have indicated that CNV‐derived cell‐free DNA (cfDNA) from plasma could be feasible as a biomarker for cancer diagnosis.
Vabiyana Safira Desdhanty, Z. Rustam
2021 International Conference on Decision Aid Sciences and Application (DASA)
This paper will be focusing on the implementation of Genetic Algorithm as feature selection when applied to the widely used machine learning algorithm Random Forest and Extreme Gradient Boosting for cancer classification and the result will show that with 20% testing data, XGBoost with Genetic Al algorithm asfeature selection gives the highest accuracy score.
C. A. E. Piter, Setiwan Hadi, I. Yulita
2021 International Conference on Artificial Intelligence and Big Data Analytics
This study aims to classify the information data on scientific conferences in Indonesia with a total of 1005 data into three labels, namely Economics, Science and Engineering, and Social Studies using the Extreme Gradient Boost (XGBoost) method.
I. Wayan, Gede Suacana, Didik Suhariyanto + 1 more
International Journal Software Engineering and Computer Science (IJSECS)
The research results show that the XGBoost algorithm achieves high accuracy, precision, recall, and F1 score, demonstrating its ability to classify sounds accurately.
Matthew Wiens, Alissa Verone‐Boyle, Nick Henscheid + 2 more
Clinical and Translational Science
ABSTRACT Approaches to artificial intelligence and machine learning (AI/ML) continue to advance in the field of drug development. A sound understanding of the underlying concepts and guiding principles of AI/ML implementation is a prerequisite to identifying which AI/ML approach is most appropriate based on the context. This tutorial focuses on the concepts and implementation of the popular eXtreme gradient boosting (XGBoost) algorithm for classification and regression of simple clinical trial‐like datasets. Emphasis is placed on relating the underlying concepts to the code implementation. In ...
Agus Fahmi Limas Ptr, Muhammad Mizan Siregar, Irwan Daniel
Journal of Computer Networks, Architecture and High Performance Computing
This research conducts a comprehensive analysis of mobile phone specification classification using three prominent machine learning algorithms: Gradient Boosting, XGBoost, and CatBoost, finding CatBoost consistently achieves the highest AUC values and accuracy scores.
S. Islam, A. Sholahuddin, A. S. Abdullah
Journal of Physics: Conference Series
This research will focus on making forecasting applications and analyzing the exchange rate of USD against rupiah based on time series data or temporal datasets from the Investing.com site using machine learning methods, namely Extreme Gradient Boosting (XGBoost).
M. Noorunnahar, A. Chowdhury, F. A. Mila
PLOS ONE
It is found that the XGBoost model performs better than the ARIMA model in predicting the annual rice production in Bangladesh, and based on the better performance, the study forecasted the annual Rice Production in Bangladesh for the next 10 years using the XTBOost model.
William K. Sanders, Dongfeng Li, Wenzhao Li + 1 more
Water
A data-driven FAS framework as a prototype that stakeholders can utilize solely based on their gauging information for local flood warning and mitigation practices is illustrated.
Anwar Husain, Ahmed Salem, Carol Jim + 1 more
2019 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT)
Extreme gradient boosting (XGBoost) that provides highly efficient and accurate data predictive model were used and may be used for any future network intrusion data where these 23 features are available to easily and efficiently predict network attack types.
Elly Warni, Dea Wahsa Saputri, A. A. Prayogi Alimuddin + 3 more
2024 8th International Conference on Information Technology, Information Systems and Electrical Engineering (ICITISEE)
XGBoost is known for its superiority in handling large datasets and providing regularization that reduces overfitting, AdaBoost improves model accuracy by adjusting weights on misclassified instances in previous iterations, and Gradient Boosting builds models sequentially to minimize the loss function.
Sheng Li, Yi Jiang, Shuisong Ke + 2 more
Land
The characteristics of housing and location conditions are the main drivers of spatial differences in housing prices, which is a topic attracting high interest in both real estate and geography research. One of the most popular models, the hedonic price model (HPM), has limitations in identifying nonlinear relationships and distinguishing the importance of influential factors. Therefore, extreme gradient boosting (XGBoost), a popular machine learning technology, and the HPM were combined to analyse the comprehensive effects of influential factors on housing prices. XGBoost was employed to iden...
Pankaj Tyagi, Anju Sharma, R. Semwal + 2 more
Journal of biomolecular structure & dynamics
The results indicate that the XGBoost-PCA model performed better than the other models for predicting common odor descriptors, and may be helpful in understanding the structure-odor relationship.
Gerard Shu Fuhnwi, Matthew Revelle, Clemente Izurieta
2024 IEEE 3rd International Conference on AI in Cybersecurity (ICAIC)
XGBoost with RFE outperforms other popular machine learning algorithms for NIDS on this dataset, achieving the highest MCC for detecting NSL-KDD dataset attacks of type DoS, Probe, U2R, and R2L and very high classification time.
Shibo Wen, Yongzhi Wang, Qizhou Gong + 6 more
Remote. Sens.
The findings suggest that the BO-XGBoost model shows notable classification performance in evaluating the suitability of lunar landing sites, which may provide valuable support for future landing missions and contribute to optimizing lunar exploration efforts.
Abdul Hakam, W. Utama, S. A. Garini + 3 more
BIO Web of Conferences
Sonic log is an important aspect that provides a detailed description of the subsurface properties associated with oil and gas reservoirs. The problem that frequently occurs is the unavailability of sonic log data for various reasons needs to be given an effective solution. The alternative approach proposed in this research is sonic log prediction based on Extreme Gradient Boosting (XGBoost) machine learning algorithm, using available log data to build a reliable sonic log prediction model. In this research, the predicted DT log type is the Differential Time Shear Slowness (DTSM) log, which is...
Rohmatul Fajriyah, Havidzah Asri Isnandar, Adhar Arifuddin
MEDIA STATISTIKA
This research has identified three genes that contribute the most to classifying AMI patients, namely calponin 2, ribosomal protein S11 and myotropin, namely calponin 2, ribosomal protein S11 and myotropin.
Jaouhar Fattahi, M. Mejri, Marwa Ziadia
journal unavailable
The eXtreme Gradient Boosting (XGBoost) algorithm is used for learning and detection, in tandem with Bag-of-Words (BoW) and Term Frequency-Inverse Document Frequency (TF-IDF) for text vectorization.
Benjamin Manning
journal unavailable
The main goals of this study were to ascertain how well Gradient Boosting could be used for prediction or, in this case, classification or identification of a specific user through the learning of HCI-based behavioral biometrics.
Cory Seidel, Ethan S. Genter, D. Peters
Proceedings of the Vertical Flight Society 76th Annual Forum
This paper explores the capabilities of regression modeling with gradient boosted trees in XGBoostTM as a potential solution to counter limitations of real-time analysis in rotorcraft.
Yang Liu, Zhuo Ma, Ximeng Liu + 4 more
2020 IEEE 40th International Conference on Distributed Computing Systems (ICDCS)
FEDXGB is proposed, a federated extreme gradient boosting (XGBoost) scheme supporting forced aggregation that combines the advantages of secret sharing and homomorphic encryption, and can solve the second challenge mentioned above, and is robust to the user dropout.