If you are eager to deepen your knowledge of XGBoost, this list of top research papers should be on your reading list. Gain valuable insights into this powerful machine learning algorithm that has revolutionized data science. Whether you're a beginner or an expert, these papers will provide essential understanding and advanced techniques, helping you to effectively leverage XGBoost in your projects.
Looking for research-backed answers?Try AI Search
Kartina Diah Kesuma Wardhani, Memen Akbar
Jurnal Online Informatika
The machine learning technique used to predict diabetes in this study is extreme gradient boosting (XGBoost), an advanced implementation of gradient boosting along with multiple regularization factors to accurately predict target variables by combining simpler and weaker model set estimations.
Muamar Mohamed, Farhad E. Mahmood, Mehmmood A. Abd + 3 more
2023 IEEE Industry Applications Society Annual Meeting (IAS)
This study focuses on the development of an electrical demand forecasting model using machine learning techniques, specifically Long Short-Term Memory (LSTM) and eXtreme Gradient Boosting (XGBoost), and uses data collected from wireless sensors installed in the city.
Dylan Norbert Gono, H. Napitupulu, Firdaniza
Mathematics
This article presents a study on forecasting silver prices using the extreme gradient boosting (XGBoost) machine learning method with hyperparameter tuning, and finds that the proposed models exhibited the best performance.
Priyanka Sharma, Mayank Jain
2023 International Conference on Computing, Communication, and Intelligent Systems (ICCCIS)
Results show that the XGBOOST model is efficient in generating future stock prices and performs with a mean squared error of 0.004, mean absolute error of 0.014 and R2 score of 0.995.
Chibuzor N Obiora, Ahmed Ali, Ali N. Hasan
2021 IEEE PES/IAS PowerAfrica
Globally, the consistent clamor by environmentalists for the need to mitigate the effects of climate change has necessitated the adoption of renewable energy sources (RES) for use by many developed/developing nations. The approach is geared towards gradually replacing or reducing the use of fossil fuels for electric power production. Solar power is among the major renewable energy sources in use today. But the use of solar energy is, unfortunately, characterized by fluctuations in its power generation due to the unpredictability of solar irradiance. Despite many methods in use already, accurat...
Albi Mulyadi Sapari, Asep Id Hadiana, F. R. Umbara
Innovation in Research of Informatics (INNOVATICS)
This study aims to classify air quality into three labels or categories: good, moderate, and unhealthy, using XGBoost, a development of the Gradient Decision Tree with several advantages, such as high scalability and prevention of overfitting.
Iqbal Hanif
Proceedings of the Proceedings of the 1st International Conference on Statistics and Analytics, ICSA 2019, 2-3 August 2019, Bogor, Indonesia
XGBoost algorithm gives a better prediction than LogReg algorithm does based on its prediction accuracy, specificity, sensitivity and ROC curve and has a better capability to separate churned customers from not-churned customers than LogReg model does according to KS chart and Gains-Lift charts produced by each algorithm.
Shubo Wu, Q. Yuan, Zhongwei Yan + 1 more
Journal of Advanced Transportation
Vehicle to vulnerable road user (VRU) crashes occupy a large proportion of traffic crashes in China, and crash injury severity analysis can support traffic managers to understand the implicit rules behind the crashes. Therefore, 554 VRUs-involved crashes are collected from January, 2017, to February, 2021, in a city in northern China, including 322 vehicle-pedestrian crashes and 232 vehicle-bicycle crashes. First, a descriptive statistical analysis is conducted to investigate the characteristics of VRUs-involved crashes. Second, the extreme gradient boosting (XGBoost) model is introduced to id...
Weiwen He, Hongyuan He, Fanglin Wang + 4 more
Analytical Letters
Abstract Ripe fruit provides essential nutrients for the human body. To fulfill the needs of consumers, the practice of artificial ripening has become more common. Artificial ripening not only degrades the quality of fruit but also impairs health. The potential of hyperspectral imaging coupled with machine learning to quickly and uninvasively identify differently ripened bananas was explored in this study. A total of 300 banana samples that were naturally ripened or ripened by ethephon or calcium carbide were characterized by their hyperspectral images. To improve the accuracy of classificatio...
R. Vijay, S. Manoj, V. Ravikanth + 2 more
journal unavailable
This specific task focuses on anomaly-based society intrusion detection by employing XGBoost algorithm on KDD CUP 1999 info positioned to get the ideal outcomes.
Kartina Diah Kusuma, Memen Akbar
Jurnal RESTI (Rekayasa Sistem dan Teknologi Informasi)
Extreme gradient boosting is the machine learning technique to predict diabetes (xgboost) using Feature Importance XGBoost using UCI Machine Learning to produce a model that can be used by medical staff to predict and identify diabetes in patients.
Diah Asmawati, Lukman Arif Sanjani, Christiant Dimas Renggana + 2 more
Journal of Technology and Informatics (JoTI)
Heart disease is one of the most dangerous illnesses because it has the potential to take people's lives. One of the causes of heart disease is arrhythmia, an abnormal condition of the heartbeat. To diagnose arrhythmia, analysis of electrocardiographic (ECG) signals can be performed. However, this analysis is very difficult to do conventionally and has the potential for errors, so there is a need for automatic ECG classification to detect arrhythmia. This study aims to fill the research gap by creating an ECG classification model to detect arrhythmia using the XGBoost algorithm. The results ar...
J. P. Haumahu, S. Permana, Y. Yaddarabullah
IOP Conference Series: Materials Science and Engineering
The result of this study shows that the machine learning model created using XGBoost has an accuracy value of 89%, with the precision value of 90% and recall value 80%.
Muhammad Talha Ashraf, Isma Hamid, Qamar Nawaz + 1 more
2023 International Multi-disciplinary Conference in Emerging Research Trends (IMCERT)
An Evolutionary Algorithm named as Group Counseling Optimizer (GCO) in tandem with Extreme gradient boost (XGBoost) to classify cancer in microarray data is employed.
Sri Elina Herni, Yulianti, O. Soesanto + 1 more
Journal of Mathematics Theory and Application
It is proved that the use of algorithms with hyperparameter tuning can improve the performance of eXtreme Gradient Boosting algorithm in the process of classification of credit card customers with an accuracy of 80.039%, precision of 81.338% and a recall value of 96.854%.
Dennis A-L Mariadass, E. Moung, Mai Sufian + 1 more
2022 12th International Conference on Computer and Knowledge Engineering (ICCKE)
This paper proposes to use the XGBoost model for annual crop yield prediction in Malaysia and shows promising results with 0.98 R-Squared value and outperformed the current models.
Peng Liu, Wei Fan
Promet - Traffic&Transportation
The results show that the XGBoost model outperforms the IDM in terms of prediction errors and the analysis of the feature importance reveals that the longitudinal position has the greatest influence on vehicle trajectory prediction results.
Opitasari Opitasari, Fauzan Natsir, Ega Shela Marsiani
Jurnal Ilmiah FIFO
Kanker serviks yang juga biasa dikenal dengan kanker mulut leher rahim merupakan satu dari beberapa jenis penyakit kanker yang mematikan pada wanita setelah kanker payudara. Menurut survei WHO, dari total kasus kanker di Indonesia, 9,2% kasus di antaranya adalah kanker serviks dengan jumlah 36.633 kasus. Sulitnya menentukan gejala awal pada kanker serviks dikarenakan gejala yang timbul tidak kasat mata sehingga banyak sekali kasus terlambat penanganan pada pasien penderita penyakit ini. Penelitian dilakukan dengan metode XGBoost untuk mengklasifikasi gejala awal penyakit kanker serviks dengan ...
Noman Shabbir, O. Husev, Kamran Daniel + 3 more
2024 IEEE 18th International Conference on Compatibility, Power Electronics and Power Engineering (CPE-POWERENG)
This research introduces a new approach for predicting short-term DC power loads using the XgBoost algorithm that utilizes weak learners to build robust prediction models using weaker learning combinations and shows the ability to capture complex non-linear patterns based on real-life residential DC load data.
Alber S. Aziz, Haitham Rizk Fadlallah
International Journal of Advances in Applied Computational Intelligence
To accurately estimate HCV using sparse weather information, this work offers two machine learning methods: The Support Vector Machine (SVM) and a simple tree-based ensemble approach called Extreme Gradient Boosting (XGBoost) that are applied to real-world data on HCV.
Iyad Lahsen Cherif, A. Kortebi
2019 Wireless Days (WD)
This work considers a supervised approach, namely eXtreme Gradient Boosting (XGBoost) algorithm, which has never been investigated for TC, and obtains 99.5% accuracy on a dataset containing real flows.
Cengiz Zopluoglu
Educational and Psychological Measurement
The utility of XGBoost in detecting examinees with potential item preknowledge is investigated using a real data set that includes examinees who engaged in fraudulent testing behavior, such as illegally obtaining live test content before the exam.
Risma Moulidya Syafei, Devi Ajeng Efrilianda
Recursive Journal of Informatics
It can be concluded that the implementation of XGBoost Feature Importance can be used to improve LightGBM's performance in bankruptcy prediction, and this study aims to implement a machine learning model using the Light Gradientboosting Machine (LightGBM) classification algorithm which is optimized using Extreme Gradient Boosting (XGBeost) Feature Importation to increase the accuracy of bankruptcy prediction.
Bin Yu, Wenying Qiu, Cheng Chen + 4 more
Bioinformatics
SubMito-XGBoost has obtained satisfactory prediction results by the leave-one-out-cross-validation (LOOCV) compared with existing methods and achieves satisfactory predictive performance on plant and non-plant protein submitochondrial datasets.
M. G. Raja, S. Jeyalaksshmi
J. Interconnect. Networks
In this paper, Self-Configuration and Self-healing Framework using an extreme gradient boosting (XGBoost) Classifier are proposed and the results exhibit that the proposed framework has higher packet delivery ratio with reduced packet drops and computational cost.
R. Kumar, G. S
Advances in Science, Technology and Engineering Systems Journal
This work proposes a malware classification scheme that constructs a model using low-end computing resources and a very large balanced dataset for malware and gives improved performance for accuracy with the tuning of the hyperparameter and achieve higher accuracy.
C. A. E. Piter, Setiwan Hadi, I. Yulita
2021 International Conference on Artificial Intelligence and Big Data Analytics
This study aims to classify the information data on scientific conferences in Indonesia with a total of 1005 data into three labels, namely Economics, Science and Engineering, and Social Studies using the Extreme Gradient Boost (XGBoost) method.
Agus Fahmi Limas Ptr, Muhammad Mizan Siregar, Irwan Daniel
Journal of Computer Networks, Architecture and High Performance Computing
This research conducts a comprehensive analysis of mobile phone specification classification using three prominent machine learning algorithms: Gradient Boosting, XGBoost, and CatBoost, finding CatBoost consistently achieves the highest AUC values and accuracy scores.
I. Wayan, Gede Suacana, Didik Suhariyanto + 1 more
International Journal Software Engineering and Computer Science (IJSECS)
The research results show that the XGBoost algorithm achieves high accuracy, precision, recall, and F1 score, demonstrating its ability to classify sounds accurately.
Da-ping Yu, Zhidong Liu, C. Su + 6 more
Thoracic Cancer
The main cause of cancer death is lung cancer (LC) which usually presents at an advanced stage, but its early detection would increase the benefits of treatment. Blood is particularly favored in clinical research given the possibility of using it for relatively noninvasive analyses. Copy number variation (CNV) is a common genetic change in tumor genomes, and many studies have indicated that CNV‐derived cell‐free DNA (cfDNA) from plasma could be feasible as a biomarker for cancer diagnosis.
Vabiyana Safira Desdhanty, Z. Rustam
2021 International Conference on Decision Aid Sciences and Application (DASA)
This paper will be focusing on the implementation of Genetic Algorithm as feature selection when applied to the widely used machine learning algorithm Random Forest and Extreme Gradient Boosting for cancer classification and the result will show that with 20% testing data, XGBoost with Genetic Al algorithm asfeature selection gives the highest accuracy score.
M. Noorunnahar, A. Chowdhury, F. A. Mila
PLOS ONE
It is found that the XGBoost model performs better than the ARIMA model in predicting the annual rice production in Bangladesh, and based on the better performance, the study forecasted the annual Rice Production in Bangladesh for the next 10 years using the XTBOost model.
S. Islam, A. Sholahuddin, A. S. Abdullah
Journal of Physics: Conference Series
This research will focus on making forecasting applications and analyzing the exchange rate of USD against rupiah based on time series data or temporal datasets from the Investing.com site using machine learning methods, namely Extreme Gradient Boosting (XGBoost).
William K. Sanders, Dongfeng Li, Wenzhao Li + 1 more
Water
A data-driven FAS framework as a prototype that stakeholders can utilize solely based on their gauging information for local flood warning and mitigation practices is illustrated.
Anwar Husain, Ahmed Salem, Carol Jim + 1 more
2019 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT)
Extreme gradient boosting (XGBoost) that provides highly efficient and accurate data predictive model were used and may be used for any future network intrusion data where these 23 features are available to easily and efficiently predict network attack types.
Elly Warni, Dea Wahsa Saputri, A. A. Prayogi Alimuddin + 3 more
2024 8th International Conference on Information Technology, Information Systems and Electrical Engineering (ICITISEE)
Measuring the performance of business partners is a crucial technique for overseeing and sustaining an organization or company's competitive edge. This process involves contrasting anticipated results with actual results, examining deviations from the plan, assessing individual performance, and evaluating the progress toward meeting set goals. Additionally, partnership is described as a collaboration between two or more parties based on a mutual agreement to benefit each other and increase revenue. In this study, each algorithm was tested to evaluate its ability to classify business partners b...
Sheng Li, Yi Jiang, Shuisong Ke + 2 more
Land
The characteristics of housing and location conditions are the main drivers of spatial differences in housing prices, which is a topic attracting high interest in both real estate and geography research. One of the most popular models, the hedonic price model (HPM), has limitations in identifying nonlinear relationships and distinguishing the importance of influential factors. Therefore, extreme gradient boosting (XGBoost), a popular machine learning technology, and the HPM were combined to analyse the comprehensive effects of influential factors on housing prices. XGBoost was employed to iden...
Pankaj Tyagi, Anju Sharma, R. Semwal + 2 more
Journal of biomolecular structure & dynamics
The results indicate that the XGBoost-PCA model performed better than the other models for predicting common odor descriptors, and may be helpful in understanding the structure-odor relationship.
Gerard Shu Fuhnwi, Matthew Revelle, Clemente Izurieta
2024 IEEE 3rd International Conference on AI in Cybersecurity (ICAIC)
XGBoost with RFE outperforms other popular machine learning algorithms for NIDS on this dataset, achieving the highest MCC for detecting NSL-KDD dataset attacks of type DoS, Probe, U2R, and R2L and very high classification time.
Shibo Wen, Yongzhi Wang, Qizhou Gong + 6 more
Remote. Sens.
The findings suggest that the BO-XGBoost model shows notable classification performance in evaluating the suitability of lunar landing sites, which may provide valuable support for future landing missions and contribute to optimizing lunar exploration efforts.
Abdul Hakam, W. Utama, S. A. Garini + 3 more
BIO Web of Conferences
Sonic log is an important aspect that provides a detailed description of the subsurface properties associated with oil and gas reservoirs. The problem that frequently occurs is the unavailability of sonic log data for various reasons needs to be given an effective solution. The alternative approach proposed in this research is sonic log prediction based on Extreme Gradient Boosting (XGBoost) machine learning algorithm, using available log data to build a reliable sonic log prediction model. In this research, the predicted DT log type is the Differential Time Shear Slowness (DTSM) log, which is...
Jaouhar Fattahi, M. Mejri, Marwa Ziadia
journal unavailable
The eXtreme Gradient Boosting (XGBoost) algorithm is used for learning and detection, in tandem with Bag-of-Words (BoW) and Term Frequency-Inverse Document Frequency (TF-IDF) for text vectorization.
Benjamin Manning
journal unavailable
The main goals of this study were to ascertain how well Gradient Boosting could be used for prediction or, in this case, classification or identification of a specific user through the learning of HCI-based behavioral biometrics.
Cory Seidel, Ethan S. Genter, D. Peters
Proceedings of the Vertical Flight Society 76th Annual Forum
This paper explores the capabilities of regression modeling with gradient boosted trees in XGBoostTM as a potential solution to counter limitations of real-time analysis in rotorcraft.
Afikah Agustiningsih, Yulian Findawati, Irwan Alnarus Kautsar
Jurnal Teknik Informatika (Jutif)
This study designs a system that is capable of classifying SMK student graduates as an evaluation for the school institution to enable the school to devise strategies for producing better student quality in the following year.
Guoli Zhou, Zhongyun Ni, Yinbing Zhao + 1 more
Sensors (Basel, Switzerland)
Mapping the distribution of bamboo species is vital for the sustainable management of bamboo and for assessing its ecological and socioeconomic value. However, the spectral similarity between bamboo species makes this work extremely challenging through remote sensing technology. Existing related studies rarely integrate multiple feature variables and consider how to quantify the main factors affecting classification. Therefore, feature variables, such as spectra, topography, texture, and vegetation indices, were used to construct the XGBoost model to identify bamboo species using the Zhuhai-1 ...
Eun Hak Lee, Kyoungtae Kim, S. Kho + 2 more
Transportation Research Record
The model shows notable performance in accuracy and provided an understanding of the estimation results, which show that the total travel time of the local train feature is found to substantially affect the probability of express train preference with a 1.871 SHAP value.
Irfan Ullah Khan, N. Aslam, Rana AlShedayed + 4 more
Sensors (Basel, Switzerland)
A proactive interpretable prediction model using ML and explainable artificial intelligence (XAI) to detect different types of security attacks using the log data generated by heating, ventilation, and air conditioning attacks is proposed.
Soyoung Woo, Wonjin Kim, Chunggil Jung + 3 more
Water
Climate change not only affects the water resource system but also has a great impact on the aquatic ecosystem, which is complexly linked to various organic and inorganic matter. It is difficult to simulate the current aquatic ecosystem and predict the future system due to the immensity and complexity of aquatic ecosystems; however, a spatial analysis of future aquatic ecological health is necessary if we are to adapt and take action against future climate change. In this study, we evaluated the aquatic ecological health of the Han River basin under the future climate change RCP4.5 and RCP8.5 ...
Yang Liu, Zhuo Ma, Ximeng Liu + 4 more
2020 IEEE 40th International Conference on Distributed Computing Systems (ICDCS)
FEDXGB is proposed, a federated extreme gradient boosting (XGBoost) scheme supporting forced aggregation that combines the advantages of secret sharing and homomorphic encryption, and can solve the second challenge mentioned above, and is robust to the user dropout.