. Predicting passengers who survived the Titanic disaster using a decision tree.

Keywords: machine learning, data analysis, decision trees, forecasting, Kaggle platform.

Abstract

The article proposes the use of the decision tree method for predicting the passengers who survived the Titanic liner disaster. The Titanic - Machine Learning from Disaster dataset, which is publicly available on the Kaggle platform, is used as input. Decision trees are well suited for solving classification and forecasting problems, and their ease of interpretation makes this method the best choice among other machine learning algorithms. On training data set, modification has been performed to fill the missing values. The distribution of qualitative and quantitative data features and the search for patterns in the data were evaluated using visual data analysis, which allowed us to identify the passenger features that correlate with their survival the most and improve the data set accordingly. The decision tree for the final dataset was built using the scikit-learn library (sklearn), which provides powerful tools for machine learning Python. The accuracy of the built decision tree is 77% of the deferred sample. Further study of the application of the decision tree for this dataset can be done by using the hyperparameter tuning method, which will help to improve the accuracy of the constructed decision tree.

References

1. Лук’янюк В. (2017). Цей день в історії: Загибель «Титаніка».
2. Ai Yu. (2023). Predicting Titanic Survivors by Using Machine Learning, Highlights in Science, Engineering and Technology, 34, 360-367
3. Haque MA, Shivaprasad G. & Guruprasad G. (2021). Passenger data analysis of Titanic using machine learning approach in the context of chances of surviving the disaster, IOP Conference Series: Materials Science and Engineering, 1065(1)
4. Singh A., Saraswat S. & Faujdar N. (2017). Analyzing Titanic disaster using machine learning algorithms, International Conference on Computing, Communication and Automation (ICCCA), Greater Noida, India, 406-411
5. Singh K., Nagpal R. & Sehgal R. (2020). Exploratory Data Analysis and Machine Learning on Titanic Disaster Dataset, 10th International Conference on Cloud Computing, Data Science & Engineering (Confluence), Noida, India, 320-326

Abstract views: 55
PDF Downloads: 43
Published
2024-06-16
How to Cite
Morokhovych , V., Liakh , I., Khomyak , M., & Morokhovych , B. (2024). . Predicting passengers who survived the Titanic disaster using a decision tree. COMPUTER-INTEGRATED TECHNOLOGIES: EDUCATION, SCIENCE, PRODUCTION, (55), 161-166. https://doi.org/10.36910/6775-2524-0560-2024-55-20
Section
Computer science and computer engineering