A Fundamental Framework for Analysis of Rainfall Prediction Features Significance
Main Article Content
Abstract
Rainfall prediction efforts had been prevalent ever since the impact of climate change on occurrences of natural disasters globally. Implementation of machine and deep learning techniques on features that contribute to rainfall occurrences were conducted with aims of seeking greater prediction accuracy for rainfall occurrences with a lack of study for significance of features in rainfall occurrence prediction. This study presents a framework of rainfall prediction features' significance analysis in the case study of Peninsular Malaysia rainfall occurrences. Features investigated in this study consist of temperature, humidity and wind speed. The designed framework for the investigation includes phases of data collection, data preprocessing, integration of random forest (RF) for ensemble classification and feature importance (FI) for feature significance calculation and finally model evaluation based on the metrics of precision, recall, F1 score and receiver operating characteristic (ROC) curve. In the preliminary investigation, the prediction model demonstrated accuracy, precision, recall and F1-score of 80.65%, 80%, 81% and 0.80 respectively. Humidity was found to have highest significance to the model's predictive power as compared to temperature and wind speed. Rainfall occurrence correlation with lower temperature and higher humidity and vice versa was identified with further investigation of feature data distribution against rainfall occurrences.
Article Details

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
All articles published in JIWE are licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0) License. Readers are allowed to
- Share — copy and redistribute the material in any medium or format under the following conditions:
- Attribution — You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use;
- NonCommercial — You may not use the material for commercial purposes;
- NoDerivatives — If you remix, transform, or build upon the material, you may not distribute the modified material.
References
S. Shahid, and A. Minhans, “Climate change and road safety: A review to assess impacts in Malaysia”, J. Teknol., vol. 78, no. 4, Mar. 2016, doi: 10.11113/jt.v78.7991.
M. Raval, P. Sivashanmugam, V. Pham, H. Gohel, A. Kaushik, and Y. Wan, “Automated predictive analytics tool for rainfall forecasting,” Sci Rep, vol. 11, no. 1, Dec. 2021, doi: 10.1038/s41598-021-95735-8.
W. M. Ridwan, M. Sapitang, A. Aziz, K. F. Kushiar, A. N. Ahmed, and A. El-Shafie, “Rainfall forecasting model using machine learning methods: Case study Terengganu, Malaysia,” Ain Shams Engineering Journal, vol. 12, no. 2, pp. 1651–1663, Jun. 2021, doi: 10.1016/j.asej.2020.09.011.
N. Chenmin, M. F. Marsani, and F. Pei Shan, “Flood prediction based on feature selection and a hybrid deep learning network,” Journal of Water and Climate Change, Mar. 2024, doi: 10.2166/wcc.2024.559.
Suvashisa Dash and Answeta Jaiswal, “Machine learning based forecasting model for rainfall prediction,” World Journal of Advanced Research and Reviews, vol. 21, no. 1, pp. 1678–1686, Jan. 2024, doi: 10.30574/wjarr.2024.21.1.0180.
V.S, G. S, D. M, and K. S, “Rainfall Based Flood Prediction in Kerala Using Machine Learning”, Int J Intell Syst Appl Eng, vol. 12, no. 16s, pp. 141–144, Feb. 2024.
B. Setya, R. A. Nurhidayatullah, M. B. Hewen, and K. Kusrini, “Comparative Analysis Of Rainfall Value Prediction In Semarang Using Linear And K-Nearest Neighbor Algorithms,” in 2023 5th International Conference on Cybernetics and Intelligent Systems, ICORIS 2023, Institute of Electrical and Electronics Engineers Inc., 2023. doi: 10.1109/ICORIS60118.2023.10352274.
P. A. Nandini, B. Meenavalli, A. Puttamreddy, J. Meghana, N. Kataria, and L. Gupta, “Prediction of Rainfall using Random Forest,” in 2022 IEEE International Students’ Conference on Electrical, Electronics and Computer Science, SCEECS 2022, Institute of Electrical and Electronics Engineers Inc., 2022. doi: 10.1109/SCEECS54111.2022.9741063.
C. Schutte, M. van der Laan, and B. van der Merwe, “Leveraging historic streamflow and weather data with deep learning for enhanced streamflow predictions,” Journal of Hydroinformatics, Feb. 2024, doi: 10.2166/hydro.2024.268.
H. Ishibashi, “Framework for risk assessment of economic loss from structures damaged by rainfall-induced landslides using machine learning,” Georisk, vol. 18, no. 1, pp. 228–243, 2024, doi: 10.1080/17499518.2023.2288606.
R. Sato and Y. Fujimoto, “Rainfall Forecasting with LSTM by Combining Cloud Image Feature Extraction with CNN and Weather Information,” IEEJ Journal of Industry Applications, vol. 13, no. 1, pp. 24–33, 2024, doi: 10.1541/ieejjia.23002926.
N. A. Majid, “Historical landslide events in Malaysia 1993-2019,” Indian J Sci Technol, vol. 13, no. 33, pp. 3387–3399, Sep. 2020, doi: 10.17485/IJST/v13i33.884.
G. Singh, M. A. B. H. Abdul Rahman, and M. S. bin Zulkipli, “An Emergency and Mass Casualty Incident Response in the Jalan Batang Kali-Jalan Genting Highlands Malaysia Landslide 2022: A Case Report and Strategies to Improve,” International Journal of Management and Human Sciences, vol. 07, no. 01, pp. 33–40, 2023, doi: 10.31674/ijmhs.2023.v07i01.005.
Department of Statistics Malaysia, “Jadual statistik.” [Online]. Available: https://www.dosm.gov.my/uploads/publications/
I. El Naqa and M. J. Murphy, "What is machine learning?," in Machine Learning in Radiation Oncology, Berlin, Germany: Springer, 2015, pp. 3–11, doi: 10.1007/978-3-319-18305-3_1.
Y. Asamoah and K. Ansah-Mensah, “Temporal Description of Annual Temperature and Rainfall in the Bawku Area of Ghana,” Advances in Meteorology, vol. 2020, 3402178, pp. 1–18, 2020, doi: 10.1155/2020/3402178.
B. Praveen et al., “Analyzing trend and forecasting of rainfall changes in India using non-parametrical and machine learning approaches,” Sci Rep, vol. 10, no. 1, Dec. 2020, doi: 10.1038/s41598-020-67228-7.
A. Sharma and M. K. Goyal, “Bayesian Network Model for Monthly Rainfall Forecast.” [Online]. Available: https://www.ncdc.noaa.gov/teleconnections/enso/indicators/soi
R. He, L. Zhang, and A. W. Z. Chew, “Data-driven multi-step prediction and analysis of monthly rainfall using explainable deep learning,” Expert Syst Appl, vol. 235, Jan. 2024, doi: 10.1016/j.eswa.2023.121160.
Proc. IEEE Congress Evolutionary Computation (CEC), Vancouver, BC, Canada, 24–29 July 2016. IEEE Press, 2021. doi: 10.5555/978-1-5090-0623-6.
P. Das, D. A. Sachindra, and K. Chanda, “Machine Learning-Based Rainfall Forecasting with Multiple Non-Linear Feature Selection Algorithms,” Water Resources Management, vol. 36, no. 15, pp. 6043–6071, Dec. 2022, doi: 10.1007/s11269-022-03341-8.
S. Benziane, “Survey: Rainfall Prediction Precipitation, Review of Statistical Methods,” WSEAS Transactions on Systems, vol. 23, pp. 47–59, Jan. 2024, doi: 10.37394/23202.2024.23.5.
A. Gutierrez-Lopez, I. Cruz-Paz, and M. M. Mandujano, “Algorithm to predict the rainfall starting point as a function of atmospheric pressure, humidity, and dewpoint,” Climate, vol. 7, no. 11, 2019, doi: 10.3390/cli7110131.
J. Yan, T. Xu, Y. Yu, and H. Xu, “Rainfall forecast model based on the tabnet model,” Water (Switzerland), vol. 13, no. 9, May 2021, doi: 10.3390/w13091272.
M. Sudha and V. Balasubramanian, “Identifying effective features and classifiers for short term rainfall forecast using rough sets maximum frequency weighted feature reduction technique,” Journal of Computing and Information Technology, vol. 24, no. 2, pp. 181–194, 2016, doi: 10.20532/cit.2016.1002715.
M. Sudha and B. Valarmathi, "Rainfall forecast analysis using rough set attribute reduction and data mining methods," AGRIS on-line Papers in Economics and Informatics, vol. 6, no. 4, pp. 1-10, Dec. 2014. [Online]. Available: http://ageconsearch.umn.edu
S. M. Wajid, T. Javed, and M. M. Su’ud, “Ensemble Learning-Powered URL Phishing Detection: A Performance Driven Approach,” Journal of Informatics and Web Engineering, vol. 3, no. 2, pp. 134–145, Jun. 2024, doi: 10.33093/jiwe.2024.3.2.10.
T. Ahmed Khan, R. Sadiq, Z. Shahid, M. M. Alam, and M. M. Su’ud, “Sentiment Analysis using Support Vector Machine and Random Forest,” Journal of Informatics and Web Engineering, vol. 3, no. 1, pp. 67–75, Feb. 2024, doi: 10.33093/jiwe.2024.3.1.5.
G. Biau and E. Scornet, “A random forest guided tour,” Test, vol. 25, pp. 197–227, 2016.
F. Ozen, “Random forest regression for prediction of Covid-19 daily cases and deaths in Turkey,” Heliyon, vol. 10, no. 4, Feb. 2024, doi: 10.1016/j.heliyon.2024.e25746.
Y. Q. Lim and Y. L. Loo, “Characteristics of multiclass suicide risks tweets through feature extraction and machine learning techniques,” International Journal on Informatics Visualization, vol. 7, no. 4, pp. 2297-2305, 2023, doi: https://dx.doi.org/10.62527/joiv.7.4.2284.
X. Wang, W. Nie, W. Xie, and Y. Zhang, “Incremental learning-random forest model-based landslide susceptibility analysis: A case of Ganzhou City, China,” Earth Sci Inform, vol. 17, no. 2, pp. 1645–1661, Apr. 2024, doi: 10.1007/s12145-024-01229-2.
M. A. Saleh and H. M. Rasel, “Performance evaluation of Machine Learning based regression models for rainfall forecasting”, Research Square, 2024, doi: 10.21203/rs.3.rs-3856741/v1.
M. A. Habib and M. J. Hossain, “Revolutionizing Wind Power Prediction—The Future of Energy Forecasting with Advanced Deep Learning and Strategic Feature Engineering,” Energies (Basel), vol. 17, no. 5, Mar. 2024, doi: 10.3390/en17051215.
M. S. A. Magboo and V. P. C. Magboo, “Feature Importance Measures as Explanation for Classification Applied to Hospital Readmission Prediction,” in Procedia Computer Science, Elsevier B.V., 2022, pp. 1388–1397. doi: 10.1016/j.procs.2022.09.195.