Acta Informatica Pragensia 2021, 10(2), 138-154 | DOI: 10.18267/j.aip.1523964
A Neural Network-Based Approach in Predicting Consumers’ Intentions of Purchasing Insurance Policies
- School of Mathematical Sciences, Sunway University, No. 5, Jalan Universiti, Bandar Sunway, 47500 Selangor Darul Ehsan, Malaysia
Insurance is a crucial mechanism used to lighten the financial burden as it provides protection against financial losses resulting from unexpected events. Insurers adopt various approaches, such as machine learning, to attract the uninsured. By using machine learning, a company is able to tap into the wealth of information of its potential customers. The main objective of this study is to apply artificial neural networks (ANNs) to predict the propensity of consumers to purchase an insurance policy by using the dataset from the Computational Intelligence and Learning (CoIL) Challenge 2000. In addition, this study also aims to identify factors that affect the propensity of customers to purchase insurance policies via feature selection. The dataset is pre-processed with feature construction and three feature selection methods, which are the neighbourhood component analysis (NCA), sequential forward selection (SFS) and sequential backward selection (SBS). Sampling techniques are carried out to address the issue of imbalanced class distributions. The results obtained are found to be comparable with the top few entries of the CoIL Challenge 2000, which shows the efficiency of the proposed model in predicting consumers’ intention of purchasing insurance policies.
Keywords: Neural network; Feature selection; Classification; Prediction; Consumer targeting.
Received: April 15, 2021; Revised: June 28, 2021; Accepted: June 28, 2021; Prepublished online: June 28, 2021; Published: September 10, 2021 Show citation
References
- Arasu, B. S., Seelan, B. J. B., & Thamaraiselvan, N. (2020). A machine learning-based approach to enhancing social media marketing. Computers & Electrical Engineering, 86, 106723. https://doi.org/10.1016/j.compeleceng.2020.106723
Go to original source...
- Cateni, S., Colla, V., & Vannucci, M. (2014). A method for resampling imbalanced datasets in binary classification tasks for real-world problems. Neurocomputing, 135, 32-41. https://doi.org/10.1016/j.neucom.2013.05.059
Go to original source...
- Chen, Y., Lee, J. Y., Sridhar, S., Mittal, V., McCallister, K., & Singal, A. G. (2020). Improving cancer outreach effectiveness through targeting and economic assessments: Insights from a randomized field experiment. Journal of Marketing, 84(3), 1-27. https://doi.org/10.1177/0022242920913025
Go to original source...
- Darzi, M. R. K., Niaki, S. T. A., & Khedmati, M. (2019). Binary classification of imbalanced datasets: The case of CoIL challenge 2000. Expert Systems with Applications, 128, 169-186. https://doi.org/10.1016/j.eswa.2019.03.024
Go to original source...
- Delley, M., & Brunner, T. A. (2020). A segmentation of Swiss fluid milk consumers and suggestions for target product concepts. Journal of Dairy Science, 103(4), 3095-3106. https://doi.org/10.3168/jds.2019-17325
Go to original source...
- Elkan, C. (2000). CoIL Challenge 2000 entry. http://liacs.leidenuniv.nl/~puttenpwhvander/library/cc2000/ELKANP~1.pdf
- Elkan, C. (2001). Magical thinking in data mining: lessons from CoIL challenge 2000. In Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining (pp. 426-431). ACM. https://doi.org/10.1145/502512.502576
Go to original source...
- Fahad, S., Wang, J., Hu, G., Wang, H., Yang, X., Shah, A. A., Nguyen, T. L. H. & Bilal, A. (2018). Empirical analysis of factors influencing farmers crop insurance decisions in Pakistan: Evidence from Khyber Pakhtunkhwa province. Land Use Policy, 75, 459-467. https://doi.org/10.1016/j.landusepol.2018.04.016
Go to original source...
- Haykin, S. (2010). Neural networks and learning machines. Pearson Education India.
- Joost, M., & Schiffmann, W. (1998). Speeding up backpropagation algorithms by using cross-entropy combined with pattern normalization. International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems, 6(2), 117-126. https://doi.org/10.1142/S0218488598000100
Go to original source...
- Jørgensen, T. M., & Linneberg, C. (2000). Subspace projections-an approach to variable selection and modeling. In CoIL challenge 2000: The insurance company case. Leiden Institute of Advanced Computer Science.
- Kim, Y., Street, W. N., Russell, G. J., & Menczer, F. (2005). Customer targeting: A neural network approach guided by genetic algorithms. Management Science, 51(2), 264-276. https://doi.org/10.1287/mnsc.1040.0296
Go to original source...
- Ładyżyński, P., Żbikowski, K., & Gawrysiak, P. (2019). Direct marketing campaigns in retail banking with the use of deep learning and random forests. Expert Systems with Applications, 134, 28-35. https://doi.org/10.1016/j.eswa.2019.05.020
Go to original source...
- Leong, L. Y., Hew, T. S., Ooi, K. B., & Tan, G. W. H. (2019). Predicting actual spending in online group buying-An artificial neural network approach. Electronic Commerce Research and Applications, 38, 100898. https://doi.org/10.1016/j.elerap.2019.100898
Go to original source...
- LIAM. (2019). LIAM re-elects Anusha Thavarajah and Rangam Bir as president and vice-president for the term 2019/2020. https://www.liam.org.my/index.php/newsmedia-room/media-releasepress-statements/english/1067-liam-re-elects-anusha-thavarajah-and-rangam-bir-as-president-and-vice-president-for-the-term-20192020
- Lin, H. C. K., Wang, T. H., Lin, G. C., Cheng, S. C., Chen, H. R., & Huang, Y. M. (2020). Applying sentiment analysis to automatically classify consumer comments concerning marketing 4Cs aspects. Applied Soft Computing, 97, 106755. https://doi.org/10.1016/j.asoc.2020.106755
Go to original source...
- Liu, H., & Motoda, H. (Eds.). (2008). Computational methods of feature selection. CRC Press.
Go to original source...
- Ma, L., & Sun, B. (2020). Machine learning and AI in marketing-Connecting computing power to human insights. International Journal of Research in Marketing, 37(3), 481-504. https://doi.org/10.1016/j.ijresmar.2020.04.005
Go to original source...
- Mahmoudi, H., Farajpour, M., & Afrasiabi, S. (2021). The preferences of consumers for organic tea: evidence from a stated choice experiment. Journal of the Saudi Society of Agricultural Sciences, 20(4), 265-269. https://doi.org/10.1016/j.jssas.2021.02.006
Go to original source...
- MATLAB. (2020). Matlab R2020b. The MathWorks Inc.
- Motoda, H., & Liu, H. (2002). Feature selection, extraction and construction. Communication of IICM, 5, 67-72.
- Netąajev, E. (2016). Motor insurance clients risk level evaluation using artificial neural networks and deep learning. https://a-lab.ee/edu/sites/default/files/Netsajev_Bsc.pdf
- Nor Shamsiah, M. Y. (2018). Governor's remarks at the Malaysian Insurance Institute (MII) Summit - "Innovation in a disruptive era". https://www.bnm.gov.my/-/governor-s-remarks-at-the-malaysian-insurance-institute-mii-summit-innovation-in-a-disruptive-era-
- Rahangdale, G., Ahirwar, M., & Motwani, M. (2016). Application of k-NN and Naive Bayes Algorithm in Banking and Insurance Domain. International Journal of Computer Science Issues, 13(5), 69. https://doi.org/10.20943/01201605.6975
Go to original source...
- Seewald, A. K. (2000). CoIL Challenge 2000 - submitted solution. http://liacs.leidenuniv.nl/~puttenpwhvander/library/cc2000/SEEWAL~1.pdf
- Shrestha, A., & Mahmood, A. (2019). Review of deep learning algorithms and architectures. IEEE Access, 7, 53040-53065. https://doi.org/10.1109/ACCESS.2019.2912200
Go to original source...
- Shtovba, S., & Mashnitskiy, Y. (2000). The backpropagation multilayer feedforward neural network based competition task solution. Vinnitas State Technical University.
- Tsai, P. H., Lin, G. Y., Zheng, Y. L., Chen, Y. C., Chen, P. Z., & Su, Z. C. (2020). Exploring the effect of Starbucks' green marketing on consumers' purchase decisions from consumers' perspective. Journal of Retailing and Consumer Services, 56, 102162. https://doi.org/10.1016/j.jretconser.2020.102162
Go to original source...
- Van der Putten, P., & Van Someren, M. (2000a). CoIL Challenge 2000: The insurance company case. In Technical Report 2000-09. Leiden Institute of Advanced Computer Science, Universiteit van Leiden. https://www.kaggle.com/uciml/caravan-insurance-challenge
- Van der Putten, P., de Ruiter, M. & van Someren, M. (2000b). CoIL challenge 2000 tasks and results: Predicting and explaining caravan policy ownership. CoIL Challenge, 2000. http://liacs.leidenuniv.nl/~puttenpwhvander/library/cc2000/PUTTEN~1.pdf
- Van der Putten, P., & Van Someren, M. (2004). A bias-variance analysis of a real world learning problem: The CoIL challenge 2000. Machine learning, 57(1), 177-195. https://doi.org/10.1023/B:MACH.0000035476.95130.99
Go to original source...
- Verma, S., Sharma, R., Deb, S., & Maitra, D. (2021). Artificial intelligence in marketing: Systematic review and future research direction. International Journal of Information Management Data Insights, 1(1), 100002. https://doi.org/10.1016/j.jjimei.2020.100002
Go to original source...
- Vesanto, J., & Sinkkonen, J. (2000). Submission for the CoIL Challenge 2000. http://liacs.leidenuniv.nl/~puttenpwhvander/library/cc2000/VESANT~1.pdf
- Wu, X., & Barbará, D. (2002). Modeling and imputation of large incomplete multidimensional datasets. In Proceedings of the International Conference on Data Warehousing and Knowledge Discovery (pp. 286-295). Springer.
Go to original source...
- Xu, D., Liu, E., Wang, X., Tang, H., & Liu, S. (2018). Rural households' livelihood capital, risk perception, and willingness to purchase earthquake disaster insurance: Evidence from southwestern China. International Journal of Environmental Research and Public Health, 15(7), 1319. https://doi.org/10.3390/ijerph15071319
Go to original source...
- Zadrozny, B., & Elkan, C. (2002). Transforming classifier scores into accurate multiclass probability estimates. In Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (pp. 694-699). ACM. https://doi.org/10.1145/775047.775151
Go to original source...
- Zainuddin, Z., Lai, K. H., & Ong, P. (2016). An enhanced harmony search based algorithm for feature selection: Applications in epileptic seizure detection and prediction. Computers & Electrical Engineering, 53, 143-162. https://doi.org/10.1016/j.compeleceng.2016.02.009
Go to original source...
- Zhang, W., Du, Y., Yoshida, T., & Wang, Q. (2018). DRI-RCNN: An approach to deceptive review identification using recurrent convolutional neural network. Information Processing & Management, 54(4), 576-592. https://doi.org/10.1016/j.ipm.2018.03.007
Go to original source...
- Zou, Q., Xie, S., Lin, Z., Wu, M., & Ju, Y. (2016). Finding the best classification threshold in imbalanced classification. Big Data Research, 5, 2-8. https://doi.org/10.1016/j.bdr.2015.12.001
Go to original source...
This is an open access article distributed under the terms of the Creative Commons Attribution 4.0 International License (CC BY 4.0), which permits use, distribution, and reproduction in any medium, provided the original publication is properly cited. No use, distribution or reproduction is permitted which does not comply with these terms.