A SYSTEMATIC LITERATURE REVIEW ON AI AND ANALYTICS TOOLS FOR PREDICTING ENTREPRENEURSHIP ATTRIBUTES IN MALAYSIA'S HIGHER EDUCATION INSTITUTIONS: DEVELOPING A CONTEXTUAL TAXONOMY

Supplementary Files

PDF

Keywords

Artificial Intelligence
Data Analytics
Entrepreneurship
Predictive Modelling
Systematic Literature Review

How to Cite

Ismail, N. A., & Ghazali, A. J. . (2025). A SYSTEMATIC LITERATURE REVIEW ON AI AND ANALYTICS TOOLS FOR PREDICTING ENTREPRENEURSHIP ATTRIBUTES IN MALAYSIA’S HIGHER EDUCATION INSTITUTIONS: DEVELOPING A CONTEXTUAL TAXONOMY. Journal of Engineering & Technological Advances , 10(2), 1-23. https://doi.org/10.35934/segi.v10i2.131

Abstract

Entrepreneurship drives economic and social progress, yet predicting key attributes such as cognitive (e.g., creativity, opportunity recognition), emotional (e.g., resilience, emotional intelligence), and social (e.g., leadership, communication) traits remains challenging due to the limitations of traditional methods. Traditional methods lack precision, driving the adoption of artificial intelligence (AI)-based data analytics tools to analyse complex datasets. This paper employs a Systematic Literature Review (SLR) to synthesise 50 peer-reviewed studies from 2018-2024, using thematic analysis to categorise AI-based data analytics tools into supervised learning, unsupervised learning, natural language processing (NLP), and deep learning, mapping their applications to entrepreneurship attributes. Findings highlight the dominance of models like Random Forests and Transformers in predicting entrepreneurship attributes, with gaps in exploring non-financial attributes such as creativity and resilience, and ethical considerations, including data bias and privacy concerns. This study investigates opportunities and challenges for Malaysian Higher Education Institutions (HEIs), emphasising personalised education and research advancements against barriers such as bias and resource constraints. Contributions include a novel taxonomy that maps AI-based data analytics tools to entrepreneurship attributes and incorporates an ethical evaluation dimension, alongside practical insights for stakeholders and a research agenda for ethical, inclusive AI adoption in entrepreneurship globally and in Malaysia.

https://doi.org/10.35934/segi.v10i2.131

References

Abdul Kadir, M. A. B., & Mhd Sarif, S. (2016). Social Entrepreneurship, Social Entrepreneur and Social Enterprise:A Review of Concepts, Definitions and Development in Malaysia. Journal of Emerging Economies and Islamic Research. https://doi.org/10.24191/jeeir.v4i2.9086

Afolabi, A. O., & Akinola, A. (2021). An Empirical Investigation of the Mentor-Mentee Relationship Among Female Architects and Female Architectural Students. International Journal of Emerging Technologies in Learning. https://doi.org/10.3991/ijet.v16i13.21971

Al Amin, A., Hossen, S., Mehedi Hasan Refat, M., Ghosh, P., & Marouf, A. Al. (2022). Predicting Entrepreneurship Skills of Tertiary-Level Students Using Machine Learning Algorithms. In Lecture Notes on Data Engineering and Communications Technologies. https://doi.org/10.1007/978-981-16-7167-8_52

Albury, N. J. (2021). Linguistic landscape and metalinguistic talk about societal multilingualism. International Journal of Bilingual Education and Bilingualism. https://doi.org/10.1080/13670050.2018.1452894

Alom, M. Z., Taha, T. M., Yakopcic, C., Westberg, S., Sidike, P., Nasrin, M. S., Hasan, M., Van Essen, B. C., Awwal, A. A. S., & Asari, V. K. (2019). A state-of-the-art survey on deep learning theory and architectures. In Electronics (Switzerland). https://doi.org/10.3390/electronics8030292

Angin, M., Ta?demir, B., Y?lmaz, C. A., Demiralp, G., Atay, M., Angin, P., & Dikmener, G. (2022). A RoBERTa Approach for Automated Processing of Sustainability Reports. Sustainability (Switzerland). https://doi.org/10.3390/su142316139

Ariffin, A. S., Maavak, M., Dolah, R., & Muthazaruddin, M. N. (2023). Formulation Of AI Governance and Ethics Framework to Support the Implementation of Responsible AI for Malaysia. Res Militaris European Journal of Military Studies .

Bahari, N., Saufi, R. A., Zainol, N. R., Samad, N. S. A., & Yaziz, M. F. A. (2023). Entrepreneur’S Personality Traits and Firm Performance of Malaysian Smes: Mediated By Market Orientation. International Journal of Professional Business Review, 8(1), 1–17. https://doi.org/10.26668/businessreview/2023.v8i1.1260

Biol, C. S. L. (2024). Supervised and Unsupervised Machine Learning Approaches in Predicting Startup Success. 19(1), 203–208.

Braun, V., & Clarke, V. (2006). Using thematic analysis in psychology. Qualitative Research in Psychology. https://doi.org/10.1191/1478088706qp063oa

Chae, B. (Kevin), & Goh, G. (2020). Digital entrepreneurs in artificial intelligence and data analytics: Who are they? Journal of Open Innovation: Technology, Market, and Complexity. https://doi.org/10.3390/JOITMC6030056

Cholil, S. R., Gernowo, R., Widodo, C. E., Wibowo, A., Warsito, B., & Hirzan, A. M. (2024). Predicting Startup Success Using Tree-Based Machine Learning Algorithms. Revista de Informatica Teorica e Aplicada. https://doi.org/10.22456/2175-2745.133375

Dahiya, N., Gupta, S., & Singh, S. (2022). A Review Paper on Machine Learning Applications, Advantages, and Techniques. ECS Transactions. https://doi.org/10.1149/10701.6137ecst

Dahri, N. A., Yahaya, N., Al-Rahmi, W. M., Vighio, M. S., Alblehai, F., Soomro, R. B., & Shutaleva, A. (2024). Investigating AI-based academic support acceptance and its impact on students’ performance in Malaysian and Pakistani higher education institutions. Education and Information Technologies. https://doi.org/10.1007/s10639-024-12599-x

Dai, Y., & Wang, T. (2021). Prediction of customer engagement behaviour response to marketing posts based on machine learning. Connection Science. https://doi.org/10.1080/09540091.2021.1912710

Data Flair Team. (2021). Advantages and Disadvantages of Machine Learning Language - DataFlair. In Data Flair.

Dhochak, M., Pahal, S., & Doliya, P. (2024). Predicting the Startup Valuation: A deep learning approach. Venture Capital. https://doi.org/10.1080/13691066.2022.2161968

Elaziz, E. A., Fathalla, R., & Shaheen, M. (2023). Deep reinforcement learning for data-efficient weakly supervised business process anomaly detection. Journal of Big Data. https://doi.org/10.1186/s40537-023-00708-5

Feng, J., Ahmad, Z., & Zheng, W. (2023). Factors influencing women’s entrepreneurial success: A multi-analytical approach. Frontiers in Psychology. https://doi.org/10.3389/fpsyg.2022.1099760

Ghani, F. A., Shabri, S. M., Rasli, M. A. M., Razali, N. A., & Shuffri, E. H. A. (2020). An overview of the Personal Data Protection Act 2010 ( PDPA ): Problems and solutions. Global Business and Management Research?: An International Journal.

Gong, Z., Nanjappan, V., & Georgiev, G. V. (2023). Experience Of Creativity And Individual Cultural Values In Ideation. Proceedings of the Design Society. https://doi.org/10.1017/pds.2023.176

Gosztonyi, M., & Judit, C. F. (2022). Profiling (Non-)Nascent Entrepreneurs in Hungary Based on Machine Learning Approaches. Sustainability (Switzerland). https://doi.org/10.3390/su14063571

Gyimah, P., & Lussier, R. N. (2021). Rural entrepreneurship success factors: An empirical investigation in an emerging market. Journal of Small Business Strategy. https://doi.org/10.53703/001c.29470

Halim, M. A. S. A., Awang, N. F., Omar, K., Saputra, J., Kamaruddin, S. N. A. A., & Samsudin, H. (2022). Analyzing the Factors that Influence the Entrepreneur Business Performance of Tourism Destination of Kuala Terengganu City Centre, Malaysia. Journal of Environmental Management and Tourism. https://doi.org/10.14505/jemt.v13.2(58).10

Hamilton, L. M., & Lahne, J. (2022). Natural Language Processing. In Rapid Sensory Profiling Techniques: Applications in New Product Development and Consumer Research, Second Edition. https://doi.org/10.1016/B978-0-12-821936-2.00004-2

Hossain, M. S., Nayla, N., & Rassel, A. A. (2022). Product Market Demand Analysis Using Nlp In Banglish Text With Sentiment Analysis And Named Entity Recognition. 2022 56th Annual Conference on Information Sciences and Systems, CISS 2022. https://doi.org/10.1109/CISS53076.2022.9751188

Huang, K., Zhou, Y., Yu, X., & Su, X. (2024). Innovative entrepreneurial market trend prediction model based on deep learning: Case study and performance evaluation. Science Progress, 107(3), 1–20. https://doi.org/10.1177/00368504241272722

Jelodar, H., Wang, Y., Orji, R., & Huang, S. (2020). Deep Sentiment Classification and Topic Discovery on Novel Coronavirus or COVID-19 Online Discussions: NLP Using LSTM Recurrent Neural Network Approach. IEEE Journal of Biomedical and Health Informatics. https://doi.org/10.1109/JBHI.2020.3001216

Jimainal, M. N. H., Hassan, R. A., Kalimin, K. M., Ansar, R., Chekima, B., & Fook, L. M. (2022). The Effect of Business Support from Business Incubator towards the Performance of Entrepreneurs in the Early Start-Up Companies in Malaysia with the Moderating Effect of Risk-taking Propensity. Journal of Entrepreneurship and Business Innovation. https://doi.org/10.5296/jebi.v9i2.20073

Khalid, N. (2020). Artificial intelligence learning and entrepreneurial performance among university students: evidence from malaysian higher educational institutions. Journal of Intelligent and Fuzzy Systems. https://doi.org/10.3233/JIFS-189026

Kitchenham, B. A., & Charters, S. (2007). Guidelines for performing Systematic Literature Reviews in Software Engineering (Software Engineering Group, Department of Computer Science, Keele …. Technical Report EBSE 2007- 001. Keele University and Durham University Joint Report.

Leng, O. T. S., Vergara, R. G., & Khan, S. (2021). Digital Tracing and Malaysia’s Personal Data Protection Act 2010 amid the Covid-19 Pandemic. Asian Journal of Law and Policy. https://doi.org/10.33093/ajlp.2021.3

Li, Q. (2024). Study on the Flipped Classroom Teaching Model of “business English Translation” in the Context of Big Data. Applied Mathematics and Nonlinear Sciences. https://doi.org/10.2478/amns.2023.2.00213

Lidman, L. (2024). The Gap Between the Rhetorical Why and the Practical What and How of Public Sector Innovation. International Journal of Public Administration. https://doi.org/10.1080/01900692.2023.2197175

Lukita, C., Lutfiani, N., Panjaitan, A. R. S., Bhima, Rahardja, U., & Huzaifah, M. L. (2023). Harnessing The Power of Random Forest in Predicting Startup Partnership Success. 2023 8th International Conference on Informatics and Computing, ICIC 2023. https://doi.org/10.1109/ICIC60109.2023.10381988

Malik, A., Onyema, E. M., Dalal, S., Lilhore, U. K., Anand, D., Sharma, A., & Simaiya, S. (2023). Forecasting students’ adaptability in online entrepreneurship education using modified ensemble machine learning model. Array. https://doi.org/10.1016/j.array.2023.100303

Mohd Razalli, N. L. Y., & Abdul Kadir, M. A. B. (2022). Bumiputera Graduate Entrepreneurs in Describing Social Status and Their Social Status Attainment Experience. Malaysian Journal of Social Sciences and Humanities (MJSSH). https://doi.org/10.47405/mjssh.v7i2.1302

Moses, A., & Bharadwaja Kumar, G. (2021). Snippet generation using textbook corpus - An NLP approach based on BERT. Journal of Physics: Conference Series. https://doi.org/10.1088/1742-6596/1716/1/012061

Mousa, G. A., Elamir, E. A. H., & Hussainey, K. (2022). Using machine learning methods to predict financial performance: Does disclosure tone matter? International Journal of Disclosure and Governance. https://doi.org/10.1057/s41310-021-00129-x

Nasreen, F., Ramzy, M. I., Fern, Y. S., & Ai, Y. J. (2024). Predicting Factors That Inspire Entrepreneurial Intention Among Malaysian University Students. Malaysian Online Journal of Educational Management.

Ntumi, S., Agbenyo, S., & Bulala, T. (2023). Estimating the Psychometric Properties (Item Difficulty, Discrimination and Reliability Indices) of Test Items using Kuder-Richardson Approach (KR-20). Shanlax International Journal of Education. https://doi.org/10.34293/education.v11i3.6081

Ojeda-Beltrán, A., Solano-Barliza, A., Arrubla-Hoyos, W., Ortega, D. D., Cama-Pinto, D., Holgado-Terriza, J. A., Damas, M., Toscano-Vanegas, G., & Cama-Pinto, A. (2023). Characterisation of Youth Entrepreneurship in Medellín-Colombia Using Machine Learning. Sustainability (Switzerland). https://doi.org/10.3390/su151310297

Okoye, K., Nganji, J. T., Escamilla, J., & Hosseini, S. (2024). Machine learning model (RG-DMML) and ensemble algorithm for prediction of students’ retention and graduation in education. Computers and Education: Artificial Intelligence. https://doi.org/10.1016/j.caeai.2024.100205

Ortiz-Perez, D., Ruiz-Ponce, P., Tomás, D., Garcia-Rodriguez, J., Vizcaya-Moreno, M. F., & Leo, M. (2023). A Deep Learning-Based Multimodal Architecture to predict Signs of Dementia. Neurocomputing. https://doi.org/10.1016/j.neucom.2023.126413

Pasayat, A. K., Mitra, A., & Bhowmick, B. (2024). Determination of essential features for predicting start-up success: an empirical approach using machine learning. Technology Analysis and Strategic Management. https://doi.org/10.1080/09537325.2022.2116569

Pendy, B. (2023). Role of AI in Business Management. Brilliance: Research of Artificial Intelligence. https://doi.org/10.47709/brilliance.v3i1.2191

Prakash, N. C. P., Narasimhaiah, A. P., Nagaraj, J. B., Pareek, P. K., Maruthikumar, N. B., & Manjunath, R. I. (2022). Implementation of NLP based automatic text summarization using spacy. International Journal of Health Sciences. https://doi.org/10.53730/ijhs.v6ns5.10574

Rijati, N., Sumpeno, S., & Purnomo, M. H. (2019). Attribute Selection Techniques to Clustering the Entrepreneurial Potential of Student based on Academic Behavior. 2019 IEEE International Conference on Computational Intelligence and Virtual Environments for Measurement Systems and Applications, CIVEMSA 2019 - Proceedings. https://doi.org/10.1109/CIVEMSA45640.2019.9071597

Sáez-Ortuño, L., Huertas-Garcia, R., Forgas-Coll, S., & Puertas-Prats, E. (2023). How can entrepreneurs improve digital market segmentation? A comparative analysis of supervised and unsupervised learning algorithms. International Entrepreneurship and Management Journal. https://doi.org/10.1007/s11365-023-00882-1

Samala, A. D., Usmeldi, Taali, Ambiyar, Bojic, L., Indarta, Y., Tsoy, D., Denden, M., Tas, N., & Dewi, I. P. (2023). Metaverse Technologies in Education: A Systematic Literature Review Using PRISMA. International Journal of Emerging Technologies in Learning. https://doi.org/10.3991/IJET.V18I05.35501

Sangsavate, S., Sinthupinyo, S., & Chandrachai, A. (2023). Experiments of Supervised Learning and Semi-Supervised Learning in Thai Financial News Sentiment: A Comparative Study. ACM Transactions on Asian and Low-Resource Language Information Processing. https://doi.org/10.1145/3603499

Saryani, Handayani, I., & Agustina, R. (2022). Starting a Digital Business: Being a Millennial Entrepreneur Innovating. Startupreneur Business Digital (SABDA Journal). https://doi.org/10.34306/sabda.v1i2.113

Schade, P., & Schuhmacher, M. C. (2023). Predicting entrepreneurial activity using machine learning. Journal of Business Venturing Insights. https://doi.org/10.1016/j.jbvi.2022.e00357

Shane, S. (2013). A General Theory of Entrepreneurship. In A General Theory of Entrepreneurship. https://doi.org/10.4337/9781781007990

Shaowei, Q., Tianhua, L., & Miao, Z. (2022). Predictive Factors of the Entrepreneurial Performance of Undergraduates. Frontiers in Psychology, 13(March), 1–9. https://doi.org/10.3389/fpsyg.2022.814759

Shepherd, D. A., & Majchrzak, A. (2022). Machines augmenting entrepreneurs: Opportunities (and threats) at the Nexus of artificial intelligence and entrepreneurship. Journal of Business Venturing. https://doi.org/10.1016/j.jbusvent.2022.106227

Sinaga, K. P., & Yang, M. S. (2020). Unsupervised K-means clustering algorithm. IEEE Access. https://doi.org/10.1109/ACCESS.2020.2988796

Talaei Khoei, T., & Kaabouch, N. (2023). Machine Learning: Models, Challenges, and Research Directions. In Future Internet. https://doi.org/10.3390/fi15100332

Xu, S., Zhang, C., & Hong, D. (2022). BERT-based NLP techniques for classification and severity modeling in basic warranty data study. Insurance: Mathematics and Economics. https://doi.org/10.1016/j.insmatheco.2022.07.013

Yang, Y., Qian, C., Li, H., Gao, Y., Wu, J., Liu, C. J., & Zhao, S. (2022). An efficient DBSCAN optimized by arithmetic optimization algorithm with opposition-based learning. Journal of Supercomputing. https://doi.org/10.1007/s11227-022-04634-w

Yin, C. W., Arif, E. E. M., Theam, T. S., Sen, S. C., Ying, T. C. Y., & Huei, C. T. (2023). Determinants of the Sustainability of Tech Startup: Comparison Between Malaysia and China. Lecture Notes in Networks and Systems. https://doi.org/10.1007/978-3-031-16865-9_45

Zhou, Q. (2024). Innovation of educational management paths in higher education based on LSTM deep learning model. Applied Mathematics and Nonlinear Sciences. https://doi.org/10.2478/amns.2023.2.00972

Zulkifle, A. M., & Aziz, K. A. (2023). Determinants of Social Entrepreneurship Intention: A Longitudinal Study among Youth in Higher Learning Institutions. Social Sciences. https://doi.org/10.3390/socsci12030124

Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.

Copyright (c) 2025 Array