Acta Informatica Pragensia X:X | DOI: 10.18267/j.aip.299160

Fairness-Aware Multimodal Machine Learning for Retail Stock Prediction from Sentiment and Market Data

Sanjay Rastogi ORCID...1, Kamal Upreti ORCID...2, Uma Shankar ORCID...3, Pravin Ramdas Kshirsagar ORCID...4, Tan Kuan Tak ORCID...5, Rituraj Jain ORCID...6, Ganesh Veluswwamy Radhakrishnan ORCID...7
2 Department of Computer Science, CHRIST (Deemed to be University), Delhi NCR Campus, Ghaziabad, India
3 Department of Financial Technology, University of Wollongong, India Branch Campus, Ahmadabad, Gujarat, India
4 Department of Electronics and Telecommunication, J D College of Engineering & Management, Nagpur, Maharashtra, India
5 Engineering Cluster, Singapore Institute of Technology, Singapore
6 Department of Information Technology, Marwadi University, Rajkot, Gujarat, India
7 Department of Economics and Finance, Kalinga School of Management, Kalinga Institute of Industrial Technology, Bhubaneswar, India

Background: The introduction of retail investors to AI-powered trading platforms and especially on emerging markets, has resulted in a new set of risks linked to algorithmic bias and financial forecasting fairness. Social media sentiment and structured data multimodal strategies have demonstrated a potential, but frequently do not have ethical considerations.

Objective: This work proposes a multimodal model predictive control (MPC) framework grounded in fairness-based forecasting of next-day returns on stock in stock market settings, particularly ethical behaviour and transparency of the model on retail markets.

Methods: We combine BERT-based sentiment analysis of Reddit discussions and organized stock market indicators and use XGBoost as the fundamental model. Bias is measured using fairness metrics, including demographic parity difference and equal opportunity difference. Debiasing measures such as reweighting and stratified calibration were used to curb the differences in stock categories.

Results: The first model has an overall accuracy of 72.3 with the highest accuracy of 83.1 in the case of Tesla – representing bias in the model. Fairness assessment shows some significant differences (DPD=0.23, EOD=0.31), but the mitigation decreases to 0.07. However, the massive performance improvement after adjustment brings up the issue of overfitting or fairness overcorrection.

Conclusion: While the proposed debiased framework successfully reduces algorithmic bias, the trade-off between fairness and generalizability underscores the need for caution. These results hold significant implications for digital trading systems and regulatory frameworks of emerging economies such as India, where explainability and fairness of AI models are significant for ethical financial engagement.

Keywords: Algorithmic fairness; BERT embedding; Demographic parity; Financial machine learning; Hybrid model; Stock prediction.

Received: August 3, 2025; Revised: December 4, 2025; Accepted: December 6, 2025; Prepublished online: March 3, 2026 

Download citation

References

  1. Addy, W. A., Ajayi-Nifise, A. O., Bello, B. G., Tula, S. T., Odeyemi, O., & Falaiye, T. (2024). Algorithmic Trading and AI: A Review of Strategies and Market impact. World Journal of Advanced Engineering Technology and Sciences, 11(1), 258-267. https://doi.org/10.30574/wjaets.2024.11.1.0054 Go to original source...
  2. Agrawal, A., Catalini, C., & Goldfarb, A. (2015). Crowdfunding: Geography, Social Networks, and the Timing of Investment Decisions. Journal of Economics & Management Strategy, 24(2), 253-274. https://doi.org/10.1111/jems.12093 Go to original source...
  3. Barman, R. D., Hanfy, F. B. A., Rajendran, R., Masood, G., Dias, B., & Maroor, J. P. (2022). A critical review of determinants of financial innovation in global perspective. Materials Today: Proceedings, 51, 88-94. https://doi.org/10.1016/j.matpr.2021.04.565 Go to original source...
  4. Barocas, S., Hardt, M., & Narayanan, A. (2019). Fairness and Machine Learning: Limitations and opportunities. MIT Press.
  5. Bouzguenda, K. (2018). Emotional intelligence and financial decision making: Are we talking about a paradigmatic shift or a change in practices? Research in International Business and Finance, 44, 273-284. https://doi.org/10.1016/j.ribaf.2017.07.096 Go to original source...
  6. Bunkar, B., & Ramaiah, K. (2024). A serial mediation model for investigating the intention to use algorithmic trading platforms among retail investors in India. Vilakshan - XIMB Journal of Management, 21(2), 263-280. https://doi.org/10.1108/XJM-12-2023-0233 Go to original source...
  7. Caragea, D., Chen, M., Cojoianu, T., Dobri, M., Glandt, K., & Mihaila, G. (2020). Identifying FinTech Innovations Using BERT. In 2020 IEEE International Conference on Big Data (pp. 1117-1126). IEEE. https://doi.org/10.1109/BigData50022.2020.9378169 Go to original source...
  8. Dibb, S., Merendino, A., Aslam, H., Appleyard, L., & Brambley, W. (2021). Whose rationality? Muddling through the messy emotional reality of financial decision-making. Journal of Business Research, 131, 826-838. https://doi.org/10.1016/j.jbusres.2020.10.041 Go to original source...
  9. Egorov, A. (2022). Financial Innovation and Financial Risks. Procedia Computer Science, 214, 441-447. https://doi.org/10.1016/j.procs.2022.11.197 Go to original source...
  10. Gomber, P., Koch, J.-A., & Siering, M. (2017). Digital Finance and FinTech: current research and future research directions. Journal of Business Economics, 87(5), 537-580. https://doi.org/10.1007/s11573-017-0852-x Go to original source...
  11. Gregor, S., & Hevner, A. R. (2013). Positioning and presenting design science research for maximum impact. MIS Quarterly, 37(2), 337-355. https://doi.org/10.25300/misq/2013/37.2.01 Go to original source...
  12. Guo, C., Pleiss, G., Sun, Y., & Weinberger, K. Q. (2017). On calibration of modern neural networks. In Proceedings of the 34th International Conference on Machine Learning, (pp. 1321-1330). MLR Press. http://proceedings.mlr.press/v70/guo17a/guo17a.pdf
  13. Hardt, M., Price, E., & Srebro, N. (2016). Equality of Opportunity in Supervised Learning. In 30th Conference on Neural Information Processing Systems (pp. 1-9). NIPS.
  14. Jun Gu, W., Hao Zhong, Y., Zun Li, S., Song Wei, C., Ting Dong, L., Yue Wang, Z., & Yan, C. (2024). Predicting Stock Prices with FinBERT-LSTM: Integrating News Sentiment Analysis. In Proceedings of the 2024 8th International Conference on Cloud and Big Data Computing, (pp. 67-72). ACM. https://doi.org/10.1145/3694860.3694870 Go to original source...
  15. Kleinberg, J., Ludwig, J., Mullainathan, S., & Sunstein, C. R. (2019). Discrimination in the age of algorithms. SSRN Electronic Journal. https://doi.org/10.2139/ssrn.3329669 Go to original source...
  16. Liu, Z., Zhang, K., & Miao, D. (2025). Evolving Financial Markets: The Impact and Efficiency of AI-Driven Trading Strategies. In Intelligence Science V. ICIS 2024. IFIP Advances in Information and Communication Technology (pp. 301-312). Springer. https://doi.org/10.1007/978-3-031-71253-1_22 Go to original source...
  17. Mehrabi, N., Morstatter, F., Saxena, N., Lerman, K., & Galstyan, A. (2021). A survey on Bias and Fairness in Machine Learning. ACM Computing Surveys, 54(6), 1-35. https://doi.org/10.1145/3457607 Go to original source...
  18. Min, B. H., & Borch, C. (2022). Systemic failures and organizational risk management in algorithmic trading: Normal accidents and high reliability in financi al markets. Social Studies of Science, 52(2), 277-302. https://doi.org/10.1177/03063127211048515 Go to original source...
  19. Mwangi, J., & Njoroge, W. (2024). AI-Driven Innovations in Financial Technology: A Review of Algorithmic Trading and Risk Management. Artificial Intelligence and Machine Learning Review, 5(2), 1-9.
  20. Nerlekar, V. S., Sane, A., Gadekar, M., Gupta, S. K., & Waghulkar, S. (2024). The Herd Mentality: Understanding the Theories and Models of Herding Behavior in Financial Markets. In Psychological Drivers of Herding and Market Overreaction (pp. 255-290). IGI Global. https://doi.org/10.4018/979-8-3693-7827-4.ch010 Go to original source...
  21. Niculescu-Mizil, A., & Caruana, R. (2005). Predicting good probabilities with supervised learning. In ICML '05: Proceedings of the 22nd international conference on Machine learning (pp. 625-632). ACM. https://doi.org/10.1145/1102351.1102430 Go to original source...
  22. Ozili, P. K. (2018). Impact of digital finance on financial inclusion and stability. Borsa Istanbul Review, 18(4), 329-340. https://doi.org/10.1016/j.bir.2017.12.003 Go to original source...
  23. Ozili, P. K. (2023). Digital finance research and developments around the world: a literature review. International Journal of Business Forecasting and Marketing Intelligence, 8(1), 35-51. https://doi.org/10.1504/IJBFMI.2023.127698 Go to original source...
  24. Pal, A., Singh, H., Singh, V. V., & Gupta, A. K. (2024). Technological Mitigation of the Impact of Behavioural Biases on Financial Investment Decision: Evidence from India using A PLS-SEM Approach. In 2024 International Conference on Intelligent & Innovative Practices in Engineering & Management (pp. 1-5). IEEE. https://doi.org/10.1109/IIPEM62726.2024.10925698 Go to original source...
  25. Ran, Z., Gul, A., Akbar, A., Haider, S. A., Zeeshan, A., & Akbar, M. (2021). Role of Gender-Based Emotional Intelligence in Corporate Financial Decision-Making. Psychology Research and Behaviour Management, 14, 2231-2244. https://doi.org/10.2147/PRBM.S335022 Go to original source...
  26. Riefel, D. M. (2024). Social media exposure and its influence on individual investment decisions. University of Twente. https://purl.utwente.nl/essays/104604
  27. Rootpi3. (2024). Stock Price Prediction Dataset. Kaggle. https://www.kaggle.com/Datasets/Rootpi3/Stock-Price-Prediction-with-Sentiment-Analysis?Select=merged_stock%20_sentiment_data.csv
  28. Saelee, R., & Pankham, S. (2024). The Impact of Social Media and Emotional Intelligence on Investment Decision: A Fuzzy Set Delphi Study Among Investors in Thailand's Stock Market. TEM Journal, 2208-2218. https://doi.org/10.18421/TEM133-48 Go to original source...
  29. Sathya, N., & Prabhavathi, C. (2024). The influence of social media on investment decision-making: examining behavioural biases, risk perception, and mediation effects. International Journal of System Assurance Engineering and Management, 15(3), 957-963. https://doi.org/10.1007/s13198-023-02182-x Go to original source...
  30. Saxena, D., & Yasobant, S. (2020). Information Overload. In Encyclopedia of Big Data (pp. 1-3). Springer. https://doi.org/10.1007/978-3-319-32001-4_374-1 Go to original source...
  31. Luchian, A. C., & Strat, V. (2024). The Trustworthiness of AI Algorithms and the Simulator Bias in Trading. In The 7th International Conference on Economics and Social Sciences (pp. 1-10). Editura ASE. https://doi.org/10.24818/ICESS/2024/022 Go to original source...
  32. Suresh G. (2024). Impact of Financial Literacy and Behavioural Biases on Investment Decision-making. FIIB Business Review, 13(1), 72-86. https://doi.org/10.1177/23197145211035481 Go to original source...
  33. Thakkar, N. (2024). Harnessing the Power of Herding for Real (Estate); Factors Influencing Herd Mentality, Driven by Social Media in the Real Estate Market. Open Journal of Business and Management, 12(5), 2983-3003. https://doi.org/10.4236/ojbm.2024.125153 Go to original source...
  34. Vasquez, H., & Cross, D. (2024). The Effects of Social Media on Investment Decisions within an Online Community. SSRN Electronic Journal. https://doi.org/10.2139/ssrn.4765619 Go to original source...
  35. Zhao, Z. (2024). Research on the Impact of Digitalization on Individual Investors' Behaviour from the Perspective of Behavioural Finance. In Applied Economics and Policy Studies (pp. 146-154). Springer. https://doi.org/10.1007/978-981-97-0523-8_13 Go to original source...
  36. Zolotareva, E. (2021). Aiding Long-Term Investment Decisions with XGBoost Machine Learning Model. In Artificial Intelligence and Soft Computing (pp. 414-427). Springer. https://doi.org/10.1007/978-3-030-87897-9_37 Go to original source...

This is an open access article distributed under the terms of the Creative Commons Attribution 4.0 International License (CC BY 4.0), which permits use, distribution, and reproduction in any medium, provided the original publication is properly cited. No use, distribution or reproduction is permitted which does not comply with these terms.