Deep-Reinforcement Learning-Based Architecture for Multi-Objective Optimization of Stock Prediction


  •   Rangappa Jyothi

  •   Gorappa Ningappa Krishnamurthy


Artificial Intelligence has been established to predict the future performance of the trading in modern era including Statistics, Computer Science, and economics, especially the stack market. By analyzing the big data, making the financial decision is important for investors in the stock market. As records, several techniques like Back Propagation Neural Network (BPNN) Recurrent Neural Network (RNN) are used to predict the stock price but due to its computation complexity is major challenging of time serious financial data in the decision-making process. In this paper, we have proposed the Recurrent Q Network Learning (RQNL) to make the right decision according to movements of stock prices as reliable attention. To overcome the computational complexity shortcoming, the forgotten memory is developed in a neural network. In this way, the error arousal is pruned in a significant manner. In addition, Enhancing the prediction ability is outstanding, and reinforcement learning is demonstrated to reduce the correlations in time series data. Discussion and evaluation were based on the NSE dataset are experimented with three different learning approaches, with the proposed method for stock prediction of financial markets. The experiment result carried out that our proposed Recurrent Q Network Learning (RQNL) performs the perfect prediction compared with other existing learning.

Keywords: Deep Learning, LSTM, Price Prediction, Recurrent Neural Network, Reinforcement Learning, Q network


Ricky C., Davis M., Kumiega A., and Van Vliet B. Ethics for automated financial markets. Handbook on ethics in finance, 2020:1–18.

Wu, Mu-En, Jia-Hao Syu, Jerry Chun-Wei Lin, and Jan-Ming Ho. Portfolio management system in equity market neutral using reinforcement learning. Applied Intelligence, 2021;51(11): 8119–8131.

Du, Guansan, Zixian Liu, and Haifeng Lu. Application of innovative risk early warning mode under big data technology in Internet credit financial risk assessment. Journal of Computational and Applied Mathematics, 2021;386:113260.

Ahmed, Abdullahi D., and Rui Huo. Volatility transmissions across international oil market, commodity futures and stock markets: Empirical evidence from China. Energy Economics, 2021;93: 104741.

Syu, Jia-Hao, Mu-En Wu, and Jan-Ming Ho. Portfolio management system with reinforcement learning. In 2020 IEEE International Conference on Systems, Man, and Cybernetics (SMC), pp. 4146–4151. IEEE, 2020.

Li, Yuming, Pin Ni, and Victor Chang. Application of deep reinforcement learning in stock trading strategies and stock forecasting. Computing, 2020;102(6):1305–1322.

Ahmed, Abdulrahman A., Ayman Ghoneim, and Mohamed Saleh. Optimizing stock market execution costs using reinforcement learning. In 2020 IEEE Symposium Series on Computational Intelligence (SSCI), pp. 1083–1090. IEEE, 2020.

Ganesh, Sumitra, Nelson Vadori, Mengda Xu, Hua Zheng, Prashant Reddy, and Manuela Veloso. Reinforcement learning for market making in a multi-agent dealer market. arXiv preprint arXiv:1911.05892, 2019.

Gupta, Somit, Lucy Ulanova, Sumit Bhardwaj, Pavel Dmitriev, Paul Raff, and Aleksander Fabijan. The anatomy of a large-scale experimentation platform. In2018 IEEE International Conference on Software Architecture (ICSA), pp. 1–109. IEEE, 2018.

Alzubi, Omar A., Jafar A. Alzubi, Mohammed Alweshah, Issa Qiqieh, Sara Al-Shami, and Manikandan Ramachandran. An optimal pruning algorithm of classifier ensembles: dynamic programming approach. Neural Computing and Applications, 2020;32(20): 16091–16107.

Bakhach, Amer M., Edward PK Tsang, and V. L. Raju Chinthalapati. TSFDC: A trading strategy based on forecasting directional change. Intelligent Systems in Accounting, Finance and Management, 2018;25(3):105–123.

Meisheri, Hardik, Vinita Baniwal, Nazneen N. Sultana, Balaraman Ravindran, and Harshad Khadilkar. Reinforcement learning for multi-objective optimization of online decisions in high-dimensional systems. arXiv preprint arXiv:1910.00211 (2019).

Chu, Xiangxiang, Bo Zhang, and Ruijun Xu. Multi-objective reinforced evolution in mobile neural architecture search. In European Conference on Computer Vision, pp. 99–113. Springer, Cham, 2020.

Bisht, Kiran, and Arun Kumar. Deep Reinforcement Learning based Multi-Objective Systems for Financial Trading. In 2020 5th IEEE International Conference on Recent Advances and Innovations in Engineering (ICRAIE), pp. 1–6. IEEE, 2020.

Xiong, Zhuoran, Xiao-Yang Liu, Shan Zhong, Hongyang Yang, and Anwar Walid. Practical deep reinforcement learning approach for stock trading. arXiv preprint arXiv:1811.07522, 2018.

Yu, Pengqian, Joon Sern Lee, Ilya Kulyatin, Zekun Shi, and Sakyasingha Dasgupta. Model-based deep reinforcement learning for dynamic portfolio optimization. arXiv preprint arXiv:1901.08740, 2019.

Carta, Salvatore, Andrea Corriga, Anselmo Ferreira, Alessandro Sebastian Podda, and Diego Reforgiato Recupero. A multi-layer and multi-ensemble stock trader using deep learning and deep reinforcement learning. Applied Intelligence, 2021;51(2):889-905.

Aboussalah, Amine Mohamed, and Chi-Guhn Lee. Continuous control with stacked deep dynamic recurrent reinforcement learning for portfolio optimization. Expert Systems with Applications, 2020;140: 112891.

Liang, Zhipeng, Hao Chen, Junhao Zhu, Kangkang Jiang, and Yanran Li. Adversarial deep reinforcement learning in portfolio management. arXiv preprint arXiv:1808.09940 (2018).

Spooner T., Fearnley J. Savani R. and Koukorinis A. Market making via reinforcement learning. arXiv preprint arXiv:1804.04216, 2018

Mosavi al. Comprehensive review of deep reinforcement learning methods and applications in economics. Mathematics, 2020;8.10: 1640.

Rouf al. Stock Market Prediction Using Machine Learning Techniques: A Decade Survey on Methodologies, Recent Developments, and Future Directions. Electronics, 2021;10.21: 2717.

Wang L., Hajric V. The Cost of Bad Market Timing Decisions in 2020 was Annahilation Bloomberg, 2020.

Wang B, Huang H, Wang X. A novel text mining approach to financial time series forecasting.Neurocomputing, 2012;83(6):136–145.

Guo Z, Wang H, Liu Q, Yang J. A Feature Fusion Based Forecasting Model for Financial Time Series Plos One, 2014;9(6): 172-200.

Halko N, Martinsson PG, Tropp JA. Finding structure with randomness: probabilistic algorithms for constructing approximate matrix decompositions. SIAM Rev. 2001;53(2):217–88.

Ghosh A., Bose S., Maji G., Debnath N., Sen S. Stock price prediction using lstm on indian share market. Int. Conf. Comput. Appl. Ind. Eng. 2019;63:101–110.

Azzouni A., Pujolle G. A long short-term memory recurrent neural network framework for network traffic matrix prediction. 2017. arXiv preprint. arXiv:17050 5690.

Enlu L, Chen Q, and Qi X. Deep reinforcement learning for imbalanced classification. Applied Intelligence, 2020;50.8:2488–2502.


Download data is not yet available.


How to Cite
Jyothi, R. and Krishnamurthy, G.N. 2022. Deep-Reinforcement Learning-Based Architecture for Multi-Objective Optimization of Stock Prediction. European Journal of Electrical Engineering and Computer Science. 6, 4 (Jul. 2022), 9–16. DOI: