Dynamic Price Allocation and Optimization for E-Commerce Platforms Using Reinforcement Learning and Deep Learning

Deepak Muvva; B.Sudheer Babu; V.Krishnateja; SK.Ayesha Tahseen; Ch.Bharadwaja

doi:10.51583/IJLTEMAS.2026.150500043

Deepak Muvva

Department of Advanced Computer Vignan’s Foundation for Science, Technology and Research Vadlamudi. Guntur,Andhra Pradesh,522 213

B.Sudheer Babu

Department of Advanced Computer Vignan’s Foundation for Science, Technology and Research Vadlamudi. Guntur,Andhra Pradesh,522 213

V.Krishnateja

Department of Advanced Computer Vignan’s Foundation for Science, Technology and Research Vadlamudi. Guntur,Andhra Pradesh,522 213

SK.Ayesha Tahseen

Department of Advanced Computer Vignan’s Foundation for Science, Technology and Research Vadlamudi. Guntur,Andhra Pradesh,522 213

Ch.Bharadwaja

Department of Advanced Computer Vignan’s Foundation for Science, Technology and Research Vadlamudi. Guntur,Andhra Pradesh,522 213

DOI: https://doi.org/10.51583/IJLTEMAS.2026.150500043

Published: May 26, 2026

The concept of dynamic pricing has already become one of the most significant aspects of electronic commerce, which is constantly changing in terms of the level of demand and competition, as well as the response of customers to a specific product (or service). Conventional methods of pricing and reinforcement learning methods like Deep Q-Networks (DQN) tend to have restricted flexibility, dis- crete action, and no proper estimation of demand. Our uncertainty-aware dynamic pricing framework as offered in this paper incorporates a hybrid demands forecasting, Transformer-LSTM demand forecasting model and Soft-Actor-Critic (SAC) reinforcement learning to optimize prices continuously. The Trans- former component models the long-range time interdependencies whereas the LSTM models the sequential nature of demand patterns and allows it to predict the demand robustly and precisely. The state representation of the SAC agent, which learns the optimal pricing policies under dynamic market, takes these forecasts into consideration.

The suggested system is implemented on a scalable, API-focused system of microservices and allows making real-time pricing decisions. Online Retail II Evaluation The experimental analysis of the Online Retail II data reveals that the improvement of experimental approaches is substantial as compared to the baseline techniques. Demand forecasting with the model has an R 2 of 0.62 with a Mean Absolute Percentage Error (MAPE) of 8.7% and one can increase the revenue by 21.4% and the profit by 18.2% over the expected traditional methods of reinforcement learning.

The findings demonstrate the efficacy of melding cutting-edge deep learning and reinforcement learning approaches to scal- able, adaptable, and smart pricing in the practical e-commerce setting.

Dynamic Price Allocation and Optimization for E-Commerce Platforms Using Reinforcement Learning and Deep Learning. (2026). International Journal of Latest Technology in Engineering Management & Applied Science, 15(5), 473-487. https://doi.org/10.51583/IJLTEMAS.2026.150500043

Downloads

References

J. Liu et al., “Dynamic Pricing on E-Commerce Platform with Deep Reinforcement Learning: A Field Experiment,” arXiv preprint arXiv:1912.02572, 2021.

H. Yin and Q. Han, “Dynamic Pricing Model of E-Commerce Plat- forms Based on Deep Reinforcement Learning,” Computer Modeling in Engineering & Sciences, 2021.

J. Sun et al., “Dynamic Pricing Model for E-Commerce Products Based on DDQN,” 2024.

A. Holovko and T. Firman, “Batch Reinforcement Learning for Dynamic Pricing,” 2021.

F. Lange et al., “Reinforcement Learning vs Dynamic Programming for Pricing,” 2025.

L. Guo and X. Zhang, “Dynamic Pricing using LSTM,” IEEE Access, 2025.

S. Kumar et al., “Weight Optimized LSTM for Pricing,” 2023.

L. Terrada et al., “Demand Forecasting using Deep Learning,” 2022.

H. Li and R. Xin, “Deep Learning Pricing Model,” 2024.

Krishna and E. Aravind, “Hybrid XGBoost-LSTM Model,” 2023.

S. Ameli et al., “DRL for Dynamic Pricing,” 2025.

M. Mahmud et al., “Forecasting + RL Pricing,” 2025.

Q. Zhao et al., “Multi-Objective Pricing using DDPG,” 2025.

R. Afshar et al., “Automated DRL Pipeline,” IEEE TAI, 2022.

D. Patel, “RL in Pricing Models,” 2022.

This work is licensed under a Creative Commons Attribution 4.0 International License.

All articles published in our journal are licensed under CC-BY 4.0, which permits authors to retain copyright of their work. This license allows for unrestricted use, sharing, and reproduction of the articles, provided that proper credit is given to the original authors and the source.

How to Cite

Dynamic Price Allocation and Optimization for E-Commerce Platforms Using Reinforcement Learning and Deep Learning. (2026). International Journal of Latest Technology in Engineering Management & Applied Science, 15(5), 473-487. https://doi.org/10.51583/IJLTEMAS.2026.150500043

Download Citation

Article Sidebar

Main Article Content

Downloads

References

Article Details

How to Cite