Dynamic pricing under competition using reinforcement learning

Dynamic pricing is considered a possibility to gain an advantage over competitors in modern online markets. The past advancements in Reinforcement Learning (RL) provided more capable algorithms that can be used to solve pricing problems. In this paper, we study the performance of Deep Q-Networks (DQN) and Soft Actor Critic (SAC) in different market models. We consider tractable duopoly settings, where optimal solutions derived by dynamic programming techniques can be used for verification, as well as oligopoly settings, which are usually intractable due to the curse of dimensionality. We find that both algorithms provide reasonable results, while SAC performs better than DQN. Moreover, we show that under certain conditions, RL algorithms can be forced into collusion by their competitors without direct communication.

Metadaten
Verfasserangaben:	Alexander Kastius, Rainer Schlosser ORCiD GND
DOI:	https://doi.org/10.1057/s41272-021-00285-3
ISSN:	1476-6930
ISSN:	1477-657X
Titel des übergeordneten Werks (Englisch):	Journal of revenue and pricing management
Verlag:	Springer Nature Switzerland AG
Verlagsort:	Cham
Publikationstyp:	Wissenschaftlicher Artikel
Sprache:	Englisch
Datum der Erstveröffentlichung:	27.02.2021
Erscheinungsjahr:	2022
Datum der Freischaltung:	09.02.2023
Freies Schlagwort / Tag:	Competition; Dynamic pricing; E-commerce; Price collusion; Reinforcement learning
Band:	21
Ausgabe:	1
Seitenanzahl:	14
Erste Seite:	50
Letzte Seite:	63
Fördernde Institution:	Projekt DEAL
Organisationseinheiten:	An-Institute / Hasso-Plattner-Institut für Digital Engineering gGmbH
DDC-Klassifikation:	3 Sozialwissenschaften / 33 Wirtschaft / 330 Wirtschaft
Peer Review:	Referiert
Publikationsweg:	Open Access / Hybrid Open-Access
Lizenz (Deutsch):	CC-BY - Namensnennung 4.0 International