• DocumentCode
    1592973
  • Title

    A Dynamic Pricing Algorithm by Bayesian Q-learning

  • Author

    Han, Wei

  • Author_Institution
    Inf. Eng. Coll., Nanjing Univ. of Finance & Econ., Nanjing, China
  • Volume
    2
  • fYear
    2010
  • Firstpage
    515
  • Lastpage
    519
  • Abstract
    In electronic marketplaces automated and dynamic pricing is becoming increasingly popular. Agents that perform this task can improve themselves by learning from past observations, possibly using reinforcement learning techniques. several papers studied the use of Q-learning for modeling the problem of dynamic pricing in electronic marketplaces. But The extension of reinforcement learning (RL) to large state space has inevitably encountered the problem of the curse of dimensionality. Improving the learning efficiency of the agent is much more important to the practical application of RL. To address the problem of dynamic pricing, we take a Bayesian model-based approach, framing transition function and reward function of MDP as distributions, and use sampling technique for action selection. The Bayesian approach accounts for the general problem of exploration vs exploitation tradeoff. Simulations show our dynamic pricing algorithm improves the profits compares with other pricing strategies based on the same pricing model.
  • Keywords
    Bayes methods; electronic commerce; learning (artificial intelligence); pricing; Bayesian Q-learning; automated pricing; dynamic pricing algorithm; electronic marketplaces; framing transition function; reinforcement learning techniques; Bayesian methods; Computational modeling; Computer simulation; Consumer electronics; Convergence; Educational institutions; Heuristic algorithms; Learning; Pricing; State-space methods; Dynamic Pricing; Electronic Marketplaces; Q-learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Modeling and Simulation, 2010. ICCMS '10. Second International Conference on
  • Conference_Location
    Sanya, Hainan
  • Print_ISBN
    978-1-4244-5642-0
  • Electronic_ISBN
    978-1-4244-5643-7
  • Type

    conf

  • DOI
    10.1109/ICCMS.2010.240
  • Filename
    5421144