Title :
Choosing Optimal Seller Based on off-line Learning Negotiation History and K-armed Bandit Problem
Author :
Wang, Li-Ming ; Chai, Yu-Mei ; Huang, Hou-Kuan
Author_Institution :
School of Information Engineering, Zhengzhou University, Zhengzhou 450052, China; School of Computer and Information Technology, Beijing Jiaotong University, Beijing 100044, China; E-MAIL: cymwlm@zzu.edu.cn
Abstract :
When a buying agent carries user’s requirements to look for a selling agent for negotiation, there may be some sellers to meet user’s requirement, but the buyer must choose a seller who can not only meet user’s requirements, but also make the buyer’s negotiation outcome gain maximal utility before negotiation. This paper solves problem of choosing seller before negotiation in order to improve accuracy of the multi-issue negotiation and buyer’s utility. In order to fully utilize negotiation history, this paper transforms the problem of choosing seller into K-armed bandit problem to solve. Several improved algorithms, which are used to learn reward distribution by off-line learning, and combine technologies for K-armed bandit problem and learning by neural network, are presented. Finally, combining the improved algorithms with trust vectors improves accuracy and practicability of choosing a seller. The experiment proves validity of the work in application.
Keywords :
Agent; K-armed bandit problem; negotiation; reputation; trust; utility; Cybernetics; History; Information technology; Machine learning; Network synthesis; Neural networks; Virtual manufacturing; Agent; K-armed bandit problem; negotiation; reputation; trust; utility;
Conference_Titel :
Machine Learning and Cybernetics, 2005. Proceedings of 2005 International Conference on
Conference_Location :
Guangzhou, China
Print_ISBN :
0-7803-9091-1
DOI :
10.1109/ICMLC.2005.1526936