Know Your Enemies and Know Yourself in the Real-Time Bidding Function Optimisation

DU, Manxing; Cowen-Rivers, Alexander I.; Wen, Ying; Sakulwongtana, Phu; Wang, Jun; BRORSSON, Mats Hakan; STATE, Radu

Download

Paper published in a book (Scientific congresses, symposiums and conference proceedings)

Know Your Enemies and Know Yourself in the Real-Time Bidding Function Optimisation

DU, Manxing; Cowen-Rivers, Alexander I.; Wen, Ying et al.

2019 • In Proceedings of the 19th IEEE International Conference on Data Mining Workshops (ICDMW 2019)

Peer reviewed

Permalink
https://hdl.handle.net/10993/41419

Files (1)Send to Details Statistics Bibliography Similar publications

Files

Full Text

conference_041818.pdf

Publisher postprint (592.2 kB)

Download

All documents in ORBilu are protected by a user license.

Send to

RIS BibTex APA Chicago Permalink X Linkedin

Details

Disciplines :

Computer science

Author, co-author :

DU, Manxing ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT)

Cowen-Rivers, Alexander I.; MediaGamma

Wen, Ying; University College London - UCL

Sakulwongtana, Phu; University College London - UCL

Wang, Jun; University College London - UCL

BRORSSON, Mats Hakan ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT)

STATE, Radu ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT)

External co-authors :

yes

Language :

English

Title :

Know Your Enemies and Know Yourself in the Real-Time Bidding Function Optimisation

Publication date :

2019

Event name :

19th IEEE International Conference on Data Mining Workshops (ICDMW 2019)

Event place :

Beijing, China

Event date :

from 8-11-2019 to 11-11-2019

Main work title :

Proceedings of the 19th IEEE International Conference on Data Mining Workshops (ICDMW 2019)

Peer reviewed :

Peer reviewed

FnR Project :

FNR11277622 - Self-learning Predictive Algorithms: From Design To Scalable Implementation, 2016 (01/03/2016-31/10/2019) - Manxing Du

Available on ORBilu :

since 28 December 2019

Statistics

Number of views

623 (15 by Unilu)

Number of downloads

408 (1 by Unilu)

More statistics

Scopus citations^®

Scopus citations^®
without self-citations

Bibliography

V. Krishna, Auction theory. Academic press, 2009.
Y. Yang, R. Luo, M. Li, M. Zhou, W. Zhang, and J. Wang, "Mean field multi-agent reinforcement learning, " arXiv preprint arXiv: 1802.05438, 2018.
R. Gummadi, P. Key, and A. Proutiere, "Repeated auctions under budget constraints: Optimal bidding strategies and equilibria, " in the Eighth Ad Auction Workshop, 2012.
K. Iyer, R. Johari, and M. Sundararajan, "Mean field equilibria of dynamic auctions with learning, " ACM SIGecom Exchanges, vol. 10, no. 3, 2011.
C. Perlich, B. Dalessandro, R. Hook, O. Stitelman, T. Raeder, and F. Provost, "Bid optimizing and inventory scoring in targeted online advertising, " in KDD. ACM, 2012.
W. Zhang, S. Yuan, and J. Wang, "Optimal real-time bidding for display advertising, " in KDD. ACM, 2014.
H. Cai, K. Ren, W. Zhang, K. Malialis, J. Wang, Y. Yu, and D. Guo, "Real-time bidding by reinforcement learning in display advertising, " in WSDM. ACM, 2017.
M. Du, R. Sassioui, G. Varisteas, M. Brorsson, O. Cherkaoui, and R. State, "Improving real-time bidding using a constrained markov decision process, " in ADMA. Springer, 2017.
D. Wu, X. Chen, X. Yang, H. Wang, Q. Tan, X. Zhang, J. Xu, and K. Gai, "Budget constrained bidding by model-free reinforcement learning in display advertising, " in Proceedings of the 27th ACM International Conference on Information and Knowledge Management. ACM, 2018, pp. 1443-1451.
J. Jin, C. Song, H. Li, K. Gai, J. Wang, and W. Zhang, "Realtime bidding with multi-agent reinforcement learning in display advertising, " arXiv preprint arXiv: 1802.09756, 2018.
J. Zhao, G. Qiu, Z. Guan, W. Zhao, and X. He, "Deep reinforcement learning for sponsored search real-time bidding, " arXiv preprint arXiv: 1803.00259, 2018.
P. Hernandez-Leal, M. Kaisers, T. Baarslag, and E. M. De Cote, "A survey of learning in multiagent environments: Dealing with non-stationarity, " arXiv preprint arXiv: 1707.09183, 2017.
M. L. Littman, "Markov games as a framework for multiagent reinforcement learning, " in Machine Learning Proceedings 1994. Elsevier, 1994.
S. P. Choi, D.-Y. Yeung, and N. L. Zhang, "An environment model for nonstationary reinforcement learning, " in Advances in neural information processing systems, 2000, pp. 987-993.
J. Hu and M. P. Wellman, "Nash q-learning for general-sum stochastic games, " J. Mach. Learn. Res., vol. 4, pp. 1039-1069, Dec. 2003. [Online]. Available: http://dl.acm.org/citation.cfm? id=945365.964288
K. Iyer, R. Johari, and M. Sundararajan, "Mean field equilibria of dynamic auctions with learning, " Management Science, vol. 60, no. 12, 2014.
K. Ren, J. Qin, L. Zheng, Z. Yang, W. Zhang, L. Qiu, and Y. Yu, "Deep recurrent survival analysis, " AAAI, 2019.
Y. Wang, K. Ren, W. Zhang, J. Wang, and Y. Yu, "Functional bid landscape forecasting for display advertising, " in Joint European Conference on Machine Learning and Knowledge Discovery in Databases. Springer, 2016.
W. C.-H. Wu, M.-Y. Yeh, and M.-S. Chen, "Predicting winning price in real time bidding with censored data, " in Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 2015, pp. 1305-1314.
R. G. Miller Jr, Survival analysis. John Wiley & Sons, 2011, vol. 66.
A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, L. Kaiser, and I. Polosukhin, "Attention is all you need, " in Advances in Neural Information Processing Systems, 2017.
A. Graves, A.-r. Mohamed, and G. Hinton, "Speech recognition with deep recurrent neural networks, " in Acoustics, speech and signal processing (icassp), 2013 ieee international conference on. IEEE, 2013.
M. E. Peters, M. Neumann, M. Iyyer, M. Gardner, C. Clark, K. Lee, and L. Zettlemoyer, "Deep contextualized word representations, " arXiv preprint arXiv: 1802.05365, 2018.
J. Devlin, M.-W. Chang, K. Lee, and K. Toutanova, "Bert: Pre-training of deep bidirectional transformers for language understanding, " arXiv preprint arXiv: 1810.04805, 2018.
A. Radford, K. Narasimhan, T. Salimans, and I. Sutskever, "Improving language understanding with unsupervised learning, " Technical report, OpenAI, Tech. Rep., 2018.
J. Hu and M. P. Wellman, "Nash q-learning for general-sum stochastic games, " Journal of machine learning research, vol. 4, no. Nov, pp. 1039-1069, 2003.
M. Huang, R. P. Malhamé, P. E. Caines et al., "Large population stochastic dynamic games: closed-loop mckean-vlasov systems and the nash certainty equivalence principle, " Communications in Information & Systems, vol. 6, no. 3, pp. 221-252, 2006.
T. P. Lillicrap, J. J. Hunt, A. Pritzel, N. Heess, T. Erez, Y. Tassa, D. Silver, and D. Wierstra, "Continuous control with deep reinforcement learning, " arXiv preprint arXiv: 1509.02971, 2015.
K. He, X. Zhang, S. Ren, and J. Sun, "Deep residual learning for image recognition, " in Proceedings of the IEEE conference on computer vision and pattern recognition, 2016.
J. L. Ba, J. R. Kiros, and G. E. Hinton, "Layer normalization, " arXiv preprint arXiv: 1607.06450, 2016.
W. A. Knaus, F. E. Harrell, J. Lynn, L. Goldman, R. S. Phillips, A. F. Connors, N. V. Dawson, W. J. Fulkerson, R. M. Califf, N. Desbiens et al., "The support prognostic model: objective estimates of survival for seriously ill hospitalized adults, " Annals of internal medicine, vol. 122, no. 3, pp. 191-203, 1995.
H. J. and A. J. S. , "Neural survival recommender, " in WSDM, 2017. [Online]. Available: http://doi.acm.org/10.1145/3018661. 3018719
W. Zhang, S. Yuan, J. Wang, and X. Shen, "Real-time bidding benchmarking with ipinyou dataset, " arXiv preprint arXiv: 1407.7073, 2014.
W. Zhang, S. Yuan, and J. Wang, "Real-time bidding benchmarking with ipinyou dataset, " CoRR, vol. abs/1407.7073, 2014. [Online]. Available: http://arxiv.org/abs/1407.7073
H. B. McMahan, G. Holt, D. Sculley, M. Young, D. Ebner, J. Grady, L. Nie, T. Phillips, E. Davydov, D. Golovin et al., "Ad click prediction: a view from the trenches, " in KDD. ACM, 2013.
S. Varrette, P. Bouvry, H. Cartiaux, and F. Georgatos, "Management of an academic hpc cluster: The ul experience, " in Proc. of the 2014 Intl. Conf. on High Performance Computing & Simulation (HPCS 2014). Bologna, Italy: IEEE, July 2014.