Paper published on a website (Scientific congresses, symposiums and conference proceedings)
ExpertCache: GPU-Efficient MoE Inference through Reinforcement Learning-Guided Expert Selection
TANG, Xunzhu; SUN, Tiezhu; SONG, Yewei et al.
2025The 41st International Conference on Software Maintenance and Evolution, New Ideas and Emerging Results Track
Peer reviewed
 

Files


Full Text
ICSME25-ExpertCache.pdf
Author postprint (265.61 kB)
Download

All documents in ORBilu are protected by a user license.

Send to



Details



Disciplines :
Computer science
Author, co-author :
TANG, Xunzhu  ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > TruX
SUN, Tiezhu  ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > TruX
SONG, Yewei  ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > TruX
Siyuan Ma;  Nanyang Technological University
KLEIN, Jacques  ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > TruX
BISSYANDE, Tegawendé François d Assise  ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > TruX
External co-authors :
yes
Language :
English
Title :
ExpertCache: GPU-Efficient MoE Inference through Reinforcement Learning-Guided Expert Selection
Publication date :
07 September 2025
Event name :
The 41st International Conference on Software Maintenance and Evolution, New Ideas and Emerging Results Track
Event date :
7-12, September, 2025
By request :
Yes
Peer reviewed :
Peer reviewed
Name of the research project :
R-AGR-3885 - H2020-ERC-NATURAL - BISSYANDE Tegawendé
Funding text :
This work is supported by the NATURAL project, which has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (grant No. 949014). The author Jiechao Gao is partially sponsored by funding from Yonghua Foundation.
Available on ORBilu :
since 09 September 2025

Statistics


Number of views
53 (1 by Unilu)
Number of downloads
342 (6 by Unilu)

Bibliography


Similar publications



Contact ORBilu