Article (Périodiques scientifiques)
Can Tree-Based Model Improve Performance Prediction for LLMs?
PANNER SELVAM, Karthick; BRORSSON, Mats Håkan
2024In ARC-LG workshop at 51st International Symposium on Computer Architecture
Peer reviewed
 

Documents


Texte intégral
Performance_Transformer_ISCA (1).pdf
Postprint Auteur (855.65 kB) Licence Creative Commons - Attribution, Pas d'Utilisation Commerciale
Télécharger

Tous les documents dans ORBilu sont protégés par une licence d'utilisation.

Envoyer vers



Détails



Mots-clés :
Large Language Models, Graph Neural Network, Performance Model
Résumé :
[en] The deployment of Large Language Models (LLMs) in cloud environments underscores the imperative for optimal hardware configurations to enhance efficiency and reduce environmental impact. Prior research on performance prediction models predominantly focuses on computer vision, leaving a void for models adept at the unique demands of LLMs. This study bridges that gap, evaluating the potential of Tree-based models, particularly XGBoost, against traditional Graph Neural Networks (GNNs) in predicting the performance of LLMs. Our analysis shows that XGBoost achieves significant improvements in predicting throughput, memory usage, and energy consumption showcasing relative enhancements of MAPE approximately 68.81%, 80.85%, and 88.21%, respectively, compared to the GNN baseline, with a remarkable speed enhancement of approximately 26761.39% over GNN. These findings underscore XGBoost's effectiveness in accurately forecasting LLM performance metrics, offering a promising avenue for hardware configuration optimization in LLM deployment.
Centre de recherche :
Interdisciplinary Centre for Security, Reliability and Trust (SnT) > SEDAN - Service and Data Management in Distributed Systems
Disciplines :
Sciences informatiques
Auteur, co-auteur :
PANNER SELVAM, Karthick ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > SEDAN
BRORSSON, Mats Håkan  ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > SEDAN
Co-auteurs externes :
no
Langue du document :
Anglais
Titre :
Can Tree-Based Model Improve Performance Prediction for LLMs?
Date de publication/diffusion :
juin 2024
Titre du périodique :
ARC-LG workshop at 51st International Symposium on Computer Architecture
Peer reviewed :
Peer reviewed
URL complémentaire :
Projet européen :
H2020 - 955513 - MAELSTROM - MAchinE Learning for Scalable meTeoROlogy and cliMate
Intitulé du projet de recherche :
U-AGR-8013 - INTER/EuroHPC/20/15077233/MAELSTROM - BRORSSON Mats Hakan
Organisme subsidiant :
Union Européenne
Disponible sur ORBilu :
depuis le 17 octobre 2024

Statistiques


Nombre de vues
213 (dont 28 Unilu)
Nombre de téléchargements
142 (dont 11 Unilu)

Bibliographie


Publications similaires



Contacter ORBilu