ORBilu: Detailed Reference

Article (Scientific journals)

Can Tree-Based Model Improve Performance Prediction for LLMs?

PANNER SELVAM, Karthick; BRORSSON, Mats Håkan

2024 • In ARC-LG workshop at 51st International Symposium on Computer Architecture

Peer reviewed

Permalink
https://hdl.handle.net/10993/62222

Files (1)Send to Details Statistics Bibliography Similar publications

Files

Full Text

Performance_Transformer_ISCA (1).pdf

Author postprint (855.65 kB)

Creative Commons License - Attribution, Non-Commercial

All documents in ORBilu are protected by a user license.

Send to

RIS BibTex APA Chicago Permalink X Linkedin

Details

Keywords :

Large Language Models, Graph Neural Network, Performance Model

Abstract :

[en] The deployment of Large Language Models (LLMs) in cloud environments underscores the imperative for optimal hardware configurations to enhance efficiency and reduce environmental impact. Prior research on performance prediction models predominantly focuses on computer vision, leaving a void for models adept at the unique demands of LLMs. This study bridges that gap, evaluating the potential of Tree-based models, particularly XGBoost, against traditional Graph Neural Networks (GNNs) in predicting the performance of LLMs. Our analysis shows that XGBoost achieves significant improvements in predicting throughput, memory usage, and energy consumption showcasing relative enhancements of MAPE approximately 68.81%, 80.85%, and 88.21%, respectively, compared to the GNN baseline, with a remarkable speed enhancement of approximately 26761.39% over GNN. These findings underscore XGBoost's effectiveness in accurately forecasting LLM performance metrics, offering a promising avenue for hardware configuration optimization in LLM deployment.

Research center :

Interdisciplinary Centre for Security, Reliability and Trust (SnT) > SEDAN - Service and Data Management in Distributed Systems

Disciplines :

Computer science

Author, co-author :

PANNER SELVAM, Karthick ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > SEDAN

BRORSSON, Mats Håkan ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > SEDAN

External co-authors :

no

Language :

English

Title :

Can Tree-Based Model Improve Performance Prediction for LLMs?

Publication date :

June 2024

Journal title :

ARC-LG workshop at 51st International Symposium on Computer Architecture

Peer reviewed :

Peer reviewed

Additional URL :

https://llm-gnn.org/

European Projects :

H2020 - 955513 - MAELSTROM - MAchinE Learning for Scalable meTeoROlogy and cliMate

Name of the research project :

U-AGR-8013 - INTER/EuroHPC/20/15077233/MAELSTROM - BRORSSON Mats Hakan

Funders :

Union Européenne

Available on ORBilu :

since 17 October 2024

Statistics

Number of views

237 (28 by Unilu)

Number of downloads

167 (11 by Unilu)

More statistics

Bibliography

Similar publications