Paper published on a website (Scientific congresses, symposiums and conference proceedings)
Small Language Models in the Real World: Insights from Industrial Text Classification
LI, Lujun; SLEEM, Lama; GENTILE, Niccolo et al.
2025The 63rd Annual Meeting of the Association for Computational Linguistics (ACL 2025)
Peer reviewed
 

Files


Full Text
2025.acl-industry.68.pdf
Author postprint (270.05 kB)
Download

All documents in ORBilu are protected by a user license.

Send to



Details



Abstract :
[en] With the emergence of ChatGPT, Transformer models have significantly advanced text classification and related tasks. Decoder-only models such as Llama exhibit strong performance and flexibility, yet they suffer from inefficiency on inference due to token-by-token generation, and their effectiveness in text classification tasks heavily depends on prompt quality. Moreover, their substantial GPU resource requirements often limit widespread adoption. Thus, the question of whether smaller language models are capable of effectively handling text classification tasks emerges as a topic of significant interest. However, the selection of appropriate models and methodologies remains largely underexplored. In this paper, we conduct a comprehensive evaluation of prompt engineering and supervised fine-tuning methods for transformer-based text classification. Specifically, we focus on practical industrial scenarios, including email classification, legal document categorization, and the classification of extremely long academic texts. We examine the strengths and limitations of smaller models, with particular attention to both their performance and their efficiency in Video Random-Access Memory (VRAM) utilization, thereby providing valuable insights for the local deployment and application of compact models in industrial settings.
Disciplines :
Computer science
Author, co-author :
LI, Lujun  ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > SEDAN
SLEEM, Lama  ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > SEDAN
GENTILE, Niccolo ;  University of Luxembourg > Faculty of Humanities, Education and Social Sciences > Department of Behavioural and Cognitive Sciences > Team Conchita D AMBROSIO
Nichil, Geoffrey
STATE, Radu  ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > SEDAN
External co-authors :
yes
Language :
English
Title :
Small Language Models in the Real World: Insights from Industrial Text Classification
Publication date :
01 August 2025
Event name :
The 63rd Annual Meeting of the Association for Computational Linguistics (ACL 2025)
Event date :
From July 27 to August 1st, 2025
Audience :
International
Peer reviewed :
Peer reviewed
Available on ORBilu :
since 10 September 2025

Statistics


Number of views
51 (4 by Unilu)
Number of downloads
24 (1 by Unilu)

OpenCitations
 
0
OpenAlex citations
 
0

Bibliography


Similar publications



Contact ORBilu