Article (Scientific journals)
Automated Demarcation of Requirements in Textual Specifications: A Machine Learning-Based Approach
Abualhaija, Sallam; Arora, Chetan; Sabetzadeh, Mehrdad et al.
2020In Empirical Software Engineering
Peer reviewed
 

Files


Full Text
Abualhaija20.pdf
Publisher postprint (4.3 MB)
Download

All documents in ORBilu are protected by a user license.

Send to



Details



Keywords :
Natural-language Requirements; Requirements Identification and Classification; Machine Learning; Natural language processing
Abstract :
[en] A simple but important task during the analysis of a textual requirements specification is to determine which statements in the specification represent requirements. In principle, by following suitable writing and markup conventions, one can provide an immediate and unequivocal demarcation of requirements at the time a specification is being developed. However, neither the presence nor a fully accurate enforcement of such conventions is guaranteed. The result is that, in many practical situations, analysts end up resorting to after-the-fact reviews for sifting requirements from other material in a requirements specification. This is both tedious and time-consuming. We propose an automated approach for demarcating requirements in free-form requirements specifications. The approach, which is based on machine learning, can be applied to a wide variety of specifications in different domains and with different writing styles. We train and evaluate our approach over an independently labeled dataset comprised of 33 industrial requirements specifications. Over this dataset, our approach yields an average precision of 81.2% and an average recall of 95.7%. Compared to simple baselines that demarcate requirements based on the presence of modal verbs and identifiers, our approach leads to an average gain of 16.4% in precision and 25.5% in recall. We collect and analyze expert feedback on the demarcations produced by our approach for industrial requirements specifications. The results indicate that experts find our approach useful and efficient in practice.We developed a prototype tool, named DemaRQ, in support of our approach. To facilitate replication, we make available to the research community this prototype tool alongside the non-proprietary portion of our training data.
Research center :
Interdisciplinary Centre for Security, Reliability and Trust (SnT) > SVV - Software Verification and Validation
Disciplines :
Computer science
Author, co-author :
Abualhaija, Sallam  ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT)
Arora, Chetan
Sabetzadeh, Mehrdad ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT)
Briand, Lionel ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT)
Traynor, Michael
External co-authors :
yes
Language :
English
Title :
Automated Demarcation of Requirements in Textual Specifications: A Machine Learning-Based Approach
Publication date :
2020
Journal title :
Empirical Software Engineering
Peer reviewed :
Peer reviewed
Focus Area :
Security, Reliability and Trust
FnR Project :
FNR12632261 - Early Quality Assurance Of Critical Systems, 2018 (01/01/2019-31/12/2021) - Mehrdad Sabetzadeh
Funders :
QRA Corp
FNR - Luxembourg National Research Fund [LU]
H2020 European Research Council
CRSNG - Conseil de Recherches en Sciences naturelles et en Génie [CA]
Available on ORBilu :
since 29 June 2020

Statistics


Number of views
466 (67 by Unilu)
Number of downloads
291 (18 by Unilu)

Scopus citations®
 
30
Scopus citations®
without self-citations
26
OpenCitations
 
18
WoS citations
 
18

Bibliography


Similar publications



Contact ORBilu