Paper published in a book (Scientific congresses, symposiums and conference proceedings)
Using Domain-specific Corpora for Improved Handling of Ambiguity in Requirements
EZZINI, Saad; ABUALHAIJA, Sallam; Arora, Chetan et al.
2021In Proceedings of the 43rd International Conference on Software Engineering (ICSE'21), Madrid 25-28 May 2021
Peer reviewed
 

Files


Full Text
ICSE21-EAASB.pdf
Author postprint (617.01 kB)
Download

All documents in ORBilu are protected by a user license.

Send to



Details



Keywords :
Requirements Engineering; Natural-language Requirements; Ambiguity; Natural Language Processing; Corpus Generation; Wikipedia
Abstract :
[en] Ambiguity in natural-language requirements is a pervasive issue that has been studied by the requirements engineering community for more than two decades. A fully manual approach for addressing ambiguity in requirements is tedious and time-consuming, and may further overlook unacknowledged ambiguity – the situation where different stakeholders perceive a requirement as unambiguous but, in reality, interpret the requirement differently. In this paper, we propose an automated approach that uses natural language processing for handling ambiguity in requirements. Our approach is based on the automatic generation of a domain-specific corpus from Wikipedia. Integrating domain knowledge, as we show in our evaluation, leads to a significant positive improvement in the accuracy of ambiguity detection and interpretation. We scope our work to coordination ambiguity (CA) and prepositional-phrase attachment ambiguity (PAA) because of the prevalence of these types of ambiguity in natural-language requirements [1]. We evaluate our approach on 20 industrial requirements documents. These documents collectively contain more than 5000 requirements from seven distinct application domains. Over this dataset, our approach detects CA and PAA with an average precision of 80% and an average recall of 89% (90% for cases of unacknowledged ambiguity). The automatic interpretations that our approach yields have an average accuracy of 85%. Compared to baselines that use generic corpora, our approach, which uses domain-specific corpora, has 33% better accuracy in ambiguity detection and 16% better accuracy in interpretation.
Research center :
Interdisciplinary Centre for Security, Reliability and Trust (SnT) > SVV - Software Verification and Validation
Disciplines :
Computer science
Author, co-author :
EZZINI, Saad ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > SVV
ABUALHAIJA, Sallam  ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > SVV
Arora, Chetan;  Deakin University
SABETZADEH, Mehrdad ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > SVV
BRIAND, Lionel ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > SVV
External co-authors :
yes
Language :
English
Title :
Using Domain-specific Corpora for Improved Handling of Ambiguity in Requirements
Publication date :
May 2021
Event name :
43rd International Conference on Software Engineering
Event date :
from 25-05-2021 to 28-05-2021
Audience :
International
Main work title :
Proceedings of the 43rd International Conference on Software Engineering (ICSE'21), Madrid 25-28 May 2021
Publisher :
IEEE
Peer reviewed :
Peer reviewed
FnR Project :
FNR12632261 - Early Quality Assurance Of Critical Systems, 2018 (01/01/2019-31/12/2021) - Mehrdad Sabetzadeh
Funders :
FNR - Luxembourg National Research Fund [LU]
Available on ORBilu :
since 12 February 2021

Statistics


Number of views
378 (57 by Unilu)
Number of downloads
743 (47 by Unilu)

Scopus citations®
 
48
Scopus citations®
without self-citations
38
OpenCitations
 
14
WoS citations
 
30

Bibliography


Similar publications



Contact ORBilu