Paper published in a book (Scientific congresses, symposiums and conference proceedings)
WikiDoMiner: Wikipedia Domain-Specific Miner
Ezzini, Saad; Abualhaija, Sallam; Sabetzadeh, Mehrdad
2022In ACM SIGSOFT CONFERENCE ON THE FOUNDATIONS OF SOFTWARE ENGINEERING
Peer reviewed
 

Files


Full Text
WikiDoMiner.pdf
Publisher postprint (807.64 kB)
Download

All documents in ORBilu are protected by a user license.

Send to



Details



Keywords :
Requirements Engineering; Natural-language Requirements; Natural Language Processing; Domain-specific Corpus Generation; Wikipedia
Abstract :
[en] We introduce WikiDoMiner -- a tool for automatically generating domain-specific corpora by crawling Wikipedia. WikiDoMiner helps requirements engineers create an external knowledge resource that is specific to the underlying domain of a given requirements specification (RS). Being able to build such a resource is important since domain-specific datasets are scarce. WikiDoMiner generates a corpus by first extracting a set of domain-specific keywords from a given RS, and then querying Wikipedia for these keywords. The output of WikiDoMiner is a set of Wikipedia articles relevant to the domain of the input RS. Mining Wikipedia for domain-specific knowledge can be beneficial for multiple requirements engineering tasks, e.g., ambiguity handling, requirements classification, and question answering. WikiDoMiner is publicly available on Zenodo under an open-source license (https://doi.org/10.5281/zenodo.6672682)
Research center :
Interdisciplinary Centre for Security, Reliability and Trust (SnT) > SVV - Software Verification and Validation
Disciplines :
Computer science
Author, co-author :
Ezzini, Saad ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > SVV
Abualhaija, Sallam  ;  University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > SVV
Sabetzadeh, Mehrdad
External co-authors :
yes
Language :
English
Title :
WikiDoMiner: Wikipedia Domain-Specific Miner
Publication date :
2022
Event name :
ACM SIGSOFT CONFERENCE ON THE FOUNDATIONS OF SOFTWARE ENGINEERING
Event date :
from 14-11-2022 to 18-11-2022
Main work title :
ACM SIGSOFT CONFERENCE ON THE FOUNDATIONS OF SOFTWARE ENGINEERING
Publisher :
Association for Computing Machinery
Peer reviewed :
Peer reviewed
FnR Project :
FNR12632261 - Early Quality Assurance Of Critical Systems, 2018 (01/01/2019-31/12/2021) - Mehrdad Sabetzadeh
Funders :
FNR - Fonds National de la Recherche [LU]
Available on ORBilu :
since 07 November 2022

Statistics


Number of views
77 (15 by Unilu)
Number of downloads
39 (2 by Unilu)

Scopus citations®
 
4
Scopus citations®
without self-citations
1
OpenCitations
 
3

Bibliography


Similar publications



Contact ORBilu