Legal requirements; semantic legal metadata; natural language processing
Abstract :
[en] [Context] Semantic legal metadata provides information that helps with understanding and interpreting the meaning of legal provisions. Such metadata is important for the systematic analysis of legal requirements. [Objectives] Our work is motivated by two observations: (1) The existing requirements engineering (RE) literature does not provide a harmonized view on the semantic metadata types that are useful for legal requirements analysis. (2) Automated support for the extraction of semantic legal metadata is scarce, and further does not exploit the full potential of natural language processing (NLP). Our objective is to take steps toward addressing these limitations. [Methods] We review and reconcile the semantic legal metadata types proposed in RE. Subsequently, we conduct a qualitative study aimed at investigating how the identified metadata types can be extracted automatically. [Results and Conclusions] We propose (1) a harmonized conceptual model for the semantic metadata types pertinent to legal requirements analysis, and (2) automated extraction rules for these metadata types based on NLP. We evaluate the extraction rules through a case study. Our results indicate that the rules generate metadata annotations with high accuracy.
Research center :
Interdisciplinary Centre for Security, Reliability and Trust (SnT) > Software Verification and Validation Lab (SVV Lab)
Disciplines :
Computer science
Author, co-author :
SLEIMI, Amin ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT)
SANNIER, Nicolas ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT)
SABETZADEH, Mehrdad ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT)
BRIAND, Lionel ; University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT)
Dann, John; Ministry of State > Central Legislative Service
External co-authors :
no
Language :
English
Title :
Automated Extraction of Semantic Legal Metadata Using Natural Language Processing
Publication date :
August 2018
Event name :
the 26th IEEE International Requirements Engineering Conference
Event date :
from 20-08-2018 to 24-08-2018
Audience :
International
Main work title :
the 26th IEEE International Requirements Engineering Conference, Banff, Alberta, 20-24 August 2018
Pages :
124-135
Peer reviewed :
Peer reviewed
FnR Project :
FNR11554296 - Automated Retrieval Of Metadata From Legal Texts, 2016 (01/01/2017-31/12/2018) - Mehrdad Sabetzadeh
T. D. Breaux and A. I. Antón, "Analyzing regulatory rules for privacy and security requirements, " IEEE Transactions on Software Engineering, vol. 34, no. 1, pp. 5-20, 2008.
A. K. Massey, P. N. Otto, L. J. Hayward, and A. I. Antón, "Evaluating existing security and privacy requirements for legal compliance, " Requirements Engineering, vol. 15, no. 1, pp. 119-137, 2010.
N. Zeni, N. Kiyavitskaya, L. Mich, J. R. Cordy, and J. Mylopoulos, "GaiusT: supporting the extraction of rights and obligations for regulatory compliance, " Requirements Engineering, vol. 20, no. 1, pp. 1-22, 2015.
D. G. Gordon and T. D. Breaux, "Reconciling multi-jurisdictional legal requirements: A case study in requirements water marking, " in Proceedings of the 20th IEEE International Requirements Engineering Conference (RE'12), 2012, pp. 91-100.
N. Sannier, M. Adedjouma, M. Sabetzadeh, and L. C. Briand, "An automated framework for detection and resolution of cross references in legal texts, " Requirements Engineering, vol. 22, no. 2, pp. 215-237, 2017.
A. Massey, "Legal requirements metrics for compliance analysis, " Ph. D. dissertation, North Carolina State University, Raleigh, North Carolina, USA, 2012.
T. Breaux, "Legal requirements acquisition for the specification of legally compliant information systems, " Ph. D. dissertation, North Carolina State University, Raleigh, North Carolina, USA, 2009.
A. Siena, J. Mylopoulos, A. Perini, and A. Susi, "Designing lawcompliant software requirements, " in Proceedings of the 28th International Conference on Conceptual Modeling (ER'09), 2009, pp. 472-486.
J. Bhatia, T. D. Breaux, and F. Schaub, "Mining privacy goals from privacy policies using hybridized task recomposition, " ACM Transactions on Software Engineering and Methodology, vol. 25, no. 3, pp. 22: 1-22: 24, 2016.
J. C. Maxwell and A. I. Antón, "The production rule framework: developing a canonical set of software requirements for compliance with law, " in Proceedings of the ACM International Health Informatics Symposium (IHI'10), 2010, pp. 629-636.
N. Zeni, E. A. Seid, P. Engiel, S. Ingolfo, and J. Mylopoulos, "Building large models of law with NómosT, " in Proceedings of the 35th International Conference on Conceptual Modeling (ER'16), 2016, pp. 233-247.
N. Kiyavitskaya, N. Zeni, T. D. Breaux, A. I. Antón, J. R. Cordy, L. Mich, and J. Mylopoulos, "Automating the extraction of rights and obligations for regulatory compliance, " in Proceedings of the 27th International Conference on Conceptual Modeling (ER'08), 2008, pp. 154-168.
A. Siena, I. Jureta, S. Ingolfo, A. Susi, A. Perini, and J. Mylopoulos, "Capturing variability of law with nómos 2, " in Proceedings of the 31st International Conference on Conceptual Modeling (ER'12), 2012, pp. 383-396.
J. Hirschberg and C. D. Manning, "Advances in natural language processing, " Science, vol. 349, no. 6245, pp. 261-266, 2015.
J. F. Horty, Agency and Deontic Logic, ser. Oxford scholarship online. Oxford University Press, USA, 2001.
W. N. Hohfeld, "Fundamental legal conceptions as applied in judicial reasoning, " The Yale Law Journal, vol. 26, no. 8, pp. 710-770, 1917.
P. Giorgini, F. Massacci, J. Mylopoulos, and N. Zannone, "Modeling security requirements through ownership, permission and delegation, " in Proceedings of the 13th IEEE International Conference on Requirements Engineering (RE'05), 2005, pp. 167-176.
T. D. Breaux, M. W. Vail, and A. I. Antón, "Towards regulatory compliance: Extracting rights and obligations to align requirements with regulations, " in Proceedings of the 14th IEEE International Requirements Engineering Conference (RE'06), 2006, pp. 46-55.
N. Kiyavitskaya, N. Zeni, L. Mich, J. R. Cordy, and J. Mylopoulos, "Text mining through semi automatic semantic annotation, " in Proceedings of the 6th International Conference on Practical Aspects of Knowledge Management (PAKM'06), 2006, pp. 143-154.
S. Ingolfo, I. Jureta, A. Siena, A. Perini, and A. Susi, "Nòmos 3: Legal compliance of roles and requirements, " in Proceedings of the 33rd International Conference on Conceptual Modeling (ER'14), 2014, pp. 275-288.
S. Ghanavati, D. Amyot, and A. Rifaut, "Legal goal-oriented requirement language (legal GRL) for modeling regulations, " in Proceedings of the 6th International Workshop on Modeling in Software Engineering (MISE'14), 2014, pp. 1-6.
S. Ghanavati, "Legal-urn framework for legal compliance of business processes, " Ph. D. dissertation, University of Ottawa, Ottawa, Ontario, Canada, 2013.
G. Boella, L. D. Caro, L. Humphreys, L. Robaldo, P. Rossi, and L. van der Torre, "Eunomos, a legal document and knowledge management system for the web to provide relevant, reliable and up-to-date information on the law, " Artificial Intelligence and Law, vol. 24, no. 3, pp. 245-283, 2016.
W. Peters, M. Sagri, and D. Tiscornia, "The structuring of legal knowledge in LOIS, " Artificial Intelligence and Law, vol. 15, no. 2, pp. 117-135, 2007.
G. Sartor, P. Casanovas, M. Biasiotti, and M. Fernndez-Barrera, Approaches to Legal Ontologies: Theories, Domains, Methodologies. Springer, 2013.
R. Hoekstra, J. Breuker, M. D. Bello, and A. Boer, "The LKIF core ontology of basic legal concepts, " in Proceedings of the 2nd Workshop on Legal Ontologies and Artificial Intelligence Techniques (LOAIT'07), 2007, pp. 43-63.
J. Breuker, A. Boer, R. Hoekstra, and K. van den Berg, "Developing content for LKIF: ontologies and frameworks for legal reasoning, " in Proceedings of the 19th Annual Conference on Legal Knowledge and Information Systems (JURIX'06), 2006, pp. 169-174.
A. Boer, R. Winkels, and F. Vitali, "Proposed XML standards for law: Metalex and LKIF, " in Proceedings of the 20th Annual Conference on Legal Knowledge and Information Systems (JURIX'07), 2007, pp. 19-28.
T. Athan, H. Boley, G. Governatori, M. Palmirani, A. Paschke, and A. Z. Wyner, "OASIS LegalRuleML, " in Proceedings of the International Conference on Artificial Intelligence and Law (ICAIL'13), 2013, pp. 3-12.
H. Lam, M. Hashmi, and B. Scofield, "Enabling reasoning with Legal-RuleML, " in Proceedings of the 10th International Symposium on Rule Technologies. Research, Tools, and Applications (RuleML'16), 2016, pp. 241-257.
"Specification of RuleML 1. 02, " http://wiki. ruleml. org/index. php/Specification of RuleML 1. 02.
C. Arora, M. Sabetzadeh, L. C. Briand, and F. Zimmer, "Automated checking of conformance to requirements templates using natural language processing, " IEEE Transactions on Software Engineering, vol. 41, no. 10, pp. 944-968, 2015.
-, "Extracting domain models from natural-language requirements: Approach and industrial evaluation, " in Proceedings of the ACM/IEEE 19th International Conference on Model Driven Engineering Languages and Systems (MODELS'16), 2016, pp. 250-260.
G. Lucassen, M. Robeer, F. Dalpiaz, J. M. E. M. van der Werf, and S. Brinkkemper, "Extracting conceptual models from user stories with visual narrator, " Requirements Engineering, vol. 22, no. 3, pp. 339-358, 2017.
T. Quirchmayr, B. Paech, R. Kohl, H. Karey, and G. Kasdepke, "Semiautomatic rule-based domain terminology and software feature-relevant information extraction from natural language user manuals, " Empirical Software Engineering, 2018.
Y. Elrakaiby, A. Ferrari, P. Spoletini, S. Gnesi, and B. Nuseibeh, "Using argumentation to explain ambiguity in requirements elicitation interviews, " in Proceedings of the 25th IEEE International Requirements Engineering Conference (RE'17), 2017, pp. 51-60.
B. Rosadini, A. Ferrari, G. Gori, A. Fantechi, S. Gnesi, I. Trotta, and S. Bacherini, "Using NLP to detect requirements defects: An industrial experience in the railway domain, " in Proceedings of the 23rd International Working Conference on Requirements Engineering: Foundation for Software Quality (REFSQ'17), 2017, pp. 344-360.
J. Bhatia, M. C. Evans, S. Wadkar, and T. D. Breaux, "Automated extraction of regulated information types using hyponymy relations, " in Proceedings of the 3rd International Workshop on Artificial Intelligence for Requirements Engineering (AIRE'16), 2016, pp. 19-25.
M. C. Evans, J. Bhatia, S. Wadkar, and T. D. Breaux, "An evaluation of constituency-based hyponymy extraction from privacy policies, " in Proceedings of the 25th IEEE International Requirements Engineering Conference (RE'17), 2017, pp. 312-321.
F. DellOrletta, S. Marchi, S. Montemagni, B. Plank, and G. Venturi, "The splet2012 shared task on dependency parsing of legal texts, " in the 4th Workshop on Semantic Processing of Legal Texts (SPLeT'12), 2012, pp. 42-51.
J. Saldaña, The Coding Manual for Qualitative Researchers. Sage, 2015.
J. Cohen, "A coefficient of agreement for nominal scales, " Educational and Psychological Measurement, vol. 20, no. 1, 1960.
J. Landis and G. G. Koch, "The measurement of observer agreement for categorical data, " Biometrics, vol. 33, no. 1, pp. 159-174, 1977.
J. C. Maxwell, A. I. Antón, P. P. Swire, M. Riaz, and C. M. McCraw, "A legal cross-references taxonomy for reasoning about compliance requirements, " Requirements Engineering, vol. 17, no. 2, pp. 99-115, 2012.
N. Sannier, M. Adedjouma, M. Sabetzadeh, and L. C. Briand, "Automated classification of legal cross references based on semantic intent, " in Proceedings of the 22nd International Working Conference on Requirements Engineering: Foundation for Software Quality (REFSQ'16), 2016, pp. 119-134.
R. Levy and G. Andrew, "Tregex and tsurgeon: Tools for querying and manipulating tree data structures, " in Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC'06), 2006, pp. 2231-2234.
B. Sagot, "The Lefff, a Freely Available and Large-coverage Morphological and Syntactic Lexicon for French, " in Proceedings of the International Conference on Language Resources and Evaluation (LREC'10), 2010, pp. 2745-2751.
S. Petrov, L. Barrett, R. Thibaux, and D. Klein, "Learning accurate, compact, and interpretable tree annotation, " in Proceedings of the 21st International Conference on Computational Linguistics and 44th Annual Meeting of the Association for Computational Linguistics (ACL'06), 2006.
J. Nivre, J. Hall, J. Nilsson, A. Chanev, G. Eryigit, S. Kübler, S. Marinov, and E. Marsi, "Maltparser: A language-independent system for datadriven dependency parsing, " Natural Language Engineering, vol. 13, no. 2, pp. 95-135, 2007.
R. T. McDonald and J. Nivre, "Characterizing the errors of datadriven dependency parsing models, " in Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL'07), 2007, pp. 122-131.
J. K. Kummerfeld, D. L. W. Hall, J. R. Curran, and D. Klein, "Parser showdown at the wall street corral: An empirical investigation of error types in parser output, " in Proceedings of the Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL'12), 2012, pp. 1048-1059.