[en] Neural network (NN) components are being increasingly incorporated into software systems. Neural network properties are determined by their architecture, as well as the training and testing datasets used. The engineering of datasets and neural networks is a challenging task that requires methods and tools to satisfy customers’ expectations. The lack of tools that support requirements specification languages makes it difficult for engineers to describe dataset and neural network recognition skill requirements. Existing approaches often rely on traditional ad hoc approaches, without precise requirement specifications for data selection criteria, to build these datasets. Moreover, these approaches do not focus on the requirements of the neural network’s expected recognition skills. We aim to overcome this issue by defining a domain-specific language that precisely specifies dataset requirements and expected recognition skills after training for an NN-based system. In this paper, we present a textual domain-specific language (DSL) called SEMKIS-DSL (Software Engineering Methodology for the Knowledge management of Intelligent Systems) that is designed to support software engineers in specifying the requirements and recognition skills of neural networks. This DSL is proposed in the context of our general SEMKIS development process for neural network engineering. We illustrate the DSL’s concepts using a running example that focuses on the recognition of handwritten digits. We show some requirements and recognition skills specifications and demonstrate how our DSL improves neural network recognition skills.
Disciplines :
Sciences informatiques
Auteur, co-auteur :
JAHIC, Benjamin ; University of Luxembourg > Faculty of Science, Technology and Medicine (FSTM) > Department of Computer Science (DCS)
GUELFI, Nicolas ; University of Luxembourg > Faculty of Science, Technology and Medicine (FSTM) > Department of Computer Science (DCS)
RIES, Benoit ; University of Luxembourg > Faculty of Science, Technology and Medicine (FSTM) > Department of Computer Science (DCS)
Co-auteurs externes :
no
Langue du document :
Anglais
Titre :
SEMKIS-DSL: A Domain-Specific Language to Support Requirements Engineering of Datasets and Neural Network Recognition
Date de publication/diffusion :
01 avril 2023
Titre du périodique :
Information
eISSN :
2078-2489
Maison d'édition :
Multidisciplinary Digital Publishing Institute (MDPI), Suisse
Heyn H.M. Knauss E. Muhammad A.P. Eriksson O. Linder J. Subbiah P. Pradhan S.K. Tungal S. Requirement engineering challenges for ai-intense systems development Proceedings of the 2021 IEEE/ACM 1st Workshop on AI Engineering-Software Engineering for AI (WAIN) Online 30–31 May 2021 89 96
Xiao T. Xia T. Yang Y. Huang C. Wang X. Learning from massive noisy labeled data for image classification Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Boston, MA, USA 7–12 June 2015 2691 2699
Ros G. Sellart L. Materzynska J. Vazquez D. Lopez A.M. The synthia dataset: A large collection of synthetic images for semantic segmentation of urban scenes Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Las Vegas, NV, USA 27–30 June 2016 3234 3243
You Q. Luo J. Jin H. Yang J. Building a large scale dataset for image emotion recognition: The fine print and the benchmark Proceedings of the AAAI Conference on Artificial Intelligence Phoenix, AZ, USA 12–17 February 2016 Volume 30
Jahic B. SEMKIS: A Contribution to Software Engineering Methodologies for Neural Network Development Ph.D. Thesis University of Luxembourg Esch/Alzette, Luxembourg 2022
Jahić B. Guelfi N. Ries B. Software engineering for dataset augmentation using generative adversarial networks Proceedings of the 2019 IEEE 10th International Conference on Software Engineering and Service Science (ICSESS) Beijing, China 18–20 October 2019 59 66
Jahić B. Guelfi N. Ries B. Specifying key-properties to improve the recognition skills of neural networks Proceedings of the 2020 European Symposium on Software Engineering Online 6–8 November 2020 60 71
Fondement F. Silaghi R. Defining model driven engineering processes Proceedings of the Third International Workshop in Software Model Engineering (WiSME) Lisbon, Portugal 11–15 October 2004
Kosar T. Martı P.E. Barrientos P.A. Mernik M. A preliminary study on various implementation approaches of domain-specific language Inf. Softw. Technol. 2008 50 390 405 10.1016/j.infsof.2007.04.002
LeCun Y. Cortes C. Burges C.J.C. The MNIST Database of Handwritten Digits 2022 Available online: http://yann.lecun.com/exdb/mnist/ (accessed on 28 March 2023)
Bettini L. Implementing Domain-Specific Languages with Xtext and Xtend Packt Publishing Ltd. Birmingham, UK 2016
Jahič B. SEMKIS-DSL Complete Grammar (v2.0) 2022 Available online: https://github.com/Benji91/lu.uni.lassy.phdthesis.semkis.toolkit.experimentations/blob/main/eclipse-workspace-semkis-dsl/lu.uni.lassy.phd.dsl.semkis.parent/lu.uni.lassy.phd.dsl.semkis/src/lu/uni/lassy/phd/dsl/semkis/Semkis.xtext (accessed on 28 March 2023)
Jahič B. SEMKIS-DSL (v2.0) 2022 Available online: https://github.com/Benji91/lu.uni.lassy.phdthesis.semkis.toolkit.experimentations (accessed on 28 March 2023)
Sommerville I. Engineering Software Products Pearson London, UK 2020 Volume 355
Object Management Group B Unified Modeling Language 2.5.1 Object Management Group Milford, MA, USA 2017
Rahimi M. Guo J.L. Kokaly S. Chechik M. Toward requirements specification for machine-learned components Proceedings of the 2019 IEEE 27th International Requirements Engineering Conference Workshops (REW) Jeju Island, Republic of Korea 23–27 September 2019 241 244
Villamizar H. Escovedo T. Kalinowski M. Requirements engineering for machine learning: A systematic mapping study Proceedings of the 2021 47th Euromicro Conference on Software Engineering and Advanced Applications (SEAA) Online 1–3 September 2021 29 36
Habibullah K.M. Horkoff J. Non-functional requirements for machine learning: Understanding current use and challenges in industry Proceedings of the 2021 IEEE 29th International Requirements Engineering Conference (RE) Notre Dame, IN, USA 20–24 September 2021 13 23
Vogelsang A. Borg M. Requirements engineering for machine learning: Perspectives from data scientists Proceedings of the 2019 IEEE 27th International Requirements Engineering Conference Workshops (REW) Jeju Island, Republic of Korea 23–27 September 2019 245 251
Hu B.C. Salay R. Czarnecki K. Rahimi M. Selim G. Chechik M. Towards requirements specification for machine-learned perception based on human performance Proceedings of the 2020 IEEE Seventh International Workshop on Artificial Intelligence for Requirements Engineering (AIRE) Zurich, Switzerland 1 September 2020 48 51
Zhao T. Huang X. Design and implementation of DeepDSL: A DSL for deep learning Comput. Lang. Syst. Struct. 2018 54 39 70 10.1016/j.cl.2018.04.004
Elango V. Rubin N. Ravishankar M. Sandanagobalane H. Grover V. Diesel: DSL for linear algebra and neural net computations on GPUs Proceedings of the 2nd ACM SIGPLAN International Workshop on Machine Learning and Programming Languages Philadelphia, PA, USA 18 June 2018 42 51
Podobas A. Svedin M. Chien S.W. Peng I.B. Ravichandran N.B. Herman P. Lansner A. Markidis S. StreamBrain: An HPC DSL for Brain-like Neural Networks on Heterogeneous Systems Proceedings of the 11th International Symposium on Highly Efficient Accelerators and Reconfigurable Technologies Online 21–23 June 2021
Cueva-Lovelle J.M. García-Díaz V. Pelayo G. Bustelo C. Pascual-Espada J. Towards a standard-based domain-specific platform to solve machine learning-based problems Int. J. Interact. Multimed. Artif. Intell. 2015 3 6 12
Agrawal A. Modi A. Passos A. Lavoie A. Agarwal A. Shankar A. Ganichev I. Levenberg J. Hong M. Monga R. et al. TensorFlow Eager: A multi-stage, Python-embedded DSL for machine learning Proc. Mach. Learn. Syst. 2019 1 178 189
Sujeeth A. Lee H. Brown K. Rompf T. Chafi H. Wu M. Atreya A. Odersky M. Olukotun K. OptiML: An implicitly parallel domain-specific language for machine learning Proceedings of the 28th International Conference on Machine Learning (ICML-11) Bellevue, WA, USA 28 June–2 July 2011 609 616
Ries B. Guelfi N. Jahic B. An mde method for improving deep learning dataset requirements engineering using alloy and uml Proceedings of the 9th International Conference on Model-Driven Engineering and Software Development SCITEPRESS, Online 8–10 February 2021 41 52
Dosovitskiy A. Ros G. Codevilla F. Lopez A. Koltun V. CARLA: An open urban driving simulator Proceedings of the Conference on Robot Learning Mountain View, California, USA 13-15 November 2017 1 16
Pei K. Cao Y. Yang J. Jana S. Deepxplore: Automated whitebox testing of deep learning systems Proceedings of the 26th Symposium on Operating Systems Principles Shanghai, China 28–31 October 2017 1 18
Fremont D.J. Dreossi T. Ghosh S. Yue X. Sangiovanni-Vincentelli A.L. Seshia S.A. Scenic: A language for scenario specification and scene generation Proceedings of the 40th ACM SIGPLAN Conference on Programming Language Design and Implementation Phoenix, AZ, USA 22–26 June 2019 63 78
Guelfi N. A formal framework for dependability and resilience from a software engineering perspective Open Comput. Sci. 2011 1 294 328 10.2478/s13537-011-0025-x