Reference : A Machine Learning Approach for Automated Filling of Categorical Fields in Data Entry...
Scientific journals : Article
Engineering, computing & technology : Computer science
http://hdl.handle.net/10993/50909
A Machine Learning Approach for Automated Filling of Categorical Fields in Data Entry Forms
English
Belgacem, Hichem mailto [University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > SVV >]
Li, Xiaochen mailto [Dalian University of Technology > > > ; University of Luxembourg > Interdisciplinary Center for Security, Reliability and Trust]
Bianculli, Domenico mailto [University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > SVV >]
Briand, Lionel mailto [University of Luxembourg > Interdisciplinary Centre for Security, Reliability and Trust (SNT) > SVV >]
In press
ACM Transactions on Software Engineering and Methodology
Association for Computing Machinery (ACM)
Yes
1049-331X
United States
[en] Form filling ; Data entry forms ; Machine Learning ; Software data quality ; User interfaces
[en] Users frequently interact with software systems through data entry forms. However, form filling is time-consuming and error-prone. Although several techniques have been proposed to auto-complete or pre-fill
fields in the forms, they provide limited support to help users fill categorical fields, i.e., fields that require
users to choose the right value among a large set of options.

In this paper, we propose LAFF, a learning-based automated approach for filling categorical fields in data
entry forms. LAFF first builds Bayesian Network models by learning field dependencies from a set of historical
input instances, representing the values of the fields that have been filled in the past. To improve its learning
ability, LAFF uses local modeling to effectively mine the local dependencies of fields in a cluster of input
instances. During the form filling phase, LAFF uses such models to predict possible values of a target field,
based on the values in the already-filled fields of the form and their dependencies; the predicted values
(endorsed based on field dependencies and prediction confidence) are then provided to the end-user as a list of
suggestions.
We evaluated LAFF by assessing its effectiveness and efficiency in form filling on two datasets, one of
them proprietary from the banking domain. Experimental results show that LAFF is able to provide accurate
suggestions with a Mean Reciprocal Rank value above 0.73. Furthermore, LAFF is efficient, requiring at most
317 ms per suggestion.
http://hdl.handle.net/10993/50909
10.1145/3533021

File(s) associated to this reference

Fulltext file(s):

FileCommentaryVersionSizeAccess
Open access
all-in-newMain.pdfAuthor postprint846.62 kBView/Open

Bookmark and Share SFX Query

All documents in ORBilu are protected by a user license.