Guided Inductive Logic Programming: Cleaning Knowledge Bases with Iterative User Feedback

Wu, Yan; Chen, Jinchuan; Haxhidauti, Plarent; Ellampallil Venugopal, Vinu; Theobald, Martin

Download

Paper published in a book (Scientific congresses, symposiums and conference proceedings)

Guided Inductive Logic Programming: Cleaning Knowledge Bases with Iterative User Feedback

Wu, Yan; Chen, Jinchuan; Haxhidauti, Plarent et al.

2020 • In Guided Inductive Logic Programming: Cleaning Knowledge Bases with Iterative User Feedback

Peer reviewed

Permalink
https://hdl.handle.net/10993/45625

Files (1)Send to Details Statistics Bibliography Similar publications

Files

Full Text

Guided_Inductive_Logic_Programming_Cleaning_Knowledge_Bases_with_Iterative_User_Feedback (5).pdf

Publisher postprint (716.97 kB)

Download

All documents in ORBilu are protected by a user license.

Send to

RIS BibTex APA Chicago Permalink X Linkedin

Details

Abstract :

[en] Domain-oriented knowledge bases (KBs) such as DBpedia and YAGO are largely constructed by applying a set of predefined extraction rules to the semi-structured contents of Wikipedia articles. Although both of these large-scale KBs achieve very high average precision values (above 95% for YAGO3), subtle mistakes in a few of the underlying extraction rules may still impose a substantial amount of systematic extraction mistakes for specific relations. For example, by applying the same regular expressions to extract person names of both Asian and Western nationality, YAGO erroneously swaps most of the family and given names of Asian person entities. For traditional rule-learning approaches based on Inductive Logic Programming (ILP), it is very difficult to detect these systematic extraction mistakes, since they usually occur only in a relatively small subdomain of the relations’ arguments. In this paper, we thus propose a guided form of ILP, coined “GILP”, that iteratively asks for small amounts of user feedback over a given KB to learn a set of data-cleaning rules that (1) best match the feedback and (2) also generalize to a larger portion of facts in the KB. We propose both algorithms and respective metrics to automatically assess the quality of the learned rules with respect to the user feedback.

Disciplines :

Computer science

Author, co-author :

Wu, Yan

Chen, Jinchuan

Haxhidauti, Plarent

Ellampallil Venugopal, Vinu ; University of Luxembourg > Faculty of Science, Technology and Medicine (FSTM) > Department of Computer Science (DCS)

Theobald, Martin ; University of Luxembourg > Faculty of Science, Technology and Medicine (FSTM) > Department of Computer Science (DCS)

External co-authors :

yes

Language :

English

Title :

Guided Inductive Logic Programming: Cleaning Knowledge Bases with Iterative User Feedback

Publication date :

12 March 2020

Event name :

GCAI 2020. 6th Global Conference on Artificial Intelligence (GCAI 2020)

Event date :

10-02-2020

Main work title :

Guided Inductive Logic Programming: Cleaning Knowledge Bases with Iterative User Feedback

Pages :

92–106

Peer reviewed :

Peer reviewed

Available on ORBilu :

since 18 January 2021

Statistics

Number of views

82 (6 by Unilu)

Number of downloads

86 (5 by Unilu)

More statistics