HUSCAP logo Hokkaido Univ. logo

Hokkaido University Collection of Scholarly and Academic Papers >
Graduate School of Information Science and Technology / Faculty of Information Science and Technology >
Peer-reviewed Journal Articles, etc >

Rule-based Approach to Extracting Location, Creator and Membership-related Information from Wikipedia-based Information-rich Taxonomy for ConceptNet Expansion

Files in This Item:
Rule-based Approach to Extracting Location, Creator and Membership-relate__.pdf217.58 kBPDFView/Open
Please use this identifier to cite or link to this item:http://hdl.handle.net/2115/63954

Title: Rule-based Approach to Extracting Location, Creator and Membership-related Information from Wikipedia-based Information-rich Taxonomy for ConceptNet Expansion
Authors: Krawczyk, Marek Browse this author
Rzepka, Rafal Browse this author →KAKEN DB
Araki, Kenji Browse this author →KAKEN DB
Issue Date: Jul-2016
Publisher: IJCAI
Journal Title: Proceedings of Language Sense on Computers IJCAI 2016 Workshop
Abstract: In this paper we present a method for extract- ing IsA assertions (hyponymy relations), AtLoca- tion assertions (informing of the location of an object or place), LocatedNear assertions (informing of neighboring locations), CreatedBy asser- tions (informing of the creator of an object) and MemberOf assertions (informing of group mem- bership) automatically from Japanese Wikipedia XML dump files. These assertions would be suitable for introduction to the Japanese part of the ConceptNet common sense knowledge ontology. We use the Hyponymy extraction tool v1.0, which analyzes definition, category and hierarchy structures of Wikipedia articles to extract IsA asser- tions and produce an information-rich taxonomy. From this taxonomy we extract additional informa- tion, in this case AtLocation, LocatedNear, Cre- atedBy and MemberOf types of assertions, using our original method. The presented experiments prove that we achieved our research goal on a large scale: both methods produce satisfactory results, and we were able to acquire 5,866,680 IsA assertions with 96.0% reliability, 131,760 AtLoca- tion assertion pairs with 93.5% reliability, 6,217 LocatedNear assertion pairs with 98.5% reliability, 270,230 CreatedBy assertion pairs with 78.5% reliability and 21,053 MemberOf assertions with 87.0% reliability. Our method surpassed the baseline system in terms of both precision and the number of acquired assertions.
Conference Name: International Joint Conference on Artificial Intelligence
Conference Sequence: 25
Conference Place: New York City, New York
Type: proceedings
URI: http://hdl.handle.net/2115/63954
Appears in Collections:情報科学院・情報科学研究院 (Graduate School of Information Science and Technology / Faculty of Information Science and Technology) > 雑誌発表論文等 (Peer-reviewed Journal Articles, etc)

Submitter: RZEPKA Rafal

Export metadata:

OAI-PMH ( junii2 , jpcoar_1.0 )

MathJax is now OFF:


 

 - Hokkaido University