|
Hokkaido University Collection of Scholarly and Academic Papers >
Graduate School of Information Science and Technology / Faculty of Information Science and Technology >
Peer-reviewed Journal Articles, etc >
Rule-based Approach to Extracting Location, Creator and Membership-related Information from Wikipedia-based Information-rich Taxonomy for ConceptNet Expansion
Title: | Rule-based Approach to Extracting Location, Creator and Membership-related Information from Wikipedia-based Information-rich Taxonomy for ConceptNet Expansion |
Authors: | Krawczyk, Marek Browse this author | Rzepka, Rafal Browse this author →KAKEN DB | Araki, Kenji Browse this author →KAKEN DB |
Issue Date: | Jul-2016 |
Publisher: | IJCAI |
Journal Title: | Proceedings of Language Sense on Computers IJCAI 2016 Workshop |
Abstract: | In this paper we present a method for extract- ing IsA assertions (hyponymy relations), AtLoca- tion assertions (informing of the location of an object or place), LocatedNear assertions (informing of neighboring locations), CreatedBy asser- tions (informing of the creator of an object) and MemberOf assertions (informing of group mem- bership) automatically from Japanese Wikipedia XML dump files. These assertions would be suitable for introduction to the Japanese part of the ConceptNet common sense knowledge ontology. We use the Hyponymy extraction tool v1.0, which analyzes definition, category and hierarchy structures of Wikipedia articles to extract IsA asser- tions and produce an information-rich taxonomy. From this taxonomy we extract additional informa- tion, in this case AtLocation, LocatedNear, Cre- atedBy and MemberOf types of assertions, using our original method. The presented experiments prove that we achieved our research goal on a large scale: both methods produce satisfactory results, and we were able to acquire 5,866,680 IsA assertions with 96.0% reliability, 131,760 AtLoca- tion assertion pairs with 93.5% reliability, 6,217 LocatedNear assertion pairs with 98.5% reliability, 270,230 CreatedBy assertion pairs with 78.5% reliability and 21,053 MemberOf assertions with 87.0% reliability. Our method surpassed the baseline system in terms of both precision and the number of acquired assertions. |
Conference Name: | International Joint Conference on Artificial Intelligence |
Conference Sequence: | 25 |
Conference Place: | New York City, New York |
Type: | proceedings |
URI: | http://hdl.handle.net/2115/63954 |
Appears in Collections: | 情報科学院・情報科学研究院 (Graduate School of Information Science and Technology / Faculty of Information Science and Technology) > 雑誌発表論文等 (Peer-reviewed Journal Articles, etc)
|
Submitter: RZEPKA Rafal
|