HUSCAP logo Hokkaido Univ. logo

Hokkaido University Collection of Scholarly and Academic Papers >
Graduate School of Information Science and Technology / Faculty of Information Science and Technology >
Peer-reviewed Journal Articles, etc >

Extracting location and creator-related information from Wikipedia-based information-rich taxonomy for ConceptNet expansion

This item is licensed under:Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International

Files in This Item:
Extracting location and creator-related information from Wikipedia-based information-rich taxonomy for ConceptNet expansion.pdf1.38 MBPDFView/Open
Please use this identifier to cite or link to this item:http://hdl.handle.net/2115/71396

Title: Extracting location and creator-related information from Wikipedia-based information-rich taxonomy for ConceptNet expansion
Authors: Krawczyk, Marek Browse this author
Rzepka, Rafal Browse this author →KAKEN DB
Araki, Kenji Browse this author →KAKEN DB
Keywords: common sense knowledge
knowledge extraction
ConceptNet
Issue Date: Sep-2016
Publisher: Elsevier
Journal Title: Knowledge-Based Systems
Volume: 108
Start Page: 125
End Page: 131
Publisher DOI: 10.1016/j.knosys.2016.05.004
Abstract: Our research goal is to generate new assertions suitable for introduction to theJapanese part of the ConceptNet common sense knowledge ontology. In thispaper we present a method for extracting IsA assertions (hyponymy relations),AtLocation assertions (informing of the location of an object or place), Located-Near assertions (informing of neighboring locations) and CreatedBy assertions(informing of the creator of an object) automatically from Japanese WikipediaXML dump les. We use the Hyponymy extraction tool v1.0, which analyzesde nition, category and hierarchy structures of Wikipedia articles to extractIsA assertions and produce an information-rich taxonomy. From this taxon-omy we extract additional information, in this case AtLocation, LocatedNearand CreatedBy types of assertions, using our original method. The presentedexperiments prove that we achieved our research goal on a large scale: bothmethods produce satisfactory results, and we were able to acquire 5,866,680 IsAassertions with 96.0% reliability, 131,760 AtLocation assertion pairs with 93.5%reliability, 6,217 LocatedNear assertion pairs with 98.5% reliability and 270,230CreatedBy assertion pairs with 78.5% reliability. Our method surpassed thebaseline system in terms of both precision and the number of acquired asser-tions.
Rights: © 2016, Elsevier. This manuscript version is made available under the CC-BY-NC-ND 4.0 license http://creativecommons.org/licenses/by-nc-nd/4.0/
http://creativecommons.org/licenses/by-nc-nd/4.0/
Type: article (author version)
URI: http://hdl.handle.net/2115/71396
Appears in Collections:情報科学院・情報科学研究院 (Graduate School of Information Science and Technology / Faculty of Information Science and Technology) > 雑誌発表論文等 (Peer-reviewed Journal Articles, etc)

Submitter: RZEPKA Rafal

Export metadata:

OAI-PMH ( junii2 , jpcoar_1.0 )

MathJax is now OFF:


 

 - Hokkaido University