Hokkaido University Collection of Scholarly and Academic Papers >
Graduate School of Information Science and Technology / Faculty of Information Science and Technology >
Peer-reviewed Journal Articles, etc >
Extracting location and creator-related information from Wikipedia-based information-rich taxonomy for ConceptNet expansion
This item is licensed under:Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International
Title: | Extracting location and creator-related information from Wikipedia-based information-rich taxonomy for ConceptNet expansion |
Authors: | Krawczyk, Marek Browse this author | Rzepka, Rafal Browse this author →KAKEN DB | Araki, Kenji Browse this author →KAKEN DB |
Keywords: | common sense knowledge | knowledge extraction | ConceptNet |
Issue Date: | Sep-2016 |
Publisher: | Elsevier |
Journal Title: | Knowledge-Based Systems |
Volume: | 108 |
Start Page: | 125 |
End Page: | 131 |
Publisher DOI: | 10.1016/j.knosys.2016.05.004 |
Abstract: | Our research goal is to generate new assertions suitable for introduction to theJapanese part of the ConceptNet common sense knowledge ontology. In thispaper we present a method for extracting IsA assertions (hyponymy relations),AtLocation assertions (informing of the location of an object or place), Located-Near assertions (informing of neighboring locations) and CreatedBy assertions(informing of the creator of an object) automatically from Japanese WikipediaXML dump les. We use the Hyponymy extraction tool v1.0, which analyzesde nition, category and hierarchy structures of Wikipedia articles to extractIsA assertions and produce an information-rich taxonomy. From this taxon-omy we extract additional information, in this case AtLocation, LocatedNearand CreatedBy types of assertions, using our original method. The presentedexperiments prove that we achieved our research goal on a large scale: bothmethods produce satisfactory results, and we were able to acquire 5,866,680 IsAassertions with 96.0% reliability, 131,760 AtLocation assertion pairs with 93.5%reliability, 6,217 LocatedNear assertion pairs with 98.5% reliability and 270,230CreatedBy assertion pairs with 78.5% reliability. Our method surpassed thebaseline system in terms of both precision and the number of acquired asser-tions. |
Rights: | © 2016, Elsevier. This manuscript version is made available under the CC-BY-NC-ND 4.0 license http://creativecommons.org/licenses/by-nc-nd/4.0/ | http://creativecommons.org/licenses/by-nc-nd/4.0/ |
Type: | article (author version) |
URI: | http://hdl.handle.net/2115/71396 |
Appears in Collections: | 情報科学院・情報科学研究院 (Graduate School of Information Science and Technology / Faculty of Information Science and Technology) > 雑誌発表論文等 (Peer-reviewed Journal Articles, etc)
|
Submitter: RZEPKA Rafal
|