Extracting location and creator-related information from Wikipedia-based information-rich taxonomy for ConceptNet expansion

Krawczyk, Marek; Rzepka, Rafal; Araki, Kenji

doi:10.1016/j.knosys.2016.05.004


Hokkaido University \| Library \| HUSCAP	Advanced Search		言語

	Home
	About HUSCAP
	Open Access Policy

	Browse by Author

Browse
	Communities & Collections

	Scholarly Journals
	Theses
	Doctoral Dissertations Listed by Graduate Schools
	Conference Procs.
	Events

	HUSCAP Senior (in Japanese)

	Societies

	Downloads (country)

For university staff
	How to post your papers to HUSCAP
	Publication of theses
	Helpline about theses publication

Open Archives Compliant

You can search our collection also at:
	Google
	Google Scholar
	CiNii
	IRDB
	OAIster
	NDLTD

Hokkaido University Collection of Scholarly and Academic Papers >
Graduate School of Information Science and Technology / Faculty of Information Science and Technology >
Peer-reviewed Journal Articles, etc >

Extracting location and creator-related information from Wikipedia-based information-rich taxonomy for ConceptNet expansion

This item is licensed under:Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International

Files in This Item:

Extracting location and creator-related information from Wikipedia-based information-rich taxonomy for ConceptNet expansion.pdf

1.38 MB

PDF

View/Open

Please use this identifier to cite or link to this item:http://hdl.handle.net/2115/71396

Title:	Extracting location and creator-related information from Wikipedia-based information-rich taxonomy for ConceptNet expansion
Authors:	Krawczyk, Marek Browse this author
	Rzepka, Rafal Browse this author →KAKEN DB
	Araki, Kenji Browse this author →KAKEN DB
Keywords:	common sense knowledge
	knowledge extraction
	ConceptNet
Issue Date:	Sep-2016
Publisher:	Elsevier
Journal Title:	Knowledge-Based Systems
Volume:	108
Start Page:	125
End Page:	131
Publisher DOI:	10.1016/j.knosys.2016.05.004
Abstract:	Our research goal is to generate new assertions suitable for introduction to theJapanese part of the ConceptNet common sense knowledge ontology. In thispaper we present a method for extracting IsA assertions (hyponymy relations),AtLocation assertions (informing of the location of an object or place), Located-Near assertions (informing of neighboring locations) and CreatedBy assertions(informing of the creator of an object) automatically from Japanese WikipediaXML dump les. We use the Hyponymy extraction tool v1.0, which analyzesde nition, category and hierarchy structures of Wikipedia articles to extractIsA assertions and produce an information-rich taxonomy. From this taxon-omy we extract additional information, in this case AtLocation, LocatedNearand CreatedBy types of assertions, using our original method. The presentedexperiments prove that we achieved our research goal on a large scale: bothmethods produce satisfactory results, and we were able to acquire 5,866,680 IsAassertions with 96.0% reliability, 131,760 AtLocation assertion pairs with 93.5%reliability, 6,217 LocatedNear assertion pairs with 98.5% reliability and 270,230CreatedBy assertion pairs with 78.5% reliability. Our method surpassed thebaseline system in terms of both precision and the number of acquired asser-tions.
Rights:	© 2016, Elsevier. This manuscript version is made available under the CC-BY-NC-ND 4.0 license http://creativecommons.org/licenses/by-nc-nd/4.0/
Rights:	http://creativecommons.org/licenses/by-nc-nd/4.0/
Type:	article (author version)
URI:	http://hdl.handle.net/2115/71396
Appears in Collections:	情報科学院・情報科学研究院 (Graduate School of Information Science and Technology / Faculty of Information Science and Technology) > 雑誌発表論文等 (Peer-reviewed Journal Articles, etc)

Submitter: RZEPKA Rafal

OAI-PMH ( junii2 , jpcoar_1.0 )

- Hokkaido University