Recallable Question Answering-Based Re-Ranking Considering Semantic Region for Cross-Modal Retrieval

Yanagi, Rintaro; Togo, Ren; Ogawa, Takahiro; Haseyama, Miki

doi:10.1109/OJSP.2023.3238280


Hokkaido University \| Library \| HUSCAP	Advanced Search		言語

	Home
	About HUSCAP
	Open Access Policy

	Browse by Author

Browse
	Communities & Collections

	Scholarly Journals
	Theses
	Doctoral Dissertations Listed by Graduate Schools
	Conference Procs.
	Events

	HUSCAP Senior (in Japanese)

	Societies

	Downloads (country)

For university staff
	How to post your papers to HUSCAP
	Publication of theses
	Helpline about theses publication

Open Archives Compliant

You can search our collection also at:
	Google
	Google Scholar
	CiNii
	IRDB
	OAIster
	NDLTD

Hokkaido University Collection of Scholarly and Academic Papers >
Graduate School of Information Science and Technology / Faculty of Information Science and Technology >
Peer-reviewed Journal Articles, etc >

Recallable Question Answering-Based Re-Ranking Considering Semantic Region for Cross-Modal Retrieval

Files in This Item:

The file(s) associated with this item can be obtained from the following URL: https://doi.org/10.1109/OJSP.2023.3238280

Title:	Recallable Question Answering-Based Re-Ranking Considering Semantic Region for Cross-Modal Retrieval
Authors:	Yanagi, Rintaro Browse this author
	Togo, Ren Browse this author
	Ogawa, Takahiro Browse this author →KAKEN DB
	Haseyama, Miki Browse this author →KAKEN DB
Keywords:	Cross-modal retrieval
	image retrieval
	question-answering
	recallability
	re-ranking
Issue Date:	19-Jan-2023
Publisher:	IEEE (Institute of Electrical and Electronics Engineers)
Journal Title:	IEEE Open Journal of Signal Processing
Volume:	4
Start Page:	1
End Page:	11
Publisher DOI:	10.1109/OJSP.2023.3238280
Abstract:	Question answering (QA)-based re-ranking methods for cross-modal retrieval have been recently proposed to further narrow down similar candidate images. The conventional QA-based re-ranking methods provide questions to users by analyzing candidate images, and the initial retrieval results are re-ranked based on the user's feedback. Contrary to these developments, only focusing on performance improvement makes it difficult to efficiently elicit the user's retrieval intention. To realize more useful QA-based re-ranking, considering the user interaction for eliciting the user's retrieval intention is required. In this paper, we propose a QA-based re-ranking method with considering two important factors for eliciting the user's retrieval intention: query-image relevance and recallability. Considering the query-image relevance enables to only focus on the candidate images related to the provided query text, while, focusing on the recallability enables users to easily answer the provided question. With these procedures, our method can efficiently and effectively elicit the user's retrieval intention. Experimental results using Microsoft Common Objects in Context and computationally constructed dataset including similar candidate images show that our method can improve the performance of the cross-modal retrieval methods and the QA-based re-ranking methods.
Type:	article
URI:	http://hdl.handle.net/2115/88973
Appears in Collections:	情報科学院・情報科学研究院 (Graduate School of Information Science and Technology / Faculty of Information Science and Technology) > 雑誌発表論文等 (Peer-reviewed Journal Articles, etc)

OAI-PMH ( junii2 , jpcoar_1.0 )

- Hokkaido University