Record Details

Multilingual PRF : English lends a helping hand

DSpace at IIT Bombay

View Archive Info
 
 
Field Value
 
Title Multilingual PRF : English lends a helping hand
 
Creator CHINNAKOTLA, MK
RAMAN, K
BHATTACHARYYA, P
 
Subject information-retrieval
models
multilingual
pseudo-relevance feedback
language models
query expansion
 
Description In this paper, we present a novel approach to Pseudo-Relevance Feedback (PRF) called Multilingual PRF (MultiPRF). The key idea is to harness multilinguality. Given a query in a language, we take the help of another language to ameliorate the well known problems of PRF, viz. (a) The expansion terms from PRF are primarily based on co-occurrence relationships with query terms, and thus other terms which are lexically and semantically related, such as morphological variants and synonyms, are not explicitly captured, and (b) PRF is quite sensitive to the quality of the initially retrieved top k documents and is thus not robust. In MultiPRF, given a query in language L(1), it is translated into language L(2) and PRF is performed on a collection in language L(2) and the resultant feedback model is translated from L(2) back into L(1). The final feedback model is obtained by combining the translated model with the original feedback model of the query in L(1). Experiments were performed on standard CLEF collections in languages with widely differing characteristics, viz., French, German, Finnish and Hungarian with English as the assisting language. We observe that MultiPRF outperforms PRF and is more robust with consistent and significant improvements in the above widely differing languages. A thorough analysis of the results reveal that the second language helps in obtaining both co-occurrence based conceptual terms as well as lexically and semantically related terms. Additionally, the use of the second language collection reduces the sensitivity to performance of initial retrieval, thereby making it more robust.
 
Publisher ASSOC COMPUTING MACHINERY
 
Date 2011-10-25T14:33:07Z
2011-12-15T09:11:58Z
2011-10-25T14:33:07Z
2011-12-15T09:11:58Z
2010
 
Type Proceedings Paper
 
Identifier SIGIR 2010: PROCEEDINGS OF THE 33RD ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH DEVELOPMENT IN INFORMATION RETRIEVAL,659-666
978-1-60558-896-4
http://dspace.library.iitb.ac.in/xmlui/handle/10054/15746
http://hdl.handle.net/100/2403
 
Source 33rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval,Geneva, SWITZERLAND,JUL 19-23, 2010
 
Language English