2003Konferenzveröffentlichung DOI: 10.18452/19970
Using Thesauri in Cross-Language Retrieval of German and French Indexed Collections
For CLEF 2002, Berkeley’s group one experimented with Russian, French and English as query languages, and investigated thesaurus-aided retrieval for the special CLEF collections GIRT and Amaryllis. Two techniques were used to locate source language topic terms within the controlled vocabulary and replace them with the document language thesaurus terms to form the query sent against the collection index. This form of controlled vocabulary-aided translation is called thesaurus matching. Results show that thesaurus-aided cross-language retrieval performs slightly worse than machine translation retrieval on average, but can yield decidedly better results for particular queries. In addition, Berkeley submitted runs to the monolingual and bilingual (French and German) CLEF main tasks. We found that bilingual retrieval sometimes outperforms monolingual retrieval and postulate reasons to explain this phenomenon.
Files in this item