Advances in Cross-Language Information Retrieval: Third Workshop of the Cross-Language Evaluation Forum, CLEF 2002. Rome, Italy, September 19-20, 2002. Revised PapersThe third campaignof the Cross-LanguageEvaluation Forum (CLEF) for Eu- pean languages was held from January to September 2002. Participation in this campaignshowedaslightriseinthenumberofparticipantswith,37groupsfrom both academia and industry and a steep rise in the number of experiments they submitted for oneor moreof the ?ve o'cialtracks. The campaignculminated in atwo-dayworkshopheldinRome, Italy,19-20September, immediatelyfollowing the Sixth European Conference on Digital Libraries (ECDL 2002), attended by nearly 70 researchersand system developers. The objective of the workshop was to bring together the groups that had participated in CLEF 2002 so that they could report on the results of their experiments. Attendance at the workshop was thus limited to participants in the campaignplus severalinvited guests with recognized expertise in the multilingual information access ?eld. This volume contains thoroughly revised and expanded versions of the preliminary papers presented at the workshop accompanied by a complete run-down and detailed analysis of the results, and it thus provides an exhaustive record of the CLEF 2002 campaign. CLEF2002 wasconducted within the frameworkof a projectof the Infor- tion Society Technologies programme of the European Commission (IST-2000- 31002). The campaign was organized in collaboration with the US National - stitute ofStandardsandTechnology(NIST)andwiththe supportoftheDELOS Network of Excellence for Digital Libraries. The support of NIST and DELOS in the running of the evaluation campaign is gratefully acknowledged. We would also like to thank the other members of the Workshop Steering Committee for their assistance in the coordination of this event. |
Contents
CrossLanguage Spoken Document Retrieval | 2 |
Overview of Results | 7 |
CrossLanguage Spoken Document Retrieval | 26 |
CrossLanguage and More | 28 |
UTACLIR CLEF 2002 | 91 |
Combining Evidence for CrossLanguage Information Retrieval | 111 |
PortugueseEnglish Experiments Using Latent Semantic Indexing | 147 |
Thomson Legal and Regulatory Experiments for CLEF 2002 | 155 |
Monolingual German Runs 665 | 358 |
Assessing Automatically Extracted Bilingual Lexicons | 363 |
Comparing UserAssisted and Automatic Query Translation | 400 |
Exploring the Effect of Query Translation | 430 |
CrossLanguage Spoken Document Retrieval Experiments | 457 |
on the TREC SDR Collection | 476 |
N Kando 485 | 505 |
Bilingual to Spanish Runs | 609 |
Merging Mechanisms in Multilingual Information Retrieval | 175 |
Reidsma D Hiemstra F de Jong and W Kraaij | 197 |
Some Experiments with the Dutch Collection | 219 |
Monolingual Experiments | 242 |
Italian Monolingual Information Retrieval with PROSIT | 257 |
Improving the Automatic Retrieval of Text Documents | 279 |
Experiments in Term Expansion Using Thesauri in Spanish | 301 |
Pliers and Snowball at CLEF 2002 | 321 |
Experiments with a Chunker and Lucene | 336 |
Using Thesauri in CrossLanguage Retrieval of German | 349 |
Bilingual to French Runs | 627 |
Bilingual to Italian Runs | 641 |
Bilingual to Dutch Runs | 655 |
Monolingual Spanish Runs | 666 |
Monolingual Finnish Runs | 714 |
Monolingual Italian Runs | 741 |
Monolingual Swedish Runs | 785 |
GIRT DomainSpecific Runs | 809 |
Author Index | 827 |
Other editions - View all
Advances in Cross-Language Information Retrieval: Third Workshop of the ... Martin Braschler,Julio Gonzalo,Martin Kluck No preview available - 2003 |
Common terms and phrases
algorithm Amaryllis approach automatic average precision base runs baseline bigrams bilingual runs Braschler C.A. Peters campaign CLEF CLIR combined compared compound Computer corpora corpus Cross-Language Information Retrieval decompounding dictionary document collection documents retrieved Dutch English queries evaluation expansion term experiments Finnish GIRT iCLEF improvement indexing Italian keywords lemmatization LNCS machine translation merging strategies method monolingual retrieval monolingual runs morphological multilingual n-grams Neuchâtel NTCIR number of documents number of relevant obtained official runs Okapi original query participants phrases Prec query expansion query terms query translation ranked list relevance assessments relevance feedback relevant documents result lists retrieval effectiveness retrieval performance retrieval results score searchers Section SIGIR similarity Spanish spoken document Spoken Document Retrieval statistical stemmer stemming stemming algorithm stopword submitted Swedish Systran Table target language TD Auto techniques test collection Text REtrieval Conference thesaurus title and description Topic Language track translation disambiguation TREC words Workshop