The Wikipedia XML corpus

From Brede Wiki
Jump to: navigation, search
Paper (help)
The Wikipedia XML corpus
Authors: Ludovic Denoyer, Patrick Gallinari
Citation: ACM SIGIR Forum 40 (1): 64-69. 2006 June
Database(s):
DOI: 10.1145/1147197.1147210.
Link(s): http://www.sigir.org/forum/2006J/2006j_sigirforum_denoyer.pdf
Search
Web: Bing Google Yahoo!Google PDF
Article: BASE Google Scholar PubMed
Restricted: DTU Digital Library
Other: NIF
Services
Format: BibTeX
Extract: Talairach coordinates from linked PDF: CSV-formated wiki-formated

The Wikipedia XML corpus describes a XML corpus of Wikipedia for use in information retrieval tasks.

The data set available at:

http://www-connex.lip6.fr/~denoyer/wikipediaXML/

The dataset is not the XML dataset distributed from the Wikimedia download site.

The paper is from the INEX Workshop 2006: Dagstuhl, Germany.

Personal tools