Home » lucene-3.0.1-src » org.apache » lucene » benchmark » utils »



ExtractReuters   Split the Reuters SGML documents into Simple Text files containing: Title, Date, Dateline, Body  code | html
ExtractWikipedia   Extract the downloaded Wikipedia dump into separate files for indexing.  code | html
NoDeletionPolicy   Licensed to the Apache Software Foundation (ASF) under one or more contributor license agreements.  code | html