Save This Page
Home » lucene-3.0.1-src » org.apache » lucene » analysis » cn » [javadoc | source]
org.apache.lucene.analysis.cn
public final class: ChineseFilter [javadoc | source]
java.lang.Object
   org.apache.lucene.util.AttributeSource
      org.apache.lucene.analysis.TokenStream
         org.apache.lucene.analysis.TokenFilter
            org.apache.lucene.analysis.cn.ChineseFilter

All Implemented Interfaces:
    Closeable

A TokenFilter with a stop word table. TO DO:
  1. Add Chinese stop words, such as \ue400
  2. Dictionary based Chinese word extraction
  3. Intelligent Chinese word extraction
Field Summary
public static final  String[] STOP_WORDS     
Fields inherited from org.apache.lucene.analysis.TokenFilter:
input
Constructor:
 public ChineseFilter(TokenStream in) 
Method from org.apache.lucene.analysis.cn.ChineseFilter Summary:
incrementToken
Methods from org.apache.lucene.analysis.TokenFilter:
close,   end,   reset
Methods from org.apache.lucene.analysis.TokenStream:
close,   end,   incrementToken,   reset
Methods from org.apache.lucene.util.AttributeSource:
addAttribute,   addAttributeImpl,   captureState,   clearAttributes,   cloneAttributes,   equals,   getAttribute,   getAttributeClassesIterator,   getAttributeFactory,   getAttributeImplsIterator,   hasAttribute,   hasAttributes,   hashCode,   restoreState,   toString
Methods from java.lang.Object:
clone,   equals,   finalize,   getClass,   hashCode,   notify,   notifyAll,   toString,   wait,   wait,   wait
Method from org.apache.lucene.analysis.cn.ChineseFilter Detail:
 public boolean incrementToken() throws IOException