Save This Page
Home » lucene-3.0.1-src » org.apache » lucene » analysis » [javadoc | source]
public final class: ASCIIFoldingFilter [javadoc | source]

All Implemented Interfaces:

This class converts alphabetic, numeric, and symbolic Unicode characters which are not in the first 127 ASCII characters (the "Basic Latin" Unicode block) into their ASCII equivalents, if one exists. Characters from the following Unicode blocks are converted; however, only those characters with reasonable ASCII alternatives are converted: See: The set of character conversions supported by this class is a superset of those supported by Lucene's ISOLatin1AccentFilter which strips accents from Latin1 characters. For example, 'à' will be replaced by 'a'.
Fields inherited from org.apache.lucene.analysis.TokenFilter:
 public ASCIIFoldingFilter(TokenStream input) 
Method from org.apache.lucene.analysis.ASCIIFoldingFilter Summary:
foldToASCII,   incrementToken
Methods from org.apache.lucene.analysis.TokenFilter:
close,   end,   reset
Methods from org.apache.lucene.analysis.TokenStream:
close,   end,   incrementToken,   reset
Methods from org.apache.lucene.util.AttributeSource:
addAttribute,   addAttributeImpl,   captureState,   clearAttributes,   cloneAttributes,   equals,   getAttribute,   getAttributeClassesIterator,   getAttributeFactory,   getAttributeImplsIterator,   hasAttribute,   hasAttributes,   hashCode,   restoreState,   toString
Methods from java.lang.Object:
clone,   equals,   finalize,   getClass,   hashCode,   notify,   notifyAll,   toString,   wait,   wait,   wait
Method from org.apache.lucene.analysis.ASCIIFoldingFilter Detail:
 public  void foldToASCII(char[] input,
    int length) 
    Converts characters above ASCII to their ASCII equivalents. For example, accents are removed from accented characters.
 public boolean incrementToken() throws IOException