Home » lucene-3.0.1-src » org.apache » lucene » analysis » shingle »

org.apache.lucene.analysis.shingle

Abstract Classes:

ShingleMatrixFilter.TokenSettingsCodec   Strategy used to code and decode meta data of the tokens from the input stream regarding how to position the tokens in the matrix, set and retreive weight, et c.  code | html

Classes:

ShingleAnalyzerWrapper   A ShingleAnalyzerWrapper wraps a ShingleFilter around another Analyzer code | html
ShingleAnalyzerWrapper.SavedStreams     code | html
ShingleFilter  

A ShingleFilter constructs shingles (token n-grams) from a token stream. 

code | html
ShingleMatrixFilter  

A ShingleMatrixFilter constructs shingles (token n-grams) from a token stream. 

code | html
ShingleMatrixFilter.Matrix   A column focused matrix in three dimensions:
Token[column][row][z-axis] {
    {{hello}, {greetings, and, salutations}},
    {{world}, {earth}, {tellus}}
};
todo consider row groups to indicate that shingles is only to contain permutations with texts in that same row group. 
code | html
ShingleMatrixFilter.Matrix.Column     code | html
ShingleMatrixFilter.Matrix.Column.Row     code | html
ShingleMatrixFilter.OneDimensionalNonWeightedTokenSettingsCodec   Using this codec makes a ShingleMatrixFilter act like org.apache.lucene.analysis.shingle.ShingleFilter code | html
ShingleMatrixFilter.SimpleThreeDimensionalTokenSettingsCodec   A full featured codec not to be used for something serious.  code | html
ShingleMatrixFilter.TokenPositioner   Used to describe how a org.apache.lucene.analysis.Token is to be inserted to a org.apache.lucene.analysis.shingle.ShingleMatrixFilter.Matrix code | html
ShingleMatrixFilter.TwoDimensionalNonWeightedSynonymTokenSettingsCodec   A codec that creates a two dimensional matrix by treating tokens from the input stream with 0 position increment as new rows to the current column.  code | html