|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Object org.apache.lucene.analysis.Analyzer fr.gouv.culture.sdx.search.lucene.analysis.AbstractAnalyzer fr.gouv.culture.sdx.search.lucene.analysis.DefaultAnalyzer fr.gouv.culture.sdx.search.lucene.analysis.Analyzer_br
org.apache.lucene.analysis.br.BrazilianAnalyzer
public class Analyzer_br
Analyzer for brazilian language. Supports an external list of stopwords (words that will not be indexed at all) and an external list of exclusions (word that will not be stemmed, but indexed).
Field Summary | |
---|---|
protected static java.lang.String |
ANALYZER_TYPE
Deprecated. |
static java.lang.String[] |
BRAZILIAN_STOP_WORDS
Deprecated. List of typical brazilian stopwords. |
Fields inherited from class fr.gouv.culture.sdx.search.lucene.analysis.DefaultAnalyzer |
---|
ATTRIBUTE_EXCLUDE_STEMS, ATTRIBUTE_USE_STOP_WORDS, DEFAULT_STOP_WORDS, EXCLUDE_STEM_ELEMENT, EXCLUDE_STEMS_ELEMENT, excludeTable, stopTable |
Fields inherited from class fr.gouv.culture.sdx.search.lucene.analysis.AbstractAnalyzer |
---|
logger |
Constructor Summary | |
---|---|
Analyzer_br(java.io.File stopwords)
Deprecated. Builds an analyzer with the given stop words. |
|
Analyzer_br(java.util.Set stopwords)
Deprecated. Builds an analyzer with the given stop words. |
|
Analyzer_br(java.lang.String[] stopwords)
Deprecated. Builds an analyzer with the given stop words. |
Method Summary | |
---|---|
protected java.lang.String |
getAnalyzerType()
Deprecated. |
void |
setStemExclusionTable(java.io.File exclusionlist)
Deprecated. Builds an exclusionlist from the words contained in the given file. |
void |
setStemExclusionTable(java.util.Set exclusionlist)
Deprecated. Builds an exclusionlist from a Hashtable. |
void |
setStemExclusionTable(java.lang.String[] exclusionlist)
Deprecated. Builds an exclusionlist from an array of Strings. |
org.apache.lucene.analysis.TokenStream |
tokenStream(java.lang.String fieldName,
java.io.Reader reader)
Deprecated. Creates a TokenStream which tokenizes all the text in the provided Reader. |
Methods inherited from class fr.gouv.culture.sdx.search.lucene.analysis.DefaultAnalyzer |
---|
buildExcludeTable, buildStopTable, configure, getDefaultStopWords, tokenStream |
Methods inherited from class fr.gouv.culture.sdx.search.lucene.analysis.AbstractAnalyzer |
---|
enableLogging, toSAX |
Methods inherited from class org.apache.lucene.analysis.Analyzer |
---|
getPositionIncrementGap |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Field Detail |
---|
protected static final java.lang.String ANALYZER_TYPE
public static final java.lang.String[] BRAZILIAN_STOP_WORDS
Constructor Detail |
---|
public Analyzer_br(java.lang.String[] stopwords)
stopwords
- public Analyzer_br(java.util.Set stopwords)
stopwords
- public Analyzer_br(java.io.File stopwords) throws java.io.IOException
stopwords
-
java.io.IOException
Method Detail |
---|
public void setStemExclusionTable(java.lang.String[] exclusionlist)
exclusionlist
- public void setStemExclusionTable(java.util.Set exclusionlist)
exclusionlist
- public void setStemExclusionTable(java.io.File exclusionlist) throws java.io.IOException
exclusionlist
-
java.io.IOException
public final org.apache.lucene.analysis.TokenStream tokenStream(java.lang.String fieldName, java.io.Reader reader)
tokenStream
in interface Analyzer
tokenStream
in class DefaultAnalyzer
protected java.lang.String getAnalyzerType()
getAnalyzerType
in class DefaultAnalyzer
fr.gouv.culture.sdx.search.lucene.analysis.AbstractAnalyzer#getAnalyserType()
|
|||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |