Analyzer_br (SDX 2.4.1 API)

Overview

Package

Class

Tree

Deprecated

Index

Help

PREV CLASS NEXT CLASS

FRAMES NO FRAMES

SUMMARY: NESTED | FIELD | CONSTR | METHOD

DETAIL: FIELD | CONSTR | METHOD

fr.gouv.culture.sdx.search.lucene.analysis
Class Analyzer_br

java.lang.Object
  org.apache.lucene.analysis.Analyzer
      fr.gouv.culture.sdx.search.lucene.analysis.AbstractAnalyzer
          fr.gouv.culture.sdx.search.lucene.analysis.DefaultAnalyzer
              fr.gouv.culture.sdx.search.lucene.analysis.Analyzer_br

All Implemented Interfaces:: Analyzer, java.io.Serializable, org.apache.avalon.framework.configuration.Configurable, org.apache.avalon.framework.logger.LogEnabled, org.apache.excalibur.xml.sax.XMLizable

Deprecated. use org.apache.lucene.analysis.br.BrazilianAnalyzer

public class Analyzer_br
extends DefaultAnalyzer
extends DefaultAnalyzer

Analyzer for brazilian language. Supports an external list of stopwords (words that will not be indexed at all) and an external list of exclusions (word that will not be stemmed, but indexed).

Version:: $Id: BrazilianAnalyzer.java,v 1.0 2001/02/13 21:29:04
Author:: João Kramer
See Also:: Serialized Form

Field Summary
`protected static java.lang.String`	`ANALYZER_TYPE` Deprecated.
`static java.lang.String[]`	`BRAZILIAN_STOP_WORDS` Deprecated. List of typical brazilian stopwords.

Fields inherited from class fr.gouv.culture.sdx.search.lucene.analysis.DefaultAnalyzer
`ATTRIBUTE_EXCLUDE_STEMS, ATTRIBUTE_USE_STOP_WORDS, DEFAULT_STOP_WORDS, EXCLUDE_STEM_ELEMENT, EXCLUDE_STEMS_ELEMENT, excludeTable, stopTable`

Fields inherited from class fr.gouv.culture.sdx.search.lucene.analysis.AbstractAnalyzer
`logger`

Constructor Summary
`Analyzer_br(java.io.File stopwords)` Deprecated. Builds an analyzer with the given stop words.
`Analyzer_br(java.util.Set stopwords)` Deprecated. Builds an analyzer with the given stop words.
`Analyzer_br(java.lang.String[] stopwords)` Deprecated. Builds an analyzer with the given stop words.

Method Summary
`protected java.lang.String`	`getAnalyzerType()` Deprecated.
`void`	`setStemExclusionTable(java.io.File exclusionlist)` Deprecated. Builds an exclusionlist from the words contained in the given file.
`void`	`setStemExclusionTable(java.util.Set exclusionlist)` Deprecated. Builds an exclusionlist from a Hashtable.
`void`	`setStemExclusionTable(java.lang.String[] exclusionlist)` Deprecated. Builds an exclusionlist from an array of Strings.
`org.apache.lucene.analysis.TokenStream`	`tokenStream(java.lang.String fieldName, java.io.Reader reader)` Deprecated. Creates a TokenStream which tokenizes all the text in the provided Reader.

Methods inherited from class fr.gouv.culture.sdx.search.lucene.analysis.DefaultAnalyzer
`buildExcludeTable, buildStopTable, configure, getDefaultStopWords, tokenStream`

Methods inherited from class fr.gouv.culture.sdx.search.lucene.analysis.AbstractAnalyzer
`enableLogging, toSAX`

Methods inherited from class org.apache.lucene.analysis.Analyzer
`getPositionIncrementGap`

Methods inherited from class java.lang.Object
`clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait`

Field Detail

ANALYZER_TYPE

protected static final java.lang.String ANALYZER_TYPE

Deprecated.

See Also:: Constant Field Values

BRAZILIAN_STOP_WORDS

public static final java.lang.String[] BRAZILIAN_STOP_WORDS

Deprecated.

List of typical brazilian stopwords.

Constructor Detail

Analyzer_br

public Analyzer_br(java.lang.String[] stopwords)

Deprecated.

Builds an analyzer with the given stop words.

Parameters:: stopwords -

Analyzer_br

public Analyzer_br(java.util.Set stopwords)

Deprecated.

Builds an analyzer with the given stop words.

Parameters:: stopwords -

Analyzer_br

public Analyzer_br(java.io.File stopwords)
            throws java.io.IOException

Deprecated.

Builds an analyzer with the given stop words.

Parameters:: stopwords -
Throws:: java.io.IOException

Method Detail

setStemExclusionTable

public void setStemExclusionTable(java.lang.String[] exclusionlist)

Deprecated.

Builds an exclusionlist from an array of Strings.

Parameters:: exclusionlist -

setStemExclusionTable

public void setStemExclusionTable(java.util.Set exclusionlist)

Deprecated.

Builds an exclusionlist from a Hashtable.

Parameters:: exclusionlist -

setStemExclusionTable

public void setStemExclusionTable(java.io.File exclusionlist)
                           throws java.io.IOException

Deprecated.

Builds an exclusionlist from the words contained in the given file.

Parameters:: exclusionlist -
Throws:: java.io.IOException

tokenStream

public final org.apache.lucene.analysis.TokenStream tokenStream(java.lang.String fieldName,
                                                                java.io.Reader reader)

Deprecated.

Creates a TokenStream which tokenizes all the text in the provided Reader.

Specified by:: tokenStream in interface Analyzer
Overrides:: tokenStream in class DefaultAnalyzer

Returns:: A TokenStream build from a StandardTokenizer filtered with StandardFilter, StopFilter, GermanStemFilter and LowerCaseFilter.

getAnalyzerType

protected java.lang.String getAnalyzerType()

Deprecated.

Overrides:: getAnalyzerType in class DefaultAnalyzer

See Also:: fr.gouv.culture.sdx.search.lucene.analysis.AbstractAnalyzer#getAnalyserType()

Overview

Package

Class

Tree

Deprecated

Index

Help

PREV CLASS NEXT CLASS

FRAMES NO FRAMES

SUMMARY: NESTED | FIELD | CONSTR | METHOD

DETAIL: FIELD | CONSTR | METHOD

fr.gouv.culture.sdx.search.lucene.analysis Class Analyzer_br

ANALYZER_TYPE

BRAZILIAN_STOP_WORDS

Analyzer_br

Analyzer_br

Analyzer_br

setStemExclusionTable

setStemExclusionTable

setStemExclusionTable

tokenStream

getAnalyzerType

fr.gouv.culture.sdx.search.lucene.analysis
Class Analyzer_br