org.apache.lucene.analysis.icu.segmentation

Class ICUTokenizerConfig

  • java.lang.Object
    • org.apache.lucene.analysis.icu.segmentation.ICUTokenizerConfig
    • Constructor Detail

      • ICUTokenizerConfig

        public ICUTokenizerConfig()
        Sole constructor. (For invocation by subclass constructors, typically implicit.)
    • Method Detail

      • getBreakIterator

        public abstract BreakIterator getBreakIterator(int script)
        Return a breakiterator capable of processing a given script.
      • getType

        public abstract String getType(int script,
                                       int ruleStatus)
        Return a token type value for a given script and BreakIterator rule status.
      • combineCJ

        public abstract boolean combineCJ()
        true if Han, Hiragana, and Katakana scripts should all be returned as Japanese