Class SimpleTransformer

java.lang.Object
com.yahoo.language.simple.SimpleTransformer
All Implemented Interfaces:
Transformer

public class SimpleTransformer extends Object implements Transformer
Converts all accented characters into their de-accented counterparts followed by their combining diacritics, then strips off the diacritics using a regex.
Author:
Simon Thoresen Hult
  • Constructor Details

    • SimpleTransformer

      public SimpleTransformer()
  • Method Details

    • accentDrop

      public String accentDrop(String input, Language language)
      Description copied from interface: Transformer
      Remove accents from input text.
      Specified by:
      accentDrop in interface Transformer
      Parameters:
      input - text to transform
      language - language of input text
      Returns:
      text with accents removed, or input-text if the feature is unavailable