normalizers package

Functions

make_normalizer(config)

Normalizer factory

Classes

CharacterNormalizer(config)

Character normalizer that removes numeric digits

EmailParseNormalizer(config)

Email parser normalizer parse an email string to extract email body.

HTMLNormalizer(config)

HTML normalizer that removes HTML markup

LowercaseNormalizer(config)

Lowercase normalizer that lowercases everything

Normalizer(config)

The Normalizer step applies specific normalizations to fields for each Document.

PunctuationNormalizer(config)

Punctuation normalizer strips punctuation

SentimentTermNormalizer(config)

Extract positive and negative terms/phrases from given text.

SpacyNormalizer(config)

Multi-Lingual Spacy Text Analyzer.

StopwordsNormalizer(config)

Stopwords normalizer strips stopwords