CharacterNormalizer

class squirro.lib.nlp.steps.normalizers.CharacterNormalizer(config)

Bases: squirro.lib.nlp.steps.normalizers.Normalizer

The character Normalizer removes numeric digits

Input - all input fields need to be of type str or list`[:class:`str].

Output - all output fields are filled with data of type str respectively list`[:class:`str].

Parameters

type (str) – character

Example

{
    "step": "normalizer",
    "type": "character",
    "input_fields": ["body"],
    "output_fields": ["normalized_body"]
}

Methods Summary

process_doc(doc)

Process a document

Methods Documentation

process_doc(doc)

Process a document

Parameters

doc (Document) – Document

Returns

Processed document

Return type

Document