HTMLMappingNormalizer#
- class HTMLMappingNormalizer(config)#
Bases:
Normalizer
The HTML
Normalizer
removes HTML markupInput - all input fields need to be of type
str
.Output - all output fields are filled with data of type
str
.- Parameters:
Example
{ "step": "normalizer", "type": "html_mapping", "input_fields": ["body"], "parse_html5": false, "output_fields": ["normalized_body"] "mapping_index_fields": ["mapping_index_body"], }
Methods Summary
process_doc
(doc)Process a document
Methods Documentation