Tokenizer#
- class Tokenizer(config)#
Bases:
BatchedStep
The
Tokenizer
step takes specified fields and splits them into tokens to be used by a downstream step.
Bases: BatchedStep
The Tokenizer
step takes specified fields and splits them into tokens
to be used by a downstream step.