SklearnProjector#
- class SklearnProjector(config)#
Bases:
ProjectorThe generic scikit-learn
Projectorstep projects from one vector space to another. For more info see Decomposition.Note - So far only svd is supported -> TruncatedSVD
Input - all input fields need to be of type
list[floatorint] ornumpy.ndarrayOutput - all output fields are filled with data of type
numpy.ndarraywith shape (‘n_components’,)- Parameters:
Example
{ "step": "projector", "type": "sklearn", "model_type": "svd", "n_components": 100, "input_field": "embedded_text", "output_field": "embedded_text" }
Methods Summary
load()Load a step
process_batch(batch)Process a batch of documents.
save()Save a step
train(docs)Train on a step of a set of documents
Methods Documentation
- load()#
Load a step
- process_batch(batch)#
Process a batch of documents. If not defined will default to using self.process_doc for each document in the batch.
- save()#
Save a step