Troubleshooting & FAQ#
Something is not working properly. How can I debug it?
You can add debug step before or after the step, which is problematic.
{
"step": "debugger",
"type": "log_fields",
"fields": ["body"],
"log_level": "warning"
}
As a result you will see something similar to:
2022-06-21 10:22:37,087 : squirro.lib.nlp.steps.debuggers.fields_debugger : WARNING : ++++++++++++++++++++++++++++++++++++++++++++++++++++
2022-06-21 10:22:37,087 : squirro.lib.nlp.steps.debuggers.fields_debugger : WARNING : Logging fields for Document '0' (skipped=False)
2022-06-21 10:22:37,087 : squirro.lib.nlp.steps.debuggers.fields_debugger : WARNING : 'body' (truncated) ----> '<html><body><p>This is a fake Squirro Item. It is composed of a couple fake sentences.</p></body></h'
Note that the list of fields needs to be adjusted based on the steps before and after.
- How can I test my ML Worklflow before uploading to the Squirro Platform?
Download the libNLP here and install it using pip
Create a python script as described in the runner documentation and execute it
- How can I uploading ML Worklflow to the Squirro Platform?
Use the Squirro Client function to create and upload a new ML Workflow
Example:
client = SquirroClient(None, None, cluster=CLUSTER)
client.authenticate(refresh_token=TOKEN)
client.new_machinelearning_workflow(
project_id=PROJECT_ID,
name = NAME_OF_THE_WORKFLOW,
config=CONFIG,
ml_models=PATH # this only needs to be set if trained models or custom steps are existing
)
- How can I use a ML Workflow in the enrich pipeline?
Follow the instructions in How To Publish ML Models Using the Squirro Client.
- I have developed my own machine learning approach and would like to use it for data enrichment in Squirro
Follow the instructions in How to Integrate a Custom ML Classifier.