When factoring in Prediction Explanations: https://app.datarobot.com/docs/modeling/analyze-models/understand/pred-explain/index.html
We found during testing that with out prediction explanations the response time is 20 milliseconds. With pred exp its 5 seconds. The latter being too long for every transaction in real time, as it's not good for the customer experience. We are looking to split the deployments and score at intervals (eg. 12/hrs)
Has anyone had a similar experience and presumably this is the normal expectation because of the extra compute required?
Thanks
Yeah awesome thanks Bogdan, I had read SHAP was multiple times (5-20x) faster, although limited to certain models (eg. linear, tree based, etc)
Appreciate the response, I will investigate.
Regards
Hello Katandra!
This is proper behavior for permutation-based prediction explanations. If latency and prediction explanations are key components then try to create another project with SHAP explanations - this way latency should remain the same magnitude as without any explanations.