Solved: MLOPS support for Spark models that aren’t DataRob... - DataRobot Community

skip-turner · ‎02-12-2021

Hi all - Can I use MLOps with non-DataRobot Spark models? If I deploy a DataRobot JAR model with Apache Spark API on Databricks, can I use DataRobot MLOps with that deployment?

doyouevendata · ‎02-12-2021

Hi Skip -

We have an article on how to deploy a DataRobot scoring code jar model on Databricks, and monitor it from DataRobot MLOps. This leverages an agent architecture that reports performance and feature/prediction data back to DataRobot. You can find it here.

You can additionally monitor non-DataRobot models as well. Note the actual code calls simply send performance stats (number of rows, time to score) and data (features, prediction/model outputs) back to DataRobot, which has an entry created for the deployment based on model characteristics and a training dataset you provide. The model itself is independent from the monitoring operations; any custom model, or model created by another platform, could be monitored.

View solution in original post

tim · ‎02-12-2021

Yes and yes!

In general, you can use DataRobot MLOps with models deployed anywhere! DataRobot provides MLOps tracking agents that can be used monitor models where they live, and this info communicated back to the DataRobot MLOps platform, providing data drift, service health, and accuracy monitoring for your model.

A special case is a model deployed to Spark (Hadoop, Databricks, AWS EMR). Provided you could put your predictions and features in a Spark DataFrame, you will be able to monitor the model with ease. Hope this helps!

If you interested, I would encourage you checkout the webinar and articles linked below

https://community.datarobot.com/t5/sessions/monitoring-all-your-models-with-datarobot-mlops-agent/ba...

https://community.datarobot.com/t5/resources/working-with-remote-models/ta-p/7517

https://community.datarobot.com/t5/resources/how-to-monitor-spark-models-with-datarobot-mlops/ta-p/7...

View solution in original post

doyouevendata · ‎02-12-2021

Hi Skip -

We have an article on how to deploy a DataRobot scoring code jar model on Databricks, and monitor it from DataRobot MLOps. This leverages an agent architecture that reports performance and feature/prediction data back to DataRobot. You can find it here.

You can additionally monitor non-DataRobot models as well. Note the actual code calls simply send performance stats (number of rows, time to score) and data (features, prediction/model outputs) back to DataRobot, which has an entry created for the deployment based on model characteristics and a training dataset you provide. The model itself is independent from the monitoring operations; any custom model, or model created by another platform, could be monitored.

tim · ‎02-12-2021

Yes and yes!

In general, you can use DataRobot MLOps with models deployed anywhere! DataRobot provides MLOps tracking agents that can be used monitor models where they live, and this info communicated back to the DataRobot MLOps platform, providing data drift, service health, and accuracy monitoring for your model.

A special case is a model deployed to Spark (Hadoop, Databricks, AWS EMR). Provided you could put your predictions and features in a Spark DataFrame, you will be able to monitor the model with ease. Hope this helps!

If you interested, I would encourage you checkout the webinar and articles linked below

https://community.datarobot.com/t5/sessions/monitoring-all-your-models-with-datarobot-mlops-agent/ba...

https://community.datarobot.com/t5/resources/working-with-remote-models/ta-p/7517

https://community.datarobot.com/t5/resources/how-to-monitor-spark-models-with-datarobot-mlops/ta-p/7...

MLOPS support for Spark models that aren’t DataRobot?

MLOPS support for Spark models that aren’t DataRobot?

MLOps

Paxata Cache Folder

how to transform the var type in workbench

Understanding Model

Time Series Modelling

Trial Walkthrough Issue