Detect data drift from the different version model... - DataRobot Community

Rod · ‎10-17-2021

Hi, there.

I have a model deployment which retrain model by scheduler.

Is there any method to know the data drift about the different version model?

For example, DatasetV1 for ModelV1 training, DatasetV2 for ModelV2 training.

The abstract representations for time range is :

DatasetV1 : 1~10, DatasetV2 : 11~20 or 1~20.

How to know dataset for 1~10 and 11~20(1~20) whether drift or not?

IraWatt · ‎10-18-2021

Hi @Rod , can you clarify what you mean by 'mothed'? Data Drift is calculated by comparing the training data to the prediction data (also known as inference data). Drift is specific to the training data used so for ModelV2 trained on 11~20 to monitor drift it would need to predict 1~10 and vice versa, ModelV1 trained on 1~10 would need to predict 11~20.

Rod · ‎10-18-2021

@IraWatt Thanks for your replying.

This is typo error, hope you don't mind this 🙂

The dataset timeline is 1,2,3 .... 18, 19, 20. From the old to the new.

As stated above, ModelV1 predict 10~20 and ModelV2 predict 20~30.

Therefore, clarifying this question : "detect data drift from two training data."
Is it feasible on DataRobot?

Thanks.

IraWatt · ‎10-18-2021

No worries 🙂 Yes DataRobot can detect data drift between any two datasets. The difference between the two datasets will be shown in the 'Feature Drift vs. Feature Importance chart' and 'Feature Details chart' .

Rod · ‎10-19-2021

Hi, @IraWatt

I have already read the Data Drift , but didn't know which part is for "detect data drift between two training set."

If there have any demo picture from official site, that couldn't be better.

I will appreciate your help with this situation.

Thanks.

IraWatt · ‎10-19-2021

hey @Rod, the key point is that it doesn't matter that it is 'training data'. All you need to do is have a trained model then make it predict some new data (in this case the training data used for the other model) and the Data Drift tab will show how the two datasets differ. I am assuming the target is the same for both models?

Detect data drift from the different version model?

Detect data drift from the different version model?

MLOps

How to stop uploading

How do I upload a JDBC driver

Paxata Cache Folder

how to transform the var type in workbench

Understanding Model