cancel
Showing results for 
Search instead for 
Did you mean: 

API used for running project and data missing

API used for running project and data missing

When running autopilot from API, how can we know what will happen with missing data?

What techniques will Datarobot implement? How do you impute categorical variables?

please also send any any documentation pointers for this

t_chandra

Labels (1)
2 Replies
dalilaB
DataRobot Alumni

For numerical value, the median is used for imputing the data.  If you go to a trained model blueprint and click on the missing value imputed, you will see how we perform the impute.  We just impute using the medial, but if you click on copy and edit, you will be able to choose random imputing.

dalilaB_1-1648238388608.png

 

Now, for categorical features.  We add a missing class to represent the missing values.  

 

By the way, if you click on Copy and Edit (orange top right corner), you can not only edit the blueprint but see all the additional pre-processing

functionality we offer.  Just click on a node and then the + sign to see the list of pre-processing approaches we have.

 

dalilaB_2-1648238916400.png

 


I hope this answers your question.

 

 

0 Kudos