Quick Help: Data Import

cancel
Showing results for 
Search instead for 
Did you mean: 

Quick Help: Data Import

Importing AI-ready data into DataRobot is a simple process.

Make sure that your data is in a tabular format that adheres to the following minimum data format requirements.

Data format requirements

  • Supported File Types: csv, tsv, dsv, xls, xlsx, sas7bdat, bz2, gz, zip, tar, tgz
  • Supported Variable Types: numeric, categorical, boolean, text, date, currency, percentage, and length
  • Minimum Rows Required: 20
  • Maximum Rows Allowed: The maximum rows allowed for Trial users is 100,000

Specifying a target column

To build a predictive model using DataRobot, you need to specify a target for your data. A target is simply a column in your data with a header name that is easy to remember. DataRobot will automatically determine the type of machine learning problem based on the data in your dataset—multiclass classification, regression, or even Time Series.

(The dataset attached to this article shows an example of a defined target column. You can download this dataset and use it to test out model building.)

Importing from other data sources

While the AI Platform Trial is limited to local file imports, DataRobot provides a wide range of JDBC-compliant data sources. The URL supports importing data from a variety of sources, from HTTP to S3. You can use HDFS for ingesting data from Hadoop.

Is your data AI-ready?

Preparing data for Machine Learning can be an arduous task. Thankfully DataRobot Paxata makes data prep a snap. Learn more about DataRobot Paxata and get started with a trial here

 

Importing Data Modeling Options Deployment
Labels (1)
Version history
Revision #:
21 of 21
Last update:
‎12-08-2020 05:36 PM
Updated by:
 
Contributors