@MagmaMan
Is connecting to Oracle Datawarehouse seamless in Paxata?
Yes, we provide a database connector that supports make databases including:
We also support database connectivity to the following Hive versions:
- Hive (CDH5) - Version: CDH 5.12-5.14
- Hive (HDP2) - Version: HDP 2.6.3
For details about configuring a database connection in Paxata, please view:
https://community.datarobot.com/t5/admin-corner/how-to-configure-a-jdbc-data-source/m-p/6493#M46
For a list of all connectivity options in our 2018.2 release, please view:
https://community.datarobot.com/t5/admin-corner/what-are-the-data-connections-i-can-access-within-pa...
How does the processing happen?
From what I currently understand, your workflow would look like this in Paxata:
- Configure the JDBC Connector for your Oracle warehouse
- Configure the Hive Connector for your Hive database
- Import data from Oracle using JDBC Connector by either browsing to a table or using SQL queries to create a cached Data Set in the Paxata Data Library
- Import data from Hive using Hive connector by either browsing to a table or using SQL queries to create a cached Data Set in the Paxata Data Library
- Create a Project in a Paxata project to perform Data Preparation
- Load your initial Data Set into the Project. This would likely be your Oracle data
- Add additional data sets to the Project via append (similar structure data to create a longer data set) or via lookup (join additional data to each row)
- Transform your data into a final prepared structure
- Export Answer Set to Hive using Hive Connector
Paxata just pulls the data from Oracle directly into Spark everytime the project is run?
After your initial Data Preparation, above, Paxata provides the ability for you to schedule a recurring execution of your Data Preparation Project via our Automation feature.
- You can choose to import new data from your databases on each execution or to use the latest cached version available in the Data Library. During execution, Paxata will load the data into Spark according to your specification.
Thanks for your question.
Bill