Overview¶
Alation Cloud Service Applies to Alation Cloud Service instances of Alation
Customer Managed Applies to customer-managed instances of Alation
Enhanced Connector Enhanced connectors add extended capabilities and require a separate entitlement in addition to your Alation platform license.
The Dataiku OCF connector is used to catalog Dataiku data sources in Alation. It extracts Dataiku objects such as Projects, Workspaces, Reports, Recipes, Data sources, and other metadata. After extraction, users can search and find Dataiku objects, curate the corresponding catalog object pages, and understand data transformations using lineage diagrams.
The connector is distributed as a .zip package that you upload and install in the Alation application. You can download the latest connector package from the Alation Customer Portal (Customer Portal > Connectors).
Team¶
You may need assistance from the following administrators to install and configure this connector:
Dataiku Administrator
Creates a service account for Alation.
Provides the connection information and JDBC URI.
Provides authentication information and assists in configuring authentication.
Alation Server Admin
Ensures that Alation Connector Manager is installed and running (or installs it).
Installs the connector.
Creates and configures the Dataiku data source in the Alation catalog.
Performs initial extraction and prepares the data source for Alation users.
Scope¶
The table below lists the features supported by the connector and the metadata objects extracted.
Feature |
Scope |
Availability |
|---|---|---|
Authentication |
||
Basic (username and password) |
Authentication with a service account using a username and password |
Yes |
SSO via OAuth |
Authentication in Compose using the OAuth protocol |
No |
Metadata extraction (MDE) |
||
Default MDE |
Extraction of metadata from Dataiku based on default queries in the connector code |
Yes |
Query-based MDE |
Extraction of metadata based on custom extraction queries provided by a user |
No |
Extracted metadata objects |
||
Project |
List of projects |
Yes |
Dataset |
List of datasets |
Yes |
Recipe |
List of recipes |
Yes |
Model |
List of models |
Yes |
Dashboard |
List of dashboards |
Yes |
Insight |
Insights |
Yes |
Notebook |
List of notebooks |
Yes |
Webapp |
List of webapps |
Yes |
Analysis |
List of analysis |
Yes |
Sampling and profiling |
||
Table sampling |
Retrieval of data samples from extracted tables |
No |
Column sampling |
Retrieval of data samples from extracted columns |
No |
Custom query-based table sampling |
Ability to use custom queries for sampling specific tables |
No |
Custom query-based column sampling |
Ability to use custom queries for profiling specific columns |
No |
Deep column profiling |
Profiling of columns with calculation of value distribution statistics |
No |
Dynamic profiling |
Ability for individual users to connect with their own database accounts to retrieve table and column samples and profiles |
No |
Query Log Ingestion (QLI) |
||
Table-based QLI |
Ingestion of query history based on a table or view that contains query history data |
No |
Custom query-based QLI |
Ingestion of query history based on a custom query |
No |
JOINs and filters |
Calculation of JOIN and filter information based on ingested query history |
No |
Lineage |
||
Cross-system lineage |
Ability to map the data source in Dataiku to an actual data source in Alation |
Yes |
Compose |
||
Customer-managed (on-premise) Alation instances |
Compose for on-premise Alation instances |
Yes |
Alation Cloud Service instances |
Compose on Alation Cloud Service instances. Depending on your network configuration, you may need to use Alation Agent to connect to your data source. Compose with Agent is supported. |
Yes |
Object mapping¶
The connector maps Dataiku objects to Alation catalog objects as listed below.
Dataiku object |
Mapped object in Alation |
Description |
|---|---|---|
Projects |
Folder |
Owner, Creation Time, Modification Time, Project Description, and Project Permissions |
Models |
Report |
Model Summary |
Recipes |
Report |
Recipe Code |
Datasets |
Datasource |
Datasource objects are cataloged as datasource columns |
Insights |
Report |
Report objects are cataloged as measures and dimensions |
Dashboard |
Folder |
Owner; Creation Time; Modification Time |
Analysis |
Folder |
Owner; Creation Time; Modification Time |
Notebook |
Report |
Source code |
Webapp |
Report |
|
Chart |
Report |
Report objects are cataloged as measures and dimensions |