Overview

Alation Cloud Service Applies to Alation Cloud Service instances of Alation

Customer Managed Applies to customer-managed instances of Alation

Enhanced Connector Enhanced connectors add extended capabilities and require a separate entitlement in addition to your Alation platform license.

The Dataiku OCF connector is used to catalog Dataiku data sources in Alation. It extracts Dataiku objects such as Projects, Workspaces, Reports, Recipes, Data sources, and other metadata. After extraction, users can search and find Dataiku objects, curate the corresponding catalog object pages, and understand data transformations using lineage diagrams.

The connector is distributed as a .zip package that you upload and install in the Alation application. You can download the latest connector package from the Alation Customer Portal (Customer Portal > Connectors).

Team

You may need assistance from the following administrators to install and configure this connector:

  • Dataiku Administrator

    • Creates a service account for Alation.

    • Provides the connection information and JDBC URI.

    • Provides authentication information and assists in configuring authentication.

  • Alation Server Admin

    • Ensures that Alation Connector Manager is installed and running (or installs it).

    • Installs the connector.

    • Creates and configures the Dataiku data source in the Alation catalog.

    • Performs initial extraction and prepares the data source for Alation users.

Scope

The table below lists the features supported by the connector and the metadata objects extracted.

Feature

Scope

Availability

Authentication

Basic (username and password)

Authentication with a service account using a username and password

Yes

SSO via OAuth

Authentication in Compose using the OAuth protocol

No

Metadata extraction (MDE)

Default MDE

Extraction of metadata from Dataiku based on default queries in the connector code

Yes

Query-based MDE

Extraction of metadata based on custom extraction queries provided by a user

No

Extracted metadata objects

Project

List of projects

Yes

Dataset

List of datasets

Yes

Recipe

List of recipes

Yes

Model

List of models

Yes

Dashboard

List of dashboards

Yes

Insight

Insights

Yes

Notebook

List of notebooks

Yes

Webapp

List of webapps

Yes

Analysis

List of analysis

Yes

Sampling and profiling

Table sampling

Retrieval of data samples from extracted tables

No

Column sampling

Retrieval of data samples from extracted columns

No

Custom query-based table sampling

Ability to use custom queries for sampling specific tables

No

Custom query-based column sampling

Ability to use custom queries for profiling specific columns

No

Deep column profiling

Profiling of columns with calculation of value distribution statistics

No

Dynamic profiling

Ability for individual users to connect with their own database accounts to retrieve table and column samples and profiles

No

Query Log Ingestion (QLI)

Table-based QLI

Ingestion of query history based on a table or view that contains query history data

No

Custom query-based QLI

Ingestion of query history based on a custom query

No

JOINs and filters

Calculation of JOIN and filter information based on ingested query history

No

Lineage

Cross-system lineage

Ability to map the data source in Dataiku to an actual data source in Alation

Yes

Compose

Customer-managed (on-premise) Alation instances

Compose for on-premise Alation instances

Yes

Alation Cloud Service instances

Compose on Alation Cloud Service instances. Depending on your network configuration, you may need to use Alation Agent to connect to your data source. Compose with Agent is supported.

Yes

Object mapping

The connector maps Dataiku objects to Alation catalog objects as listed below.

Dataiku object

Mapped object in Alation

Description

Projects

Folder

Owner, Creation Time, Modification Time, Project Description, and Project Permissions

Models

Report

Model Summary

Recipes

Report

Recipe Code

Datasets

Datasource

Datasource objects are cataloged as datasource columns

Insights

Report

Report objects are cataloged as measures and dimensions

Dashboard

Folder

Owner; Creation Time; Modification Time

Analysis

Folder

Owner; Creation Time; Modification Time

Notebook

Report

Source code

Webapp

Report

Chart

Report

Report objects are cataloged as measures and dimensions