With the introduction of Analytics Marketplace in v7.1.0, discovery and search capabilities across multiple data catalog platforms has been significantly expanded.
You can search for the associated database table assets without leaving the Metric Insights UI and without recreating them as objects in Metric Insights. Global Search will return the assets with the list of the available metadata and links to proceed to the asset in the data catalog tool.
The process consists of the following steps:
- Create a Dataset to retrieve all Table Assets metadata into Metric Insights.
- Create a derived Dataset from the Table Assets Dataset to generate JSON objects for metadata fields available in the data catalog tool and include any additional custom fields specific to your environment.
NOTE: The Columns and Tags fields are already formatted in JSON. - Add the Dataset to Search and map the Dataset Columns.
- Add Metadata Fields to Search — include the fields from Step 2 so they are indexed and displayed in the search results for each object, as shown in the final screen.
PREREQUISITES:
- Established connectivity with Collibra.
1. Create Dataset from Tables
For more details on how to create Datasets, refer to Understanding Datasets.
For more details on how to create a Dataset form Collibra, refer to Create Dataset from Collibra.
- Create a Dataset from the Tables object under All Data Assets.
- Note the Dataset ID to be used in Step 2.
2. Create Dataset from Tables Dataset
- Data Source: select the Data Source used as Data Storage when creating Dataset from the Collibra Tables object.
- Query: the query is intended to:
- Set Object Type Column to "Table";
- Transform standard Collibra fields, such as Status, Created By, Last Modified By, etc., into JSON objects.
- Transform user-added fields into JSON objects.
- This step will ensure that metadata is searchable and appears in search properly formatted.
- The query must include the ID of Dataset created from the Collibra All Table Assets object in "
FROM dataset_datasetID".
- Make sure the Dataset Column Length for "columns" is set high enough. This may be necessary if your table contains many columns or very detailed descriptions.
SELECT display_name
, asset_url
, domain
, "Table" as "type"
, columns
, JSON_OBJECT('Status', status ) AS "Status"
, JSON_OBJECT('Created By',created_by
, 'Created On',created_on
) AS "Created"
, JSON_OBJECT('Last Modified By',last_modified_by
, 'Last Modified On', last_modified_on
) AS "Last Modified"
from dataset_XXX
Acceptable JSON Formats:
{"Pii Data": ""}
{"Data Steward": "Clyde, Curt", "Business Owner": "Clint", "Technical Owner": "Kyle"}
[{"label":"Catalog","value":"Gifts"}]
[{"name":"Product Type","description":"High level grouping of our products."}]
4. Add Dataset to Search
Access Admin > System > Search Setup
- Under Datasets to include in Search, [+Add Dataset].
- Dataset & View: choose the Dataset created in Step Create Dataset from Tables Dataset.
- [Save]
5. Map Dataset Columns
Use the following mapping example with Allow click through to external tool enabled.
| Field | Dataset Column |
|---|---|
| Title Column | display_name |
| External Catalog Type | Collibra |
| External Catalog Asset URL | asset_url |
| Content Type Column | domain |
| Object Type Column | type |
For more details on configuring search across Datasets and Access, refer to Configure Search Across Specific Dataset (v7.1.0+).
6. Add Dataset Columns for Search
- Under Dataset Columns for Search, [+Add Dataset Columns].
- Dataset Column: choose a custom field column from the Dataset.
- Column data is formatted in JSON: enable. Must be enabled for all added columns, if multiple.
- Icon: optionally, select an icon.
- [Save] and add as many as needed.
- [Back to Search Setup] and [Update All Search Indexes]at the upper right.