Skip to main content

Catalog

The Catalog gives users a view of the data assets available in their workspace — tables, schemas, ownership, lineage, and access permissions — all in one place.

Services Behind This Feature

ServiceRole
BFF APIProxies requests from the Cogrion UI to the catalog backend.
DatahubMetadata catalog and lineage
SupersetData exploration and visualization
TrinoQuery engine
RangerProvides the access control layer — permissions visible in the catalog are enforced here at query time.
Hive MetastoreTable catalog

What a Table Detail Page Shows

When a user opens a specific table in the Catalog, they see:

TabContent
ColumnsColumn names, data types, Datahub tags, and descriptions
Sample dataA preview of actual row data from the table (fetched via Trino)
History / LineageUpstream and downstream tables — which tables feed this one and which depend on it
Data qualityResults from data quality checks run against the table

Lineage

Table and column lineage is populated by the Airflow Datahub plugin. When an Airflow DAG runs a pipeline that reads from or writes to tables, the plugin emits lineage events to Datahub automatically. Lineage is visible in the Catalog without any manual configuration on the data engineering side.

Go Deeper