Trino Cluster
The Trino cluster capability deploys the actual Trino coordinator and worker pods into the cluster. It is distinct from the Trino foundation bundle, which only deploys the Trino Gateway and supporting infrastructure.
Spark Team
The Spark Team capability provisions an isolated Kubernetes namespace for a group of Spark users. One instance is deployed per team — multiple instances can coexist in the same cluster.
Celeborn
Celeborn is a remote shuffle service for distributed compute frameworks. It offloads Spark's shuffle phase — the expensive inter-stage data exchange — from executor pods to a dedicated, persistent shuffle cluster.
Cube
Cube is the semantic layer engine. It sits between Trino and downstream consumers (APIs, BI tools, AI agents) and provides a structured, versioned model of business metrics and dimensions — abstracting raw SQL schema into reusable semantic objects.
Ontology Backend
The Ontology Backend manages the semantic model — the structured definition of business entities, metrics, dimensions, and their relationships. It serves this model to Cube and to AI features that need business context to generate accurate SQL or explanations.
Text-to-SQL
Text-to-SQL translates natural language questions into SQL queries against the tenant's data. It uses the ontology model to understand business terminology and Trino to execute the resulting queries.
Workflow Backend
The Workflow Backend is the internal service that bridges the Cogrion UI and Airflow for DAG and workflow operations. It handles workflow lifecycle management — creating, triggering, and monitoring pipelines — and is the layer the BFF talks to for all workflow-related actions.
Agent-Based ML Platform
The Agent-Based ML Platform is an AI-driven backend that automates end-to-end machine learning workflows. It orchestrates dataset ingestion, feature engineering, model training, explainability, and pipeline creation — using LLM reasoning to decide and execute steps autonomously.
Pipeline Backend
The Pipeline Backend provides lower-level pipeline primitive operations, sitting downstream of the Workflow Backend in the dependency chain.
Dashboard Access Management
The Dashboard Access Management backend manages sharing and permission control for Superset dashboards. It is the service the BFF calls when a user grants or revokes access to a dashboard from the Cogrion UI.
Workspace File Management
The Workspace File Management backend manages user files in the JupyterHub workspace — browsing, uploading, downloading, and organizing files that live in S3. It is the service the BFF calls for all file operations in the Cogrion workspace UI.
Delta Spark Product Capabilities
This is an integration bundle — it deploys no long-running services. Its job is to wire together the Layer 2 stacks after they are all deployed: granting the right Ranger policies to platform service accounts and seeding any static objects that cross-service features depend on.
Chatbot Backend
The Chatbot Backend is the AI assistant service that powers the Cogrion conversational UI. It handles chat sessions, maintains conversation history, and routes requests to an LLM provider.