Documentation

Knowledge Base

Find guides, tutorials, and documentation to help you get the most out of CubePath services.

Showing 8 of 8 guides

Apache Airflow Installation and Configuration

Deploy Apache Airflow for workflow orchestration on Linux. Covers installation methods, DAG creation, executor configuration (Local, Celery), connections, monitoring, and production setup.

April 4, 2026Read

Data Engineering

Apache Spark Installation on Linux

Install Apache Spark for distributed data processing on Linux. Learn standalone cluster setup, PySpark configuration, job submission, Spark SQL, resource management, and monitoring.

April 4, 2026Read

Data Engineering

Metabase Business Intelligence Installation

Deploy Metabase for self-hosted business intelligence on Linux. Covers database connections, question builder, custom dashboards, embedding, user permissions, and performance tuning.

April 4, 2026Read

Data Engineering

Apache Superset Data Visualization Installation

Install Apache Superset for data exploration and visualization on Linux. Learn database connections, chart creation, dashboard design, SQL Lab, caching, and user role management.

April 4, 2026Read

Data Engineering

Vector Log Collection and Transformation

Install Vector for high-performance log collection and transformation on Linux. Covers sources, transforms, sinks, VRL scripting, topology design, and integration with monitoring stacks.

April 4, 2026Read

Data Engineering

Fluent Bit Lightweight Log Processor

Deploy Fluent Bit as a lightweight log processor on Linux. Learn input plugins, parsing rules, filtering, output destinations, Kubernetes integration, and memory-efficient configuration.

April 4, 2026Read

Data Engineering

Apache NiFi Data Flow Installation

Install Apache NiFi for visual data flow management on Linux. Covers processor configuration, flow design, data provenance, security, clustering, and common ETL patterns.

April 4, 2026Read

Data Engineering

dbt Data Transformation Tool Installation

Install dbt (data build tool) for SQL-based data transformation on Linux. Learn project setup, model creation, testing, documentation, incremental models, and CI/CD integration.

April 4, 2026Read