Find guides, tutorials, and documentation to help you get the most out of CubePath services.
Showing 8 of 8 guides

Deploy Apache Airflow for workflow orchestration on Linux. Covers installation methods, DAG creation, executor configuration (Local, Celery), connections, monitoring, and production setup.

Install Apache Spark for distributed data processing on Linux. Learn standalone cluster setup, PySpark configuration, job submission, Spark SQL, resource management, and monitoring.

Deploy Metabase for self-hosted business intelligence on Linux. Covers database connections, question builder, custom dashboards, embedding, user permissions, and performance tuning.

Install Apache Superset for data exploration and visualization on Linux. Learn database connections, chart creation, dashboard design, SQL Lab, caching, and user role management.

Install Vector for high-performance log collection and transformation on Linux. Covers sources, transforms, sinks, VRL scripting, topology design, and integration with monitoring stacks.

Deploy Fluent Bit as a lightweight log processor on Linux. Learn input plugins, parsing rules, filtering, output destinations, Kubernetes integration, and memory-efficient configuration.

Install Apache NiFi for visual data flow management on Linux. Covers processor configuration, flow design, data provenance, security, clustering, and common ETL patterns.

Install dbt (data build tool) for SQL-based data transformation on Linux. Learn project setup, model creation, testing, documentation, incremental models, and CI/CD integration.