Documentation

Knowledge Base

Find guides, tutorials, and documentation to help you get the most out of CubePath services.

Showing 10 of 10 guides

vLLM Installation for High-Performance LLM Inference

Deploy vLLM on Linux for high-throughput LLM inference with PagedAttention. Learn installation, model loading, OpenAI-compatible API, quantization, and GPU memory optimization.

April 4, 2026Read

AI and ML Infrastructure

Open WebUI Installation for LLM Chat Interface

Install Open WebUI to create a ChatGPT-like interface for self-hosted LLMs. Covers Docker deployment, Ollama integration, user management, RAG pipelines, and customization.

April 4, 2026Read

AI and ML Infrastructure

MinIO for ML Model and Dataset Storage

Use MinIO as S3-compatible storage for ML workflows. Learn bucket organization for models and datasets, versioning, Python SDK integration, and high-performance data pipelines.

April 4, 2026Read

AI and ML Infrastructure

NVIDIA GPU Drivers Installation on Linux

Install NVIDIA GPU drivers on Linux for compute and AI workloads. Covers driver selection, DKMS setup, kernel compatibility, verification, and troubleshooting common installation issues.

April 4, 2026Read

AI and ML Infrastructure

CUDA Toolkit Installation and Configuration

Install and configure the NVIDIA CUDA Toolkit for GPU computing on Linux. Learn version selection, cuDNN setup, environment variables, multi-version management, and verification.

April 4, 2026Read

AI and ML Infrastructure

Jupyter Notebook Server Installation on Linux

Deploy a secure Jupyter Notebook/Lab server on Linux for remote data science work. Covers installation, password protection, SSL with Nginx reverse proxy, and kernel management.

April 4, 2026Read

AI and ML Infrastructure

Ollama Installation for Local LLM Deployment

Install Ollama on Linux to run large language models locally. Learn model management, GPU acceleration, API usage, Open WebUI integration, and performance optimization.

April 4, 2026Read

AI and ML Infrastructure

MLflow Installation for ML Lifecycle Management

Deploy MLflow on Linux for experiment tracking, model registry, and ML lifecycle management. Covers server setup, backend storage, artifact stores, and team collaboration.

April 4, 2026Read

AI and ML Infrastructure

TensorFlow Serving Installation and Configuration

Deploy TensorFlow Serving for production model inference on Linux. Learn installation methods, model versioning, REST and gRPC APIs, GPU support, and performance tuning.

April 4, 2026Read

AI and ML Infrastructure

Stable Diffusion WebUI Installation on Linux

Install Stable Diffusion WebUI (AUTOMATIC1111) on a Linux server for AI image generation. Covers dependency setup, GPU configuration, model downloads, and remote access.

April 4, 2026Read