Skip to main content

Edge & IoT Blueprint

In Development

This blueprint is in active design. No committed timeline. Content reflects planned architecture based on platform foundation and air-gap capabilities.

Purpose: For platform engineers and IoT architects, explains the planned Edge & IoT blueprint for lightweight distributed clusters with store-and-forward GitOps and device ingestion.

Overview

Edge and IoT deployments need clusters that run on constrained hardware, operate offline for extended periods, and ingest data from device fleets. This blueprint extends the platform foundation with lightweight cluster profiles, offline-first GitOps, and device ingestion patterns (MQTT/AMQP).

Key Capabilities

CapabilityDescriptionFoundation Component
Lightweight ClustersRight-sized profiles for constrained hardwareKubespray with reduced resource footprint
Offline GitOpsStore-and-forward reconciliation during connectivity lossFluxCD + local Gitea mirror (air-gap pattern)
Device IngestionMQTT and AMQP protocols for device fleet dataStrimzi MQTT Bridge (from streaming blueprint)
Air-Gap FirstSigned packages for fully disconnected sitesopenCenter-AirGap three-zone model
Fleet ManagementConsistent configuration across hundreds of edge sitesGit-based overlays + FluxCD per-site reconciliation
ARM64 SupportWorkloads on ARM-based edge hardwarePlatform images built for multi-arch (planned)

Edge Cluster Profiles

ProfileControl PlaneWorkersResourcesUse Case
Micro1 node (combined)04 CPU, 8 GB RAMSingle-purpose gateway/sensor
Mini1 node (combined)1–28 CPU, 16 GB RAMSmall retail/branch site
Standard3 nodes (HA)2–416 CPU, 32 GB RAMMedium edge site
HA Edge3 nodes (HA)4+32+ CPU, 64+ GB RAMLarge site with high availability

Offline Behavior

When an edge site loses connectivity:

  1. FluxCD continues reconciling from local Gitea mirror
  2. Observability stack runs locally (Prometheus, Loki)
  3. Alerts route to local notification channel
  4. Changes queue in local Git; sync when connectivity returns
  5. Device data buffers locally; forwards to central when available

When connectivity returns:

  1. Local Gitea syncs with central repository
  2. Queued changes reconcile
  3. Buffered telemetry and device data forwards to central
  4. Fleet management verifies site convergence

Service Comparison (Core vs Edge)

ServiceCore DCEdge SiteNotes
PrometheusFullFull (local)Federated to central when connected
GrafanaFullOptionalCentral dashboards aggregate edge metrics
LokiFullFull (local)Local retention, forward when connected
KyvernoFullFullSame policies at every site
FluxCDFullFullReconciles from local mirror
VeleroFullReducedLocal backup target, remote when connected
HarborFullNot deployedImages served from bastion registry
KeycloakFullCached tokensOperates with cached OIDC tokens during offline

Composition

Relationship to Telco Blueprint

The Telco blueprint addresses similar fleet-management and edge concerns for telecommunications infrastructure. Key differences:

  • Telco focuses on network function workloads, carrier-grade reliability, and central DC → edge site patterns
  • Edge & IoT focuses on constrained hardware, device ingestion protocols, and extended offline operation

Both share air-gap deployment patterns and fleet-wide GitOps consistency from the platform foundation.

Further Reading