Cilium is a cloud native technology for networking, observability, and security.[1] It is based on the kernel technology eBPF, originally for better networking performance, and now leverages many additional features for different use cases. The core networking component has evolved from only providing a flat Layer 3 network for containers to including advanced networking features, like BGP and Service mesh, within a Kubernetes cluster, across multiple clusters, and connecting with the world outside Kubernetes.[1] Hubble was created as the network observability component and Tetragon was later added for security observability and runtime enforcement.[1] Cilium runs on Linux and is one of the first eBPF applications being ported to Microsoft Windows through the eBPF on Windows project.[7]
History
Evolution from Networking CNI (Container Network Interface)
Cilium began as a networking CNI[8] for container workloads. It was originally IPv6 only and supported multiple container orchestrators, like Kubernetes. The original vision for Cilium was to build an intent and identity-based high-performance container networking platform.[9] As the cloud native ecosystem expanded, Cilium added new projects and features to address new problems in the space.
The table below summarises some of the most significant milestones of this evolution:
December 2015 - Initial commit to the Cilium project[10]
May 2016 - Network policy was added, expanding the scope beyond just networking[11]
August 2016 - Cilium was initially announced during LinuxCon as a project providing fast IPv6 container networking with eBPF and XDP.[9] Today, Cilium has been adopted by major cloud provider's Kubernetes offerings and is one of the most widely used CNIs.
August 2017 - ebpf-go was created as a library to read, modify, and load eBPF programs and attach them to various hooks.[12]
April 2018 - Cilium 1.0 is the first stable release[13]
November 2019 - Hubble was launched to provide eBPF-based observability to network flows[14]
August 2020 - Chosen by Google as the basis for their Kubernetes Dataplane v2[15]
September 2021 - AWS picks Cilium for Networking & Security on EKS Anywhere[16]
October 2021 - Pwru was launched for tracing network packets in the Linux kernel with advanced filtering capabilities[17][18]
October 2021 - Accepted into CNCF as an incubation level project[19]
December 2021 - Cilium Service Mesh launched to help manage traffic between services[20]
May 2022 - Tetragon open sourced to cover security observability and runtime enforcement[21][22]
April 2023 - Cilium Mesh launched to connect workloads and machines across cloud, on-prem, and edge[25][26][27]
April 2023 - First CiliumCon hosted as a part of KubeCon[28]
October 2023 - Cilium becomes a CNCF Graduated project [29]
CNCF
Cilium was accepted into the Cloud Native Computing Foundation on October 13th, 2021 as an incubation-level project. It applied to become a graduated project on October 27th 2022.[19] It became a Graduated project one year later. Cilium is one of the fastest-moving projects in the CNCF ecosystem.[30]
Adoption
Cilium has been adopted by many large-scale production users, including over 100 that have stated it publicly,[31] for example:
Datadog uses Cilium as their CNI and kube-proxy replacement[32][33]
Ascend uses Cilium as their one CNI across multiple cloud providers[34]
Bell Canada uses Cilium and eBPF for telco networking[35][36]
Cosmonic uses Cilium for their Nomad-based PaaS[37][38][39]
IKEA uses Cilium for their self-hosted bare-metal private cloud[40]
Sky uses Cilium as their CNI and for network security[42]
The New York Times uses Cilium on EKS for multi-region multi-tenant shared clusters[43]
Trip.com uses Cilium both on premise and in AWS[44]
Cilium is the CNI for many cloud providers including Alibaba,[45] APPUiO,[46] Azure,[47] AWS,[16] DigitalOcean,[48] Exoscale,[49] Google Cloud,[15] Hetzner,[50] and Tencent Cloud.[51]
Projects Overview
Cilium
Cilium began as a container networking project. With the growth of Kubernetes and container orchestration, Cilium became a CNI,[8] providing basic things like configuring container network interfaces and Pod to Pod connectivity. From the beginning, Cilium based its networking on eBPF rather than iptables or IPVS, betting that eBPF would become the future of cloud native networking.[52]
Cilium’s eBPF based dataplane provides a simple flat Layer 3 network with the ability to span multiple clusters in either a native routing or overlay mode with Cilium Cluster Mesh. It is Layer 7-protocol aware and can enforce network policies on Layer 3 to Layer 7 and with FQDN using an identity-based security model that is decoupled from network addressing.
Cilium implements distributed load balancing for traffic between Pods and to external services, and is able to fully replace kube-proxy,[53] using XDP, socket-based load-balancing and efficient hash tables in eBPF. It also supports advanced functionality like integrated ingress and egress gateways,[54] bandwidth management, a stand-alone load balancer, and service mesh.[55]
Cilium is the first CNI to support advanced kernel features such as BBR TCP congestion control[56] and BIG TCP[57] for Kubernetes Pods.[58]
Hubble
Hubble is the observability, service map, and UI of Cilium which is shipped with the CNI.[59][60] It can be used to observe individual network packet flows, view network policy decisions to allow or block traffic, and build up service maps showing how Kubernetes services are communicating.[61] Hubble can export this data to Prometheus, OpenTelemetry, Grafana, and Fluentd for further analysis of Layer 3/4 and Layer 7 metrics.[62]
Tetragon
Tetragon is the security observability and runtime enforcement project of Cilium.[63] Tetragon is a flexible Kubernetes-aware security observability and runtime enforcement tool that applies policy and filtering directly with eBPF. It allows users to monitor and observe the complete lifecycle of every process execution on their machine, translate policies for file monitoring, network observability, container security, and more into eBPF programs, and do synchronous monitoring, filtering, and enforcement completely in the kernel.
Go eBPF Library
ebpf-go is a pure-Go library to interact with the eBPF subsystem in the Linux kernel.[64] It has minimal external dependencies, emphasises reliability and compatibility, and is widely deployed in production.
Pwru
pwru ("Packet, where are you?") is an eBPF-based tool for tracing network packets in the Linux kernel with advanced filtering capabilities. It allows fine-grained introspection of kernel state to facilitate debugging network connectivity issues. Under the hood, pwru attaches eBPF debugging programs to all Linux kernel functions which are responsible for processing network packets.
This gives a user finer-grained view into a packet processing in the kernel than with tcpdump, Wireshark, or more traditional tools. Also, it can show packet metadata such as network namespace, processing timestamp, internal kernel packet representation fields, and more.
Use Cases
Networking
Cilium began as a networking project and has many features that allow it to provide a consistent connectivity experience from Kubernetes workloads to virtual machines and physical servers running in the cloud, on-premises, or at the edge. Some of these include:
Container Network Interface (CNI)[65] - Provides networking for Kubernetes clusters
Layer 4 Load Balancer[66] - Based on Maglev[67][68] and XDP[69] for handling north/south traffic
Cluster Mesh[70] - Combines multiple Kubernetes clusters into one network
Bandwidth and Latency Optimization[71] - Fair Queueing, TCP Optimization, and Rate Limiting
kube-proxy replacement[72] - Replaces iptables with eBPF hash tables
BGP[73] - Integrates into existing networks and provides load balancing in bare metal clusters
Egress Gateway[74] - Provides a static IP for integration into external workloads
Service Mesh[75][76] - Includes ingress, TLS termination, canary rollouts, rate limiting, and circuit breaking
Gateway API[77] - Fully conformant implementation for managing ingress into Kubernetes clusters
SRv6[78] - Defines packet processing in the network as a program
BBR support for Pods[79] - Allows for better throughput and latency for Internet traffic
NAT 46/64 Gateway[80] - Allows IPv4 services to talk with IPv6 ones and vice versa
BIG TCP for IPv4/IPv6[81] - Enables better performance by reducing the number of packets traversing the stack
Cilium Mesh[82][83] - Connects workloads running outside Kubernetes to ones running inside it
Observability
Being in the kernel, eBPF has complete visibility of everything that is happening on a machine. Cilium leverages this with the following features:
Service Map[84] - Provides a UI for network flows and policy
Network Flow Logs[85] - Provides Layer 3/4 and DNS visibility connected to identity
Network Protocol Visibility[86] - Including HTTP, gRPC, Kafka, UDP, and SCTP
Metrics & Tracing Export[87] - Sends data to Prometheus, OpenTelemetry, or other storage system
Security
eBPF can stop events in the kernel for security. Cilium projects leverage this through the following features: