Kubernetes with k3s

Home Kubernetes cluster

Kubernetes with k3s

Setup

k3s cluster will also be configured as an HA using etcd with the --cluster-init flag

Config File

Server

Install k3s manually

curl -sfL https://get.k3s.io | K3S_KUBECONFIG_MODE="644" INSTALL_K3S_EXEC="server" sh -s - --flannel-backend none \
        --disable traefik \
        --disable servicelb \
        --disable-network-policy \
        --disable-kube-proxy \
        --kube-controller-manager-arg bind-address=0.0.0.0 \
        --kube-scheduler-arg bind-address=0.0.0.0 \
        --etcd-expose-metrics \
        --cluster-init

Add extra masters if need be.

curl -sfL https://get.k3s.io | K3S_TOKEN=SECRET sh -s - server --server https://<hostname or ip>:6443 --flannel-backend none \
        --disable traefik \
        --disable servicelb \
        --disable-network-policy \
        --disable-kube-proxy \
        --kube-controller-manager-arg bind-address=0.0.0.0 \
        --kube-scheduler-arg bind-address=0.0.0.0 \
        --etcd-expose-metrics

Add workers

Workers

curl -sfL https://get.k3s.io | K3S_TOKEN="" K3S_URL=https://<ip:port> sh -

I recommend to use bootstrap values from kubernetes/core/cilium/bootstrap/values.yaml

helm repo add cilium https://helm.cilium.io/
helm install cilium cilium/cilium -f kubernetes/core/cilium/bootstrap/values.yaml --namespace kube-system

Networking

OPNsense

Note: Add an entry to the BGP Neighbors table with the IP address of the Node you're adding.

I use Tailscale on the router and another one inside the cluster. Both of them broadcasts my network and act as exit nodes for Tailscale clients.

Apple TV exit node
Cluster exit node

Having both isn't necessary because if you lose internet you won't be able to access either remotely. The difference is if i'm playing around with stuff inside the cluster and the cluster breaks, I may no longer be able to use the exit node in the cluster; The exit node inside the cluster is the main exit node.

Having the ability to not put unnecessary strain on the router is the reason why i'm running 2 exit nodes + 2 subnet broadcasts. If I have internet access and the cluster explodes for whatever reason, at least, i'll still be able to access my network remotely.

Notes

Cilium CNI

Be sure to set the Pod CIDR to the one you have chosen if you aren't using the k3s default. 10.42.0.0/16 Otherwise, you will more than likely have issues.

The APIServer address must also be correct otherwise, the cni will not be installed on the nodes. I have the three master node IP addresses registered to the HAProxy on my router on port 6443.

Unifi

As I have unifi hardware, you cannot wait for the Unifi Software controller. You have 2 options.

Get a cloud key / security gateway
Self-host on something reliable like a NAS.

I opted for option 2 because it's cheaper and does almost the same, except you manage your own backups and etc. When I just had APs this didn't bother me, however, now with Switches your NAS must be up and running before setting up the switch.

Maybe this could've been run on a router but I did not want to introduce more stress

Gateway API

Ingress and Gateway API can co-exist. Keep in mind, the DNS must simply be unique.

You'll notice in my repo most of my external/internal services have both route and ingress. I've noticed after using Gateway-API extensively with Cilium, that it is not stable enough. For this reason, I have kept both and when Cilium's implementation decides to stop functioning, I have ingress available.

Ingress

For ingress controller we need to add this in order to get proper ip address from Cloudflare LB @ L7.

data:
  use-forwarded-headers: "true"
  forwarded-for-header: "CF-Connecting-IP"

Flux

Installation

Generate a personal access token (PAT) that can create repositories by checking all permissions under repo. If a pre-existing repository is to be used the PAT’s user will require admin permissions on the repository in order to create a deploy key. Flux Installation

export GITHUB_USER=<user>
export GITHUB_TOKEN=<token>

flux bootstrap github \
  --owner=$GITHUB_USER \
  --repository=home-cluster \
  --branch=main \
  --path=./kubernetes/bootstrap \
  --personal

Install the sops secret either age sops age

age-keygen -o age.agekey

cat age.agekey |
kubectl create secret generic sops-age \
--namespace=flux-system \
--from-file=age.agekey=/dev/stdin

Add these secrets

flux create secret ghcr-auth \
  --url=ghcr.io \
  --username=flux \
  --password=${GITHUB_PAT}

flux create secret oci dockerio-auth \
  --url=registry-1.docker.io \
  --username=<username> \
  --password=<password>

SOPS is only used to create the helm-release required for bitwarden and external-secrets. Previously, it was used throughout the repository however, with external-secrets bitwarden and webhooks, we're able to remove this dependency slightly.

External-Secrets uses bitwarden-cli container to retrieve my vault items and creates kubernetes secrets with them. Since external-secrets doesn't use the bitwarden API directly, we have to use a container with the cli and webhooks.

Also keep in mind, that since the bitwarden container exposes your bitwarden vault, it's good practice to limit who can communicate with it. See the network policy at kubernetes/core/bitwarden/network-policy.yaml

Ensure you use this sops-age secret for decrypting.

Nodes/Hardware

Device	Count	OS Disk Size	Data Disk Size	Ram	Operating System	Purpose
J4125 RS34g	1	250Gi mSATA	-	16Gi	Opnsense 23	Router
Unifi Core Switch XG-16	1	-	-	-	Unifi OS - 6.x	Switch
Unifi Enterprise 24 PoE	1	-	-	-	Unifi OS - 6.x	Switch
Beelink U59 N5105	3	500Gi M2 SATA	-	16Gi	Ubuntu 22.04	Kubernetes Masters
MS-01	2	1 Ti U.2 NVMe	-	64Gi	Ubuntu 22.04	Kubernetes Worker
NVIDIA - GPU PC	1	2Ti NVMe	-	32Gi	Ubuntu 22.04	Kubernetes Worker
Synology 920+	1	26Ti HDD / 2Ti NVMe	-	4Gi	DSM 7	NAS

Base Node Install

Nodes

Base install

sudo apt install \
  nftables \
  nfs-common \
  curl \
  containerd \
  open-iscsi \
  vim \
  gnupg \
  net-tools \
  dnsutils

Network Bonding

Ubuntu

root@dlp:~# ls /etc/netplan

00-installer-config.yaml.org 01-netcfg.yaml
# edit network settings

root@dlp:~# vi /etc/netplan/01-netcfg.yaml

# change all like follows
# replace the interface name, IP address, DNS, Gateway to your environment value
# for [mode] section, set a mode you'd like to use
network:
  ethernets:
    enp1s0:
      dhcp4: false
      dhcp6: false
    enp7s0:
      dhcp4: false
      dhcp6: false
  bonds:
    bond0:
      addresses: [10.0.0.30/24]
      routes:
        - to: default
          via: 10.0.0.1
          metric: 100
      interfaces:
        - enp1s0
        - enp7s0
      parameters:
        mode: 802.3ad
        mii-monitor-interval: 100
  version: 2

# apply changes

root@dlp:~# netplan apply
# after setting bonding, [bonding] is loaded automatically

root@dlp:~# lsmod | grep bond

bonding               196608  0
tls                   114688  1 bonding

Reference

GPU Install

sudo apt install \
    nftables \
    nfs-common \
    net-tools \
    nvidia-driver-545 \
    open-iscsi \
    ca-certificates \
    curl \
    gnupg \
    lsb-release \
    containerd \
    vim

curl -fsSL https://nvidia.github.io/libnvidia-container/gpgkey | sudo gpg --dearmor -o /usr/share/keyrings/nvidia-container-toolkit-keyring.gpg \
  && curl -s -L https://nvidia.github.io/libnvidia-container/stable/deb/nvidia-container-toolkit.list | \
    sed 's#deb https://#deb [signed-by=/usr/share/keyrings/nvidia-container-toolkit-keyring.gpg] https://#g' | \
    sudo tee /etc/apt/sources.list.d/nvidia-container-toolkit.list

sudo apt update && sudo apt install nvidia-container-toolkit

Extra Documentation

⭐ Stargazers

🤝 Gratitude and Thanks

Thanks to all the people who donate their time to the Home Operations Discord community. Be sure to check out kubesearch.dev for ideas on how to deploy applications or get ideas on what you may deploy.

onedr0p
bjw-s
buroa
LilDrunkenSmurf

For all their hard work and dedication

Name		Name	Last commit message	Last commit date
Latest commit History 4,740 Commits
.archive		.archive
.ci		.ci
.github		.github
infrastructure/terraform		infrastructure/terraform
kubernetes		kubernetes
.gitignore		.gitignore
.sops.yaml		.sops.yaml
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Home Kubernetes cluster

Kubernetes with k3s

Setup

Config File

Server

Workers

Networking

OPNsense

Notes

Cilium CNI

Unifi

Gateway API

Ingress

Flux

Installation

Nodes/Hardware

Base Node Install

Nodes

Base install

Network Bonding

Ubuntu

GPU Install

Extra Documentation

⭐ Stargazers

🤝 Gratitude and Thanks

About

Releases

Packages

Languages

License

webclinic017/home-cluster

Folders and files

Latest commit

History

Repository files navigation

Home Kubernetes cluster

Kubernetes with k3s

Setup

Config File

Server

Workers

Networking

OPNsense

Notes

Cilium CNI

Unifi

Gateway API

Ingress

Flux

Installation

Nodes/Hardware

Base Node Install

Nodes

Base install

Network Bonding

Ubuntu

GPU Install

Extra Documentation

⭐ Stargazers

🤝 Gratitude and Thanks

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages