$ guides / kubernetes / kubernetes-bound-service-account-tokens ▌

Operations Guides

Kubernetes bound service account tokens: rotation, audience, and expiry

Pods fail with 401 Unauthorized, CSI volume mounts hang with token errors, or security audits surface long-lived credentials that never rotate. Bound tokens are projected, audience-scoped, and short-lived. Legacy tokens are static Secrets that persist forever. After Kubernetes 1.24, both coexist in most upgraded clusters. This guide covers the TokenRequest API lifecycle, kubelet rotation behavior, audience binding, and version-specific changes from 1.24 through 1.33 to help you diagnose auth failures and remove stale credentials safely.

What this means

Kubernetes bound service account tokens are JWTs issued by the TokenRequest API. Each token is bound to a specific Pod, carries an aud claim array, and expires. The default TTL is 1 hour (3600 seconds), with a minimum of 10 minutes (600 seconds). The kubelet requests the token from the API server using the Pod’s ServiceAccount and mounts it via the serviceaccount volume plugin. It rotates the file when the token age exceeds 80 percent of its TTL or when it is older than 24 hours. At the default TTL, rotation begins roughly 48 minutes after issuance.

If no audience is specified in the TokenRequest, the token audience defaults to the API server identifier (the value of --service-account-issuer). Recipients must verify that they identify as one of the aud values. A token projected with audience: vault cannot be validated by the Kubernetes TokenReview API unless api or the matching issuer audience is included in the audience list.

Projected token files are written with mode 0600, or 0640 if the Pod sets securityContext.fsGroup. Legacy secret-mounted tokens default to world-readable 0644.

Legacy tokens are Secret objects of type kubernetes.io/service-account-token. They are static, unbound, and carry no expiration. Since v1.24, Kubernetes no longer auto-creates these Secrets for every ServiceAccount, but clusters upgraded from older versions often retain them. From v1.29, unused legacy tokens are labeled kubernetes.io/legacy-token-invalid-since and tracked by the serviceaccount_stale_tokens_total metric. The LegacyServiceAccountTokenCleanUp controller, GA in v1.30, automatically deletes unused legacy tokens one year after they are marked invalid (two years of total inactivity).

flowchart LR
    A[TokenRequest API] -->|binds to Pod, audience, TTL| B[Projected Volume]
    B --> C[Kubelet Volume Plugin]
    C -->|rotate at >80% TTL or >24h| D[Token File on Disk]
    D --> E[Application or CSI Driver]
    E -->|verify aud| F[K8s API or External IDP]

Common causes

Cause	What it looks like	First thing to check
Audience mismatch	Token rejected by external provider or TokenReview returns invalid audience	Decode the JWT `aud` claim and compare it to the verifier’s expected audience
Kubelet rotation stall	401 after roughly 48-60 minutes; token file modification time is old	`stat` the projected token file inside the Pod and compare its age to `expirationSeconds`
Legacy secret deleted or expired	Pre-1.24 workload gets 401; Secret reference is missing or empty	`kubectl get secrets` for type `kubernetes.io/service-account-token` in the namespace
CSI driver token request blocked	Volume mount fails with “failed to get service account token attributes”	`CSIDriver.spec.tokenRequests` audience against NodeRestriction enforcement
Stale legacy token accumulation	etcd database grows; security scan flags non-expiring credentials	Secrets labeled `kubernetes.io/legacy-token-invalid-since`
Provider breakage post-1.24	Terraform or CD pipeline fails because `default_secret_name` is absent	Provider version and explicit token Secret creation

Quick checks

# Decode a projected token to inspect claims
kubectl exec -n <ns> <pod> -- cat /var/run/secrets/kubernetes.io/serviceaccount/token | \
  cut -d. -f2 | base64 -d 2>/dev/null | jq '{aud, exp, iat}'

# Check token file age in seconds to verify rotation cadence
kubectl exec -n <ns> <pod> -- sh -c 'echo $(( $(date +%s) - $(stat -c %Y /var/run/secrets/kubernetes.io/serviceaccount/token) ))'

# List legacy service account token Secrets across the cluster
kubectl get secrets -A --field-selector type=kubernetes.io/service-account-token \
  -o custom-columns='NAMESPACE:.metadata.namespace,NAME:.metadata.name' --no-headers

# Check a CSIDriver's token request audience
kubectl get csidriver <name> -o jsonpath='{.spec.tokenRequests}'

# Check API server token issuer setting on a control plane node
ps aux | grep kube-apiserver | grep -o '\-\-service-account-issuer=[^ ]*'

# Find stale legacy tokens marked invalid (v1.29+)
kubectl get secrets -A -o json | \
  jq -r '.items[] | select(.metadata.labels."kubernetes.io/legacy-token-invalid-since") | "\(.metadata.namespace)/\(.metadata.name)"'

How to diagnose it

Identify the token type. If the Pod relies on a Secret reference created before 1.24, it is likely legacy. If the Pod spec uses a projected volume or automount on a 1.24+ cluster, it is bound.
Inspect the JWT claims. Decode aud, exp, and iat. Confirm the aud array contains every audience the recipient expects. For Kubernetes TokenReview, the audience must include api or the matching issuer identifier.
Verify rotation. Check the token file age on the Pod filesystem. At the default 1-hour TTL, kubelet begins rotation after roughly 48 minutes. If the file age exceeds 80 percent of expirationSeconds, investigate kubelet volume health and node conditions on that node.
Check kubelet logs. Look for errors related to projected volume updates or TokenRequest calls. On systemd-based nodes, use journalctl -u kubelet. PLEG delays, disk pressure, or volume manager stalls can block the kubelet from writing the rotated token.
Audit legacy secrets. In clusters upgraded past 1.24, list Secrets by type kubernetes.io/service-account-token. Look for the kubernetes.io/legacy-token-invalid-since label introduced in v1.29. The cleanup controller removes these automatically in 1.30+ if they remain unused; verify controller activity before manual deletion.
Examine CSI drivers. If the failure occurs during volume mount, inspect CSIDriver.spec.tokenRequests. In v1.32, ServiceAccountNodeAudienceRestriction caused regressions with azureFile. In v1.33 it was re-enabled by default. Ensure the requested audience is present in the Pod spec or explicitly granted via RBAC with verb request-serviceaccounts-token-audience.
Correlate 401 errors. Match the failing Pod’s service account, the token’s exp claim, and the API server or external service logs. Clock skew between node and control plane can make a valid token appear expired.

Metrics and signals to monitor

Signal	Why it matters	Warning sign
`apiserver_request_total{resource="serviceaccounts",subresource="token"}`	TokenRequest API load; spikes indicate rotation storms or runaway CSI drivers	Sustained rate more than 2× baseline without deployment activity
`serviceaccount_stale_tokens_total` (v1.29+)	Counts legacy tokens marked invalid	Non-zero or increasing count
Projected token file age on disk	Direct indicator of kubelet rotation health	File age greater than 80% of `expirationSeconds`
API server audit log `responseStatus.code=401`	Confirms token rejection and identifies the service account	Sudden spike correlated with rotation window
`apiserver_storage_objects{resource="secrets"}`	Secret growth in etcd	Increasing count, especially in clusters with many legacy tokens
`storage_operation_duration_seconds` from kubelet	Node restriction or token request failures block mounts	Duration greater than 60 seconds with token-related events

Fixes

If the cause is audience mismatch

Add the required audience to the Pod’s projected volume spec. If a token is used for both an external vault (audience: vault) and the Kubernetes API (audience: api), include both in the audiences list. Do not assume the default issuer audience satisfies external verifiers.

If the cause is rotation stall

Evict the Pod to force a fresh projected volume mount on a healthy kubelet. Warning: Eviction is disruptive and will terminate running containers.

Check the node for kubelet volume manager errors, PLEG delays, or disk pressure that prevent the kubelet from updating the token. Restarting the kubelet remounts all projected volumes on the node, but this is disruptive to every Pod on the node.

If the cause is legacy token accumulation

Delete manually created legacy Secrets only after confirming no workload references them. Warning: Deleting an in-use Secret will cause immediate 401 failures.

For auto-generated legacy Secrets, rely on the LegacyServiceAccountTokenCleanUp controller (GA in v1.30) to remove tokens labeled invalid after one year of inactivity. You may delete Secrets carrying the kubernetes.io/legacy-token-invalid-since label if you have verified they are unused.

If the cause is CSI driver token request failure

Upgrade to Kubernetes 1.33 or later if you hit the v1.32 azureFile regression. Ensure the CSI driver token request audience is declared in the Pod’s projected volume or that the node’s kubelet has RBAC to request that audience via the request-serviceaccounts-token-audience verb.

If the cause is provider breakage post-1.24

Update Terraform providers and Helm charts that rely on kubernetes_service_account.default_secret_name. Create an explicit Secret of type kubernetes.io/service-account-token if a static reference is required, or switch to kubectl create token or the TokenRequest API in your pipeline.

Prevention

Use projected volumes for all new workloads. Set automountServiceAccountToken: false on ServiceAccounts that do not need API access, and mount projected tokens explicitly where needed.
Set expirationSeconds to a value your workload and kubelet can sustain. The default 3600 seconds is appropriate for most cases; avoid values below 600 seconds unless you have verified kubelet can rotate under load.
Audit Secrets quarterly. Remove legacy tokens before they accumulate. Enable LegacyServiceAccountTokenCleanUp if not already active.
Validate CSI driver audiences in staging before upgrading to versions that enforce ServiceAccountNodeAudienceRestriction.
Monitor TokenRequest rates and 401 errors per service account. Treat a rising 401 rate from a known service account as a token hygiene alert.

How Netdata helps

Surfaces API server request rates for the serviceaccounts/token subresource to detect TokenRequest storms.
Tracks kubelet volume operation latency and PLEG health to catch rotation delays before tokens expire.
Monitors etcd object counts and control plane storage growth to surface legacy token accumulation.
Alerts on node disk pressure that can block kubelet projected volume updates.
Correlates API server 401/403 rates with node-level signals to narrow the failure domain.

See How the Kubernetes control plane works for the mental model behind TokenRequest and kubelet volume management.

See Kubernetes API server certificate rotation for handling control-plane credential expiry.

See Kubernetes API server slow or unresponsive if TokenRequest latency is part of a broader API degradation.

See Kubernetes anonymous API access to distinguish token failures from anonymous auth probes.

The Netdata solution

Kubernetes monitoring with Netdata

Netdata monitors Kubernetes with per-second metrics across the control plane, nodes, and every pod, with ML anomaly detection and zero per-pod configuration. Correlate API-server and etcd latency, kubelet PLEG stalls, scheduling pressure, and OOMKills in one place.

See Kubernetes monitoring → Start monitoring free

Kubernetes bound service account tokens: rotation, audience, and expiry

Kubernetes bound service account tokens: rotation, audience, and expiry

What this means

Common causes

Quick checks

How to diagnose it

Metrics and signals to monitor

Fixes

If the cause is audience mismatch

If the cause is rotation stall

If the cause is legacy token accumulation

If the cause is CSI driver token request failure

If the cause is provider breakage post-1.24

Prevention

How Netdata helps

Related guides

Kubernetes monitoring with Netdata