DaemonSet Details
Name: do-node-agent-amd-device-metrics-exporter
Namespace: kube-system
Pods: 0/0
Selector: app: do-node-agent-amd-de...
Kubectl Commands
- View
- Delete
- Describe
Containers
Name | Image | Ports | ... |
---|---|---|---|
amdgpu-metrics-exporter-container | ghcr.io/digitalocean-packages/amd-gpu-de... | 5000/TCP | ... |
do-node-agent | ghcr.io/digitalocean-packages/do-agent:3... | N/A | ... |
Init Containers
Name | Image | Ports | ... |
---|---|---|---|
dynamic-config | gcr.io/distroless/static-debian12:debug-... | N/A | ... |
driver-init | busybox:1.36 | N/A | ... |
Metadata
Creation Time: 2025-08-14T22:04:45Z
Labels:
- app: do-node-agent-amd-de...
- c3.doks.digitalocean.com/component: do-node-agent...
- c3.doks.digitalocean.com/plane: data...
- doks.digitalocean.com/managed: true...
Annotation:
- deprecated.daemonset.template.generation: 2...
name: do-node-agent-amd-device-metrics-exporternamespace: kube-systemuid: 69947fe6-da27-4ba9-9099-88aa6c810404resourceVersion: '134440246'generation: 2creationTimestamp: '2025-08-14T22:04:45Z'labels:app: do-node-agent-amd-device-metrics-exporterc3.doks.digitalocean.com/component: do-node-agentc3.doks.digitalocean.com/plane: datadoks.digitalocean.com/managed: 'true'annotations:deprecated.daemonset.template.generation: '2'
- name: amdgpu-metrics-exporter-containerimage: ghcr.io/digitalocean-packages/amd-gpu-device-metrics-exporter:v1.3.1workingDir: /rootports:- containerPort: 5000protocol: TCPenv:- name: NODE_NAMEvalueFrom:fieldRef:apiVersion: v1fieldPath: spec.nodeName- name: METRICS_EXPORTER_PORTvalue: '5000'resources: {}volumeMounts:- name: dev-volumemountPath: /dev- name: sys-volumemountPath: /sys- name: pod-resourcesmountPath: /var/lib/kubelet/pod-resources- name: exporter-health-grpc-volumemountPath: /var/lib/amd-metrics-exporter- name: exporter-slurm-jobmountPath: /var/run/exporter- name: metrics-config-volumemountPath: /etc/metrics/terminationMessagePath: /dev/termination-logterminationMessagePolicy: FileimagePullPolicy: AlwayssecurityContext:privileged: true- name: do-node-agentimage: ghcr.io/digitalocean-packages/do-agent:3.18.2command:- /bin/do-agentargs:- '@/etc/config/do-agent-config'- '--path.procfs=/host/proc'- '--path.sysfs=/host/sys'- >---k8s-metrics-path=http://kube-state-metrics.kube-system.svc.cluster.local:8080/metrics- '--gpu-metrics-path=http://127.0.0.1:5000/metrics'- >---additional-label=kubernetes_cluster_uuid:f6ce2907-8531-4ab3-861e-4e2affa620b1resources:limits:memory: 300Mirequests:cpu: 102mmemory: 80MivolumeMounts:- name: procreadOnly: truemountPath: /host/proc- name: sysreadOnly: truemountPath: /host/sys- name: rootreadOnly: truemountPath: /host/rootmountPropagation: HostToContainer- name: dynamic-configmountPath: /etc/configterminationMessagePath: /dev/termination-logterminationMessagePolicy: FileimagePullPolicy: IfNotPresent
currentNumberScheduled: 0numberMisscheduled: 0desiredNumberScheduled: 0numberReady: 0observedGeneration: 2
selector:matchLabels:app: do-node-agent-amd-device-metrics-exportertemplate:metadata:creationTimestamp: nulllabels:app: do-node-agent-amd-device-metrics-exporterdoks.digitalocean.com/managed: 'true'annotations:clusterlint.digitalocean.com/disabled-checks: resource-requirements,hostpath-volumespec:volumes:- name: prochostPath:path: /proctype: ''- name: syshostPath:path: /systype: ''- name: roothostPath:path: /type: ''- name: dynamic-configemptyDir: {}- name: host-kubectlhostPath:path: /usr/bin/kubectltype: File- name: dev-volumehostPath:path: /devtype: Directory- name: sys-volumehostPath:path: /systype: Directory- name: pod-resourceshostPath:path: /var/lib/kubelet/pod-resourcestype: Directory- name: exporter-health-grpc-volumehostPath:path: /var/lib/amd-metrics-exportertype: DirectoryOrCreate- name: exporter-slurm-jobhostPath:path: /var/run/exportertype: DirectoryOrCreate- name: metrics-config-volumeconfigMap:name: do-node-agent-device-metrics-exporter-charts-configmapdefaultMode: 420initContainers:- name: dynamic-configimage: gcr.io/distroless/static-debian12:debug-nonroot-amd64command:- sh- '-c'- >set -o errexitset -o pipefailset -o nounsetKUBECTL=/host/usr/bin/kubectlPOOL_ID="$(${KUBECTL} get node ${NODE_NAME} -ojsonpath='{.metadata.labels.doks\.digitalocean\.com/node-pool-id}')"[[ -z "${POOL_ID}" ]] && echo "Pool ID label missing" && exit 1echo "--additional-label=kubernetes_node_pool_uuid:${POOL_ID}" >/etc/config/do-agent-configecho "Pool ID configured: ${POOL_ID}"env:- name: NODE_NAMEvalueFrom:fieldRef:apiVersion: v1fieldPath: spec.nodeNameresources: {}volumeMounts:- name: dynamic-configmountPath: /etc/config- name: host-kubectlmountPath: /host/usr/bin/kubectlterminationMessagePath: /dev/termination-logterminationMessagePolicy: FileimagePullPolicy: IfNotPresent- name: driver-initimage: busybox:1.36command:- sh- '-c'- >-while [ ! -d /host-sys/class/kfd ] || [ ! -d/host-sys/module/amdgpu/drivers/ ]; do echo "amdgpu driver is notloaded "; sleep 2 ;doneresources: {}volumeMounts:- name: sys-volumemountPath: /host-systerminationMessagePath: /dev/termination-logterminationMessagePolicy: FileimagePullPolicy: IfNotPresentsecurityContext:privileged: truecontainers:- name: amdgpu-metrics-exporter-containerimage: ghcr.io/digitalocean-packages/amd-gpu-device-metrics-exporter:v1.3.1workingDir: /rootports:- containerPort: 5000protocol: TCPenv:- name: NODE_NAMEvalueFrom:fieldRef:apiVersion: v1fieldPath: spec.nodeName- name: METRICS_EXPORTER_PORTvalue: '5000'resources: {}volumeMounts:- name: dev-volumemountPath: /dev- name: sys-volumemountPath: /sys- name: pod-resourcesmountPath: /var/lib/kubelet/pod-resources- name: exporter-health-grpc-volumemountPath: /var/lib/amd-metrics-exporter- name: exporter-slurm-jobmountPath: /var/run/exporter- name: metrics-config-volumemountPath: /etc/metrics/terminationMessagePath: /dev/termination-logterminationMessagePolicy: FileimagePullPolicy: AlwayssecurityContext:privileged: true- name: do-node-agentimage: ghcr.io/digitalocean-packages/do-agent:3.18.2command:- /bin/do-agentargs:- '@/etc/config/do-agent-config'- '--path.procfs=/host/proc'- '--path.sysfs=/host/sys'- >---k8s-metrics-path=http://kube-state-metrics.kube-system.svc.cluster.local:8080/metrics- '--gpu-metrics-path=http://127.0.0.1:5000/metrics'- >---additional-label=kubernetes_cluster_uuid:f6ce2907-8531-4ab3-861e-4e2affa620b1resources:limits:memory: 300Mirequests:cpu: 102mmemory: 80MivolumeMounts:- name: procreadOnly: truemountPath: /host/proc- name: sysreadOnly: truemountPath: /host/sys- name: rootreadOnly: truemountPath: /host/rootmountPropagation: HostToContainer- name: dynamic-configmountPath: /etc/configterminationMessagePath: /dev/termination-logterminationMessagePolicy: FileimagePullPolicy: IfNotPresentrestartPolicy: AlwaysterminationGracePeriodSeconds: 30dnsPolicy: ClusterFirstWithHostNetnodeSelector:doks.digitalocean.com/gpu-brand: amdkubernetes.io/os: linuxserviceAccountName: do-agentserviceAccount: do-agenthostNetwork: truesecurityContext: {}schedulerName: default-schedulertolerations:- operator: ExistspriorityClassName: system-node-criticalupdateStrategy:type: RollingUpdaterollingUpdate:maxUnavailable: 1maxSurge: 0revisionHistoryLimit: 10
metadata:name: do-node-agent-amd-device-metrics-exporternamespace: kube-systemuid: 69947fe6-da27-4ba9-9099-88aa6c810404resourceVersion: '134440246'generation: 2creationTimestamp: '2025-08-14T22:04:45Z'labels:app: do-node-agent-amd-device-metrics-exporterc3.doks.digitalocean.com/component: do-node-agentc3.doks.digitalocean.com/plane: datadoks.digitalocean.com/managed: 'true'annotations:deprecated.daemonset.template.generation: '2'spec:selector:matchLabels:app: do-node-agent-amd-device-metrics-exportertemplate:metadata:creationTimestamp: nulllabels:app: do-node-agent-amd-device-metrics-exporterdoks.digitalocean.com/managed: 'true'annotations:clusterlint.digitalocean.com/disabled-checks: resource-requirements,hostpath-volumespec:volumes:- name: prochostPath:path: /proctype: ''- name: syshostPath:path: /systype: ''- name: roothostPath:path: /type: ''- name: dynamic-configemptyDir: {}- name: host-kubectlhostPath:path: /usr/bin/kubectltype: File- name: dev-volumehostPath:path: /devtype: Directory- name: sys-volumehostPath:path: /systype: Directory- name: pod-resourceshostPath:path: /var/lib/kubelet/pod-resourcestype: Directory- name: exporter-health-grpc-volumehostPath:path: /var/lib/amd-metrics-exportertype: DirectoryOrCreate- name: exporter-slurm-jobhostPath:path: /var/run/exportertype: DirectoryOrCreate- name: metrics-config-volumeconfigMap:name: do-node-agent-device-metrics-exporter-charts-configmapdefaultMode: 420initContainers:- name: dynamic-configimage: gcr.io/distroless/static-debian12:debug-nonroot-amd64command:- sh- '-c'- >set -o errexitset -o pipefailset -o nounsetKUBECTL=/host/usr/bin/kubectlPOOL_ID="$(${KUBECTL} get node ${NODE_NAME} -ojsonpath='{.metadata.labels.doks\.digitalocean\.com/node-pool-id}')"[[ -z "${POOL_ID}" ]] && echo "Pool ID label missing" && exit 1echo "--additional-label=kubernetes_node_pool_uuid:${POOL_ID}" >/etc/config/do-agent-configecho "Pool ID configured: ${POOL_ID}"env:- name: NODE_NAMEvalueFrom:fieldRef:apiVersion: v1fieldPath: spec.nodeNameresources: {}volumeMounts:- name: dynamic-configmountPath: /etc/config- name: host-kubectlmountPath: /host/usr/bin/kubectlterminationMessagePath: /dev/termination-logterminationMessagePolicy: FileimagePullPolicy: IfNotPresent- name: driver-initimage: busybox:1.36command:- sh- '-c'- >-while [ ! -d /host-sys/class/kfd ] || [ ! -d/host-sys/module/amdgpu/drivers/ ]; do echo "amdgpu driver is notloaded "; sleep 2 ;doneresources: {}volumeMounts:- name: sys-volumemountPath: /host-systerminationMessagePath: /dev/termination-logterminationMessagePolicy: FileimagePullPolicy: IfNotPresentsecurityContext:privileged: truecontainers:- name: amdgpu-metrics-exporter-containerimage: ghcr.io/digitalocean-packages/amd-gpu-device-metrics-exporter:v1.3.1workingDir: /rootports:- containerPort: 5000protocol: TCPenv:- name: NODE_NAMEvalueFrom:fieldRef:apiVersion: v1fieldPath: spec.nodeName- name: METRICS_EXPORTER_PORTvalue: '5000'resources: {}volumeMounts:- name: dev-volumemountPath: /dev- name: sys-volumemountPath: /sys- name: pod-resourcesmountPath: /var/lib/kubelet/pod-resources- name: exporter-health-grpc-volumemountPath: /var/lib/amd-metrics-exporter- name: exporter-slurm-jobmountPath: /var/run/exporter- name: metrics-config-volumemountPath: /etc/metrics/terminationMessagePath: /dev/termination-logterminationMessagePolicy: FileimagePullPolicy: AlwayssecurityContext:privileged: true- name: do-node-agentimage: ghcr.io/digitalocean-packages/do-agent:3.18.2command:- /bin/do-agentargs:- '@/etc/config/do-agent-config'- '--path.procfs=/host/proc'- '--path.sysfs=/host/sys'- >---k8s-metrics-path=http://kube-state-metrics.kube-system.svc.cluster.local:8080/metrics- '--gpu-metrics-path=http://127.0.0.1:5000/metrics'- >---additional-label=kubernetes_cluster_uuid:f6ce2907-8531-4ab3-861e-4e2affa620b1resources:limits:memory: 300Mirequests:cpu: 102mmemory: 80MivolumeMounts:- name: procreadOnly: truemountPath: /host/proc- name: sysreadOnly: truemountPath: /host/sys- name: rootreadOnly: truemountPath: /host/rootmountPropagation: HostToContainer- name: dynamic-configmountPath: /etc/configterminationMessagePath: /dev/termination-logterminationMessagePolicy: FileimagePullPolicy: IfNotPresentrestartPolicy: AlwaysterminationGracePeriodSeconds: 30dnsPolicy: ClusterFirstWithHostNetnodeSelector:doks.digitalocean.com/gpu-brand: amdkubernetes.io/os: linuxserviceAccountName: do-agentserviceAccount: do-agenthostNetwork: truesecurityContext: {}schedulerName: default-schedulertolerations:- operator: ExistspriorityClassName: system-node-criticalupdateStrategy:type: RollingUpdaterollingUpdate:maxUnavailable: 1maxSurge: 0revisionHistoryLimit: 10status:currentNumberScheduled: 0numberMisscheduled: 0desiredNumberScheduled: 0numberReady: 0observedGeneration: 2