open-nomad/devices/gpu/nvidia
Kris Hicks 93155ba3da
Add gocritic to golangci-lint config (#9556)
2020-12-08 12:47:04 -08:00
..
cmd cleanup driver eventor goroutines 2020-05-26 11:04:04 -04:00
nvml
README.md Update devices/gpu/nvidia/README.md 2019-01-23 17:44:24 -08:00
device.go nvidia: support disabling the nvidia plugin (#8353) 2020-07-21 10:11:16 -04:00
device_test.go nvidia: support disabling the nvidia plugin (#8353) 2020-07-21 10:11:16 -04:00
fingerprint.go Device manager 2018-11-07 10:43:15 -08:00
fingerprint_test.go
stats.go Add gocritic to golangci-lint config (#9556) 2020-12-08 12:47:04 -08:00
stats_test.go devices/nvidia: memory state as the summary stat 2018-12-10 12:18:24 -05:00

README.md

This package provides an implementation of nvidia device plugin

Behavior

Nvidia device plugin uses NVML bindings to get data regarding available nvidia devices and will expose them via Fingerprint RPC. GPUs can be excluded from fingerprinting by setting the ignored_gpu_ids field. Plugin sends statistics for fingerprinted devices every stats_period period.

Config

The configuration should be passed via an HCL file that begins with a top level config stanza:

config {
  ignored_gpu_ids = ["uuid1", "uuid2"]
  fingerprint_period = "5s"
}

The valid configuration options are:

  • ignored_gpu_ids (list(string): []): list of GPU UUIDs strings that should not be exposed to nomad
  • fingerprint_period (string: "1m"): interval to repeat the fingerprint process to identify possible changes.