Metrics
The following list contains the core metrics reported by Linaro MAP.
Only available metrics are exported to JSON. For example, if there is no Lustre filesystem then the Lustre metrics will not be included. If any custom metrics are loaded, they will be included in the JSON, but are not documented here.
For more information on the metrics, see Metrics view
CPU Instructions: see CPU instructions
instr_fp
instr_int
instr_mem
instr_vector_fp
instr_vector_int
instr_branch
instr_scalar_fp
: The percentage of time each rank spends in standard x87 floating-point operations.instr_scalar_int
: The percentage of time each rank spends in standard integer operations.instr_implicit_mem
: Implicit memory accesses. The percentage of time spent executing instructions with implicit memory accesses.instr_other
: The percentage of time each rank spends in instructions which cannot be categorized as any of the ones given above.
CPU Time: see CPU time
cpu_time_percentage
: See CPU time in CPU timeuser_time_percentage
: See User-mode CPU time in CPU timesystem_time_percentage
: See Kernel-mode CPU time in CPU timevoluntary_context_switches
: See Voluntary context switches (1/s) in CPU timeinvoluntary_context_switches
: See Involuntary context switches (1/s) in CPU timeloadavg
: See System load in CPU time
I/O: see I/O
Lustre
lustre_bytes_read
: Lustre read transfer (B/s)lustre_bytes_written
: Lustre write transfer (B/s)lustre_rchar_total
: Lustre bytes readlustre_wchar_total
: Lustre bytes written
Memory: see Memory
MPI: see MPI calls
mpi_call_time
: See MPI call duration (ns) in MPI callsmpi_sent
: See MPI sent in MPI callsmpi_recv
: See MPI received in MPI callsmpi_calls
: Number of MPI calls per second per processmpi_p2p
: See MPI point-to-point in MPI callsmpi_collect
: See MPI collective operations in MPI callsmpi_p2p_bytes
: See MPI point-to-point bytes in MPI callsmpi_collect_bytes
: See MPI collective bytes in MPI calls
Accelerator (NVIDIA): see Accelerator
nvidia_gpu_usage
: See GPU utilization in Acceleratornvidia_memory_used_percent
: See GPU memory usage in Acceleratornvidia_memory_used
: GPU memory usage in bytes
Accelerator (AMD): see Accelerator
rocm.gpu_utilization
: See GPU utilization in Acceleratorrocm.gpu_memory_used_percent
: See GPU memory usage in Acceleratorrocm.memory_utilization
: See GPU memory utilization in Acceleratorrocm.gpu_memory_used
: GPU memory usage in bytes
Energy: see Energy
nvidia_power
: See NVIDIA GPU power usage (mW/node) in Energyrocm.gpu_power
: See AMD GPU power usage (mW/node) in Energyrapl_power
: See CPU power usage (W/node) in Energysystem_power
: See System power usage (W/node) in Energyrapl_energy
: CPU energy, integral ofrapl_power (J)
system_energy
: CPU energy, integral ofsystem_power (J)