Skip to main content

Metrics Reference

Last update:

This instruction describes the product metrics collected by the service.

The list of products that support metrics will be expanded.

Dedicated Servers

Monitoring is supported by servers with a KVM console, which is available for every dedicated server, except for the Chipcore product line servers and servers in a dedicated rack — see more on the Servers in a Dedicated Rack page on selectel.ru.

The set of IPMI metrics may differ depending on the server model.

Each metric transmits a timestamp — the time the metric was collected. You can convert the value using online services, for example, on the unixtimestamp.com.

Metric name
Type, unit of measurement
DescriptionLabels
ipmi_bmc_info
Gauge, UNIT
BMC (Baseboard Management Controller) metadata — firmware, manufacturer, and others. The metric value is always 1
  • datacenter — the name of the data center where the server is hosted;
  • location_id — the pool where the server is hosted;
  • resource_id — server UUID;
  • bmc_url — BMC address;
  • firmware_revision — BMC firmware version;
  • manufacturer_id — manufacturer (vendor) ID;
  • system_firmware_version — system firmware version (BIOS, UEFI)
up
Gauge, bool

Checking the availability of the metric source:

  • 1 — successful;
  • 0 — failure
  • datacenter — name of the data center where the server is hosted;
  • location_id — the pool where the server is hosted;
  • resource_id — server UUID
ipmi_up
Gauge, bool

Checking IPMI availability:

  • 1 — successful;
  • 0 — failure
  • datacenter — name of the data center where the server is hosted;
  • collector — name of the exporter's internal collector responsible for collecting a specific group of metrics;
  • location_id — the pool where the server is hosted;
  • resource_id — server UUID
ipmi_scrape_duration_seconds
Gauge, seconds
Time taken for IPMI exporter to collect metrics
  • datacenter — name of the data center where the server is hosted;
  • location_id — the pool where the server is hosted;
  • resource_id — server UUID
ipmi_sel_events_count_by_state
Gauge, count
Number of events from the SEL (System Event Log), grouped by event criticality level
  • datacenter — name of the data center where the server is hosted;
  • location_id — the pool where the server is hosted;
  • resource_id — server UUID;
  • state — event criticality level
ipmi_chassis_cooling_fault_state
Gauge, bool

Server cooling system status:

  • 1 — healthy;
  • 0 — failure
  • datacenter — name of the data center where the server is hosted;
  • location_id — the pool where the server is hosted;
  • resource_id — server UUID
ipmi_chassis_drive_fault_state
Gauge, bool

Drive status:

  • 1 — healthy;
  • 0 — failure
  • datacenter — name of the data center where the server is hosted;
  • location_id — the pool where the server is hosted;
  • resource_id — server UUID
ipmi_chassis_power_state
Gauge, bool

Server power status:

  • 1 — on;
  • 0 — off
  • datacenter — name of the data center where the server is hosted;
  • location_id — the pool where the server is hosted;
  • resource_id — server UUID
ipmi_fan_speed_state
Gauge, enum

Fan status:

  • 0 — healthy;
  • 1 — requires attention;
  • 2 — failure
  • datacenter — name of the data center where the server is hosted;
  • location_id — the pool where the server is hosted;
  • resource_id — server UUID;
  • id — sensor identifier;
  • name — sensor name
ipmi_power_state
Gauge, enum

Server component power status:

  • 0 — healthy;
  • 1 — requires attention;
  • 2 — failure
  • datacenter — name of the data center where the server is hosted;
  • location_id — the pool where the server is hosted;
  • resource_id — server UUID;
  • id — sensor ID;
  • name — sensor name
ipmi_temperature_state
Gauge, enum

Server component temperature status:

  • 0 — healthy;
  • 1 — requires attention;
  • 2 — failure
  • datacenter — name of the data center where the server is hosted;
  • location_id — the pool where the server is hosted;
  • resource_id — server UUID;
  • id — sensor ID;
  • name — sensor name
ipmi_voltage_state
Gauge, enum

Voltage status:

  • 0 — healthy;
  • 1 — requires attention;
  • 2 — failure
  • datacenter — name of the data center where the server is hosted;
  • location_id — the pool where the server is hosted;
  • resource_id — server UUID;
  • id — sensor ID;
  • name — sensor name
ipmi_sensor_state
Gauge, enum

Server component status:

  • 0 — healthy;
  • 1 — requires attention;
  • 2 — failure
  • datacenter — name of the data center where the server is hosted;
  • location_id — the pool where the server is hosted;
  • resource_id — server UUID;
  • id — sensor ID;
  • name — sensor name;
  • type — sensor type
ipmi_fan_speed_rpm
Gauge, RPM
Fan rotation speed
  • datacenter — name of the data center where the server is hosted;
  • location_id — the pool where the server is hosted;
  • resource_id — server UUID;
  • id — sensor ID;
  • name — sensor name
ipmi_power_watts
Gauge, watts
Power consumption
  • datacenter — name of the data center where the server is hosted;
  • location_id — the pool where the server is hosted;
  • resource_id — server UUID;
  • id — sensor ID;
  • name — sensor name
ipmi_temperature_celsius
Gauge, degrees °C
Server component temperature
  • datacenter — name of the data center where the server is hosted;
  • location_id — the pool where the server is hosted;
  • resource_id — server UUID;
  • name — sensor name;
  • id — sensor ID
ipmi_voltage_volts
Gauge, volts
Voltage
  • datacenter — name of the data center where the server is hosted;
  • location_id — the pool where the server is hosted;
  • resource_id — server UUID;
  • id — sensor ID;
  • name — sensor name
ipmi_sensor_value
Gauge
IPMI sensor data. The unit of measurement depends on the sensor type
  • datacenter — name of the data center where the server is hosted;
  • location_id — the pool where the server is hosted;
  • resource_id — server UUID;
  • id — sensor ID;
  • name — sensor name;
  • type — sensor type

Cloud servers

The cloud server namespace is compute.

Metric name
Type, unit of measurement
DescriptionLabels
server_meta_info
Gauge, UNIT
Cloud server metadata
  • project_name — project name;
  • server_uuid — cloud server ID;
  • server_name — cloud server name
server_state_info
Gauge, UNIT

Current cloud server status:

  • 0 (no state) — the server state is unknown or undefined;
  • 1 (running) — the server is running;
  • 2 (blocked) — the server is blocked (waiting for resources or completion of a disk read or write operation);
  • 3 (paused) — the server is paused;
  • 4 (shutdown) — the server is shutting down;
  • 5 (shut off) — the server is stopped and off;
  • 6 (crashed) — the server has crashed;
  • 7 (pmsuspended) — the server is suspended and has entered sleep mode
  • server_uuid — cloud server ID
cpu_count
Gauge, count
Number of vCPUs on the cloud server
  • server_uuid — cloud server ID
vcpu_usage_ratio
Gauge, %
Utilization of each cloud server vCPU in %
  • server_uuid — cloud server ID;
  • vcpu — virtual processor number
cpu_usage_ratio
Gauge, %
Cloud server CPU utilization in %
  • server_uuid — cloud server ID
memory_total_bytes
Gauge, bytes
Memory size on the cloud server
  • server_uuid — cloud server ID
memory_usage_bytes
Gauge, bytes
Amount of memory used by the cloud server
  • server_uuid — cloud server ID
memory_usage_ratio
Gauge, %
Percentage of RAM being used by the cloud server
  • server_uuid — cloud server ID
memory_major_faults_total
Counter, items
Number of major page faults that have occurred on the cloud server since it was last started (major page faults)
  • server_uuid — cloud server ID
disk_meta_info
Gauge, UNIT
Cloud server disk metadata
  • server_uuid — cloud server ID;
  • disk_uuid — cloud server disk ID;
  • disk_type="network" / "file" — disk type
disk_write_requests_rate
Gauge, operations/s
Number of write operations per second on the cloud server disk
  • server_uuid — cloud server ID;
  • disk_uuid — cloud server disk ID
disk_read_requests_rate
Gauge, operations/s
Number of read operations per second from the cloud server disk
  • server_uuid — cloud server ID;
  • disk_uuid — cloud server disk ID
disk_read_requests_limit
Gauge, operations/s
Maximum number of read operations for the cloud server disk
  • server_uuid — cloud server ID;
  • disk_uuid — cloud server disk ID
disk_write_requests_limit
Gauge, operations/s
Maximum number of write operations for the cloud server disk
  • server_uuid — cloud server ID;
  • disk_uuid — cloud server disk ID
disk_flush_requests_rate
Gauge, operations/s
Number of operations for flushing the cloud server disk cache
  • server_uuid — cloud server ID;
  • disk_uuid — cloud server disk ID
disk_read_bytes_rate
Gauge, bytes/s
Number of bytes per second read from the cloud server disk
  • server_uuid — cloud server ID;
  • disk_uuid — cloud server disk ID
disk_write_bytes_rate
Gauge, bytes/s
Number of bytes per second written to the cloud server disk
  • server_uuid — cloud server ID;
  • disk_uuid — cloud server disk ID
network_meta_info
Gauge, UNIT
Cloud server network interface metadata
  • server_uuid — cloud server ID;
  • virtual_interface — cloud server port ID
network_receive_packets_rate
Gauge, packets/s
Number of packets received per second on the cloud server network interface
  • server_uuid — cloud server ID;
  • virtual_interface — cloud server port ID
network_transmit_packets_rate
Gauge, packets/s
Number of packets transmitted per second through the cloud server network interface
  • server_uuid — cloud server ID;
  • virtual_interface — cloud server port ID
network_receive_bits_rate
Gauge, bits/s
Number of bits received per second on the cloud server network interface
  • server_uuid — cloud server ID;
  • virtual_interface — cloud server port ID
network_transmit_bits_rate
Gauge, bits/s
Number of bits transmitted per second through the cloud server network interface
  • server_uuid — cloud server ID;
  • virtual_interface — cloud server port ID