cpufreq collector fails when any core is offline #2577

kitzmiller · 2023-01-17T15:43:09Z

Host operating system: output of `uname -a`

Linux myhost 6.1.5-060105-generic #202301121238 SMP PREEMPT_DYNAMIC Thu Jan 12 13:10:27 UTC 2023 x86_64 x86_64 x86_64 GNU/Linux

node_exporter version: output of `node_exporter --version`

  node_exporter, version 1.5.0 (branch: HEAD, revision: 1b48970ffcf5630534fb00bb0687d73c66d1c959)
  build user:       root@6e7732a7b81b
  build date:       20221129-18:59:09
  go version:       go1.19.3
  platform:         linux/amd64

same behavior on

node_exporter, version 1.3.1 (branch: debian/sid, revision: 1.3.1-1)
  build user:       team+pkg-go@tracker.debian.org
  build date:       20220114-23:26:34
  go version:       go1.17.3
  platform:         linux/amd64

node_exporter command line flags

none

node_exporter log output

Jan 17 07:35:31 myhost prometheus-node-exporter[1028]: ts=2023-01-17T12:35:31.485Z caller=collector.go:169 level=error msg="collector failed" name=cpufreq duration_seconds=0.001987722 err="read /sys/devices/system/cpu/cpu15/cpufreq/cpuinfo_max_freq: device or resource busy"

Are you running node_exporter in Docker?

no

What did you do that produced an error?

I disabled a CPU core with:

echo 0 > /sys/devices/system/cpu/cpu15/online

What did you expect to see?

I expected the metrics node_cpu_frequency_max_hertz, node_cpu_frequency_min_hertz, node_cpu_scaling_frequency_hertz, node_cpu_scaling_frequency_max_hertz, node_cpu_scaling_frequency_min_hertz for the remaining online cores to still be available.

What did you see instead?

The above metrics were not present when any core is disabled. Reenabling the core reenables the metrics. The error above is added to /var/log/syslog every minute.

The text was updated successfully, but these errors were encountered:

taherkk · 2023-03-03T16:24:26Z

Hi @discordianfish ,

I want to contribute to this issue.

I have added a check to ensure the CPU is online before reading the frequency files (except cpu0) in systems_cpu.go under the "github.com/prometheus/procfs/sysfs" package.

This has solved the issue. I am new to contributing to open source.
Can someone guide me through the next steps?

discordianfish · 2023-03-07T13:23:44Z

Maybe #2605 fixed that issue for you as well? Try the current version in master

taherkk · 2023-03-08T17:09:37Z

No it didn't. Since this change resolved the bug in collector and not procfs.
I found a better way to find offline cpus though using "/sys/devices/system/cpu/offline".

taherkk · 2023-03-31T08:38:44Z

We can close this issue it should be resolved by prometheus/procfs#497

discordianfish added bug accepted good first issue platform/Linux Linux specific issue labels Feb 17, 2023

This was referenced Mar 5, 2023

SystemCPUfreq fails when any core is offline prometheus/procfs#496

Closed

Bug Fix: SystemCPUfreq fails when any core is offline prometheus/procfs#497

Merged

discordianfish closed this as completed Apr 5, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cpufreq collector fails when any core is offline #2577

cpufreq collector fails when any core is offline #2577

kitzmiller commented Jan 17, 2023

taherkk commented Mar 3, 2023 •

edited

Loading

discordianfish commented Mar 7, 2023

taherkk commented Mar 8, 2023 •

edited

Loading

taherkk commented Mar 31, 2023

cpufreq collector fails when any core is offline #2577

cpufreq collector fails when any core is offline #2577

Comments

kitzmiller commented Jan 17, 2023

Host operating system: output of uname -a

node_exporter version: output of node_exporter --version

node_exporter command line flags

node_exporter log output

Are you running node_exporter in Docker?

What did you do that produced an error?

What did you expect to see?

What did you see instead?

taherkk commented Mar 3, 2023 • edited Loading

discordianfish commented Mar 7, 2023

taherkk commented Mar 8, 2023 • edited Loading

taherkk commented Mar 31, 2023

Host operating system: output of `uname -a`

node_exporter version: output of `node_exporter --version`

taherkk commented Mar 3, 2023 •

edited

Loading

taherkk commented Mar 8, 2023 •

edited

Loading