Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Filter terminated pods from node request metrics. #104

Merged

Conversation

jefchien
Copy link

Description: The node_<cpu|memory>_request metrics and metrics derived from them (node_<cpu|memory>_reserved_capacity) differ from the output of kubectl describe node <node_name>. This is because kubectl filters out terminated pods. See linked issue for more details.

Adds a filter for terminated (succeeded/failed state) pods.

Link to tracking Issue: open-telemetry#27262

Testing: Added unit test to validate pod state filtering. Built and deployed changes to cluster. Deployed cpu-test pod.

image

The gap is when the change was deployed. The metric can be see spiking up while the cpu-test pod is running and then returns to the previous request size after it has terminated.

@jefchien jefchien merged commit 1957d95 into amazon-contributing:aws-cwa-dev Oct 2, 2023
86 of 87 checks passed
@jefchien jefchien deleted the filter-terminated-pods branch October 2, 2023 16:07
lisguo pushed a commit to lisguo/opentelemetry-collector-contrib that referenced this pull request Oct 20, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants