Validator Hub Telemetry #559

thekaranacharya · 2024-01-26T22:46:29Z

What is this about?

Adds manual instrumentation using the OpenTelemetry Python SDK to capture anonymous information and usage metadata and send it to a private OpenSearch sink for analysis.

Following is all what we capture:

User ID
The unique ID of the Guard object
The LLM provider API name
Boolean values that reflect whether custom reask prompts and instructions were used (not the actual content of the prompts and instructions)
The names of the validators utilized - including both in-house and custom-made (Just the name of the validator)
The on_fail actions used on each of the validator e.g. fix, reask, refrain, noop, etc.
The result of each validator: pass/fail (Again, just the outcome, not the content)
The number of times reask was performed in each guard call

Following are some of the insights we get from the raw traces data:

Number of uses per validator, guard and user, LLM Provider, outcome and on_fail action
Number of times a custom reask prompt or instruction was provided
Most popular validator, on_fail action and LLM Provider

Why do we need this?

Observability is crucial for understanding and maintaining complex distributed systems. As modern applications evolve to use microservices and other distributed architectures, it becomes challenging to trace the flow of requests and identify issues that may arise across various services. Observability tools like OpenTelemetry provide a unified way to collect and analyze data, allowing teams to monitor performance, troubleshoot problems, and optimize system behavior. This helps in maintaining a high level of reliability and performance in dynamic and distributed environments.
This manual instrumentation helps to observe the metadata, gain valuable insights and find any bugs, and improve iteratively.

…erge conflicts in future when merging hub-cli

guardrails/cli_dir/hub/credentials.py

guardrails/guard.py

… ways

guardrails/utils/hub_telemetry_utils.py

thekaranacharya added 7 commits January 17, 2024 17:28

Add OpenTelemetry class for validator hub

eba76a6

Init global tracer, use it to create nested spans to send usage data

5aaecb9

Add placeholder cli classes to get creds from local file; will have m…

6cfcdf4

…erge conflicts in future when merging hub-cli

Update class, endpoint, add new methods for context

9bfc5f5

Add new span, update existing spans, one parent, multiple children

44fcd93

Update endpoint

2467842

Lint

49c83ae

thekaranacharya marked this pull request as ready for review January 29, 2024 17:48

thekaranacharya requested review from zsimjee and CalebCourier January 29, 2024 17:48

CalebCourier reviewed Jan 29, 2024

View reviewed changes

guardrails/cli_dir/hub/credentials.py Outdated Show resolved Hide resolved

CalebCourier reviewed Jan 29, 2024

View reviewed changes

guardrails/guard.py Show resolved Hide resolved

CalebCourier reviewed Jan 29, 2024

View reviewed changes

guardrails/guard.py Show resolved Hide resolved

thekaranacharya added 2 commits January 29, 2024 18:26

Update credentials, and add logger

ee708e4

Add util method that will create spans from anywhere, update existing…

5e4096b

… ways

CalebCourier reviewed Feb 5, 2024

View reviewed changes

guardrails/utils/hub_telemetry_utils.py Outdated Show resolved Hide resolved

CalebCourier reviewed Feb 5, 2024

View reviewed changes

guardrails/utils/hub_telemetry_utils.py Outdated Show resolved Hide resolved

thekaranacharya added 4 commits February 13, 2024 14:01

Replace print with logging

fc87a74

Add opt-out

af28d40

Update str var to bool

2054f91

lint

6500348

CalebCourier approved these changes Feb 13, 2024

View reviewed changes

thekaranacharya merged commit 462d829 into otel Feb 13, 2024

TheDen mentioned this pull request Mar 15, 2024

No capability to disable telemetry via env var [bug] #646

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Validator Hub Telemetry #559

Validator Hub Telemetry #559

thekaranacharya commented Jan 26, 2024 •

edited

Loading

Validator Hub Telemetry #559

Validator Hub Telemetry #559

Conversation

thekaranacharya commented Jan 26, 2024 • edited Loading

What is this about?

Why do we need this?

thekaranacharya commented Jan 26, 2024 •

edited

Loading