Define HTTP attributes that MUST be provided at span creation time #1916

lmolkova · 2021-09-09T16:56:24Z

Changes

This change clarifies which HTTP attributes should be provided before making sampling decision: all attributes that are required and available before span starts.

Optional attributes are not provided at start time to avoid the overhead of calculating and adding them

Note it actually goes against current spec recommendation (see #620 for discussion):

Whenever possible, users SHOULD set any already known attributes at span creation instead of calling SetAttribute later.

https://github.com/open-telemetry/opentelemetry-specification/blob/main/specification/trace/api.md#span-creation

Related issues #1747

Oberon00 · 2021-09-10T12:20:11Z

Note it actually goes against current spec recommendation:

Whenever possible, users SHOULD set any already known attributes at span creation instead of calling SetAttribute later.

https://github.com/open-telemetry/opentelemetry-specification/blob/main/specification/trace/api.md#span-creation

And I would like to change that as well because it seems to be not very efficient

Please don't open new, mostly unrelated issues in the PR description 😃
I agree that there is a fundamental performance problem with the current sampling API, but this should be discussed in #620.

Oberon00 · 2021-09-10T12:23:24Z

Your change currently fails the semantic convention check. I see two possibilities:

Update the markdown generation part of the semantic convention generator would need to be adapted to describe sampling_relevant see https://github.com/open-telemetry/build-tools/blob/main/semantic-conventions/src/opentelemetry/semconv/templating/markdown/__init__.py. Documentation that currently says "do not use" also needs to be updated (please don't add an additional table column though. we already have many tables that need horizontal scrolling).
Manually write the "at creation time" restrictions outside the  tags for now.

semantic_conventions/trace/http.yaml

Oberon00 · 2021-09-10T12:32:44Z

I agree to the semantic content of this PR, but it needs to be changed to pass the semantic convention check.

bogdandrutu

Please fix the checks, everything else LGTM

github-actions · 2021-09-23T03:17:10Z

This PR was marked stale due to lack of activity. It will be closed in 7 days.

specification/trace/semantic_conventions/http.md

yurishkuro

The change sneaks MUST in the text

jmacd · 2021-09-23T16:11:11Z

This doesn't look like "sneaking" to me, it's just a grammar question.

@yurishkuro are you suggesting to change

"following attributes MUST be provided at span creation time (when provided at all)"

to something like:

"If they will be provided during the lifetime of a span, the following attributes MUST be provided at span creation time."

?

yurishkuro · 2021-09-23T16:30:00Z

@jmacd the PR title says "should be provided", but the text says MUST, that's why I called it "sneaking in".

I would not object to SHOULD, I do object to MUST, I think it is an unreasonable requirement.

jmacd · 2021-09-23T16:37:24Z

"If condition is met, something MUST be." sounds equivalent to SHOULD, it's just more specific. Right?

lmolkova · 2021-09-23T18:32:34Z

I.e. how I think optimization should look like:

if tracer.shouldStart("name", ...) 
    sp:= tracer.start("name", ..., sampling_relevant_attributes)
    if sp.IsRecording() {
        sp.SetAttribute(optional attributes)
         ...

jmacd · 2021-09-23T18:54:19Z

If we want to optimize the sampling API, we should take it up in #620 and let this PR proceed. A lazy-attribute mechanism is how I'd do this, e.g.,

    sp:= tracer.start("name", ..., tracing.WithLazyAttribute("http.path", func() attribute.Value { return ... }))
    // ...
    sp.SetAttribute(attribute.LazyString("http.thing", func() attribute.Value { return ... }))
    if sp.IsRecording() {
        expensiveVar := expensiveMethod()
        calculated := doLogic(expensiveVar)
        sp.SetAttribute(attribute.String("http.expensive", calculated))
    }

#620 is saying we should be able to refactor the Sampler API to allow the Sampler to include this "shouldSample(name)" logic and selectively evaluate any of the lazy span attributes in order to make its decision. Most call-sites would be able to use the lazy attribute and avoid a call to sp.IsRecording(), if we had such a thing, like the example above.

yurishkuro · 2021-09-23T19:05:34Z

The problem I have with SHOULD is that instrumentations just do not follow it.

So either they don't care about following the spec (MUST wouldn't change that), or they have reasons (MUST would break them without a path forward).

lmolkova · 2021-09-23T19:09:50Z

So either they don't care about following the spec (MUST wouldn't change that), or they have reasons (MUST would break them without a path forward).

my assumptions about reasons (and I own Azure SDK http instrumentation, so know a bit about it):

historic implementations - follow old versions and owners not aware of sampling and creation time attribute requirements
not-optimal (optional attributes)
not-optimal (no early check)

Let me give you another reason for MUST (a bit beyond sampling): calculating metrics without a new instrumentation code. Is it worth a MUST assuming #620 solves perf concerns (however it does it)?

.

yurishkuro · 2021-09-23T22:47:04Z

Is it worth a MUST assuming #620 solves perf concerns (however it does it)?

Are we prepared to park this PR until #620 (which is ~18mo old) is resolved?

lmolkova · 2021-09-23T23:14:22Z

Is it worth a MUST assuming #620 solves perf concerns (however it does it)?

Are we prepared to park this PR until #620 (which is ~18mo old) is resolved?

I'm happy to change it to SHOULD (please, it will be a MUST soon) until #620 is resolved. Do you agree that it's the only blocker for MUST?

yurishkuro · 2021-09-23T23:53:12Z

Sorry, but I still haven't seen a convincing argument why this needs to be a requirement. I cannot overcome the fact that the only reason you want to require these fields is to do sampling, and not every conceivable form of sampling, but a very specific form that most people do not actually use today. And even if we start talking about attribute-based sampling, there can be plenty of other attributes could be way more valuable in sampling decisions than these http attributes, so why require these specific ones? This simply does not pass the bar for me for a narrow use case to be elevated to a requirement. I think you'd get as much mileage from SHOULD as from MUST.

calculating metrics without a new instrumentation code

Of the three RED metrics, error and duration cannot be calculated until span is finished, so it doesn't matter when span attributes are set.

lmolkova · 2021-09-24T00:06:05Z

I don't care much about this change, I want to have clear rules for implementations to follow.

And even if we start talking about attribute-based sampling, there can be plenty of other attributes could be way more valuable in sampling decisions than these http attributes, so why require these specific ones?

I'm trying to move HTTP spec forward, not sampling story. I want to have a good, consistent e2e story for users. Lack of clarity in this spec causes inconsistencies in instrumentations and broken experience for users - MUST is a way to provide clarity.

Of the three RED metrics, error and duration cannot be calculated until span is finished, so it doesn't matter when span attributes are set.

For metrics, we don't even have to start a real span if it's sampled out - we can have a lightweight one that only supports a few required attributes, provided at start time and measures duration. But you're right that all required attributes indeed can be provided after start for it to work.

iNikem · 2021-09-24T05:49:34Z

a very specific form that most people do not actually use today.

Please be careful with such sweeping statements. In my pond the situation is the opposite: nobody cares about probabilistic sampling but a lot of people want attribute-based sampling (or Views, as you called them in another issue).

anuraaga · 2021-09-24T06:32:15Z

For better or worse, the sample method accepts Attributes. I agree with @iNikem that this is an important use case - but without a consistent base then that use case basically gets blocked. If some library, or some language even, does not populate an attribute while others do, setting up any sort of attribute-based sampling becomes a nightmare for users, if it's even possible.

It seems to be the same as the required field for conventions - presumably they are defined so that backends have a consistent base to work with, even though not all backends will actually use them all. But that base needs to be defined. I don't think I see a definition for what required actually means in RFC-speak - if required is a MUST, I'd expect this definition to also be MUST, or otherwise both could be SHOULD.

There is performance overhead in providing attributes on-start since in most cases it requires allocation of a dictionary, instead of doing this:

This comment specifically seems to point out the dangers - we can't provide a consistent experience to users if one language SIG happens to deem the performance implication too high and doesn't implement these at sampling time while other languages do.

specification/trace/semantic_conventions/http.md

Oberon00 · 2021-10-06T09:59:24Z

Do you think we can merge this? Then we could also finish up the build-tools stuff (merging open-telemetry/build-tools#70 without changing to SHOULD and cutting a new release that also contains the event name stuff from open-telemetry/build-tools#67)

lmolkova · 2021-10-06T20:35:55Z

Do you think we can merge this? Then we could also finish up the build-tools stuff (merging open-telemetry/build-tools#70 without changing to SHOULD and cutting a new release that also contains the event name stuff from open-telemetry/build-tools#67)

@Oberon00 yes, I believe so, thanks! and then I'll follow up on open-telemetry/build-tools#70

specification/trace/semantic_conventions/http.md

CHANGELOG.md

lmolkova · 2021-10-12T14:58:14Z

Discussed MUST vs SHOULD over Tuesday Instrumentation SIG meeting (10/5/21) , and came to the agreement to start with MUST, the summary:

Feasibility
- [out of scope of this PR] we should probably separate server and client specs more and make attributes sets more consistent (e.g. url for client and components for server)
- there is no reason to believe that it's not feasible, i.e.
  - clients have url before the request starts (Java's netty client has only components though)
  - and servers have components ready before they start span (have to start after reading context from headers)
SHOULD vs MUST:
- SHOULD does not have enough power, especially for people that are far from tracing UX
- Consistency is important (for sampling and common required attributes); instrumentations tend to be inconsistent, staying within current spec
- Start with a MUST and check if 90%+ of instrumentation can comply. If not, switch to SHOULD
There is no punishment for not complying, except some broken experience
- eventually, we may have a certification test for instrumentations, but no strong enforcement

bogdandrutu · 2021-10-12T15:35:21Z

@yurishkuro @open-telemetry/technical-committee we discussed this in the spec meeting, and decide to file an issue (to track if this is the right decision or not) that needs to be resolved before the stability of this document, and proceed with the current proposal.

carlosalberto · 2021-10-14T12:48:40Z

Merging as further details will be decided as part of #2011, as discussed in the previous Spec SIG call.

lmolkova requested review from a team September 9, 2021 16:56

github-actions bot assigned jmacd Sep 9, 2021

Oberon00 reviewed Sep 10, 2021

View reviewed changes

semantic_conventions/trace/http.yaml Show resolved Hide resolved

lmolkova force-pushed the define-http-sampling-attributes branch from bde6c05 to 2671561 Compare September 13, 2021 18:22

Oberon00 added area:semantic-conventions Related to semantic conventions spec:trace Related to the specification/trace directory labels Sep 14, 2021

bogdandrutu approved these changes Sep 15, 2021

View reviewed changes

lmolkova mentioned this pull request Sep 15, 2021

Support sampling_relevant attributes open-telemetry/build-tools#68

Merged

lmolkova force-pushed the define-http-sampling-attributes branch from 2671561 to e33430c Compare September 15, 2021 17:49

github-actions bot added the Stale label Sep 23, 2021

arminru mentioned this pull request Sep 23, 2021

Upgrade semconv generator to v0.7.0 #1959

Merged

lmolkova closed this Sep 23, 2021

lmolkova reopened this Sep 23, 2021

arminru requested review from a team September 23, 2021 15:55

arminru removed the Stale label Sep 23, 2021

yurishkuro reviewed Sep 23, 2021

View reviewed changes

specification/trace/semantic_conventions/http.md Show resolved Hide resolved

arminru reviewed Sep 23, 2021

View reviewed changes

specification/trace/semantic_conventions/http.md Show resolved Hide resolved

yurishkuro previously requested changes Sep 23, 2021

View reviewed changes

jmacd approved these changes Sep 23, 2021

View reviewed changes

lmolkova changed the title ~~Define HTTP attributes that should be provided at span creation time~~ Define HTTP attributes that MUST be provided at span creation time Sep 23, 2021

jmacd mentioned this pull request Sep 23, 2021

Sampling decison is too late to gain much performance #620

Open

lmolkova mentioned this pull request Sep 24, 2021

Sampling relevant: minor improvements open-telemetry/build-tools#70

Merged

carlosalberto approved these changes Sep 25, 2021

View reviewed changes

bogdandrutu reviewed Sep 29, 2021

View reviewed changes

specification/trace/semantic_conventions/http.md Show resolved Hide resolved

Oberon00 approved these changes Oct 2, 2021

View reviewed changes

specification/trace/semantic_conventions/http.md Show resolved Hide resolved

pyohannes approved these changes Oct 5, 2021

View reviewed changes

arminru approved these changes Oct 12, 2021

View reviewed changes

specification/trace/semantic_conventions/http.md Show resolved Hide resolved

CHANGELOG.md Outdated Show resolved Hide resolved

Define HTTP attributes that should be provided at span creation time

f34abb0

lmolkova force-pushed the define-http-sampling-attributes branch from 6cace67 to f34abb0 Compare October 12, 2021 15:08

jsuereth approved these changes Oct 12, 2021

View reviewed changes

lmolkova mentioned this pull request Oct 12, 2021

Confirm that HTTP instrumentations can provide sampling-relevant attributes at creation time #2011

Closed

Merge branch 'main' into define-http-sampling-attributes

729f3d0

carlosalberto merged commit ecc2635 into open-telemetry:main Oct 14, 2021

fbogsany mentioned this pull request Nov 1, 2021

net_http instrumentation not following semantic conventions for net.peer.name open-telemetry/opentelemetry-ruby#998

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Define HTTP attributes that MUST be provided at span creation time #1916

Define HTTP attributes that MUST be provided at span creation time #1916

lmolkova commented Sep 9, 2021 •

edited

Loading

Oberon00 commented Sep 10, 2021

Oberon00 commented Sep 10, 2021 •

edited

Loading

Oberon00 commented Sep 10, 2021

bogdandrutu left a comment

github-actions bot commented Sep 23, 2021

yurishkuro left a comment

jmacd commented Sep 23, 2021

yurishkuro commented Sep 23, 2021

jmacd commented Sep 23, 2021

lmolkova commented Sep 23, 2021 •

edited

Loading

jmacd commented Sep 23, 2021

yurishkuro commented Sep 23, 2021

lmolkova commented Sep 23, 2021

yurishkuro commented Sep 23, 2021

lmolkova commented Sep 23, 2021

yurishkuro commented Sep 23, 2021

lmolkova commented Sep 24, 2021 •

edited

Loading

iNikem commented Sep 24, 2021

anuraaga commented Sep 24, 2021 •

edited

Loading

Oberon00 commented Oct 6, 2021

lmolkova commented Oct 6, 2021 •

edited

Loading

lmolkova commented Oct 12, 2021

bogdandrutu commented Oct 12, 2021

carlosalberto commented Oct 14, 2021

Define HTTP attributes that MUST be provided at span creation time #1916

Define HTTP attributes that MUST be provided at span creation time #1916

Conversation

lmolkova commented Sep 9, 2021 • edited Loading

Changes

Oberon00 commented Sep 10, 2021

Oberon00 commented Sep 10, 2021 • edited Loading

Oberon00 commented Sep 10, 2021

bogdandrutu left a comment

Choose a reason for hiding this comment

github-actions bot commented Sep 23, 2021

yurishkuro left a comment

Choose a reason for hiding this comment

jmacd commented Sep 23, 2021

yurishkuro commented Sep 23, 2021

jmacd commented Sep 23, 2021

lmolkova commented Sep 23, 2021 • edited Loading

jmacd commented Sep 23, 2021

yurishkuro commented Sep 23, 2021

lmolkova commented Sep 23, 2021

yurishkuro commented Sep 23, 2021

lmolkova commented Sep 23, 2021

yurishkuro commented Sep 23, 2021

lmolkova commented Sep 24, 2021 • edited Loading

iNikem commented Sep 24, 2021

anuraaga commented Sep 24, 2021 • edited Loading

Oberon00 commented Oct 6, 2021

lmolkova commented Oct 6, 2021 • edited Loading

lmolkova commented Oct 12, 2021

bogdandrutu commented Oct 12, 2021

carlosalberto commented Oct 14, 2021

lmolkova commented Sep 9, 2021 •

edited

Loading

Oberon00 commented Sep 10, 2021 •

edited

Loading

lmolkova commented Sep 23, 2021 •

edited

Loading

lmolkova commented Sep 24, 2021 •

edited

Loading

anuraaga commented Sep 24, 2021 •

edited

Loading

lmolkova commented Oct 6, 2021 •

edited

Loading