aiohttp client #421

joshuahlang · 2020-02-14T02:28:36Z

Initial attempt at an aiohttp-client integration. Still needs some work to document the usage at a minimum.

GoPavel · 2020-02-16T13:22:19Z

I think that trace_config's hooks on request start, request end, and request exception are good, but not enough. Your typical usage of aiohttp:

async with session.get(...) as response:
     # this code out of the span
     ...

Currently, trace hooks track the life-time of aiohttp.ClientRequest, but when errors occur with response(JsonError, from raise_for_status) your span not fail and have StatusCanonicalCode.OK.
So you should track the life-time of aiohttp.ClientResponse. It is also important for stream request, where any problem with stream connection should be detected by tracing:

async with session.get(...) as response:
    # now ClientRequest is closed.
   async for chunk in response.content:
       # this code out of the span
       ...

I tried to find a proper way to hook it, but on_response_chunk_received seems not to work.
So I can suggest only replace response_class with a wrapper.
My minimalistic response wrapper example:

class ClientResponseWithTracing(aiohttp.ClientResponse):
    def _start_span(self):
        ctx = tracer.start_as_current_span(self.url, kind=SpanKind.CLIENT)
        span_exit = ctx.__exit__
        span: Span = ctx.__enter__()
        setattr(self, '_span', span)
        setattr(self, '_exit_span', span_exit)

    def _release_span(self):
        if not hasattr(self, '_span'):
            return
        span: Span = getattr(self, '_span')
        exc_val: Optional[BaseException] = self.content.exception()
        if exc_val is not None:
            add_exception_to_events(exc_val, span)
            exc_type, exc_tb = type(exc_val), exc_val.__traceback__
            span.set_status(Status(StatusCanonicalCode.INTERNAL, exc_type.__name__))
            getattr(self, '_exit_span')(exc_type, exc_val, exc_tb)
        else:
            getattr(self, '_exit_span')(None, None, None)
        delattr(self, '_span')

    def close(self):
        self._release_span()
        super().close()

    def release(self) -> Any:
        self._release_span()
        super().release()

async def on_request_end_hook(session, trace_config_ctx, params):
    ...
    params.response._start_span()

joshuahlang · 2020-02-17T18:12:07Z

Calling raise_for_status shouldn't prevent the trace status from being inferred from the HTTP status code. That happens prior to the aiohttp.ClientResponse object being returned to the caller. You're right that the on_response_chunk_received callback isn't very functional. It appears only to be triggered when calling aiohttp.ClientResponse.read here.

This raises an interesting question: What should be included in the span itself? E.g. if the server returns a non-JSON response, but the client tries to parse it as JSON (e.g. calling aiohttp.ClientResponse.json()) should that be reported as an error in the span for the HTTP request? It probably should for a connection related error, such as the server failing to send the complete response content. It becomes a bit more questionable if the server returns a content-type encoding that doesn't match the actual content encoding, and even more still if the client incorrectly calls aiohttp.ClientResponse.json() on a response

GoPavel · 2020-02-18T15:44:00Z

I want to pay attention to the stream queries. We can receive status 200 and after that, the stream can suddenly close with ClientPayloadError. I am certain that this exception related to the request span.

I have been a bit hasty when passing any exception to _exit_span in my ClientResponseWithTracing. Maybe the best-effort way is to try to decide whether the error related to request problems or related to business logic error.
For example:

if  isinstance(exc_val, aiohttp.ClientError) or isinstance(exc_val, json.JSONDecodeError):
   add_exception_to_events(exc_val, span)
   exc_type, exc_tb = type(exc_val), exc_val.__traceback__
   span.set_status(Status(StatusCanonicalCode.INTERNAL, exc_type.__name__))
   getattr(self, '_exit_span')(exc_type, exc_val, exc_tb)
else:
   getattr(self, '_exit_span')(None, None, None)

GoPavel · 2020-02-18T16:11:26Z

Calling raise_for_status shouldn't prevent the trace status from being inferred from the HTTP status code. That happens prior to the aiohttp.ClientResponse object being returned to the caller. You're right that the on_response_chunk_received callback isn't very functional. It appears only to be triggered when calling aiohttp.ClientResponse.read here.

Yes, you right. I tested with the default behavior when you call raise_for_status manually. Maybe, in that case, the user is responsible for HTTP status handling.

This raises an interesting question: What should be included in the span itself? E.g. if the server returns a non-JSON response, but the client tries to parse it as JSON (e.g. calling aiohttp.ClientResponse.json()) should that be reported as an error in the span for the HTTP request? It probably should for a connection related error, such as the server failing to send the complete response content. It becomes a bit more questionable if the server returns a content-type encoding that doesn't match the actual content encoding, and even more still if the client incorrectly calls aiohttp.ClientResponse.json() on a response

Maybe OT-spec will state something about that in the future.

c24t · 2020-02-27T23:40:47Z

@joshuahlang is this ready for review?

joshuahlang · 2020-02-28T02:59:24Z

There's a few open issues in the comments already, but does can be addressed in the formal review. I'll remove the WIP prefix once I finish the unit tests, which hopefully will be by EOD 2020-02-28 PST

joshuahlang · 2020-02-29T02:34:39Z

@joshuahlang is this ready for review?

Ready now. Still an open need to update the eachdist.py/tox.ini to not attempt to install eggs on versions of python which they don't support. E.g. this module (ext-aiohttp-client) is not supported on Python 3.4 and causes the 3.4 coverage task to fail.

toumorokoshi

I think a couple minor discussions on response code standards, but looks good otherwise!

ext/opentelemetry-ext-aiohttp-client/src/opentelemetry/ext/aiohttp_client/__init__.py

ext/opentelemetry-ext-aiohttp-client/tests/test_aiohttp_client_integration.py

toumorokoshi · 2020-03-09T17:58:10Z

ext/opentelemetry-ext-aiohttp-client/tests/test_aiohttp_client_integration.py

+                )
+                self.InMemoryExporter.clear()
+
+    def test_url_filter_option(self):


I believe query parameters should be a part of the atttribute:

https://github.com/open-telemetry/opentelemetry-specification/blob/master/specification/data-http.md#common-attributes

Full HTTP request URL in the form scheme://host[:port]/path?query[#fragment]. Usually the fragment is not transmitted over HTTP, but if it is known, it should be included nevertheless.

If the client wants to strip them for whatever reason, then they should be removed from the span completely. This is merely testing that a client-provided callback is actually called and used to process the URL before adding it to the span. In this case it simply removes all query params, but likely clients will use it to selectively remove API keys or PII from query params.

ah ok. a little nervous that the test is one that causes behavior which violates the spec, but sounds good.

Should the spec should be updated to account for this? Some things still pass secrets or PII in query strings. May be good to acknowledge that in the spec?

toumorokoshi · 2020-03-09T19:16:31Z

Currently, trace hooks track the life-time of aiohttp.ClientRequest, but when errors occur with response(JsonError, from raise_for_status) your span not fail and have StatusCanonicalCode.OK.
So you should track the life-time of aiohttp.ClientResponse. It is also important for stream request, where any problem with stream connection should be detected by tracing:

This is an interesting one. in that context, what does on_request_end really mean? docs seem a little sparse: https://docs.aiohttp.org/en/stable/tracing_reference.html#aiohttp.TraceConfig.on_request_end

This raises an interesting question: What should be included in the span itself? E.g. if the server returns a non-JSON response, but the client tries to parse it as JSON (e.g. calling aiohttp.ClientResponse.json()) should that be reported as an error in the span for the HTTP request? It probably should for a connection related error, such as the server failing to send the complete response content. It becomes a bit more questionable if the server returns a content-type encoding that doesn't match the actual content encoding, and even more still if the client incorrectly calls aiohttp.ClientResponse.json() on a response

Agree with @GoPavel that this is a good discussion for the spec. But today, HTTP instrumentation do not concern themselves with handling mismatched content types, or anything to do with processing the response, except for the timing.

One major implementation philosophy of OpenTelemetry is to not interfere with the application runtime as much as possible. I think content type validation would be significant overhead we don't want to be involved in.

This module is only supported on Python3.5, which is the oldest supported by aiohttp.

joshuahlang · 2020-05-04T15:53:22Z

@toumorokoshi Had some time this weekend to rebase this off the latest in OTel.

toumorokoshi · 2020-05-05T05:39:05Z

@joshuahlang great! Looks like CI is still failing but seems unrelated to your changes, taking a look

ext/opentelemetry-ext-aiohttp-client/src/opentelemetry/ext/aiohttp_client/version.py

scripts/jaeger.sh

scripts/coverage.sh

codeboten

Thanks for coming back to this! I've added some minor comments but overall it looks great. I'm requesting changes because of the following missing files: CHANGELOG.md, LICENSE, MANIFEST.in.

Hopefully they'll be pretty quick to add, you can see an example here: https://github.com/open-telemetry/opentelemetry-python/tree/master/ext/opentelemetry-ext-requests

ext/opentelemetry-ext-aiohttp-client/src/opentelemetry/ext/aiohttp_client/__init__.py

ext/opentelemetry-ext-aiohttp-client/tests/test_aiohttp_client_integration.py

ocelotl

Just for the record, we are not adding instrumentations (previously known as integrations) as children of BaseInstrumentor. Nevertheless, this can be approved now and migrated later to an instrumentation if that is more convenient, I guess.

scripts/coverage.sh

ocelotl · 2020-05-05T21:27:49Z

ext/opentelemetry-ext-aiohttp-client/src/opentelemetry/ext/aiohttp_client/__init__.py

+
+
+# TODO: refactor this code to some common utility
+def http_status_to_canonical_code(status: int) -> StatusCanonicalCode:


This code looks very similar to this one.

It does, hence the TODO. Not sure where it could be shared from though. opentelemetry-sdk perhaps? I'm not very familiar with the OTel project structure

ocelotl · 2020-05-05T21:46:54Z

ext/opentelemetry-ext-aiohttp-client/src/opentelemetry/ext/aiohttp_client/__init__.py

+        elif callable(trace_config_ctx.span_name):
+            request_span_name = str(trace_config_ctx.span_name(params))
+        else:
+            request_span_name = str(trace_config_ctx.span_name)


Is it really necessary to have an argument that can be either a callable or a string? Is it feasible that the span_name argument is only a string type argument and leave the processing of params (if necessary) to the caller of on_request_start who will have the responsibility of providing the span name to on_request_start?

on_request_start is called by aiohttp internally. aiohttp doesn't know the caller's context (e.g. that a request has a sensitive value in the query string). The higher-level caller instrumenting aiohttp.client doesn't have access to mutate the params directly. params is an internal construct used by aiohttp for tracing.

ext/opentelemetry-ext-aiohttp-client/src/opentelemetry/ext/aiohttp_client/__init__.py

ext/opentelemetry-ext-aiohttp-client/tests/test_aiohttp_client_integration.py

codeboten

Looks like only a couple of unused imports left to cleanup in the tests, otherwise this is looking good. Thanks!

toumorokoshi · 2020-05-07T03:41:03Z

@joshuahlang thanks so much for all the work! Exciting to get this in.

Adding initial aiohttp client. This module is only supported on Python3.5, which is the oldest supported by aiohttp. Co-authored-by: Yusuke Tsutsumi <yusuke@tsutsumi.io>

* chore: update README * chore: update README * fix: review comments

joshuahlang requested a review from a team February 14, 2020 02:28

joshuahlang force-pushed the aiohttp-client branch 3 times, most recently from 2d8b1ec to ff58602 Compare February 15, 2020 01:16

joshuahlang force-pushed the aiohttp-client branch from ff58602 to d8160cd Compare February 20, 2020 19:22

joshuahlang force-pushed the aiohttp-client branch 2 times, most recently from 17f907f to 0384670 Compare February 28, 2020 02:45

joshuahlang force-pushed the aiohttp-client branch 2 times, most recently from cda2d24 to 958aa88 Compare February 29, 2020 02:16

joshuahlang changed the title ~~WIP: aiohttp client~~ aiohttp client Feb 29, 2020

toumorokoshi suggested changes Mar 9, 2020

View reviewed changes

joshuahlang force-pushed the aiohttp-client branch 3 times, most recently from 35c0d76 to d329d55 Compare March 10, 2020 17:17

joshuahlang force-pushed the aiohttp-client branch from d329d55 to 311ba67 Compare March 30, 2020 19:44

ext-aiohttp-client implementation

1d51ed2

This module is only supported on Python3.5, which is the oldest supported by aiohttp.

joshuahlang force-pushed the aiohttp-client branch from 311ba67 to 1d51ed2 Compare May 2, 2020 18:51

Merge branch 'master' into aiohttp-client

d7286d2

Merge branch 'master' into aiohttp-client

e1b34ae

toumorokoshi approved these changes May 5, 2020

View reviewed changes

codeboten reviewed May 5, 2020

View reviewed changes

ext/opentelemetry-ext-aiohttp-client/src/opentelemetry/ext/aiohttp_client/version.py Outdated Show resolved Hide resolved

codeboten reviewed May 5, 2020

View reviewed changes

scripts/jaeger.sh Outdated Show resolved Hide resolved

codeboten reviewed May 5, 2020

View reviewed changes

scripts/coverage.sh Show resolved Hide resolved

codeboten suggested changes May 5, 2020

View reviewed changes

Changes from code review feedback

cc3cdb6

joshuahlang force-pushed the aiohttp-client branch from 568d522 to cc3cdb6 Compare May 5, 2020 20:42

ocelotl approved these changes May 5, 2020

View reviewed changes

More review feedback

76600a2

joshuahlang force-pushed the aiohttp-client branch from 14548cc to 76600a2 Compare May 5, 2020 23:29

Fix lint issues

641cdbe

joshuahlang force-pushed the aiohttp-client branch from dbdaff3 to 641cdbe Compare May 6, 2020 00:35

codeboten approved these changes May 6, 2020

View reviewed changes

Remove unused imports

579ce7a

joshuahlang force-pushed the aiohttp-client branch from 210ef13 to 579ce7a Compare May 6, 2020 21:12

Merge branch 'master' into aiohttp-client

ec82fcd

toumorokoshi merged commit e800d26 into open-telemetry:master May 7, 2020

srikanthccv pushed a commit to srikanthccv/opentelemetry-python that referenced this pull request Nov 1, 2020

Update readme (open-telemetry#421)

8b20e41

* chore: update README * chore: update README * fix: review comments

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

aiohttp client #421

aiohttp client #421

joshuahlang commented Feb 14, 2020

GoPavel commented Feb 16, 2020 •

edited

Loading

joshuahlang commented Feb 17, 2020

GoPavel commented Feb 18, 2020

GoPavel commented Feb 18, 2020

c24t commented Feb 27, 2020

joshuahlang commented Feb 28, 2020

joshuahlang commented Feb 29, 2020

toumorokoshi left a comment

toumorokoshi Mar 9, 2020

joshuahlang Mar 10, 2020

toumorokoshi May 5, 2020

joshuahlang May 5, 2020

toumorokoshi commented Mar 9, 2020

joshuahlang commented May 4, 2020

toumorokoshi commented May 5, 2020

codeboten left a comment

ocelotl left a comment

ocelotl May 5, 2020

joshuahlang May 5, 2020

ocelotl May 5, 2020

joshuahlang May 5, 2020

codeboten left a comment

toumorokoshi commented May 7, 2020



		# TODO: refactor this code to some common utility
		def http_status_to_canonical_code(status: int) -> StatusCanonicalCode:

aiohttp client #421

aiohttp client #421

Conversation

joshuahlang commented Feb 14, 2020

GoPavel commented Feb 16, 2020 • edited Loading

joshuahlang commented Feb 17, 2020

GoPavel commented Feb 18, 2020

GoPavel commented Feb 18, 2020

c24t commented Feb 27, 2020

joshuahlang commented Feb 28, 2020

joshuahlang commented Feb 29, 2020

toumorokoshi left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

toumorokoshi commented Mar 9, 2020

joshuahlang commented May 4, 2020

toumorokoshi commented May 5, 2020

codeboten left a comment

Choose a reason for hiding this comment

ocelotl left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

codeboten left a comment

Choose a reason for hiding this comment

toumorokoshi commented May 7, 2020

GoPavel commented Feb 16, 2020 •

edited

Loading