-
Notifications
You must be signed in to change notification settings - Fork 92
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fix boundaries for Symfony HttpKernel auto instrumentation #317
base: main
Are you sure you want to change the base?
Fix boundaries for Symfony HttpKernel auto instrumentation #317
Conversation
Thanks for opening your first pull request! If you haven't yet signed our Contributor License Agreement (CLA), then please do so that we can accept your contribution. A link should appear shortly in this PR if you have not already signed one. |
|
e78c015
to
19f787e
Compare
@@ -80,6 +80,30 @@ public static function register(): void | |||
array $params, | |||
?Response $response, | |||
?\Throwable $exception | |||
): void { | |||
$scope = Context::storage()->scope(); | |||
if (null === $scope || null === $exception) { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
So if an exception occurs, this post callback will run, otherwise the terminate
one will? If so, we should document this.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
That is correct.
I'm assuming this should be documented in the README.md, right?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The examples on this page seem to suggest that the handle
and terminate
can both run.
So it looks like you'd expect 2 spans to be created and then with these changes only close one of them? I'm not so sure that behaviour is correct.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There appears to be a parameter to configure whether handle
should throw here.
So when $catch = false
the handle post-hook might have an exception and terminate
will not have been reached yet. I think this would be fine as it handles the recording of the exception and detaches the scope.
However, if $catch = false
and there is no exception, then terminate()
would be called after handle
and the terminate
post-hook could try to close a scope which does not belong to it.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
-
If
handle
throws an exception which is never caught, thenterminate
will never be reached either, according to theHttpKernelRunner
implementation.
In this situation, it is correct to close the scope on thepost
hook ofhandle
. -
If
handle
does not throw an exception (or has$catch = true
), thenterminate
would run afterwards, closing the scope.
The only problem I see is there is not a 100% guarantee that terminate
will always be executed after handle
.
For example, something else might go wrong in HttpKernelRunner
after handle
was called, e.g. fastcgi_finish_request()/litespeed_finish_request()
might crash, flush()
might crash, etc, leaving the scope still open.
Unsure how much of a real issue this is, in most cases.
Normally this would be solved by hooking into HttpKernelRunner::run
with both pre
and post
hooks, as it covers both handle
and terminate
, however $request
and $response
are not available as parameters there, so I am at a loss right now.
The main issue is that the parent
span is lost in between handle
and terminate
, due to handle
detaching it, thus producing two different traces for the same request:
- one containing everything from the start of the HTTP request up to the end of
handle
- one containing just the dispatch of
TerminateEvent
done by$this->kernel->terminate
afterwards
the second should be part of the first, as it is still logically part of the same HTTP request lifecycle.
We need either a compromise or another solution entirely.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for that @dciprian-petrisor - I had another look with less bleary eyes and I finally caught up to you guys 💪
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I am having the same problem for Drupal, but opted to do an auto root span for now, which obviously duplicates the logic to do that.
That way I have a duplicated span for the Kernel::handle() + Kernel::terminate(), but that does not matter as it's still within one root trace.
The Auto Root Span uses the SDK ShutdownHandler::register so that might be also an option here.
The other thing is:
You could hook into KernelRunner::run(), but not open the scope here, but instead close it just during the post() [if ::run() was called in pre].
While that is more logic to handle, because handle() needs to check that ::run was actually called, so that it can close it, cross hook frontiers are a good way to handle this.
::run pre -> $runCalled = 1
::handle pre -> Open root scope like normal
::handle post -> if ($runCalled) {} -> Do not bother to close scope, else fallback to current logic
::terminate pre -> Open span like normal
::terminate post -> Close span like normal
::run post -> Close scope
In general I personally will still call the Auto Root Scope hook as early as possible (e.g. DrupalKernel::__construct in my case), because else there is always an ::IO extension call, which escapes.
Until that's converted to SPI obviously there are still race conditions.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is my implementation:
<?php
namespace OpenTelemetry\Contrib\Instrumentation\Drupal;
use Drupal\Core\DrupalKernel;
use OpenTelemetry\SDK\Trace\AutoRootSpan;
use function OpenTelemetry\Instrumentation\hook;
class DrupalAutoRootSpan {
public static function register(): void {
hook(
DrupalKernel::class,
'__construct',
static::registerAutoRootSpan(),
NULL,
);
}
/**
* @param \OpenTelemetry\API\Instrumentation\CachedInstrumentation $instrumentation
*
* @return \Closure
*/
public static function registerAutoRootSpan(): \Closure {
return static function (DrupalKernel $kernel, array $params, string $class, string $function, ?string $filename, ?int $lineno) {
$request = AutoRootSpan::createRequest();
if ($request) {
AutoRootSpan::create($request);
AutoRootSpan::registerShutdownHandler();
}
};
}
}
It works really well.
Codecov ReportAttention: Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## main #317 +/- ##
============================================
- Coverage 79.60% 78.66% -0.95%
+ Complexity 729 610 -119
============================================
Files 69 60 -9
Lines 2952 2606 -346
============================================
- Hits 2350 2050 -300
+ Misses 602 556 -46
Flags with carried forward coverage won't be shown. Click here to find out more.
... and 9 files with indirect coverage changes Continue to review full report in Codecov by Sentry.
|
Fixes open-telemetry/opentelemetry-php#1443