Add OpenTelemetry Application Performance Monitoring to core performance tests [#3352459]

Problem/Motivation

Postponed on #3346765: Add PerformanceTestBase for allowing browser performance assertions within FunctionalJavaScriptTests and #3352389: Add open-telemetry/sdk and open-telemetry/exporter-otlp as dev dependencies.

See #638078: Automated performance testing for core for rationale.

We're not able to easily see changes on core backend and frontend performance over time, this MR adds the capability to display graphs in a grafana/tempo instance as well as browse individual traces to see what's different.

Steps to reproduce

Proposed resolution

Performance dashboard in Grafana showing time to first byte and largest contentful paint for front and node/1

#3346765: Add PerformanceTestBase for allowing browser performance assertions within FunctionalJavaScriptTests gives us the ability to pass or fail tests based on certain performance characteristics, but this only works for things you can count.

For things that can't be counted but only timed (like how long a request takes from first byte to first paint), we could instead look at graphing these over time and flagging changes.

Open Telemetry is a suite of tools that will allow us to do this via adding some development dependencies, and setting up a 'collector' on Drupal.org infrastructure.

We can add the capability to PerformanceTestBase to send traces to OpenTelemetry, these can then be made available for browsing in signoz. signoz will show both trends over time and allow you to drill down into individual traces. See the screenshot for a proof of concept.

Traces are currently using data from the performance navigation API: https://developer.mozilla.org/en-US/docs/Web/API/PerformanceNavigationTi... via executeScript() - this lets us log this information without running any extra JavaScript on the tested site.

Note that the MR doesn't include the OpenTelemetry collector or the configuration to enable this, that will have to be handled on the infrastructure side. However there is a test repo using this patch that gives you a full end-to-end installation via ddev. https://github.com/tag1consulting/google-drupal

Remaining tasks

Once we have the initial framework for sending traces, we can add more tests, and more data to traces.

#3377660: Add authenticated user telemetry tests
#3377657: Add database query spans to otel traces
#3377654: Support 'interaction to next paint' (or close equivalent) in performance testing
#3377655: Add script, style, image HTTP requests to otel traces
#3379761: Add the commit hash to OpenTelemetry traces
#3379757: Track largestContentfulPaint::candidate events in PerformanceTestBase and allow assertions on them
#3379750: Figure out cold start issue with chromedriver performance logs

User interface changes

API changes

A new PerformanceTestTrait is added, it is included in the existing PerformanceTestBase.

A PerformanceData value object has been added in the Drupal\Tests namespace, this is only used by PerformanceTestTrait.

To get performance data to assert on within a test:

Before:

  $this->drupalGet('node/1');
  $this->assertSame(2, $this->styleSheetCount);
  $this->assertSame(2, $this->scriptCount);

After, to record performance data to assert on, explicitly enable profiling via calling the 'collectPerformanceData()' method, which takes a callable.

    $performance_data = $this->collectPerformanceData(function () {
      $this->drupalGet('node/1');
    });
    $this->assertSame(2, $performance_data->getStylesheetCount());
    $this->assertSame(1, $performance_data->getScriptCount());

The former behaviour of always recording performance data and adding information on PerformanceTestBase properties is removed, but is not in a tagged release yet, so no bc is provided, just existing tests updated:

To log telemetry to open telemetry, call the new ::logTelemetry() method, API is otherwise the same as ::collectPerformancedata(), so it's possible to combine both telemetry logging but also assert on PerformanceData from the same request.

    $this->logTelemetry('umamiFrontPageWarmCache', function () {
      $this->drupalGet('<front>');
    });

Data model changes

Release notes snippet

Tests extending PerformanceTestBase can now additionally send OpenTelemetry traces to an open telemetry endpoint, but setting the OTEL_COLLECTORenvironment variable. An OpenTelemetry collecter must be accessible from the environment running the test.

Comment	File	Size	Author
#92	100x_PerformanceTest.patch	2.01 KB	spokje
#91	1000x_PerformanceTest.patch	2.01 KB	spokje
#85	3352459-followup.patch	915 bytes	catch
#60	Screenshot 2023-09-18 at 2.28.37 PM.png	498.62 KB	smustgrave
#54	3352459-nr-bot.txt	90 bytes	needs-review-queue-bot
#52	3352459-nr-bot.txt	90 bytes	needs-review-queue-bot
#39	Screenshot from 2023-07-18 14-55-45.png	596.39 KB	catch
#39	Screenshot from 2023-07-18 13-32-25.png	529 KB	catch
#38	Screenshot from 2023-07-18 10-58-46.png	148.2 KB	catch
#37	Screenshot from 2023-07-15 17-25-35.png	157.5 KB	catch
#36	Screenshot from 2023-07-13 22-31-43.png	203.66 KB	catch
#33	Screenshot from 2023-07-09 13-39-27.png	217.17 KB	catch
#31	Screenshot from 2023-07-09 13-28-34.png	341.19 KB	catch
#31	Screenshot from 2023-07-09 13-27-50.png	290.58 KB	catch
#30	Screenshot from 2023-07-07 22-42-13.png	267.55 KB	catch
#28	Screenshot from 2023-07-04 11-23-56.png	321.37 KB	catch
#22	3352459-nr-bot.txt	2.08 KB	needs-review-queue-bot
#19	3352459-19.patch	6.21 KB	catch
#13	Screenshot from 2023-05-13 09-36-06.png	229.58 KB	catch
#12	3352459-12.patch	6.21 KB	catch
#11	3352459-11.patch	6.62 KB	catch
#10	3352459-10.patch	5.64 KB	catch
#9	3352459-9.patch	5.55 KB	catch
#8	3352459-8.patch	7.24 KB	catch
#6	3352459-6.patch	7.22 KB	catch
#6	3352459-interdiff.txt	2.82 KB	catch
#5	3352459-5.patch	7.21 KB	catch
#4	3352459-4.patch	7.22 KB	catch
#3	3352459.patch	7.2 KB	catch
#2	3352459.patch	6.67 KB	catch

Issue fork drupal-3352459

Show commands

Start within a Git clone of the project using the version control instructions.

Add & fetch this issue fork’s repository

Or, if you do not have SSH keys set up on git.drupalcode.org:

Add & fetch this issue fork’s repository

3352459-trait changes, plain diff MR !4900
Check out this branch for the first time

Check out existing branch, if you already have it locally

1 hidden branch

3352459-log-traces-from

changes, plain diff MR !4226

Check out this branch for the first time

Check out existing branch, if you already have it locally

About issue forks

Comments

Comment #1

5 April 2023 at 16:16

catch created an issue. See original summary.

Title:	Log traces from performance tests to OpenTelemetry	» [PP-2] Log traces from performance tests to OpenTelemetry
Status:	Active	» Postponed

Status	File	Size
hidden	3352459.patch	6.67 KB
hidden	3352459.patch	7.2 KB
hidden	3352459-4.patch	7.22 KB
hidden	3352459-5.patch	7.21 KB

Title:	Log traces from performance tests to OpenTelemetry	» Add OpenTelemetry Application Performance Monitoring to core performance tests
Issue summary:	View changes

Status:	Reviewed & tested by the community	» Needs work
Issue tags:		+Needs followup

Status:	Fixed	» Reviewed & tested by the community
Issue tags:	-Needs followup, -Needs change record updates