Compress aggregate URL query strings [#3303067]

Problem/Motivation

#1014086: Stampedes and cold cache performance issues with css/js aggregation changes aggregate URLs to a hash + query string with theme, delta, language and libraries.

The libraries list is the 'minimum representative set' (i.e. leaves of the dependency tree that are not dependents of anything else), but can still get quite long. user/1 visiting the front page of the standard profile ends up with query strings nearly 600 chars long. That gives us some headroom before it reaches 1,000, but it's not indefinite.

There is no RFC limit on URL or query string length, and we no longer support old versions of IE that used to have an arbitrary limit of around 2,000, but it's still possible for servers to set a maximum length (for example see discussion in https://stackoverflow.com/questions/812925/what-is-the-maximum-possible-...).

The most likely way someone would run into this is a site with 300 modules on quite a restrictive hosting platform, which is... well it's very possible really even if it'll be relatively uncommon.

@alexpott suggested compressing the query string if it's going to go over 950 characters.

Steps to reproduce

Proposed resolution

There are a couple of ways we could do this:
1. In the original issue, before we had the 'minimum representative set' option for libraries, #1014086-100: Stampedes and cold cache performance issues with css/js aggregation suggested keeping a lookup table, using a base64 increment or similar, so that each library could be represented by just one or two characters in the query string. We'd have to build that lookup table on library discovery and consult it both when building URLs and aggregates. Somewhat complex to implement but would be extremely efficent for the actual URL length which would end up looking something like A%2c0%2cf%2cbA instead of drupal/once%2cdrupal/backbone%2cdrupal/ajax

2. @alexpott suggested using gzcompress(), i.e.:

strlen(base64_encode(gzcompress("contextual/drupal.contextual-links%2Csystem/base%2Colivero/global-styling%2Ccore/drupal.active-link%2Colivero/powered-by-block%2Colivero/feed                                                                                                                                                       /drupal.debounce%2Ctoolbar/toolbar%2Cuser/drupal.user.icons%2Ccore/shepherd%2Ctour/tour-styling%2Ctour/tour%2Ccore/drupal.tabbingmanager%2Ccontextual/drupal.contextual-tool

This reduces 544 characters => 324 characters.

I think we should go with #2, at least until there's a compelling reason to go with #1, because #2 will be trivial to implement, and #1 will be hard.

A further complication is we've been hoping to use the URL information for ajaxPageState: #3279206: Dynamically determine ajaxPageState based on libraries. I guess if we're only ever going to pass that list back to PHP at some point, maybe it's fine to just pass the encoded list around? But is it that simple?

Remaining tasks

User interface changes

API changes

Data model changes

Release notes snippet

Comment	File	Size	Author
#42	3303067-42.patch	13.94 KB	catch
#42	10.1.x: PHP 8.1 & MySQL 5.7 28,884 pass
#42	3303067-42-interdiff.txt	722 bytes	catch
#38	3303067-39.patch	13.93 KB	catch
#38	10.1.x: PHP 8.1 & MySQL 5.7 28,531 pass
#38	interdiff-39.txt	1.97 KB	catch
#37	3303067-37.patch	14.94 KB	catch
#37	10.1.x: PHP 8.1 & MySQL 5.7 Custom Commands Failed
#37	interdiff-37.txt	5.09 KB	catch
#33	3303067-33.patch	15.8 KB	catch
#33	10.1.x: PHP 8.1 & MySQL 5.7 28,531 pass
#33	interdiff-29-33.txt	2.51 KB	catch
#30	3303067-29.patch	15.71 KB	catch
#30	10.1.x: PHP 8.1 & MySQL 5.7 28,530 pass
#29	3303067-29.patch	15.71 KB	catch
#29	10.1.x: PHP 8.1 & MySQL 5.7 Custom Commands Failed
#29	3303067-24-29-interdiff.txt	2.06 KB	catch
#24	3303067-24.patch	15.45 KB	catch
#24	10.1.x: PHP 8.1 & MySQL 5.7 28,617 pass
#24	3303067-interdiff-18-23.txt	3.09 KB	catch
#18	3303067-18.patch	15.32 KB	catch
#18	10.1.x: PHP 8.1 & MySQL 5.7 28,604 pass
#18	3303067-interdiff-18.txt	3 KB	catch
#16	3303067-16.patch	15.48 KB	catch
#16	10.1.x: PHP 8.1 & MySQL 5.7 28,605 pass
#15	3303067-15.patch	15.48 KB	catch
#15	10.1.x: PHP 8.1 & MySQL 5.7 Custom Commands Failed
#15	3303067-interdiff-15.patch	5.38 KB	catch
#15	10.1.x: PHP 8.1 & MySQL 5.7 Patch Failed to Apply
#14	3303067-14.patch	13.23 KB	catch
#14	10.1.x: PHP 8.1 & MySQL 5.7 28,608 pass
#13	3303067-13.patch	14.29 KB	catch
#13	10.1.x: PHP 8.1 & MySQL 5.7 Custom Commands Failed
#13	3303067-interdiff.txt	11.64 KB	catch
#9	3303067-9.patch	18.68 KB	catch
#9	10.1.x: PHP 8.1 & MySQL 5.7 28,449 pass, 2 fail
#8	3303067-8.patch	18.67 KB	catch
#8	10.1.x: PHP 8.1 & MySQL 5.7 Custom Commands Failed
#7	3303067-7.patch	18.46 KB	catch
#7	10.1.x: PHP 8.1 & MySQL 5.7 Custom Commands Failed
#7	3303067-interdiff.txt	1.9 KB	catch
#6	3303067-6.patch	15.65 KB	catch
#6	10.1.x: PHP 8.1 & MySQL 5.7 Custom Commands Failed
#6	3303067-interdiff.txt	12.76 KB	catch
#5	3303067-5.patch	17.74 KB	catch
#5	10.1.x: PHP 8.1 & MySQL 5.7 Custom Commands Failed
#4	3303067.patch	9.52 KB	catch
#4	10.1.x: PHP 8.1 & MySQL 5.7 Custom Commands Failed

Support from Acquia helps fund testing for Drupal Acquia logo

Comments

Comment #1

11 August 2022 at 11:34

catch created an issue. See original summary.

Comment #2

nod_

French

Lille

CreditAttribution: nod_ as a volunteer commented 11 August 2022 at 13:13

We can also add a new data attribute on the script tags that have this information in clear format so we don't need to mess around with url parsing.

Comment #3

nod_

French

Lille

CreditAttribution: nod_ as a volunteer commented 11 August 2022 at 13:14

I think it'd be that simple. We don't use it in the frontend, it's only to send back to php.

Comment #4

catch

he/him

English

CreditAttribution: catch at Third and Grove commented 12 August 2022 at 01:16

Title:	Compress aggregate URL query strings if they're over 950 chars	» Compress aggregate URL query strings
Status:	Active	» Needs review

File	Size
3303067.patch	9.52 KB
10.1.x: PHP 8.1 & MySQL 5.7 Custom Commands Failed

Took a look at this.

I started off trying to implement compression only when the list was going to be over 950 chars, but then wondered if we really need the two code-paths:

We need to deal with both include + exclude, this does increase the risk that the query string length will get very long, since any limits will be for the entire string rather than each part, if we compress over 475 chars that's quite a low threshold, and if we start checking if both strings are there etc. that seems a bit overkill.

The main reason to conditionally compress would be to save some CPU cycles, and maybe to make debugging easier, however this is all behind caching, and if we move some code around, we do the compression once per asset type, not once per asset group. I'm also not sure two different formats helps debugging that much over a shorter code path.

Additionally, given the query string is on every asset URL, compressing is going to save some bytes on the HTML page itself - say we've got a 550 char string that's on four js assets and four CSS assets, that's 2,200 characters, compression gets that down closer to 1,000.

This has all given me another idea too, but uploading a theoretically working patch before trying that.

Comment #5

catch

he/him

English

CreditAttribution: catch at Third and Grove commented 12 August 2022 at 05:23

File	Size
3303067-5.patch	17.74 KB
10.1.x: PHP 8.1 & MySQL 5.7 Custom Commands Failed

Alright the more comprehensive version seems like it might be a goer. No interdiff because it's more or less an entirely new patch, but here's what's different.

The one new constraint is that this now needs to handle a nested array, so we can't use implode()/explode(). We also can't use php serialize() because it's user data, so using json_encode()/json_decode() for that.

The 'include' and 'exclude' query parameters potentially have a lot of strings in common, since we might have to compress both, we might as well compress them into a single string. Compressing one slightly larger string should be more CPU efficient than compressing two smaller ones.

We can also include the theme in there, since that's common to every URL and similarly may contain duplicate strings (i.e. 'olivero' provides a couple of 'olivero/foo' libraries).

We also drop the separate 'include/exclude/theme' query argument keys, and chuck all this in a single 'data' key, which doesn't save a lot but slightly offsets the extra characters from using json_encode().

Comparisons of a real example (front page of a 10.1.x standard install as user/1 again):

HEAD:

css_AI3tdfWguXKwGYQh77azpIWFB75vRNg_QczKMd7wJAo.css?delta=0&language=en&theme=olivero&include=contextual/drupal.contextual-links%2Csystem/base%2Colivero/global-styling%2Ccore/drupal.active-link%2Colivero/powered-by-block%2Colivero/feed%2Cviews/views.module%2Colivero/navigation-secondary%2Colivero/search-wide%2Colivero/navigation-primary%2Colivero/search-narrow%2Ccore/modernizr%2Ccore/drupal.debounce%2Ctoolbar/toolbar%2Cuser/drupal.user.icons%2Ccore/shepherd%2Ctour/tour-styling%2Ctour/tour%2Ccore/drupal.tabbingmanager%2Ccontextual/drupal.contextual-toolbar%2Cshortcut/drupal.shortcut%2Ctoolbar/toolbar.escapeAdmin%2Cbig_pipe/big_pipe

(638)

Only compressing 'include', and using implode():

css_AI3tdfWguXKwGYQh77azpIWFB75vRNg_QczKMd7wJAo.css?delta=0&language=en&theme=olivero&include=eJx9kVFuxCAMRC8E4Qw9SWXATawSjGxImp6%252BZLdktdWqP2A8b%252BQBAueKX7VBclFagTSFq2MT5U81emjF1XlQNJxoQ2E3J%252FYd0Hp0ZjaBBYcfQu3MzXvhhXcUjNYf1icOD%252BEDMZqNcFd3W6eVY0uPORk2mqESZ6vYk0WQ4xIVQcJid4ovDUVofYFnEOH9HrlPQ8n0LU83iOi55YCmMicP4n530xRlQGc9UY%252Bkd68uWBaU2E3tdDS5XufqPE2p4H1XV8gw4yn98xEjgC4sNbQ6kHH%252Bm3RCDVDwLa6Ujaf5vVBBN4ofzJrDDw%253D%253D

(438)

Newest approach:

css_AI3tdfWguXKwGYQh77azpIWFB75vRNg_QczKMd7wJAo.css?delta=0&data=eJx9UstOxDAM%252FJecaXvnxndQhJzEtBFpHNnJloL4d9yy231o4RR7PDO2k3yZCGmoMKB5NJjMgykjTmtCMRyQSZGQXKxesWfjKBX8KBVi33muGWJ7hpoY0ruoQBYpOPWdBUFNj059N0SySpOyKHPQiiPG3QhcUd5mcinKNCOjb%252BzS2EjuqvaG6DU%252FBJyl77ajncjXeNU2wSEMUAKlRlDH9cDLZV0Q2I3NHPxfssxhui9KwEzzvoo2R07hk2%252BX82ipJrc2KETRAvfdMVCoCvJOXZM26Jyym8iIeUT2m7pu0soX93gGb%252FsWsFY5EyR94t%252Fqfy94HklG4uJq2Vkn4M4GLYqDjE9%252BCusPsmF4zSHrEKfIvHz%252FABr94Yc%253D

(477)

So this results in a slightly longer compressed string, but it's more robust against a string with both include and exclude query parameters.

One more possible approach here, but again going to upload progress.

Comment #6

catch

he/him

English

CreditAttribution: catch at Third and Grove commented 12 August 2022 at 06:22

File	Size
3303067-interdiff.txt	12.76 KB
3303067-6.patch	15.65 KB
10.1.x: PHP 8.1 & MySQL 5.7 Custom Commands Failed

One more iteration.

Instead of using json_encode()/json_decode(), have the new UrlHelper methods only accept a string.

However, we still compress all the bits of the URL we can - by custom encoding the array.

css_AI3tdfWguXKwGYQh77azpIWFB75vRNg_QczKMd7wJAo.css?delta=0&data=eJx9kWFuwyAMhS9Ewhl2ksmAl1gjGNnQLFMPX9KWRJ2q%252FQHj9z75AZiuHOmCwlfPqeBPqRBtkJohjmdniJS%252B1eimBRfrQNE8MTtFds2gZWueyXgW7Dz40jx39rBnXlEwDG4bXGR%252FCl%252BIwVwIV7X3dVw41HjOSXChCQpxGhRbsgCyHaIiiJ%252BHlcJbIAstb%252BwJRHh9RG7TUBL9yssNAjquyaMpzNGB2OduqqJ0016P1CLpg9UZ84wSGlR3osrxOkfnZUoB55q6QIIJd%252Bmfj%252BgBdGYpvpZu6ee%252FSUdUDxk%252FwkLJOJo%252BM2W0vbgBRirH2g%253D%253D

(437)

This results in a URL literally one character shorter than when we only encode 'include', but it'll be more robust when there's a list of libraries in 'exclude' and cheaper to generate. If someone wanted to use the compress/uncompress functions with json_encode()/json_decode() they still can - just call it on an array before passing in the string.

Part of the reason I wanted to try a slightly more generic approach is because there's a possibility we can use this for #956186: Allow AJAX to use GET requests too.

Comment #7

catch

he/him

English

CreditAttribution: catch at Third and Grove commented 12 August 2022 at 07:16

File	Size
3303067-interdiff.txt	1.9 KB
3303067-7.patch	18.46 KB
10.1.x: PHP 8.1 & MySQL 5.7 Custom Commands Failed

Comment #8

catch

he/him

English

CreditAttribution: catch at Third and Grove commented 12 August 2022 at 07:27

File	Size
3303067-8.patch	18.67 KB
10.1.x: PHP 8.1 & MySQL 5.7 Custom Commands Failed

Comment #9

catch

he/him

English

CreditAttribution: catch at Third and Grove commented 12 August 2022 at 07:38

File	Size
3303067-9.patch	18.68 KB
10.1.x: PHP 8.1 & MySQL 5.7 28,449 pass, 2 fail

Missed the cspell issue, rewording to get around it.

Comment #10

nod_

French

Lille

CreditAttribution: nod_ as a volunteer commented 12 August 2022 at 08:02

on umami frontpage with this URL:

js_PEdSRDCYlyswo_e1QJZX-ZKulvo_gfga7JWEGxn4yMU.js?scope=footer&delta=0&language=en&theme=umami&include=core/jquery%2Ccore/once%2Ctour/tour%2Ccore/drupal.progress%2Ccore/tabbable%2Ccore/loadjs%2Cresponsive_image/ajax%2Ccore/drupal.ajax%2Ccontextual/drupal.contextual-links%2Csystem/base%2Cumami/classy.base%2Ccore/normalize%2Cumami/demo-umami-tour%2Cumami/global%2Cumami/messages%2Cumami/webfonts-open-sans%2Cumami/webfonts-scope-one%2Ccore/drupal.active-link%2Cviews/views.module%2Cumami/recipe-collections%2Cumami/more-link%2Cumami/view-mode-card-common%2Cumami/view-mode-card%2Cumami/classy.node%2Cumami/view-mode-card-common-alt%2Ccore/modernizr%2Ccore/drupal.debounce%2Ctoolbar/toolbar%2Cuser/drupal.user.icons%2Ccore/drupal.tabbingmanager%2Ccontextual/drupal.contextual-toolbar%2Cshortcut/drupal.shortcut%2Ctoolbar/toolbar.escapeAdmin%2Cbig_pipe/big_pipe

Core: 860
patch #4: 528
patch #5: 575
patch #6: 553
Using ~ as a separator: 794

Not too happy about having the language, theme, include, exclude params as a set indexes of the data array, feels like a future headache.

One thing to be aware of is that using gzcompress makes it impossible to use from the JS (at least without using something like zlib which adds quite a bit of js just for that).

One thing we could do easily is replace the , separator by ~ because that one isn't urlencoded so we save 2chars for each separator, which can add up as seen above. I haven't seen a ~ in a library name declaration so that should be safe enough, we could always prevent people from using that character in the library definition.

Comment #11

12 August 2022 at 08:35

Status:

Needs review

» Needs work

The last submitted patch, 9: 3303067-9.patch, failed testing. View results

Comment #12

catch

he/him

English

CreditAttribution: catch at Third and Grove commented 12 August 2022 at 13:31

One thing to be aware of is that using gzcompress makes it impossible to use from the JS (at least without using something like zlib which adds quite a bit of js just for that).

This is true, but if we just need to pass it back to PHP, could the AJAX API just pass the encoded data and we deal with it in PHP again? We might need a proper generic API for PHP to get the relevant information in that case though.

Not too happy about having the language, theme, include, exclude params as a set indexes of the data array, feels like a future headache.

Yeah I kind of think #4 (but maybe with the compression API in UrlHelper like the latter patches) might end up being the way to go here.

Comment #13

catch

he/him

English

CreditAttribution: catch at Third and Grove commented 13 August 2022 at 00:32

Status:

Needs work

» Needs review

File	Size
3303067-interdiff.txt	11.64 KB
3303067-13.patch	14.29 KB
10.1.x: PHP 8.1 & MySQL 5.7 Custom Commands Failed

Not too happy about having the language, theme, include, exclude params as a set indexes of the data array, feels like a future headache.

Backed that out, new version which has the same logic as #4, but drops the LibraryCompresser class in favour of the UrlHelper methods from later patches.

Also improved the test coverage a bit.

Interdiff is against #4 since it's basically a clean-up of that approach.

Using ~ as a separator: 794

Wow that is a big difference, we might want to open new issue just for that change since it should be an easy one?

Comment #14

catch

he/him

English

CreditAttribution: catch at Third and Grove commented 13 August 2022 at 00:37

File	Size
3303067-14.patch	13.23 KB
10.1.x: PHP 8.1 & MySQL 5.7 28,608 pass

Hadn't fully removed LibraryCompresser

Comment #15

catch

he/him

English

CreditAttribution: catch at Third and Grove commented 13 August 2022 at 01:50

File	Size
3303067-interdiff-15.patch	5.38 KB
10.1.x: PHP 8.1 & MySQL 5.7 Patch Failed to Apply
3303067-15.patch	15.48 KB
10.1.x: PHP 8.1 & MySQL 5.7 Custom Commands Failed

More test improvements and accounting for another failure condition - we don't want logs getting flooded if people put garbage in the 'include' or 'exclude' query parameters, since it's a client error not a logic error, so suppressing the warning and adding test coverage for it.

Comment #16

catch

he/him

English

CreditAttribution: catch at Third and Grove commented 13 August 2022 at 02:09

File	Size
3303067-16.patch	15.48 KB
10.1.x: PHP 8.1 & MySQL 5.7 28,605 pass

whitespace.

Comment #17

nod_

French

Lille

CreditAttribution: nod_ as a volunteer commented 13 August 2022 at 10:53

Status:

Needs review

» Needs work

+++ b/core/lib/Drupal/Component/Utility/UrlHelper.php
@@ -62,6 +62,47 @@ public static function buildQuery(array $query, $parent = '') {
+    // Use json_encode() instead of serialize(), because this is used to
+    // compress/uncompress user data.

That comment is either unnecessary or in the wrong place.

+++ b/core/modules/system/src/Controller/AssetControllerBase.php
@@ -147,9 +151,19 @@ public function deliver(Request $request, string $file_name) {
+      $attached_assets->setAlreadyLoadedLibraries(explode(',', UrlHelper::uncompressQueryParameter($request->query->get('exclude'))));

could be explode(',', $exclude_string) here no?

+++ b/core/tests/Drupal/FunctionalTests/Asset/AssetOptimizationTest.php
@@ -164,19 +173,21 @@ protected function replaceGroupHash(string $url): string {
+    $include = explode(',', UrlHelper::uncompressQueryParameter($parts['query']['include']));
+    $includes[] = 'system/llama';
+    $parts['query']['include'] = UrlHelper::compressQueryParameter(implode(',', $include));

that should be $include[] here no? (without the "s")

Comment #18

catch

he/him

English

CreditAttribution: catch at Third and Grove commented 13 August 2022 at 11:31

Status:

Needs work

» Needs review

File	Size
3303067-interdiff-18.txt	3 KB
3303067-18.patch	15.32 KB
10.1.x: PHP 8.1 & MySQL 5.7 28,604 pass

Should address #17.

Comment #19

nod_

French

Lille

CreditAttribution: nod_ as a volunteer commented 13 August 2022 at 17:23

Status:

Needs review

» Reviewed & tested by the community

Still at 528 chars, all good. It makes sense that only compressing the include and exclude value the string would be the shortest since there are a lot or repetition inside that string "core/", "core/drupal.", "umami/", etc. so it checks out that it's the most efficient.

Tried to replace "," by "~", and the string ends up longer at 540 chars. So no need to open a follow-up for that since it never appear in the URL.

This will mean that if we use that include (and exclude) parameters for ajaxPageState, the backend will need to be able to receive a list of encoded arguments instead of the exhaustive list of libraries on the page. Not relevant to this issue and not a big deal in itself, something to keep in mind for BC and all that.

Given all that it's RTBC for me.

Comment #20

catch

he/him

English

CreditAttribution: catch at Third and Grove commented 13 August 2022 at 22:53

I'm also hopeful this will help with #956186: Allow AJAX to use GET requests.

Comment #21

Wim Leers

Ghent 🇧🇪🇪🇺

CreditAttribution: Wim Leers at Acquia commented 23 August 2022 at 10:30

Status:

Reviewed & tested by the community

» Needs work

+++ b/core/lib/Drupal/Component/Utility/UrlHelper.php
@@ -62,6 +62,45 @@ public static function buildQuery(array $query, $parent = '') {
+   * While RFC 1738 doesn't specify a maximum length for query strings,
+   * browsers (such as Internet Explorer) or server configurations may
+   * restrict URLs and/or query strings to a certain length, often 1000
+   * or 2000 characters. This method can be used to compress a string into a
+   * URL-safe query string which will be shorter than if it was used directly.

🙈 Nit: 80 cols formatting

```
+++ b/core/lib/Drupal/Component/Utility/UrlHelper.php
@@ -62,6 +62,45 @@ public static function buildQuery(array $query, $parent = '') {
+   *   The uncompressed data or FALSE on failure.
+   */
+  public static function uncompressQueryParameter(string $compressed): string|bool {
```
🤓 I was confused by "uncompress" — never seen that before! So did some searching and … apparently "uncompress" is rarely used, "uncompressed" is: to indicate that something is not/never was compressed.

I think one usually uses "decompress" — just like "encode & decode".

Sorry 😬

+++ b/core/lib/Drupal/Core/Asset/CssCollectionOptimizerLazy.php
@@ -102,18 +115,9 @@ public function optimize(array $css_assets, array $libraries) {
-    $ajax_page_state = $this->requestStack->getCurrentRequest()->get('ajax_page_state');
-    $already_loaded = isset($ajax_page_state) ? explode(',', $ajax_page_state['libraries']) : [];
-    $query_args = [

🤔 Why did these get moved to a different location in the code?

+++ b/core/tests/Drupal/FunctionalTests/Asset/AssetOptimizationTest.php
@@ -200,4 +211,58 @@ protected function omitTheme(string $url): string {
+   *   The URL with the 'include' set to an arbitrary string.

s/include/exclude/

+++ b/core/tests/Drupal/Tests/Component/Utility/UrlHelperTest.php
@@ -106,6 +106,25 @@ public function providerTestInvalidAbsolute() {
+    $this->assertEquals($data, $uncompressed);
+    $this->assertLessThan(strlen($uncompressed), strlen($compressed));

Wow, super elegant test coverage! 🤩

+++ b/core/tests/Drupal/Tests/Component/Utility/UrlHelperTest.php
@@ -106,6 +106,25 @@ public function providerTestInvalidAbsolute() {
+    // Pass an invalid string to ::uncompressQueryParameter() and ensure it
+    // doesn't result in a PHP warning.
+    $this->assertFalse(UrlHelper::uncompressQueryParameter('llama'));

🤓 Nit: this feels like it belongs in a separate test method.

Comment #22

catch

he/him

English

CreditAttribution: catch at Third and Grove commented 23 August 2022 at 11:14

#21.2 so I nearly used ::decompress() here, but PHP uses gzuncompress() https://www.php.net/manual/en/function.gzuncompress.php and that's what we're using to compress/un-/de-compress, so I went for that. Also checked out uncompress and it appears to be more widely used for the specific case of compressed files, i.e. https://www.merriam-webster.com/dictionary/uncompress

So... I don't have a strong preference, but that was the thinking behind it. Do you still think it should be decompress?

#21.3 the query string is nearly identical for every aggregate, so this means we compress the same string once outside the foreach loop. It's behind a cache, but it doesn't hurt. If there was a suitable place we could even do it once between css and js since the list of libraries is the same, but there's no obvious spot to do that at the moment.

Comment #23

Wim Leers

Ghent 🇧🇪🇪🇺

CreditAttribution: Wim Leers at Acquia commented 23 August 2022 at 12:18

Hah! That's more than good enough for me 🤓

Comment #24

catch

he/him

English

CreditAttribution: catch at Third and Grove commented 24 August 2022 at 05:47

Status:

Needs work

» Needs review

File	Size
3303067-interdiff-18-23.txt	3.09 KB
3303067-24.patch	15.45 KB
10.1.x: PHP 8.1 & MySQL 5.7 28,617 pass

Should hopefully address #21.

Comment #25

nod_

French

Lille

CreditAttribution: nod_ as a volunteer commented 17 November 2022 at 08:20

Status:

Needs review

» Reviewed & tested by the community

Patch still applies and works. Review from #21 addressed.

All good :)

Comment #26

olli CreditAttribution: olli commented 19 November 2022 at 15:47

+++ b/core/lib/Drupal/Component/Utility/UrlHelper.php
@@ -62,6 +62,45 @@ public static function buildQuery(array $query, $parent = '') {
+   * @see \Drupal\Component\Utility\UrlHelper::uncompress()

@see uncompressQueryParameter()

+++ b/core/lib/Drupal/Component/Utility/UrlHelper.php
@@ -62,6 +62,45 @@ public static function buildQuery(array $query, $parent = '') {
+    return urlencode(base64_encode(gzcompress($data)));

It seems to double encode unsafe characters from base64 encode (+, / and =).

<script src="/sites/default/files/js/js_gi8MaXv8i2qj2W0p7yMIoHc3LqEQ2PXt8dRkQm4-wPg.js?scope=footer&amp;delta=0&amp;language=en&amp;theme=olivero&amp;include=eJxtjEEOwyAMBD8UypsMdolVgyObgvh9pB7oJZc97MyuL%252B9UYwKnQ4UHmcYimkCC9yXcyq4vnWSEIa2QRPNngzcRHoNpevzlqyp%252B5f%252FXYHCBztqCU9aGYGtDJ7B8hsn4OLiM64PewEznDVS%252FSdg%253D"></script>

With "base64url" encoding the parameter would be few characters shorter. Something like this: https://www.php.net/manual/en/function.base64-encode.php#123098 ?

+++ b/core/lib/Drupal/Component/Utility/UrlHelper.php
@@ -62,6 +62,45 @@ public static function buildQuery(array $query, $parent = '') {
+   * @see \Drupal\Component\Utility\UrlHelper::compress()

@see compressQueryParameter()

+++ b/core/lib/Drupal/Component/Utility/UrlHelper.php
@@ -62,6 +62,45 @@ public static function buildQuery(array $query, $parent = '') {
+    return @gzuncompress(base64_decode(urldecode($compressed)));

do we need to add ext-zlib to composer.json for gz-functions?

Comment #27

andypost

he/him

Russian

CreditAttribution: andypost as a volunteer and at Skilld commented 19 November 2022 at 16:27

16 files were hidden/shown/deleted

File	Size
3303067.patch	9.52 KB
10.1.x: PHP 8.1 & MySQL 5.7 Custom Commands Failed
3303067-5.patch	17.74 KB
10.1.x: PHP 8.1 & MySQL 5.7 Custom Commands Failed
3303067-interdiff.txt	12.76 KB
3303067-6.patch	15.65 KB
10.1.x: PHP 8.1 & MySQL 5.7 Custom Commands Failed
3303067-interdiff.txt	1.9 KB
3303067-7.patch	18.46 KB
10.1.x: PHP 8.1 & MySQL 5.7 Custom Commands Failed
3303067-8.patch	18.67 KB
10.1.x: PHP 8.1 & MySQL 5.7 Custom Commands Failed
3303067-9.patch	18.68 KB
10.1.x: PHP 8.1 & MySQL 5.7 28,449 pass, 2 fail
3303067-interdiff.txt	11.64 KB
3303067-13.patch	14.29 KB
10.1.x: PHP 8.1 & MySQL 5.7 Custom Commands Failed
3303067-14.patch	13.23 KB
10.1.x: PHP 8.1 & MySQL 5.7 28,608 pass
3303067-interdiff-15.patch	5.38 KB
10.1.x: PHP 8.1 & MySQL 5.7 Patch Failed to Apply
3303067-15.patch	15.48 KB
10.1.x: PHP 8.1 & MySQL 5.7 Custom Commands Failed
3303067-16.patch	15.48 KB
10.1.x: PHP 8.1 & MySQL 5.7 28,605 pass
3303067-interdiff-18.txt	3 KB
3303067-18.patch	15.32 KB
10.1.x: PHP 8.1 & MySQL 5.7 28,604 pass

Zlib PHP extension can't be disabled, no need to change composer

Comment #28

catch

he/him

English

CreditAttribution: catch at Third and Grove commented 20 November 2022 at 09:56

Status:

Reviewed & tested by the community

» Needs work

Marking needs work for #26, 'base64url' encoding is an easy change to make and worth saving some characters for.

Comment #29

catch

he/him

English

CreditAttribution: catch at Third and Grove commented 20 November 2022 at 17:34

Status:

Needs work

» Needs review

File	Size
3303067-24-29-interdiff.txt	2.06 KB
3303067-29.patch	15.71 KB
10.1.x: PHP 8.1 & MySQL 5.7 Custom Commands Failed

Comment #30

catch

he/him

English

CreditAttribution: catch at Third and Grove commented 20 November 2022 at 19:31

File	Size
3303067-29.patch	15.71 KB
10.1.x: PHP 8.1 & MySQL 5.7 28,530 pass

Fixing the typo.

Comment #31

Wim Leers

Ghent 🇧🇪🇪🇺

CreditAttribution: Wim Leers at Acquia commented 21 November 2022 at 08:28

+++ b/core/lib/Drupal/Component/Utility/UrlHelper.php
@@ -62,6 +62,48 @@ public static function buildQuery(array $query, $parent = '') {
+   * browsers (such as Internet Explorer) or server configurations may restrict

Nit: I don't think we support the Internet Explorer anymore? 🤓 (Also: 🥳)

+++ b/core/lib/Drupal/Component/Utility/UrlHelper.php
@@ -62,6 +62,48 @@ public static function buildQuery(array $query, $parent = '') {
+    return str_replace(['+', '/', '='], ['-', '_', ''], base64_encode(gzcompress($data)));
...
+    return @gzuncompress(base64_decode(str_replace(['-', '_'], ['+', '/'], $compressed)));

Fascinating that we can uncompress even though = was omitted?! 🤯

+++ b/core/modules/system/src/Controller/AssetControllerBase.php
@@ -147,9 +151,19 @@ public function deliver(Request $request, string $file_name) {
+    if (!$include_string) {
+      throw new BadRequestHttpException('The libraries to include must be passed as a query argument');
+    }

🤔 This looks like the message for !$query->has('include'), but the message for ::uncompressQuery() returning FALSE?

EDIT: ah, yes, the "has" check already happened above. We just need to tweak this message :)

What length does this result in compared to @nod_'s analysis in #10?

Comment #32

nod_

French

Lille

CreditAttribution: nod_ as a volunteer and at Acquia commented 21 November 2022 at 08:51

some libraries in umami changed so went back to the same commit i used in #10 to have numbers we can compare, with latest patch we're at 502, we have a winner.

For reference, using commit 6a1855c2:

Core: 860
patch #4: 528
patch #5: 575
patch #6: 553
Using ~ as a separator: 794
patch #30: 502

Comment #33

catch

he/him

English

CreditAttribution: catch at Third and Grove commented 21 November 2022 at 09:39

File	Size
interdiff-29-33.txt	2.51 KB
3303067-33.patch	15.8 KB
10.1.x: PHP 8.1 & MySQL 5.7 28,531 pass

#31.1: removed the IE reference.

#31.2: this is because = is only used for right-padding the string and doesn't form part of the actual compressed data, so we're able to just strip it (note that it's replaced with ''). We could also have used rtrim() there but that'd mean more nested string manipulation. I added a comment.

#31.3: adjusted the comment to match the language for the 'exclude' error.

#31.4/#32: nice!

Comment #34

nod_

French

Lille

CreditAttribution: nod_ as a volunteer and at Acquia commented 21 November 2022 at 09:47

Status:

Needs review

» Reviewed & tested by the community

Seems like we're good to go now.

Comment #35

Wim Leers

Ghent 🇧🇪🇪🇺

CreditAttribution: Wim Leers at Acquia commented 21 November 2022 at 10:41

Issue tags:

+front-end performance

#33: TIL = is used for right-padding data in base 64 encoding! I actually always wondered what that was for, and never looked into it! 😅

Thanks for teaching me something! 🙏

#32: very nice!

~~RTBC++~~

Just realized one last thing 😬

What happens when you have cached responses (in Dynamic Page Cache, Page Cache, Varnish, CDN … — any reverse proxy) and that still contains uncompressed aggregate URL query strings? Does that still work?

Oh, HAH! #1014086: Stampedes and cold cache performance issues with css/js aggregation is 10.1.x-only, so … no existing sites are on this yet! And when updating to a new minor, caches are cleared already anyway 👍

Comment #36

alexpott

he/they

English

🇪🇺🌍

CreditAttribution: alexpott at Acro Commerce, Thunder commented 21 November 2022 at 16:09

Status:

Reviewed & tested by the community

» Needs work

+++ b/core/lib/Drupal/Core/Asset/CssCollectionOptimizerLazy.php
@@ -74,6 +74,19 @@ public function optimize(array $css_assets, array $libraries) {
+    // All asset group URLs will have exactly the same query arguments, except
+    // for the delta, so prepare them in advance.
+    $query_args = [
+      'language' => $this->languageManager->getCurrentLanguage()->getId(),
+      'theme' => $this->themeManager->getActiveTheme()->getName(),
+      'include' => UrlHelper::compressQueryParameter(implode(',', $this->dependencyResolver->getMinimalRepresentativeSubset($libraries))),
+    ];
+    $ajax_page_state = $this->requestStack->getCurrentRequest()->get('ajax_page_state');
+    $already_loaded = isset($ajax_page_state) ? explode(',', $ajax_page_state['libraries']) : [];
+    if ($already_loaded) {
+      $query_args['exclude'] = UrlHelper::compressQueryParameter(implode(',', $this->dependencyResolver->getMinimalRepresentativeSubset($already_loaded)));
+    }
+
     $css_assets = [];
     foreach ($css_groups as $order => $css_group) {
       // We have to return a single asset, not a group of assets. It is now up
@@ -102,18 +115,9 @@ public function optimize(array $css_assets, array $libraries) {

@@ -102,18 +115,9 @@ public function optimize(array $css_assets, array $libraries) {
         $css_assets[$order]['data'] = $uri;
       }
     }
+
     // Generate a URL for each group of assets, but do not process them inline,
     // this is done using optimizeGroup() when the asset path is requested.
-    $ajax_page_state = $this->requestStack->getCurrentRequest()->get('ajax_page_state');
-    $already_loaded = isset($ajax_page_state) ? explode(',', $ajax_page_state['libraries']) : [];
-    $query_args = [
-      'language' => $this->languageManager->getCurrentLanguage()->getId(),
-      'theme' => $this->themeManager->getActiveTheme()->getName(),
-      'include' => implode(',', $this->dependencyResolver->getMinimalRepresentativeSubset($libraries)),
-    ];
-    if ($already_loaded) {
-      $query_args['exclude'] = implode(',', $this->dependencyResolver->getMinimalRepresentativeSubset($already_loaded));
-    }

--- a/core/lib/Drupal/Core/Asset/JsCollectionOptimizerLazy.php
+++ b/core/lib/Drupal/Core/Asset/JsCollectionOptimizerLazy.php

+++ b/core/lib/Drupal/Core/Asset/JsCollectionOptimizerLazy.php
@@ -71,6 +71,21 @@ public function optimize(array $js_assets, array $libraries) {
+    // All group URLs have the same query arguments apart from the delta and
+    // scope, so prepare them in advance.
+    $language = $this->languageManager->getCurrentLanguage()->getId();
+    $query_args = [
+      'language' => $language,
+      'theme' => $this->themeManager->getActiveTheme()->getName(),
+      'include' => UrlHelper::compressQueryParameter(implode(',', $this->dependencyResolver->getMinimalRepresentativeSubset($libraries))),
+    ];
+    $ajax_page_state = $this->requestStack->getCurrentRequest()
+      ->get('ajax_page_state');
+    $already_loaded = isset($ajax_page_state) ? explode(',', $ajax_page_state['libraries']) : [];
+    if ($already_loaded) {
+      $query_args['exclude'] = UrlHelper::compressQueryParameter(implode(',', $this->dependencyResolver->getMinimalRepresentativeSubset($already_loaded)));
+    }
+
     // Group the assets.
     $js_groups = $this->grouper->group($js_assets);
 
@@ -109,33 +124,19 @@ public function optimize(array $js_assets, array $libraries) {

@@ -109,33 +124,19 @@ public function optimize(array $js_assets, array $libraries) {
           break;
       }
     }
-    if ($libraries) {
-      // Generate a URL for the group, but do not process it inline, this is
-      // done by \Drupal\system\controller\JsAssetController.
-      $ajax_page_state = $this->requestStack->getCurrentRequest()
-        ->get('ajax_page_state');
-      $already_loaded = isset($ajax_page_state) ? explode(',', $ajax_page_state['libraries']) : [];
-      $language = $this->languageManager->getCurrentLanguage()->getId();
-      $query_args = [
-        'language' => $language,
-        'theme' => $this->themeManager->getActiveTheme()->getName(),
-        'include' => implode(',', $this->dependencyResolver->getMinimalRepresentativeSubset($libraries)),
-      ];
-      if ($already_loaded) {
-        $query_args['exclude'] = implode(',', $this->dependencyResolver->getMinimalRepresentativeSubset($already_loaded));
-      }
-      foreach ($js_assets as $order => $js_asset) {
-        if (!empty($js_asset['preprocessed'])) {
-          $query = [
-            'scope' => $js_asset['scope'] === 'header' ? 'header' : 'footer',
-            'delta' => "$order",
-          ] + $query_args;
-          $filename = 'js_' . $this->generateHash($js_asset) . '.js';
-          $uri = 'public://js/' . $filename;
-          $js_assets[$order]['data'] = $this->fileUrlGenerator->generateAbsoluteString($uri) . '?' . UrlHelper::buildQuery($query);
-        }
-        unset($js_assets[$order]['items']);
+    // Generate a URL for the group, but do not process it inline, this is
+    // done by \Drupal\system\controller\JsAssetController.
+    foreach ($js_assets as $order => $js_asset) {
+      if (!empty($js_asset['preprocessed'])) {
+        $query = [
+          'scope' => $js_asset['scope'] === 'header' ? 'header' : 'footer',
+          'delta' => "$order",
+        ] + $query_args;
+        $filename = 'js_' . $this->generateHash($js_asset) . '.js';
+        $uri = 'public://js/' . $filename;
+        $js_assets[$order]['data'] = $this->fileUrlGenerator->generateAbsoluteString($uri) . '?' . UrlHelper::buildQuery($query);
       }
+      unset($js_assets[$order]['items']);
     }

I don't think we need to move the code blocks around. I think we can add UrlHelper::compressQueryParameter() where it is needed and not move all the code and change the flow. Just in case someone where is relying on the if ($libraries) { behaviour ... and trying to keep changes to minimal necessary to implement.

Comment #37

catch

he/him

English

CreditAttribution: catch at Third and Grove commented 21 November 2022 at 16:15

Status:

Needs work

» Needs review

File	Size
interdiff-37.txt	5.09 KB
3303067-37.patch	14.94 KB
10.1.x: PHP 8.1 & MySQL 5.7 Custom Commands Failed

That's a good point. Kept the new additional comment since it's more important now we have to base64 encode and compress the string, but moved things back where they were otherwise.

Comment #38

catch

he/him

English

CreditAttribution: catch at Third and Grove commented 21 November 2022 at 16:22

File	Size
interdiff-39.txt	1.97 KB
3303067-39.patch	13.93 KB
10.1.x: PHP 8.1 & MySQL 5.7 28,531 pass

More can be inside the if ($libraries)

Comment #39

21 November 2022 at 17:50

Status:

Needs review

» Needs work

The last submitted patch, 38: 3303067-39.patch, failed testing. View results

Comment #40

catch

he/him

English

CreditAttribution: catch at Third and Grove commented 21 November 2022 at 21:01

Status:

Needs work

» Needs review

Comment #41

olli CreditAttribution: olli commented 25 November 2022 at 18:59

+++ b/core/lib/Drupal/Component/Utility/UrlHelper.php
@@ -62,6 +62,49 @@ public static function buildQuery(array $query, $parent = '') {
+    return str_replace(['+', '/', '='], ['-', '_', ''], base64_encode(gzcompress($data)));

The double encoding is gone now.

<script src="/sites/default/files/js/js_gi8MaXv8i2qj2W0p7yMIoHc3LqEQ2PXt8dRkQm4-wPg.js?scope=footer&amp;delta=0&amp;language=en&amp;theme=olivero&amp;include=eJxtjEEOwyAMBD8UypsMdolVgyObgvh9pB7oJZc97MyuL-9UYwKnQ4UHmcYimkCC9yXcyq4vnWSEIa2QRPNngzcRHoNpevzlqyp-5f_XYHCBztqCU9aGYGtDJ7B8hsn4OLiM64PewEznDVS_Sdg"></script>

+++ b/core/lib/Drupal/Component/Utility/UrlHelper.php
@@ -62,6 +62,49 @@ public static function buildQuery(array $query, $parent = '') {
+   * Takes a compressed, urlencoded string and converts it back to the original.

Drop "urlencoded" or replace it with "URL-safe" or "Takes a compressed query parameter and converts it back to the original."?

+++ b/core/lib/Drupal/Component/Utility/UrlHelper.php
@@ -62,6 +62,49 @@ public static function buildQuery(array $query, $parent = '') {
+   * @see \Drupal\Component\Utility\UrlHelper::compress()

@see \Drupal\Component\Utility\UrlHelper::compressQueryParameter()

Other than that looks good to me.

Comment #42

catch

he/him

English

CreditAttribution: catch at Third and Grove commented 29 November 2022 at 20:19

File	Size
3303067-42-interdiff.txt	722 bytes
3303067-42.patch	13.94 KB
10.1.x: PHP 8.1 & MySQL 5.7 28,884 pass

Should address #41.

Comment #43

b_sharpe CreditAttribution: b_sharpe at ImageX commented 25 January 2023 at 17:09

Status:

Needs review

» Reviewed & tested by the community

#42 looks good and addresses #41, working as expected for me and tests are passing. I don't see anything else here. RTBC

Comment #44

alexpott

he/they

English

🇪🇺🌍

CreditAttribution: alexpott at Acro Commerce, Thunder commented 26 January 2023 at 09:21

Status:

Reviewed & tested by the community

» Fixed

Committed 6d62cf2 and pushed to 10.1.x. Thanks!

The new code has test coverage and the URLs are smaller - nice.

Comment #45

26 January 2023 at 09:21

alexpott committed 6d62cf23 on 10.1.x

Issue #3303067 by catch, nod_, Wim Leers, olli, alexpott: Compress...

Comment #46

Wim Leers

Ghent 🇧🇪🇪🇺

CreditAttribution: Wim Leers at Acquia commented 1 February 2023 at 10:08

YAY!

Hopefully see y'all in #3303067: Compress aggregate URL query strings next 🤓

Comment #47

voleger

he/his

Ukrainian

Ukraine, Rivne

CreditAttribution: voleger as a volunteer and at GOLEMS GABB for Drupal Ukraine Community, GOLEMS GABB commented 1 February 2023 at 10:34

#1945262: Replace custom weights with dependencies in library declarations; introduce "before" and "after" for conditional ordering

Comment #48

Wim Leers

Ghent 🇧🇪🇪🇺

CreditAttribution: Wim Leers at Acquia commented 1 February 2023 at 13:04

@voleger, oops, yes, that's what I meant 😅

Comment #49

15 February 2023 at 13:04

Status:

Fixed

» Closed (fixed)

Automatically closed - issue fixed for 2 weeks with no activity.

Compress aggregate URL query strings

Problem/Motivation

Steps to reproduce

Proposed resolution

Remaining tasks

User interface changes

API changes

Data model changes

Release notes snippet

Comments

Related issues

Referenced by