Fix and improve comment cache tag usage [#2530846]

Problem/Motivation

To reproduce:

1. Create an article
2. Add a comment
3. Go to frontpage (still logged in), comment shows up
4. Separate browser window, anon, frontpage, so that it gets cached
5. Back to the logged in user. add another comment.
6. Back to frontpage with that user. Comment count is correct, thanks to placeholdered node links
7. Refresh page in anon browser window. Still shows the old comment count.

Additionally, node/1 with comments adds the comment_list tag. That seems like a really bad idea, because any added or updated comment will invalidate all the render cached nodes that have comments.

The effect of this cache tag can be seen on the following graph:

Each of those bumps is a comment.

Proposed resolution

Unless I'm missing something, I think we can fix both issues by removing the comment_list cache tag from the comment formatter and instead invalidate the cache tag of the commented entity. I've actually been doing this in a custom hook for months in my install profile, I just noticed another related bug there that is specific to my install profile when (finally) enabling page cache by default which reminded me of this.

Additionally, I think we can actually remove the placeholdering of node links then. Opened #2556767: Remove placeholdering of node links for that.

Remaining tasks

User interface changes

API changes

Data model changes

Comment	File	Size	Author
#31	node_pages_throughput.png	28.88 KB	Berdir
#28	comment-caching-2530846-28-interdiff.txt	1.25 KB	mbovan
#28	comment-caching-2530846-28-comment-only.patch	8.96 KB	mbovan
#28	8.0.x: PHP 5.5 & MySQL 5.5 13,510 pass
#28	comment-caching-2530846-28-test-only.patch	3.84 KB	mbovan
#28	8.0.x: PHP 5.5 & MySQL 5.5 13,509 pass, 1 fail
#20	comment-caching-2530846-20-interdiff.txt	4.81 KB	Berdir
#20	comment-caching-2530846-20-comment-only.patch	8.87 KB	Berdir
#20	8.0.x: PHP 5.5 & MySQL 5.5 13,141 pass
#20	comment-caching-2530846-20-test-only.patch	3.67 KB	Berdir
#20	8.0.x: PHP 5.5 & MySQL 5.5 13,140 pass, 1 fail
#10	comment-caching-2530846-10-interdiff.txt	4.23 KB	Berdir
#10	comment-caching-2530846-10-comment-only.patch	5.1 KB	Berdir
#10	8.0.x: PHP 5.5 & MySQL 5.5 12,865 pass, 2 fail
#10	comment-caching-2530846-10.patch	8.05 KB	Berdir
#10	8.0.x: PHP 5.5 & MySQL 5.5 12,863 pass, 5 fail
#3	comment-caching-2530846-3.patch	2.15 KB	Berdir
#3	8.0.x: PHP 5.5 & MySQL 5.5 12,777 pass, 6 fail
#2	comment-caching-2530846-2-interdiff.txt	597 bytes	Berdir
#2	comment-caching-2530846-2.patch	4.59 KB	Berdir
#2	8.0.x: PHP 5.5 & MySQL 5.5 12,774 pass, 10 fail
#1	comment-caching-2530846-1.patch	4.55 KB	Berdir
#1	8.0.x: PHP 5.5 & MySQL 5.5 CI error

Support from Acquia helps fund testing for Drupal Acquia logo

Comments

Comment #1

Berdir

German

Switzerland

CreditAttribution: Berdir at MD Systems GmbH commented 10 July 2015 at 16:42

Status:

Active

» Needs review

File	Size
comment-caching-2530846-1.patch	4.55 KB
8.0.x: PHP 5.5 & MySQL 5.5 CI error

Should be that easy :)

For testing, also removed the placeholdering of node links. Can still revert that if we want to do that in a separate issue.

Comment #2

Berdir

German

Switzerland

CreditAttribution: Berdir at MD Systems GmbH commented 10 July 2015 at 16:44

File	Size
comment-caching-2530846-2.patch	4.59 KB
8.0.x: PHP 5.5 & MySQL 5.5 12,774 pass, 10 fail
comment-caching-2530846-2-interdiff.txt	597 bytes

Adding the user.permissions cache tag.

Comment #3

Berdir

German

Switzerland

CreditAttribution: Berdir at MD Systems GmbH commented 10 July 2015 at 16:45

File	Size
comment-caching-2530846-3.patch	2.15 KB
8.0.x: PHP 5.5 & MySQL 5.5 12,777 pass, 6 fail

And here is a patch with just the comment changes.

Comment #4

Berdir

German

Switzerland

CreditAttribution: Berdir at MD Systems GmbH commented 10 July 2015 at 16:51

Issue summary:

View changes

Comment #5

Berdir

German

Switzerland

CreditAttribution: Berdir at MD Systems GmbH commented 10 July 2015 at 16:52

+++ b/core/modules/comment/src/Entity/Comment.php
@@ -153,6 +154,10 @@ public function preSave(EntityStorageInterface $storage) {
+    Cache::invalidateTags($commented_entity->getCacheTags());

That needs to use getCacheTagsToInvalidate() now of course. Not going to upload a 4th patch until test results are back, though ;)

Comment #6

Fabianx CreditAttribution: Fabianx as a volunteer commented 10 July 2015 at 17:03

Actually I like to use placeholders for the whole comment area, to avoid especially that problem that the node is invalidated when comments are newly created.

However removing the list cache tag is fine I think.

Comment #7

10 July 2015 at 17:10

The last submitted patch, 1: comment-caching-2530846-1.patch, failed testing.

Comment #8

10 July 2015 at 17:12

The last submitted patch, 2: comment-caching-2530846-2.patch, failed testing.

Comment #9

10 July 2015 at 17:19

Status:

Needs review

» Needs work

The last submitted patch, 3: comment-caching-2530846-3.patch, failed testing.

Comment #10

Berdir

German

Switzerland

CreditAttribution: Berdir at MD Systems GmbH commented 10 July 2015 at 19:42

Status:

Needs work

» Needs review

File	Size
comment-caching-2530846-10.patch	8.05 KB
8.0.x: PHP 5.5 & MySQL 5.5 12,863 pass, 5 fail
comment-caching-2530846-10-comment-only.patch	5.1 KB
8.0.x: PHP 5.5 & MySQL 5.5 12,865 pass, 2 fail
comment-caching-2530846-10-interdiff.txt	4.23 KB

@Fabianx: I don't really think that would work with how placeholders work right now. We'd have to rebuild the comments every time or another cache inbetween. That said, it's a formatter, so whatever that one is doing is very easy to replace, you can just write one that uses a placeholder.

Note that the node links section has nothing to do with the actual comments, so removing placeholders for that doesn't mean anything for the actual comments.

That said, we can't remove the list cache tag without adding a replacement, and invalidating the node tag will not work with your idea. So for that we'd need

Fixing a few tests, prevent fatal errors if there is no commented entity. that wouldn't require to change unit test but I only had that idea after I changed it already ;)

Comment #11

Wim Leers

Ghent 🇧🇪🇪🇺

CreditAttribution: Wim Leers at Acquia commented 10 July 2015 at 19:55

+++ b/core/modules/comment/src/Tests/CommentDefaultFormatterCacheTagsTest.php
--- a/core/modules/comment/src/Tests/Views/CommentUserNameTest.php
+++ b/core/modules/comment/src/Tests/Views/CommentUserNameTest.php

Unnecessary change.

+++ b/core/modules/node/src/NodeViewBuilder.php
@@ -34,15 +34,7 @@ public function buildComponents(array &$build, array $entities, array $displays,
-        $build[$id]['links'] = array(
-          '#lazy_builder' => [get_called_class() . '::renderLinks', [
-            $entity->id(),
-            $view_mode,
-            $langcode,
-            !empty($entity->in_preview),
-          ]],
-          '#create_placeholder' => TRUE,
-        );
+        $build[$id]['links'] = $this->renderLinks($entity, $view_mode, $langcode, !empty($entity->in_preview));

We can do this thanks to #2429257: Bubble cache contexts, which predates the use of placeholdering for node/comment links!

This should yield a small performance improvement too.

+++ b/core/modules/node/src/NodeViewBuilder.php
@@ -75,7 +67,7 @@ protected function getBuildDefaults(EntityInterface $entity, $view_mode, $langco
    *   The node entity ID.

Now outdated.

+++ b/core/modules/node/src/NodeViewBuilder.php
@@ -87,23 +79,25 @@ protected function getBuildDefaults(EntityInterface $entity, $view_mode, $langco
+      '#cache' => [
+        'contexts' => ['user.permissions'],
+      ]

How do we know that we want user.permissions here? Is it just a "sane default"?

Comment #12

Berdir

German

Switzerland

CreditAttribution: Berdir at MD Systems GmbH commented 10 July 2015 at 20:00

4. Yes, I was just wondering if that makes a difference with the tests. The problem is that we are using #theme links I think, so #2495779: Make #theme => links take cacheability metadata as an argument. Maybe we can just switch to #type links?

Also, I think statistics failed, not sure what to do with that :(

Comment #13

10 July 2015 at 20:07

The last submitted patch, 10: comment-caching-2530846-10.patch, failed testing.

Comment #14

10 July 2015 at 20:07

Status:

Needs review

» Needs work

The last submitted patch, 10: comment-caching-2530846-10-comment-only.patch, failed testing.

Comment #15

Berdir

German

Switzerland

CreditAttribution: Berdir at MD Systems GmbH commented 10 July 2015 at 22:19

This should yield a small performance improvement too.

If you're displaying 50 or so nodes on a page then the performance improvement that you get from this is not small :)

We can open a separate issue to do the node links switch, but what are we going to do about statistics_node_links_alter() That's just not cacheable and we definitely don't want to invalidate to support that..

My only idea right now is that we set a max-age to a configurable time, like an hour or so. Page cache caches it anyway, indefinitely, so what do we lose, really?

Comment #16

catch

he/him

English

CreditAttribution: catch commented 11 July 2015 at 14:51

Statistics should probably use a placeholder itself? Or mixed placeholder/js like history module.

1 hour TTL will affect cache hit rate for everything else and there's an issue to bubble max age to the page cache.

Comment #17

Berdir

German

Switzerland

CreditAttribution: Berdir at MD Systems GmbH commented 11 July 2015 at 20:11

I thought about placeholder too, but that won't help with page cache, that will still cache this indefinitely (yes, this is already a problem in HEAD). And even less sure it's worth to have an additional JS request just to display that number.

That's the problem with optimizations.. it really depends on the site what makes sense. If you have a lot of requests, I'd imagine it's a lot more effective to just update the node once per hour compared to having a placeholder/JS request that needs to be called every time.

Comment #18

Fabianx CreditAttribution: Fabianx as a volunteer commented 13 July 2015 at 15:55

#17: The trick is to do both:

Compared to #post_render_cache, #placeholders can be lazy created _and_ cached at the same time! :)

Example:

$build['lazy'] = [
  '#cache' => [
    'keys' => ['node_statistics', $this->getId()],
     'max-age' => 3600,
     'tags' => ['node_read:' . $this->getId()],
  ],
  '#lazy_builder' => ['SomeClass::someMethod', ['foo']],
];

That way the nodes will be cached indefinitely, while the statistics are refreshed e.g. every hour.

Once we have multipleGet (on the roadmap, but no API change needed, so just major priority), all placeholders can be lazily gotten at the page / smart cache / render strategy level.

And with render_strategies someone would be flexible to convert that even into a JS callback instead or remove it or ...

Comment #19

Wim Leers

Ghent 🇧🇪🇪🇺

CreditAttribution: Wim Leers at Acquia commented 3 August 2015 at 19:14

Issue tags:

+D8 cacheability

Comment #20

Berdir

German

Switzerland

CreditAttribution: Berdir at MD Systems GmbH commented 3 August 2015 at 20:42

Status:

Needs work

» Needs review

File	Size
comment-caching-2530846-20-test-only.patch	3.67 KB
8.0.x: PHP 5.5 & MySQL 5.5 13,140 pass, 1 fail
comment-caching-2530846-20-comment-only.patch	8.87 KB
8.0.x: PHP 5.5 & MySQL 5.5 13,141 pass
comment-caching-2530846-20-interdiff.txt	4.81 KB

Fixed the test fail, addressed #11.1, the other points there about the node links, no longer part of this patch. Will open a new issue for that.

Please welcome a new member to our zoo aka test suite ;) test-only is supposed to fail, let's see if that works as expected.

Comment #21

3 August 2015 at 22:17

The last submitted patch, 20: comment-caching-2530846-20-test-only.patch, failed testing.

Comment #22

Wim Leers

Ghent 🇧🇪🇪🇺

CreditAttribution: Wim Leers at Acquia commented 4 August 2015 at 07:41

Status:

Needs review

» Needs work

  .-''''-. _    
 ('    '  '0)-/)
 '..____..:    \._
   \u  u (        '-..------._
   |     /      :   '.        '--.
  .nn_nn/ (      :   '            '\
 ( '' '' /      ;     .             \
  ''----' "\          :            : '.
         .'/                           '.
        / /                             '.
       /_|       )                     .\|
         |      /\                     . '
         '--.__|  '--._  ,            /
                      /'-,          .'
                     /   |        _.' 
                snd (____\       /    
                          \      \    
                           '-'-'-'

+++ b/core/modules/comment/src/Tests/CommentCacheTagsTest.php
@@ -29,6 +31,16 @@ class CommentCacheTagsTest extends EntityWithUriCacheTagsTestBase {
+  protected $entityTestHippopotamidae;

This is actually plural, even though we only ever herd a single hippopotamus. In which case the singular form would be hippopotamida.

+++ b/core/modules/comment/src/Tests/CommentCacheTagsTest.php
@@ -81,6 +93,45 @@ protected function createEntity() {
+  public function testCommentEntity() {
+
+    $this->verifyPageCache($this->entityTestCamelid->urlInfo(), 'MISS');

Extraneous newline.

+++ b/core/modules/comment/src/Tests/CommentCacheTagsTest.php
@@ -81,6 +93,45 @@ protected function createEntity() {
+    $this->entity->save();
+    $this->verifyPageCache($this->entityTestCamelid->urlInfo(), 'MISS');
+    $this->verifyPageCache($this->entityTestHippopotamidae->urlInfo(), 'HIT');

Hrm, aren't the camelid and hippopotamida completely separate entities? Ah, this is because before this patch, we set the comment list cache tags. Can you add a comment to the test to make that clear?

Comment #23

Berdir

German

Switzerland

CreditAttribution: Berdir at MD Systems GmbH commented 5 August 2015 at 19:37

1. You, sir, win the nitpick of the month award ;) I've copied this from https://en.wikipedia.org/wiki/Hippopotamidae (or actually https://en.wikipedia.org/wiki/Hippopotamus) which never uses the singular version? Also nothing prevents us from adding more than one hippo? ;)

3. Exactly. Suggestions on how to write that comment? Comments that refer to "that's how it worked in the past" are IMHO always a bit tricky, on the other side, this is a regression test for exactly that not happening.

Comment #24

Wim Leers

Ghent 🇧🇪🇪🇺

CreditAttribution: Wim Leers at Acquia commented 5 August 2015 at 22:13

1. YAY!!! :P The Wikipedia page says: the family Hippopotamidae, which means it's plural. And, the typehint prevents us from having multiple hippos assigned to that one variable, so :)

3. A comment like "Ensure only the commented entity is affected" should be sufficient, and not suffer from the "X in the past" problem?

Comment #25

catch

he/him

English

CreditAttribution: catch commented 5 August 2015 at 23:26

Hippopotamidae is the family for hippopotamus in the sense that canidae is the family of canis and vulpes. So it's not really plural, it's 'the thing above a genus'. By the way it is fun discussing taxonomy on a d.o issue :P

So it's hippopotamus for singular and any of hippopotamuses/hippopotami/hippos for plural according to wikipedia, although we could probably use family/genus if we wanted to.

Or... there is the collective noun which can be any of bloat/pod/dale/herd.

Comment #26

webchick

she/they

English

Vancouver 🇨🇦

CreditAttribution: webchick at Acquia commented 6 August 2015 at 05:02

NERDS!! :P ;)

Comment #27

Berdir

German

Switzerland

CreditAttribution: Berdir at MD Systems GmbH commented 6 August 2015 at 05:50

What I wanted to get is the same relationship with hippos as the test already has by using Camelids (which is already plural, btw, just not the variable) and Llama.

By the way it is fun discussing taxonomy on a d.o issue :P

Yes ;)

Comment #28

mbovan CreditAttribution: mbovan at MD Systems GmbH commented 24 August 2015 at 09:47

Status:

Needs work

» Needs review

File	Size
comment-caching-2530846-28-test-only.patch	3.84 KB
8.0.x: PHP 5.5 & MySQL 5.5 13,509 pass, 1 fail
comment-caching-2530846-28-comment-only.patch	8.96 KB
8.0.x: PHP 5.5 & MySQL 5.5 13,510 pass
comment-caching-2530846-28-interdiff.txt	1.25 KB

Rerolled and fixed #22.2 and #22.3.

Comment #29

24 August 2015 at 10:12

The last submitted patch, 28: comment-caching-2530846-28-test-only.patch, failed testing.

Comment #30

Wim Leers

Ghent 🇧🇪🇪🇺

CreditAttribution: Wim Leers at Acquia commented 24 August 2015 at 14:09

Status:

Needs review

» Reviewed & tested by the community

Comment #31

Berdir

German

Switzerland

CreditAttribution: Berdir at MD Systems GmbH commented 24 August 2015 at 19:22

Issue summary:	View changes
Related issues:		+#2556767: Remove placeholdering of node links

File	Size
node_pages_throughput.png	28.88 KB

Added the following new relic throughput graph to the issue summary:

Also updated the issue summary a bit and opened #2556767: Remove placeholdering of node links.

Comment #32

catch

he/him

English

CreditAttribution: catch commented 24 August 2015 at 21:03

Status:

Reviewed & tested by the community

» Fixed

Thanks for the throughput graph, flat throughput graphs are nicer.

Committed/pushed to 8.0.x, thanks!

Comment #33

24 August 2015 at 21:03

catch committed dd23c88 on 8.0.x

Issue #2530846 by Berdir, mbovan: Fix and improve comment cache tag...

Comment #34

7 September 2015 at 21:04

Status:

Fixed

» Closed (fixed)

Automatically closed - issue fixed for 2 weeks with no activity.

Fix and improve comment cache tag usage

Problem/Motivation

Proposed resolution

Remaining tasks

User interface changes

API changes

Data model changes

Comments

Related issues