Metatag has media browser token in meta description [#3063056]

Problem/Motivation

On the site I work on, we ran into an issue where, if a page has an image inserted using media browser at the very start of the body, part of the media browser token as the meta description.

An easy workaround is to just manually enter the desired description instead of using the [node:summary] token, but we would prefer to not need to hunt down any pages that have this problem.

Proposed resolution

The quick solution I can see is to run the node body that is stored in $options['token data'] through the tidyValue function before passing it into the token_replace function. This gives media browser a chance to process its tokens and gives tidyValue a chance to strip out any html and whitespace it needs to before token_replace truncates the body text.

Remaining tasks

Review needed

User interface changes

API changes

Data model changes

Release notes snippet

Original report by tristangraves

On the site I work on, we ran into an issue where, if a page has an image, inserted using media browser, at the very start of the body, the Metatag module was outputting part of the media browser token as the meta description. This seems to be caused by the way token_replace handles creating the [node:summary] from a node body when there is no summary and the character limit of the teaser display.

The Metatag module defaults to using [node:summary] for the meta description. When Metatag processes this to create the description, it will run it through token_replace to deal with any tokens. The token_replace function would see that we had no text in the summary of this node (or are not using a summary field at all) and would use the node body as a fallback. It would automatically use teaser as the display for the body as it converts the token, which on our site is the default limit of 600 characters. This is where the problem comes in. Media browser tokens tend to be extremely long (at least on this site) and in my case the media browser token was longer than 600 characters causing it to be cut off.

The processed token with the broken media browser token would then return to the Metatag module to be further parsed eventually making it to tidyValue which checks for media browser tokens and processes them. Since we have cut off a chunk of the token, the conversion fails, but Metatag continues on and uses the left over token bits for the meta description.

The solution I came up with is to run the node body that is stored in $options['token data'] through the tidyValue function before passing it into the token_replace function. This gives media browser a chance to process its tokens and gives tidyValue a chance to strip out any html and whitespace it needs to before token_replace truncates the body text.

Comment	File	Size	Author
#9	metatag-n3063056-9.interdiff.txt	7.75 KB	DamienMcKenna
#9	metatag-n3063056-9.patch	8.18 KB	DamienMcKenna
#9	7.x-1.x: PHP 7.2 & MySQL 5.5, D7 278 pass
#8	metatag-has-media-token-in-meta-description-3063056-8.patch	5.57 KB	tristangraves
#8	7.x-1.x: PHP 7.2 & MySQL 5.5, D7 278 pass
#7	metatag-has-media-token-in-meta-description-3063056-7.patch	5.45 KB	tristangraves
#7	7.x-1.x: PHP 7.2 & MySQL 5.5, D7 99 pass, 298 fail
#2	metatag-has-media-token-in-meta-description-3063056-2.patch	5.31 KB	tristangraves
#2	7.x-1.x: PHP 7.1 & MySQL 5.5, D7 278 pass

Support from Acquia helps fund testing for Drupal Acquia logo

Comments

Comment #1

20 June 2019 at 15:30

tristangraves created an issue. See original summary.

Comment #2

tristangraves CreditAttribution: tristangraves commented 20 June 2019 at 15:31

File	Size
metatag-has-media-token-in-meta-description-3063056-2.patch	5.31 KB
7.x-1.x: PHP 7.1 & MySQL 5.5, D7 278 pass

Comment #3

tristangraves CreditAttribution: tristangraves commented 20 June 2019 at 15:45

Status:

Active

» Needs review

Comment #4

tristangraves CreditAttribution: tristangraves commented 20 June 2019 at 21:48

Issue summary:

View changes

Updated the summary to use the Issue Summary Template. Thanks @nmillin for showing me this existed.

Comment #5

DamienMcKenna

NH, USA

CreditAttribution: DamienMcKenna at Mediacurrent commented 21 June 2019 at 21:08

Assigned:

tristangraves

» Unassigned

Comment #6

DamienMcKenna

NH, USA

CreditAttribution: DamienMcKenna at Mediacurrent commented 10 September 2019 at 17:46

Status:

Needs review

» Needs work

This needs to be language-safe, so hardcoding it to use "und" isn't a good idea here.

Comment #7

tristangraves CreditAttribution: tristangraves commented 22 November 2019 at 19:38

Status:

Needs work

» Needs review

File	Size
metatag-has-media-token-in-meta-description-3063056-7.patch	5.45 KB
7.x-1.x: PHP 7.2 & MySQL 5.5, D7 99 pass, 298 fail

1 file was hidden/shown/deleted

File	Size
metatag-has-media-token-in-meta-description-3063056-2.patch	5.31 KB
7.x-1.x: PHP 7.1 & MySQL 5.5, D7 278 pass

I updated the code to get a language code from field_language to use instead of hardcoding "und". I haven't had any experience working with multilingual sites or code, so I'm not sure if this is the correct way to do this. Please let me know if I have more work to do on this or if I completely missed the mark.

Comment #8

tristangraves CreditAttribution: tristangraves commented 25 November 2019 at 19:00

File	Size
metatag-has-media-token-in-meta-description-3063056-8.patch	5.57 KB
7.x-1.x: PHP 7.2 & MySQL 5.5, D7 278 pass

1 file was hidden/shown/deleted

File	Size
metatag-has-media-token-in-meta-description-3063056-7.patch	5.45 KB
7.x-1.x: PHP 7.2 & MySQL 5.5, D7 99 pass, 298 fail

Looks like tests were failing for the last patch because I wasn't making sure there was a node to work with when checking the language. This patch should fix things I think.

Comment #9

DamienMcKenna

NH, USA

CreditAttribution: DamienMcKenna at Mediacurrent commented 27 November 2019 at 20:01

Parent issue:

» #2958474: Plan for Metatag 7.x-1.26

File	Size
metatag-n3063056-9.patch	8.18 KB
7.x-1.x: PHP 7.2 & MySQL 5.5, D7 278 pass
metatag-n3063056-9.interdiff.txt	7.75 KB

Thanks for putting that together, and for the test coverage, it's appreciated!

I split out the JSON encoded string into arrays that are then passed through json_encode(), just to make maintenance easier.

Comment #10

17 December 2019 at 18:56

DamienMcKenna committed 2c52df6 on 7.x-1.x authored by tristangraves

Issue #3063056 by tristangraves, DamienMcKenna: Metatag has media...

Comment #11

DamienMcKenna

NH, USA

CreditAttribution: DamienMcKenna at Mediacurrent commented 17 December 2019 at 18:56

Status:

Needs review

» Fixed

Committed. Thanks!

Comment #12

31 December 2019 at 18:59

Status:

Fixed

» Closed (fixed)

Automatically closed - issue fixed for 2 weeks with no activity.

Comment #13

ron_s CreditAttribution: ron_s commented 5 January 2020 at 01:00

@tristangraves, fyi, the patch in this thread is causing a critical problem with the newest 7.x-1.26 release:
https://www.drupal.org/project/metatag/issues/3102817