Document SafeMarkup::set in AllowedTagsXssTrait::fieldFilterXss [#2501441]

Problem/Motivation

AllowedTagsXssTrait::fieldFilterXss() calls SafeMarkup::set() which is meant to be for internal use only.

Proposed resolution

Similar code comment as #2501403: Document SafeMarkup::set in Xss::filter to be added.

~~Remove the call by refactoring the code.~~
If refactoring is not possible, thoroughly document where the string is coming from and why it is safe, and why SafeMarkup::set() is required.

Remaining tasks

~~Evaluate whether the string can be refactored to one of the formats outlined in this change record: https://www.drupal.org/node/2311123~~
Identify whether there is existing automated test coverage for the sanitization of the string. If there is, list the test in the issue summary. If there isn't, add an automated test for it.
If the string cannot be refactored, the SafeMarkup::set() usage needs to be thoroughly audited and documented.

Manual testing steps (for XSS and double escaping)

Not necessary, we are only adding documentation.

User interface changes

N/A

API changes

N/A

Comment	File	Size	Author
#12	document-2501441-12.patch	1020 bytes	cilefen
#12
#8	safemarkup-set-for-filterxss-2501441-8.patch	682 bytes	mlncn
#8
#6	safemarkup-set-for-filterxss-2501441-5.patch	523 bytes	joelpittet
#6
#6	interdiff.txt	743 bytes	joelpittet
#5	safemarkup-set-for-filterxss-2501441-3.patch	522 bytes	mlncn
#5
#3	safemarkup-set-for-filterxss-2501441-3.patch	522 bytes	mlncn
#3

Support from Acquia helps fund testing for Drupal Acquia logo

Comments

Comment #1

star-szr

he/him

English

CreditAttribution: star-szr as a volunteer commented 5 June 2015 at 23:22

Issue summary:

View changes

Comment #2

mlncn CreditAttribution: mlncn at Agaric commented 6 June 2015 at 17:57

Assigned:

Unassigned

» mlncn

Comment #3

mlncn CreditAttribution: mlncn at Agaric commented 6 June 2015 at 19:10

Status:

Active

» Needs review

File	Size
safemarkup-set-for-filterxss-2501441-3.patch	522 bytes

Followed up on #2501403 to note that when we run something marked safe from Xss::filter() it should still be marked safe when all we do is put it through HTML::normalize() without combining with any unsafe input.

As a documentation-only change, not adding test. We could repeat Drupal\Component\Utility\Xss tests in core/tests/Drupal/Tests/Core/Field/ but that would be needlessly redundant.

Comment #4

6 June 2015 at 19:12

Status:

Needs review

» Needs work

The last submitted patch, 3: safemarkup-set-for-filterxss-2501441-3.patch, failed testing.

Comment #5

mlncn CreditAttribution: mlncn at Agaric commented 6 June 2015 at 19:35

Status:

Needs work

» Needs review

File	Size
safemarkup-set-for-filterxss-2501441-3.patch	522 bytes

With a patch that's not backwards this time.

Comment #6

joelpittet

English

Vancouver

CreditAttribution: joelpittet as a volunteer commented 7 June 2015 at 02:30

Assigned:	mlncn	» Unassigned
Status:	Needs review	» Reviewed & tested by the community

File	Size
interdiff.txt	743 bytes
safemarkup-set-for-filterxss-2501441-5.patch	523 bytes

Just added a \ to the namespace because that is more common in core. But otherwise it's RTBC. Thanks @mlncn

Comment #7

xjm

she/her

English

CreditAttribution: xjm at Acquia commented 9 June 2015 at 20:33

Status:

Reviewed & tested by the community

» Needs work

Thanks @mlncn and @joelpittet.

@mlncn's comment helps explain why this is safe:

when we run something marked safe from Xss::filter() it should still be marked safe when all we do is put it through HTML::normalize() without combining with any unsafe input.

However, the comment in the patch itself isn't quite as clear. :) Can we spell it out a little more as to why normalizing filtered HTML should also be added to the safe list? In general, I'd also like us to document not only why the SafeMarkup use is indeed safe, but why it is necessary and appropriate.

If the HTML output isn't already in the normal form, this single line of code is adding not one but two potentially lengthy entries to the SafeMarkup list: one in Xss::filter() itself, and then a variant of it that's normalized. See #2488538: Add SafeMarkup::remove() to free memory from marked strings when they're printed and #2295823: Ensure that we don't store excessive lists of safe strings for why this is potentially a concern. In this case since it's internal to filtering/sanitization APIs, we might decide that it's fine, or at least any working around it would be needless disruption or overhead. But I'd like to at least raise the question. :)

Also, a minor note: the method name should be followed by () so it gets linked properly on api.d.o and such.

Comment #8

mlncn CreditAttribution: mlncn at Agaric commented 11 June 2015 at 17:20

Status:

Needs work

» Needs review

File	Size
safemarkup-set-for-filterxss-2501441-8.patch	682 bytes

Better explanation and parenthesis for the methods! Thanks @xjm

This is a place where i suppose we could safemarkup::remove() the now-obsolete string (filterXss without the HTML normalize). Perhaps we could do a follow-up for that if #2488538 lands and we want to extend that approach.

Comment #9

joelpittet

English

Vancouver

CreditAttribution: joelpittet as a volunteer commented 12 June 2015 at 01:29

Status:

Needs review

» Reviewed & tested by the community

This seems clearer now and still succinct.

Comment #10

xjm

she/her

English

CreditAttribution: xjm at Acquia commented 12 June 2015 at 04:06

Status:	Reviewed & tested by the community	» Needs review
Issue tags:		+Needs followup

Thanks @mlncn, that's much clearer. This is most likely committable as it is right now, but I have an additional suggestion. Iterating on @mlncn's patch:

All known XSS vectors are filtered out by \Drupal\Component\Utility\Xss::filter(), all tags in the markup are allowed intentionally by the trait, and no danger is added in by \Drupal\Component\Utility\HTML::normalize(). Since the normalized value is essentially the same markup, designate this string as safe as well. This method is an internal part of field sanitization, so the resultant, sanitized string should be printable as is.

The intent there is to additionally make it clear that other code should not use this as a pattern, because it's a truly internal call.

And then I think maybe follow it with an @todo referencing #2488538: Add SafeMarkup::remove() to free memory from marked strings when they're printed and/or #2450993: Rendered Cache Metadata created during the main controller request gets lost suchlike is in order -- actually, I think we should file a new followup that's postponed on one or both of those.

What say ye?

Comment #11

cilefen CreditAttribution: cilefen commented 14 June 2015 at 20:26

I am looking at this at the NJ sprint.

Comment #12

cilefen CreditAttribution: cilefen commented 14 June 2015 at 20:49

File	Size
document-2501441-12.patch	1020 bytes

Comment #13

mlncn CreditAttribution: mlncn at Agaric commented 14 June 2015 at 23:26

Status:

Needs review

» Reviewed & tested by the community

I would say that nails it! Very good call @xjm on the new follow-up issue and thank you @cilefen for making it; looking forward to the follow-ups and removing the @todo :-)

Comment #14

xjm

she/her

English

CreditAttribution: xjm at Acquia commented 14 June 2015 at 23:40

Status:

Reviewed & tested by the community

» Fixed

Looks great, thanks!

This issue is a required part of a critical task and is allowed per https://www.drupal.org/core/beta-changes. Committed and pushed to 8.0.x.

Comment #15

14 June 2015 at 23:40

xjm committed 48d0043 on 8.0.x

Issue #2501441 by mlncn, joelpittet, cilefen: Document SafeMarkup::set...

Comment #16

28 June 2015 at 23:44

Status:

Fixed

» Closed (fixed)

Automatically closed - issue fixed for 2 weeks with no activity.

Document SafeMarkup::set in AllowedTagsXssTrait::fieldFilterXss

Problem/Motivation

Proposed resolution

Remaining tasks

Manual testing steps (for XSS and double escaping)

User interface changes

API changes

Comments

Comment #1

Comment #2

Comment #3

Comment #4

Comment #5

Comment #6

Comment #7

Comment #8

Comment #9

Comment #10

Comment #11

Comment #12

Comment #13

Comment #14

Comment #15

Comment #16

Parent issue

Related issues

Thank you to these Drupal contributors

News items

Our community

Documentation

Drupal code base

Governance of community