Ensure that SafeString objects can be used in non-HTML contexts [#2509218]

Comment	File	Size	Author
#166	2509218-165.patch	5.61 KB	jhedstrom
#166	interdiff.txt	678 bytes	jhedstrom
#164	interdiff.txt	2.46 KB	wim leers
#164	2509218-163.patch	5.81 KB	wim leers
#162	Screen Shot 2015-09-25 at 14.26.52.png	38.43 KB	wim leers
#161	2509218-161.patch	5.8 KB	jhedstrom
#161	interdiff.txt	797 bytes	jhedstrom
#154	safe_markup-contexts-2509218-153.patch	5.81 KB	plach
#153	safe_markup-contexts-2509218-153.interdiff.txt	2.28 KB	plach
#148	safe_markup-contexts-2509218-148.patch	5.88 KB	plach
#148	safe_markup-contexts-2509218-148.interdiff.txt	847 bytes	plach
#145	2509218-145.patch	5.88 KB	stefan.r
#145	interdiff-144-145.txt	1.55 KB	stefan.r
#144	safe_markup-contexts-2509218-144.patch	5.99 KB	plach
#144	safe_markup-contexts-2509218-144.interdiff.txt	6.7 KB	plach
#132	increment.txt	633 bytes	pwolanin
#132	2509218-132.patch	9.99 KB	pwolanin
#126	2509218-126.patch	9.98 KB	stefan.r
#126	interdiff-122-126.txt	1.74 KB	stefan.r
#122	2509218-122.patch	10.03 KB	imiksu
#122	interdiff.txt	1.3 KB	imiksu
#120	2509218-120.patch	10.03 KB	stefan.r
#120	interdiff-119-120.txt	711 bytes	stefan.r
#119	2509218-119.patch	10.02 KB	stefan.r
#119	interdiff-111-119.txt	865 bytes	stefan.r
#111	increment.txt	3.39 KB	pwolanin
#111	2509218-111.patch	10.04 KB	pwolanin
#108	increment.txt	1.38 KB	pwolanin
#108	2509218-107.patch	9.56 KB	pwolanin
#103	2509218-99.patch	9.68 KB	stefan.r
#103	interdiff-94-99.txt	12.04 KB	stefan.r
#101	2509218-98.patch	9.64 KB	stefan.r
#101	interdiff-94-98.patch	11.96 KB	stefan.r
#94	interdiff.txt	5.17 KB	dawehner
#94	2509218-94.patch	9.04 KB	dawehner
#88	safe_markup-contexts-2509218-88.patch	8.86 KB	plach
#88	safe_markup-contexts-2509218-88.interdiff.txt	3.75 KB	plach
#86	safe_markup-contexts-2509218-86.interdiff.txt	9.4 KB	plach
#86	safe_markup-contexts-2509218-86.patch	7.76 KB	plach
#83	interdiff.txt	2.59 KB	dawehner
#83	2509218-83.patch	5.23 KB	dawehner
#82	2509218-82.patch	2.63 KB	pwolanin
#80	2509218-80.patch	1.67 KB	pwolanin
#61	safe_markup_contexts-2509218-61.patch	23.6 KB	almaudoh
#61	interdiff.txt	14.2 KB	almaudoh
#59	safe_markup_contexts-2509218-56.patch	14.19 KB	almaudoh
#56	safe_markup-contexts-2509218-56.review.txt	14.19 KB	plach
#56	safe_markup-contexts-2509218-56.patch	66.5 KB	plach
#54	safe_markup-contexts-2509218-54.patch	57.01 KB	plach
#54	safe_markup-contexts-2509218-54.review.txt	15.87 KB	plach
#54	safe_markup-contexts-2509218-54.interdiff.txt	821 bytes	plach
#51	safe_markup-contexts-2509218-49.review.txt	15.6 KB	plach
#49	safe_markup-contexts-2509218-49.patch	56.75 KB	plach
#49	safe_markup-contexts-2509218-49.interdiff.txt	14.76 KB	plach
#40	safe_markup-contexts-2509218-41.patch	49.76 KB	plach
#40	safe_markup-contexts-2509218-41.interdiff.txt	8.91 KB	plach
#18	interdiff.txt	1.3 KB	subhojit777
#18	make_behave_like_in-2509218-18.patch	18.95 KB	subhojit777
#13	interdiff.txt	1.94 KB	effulgentsia
#13	2509218.13.patch	14.38 KB	effulgentsia
#12	2509218.12.patch	12.32 KB	effulgentsia
#8	2509218.8.patch	12.32 KB	alexpott
#4	interdiff.txt	672 bytes	effulgentsia
#4	2509218-4.patch	21.24 KB	effulgentsia
#2	2509218-2.patch	20.47 KB	effulgentsia
	SafeMarkup-remove-passthrough.patch	2.75 KB	effulgentsia

Comment #1

20 June 2015 at 00:10

Status:

Needs review

» Needs work

The last submitted patch, SafeMarkup-remove-passthrough.patch, failed testing.

Log in or register to post comments

Status	File	Size
new	2509218-2.patch	20.47 KB

Comment #3

20 June 2015 at 03:09

Status:

Needs review

» Needs work

The last submitted patch, 2: 2509218-2.patch, failed testing.

Log in or register to post comments

Status	File	Size
new	2509218-4.patch	21.24 KB
new	interdiff.txt	672 bytes

Comment #5

xjm

she/her

English

commented 20 June 2015 at 17:51

Priority:

Normal

» Major

+1 for "deprecating" the option this way.

It's interesting that the patch is green when my Views patch has a failure though -- didn't figure out how that happened yet. :)

+++ b/core/lib/Drupal/Component/Utility/SafeMarkup.php
@@ -209,38 +209,28 @@ public static function checkPlain($text) {
-   *   - !variable: Inserted as is, with no sanitization or formatting. Only
-   *     use this when the resulting string is being generated for one of:
-   *     - Non-HTML usage, such as a plain-text email.
-   *     - Non-direct HTML output, such as a plain-text variable that will be
-   *       printed as an HTML attribute value and therefore formatted with
-   *       self::checkPlain() as part of that.
-   *     - Some other special reason for suppressing sanitization.

I think we need to keep documentation of this indicating it is a legacy placeholder type that now behaves the same as @ but is deprecated in 8.0.x and that support for it will be removed before 9.0.0.

Log in or register to post comments

Comment #6

effulgentsia commented 31 August 2015 at 18:37

Title:	Make ! behave like @ in SafeMarkup::format()	» Allow t() to work for non-HTML text
Status:	Needs review	» Needs work

#2558791: "!"-prefixed tokens should Xss::filterAdmin() but not affect safeness takes a different (and IMO, better) approach to the ! semantics. But I still think the 'html' => FALSE option for t() is worth adding, so retitling.

Log in or register to post comments

Comment #7

joelpittet

he/him

English

Vancouver

commented 31 August 2015 at 19:52

+++ b/core/lib/Drupal/Core/StringTranslation/TranslationManager.php
@@ -176,6 +171,26 @@ protected function doTranslate($string, array $options = array()) {
+    $html = isset($options['html']) ? $options['html'] : TRUE;

Could this be a bit more generic so it's inline with other strategies in twig? Doesn't have to be exact 'is_safe' => ['html']

But the idea is that you can make it safe for different contexts. ['js', 'html'] or ['all'], what do you think?
https://github.com/twigphp/Twig/blob/5fdbd991bfcf5ea2492c9ab074a2d2cde18...
https://github.com/twigphp/Twig/blob/5fdbd991bfcf5ea2492c9ab074a2d2cde18...

Also I have tests over here #2531824: Attribute class to check safe strings before escaping (has tests) for a bug that looks like it's partially addressed by this patch for feed_icon.

Log in or register to post comments

Comment #8

alexpott

he/they

English

🇪🇺🌍

commented 7 September 2015 at 14:23

Status:

Needs work

» Needs review

Status	File	Size
new	2509218.8.patch	12.32 KB

I think given #2557113: Make t() return a TranslationWrapper object to remove reliance on a static, unpredictable safe list we have another option here. If t() returns a TranslationWrapper then we can add a method on it to do this. See patch.

Log in or register to post comments

Comment #9

7 September 2015 at 14:48

Status:

Needs review

» Needs work

The last submitted patch, 8: 2509218.8.patch, failed testing.

Log in or register to post comments

Comment #10

stefan.r commented 8 September 2015 at 16:59

I like the idea in #8, it would solve part of the concern in #2558791: "!"-prefixed tokens should Xss::filterAdmin() but not affect safeness

+++ b/core/lib/Drupal/Core/StringTranslation/TranslationInterface.php
@@ -136,4 +136,31 @@ public function formatPluralTranslated($count, $translation, array $args = array
+   * Never call translate($user_text) where $user_text is text that a user

s/translate/translateForNonHtml/

+++ b/core/lib/Drupal/Core/StringTranslation/TranslationInterface.php
@@ -136,4 +136,31 @@ public function formatPluralTranslated($count, $translation, array $args = array
+   * entered; doing so can lead security problems. The output of this method is
+   * intended for non-HTML usages. If the caller wants to ensure no HTML tags

Should we be more forceful about how dangerous this is if it ends up in HTML (or javascript)?

+++ b/core/lib/Drupal/Core/StringTranslation/TranslationInterface.php
@@ -136,4 +136,31 @@ public function formatPluralTranslated($count, $translation, array $args = array
+   *   on the first character of the key, the value is escaped and/or themed.

Maybe just "wrapped in " as opposed to themed?

+++ b/core/lib/Drupal/Core/StringTranslation/TranslationManager.php
@@ -265,4 +265,15 @@ public function getNumberOfPlurals($langcode = NULL) {
+  public function translateForNonHtml($string, array $args = array(), array $options = array()) {
+    $string = $this->doTranslate($string, $options);

Just an idea, but if we're worried about developers not thinking when they use this, maybe a 'strip_tags' key in the options array that defaults to TRUE...

+++ b/core/lib/Drupal/Core/StringTranslation/TranslationWrapper.php
@@ -128,4 +145,13 @@ public function jsonSerialize() {
+   * Returns the translation unescaped for usages in non-HTML.

Should we list a few example use cases here, i.e. error logs and plain text email messages?

+++ b/core/modules/contact/contact.module
@@ -7,8 +7,10 @@
+

newline

Log in or register to post comments

Comment #11

stefan.r commented 8 September 2015 at 17:16

+++ b/core/lib/Drupal/Core/StringTranslation/TranslationManager.php
@@ -265,4 +265,15 @@ public function getNumberOfPlurals($langcode = NULL) {
+      $string = strtr($string, $args);

If we have a string that's intended for non-HTML output only this muddles the meaning of the placeholder prefixes a bit as both @, ! and any other prefix would output the unescaped/unfiltered string... but I guess there's no way around that when we have a TranslationWrapper that has a single translatable string that may be output to both HTML and non-HTML.

Log in or register to post comments

Comment #12

effulgentsia commented 10 September 2015 at 17:49

Status:

Needs work

» Needs review

Status	File	Size
new	2509218.12.patch	12.32 KB

Just a reroll.

Log in or register to post comments

Comment #13

effulgentsia commented 10 September 2015 at 18:00

Title:

Allow t() to work for non-HTML text

» Make ! behave like @ in SafeMarkup::format(), and add a new API for t() to work for non-HTML text

Status	File	Size
new	2509218.13.patch	14.38 KB
new	interdiff.txt	1.94 KB

Per #2506427-29: [meta] !placeholder causes strings to be escaped and makes the sanitization API harder to understand, restoring this issue to its original scope.

Log in or register to post comments

Comment #14

10 September 2015 at 18:19

The last submitted patch, 12: 2509218.12.patch, failed testing.

Log in or register to post comments

Comment #15

10 September 2015 at 18:30

Status:

Needs review

» Needs work

The last submitted patch, 13: 2509218.13.patch, failed testing.

Log in or register to post comments

Comment #16

10 September 2015 at 22:33

The last submitted patch, 12: 2509218.12.patch, failed testing.

Log in or register to post comments

Comment #17

10 September 2015 at 22:41

The last submitted patch, 13: 2509218.13.patch, failed testing.

Log in or register to post comments

Comment #18

subhojit777

he/him

Bengali

commented 12 September 2015 at 12:45

Status:

Needs work

» Needs review

Status	File	Size
new	make_behave_like_in-2509218-18.patch	18.95 KB
new	interdiff.txt	1.3 KB

Reducing the number of failing tests

Log in or register to post comments

Comment #19

subhojit777

he/him

Bengali

commented 12 September 2015 at 12:47

+++ b/core/modules/system/src/Tests/Common/XssUnitTest.php
@@ -41,7 +41,7 @@ function testT() {
-    $this->assertEqual($text, 'Verbatim text: <script>', 't replaces verbatim string as-is.');
+    $this->assertEqual($text, 'Verbatim text: &lt;script&gt;', 't replaces and escapes string.');
   }
 
   /**
diff --git a/core/modules/system/src/Tests/Theme/TwigTransTest.php b/core/modules/system/src/Tests/Theme/TwigTransTest.php

diff --git a/core/modules/system/src/Tests/Theme/TwigTransTest.php b/core/modules/system/src/Tests/Theme/TwigTransTest.php
index 21aef18..d30e753 100644

index 21aef18..d30e753 100644
--- a/core/modules/system/src/Tests/Theme/TwigTransTest.php

--- a/core/modules/system/src/Tests/Theme/TwigTransTest.php
+++ b/core/modules/system/src/Tests/Theme/TwigTransTest.php

+++ b/core/modules/system/src/Tests/Theme/TwigTransTest.php
+++ b/core/modules/system/src/Tests/Theme/TwigTransTest.php
@@ -139,7 +139,7 @@ protected function assertTwigTransTags() {

@@ -139,7 +139,7 @@ protected function assertTwigTransTags() {
     );
 
     $this->assertRaw(
-      'PAS-THRU: &"<>',
+      'PAS-THRU: &amp;&quot;&lt;&gt;',

I have just updated the tests since now ! will behave similar to @.

Log in or register to post comments

Comment #20

12 September 2015 at 13:12

Status:

Needs review

» Needs work

The last submitted patch, 18: make_behave_like_in-2509218-18.patch, failed testing.

Log in or register to post comments

Comment #21

12 September 2015 at 13:14

The last submitted patch, 18: make_behave_like_in-2509218-18.patch, failed testing.

Log in or register to post comments

Comment #22

joelpittet

he/him

English

Vancouver

commented 13 September 2015 at 21:46

We need to move #2506445-170: Replace !placeholder with @placeholder in t() and format_string() for non-URLs in tests here. This is the diff:

+++ b/core/modules/contact/src/Tests/ContactPersonalTest.php
@@ -79,12 +79,12 @@ function testSendPersonalContactMessage() {
-      '!site-name' => $this->config('system.site')->get('name'),
-      '!subject' => $message['subject[0][value]'],
-      '!recipient-name' => $this->contactUser->getUsername(),
+      '@site-name' => $this->config('system.site')->get('name'),
+      '@subject' => $message['subject[0][value]'],
+      '@recipient-name' => $this->contactUser->getUsername(),
...
-    $this->assertEqual($mail['subject'], t('[!site-name] !subject', $variables), 'Subject is in sent message.');
-    $this->assertTrue(strpos($mail['body'], 'Hello ' . $variables['!recipient-name']) !== FALSE, 'Recipient name is in sent message.');
+    $this->assertEqual($mail['subject'], t('[@site-name] @subject', $variables), 'Subject is in sent message.');
+    $this->assertTrue(strpos($mail['body'], 'Hello ' . $variables['@recipient-name']) !== FALSE, 'Recipient name is in sent message.');

These are for email tokens.

Log in or register to post comments

Comment #23

xjm

she/her

English

commented 14 September 2015 at 00:06

Status:

Needs work

» Postponed

I think we should probably postpone this on #2557113: Make t() return a TranslationWrapper object to remove reliance on a static, unpredictable safe list at this point, since that will change this patch.

Log in or register to post comments

Comment #24

xjm

she/her

English

commented 14 September 2015 at 00:07

Log in or register to post comments

Comment #25

xjm

she/her

English

commented 14 September 2015 at 00:18

Title:	Make ! behave like @ in SafeMarkup::format(), and add a new API for t() to work for non-HTML text	» Add new API for t() to work for non-HTML text
Issue tags:		+Needs followup

Also, I think we should actually separate the two scopes of this issue. "Add new API for t() to work for non-HTML text" can mean simply using the new behavior from #2557113: Make t() return a TranslationWrapper object to remove reliance on a static, unpredictable safe list to allow @placeholder to be used for emails since they would then be escaped at the same time as other strings, which will improve the DX overall by simplifying things.

However, deprecating !placeholder and making it behave like @placeholder is much broader in scope and does not have consensus. We may actually remove it instead, and anyway fixing the test failures for that change will duplicate efforts on #2506445: Replace !placeholder with @placeholder in t() and format_string() for non-URLs in tests and friends.

Can we create a separate issue for that if we still think it's worth doing (as opposed to just removing them all)?

Log in or register to post comments

Comment #26

xjm

she/her

English

commented 14 September 2015 at 00:21

Title:

Add new API for t() to work for non-HTML text

» Allow t() to work for non-HTML text

Log in or register to post comments

Comment #27

xjm

she/her

English

commented 14 September 2015 at 00:24

Also, there are some incorrect hunks in the current patch.

Log in or register to post comments

Comment #28

effulgentsia commented 15 September 2015 at 17:55

In #18, the working assumption was that once #2557113: Make t() return a TranslationWrapper object to remove reliance on a static, unpredictable safe list lands, the API for getting a translated string for non-HTML use (such as email subject or JSON output) would be something like:

t(...)->getTranslationForNonHtml()

with that method name still open for discussion (other options might be ->toPlainText(), etc.).

On a hangout earlier today with a bunch of people, @catch brought up another idea:

Html::toText(t(...))

where that toText() method could call MailFormatHelper::htmlToText() (or we move that implementation into the Html class).

A benefit of that is that it could be the same API for when we want to convert the HTML output of something other than t() to plain-text, such as:

Html::toText($token_service->replace(...))

A drawback might be that if you look at that current implementation of MailFormatHelper::htmlToText(), maybe that's a lot of specific choices on what to do that don't really make sense for strings coming out of t()? Then again, maybe they do?

Log in or register to post comments

Comment #29

effulgentsia commented 15 September 2015 at 18:05

Priority:

Major

» Critical

Also, raising this to Critical, because it's impossible to remove the last usages of !placeholder from core, such as:

$message['subject'] .= t('[!site-name] !subject', $variables, $options);

without it.

Log in or register to post comments

Comment #30

xjm

she/her

English

commented 15 September 2015 at 18:10

One thing #28 does not mention is the risk of adding "!placeholder by another name".

Also, I'm confused that all the suggestions in #28 involve adding methods. I thought that the idea was that converting to late rendering of the translations would mean that the placeholders could be escaped then, in the render process, so they wouldn't be escaped otherwise, same as the expectation for Twig.

Log in or register to post comments

Comment #31

dawehner

German

commented 15 September 2015 at 18:28

Also, I'm confused that all the suggestions in #28 involve adding methods. I thought that the idea was that converting to late rendering of the translations would mean that the placeholders could be escaped then, in the render process, so they wouldn't be escaped otherwise, same as the expectation for Twig.

Well yeah at some point though you need to convert the object to a string. Maybe we could also go with TranslationWrapper->render($strategy)

Log in or register to post comments

Comment #32

catch

he/him

English

commented 16 September 2015 at 09:30

Title:	Allow t() to work for non-HTML text	» Ensure that the results of t() can be used as plain text for non-HTML contexts
Issue summary:	View changes
Status:	Postponed	» Active

Re-titled and updated issue summary to try to summarize the two approaches.

Also if we go for option #2, this wouldn't need to be postponed, so moving back to active for the discussion at least.

Log in or register to post comments

Comment #33

plach

he/him

Italian

Venezia

commented 16 September 2015 at 09:53

Not sure whether this is BS, since I just started to get my feet wet with the SafeMarkup stuff, but it seems to me that here we are needing an alternative sanitization logic. If we were able to instantiate a different sanitization service depending on the mime type of the output, we could use placeholders semantically and let each service figure out the most appropriate sanitization strategy. We could default to text/html as sanitization context and allow to specify alternative ones via $options. For example:

$args = ['@user_name' => $account->getName()];
// HTML email, @user_name is escaped.
t('Welcome @user_name!', $args);
// Plain text email, @user_name is not escaped.
t('Welcome @user_name!', $args, ['output' => 'text/plain']);

Log in or register to post comments

Comment #34

dawehner

German

commented 16 September 2015 at 10:43

Well, the problem is that like for tokens, the place which generates the t() cannot know yet, how its gonna be used, so making it lazy would help with that.
@plach
I think we should do something, that is as parallel as twig, which means you would pass a sanitization strategy?

Log in or register to post comments

Comment #35

catch

he/him

English

commented 16 September 2015 at 10:59

Well, the problem is that like for tokens, the place which generates the t() cannot know yet, how its gonna be used, so making it lazy would help with that.

So that's true for t() strings that end up in e-mail bodies - but we (should) already use MailHelper::htmlToText() for that.

I'm not sure it's true for e-mail subjects though - otherwise those places wouldn't be explicitly using the !placeholder to avoid sanitization.

i.e. $message['subject'] .= t('[!site-name] !subject', $variables, $options); from contact.module

Log in or register to post comments

Comment #36

dawehner

German

commented 16 September 2015 at 11:42

Well, IMHO for email subjects we should be able to strip all tags.

Log in or register to post comments

Comment #37

plach

he/him

Italian

Venezia

commented 16 September 2015 at 14:10

@dawehner:

Well, the problem is that like for tokens, the place which generates the t() cannot know yet, how its gonna be used, so making it lazy would help with that.

On one hand we cannot have reliable output sanitization without knowing the output format, OTOH I realize that code generating token might miss the required contextual information, at least currently.

I think we'd have two ways to cope with this:

We require a string requiring sanitization to be provided the output format as contextual information, as I was suggesting in the example above. For tokens this would mean passing the output format in $options, as we are doing for language right now. How we'd implement that practically I'm not sure about: maybe a user could choose between [title] and [title:plain] or stuff like that.
Indeed a better solution DX-wise could be to lazily sanitize/stringify dynamic strings. For instance we could extend the approach introduced in #2557113: Make t() return a TranslationWrapper object to remove reliance on a static, unpredictable safe list and introduce a DynamicStringWrapper that could be passed around until contextual information about output format is finally available. This would allow us to have something like the following:
```
$dynamic_string = t('Welcome, @user_name!', ['@user_name' => $account->getName()]);

// Render the string as HTML, inherit Twig autoescaping.
$build = ['#markup' => $dynamic_string];

// Send as email content.
function mymodule_send_welcome_email(DynamicStringWrapper $dynamic_string) {
 $dynamic_string->setOutputFormat('text/plain');
 // Send plaintext email, no escaping.
}

// Include it as HTML attribute, e.g. <input type="submit" value="$dynamic_string">.
function _drupal_render_attribute(DynamicStringWrapper $dynamic_string) {
 $dynamic_string->setOutputFormat('text/html; x-drupal-context: attribute');
 // Escape attribute delimiters and type-specific content, e.g. URL protocol.
}
```
A token value could simply be wrapped into a child TokenStringWrapper having simply @value as hard-coded string pattern.
In this scenario @value and :url placeholders would determine the value's semantics instead of the sanitization logic: the former would indicate any plain string, while the second would indicate a URL string. This way, depending on the output format, we would know whether escaping the value or sanitizing the URL protocol is needed, for instance.

Well, IMHO for email subjects we should be able to strip all tags.

I think this is not enough: if the string was "HTML-escaped" previously, stripping tags would have no effect and lots of bogus < and > could creep into the mail subject.

Log in or register to post comments

Comment #38

catch

he/him

English

commented 16 September 2015 at 14:19

Just discussed this with plach, I'd thought that #2569485: Add AttributeSafeStringInterface and UriAttributeSafeStringInterface was a competing approach to this, but actually I think we should use both.

You get a $translated_string object.

The object has a ->renderAsHtmlAttribute() method.

This strips tags (to avoid  tags either being invalidly not escaped, or uglily escaped in an HTML attribute.

It also encodes entities to avoid XSS.

And returns an AttributeSafeStringInterface.

You can then pass an AttributeSafeStringInterface into AttributeString, so that it doesn't end up getting run through Html::escape() when it's already in the right format.

If we only did renderAsPlainText() then that might be OK for an e-mail subject but it's not necessarily OK for an HTML attribute.

If we only do AttributeSafeStringInterface then you can still end up with either escaped or unescaped HTML tags in attributes.

If we do both all cases are covered and it's clear what should be used for what.

Log in or register to post comments

Comment #39

plach

he/him

Italian

Venezia

commented 17 September 2015 at 11:58

Assigned:	Unassigned	» plach
Issue tags:		+D8 Accelerate

I'll experiment a bit with this, although it would be better to get #2557113: Make t() return a TranslationWrapper object to remove reliance on a static, unpredictable safe list in first.

Log in or register to post comments

Comment #40

plach

he/him

Italian

Venezia

commented 17 September 2015 at 13:51

Status:

Active

» Needs review

Status	File	Size
new	safe_markup-contexts-2509218-41.interdiff.txt	8.91 KB
new	safe_markup-contexts-2509218-41.patch	49.76 KB

Here is a first stab, just to get an idea of how things could look like. This includes also #2557113-185: Make t() return a TranslationWrapper object to remove reliance on a static, unpredictable safe list, the interdiff is the new code.

Log in or register to post comments

Comment #41

17 September 2015 at 14:20

Status:

Needs review

» Needs work

The last submitted patch, 40: safe_markup-contexts-2509218-41.patch, failed testing.

Log in or register to post comments

Comment #42

17 September 2015 at 15:20

The last submitted patch, 40: safe_markup-contexts-2509218-41.patch, failed testing.

Log in or register to post comments

Comment #43

catch

he/him

English

commented 17 September 2015 at 15:46

So right now the first argument to t() is literal HTML and does not get escaped or filtered again (since it's marked as a SafeString). We also allow translated strings to have HTML in them.

We should probably make that more explicit than it currently is in the documentation, https://api.drupal.org/api/drupal/core%21includes%21bootstrap.inc/functi... doesn't really make it clear that the first argument is HTML at all, except saying don't put variables in there.

Given in this issue we're talking about three different formats of returned string - HTML + plain text + HTML attribute, we need to figure out what that looks like.

Let's say we start with the following string:

t('The &lt;em&gt; tag makes your text look like <em>"this"</em>.')

1. HTML:

Nothing happens, you get this back:

The &lt;em&gt; tag makes your text look like <em>"this"</em>.

Which will look like

The tag makes your text look like "this".

in the browser (i.e. a select option or the title tag of an image or whatever).

2. Plain text:

plach suggested html_entity_decode(strip_tags($string)); The other option would be factoring out https://api.drupal.org/api/drupal/core!lib!Drupal!Core!Mail!MailFormatHe...

That would return:

The tag makes your text look like "this".

Fine for e-mail subject lines and similar.

3. HTML attribute.

For this, I think we'd want to take the plain text string, then Html::escape() it, so:

Html::escape(html_entity_decode(strip_tags($string)));

That gets us:

'The tag makes your text look like "this".

Which will look like:

The tag makes your text look like "this".

In the browser.

Having typed that out makes me wonder the following (assuming the above is what we want, it might not be):

If we can get a plain text output from t(), then we can pass that plain text to Attributes, and AttributeString would Html::escape() it - and maybe that's enough of an API for passing the results of t() to attributes (which Views currently does).

Log in or register to post comments

Comment #44

catch

he/him

English

commented 17 September 2015 at 16:07

Then that same example with arguments:


t('The @tag makes your text look like @result', ['@tag' =>'<em>', '@result' => SafeString::create('<em>"this"</em>']);

1. HTML:

First argument is escaped, second argument is marked as a SafeString so is not escaped. Return is correct HTML.

2. Plain text:

First argument is not escaped- this is fine.

Second argument is ... we could strip_tags() safe string arguments but erggh.

I think it'd be easier to get the HTML string first, then apply the same html_entity_decode(strip_tags($string)) to that. Gets us the same result regardless of whether the HTML is in the string or replacements then.

3. Attributes, as before we just Html::escape() the plain text value.

Log in or register to post comments

Comment #45

stefan.r commented 17 September 2015 at 17:18

Html::escape(html_entity_decode(strip_tags($string)));

Hmm, not sure this is secure for all attributes, we may still need some special cases for other attributes such as on* then, and think about what to do about attributes that are not wrapped in double quotes.

2. Plain text:

The Html::escape(html_entity_decode(strip_tags($string))); might be fine for email subjects but the htmlToText helper is much nicer for longer text, I also quite like some of the things in https://github.com/soundasleep/html2text that we could be doing in htmlToText too.

Log in or register to post comments

Comment #46

catch

he/him

English

commented 17 September 2015 at 19:23

Hmm, not sure this is secure for all attributes, we may still need some special cases for other attributes such as on* then, and think about what to do about attributes that are not wrapped in double quotes.

So at the moment we just don't support passing user-entered content to on* (and never should). But if we wanted to provide a way to encode them properly I think we need a new issue for that - would need to be applied to Attributes and Xss::attributes() too.

The Html::escape(html_entity_decode(strip_tags($string))); might be fine for email subjects but the htmlToText helper is much nicer for longer text

Yes I think that's an open question.

Log in or register to post comments

Comment #47

effulgentsia commented 17 September 2015 at 20:56

I haven't thought through #43 and its related comments yet, so this is not a commentary on that, but just want to respond to this part from the issue summary:

"!placeholder by another name", which somewhat turns the pro into a con.

I don't see this as a con, because IMO the "another name" part is enough to address the "makes the sanitization API harder to understand" part of the parent issue. Here's what I mean:

In Drupal 7, ! can be used to signify many different things, among which are:

The output of t() will be used in non-HTML context, such as an email subject, so don't escape HTML entities, because email clients do not render email subjects as HTML.
The output of t() will eventually be used in HTML context, but something else will escape it, such as drupal_attributes() or form_select_options().
The value I'm passing is the output of drupal_render(), t(), or some other rendering function that does its own escaping, so don't re-escape that.
The value I'm passing is not one of the above, but is something I know to be safe for my own reasons (e.g., I'm passing a literal string of HTML, or an implode() of safe parts with a literal glue), and don't (re-)escape that.
The value I'm passing doesn't have any characters that require HTML encoding, such as a machine name, or a URL without a query string or fragment, so don't waste CPU time on a no-op check_plain().

Note that with each of the above cases, the calling code in question might be incorrect in its assumptions (each item below maps to the same number above), each mistake resulting in an XSS vulnerability:

You thought you were returning a string to a caller for use in an email subject, but instead the caller used it within a drupal_set_message().
You thought the values within #options were the responsibility of Form API to escape (which it is in Drupal 7), but then Form API changed to not do it (which happened in Drupal 8).
You thought you were passing the result of a safe rendering function (such as drupal_render()), but suppose the HTML you have is really the result of a Views plugin function that wasn't sufficiently well documented as to whether it needs to return text or HTML, and some Views plugin out there got it wrong (yes, this really happened during D8's development).
Plain human error: See https://www.drupal.org/node/2537866 for a recent, but by far not the only, example.
You thought you were passing a URL without any query string or fragment, but some other module has a hook_url_outbound_alter() implementation that adds those.

The current implementation of ! in Drupal 8 HEAD solves nicely for the first two cases, whether you're right or wrong. If you're right, you get exactly what you asked for, and the resulting string isn't marked safe and doesn't need to be for those contexts. If you're wrong, you end up getting the escaping of the entire string, which is exactly what you should get when a string you thought you were returning for non-HTML use gets used in HTML.

The current implementation of ! in HEAD also solves nicely for the 3rd case if you're right, but causes escaping of the entire t() output (not just the incorrect placeholder) if you're wrong. And the current implementation breaks down even more for the last two cases, regardless of if your assumptions are right or wrong.

The problems with cases 3-5 are the reasons to remove ! entirely. But not arguments against solving just cases 1 and 2 with an API that's clear about that reduced scope.

Log in or register to post comments

Comment #48

catch

he/him

English

commented 17 September 2015 at 21:13

The output of t() will eventually be used in HTML context, but something else will escape it, such as drupal_attributes() or form_select_options().

So for me #2 is the tricky one, which #43 attempts to unpick, and I think plach's patch is compatible with what's in #43.

The problem being is that the output of t(), with the same strings and context, could be used as either an attribute value or an HTML fragment, certainly the Views info stuff is like that - and those need two different strategies.

I'd also add an example #6, which is the reverse problem of #1:

6. You thought you were creating a string for drupal_set_message() (or any HTML output), but someone put it into the subject of an e-mail instead.

#43 allows us to handle that case too.

Log in or register to post comments

Comment #49

plach

he/him

Italian

Venezia

commented 17 September 2015 at 22:11

Status:

Needs work

» Needs review

Status	File	Size
new	safe_markup-contexts-2509218-49.interdiff.txt	14.76 KB
new	safe_markup-contexts-2509218-49.patch	56.75 KB

12 files were hidden/shown/deleted

Status	File	Size
hidden	SafeMarkup-remove-passthrough.patch	2.75 KB
hidden	2509218-2.patch	20.47 KB
hidden	2509218-4.patch	21.24 KB
hidden	interdiff.txt	672 bytes
hidden	2509218.8.patch	12.32 KB
hidden	2509218.12.patch	12.32 KB
hidden	2509218.13.patch	14.38 KB
hidden	interdiff.txt	1.94 KB
hidden	make_behave_like_in-2509218-18.patch	18.95 KB
hidden	interdiff.txt	1.3 KB
hidden	safe_markup-contexts-2509218-41.interdiff.txt	8.91 KB
hidden	safe_markup-contexts-2509218-41.patch	49.76 KB

This implements #43 - #44 and provides unit tests for that. I will start looking at test failures as soon as #2557113: Make t() return a TranslationWrapper object to remove reliance on a static, unpredictable safe list is committed, since I don't want to waste time chasing every iteration. Most of the failures are unit tests that just need to be adapted to the new API.

Log in or register to post comments

Comment #50

plach

he/him

Italian

Venezia

commented 17 September 2015 at 22:32

Assigned:

plach

» Unassigned

Done for tonight

Log in or register to post comments

Comment #51

plach

he/him

Italian

Venezia

commented 17 September 2015 at 22:39

Issue tags:

+Needs issue summary update

Status	File	Size
new	safe_markup-contexts-2509218-49.review.txt	15.6 KB

Here's a version of #49 including only the parts added here, except for some small bits in TranslationInterface and TranslationManager.

Log in or register to post comments

Comment #52

17 September 2015 at 22:38

Status:

Needs review

» Needs work

The last submitted patch, 49: safe_markup-contexts-2509218-49.patch, failed testing.

Log in or register to post comments

Comment #53

17 September 2015 at 22:40

The last submitted patch, 49: safe_markup-contexts-2509218-49.patch, failed testing.

Log in or register to post comments

Comment #54

plach

he/him

Italian

Venezia

commented 17 September 2015 at 22:49

Status	File	Size
new	safe_markup-contexts-2509218-54.interdiff.txt	821 bytes
new	safe_markup-contexts-2509218-54.review.txt	15.87 KB
new	safe_markup-contexts-2509218-54.patch	57.01 KB

1 file was hidden/shown/deleted

Status	File	Size
hidden	safe_markup-contexts-2509218-49.patch	56.75 KB

Added missing PHP docs

Log in or register to post comments

Comment #55

subhojit777

he/him

Bengali

commented 18 September 2015 at 06:33

Status:

Needs work

» Needs review

Log in or register to post comments

Comment #56

plach

he/him

Italian

Venezia

commented 18 September 2015 at 06:41

Status	File	Size
new	safe_markup-contexts-2509218-56.patch	66.5 KB
new	safe_markup-contexts-2509218-56.review.txt	14.19 KB

1 file was hidden/shown/deleted

Status	File	Size
hidden	safe_markup-contexts-2509218-54.patch	57.01 KB

Rebased on top of #2557113-225: Make t() return a TranslationWrapper object to remove reliance on a static, unpredictable safe list.

Log in or register to post comments

Comment #57

18 September 2015 at 07:00

The last submitted patch, 54: safe_markup-contexts-2509218-54.patch, failed testing.

Log in or register to post comments

Comment #58

18 September 2015 at 07:02

The last submitted patch, 54: safe_markup-contexts-2509218-54.patch, failed testing.

Log in or register to post comments

Comment #59

almaudoh commented 19 September 2015 at 10:12

Status	File	Size
new	safe_markup_contexts-2509218-56.patch	14.19 KB

I really like this approach. Re-uploaded the review.txt patch in #56 since #2557113: Make t() return a TranslationWrapper object to remove reliance on a static, unpredictable safe list is now in.

+++ b/core/lib/Drupal/Core/StringTranslation/TranslationWrapper.php
@@ -119,34 +130,37 @@ public function getOptions() {
+   * TODO
    *
-   * @return string
-   *   The translated string.
+   * This should be an injected factory, likely a plugin manager.
+   *
+   * @return \Drupal\Core\OutputFormatter\OutputFormatterInterface
    */

Working on a service / plugin manager for replacing placeholders based on content type.

Log in or register to post comments

Comment #60

almaudoh commented 19 September 2015 at 12:39

Assigned:

Unassigned

» almaudoh

Log in or register to post comments

Comment #61

almaudoh commented 19 September 2015 at 13:01

Assigned:	almaudoh	» Unassigned
Issue tags:		+Needs tests

Status	File	Size
new	interdiff.txt	14.2 KB
new	safe_markup_contexts-2509218-61.patch	23.6 KB

In this patch...
1. Added a plugin manager with @OutputFormatter annotation to manage the different kinds of output formatters.
2. Moved the three output formatters to the Plugin/OutputFormatter directory

Needs tests for the new classes.

Log in or register to post comments

Comment #62

19 September 2015 at 13:30

Status:

Needs review

» Needs work

The last submitted patch, 61: safe_markup_contexts-2509218-61.patch, failed testing.

Log in or register to post comments

Comment #63

19 September 2015 at 14:06

The last submitted patch, 61: safe_markup_contexts-2509218-61.patch, failed testing.

Log in or register to post comments

Comment #64

pwolanin commented 19 September 2015 at 16:37

This patch looks like it's off track in terms of scope and what it's doing. Let's make it as small as possible - ideally just doc and no API changes

Log in or register to post comments

Comment #65

plach

he/him

Italian

Venezia

commented 19 September 2015 at 16:37

Thanks, some copy/paste issues:

+++ b/core/lib/Drupal/Core/OutputFormatter/OutputFormatterManager.php
@@ -0,0 +1,119 @@
+ * Contains \Drupal\Core\Block\BlockManager.

+++ b/core/lib/Drupal/Core/OutputFormatter/OutputFormatterManager.php
@@ -0,0 +1,119 @@
+ * Manages discovery and instantiation of block plugins.
+ *
+ * @todo Add documentation to this class.
+ *
+ * @see \Drupal\Core\Block\BlockPluginInterface

+++ b/core/lib/Drupal/Core/OutputFormatter/OutputFormatterManager.php
@@ -0,0 +1,119 @@
+   * Constructs a new \Drupal\Core\Block\BlockManager object.

Log in or register to post comments

Comment #66

plach

he/him

Italian

Venezia

commented 19 September 2015 at 16:42

This patch looks like it's off track in terms of scope and what it's doing. Let's make it as small as possible - ideally just doc and no API changes

I'm not sure what API changes you are referring to, can you expand on that? Also, I don't really think we are going to address this issue just with documentation.

Log in or register to post comments

Comment #67

pwolanin commented 19 September 2015 at 17:06

Please update the issue summary before posting any more patches.

I can't even tell if this is going in the right direction. I think adding more complexity to t() and the rest of the API here might be the wrong thing.

From the issue summary I was hoping this would be mostly a documentation issue - instead it's looking light a significant API change.

Log in or register to post comments

Comment #68

pwolanin commented 19 September 2015 at 17:08

@plach - option #2 in the current issue summary suggests it can mostly be a docs issue. It's not clear to me when/if that option was discarded.

Log in or register to post comments

Comment #69

plach

he/him

Italian

Venezia

commented 19 September 2015 at 22:26

Sorry, we are currently going a different way, see #33 and #37, I will update the issue summary ASAP.

Log in or register to post comments

Comment #70

pwolanin commented 20 September 2015 at 09:26

Status:

Needs work

» Postponed (maintainer needs more info)

Discussing with alexpott at MOB. We need a conversation in person about broadening this into to a generic (but as simple as possible) system to allow us to render any SafeString with a context-relevant formatter.

Log in or register to post comments

Comment #71

catch

he/him

English

commented 20 September 2015 at 22:17

Status:

Postponed (maintainer needs more info)

» Needs work

This is still critical, and it blocks #2571673: Convert Views t() usage where it is used as an attribute value, which is also critical, even if you don't personally like the solution.

The problem is very much there, and does not 'need more info'.

Log in or register to post comments

Comment #72

stefan.r commented 21 September 2015 at 09:26

Log in or register to post comments

Comment #73

catch

he/him

English

commented 21 September 2015 at 09:48

Title:	Ensure that the results of t() can be used as plain text for non-HTML contexts	» Ensure that SafeString objects can be used in non-HTML contexts
Issue summary:	View changes

Discussed more with pwolanin and alexpott at the sprint.

I don't think we should use plugins for this, @alexpott suggested just a method that takes a formatter and returns the string in the format, that seems plenty for extensibility to me. It'll be an interface with one method.

There's still a bit of discussion as to whether when we format for an attribute, do we return a plain text string that we then escape again, or return an escaped string and communicate to AttributeString and elsewhere that it doesn't get escaped again (via AttributeFormattedStringInterface or whatever which we don't have yet). However hopefully this clarifies exactly what the need is with the updated issue summary.

Log in or register to post comments

Comment #74

catch

he/him

English

commented 21 September 2015 at 09:54

Issue summary:

View changes

Log in or register to post comments

Comment #75

catch

he/him

English

commented 21 September 2015 at 10:02

Issue summary:

View changes

Log in or register to post comments

Comment #76

catch

he/him

English

commented 21 September 2015 at 10:06

Issue summary:

View changes

Log in or register to post comments

Comment #77

wim leers

Ghent 🇧🇪🇪🇺

commented 21 September 2015 at 10:26

Issue summary:

View changes

I read through the entire issue, and had the same remarks/questions/doubts about using plugins as #73. But catch posted it 10 minutes before I did :P

@pwolanin and @catch confirmed that we are now going with option 3 in the issue summary. That's why he tried to strike through the rest in #76, but failed miserably, because <del> is not able to strike through block-level elements :P

Log in or register to post comments

Comment #78

pwolanin commented 21 September 2015 at 10:59

really more like #2

Log in or register to post comments

Comment #79

pwolanin commented 21 September 2015 at 11:46

Issue summary:

View changes

really more like #2

Log in or register to post comments

Comment #80

pwolanin commented 21 September 2015 at 11:54

Status:

Needs work

» Needs review

Status	File	Size
new	2509218-80.patch	1.67 KB

After further discussion just having an interface and classes with a static method is the simplest possible way to solve this problem.

Here's a starting patch showing the general idea.

Log in or register to post comments

Comment #81

pwolanin commented 21 September 2015 at 12:11

Issue summary:

View changes

Log in or register to post comments

Comment #82

pwolanin commented 21 September 2015 at 12:59

Status	File	Size
new	2509218-82.patch	2.63 KB

Log in or register to post comments

Comment #83

dawehner

German

commented 21 September 2015 at 15:32

Status	File	Size
new	2509218-83.patch	5.23 KB
new	interdiff.txt	2.59 KB

Added some tests

Log in or register to post comments

Comment #84

plach

he/him

Italian

Venezia

commented 21 September 2015 at 16:26

@pwolanin @dawehner:

Can we restore the test cases using placeholders that were introduced in #49?

Log in or register to post comments

Comment #85

plach

he/him

Italian

Venezia

commented 21 September 2015 at 16:32

Assigned:

Unassigned

» plach

Working on this

+++ b/core/lib/Drupal/Component/Utility/AttributeValueOutput.php
@@ -0,0 +1,26 @@
+   * @param $string|SafeStringInterface

+++ b/core/lib/Drupal/Component/Utility/OutputStrategyInterface.php
@@ -0,0 +1,23 @@
+   * @param $string|SafeStringInterface

+++ b/core/lib/Drupal/Component/Utility/PlainTextOutput.php
@@ -0,0 +1,26 @@
+   * @param $string|SafeStringInterface

Missing FQCN

+++ b/core/lib/Drupal/Component/Utility/AttributeValueOutput.php
@@ -0,0 +1,26 @@
\ No newline at end of file

+++ b/core/lib/Drupal/Component/Utility/OutputStrategyInterface.php
@@ -0,0 +1,23 @@
\ No newline at end of file

+++ b/core/lib/Drupal/Component/Utility/PlainTextOutput.php
@@ -0,0 +1,26 @@
\ No newline at end of file

Missing newline

Log in or register to post comments

Comment #86

plach

he/him

Italian

Venezia

commented 21 September 2015 at 17:19

Status	File	Size
new	safe_markup-contexts-2509218-86.patch	7.76 KB
new	safe_markup-contexts-2509218-86.interdiff.txt	9.4 KB

More tests and docs

Log in or register to post comments

Comment #87

21 September 2015 at 17:49

Status:

Needs review

» Needs work

The last submitted patch, 86: safe_markup-contexts-2509218-86.patch, failed testing.

Log in or register to post comments

Comment #88

plach

he/him

Italian

Venezia

commented 21 September 2015 at 21:33

Status	File	Size
new	safe_markup-contexts-2509218-88.interdiff.txt	3.75 KB
new	safe_markup-contexts-2509218-88.patch	8.86 KB

10 files were hidden/shown/deleted

Status	File	Size
hidden	safe_markup-contexts-2509218-56.patch	66.5 KB
hidden	safe_markup_contexts-2509218-56.patch	14.19 KB
hidden	interdiff.txt	14.2 KB
hidden	safe_markup_contexts-2509218-61.patch	23.6 KB
hidden	2509218-80.patch	1.67 KB
hidden	2509218-82.patch	2.63 KB
hidden	2509218-83.patch	5.23 KB
hidden	interdiff.txt	2.59 KB
hidden	safe_markup-contexts-2509218-86.patch	7.76 KB
hidden	safe_markup-contexts-2509218-86.interdiff.txt	9.4 KB

Fixed test failures.

Log in or register to post comments

Comment #89

plach

he/him

Italian

Venezia

commented 21 September 2015 at 21:43

Status:	Needs work	» Needs review
Issue tags:	-Needs tests

+++ b/core/lib/Drupal/Component/Utility/OutputStrategyInterface.php
@@ -0,0 +1,25 @@
+/**
+ * Common interface for output strategies.
+ */
+interface OutputStrategyInterface {

Probably this interface should provide more details about what output strategies are, when to use them and how.

Log in or register to post comments

Comment #90

catch

he/him

English

commented 21 September 2015 at 21:48

+++ b/core/lib/Drupal/Component/Utility/HtmlAttributeValueOutput.php
@@ -0,0 +1,29 @@
+ * Implements an output strategy to be used to format strings to be used as

Would be good to avoid 'to be used' twice in the sentecne.

"Implements an output strategy used to format strings for HTML attribute values."?

+++ b/core/lib/Drupal/Component/Utility/HtmlAttributeValueOutput.php
@@ -0,0 +1,29 @@
+     return Html::escape(PlainTextOutput::renderFromHtml($string));

We can skip the Html::escape() - Attributes/AttributeString will do that. I think that's what we decided this afternoon.

```
+++ b/core/tests/Drupal/Tests/Component/Utility/HtmlAttributeValueOutputTest.php
@@ -0,0 +1,71 @@
+    $output = HtmlAttributeValueOutput::renderFromHtml($markup);
```
Even if we don't have the separate output strategy for attributes vs. plain text, we could have an integration test with AttributeString to ensure it looks right after escaping?

Log in or register to post comments

Comment #91

effulgentsia commented 21 September 2015 at 22:23

+1 to the general approach here.

Re #90.2: if we keep HtmlAttributeValueOutput in this patch at all, let's add a @todo for it to do the escaping AND upcast it to a AttributeSafeStringInterface once #2569485: Add AttributeSafeStringInterface and UriAttributeSafeStringInterface is in.

Should we also add a FormattedPlainTextOutput strategy (or better name if someone comes up with one) that invokes MailFormatHelper::htmlToText()? And if we do, then should we rename PlainTextOutput to SimplePlainTextOutput or similar name, or is it ok for that one to not have any qualifying prefix?

Log in or register to post comments

Comment #92

stefan.r commented 22 September 2015 at 00:12

+1 to #91 - I had discussed this with @plach earlier today and the conclusion was a "fancier" version of the plain text output for larger text could be a nice non-critical followup here. SimplePlainTextOutput and FormattedPlainTextOutput sound great to me

Log in or register to post comments

Comment #93

almaudoh commented 22 September 2015 at 06:48

Some docs nits mostly...

+++ b/core/lib/Drupal/Component/Utility/HtmlAttributeValueOutput.php
@@ -0,0 +1,29 @@
+ * Contains \Drupal\Component\Utility\PlainTextOutput.
+ */
...
+class HtmlAttributeValueOutput implements OutputStrategyInterface {

Contains \Drupal\Component\Utility\HtmlAttributeValueOutput

+++ b/core/lib/Drupal/Component/Utility/HtmlAttributeValueOutput.php
@@ -0,0 +1,29 @@
+   * @param $string|\Drupal\Component\Utility\SafeStringInterface
+   *   An HTML string or a any object that can be cast to string.

"or a any" :)

+++ b/core/lib/Drupal/Component/Utility/OutputStrategyInterface.php
@@ -0,0 +1,25 @@
+ * @file
+ * Contains \Drupal\Component\Utility\OutputStrategyInterface.
...
+interface OutputStrategyInterface {

OutputStrategyInterface doesn't really explain for me what this does. Maybe OutputEscapeStrategyInterface or OutputFormatStrategyInterface. Or perhaps just OutputFormatInterface.

So what's the plan for implementation of these on SafeStrings? The IS is not very clear on this. Are we adding a new :: renderAsFormat(OutputStrategyInterface $strategy) method to SafeStringInterface...?

Log in or register to post comments

Comment #94

dawehner

German

commented 22 September 2015 at 07:36

Status	File	Size
new	2509218-94.patch	9.04 KB
new	interdiff.txt	5.17 KB

Smal changes here and there.

Log in or register to post comments

Comment #95

22 September 2015 at 07:54

The last submitted patch, 86: safe_markup-contexts-2509218-86.patch, failed testing.

Log in or register to post comments

Comment #96

plach

he/him

Italian

Venezia

commented 22 September 2015 at 08:47

Assigned:

plach

» Unassigned

Not working on this atm...

Log in or register to post comments

Comment #97

lauriii

he/him

Finnish

Finland

commented 22 September 2015 at 09:10

+1 for the SimplePlainTextOutput to make it possible to create more plaintext output strategies

Log in or register to post comments

Comment #98

22 September 2015 at 09:14

The last submitted patch, 88: safe_markup-contexts-2509218-88.patch, failed testing.

Log in or register to post comments

Comment #99

pwolanin commented 22 September 2015 at 09:24

I think all of these just return a simple string, so I don't think we should return a HtmlAttributeValueOutput or anything else for any of these.

Log in or register to post comments

Comment #100

pwolanin commented 22 September 2015 at 09:28

+++ b/core/lib/Drupal/Component/Utility/HtmlAttributeValueOutput.php
@@ -0,0 +1,32 @@
+   * @param $string|\Drupal\Component\Utility\SafeStringInterface

I put this here originally, but actually we can accept any object that implements __toString
Not sure how to note that in the @param

+++ b/core/lib/Drupal/Component/Utility/HtmlAttributeValueOutput.php
@@ -0,0 +1,32 @@
+    // @todo Conver the result to AttributeSafeStringInterface, see
+    //   https://www.drupal.org/node/2569485

Typo here, but I also think the comment isn't right - we may use it with AttributeSafeStringInterface but every class implementing this interface should return a string.

+++ b/core/tests/Drupal/Tests/Component/Utility/PlainTextOutputTest.php
@@ -0,0 +1,70 @@
+    $safe_string = $this->prophesize(SafeStringInterface::class);

Why not just create a SafeString object? I don't see the value of a mock here.

Log in or register to post comments

Comment #101

stefan.r commented 22 September 2015 at 10:34

Issue summary:	View changes
Issue tags:	-Needs issue summary update

Status	File	Size
new	interdiff-94-98.patch	11.96 KB
new	2509218-98.patch	9.64 KB

Making some further changes

Log in or register to post comments

Comment #102

22 September 2015 at 10:37

The last submitted patch, 101: interdiff-94-98.patch, failed testing.

Log in or register to post comments

Comment #103

stefan.r commented 22 September 2015 at 10:38

Status	File	Size
new	interdiff-94-99.txt	12.04 KB
new	2509218-99.patch	9.68 KB

Log in or register to post comments

Comment #104

22 September 2015 at 11:03

The last submitted patch, 101: 2509218-98.patch, failed testing.

Log in or register to post comments

Comment #105

22 September 2015 at 11:08

Status:

Needs review

» Needs work

The last submitted patch, 103: 2509218-99.patch, failed testing.

Log in or register to post comments

Comment #106

pwolanin commented 22 September 2015 at 11:37

I'm going to try to fix the test fails now.

Log in or register to post comments

Comment #107

almaudoh commented 22 September 2015 at 11:38

+++ b/core/lib/Drupal/Component/Utility/OutputStrategyInterface.php
--- /dev/null
+++ b/core/lib/Drupal/Component/Utility/PlainTextSimpleOutput.php

+++ b/core/lib/Drupal/Component/Utility/PlainTextSimpleOutput.php
@@ -0,0 +1,37 @@
+ * Contains \Drupal\Component\Utility\PlainTextSimpleOutput.
...
+class PlainTextSimpleOutput implements OutputStrategyInterface {

+++ b/core/tests/Drupal/Tests/Component/Utility/PlainTextSimpleOutputTest.php
@@ -0,0 +1,64 @@
+class PlainTextSimpleOutputTest extends UnitTestCase {
...
+    $output = SimplePlainTextOutput::renderFromHtml($markup);

should be SimplePlainTextOutput

Log in or register to post comments

Comment #108

pwolanin commented 22 September 2015 at 11:48

Status:

Needs work

» Needs review

Status	File	Size
new	2509218-107.patch	9.56 KB
new	increment.txt	1.38 KB

mis-named classes used in the code.

Log in or register to post comments

Comment #109

22 September 2015 at 12:18

Status:

Needs review

» Needs work

The last submitted patch, 108: 2509218-107.patch, failed testing.

Log in or register to post comments

Comment #110

stefan.r commented 22 September 2015 at 12:33

The PlainTextSimple / PlainTextFormatted was deliberate as SimplePlainText seemed more confusing but I don't care much about one or the other. If we're going to go back to SimplePlainText let's rather mention FormattedPlainText in the @todo as well.

I think we'll have to revert the SafeString change as SafeString is in Core\Render - which we're not supposed to refer to in Component?

Log in or register to post comments

Comment #111

pwolanin commented 22 September 2015 at 13:49

Status:

Needs work

» Needs review

Status	File	Size
new	2509218-111.patch	10.04 KB
new	increment.txt	3.39 KB

Silly me - the DrupalComponentTest fails if you do that. Back to using mocks, plus fix another class name use.

Log in or register to post comments

Comment #112

dawehner

German

commented 22 September 2015 at 15:50

+++ b/core/lib/Drupal/Component/Utility/OutputStrategyInterface.php
@@ -0,0 +1,35 @@
+ * Output strategies assist in transforming unsanitized HTML strings into
+ * strings that are appropriate for a given context (i.e. plain-text, HTML
+ * attributes), through performing the relevant sanitization and formatting.

I'm still convinced by the comment about unsanizited. If you pass in something like t('Foo

+++ b/core/lib/Drupal/Component/Utility/PlainTextSimpleOutput.php
@@ -0,0 +1,37 @@
+ * @todo Provide a PlainTextFormattedOutput strategy that transforms HTML
+ *   into formatted plain text for use in the email body and long texts.

Do we need to come up with the email given that we already have \Drupal\Core\Mail\MailFormatHelper::htmlToText

Log in or register to post comments

Comment #113

stefan.r commented 22 September 2015 at 16:07

@dawehner yes, 1 is confusing, let me see about rewriting that.

As to 2, earlier in the issue a PlainTextFormattedOutput option came up (to be added in a followup). As far as I can see this would be no different from MailFormatHelper, so we could just move the logic from MailFormatHelper to PlainTextFormattedOutput as the formatted plain text could be used in more contexts than just email.

Log in or register to post comments

Comment #114

dawehner

German

commented 22 September 2015 at 16:09

As to 2, earlier in the issue a PlainTextFormattedOutput option came up (to be added in a followup). As far as I can see this would be no different from MailFormatHelper, so we could just move the logic from MailFormatHelper to PlainTextFormattedOutput as the formatted plain text could be used in more contexts than just email.

Do you think this is needed as part of this issue?

Log in or register to post comments

Comment #115

22 September 2015 at 17:12

The last submitted patch, 101: interdiff-94-98.patch, failed testing.

Log in or register to post comments

Comment #116

22 September 2015 at 17:39

The last submitted patch, 101: 2509218-98.patch, failed testing.

Log in or register to post comments

Comment #117

22 September 2015 at 17:42

The last submitted patch, 103: 2509218-99.patch, failed testing.

Log in or register to post comments

Comment #118

22 September 2015 at 18:23

The last submitted patch, 108: 2509218-107.patch, failed testing.

Log in or register to post comments

Comment #119

stefan.r commented 23 September 2015 at 01:06

Status	File	Size
new	interdiff-111-119.txt	865 bytes
new	2509218-119.patch	10.02 KB

Do you think this is needed as part of this issue?

I think this should be a followup. Created #2573009: Provide a PlainTextFormattedOutput output strategy.

Log in or register to post comments

Comment #120

stefan.r commented 22 September 2015 at 18:49

Status	File	Size
new	interdiff-119-120.txt	711 bytes
new	2509218-120.patch	10.03 KB

Log in or register to post comments

Comment #121

star-szr

he/him

English

commented 23 September 2015 at 11:31

Some thoughts for now:

+++ b/core/lib/Drupal/Component/Utility/HtmlAttributeValueOutput.php
@@ -0,0 +1,33 @@
+ * Use this when rendering a given HTML string into an HTML attribute value,
+ * such as select list options. Never use this to render strings into
+ * "style"/"on*" attributes, or attributes that are not wrapped in quotes.
...
+  public static function renderFromHtml($string) {
+    return Html::escape(PlainTextSimpleOutput::renderFromHtml($string));
+  }

Didn't we say select lists are special from other attributes?

I'd also say there are use cases for including tags in a select list, for example if you are choosing different HTML tags for output of a field.

+++ b/core/lib/Drupal/Component/Utility/OutputStrategyInterface.php
@@ -0,0 +1,35 @@
+<?php
+/**
...
+ * Contains \Drupal\Component\Utility\OutputStrategyInterface.
+ */

Minor: Blank line needed above docblock.

+++ b/core/lib/Drupal/Component/Utility/OutputStrategyInterface.php
@@ -0,0 +1,35 @@
+ * appropriate for a given context (i.e. plain-text, HTML attributes), through
...
+   * a given output context (i.e. plain-text email subjects, HTML attribute

Minor: I think these i.e. should both be e.g.,.

+++ b/core/lib/Drupal/Component/Utility/PlainTextSimpleOutput.php
@@ -0,0 +1,37 @@
+ *   into formatted plain text for use in the email body and long texts.

What does long texts mean here?

Log in or register to post comments

Comment #122

imiksu

Finland

commented 23 September 2015 at 11:59

Status	File	Size
new	interdiff.txt	1.3 KB
new	2509218-122.patch	10.03 KB

Fixed minors (#121.2 and #121.3).

Log in or register to post comments

Comment #123

almaudoh commented 23 September 2015 at 12:01

Re #107 #108, #110: Sorry, didn't know there had been a decision to change the names. Patch looks good.

Log in or register to post comments

Comment #124

dawehner

German

commented 23 September 2015 at 13:42

Given our previous discussion with @stefan.r I think we should add explicit UI test coverage for select form elements with quotes in there and HTML just to see what we exactly should do.

Log in or register to post comments

Comment #125

stefan.r commented 24 September 2015 at 02:49

I'm actually not sure about options elements being special. <option><</option> in a select does /not/ double escape for me.

Log in or register to post comments

Comment #126

stefan.r commented 24 September 2015 at 02:55

Status	File	Size
new	interdiff-122-126.txt	1.74 KB
new	2509218-126.patch	9.98 KB

Log in or register to post comments

Comment #127

stefan.r commented 24 September 2015 at 02:58

Discussed this patch with @catch earlier today and we didn't see the need of escaping HTML attributes - merely turning them into plain text is enough. I do think it makes sense to have a dedicated class for them though, even if they only wrap the plain text one.

If select elements really are special let's address them in a followup as I don't think they're blocking any criticals whereas this one is. Let's get this patch in?

Log in or register to post comments

Comment #128

24 September 2015 at 03:22

Status:

Needs review

» Needs work

The last submitted patch, 126: 2509218-126.patch, failed testing.

Log in or register to post comments

Comment #129

24 September 2015 at 03:25

The last submitted patch, 126: 2509218-126.patch, failed testing.

Log in or register to post comments

Comment #130

catch

he/him

English

commented 24 September 2015 at 05:37

Select options in a followup is fine with me.

Log in or register to post comments

Comment #131

pwolanin commented 24 September 2015 at 09:37

I don't understand the last change. The help text says "Use this when rendering a given HTML string into an HTML attribute value"

If people are putting it into an attribute value (or if we wire this up to a Twig filter) it needs to be escaped.

Log in or register to post comments

Comment #132

pwolanin commented 24 September 2015 at 09:47

Status	File	Size
new	2509218-132.patch	9.99 KB
new	increment.txt	633 bytes

Log in or register to post comments

Comment #133

pwolanin commented 24 September 2015 at 09:47

Status:

Needs work

» Needs review

Log in or register to post comments

Comment #134

stefan.r commented 24 September 2015 at 09:59

@pwolanin I had discussed this with @catch and the idea was not to do any sanitization in these output strategies and arrange this with Twig instead. Which would need updated docs if that's what we want to do.

Alternatively maybe we could sanitize and then mark as safe (for output into attributes).

Log in or register to post comments

Comment #135

stefan.r commented 24 September 2015 at 10:01

re #132 why would we need to escape when outputting into a plain text context? It'd be for output into (non-URL/style/on*) attributes in any case, right?

And wouldn't that attribute value get double escaped whenever Twig has a go at that string?

Log in or register to post comments

Comment #136

24 September 2015 at 10:14

Status:

Needs review

» Needs work

The last submitted patch, 132: 2509218-132.patch, failed testing.

Log in or register to post comments

Comment #137

24 September 2015 at 10:16

The last submitted patch, 132: 2509218-132.patch, failed testing.

Log in or register to post comments

Comment #138

pwolanin commented 24 September 2015 at 11:06

@stefan.r - in the twig context I was assuming you'd use this as a filter, not let Twig do the default autoescaping.

Log in or register to post comments

Comment #139

stefan.r commented 24 September 2015 at 11:10

@pwolanin OK can you double check with @catch what we want to do here then?

Log in or register to post comments

Comment #140

plach

he/him

Italian

Venezia

commented 24 September 2015 at 12:13

@pwolanin

Regardless of what we decide, plain text should not be HTML-escaped. I think the change was applied to the wrong strategy class.

Log in or register to post comments

Comment #141

pwolanin commented 24 September 2015 at 12:44

Oh, huh - yes. I'm a little tired

Log in or register to post comments

Comment #142

catch

he/him

English

commented 24 September 2015 at 13:49

If we let Twig don't we have to add knowledge about the strategy to AttributeString since that's what we use mostly.

I don't think we should do that here. The Views case where t() is put into an attribute can do plain text then put that string into AttributeString which does the escaping.

Anything beyond that is major followup for me not release blocking. And we shouldn't add any strategies we can't use in core at all here either.

Log in or register to post comments

Comment #143

plach

he/him

Italian

Venezia

commented 24 September 2015 at 13:54

Assigned:

Unassigned

» plach

Ok, removing the HTML attribute strategy altogether. We can add it back later if needed.

Log in or register to post comments

Comment #144

plach

he/him

Italian

Venezia

commented 24 September 2015 at 14:51

Assigned:	plach	» Unassigned
Status:	Needs work	» Needs review

Status	File	Size
new	safe_markup-contexts-2509218-144.interdiff.txt	6.7 KB
new	safe_markup-contexts-2509218-144.patch	5.99 KB

21 files were hidden/shown/deleted

Status	File	Size
hidden	safe_markup-contexts-2509218-88.patch	8.86 KB
hidden	2509218-94.patch	9.04 KB
hidden	interdiff.txt	5.17 KB
hidden	interdiff-94-98.patch	11.96 KB
hidden	2509218-98.patch	9.64 KB
hidden	interdiff-94-99.txt	12.04 KB
hidden	2509218-99.patch	9.68 KB
hidden	2509218-107.patch	9.56 KB
hidden	increment.txt	1.38 KB
hidden	2509218-111.patch	10.04 KB
hidden	increment.txt	3.39 KB
hidden	interdiff-111-119.txt	865 bytes
hidden	2509218-119.patch	10.02 KB
hidden	interdiff-119-120.txt	711 bytes
hidden	2509218-120.patch	10.03 KB
hidden	interdiff.txt	1.3 KB
hidden	2509218-122.patch	10.03 KB
hidden	interdiff-122-126.txt	1.74 KB
hidden	2509218-126.patch	9.98 KB
hidden	2509218-132.patch	9.99 KB
hidden	increment.txt	633 bytes

Also improved docs a bit.

Log in or register to post comments

Comment #145

stefan.r commented 25 September 2015 at 00:39

Status	File	Size
new	interdiff-144-145.txt	1.55 KB
new	2509218-145.patch	5.88 KB

Log in or register to post comments

Comment #146

lauriii

he/him

Finnish

Finland

commented 25 September 2015 at 08:50

+++ b/core/lib/Drupal/Component/Utility/PlainTextSimpleOutput.php
@@ -0,0 +1,31 @@
+ *   into formatted plain text for use in the email body and CLI. See

Nit: into fits the previous line

+++ b/core/lib/Drupal/Core/StringTranslation/TranslatableString.php
@@ -140,9 +140,6 @@ public function render() {
-    // @todo https://www.drupal.org/node/2509218 Note that the argument
-    //   replacement is not stored so that different sanitization strategies can
-    //   be used in different contexts.

Why was this @todo removed in #88?

Log in or register to post comments

Comment #147

plach

he/him

Italian

Venezia

commented 25 September 2015 at 09:03

Assigned:

Unassigned

» plach

On this

Log in or register to post comments

Comment #148

plach

he/him

Italian

Venezia

commented 25 September 2015 at 09:16

Status	File	Size
new	safe_markup-contexts-2509218-148.interdiff.txt	847 bytes
new	safe_markup-contexts-2509218-148.patch	5.88 KB

3 files were hidden/shown/deleted

Status	File	Size
hidden	safe_markup-contexts-2509218-144.patch	5.99 KB
hidden	interdiff-144-145.txt	1.55 KB
hidden	2509218-145.patch	5.88 KB

@lauriii

Why was this @todo removed in #88?

Because we changed approach: we no longer automatically apply the output strategy from the ::render() method, instead we apply it to its return value.

Log in or register to post comments

Comment #149

lauriii

he/him

Finnish

Finland

commented 25 September 2015 at 09:13

Assigned:	plach	» Unassigned
Status:	Needs review	» Reviewed & tested by the community

This looks good for me now :) Thanks @plach!

Log in or register to post comments

Comment #150

plach

he/him

Italian

Venezia

commented 25 September 2015 at 09:16

Issue summary:	View changes
Issue tags:	-Needs followup	+API addition

I think we should be done here.

Log in or register to post comments

Comment #151

plach

he/him

Italian

Venezia

commented 25 September 2015 at 09:25

Working on a CR

Log in or register to post comments

Comment #152

almaudoh commented 25 September 2015 at 09:48

RTBC++

Log in or register to post comments

Comment #153

plach

he/him

Italian

Venezia

commented 25 September 2015 at 10:55

Status	File	Size
new	safe_markup-contexts-2509218-153.interdiff.txt	2.28 KB

CR at https://www.drupal.org/node/2574697

Discussed with @alexpott and we agreed PlainTextOutput is a better name for the strategy we are adding here, PlainTextFormattedOutput is still a good name for the upcoming advanced strategy.

Log in or register to post comments

Comment #154

plach

he/him

Italian

Venezia

commented 25 September 2015 at 11:07

Status	File	Size
new	safe_markup-contexts-2509218-153.patch	5.81 KB

And now with patch!

Log in or register to post comments

Comment #155

plach

he/him

Italian

Venezia

commented 25 September 2015 at 11:13

Created #2574723: Figure out whether we need a dedicated output strategy for select elements.

Log in or register to post comments

Comment #156

25 September 2015 at 11:13

Status:

Reviewed & tested by the community

» Needs work

The last submitted patch, 154: safe_markup-contexts-2509218-153.patch, failed testing.

Log in or register to post comments

Comment #157

25 September 2015 at 11:30

The last submitted patch, 154: safe_markup-contexts-2509218-153.patch, failed testing.

Log in or register to post comments

Comment #158

25 September 2015 at 11:34

The last submitted patch, 154: safe_markup-contexts-2509218-153.patch, failed testing.

Log in or register to post comments

Comment #159

jhedstrom

English

Portland, OR

commented 25 September 2015 at 11:54

Fails on bot are PHP Fatal error: Uncaught exception 'ReflectionException' with message 'Class Drupal\Tests\Component\Utility\PlainTextOutputTest does not exist'

+++ b/core/lib/Drupal/Core/StringTranslation/TranslatableString.php
--- /dev/null
+++ b/core/tests/Drupal/Tests/Component/Utility/PlainTextOutputTest.php

+++ b/core/tests/Drupal/Tests/Component/Utility/PlainTextOutputTest.php
@@ -0,0 +1,68 @@
+ * Contains \Drupal\Tests\Component\Utility\PlainTextSimpleOutputTest.

due to a mismatch between class and filename.

Log in or register to post comments

Comment #160

jhedstrom

English

Portland, OR

commented 25 September 2015 at 12:03

Assigned:

Unassigned

» jhedstrom

Working on #159, and whatever else @alexpott finds during review.

Log in or register to post comments

Comment #161

jhedstrom

English

Portland, OR

commented 25 September 2015 at 12:09

Assigned:	jhedstrom	» Unassigned
Status:	Needs work	» Needs review

Status	File	Size
new	interdiff.txt	797 bytes
new	2509218-161.patch	5.8 KB

3 files were hidden/shown/deleted

Status	File	Size
hidden	safe_markup-contexts-2509218-148.patch	5.88 KB
hidden	safe_markup-contexts-2509218-153.interdiff.txt	2.28 KB
hidden	safe_markup-contexts-2509218-153.patch	5.81 KB

Quick rename so tests can run while review proceeds.

Log in or register to post comments

Comment #162

wim leers

Ghent 🇧🇪🇪🇺

commented 25 September 2015 at 12:28

Issue summary:

View changes

Status	File	Size
new	Screen Shot 2015-09-25 at 14.26.52.png	38.43 KB

PlainTextFormattedOutput makes no sense to me.

How can something be both plain and formatted? Contrast with the text field types we have: — you must choose, you can't have both.

Can somebody enlighten me?

Log in or register to post comments

Comment #163

alexpott

he/they

English

🇪🇺🌍

commented 25 September 2015 at 12:41

+++ b/core/lib/Drupal/Component/Utility/PlainTextOutput.php
@@ -0,0 +1,31 @@
+ * @todo Provide a PlainTextFormattedOutput strategy that transforms HTML into
+ *   formatted plain text for use in the email body and CLI. See
+ *   https://www.drupal.org/node/2573009.

I'm too am confused by this comment. I thought this patch would give as the tools necessary to do #2572597: Replace !placeholder with @placeholder in mail code. Tbh I don't get why we have to do this - surely MailFormatHelper::htmlToText() just does what it does and this is great.

Log in or register to post comments

Comment #164

wim leers

Ghent 🇧🇪🇪🇺

commented 25 September 2015 at 12:52

Status	File	Size
new	2509218-163.patch	5.81 KB
new	interdiff.txt	2.46 KB

+++ b/core/lib/Drupal/Component/Utility/OutputStrategyInterface.php
@@ -0,0 +1,36 @@
+ * appropriate for a given context (e.g. plain-text), through performing the

s/plain-text/plain text/

+++ b/core/lib/Drupal/Component/Utility/OutputStrategyInterface.php
@@ -0,0 +1,36 @@
+ * relevant formatting. No santization is applied.

s/santization/sanitization/

```
+++ b/core/lib/Drupal/Component/Utility/PlainTextOutput.php
@@ -0,0 +1,31 @@
+ * Provides an output strategy for transforming HTML into simple plain text.
```
What is "simple" plain text? Isn't plain text always "simple"?

I'm betting this is related to the "formatted plain text", but that doesn't mean this is clear. I think it's fine to omit.

+++ b/core/lib/Drupal/Component/Utility/PlainTextOutput.php
@@ -0,0 +1,31 @@
+ * @todo Provide a PlainTextFormattedOutput strategy that transforms HTML into
+ *   formatted plain text for use in the email body and CLI. See
+ *   https://www.drupal.org/node/2573009.

Discussed with @jhedstrom, and now #162 is answered. I still think the name doesn't make sense, but we can discuss that further/refine it in the follow-up issue. I will comment there in a minute.

+++ b/core/tests/Drupal/Tests/Component/Utility/PlainTextOutputTest.php
@@ -0,0 +1,68 @@
+   * @param array $args

s/array/string[]/

+++ b/core/tests/Drupal/Tests/Component/Utility/PlainTextOutputTest.php
@@ -0,0 +1,68 @@
+   *   none.

s/none/the empty array/

EDIT: to be clear, I fixed all my nits.

Log in or register to post comments

Comment #165

wim leers

Ghent 🇧🇪🇪🇺

commented 25 September 2015 at 13:00

Ok, per #163, we need to deal with the mail thing here.

I think PlainTextFormattedOutput is an extremely confusing name. So let's try to find something better. This is the relevant code:

  /**
   * Transforms an HTML string into plain text, preserving its structure.
   *
   * The output will be suitable for use as 'format=flowed; delsp=yes' text
   * (RFC 3676) and can be passed directly to MailManagerInterface::mail() for sending.
   *
   * We deliberately use LF rather than CRLF, see MailManagerInterface::mail().
   *
   * This function provides suitable alternatives for the following tags:
   * <a> <em> <i> <strong> <b> <br> <p> <blockquote> <ul> <ol> <li> <dl> <dt>
   * <dd> <h1> <h2> <h3> <h4> <h5> <h6> <hr>
   *
   * …
   */
  public static function htmlToText($string, $allowed_tags = NULL) {

So it is transforming HTML into plain text, while preserving the HTML structure and just mapping it to a syntax defined in RFC 3676. That RFC is titled The Text/Plain Format and DelSp Parameters.

So it's actually transforming text/html to text/plain.

Suggested names based on this so far: StructuredPlainTextOutput, MailPlainTextOutput, PlainTextMailOutput.

Second, let's look at that RFC for inspiration:

3.  The Problem

   The Text/Plain media type is the lowest common denominator of
   Internet email, with lines of no more than 998 characters (by
   convention usually no more than 78), and where the carriage-return
   and line-feed (CRLF) sequence represents a line break (see [MIME-IMT]
   and [MSG-FMT]).

   Text/Plain is usually displayed as preformatted text, often in a
   fixed font.  That is, the characters start at the left margin of the
   display window, and advance to the right until a CRLF sequence is
   seen, at which point a new line is started, again at the left margin.
   When a line length exceeds the display window, some clients will wrap
   the line, while others invoke a horizontal scroll bar.

   Text which meets this description is defined by this memo as "fixed".

Suggested names based on this: PreformattedPlainTextOutput, MonospacedPlainTextOutput.

Conclusion: pick one of these names:

StructuredPlainTextOutput
MailPlainTextOutput
PlainTextMailOutput
PreformattedPlainTextOutput
MonospacedPlainTextOutput

I think PlainTextMailOutput is the best one.

Log in or register to post comments

Comment #166

jhedstrom

English

Portland, OR

commented 25 September 2015 at 13:03

Status:

Needs review

» Reviewed & tested by the community

Status	File	Size
new	interdiff.txt	678 bytes
new	2509218-165.patch	5.61 KB

3 files were hidden/shown/deleted

Status	File	Size
hidden	Screen Shot 2015-09-25 at 14.26.52.png	38.43 KB
hidden	2509218-163.patch	5.81 KB
hidden	interdiff.txt	2.46 KB

Discussed with @stefan.r and @alexpott, and decided the @todo could be removed entirely, and that this is rtbc assuming it goes green.

Log in or register to post comments

Comment #167

wim leers

Ghent 🇧🇪🇪🇺

commented 25 September 2015 at 13:14

Eh, ok. Can we document here why?

I've transplanted my commment at #165 to #2573009-3: Provide a PlainTextFormattedOutput output strategy.

Log in or register to post comments

Comment #168

jhedstrom

English

Portland, OR

commented 25 September 2015 at 13:21

re #167 the reasoning is that the new class isn't strictly needed as part of this fix, so an @todo is premature, but the new class can still be discussed in #2573009: Provide a PlainTextFormattedOutput output strategy.

Log in or register to post comments

Comment #169

alexpott

he/they

English

🇪🇺🌍

commented 25 September 2015 at 14:10

Status:

Reviewed & tested by the community

» Fixed

Thanks everyone - this looks a great solution. Committed 70bad3e and pushed to 8.0.x. Thanks!

Log in or register to post comments

Comment #170

25 September 2015 at 14:10

alexpott committed 70bad3e on 8.0.x

Issue #2509218 by plach, stefan.r, pwolanin, effulgentsia, dawehner,...

Log in or register to post comments

Comment #171

25 September 2015 at 14:21

The last submitted patch, 154: safe_markup-contexts-2509218-153.patch, failed testing.

Log in or register to post comments

Comment #172

25 September 2015 at 15:26

The last submitted patch, 161: 2509218-161.patch, failed testing.

Log in or register to post comments

Comment #173

25 September 2015 at 16:39

The last submitted patch, 164: 2509218-163.patch, failed testing.

Log in or register to post comments

Comment #174

25 September 2015 at 16:48

Status:

Fixed

» Needs work

The last submitted patch, 166: 2509218-165.patch, failed testing.

Log in or register to post comments

Comment #175

berdir

German

Switzerland

commented 25 September 2015 at 16:50

Status:

Needs work

» Fixed

Too slow old testbot, too slow.

Log in or register to post comments

Comment #176

plach

he/him

Italian

Venezia

commented 25 September 2015 at 16:53

:)

Log in or register to post comments

Comment #177

sun

German

Karlsruhe

commented 27 September 2015 at 21:02

This change attempts to introduce a custom concept of "output strategies", not a utility. Why was the code added to Utility?

Log in or register to post comments

Comment #178

plach

he/him

Italian

Venezia

commented 28 September 2015 at 08:18

@sun:

hey :)

My original patch was providing an OutputFormatter namespace, @pwolanin asked for a simplification of the approach and since both Html and SafeMarkup live in Utility, that felt like a good place also for PlainTextOutput.

Btw, I think it would useful if SafeMarkup implemented OutputStrategyInterface, providing a ::renderFromHtml() method implementation just casting its input to string. That would allow to pass it to methods receiving any output strategy as input.

Log in or register to post comments

Comment #179

12 October 2015 at 08:24

Status:

Fixed

» Closed (fixed)

Automatically closed - issue fixed for 2 weeks with no activity.

Log in or register to post comments

Ensure that SafeString objects can be used in non-HTML contexts

Problem/Motivation

Proposed resolution

Create a simple interface with a method to for HTML to text strategy classes

Remaining tasks

User interface changes

API changes

Data model changes

Comments