Improve config patch and merge for sequences [#3232243]

Comment	File	Size	Author
#39	interdiff_28-38.txt	2.86 KB	pyrello
#38	interdiff_28-38.txt	0 bytes	pyrello
#38	3232243-38.patch	10.33 KB	pyrello
#28	3232243-28.patch	9.46 KB	pyrello
#10	improve-patch-merge-sequences-3232243-10-FAILING.patch	1.09 KB	pyrello

Comment #1

9 September 2021 at 21:46

bircher created an issue. See original summary.

Log in or register to post comments

Comment #2

10 September 2021 at 15:00

pyrello made their first commit to this issue’s fork.

Log in or register to post comments

Comment #3

10 September 2021 at 15:01

pyrello opened merge request !8

Log in or register to post comments

Comment #4

pyrello commented 10 September 2021 at 15:02

I made a first attempt at getting the logic for sequences working better. Next step is to write a test for this, but I'll probably need some help on that front, since I don't have a lot of experience doing it.

Log in or register to post comments

Comment #5

12 September 2021 at 21:05

pyrello opened merge request !9

Log in or register to post comments

Comment #6

12 September 2021 at 21:05

pyrello closed merge request !8

Log in or register to post comments

Comment #7

pyrello commented 12 September 2021 at 21:07

I'm starting over with a new branch and MR so that I can establish passing tests to start with.

Log in or register to post comments

Comment #8

pyrello commented 12 September 2021 at 21:21

Ugh. Didn't realize that using the ProphecyTrait would be a problem, so I had to remove it to get the test runner not to error out. Also reverted the last commit prior to that to establish the initial passing test.

Log in or register to post comments

Comment #9

pyrello commented 12 September 2021 at 21:51

Last commit should fix the new test that was failing, but will fail on the existing testSimpleMergeExample test. I'm still not totally clear on the logic of what is happening in that test, but it fails on the last assertion.

Log in or register to post comments

Comment #10

pyrello commented 13 September 2021 at 00:01

Status:

Active

» Needs review

Status	File	Size
new	improve-patch-merge-sequences-3232243-10-FAILING.patch	1.09 KB

Adding a patch with just the new permission (sequence) test that was added to the MR in the last commit to demonstrate that it is failing prior to the rest of the changes in the MR.

Log in or register to post comments

Comment #11

13 September 2021 at 00:06

Status:

Needs review

» Needs work

The last submitted patch, 10: improve-patch-merge-sequences-3232243-10-FAILING.patch, failed testing. View results

Log in or register to post comments

Comment #12

bircher

🇨🇿

commented 13 September 2021 at 11:07

This is looking great already.

One small thing we need to add to the test you are adding (in #10): We should also test that the merging works again. I know the sorting can be off, that is a problem for another day.
But basically if you take the config patch (, revert it) and apply it to one of the examples you should get the other one.

We can also add more contrived examples that add and remove things etc with a dataprovider but that is not a hard requirement for a first version.

I think the failing test is testing pretty much this, though in a more involved workflow. So if you can make an easier-to-understand version like I described in the first paragraph of this comment then that will be helpful for all contributors.

Log in or register to post comments

Comment #13

bircher

🇨🇿

commented 13 September 2021 at 11:09

maybe it is as simple as switching the boolean value to the preserve integer keys on the merge utility.

Oh and the method should probably not be public, none of this is an API.

Log in or register to post comments

Comment #14

pyrello commented 13 September 2021 at 13:34

The problem that is causing the previously existing patch test to fail seems to happen because of the NestedArray::mergeDeepArray() method, which is unfortunate. When you look at the existing test for this method (https://git.drupalcode.org/project/drupal/-/blob/9.3.x/core/tests/Drupal...), I think it is pretty clear that this use case wasn't envisioned. If $preserve_integer_keys is FALSE, there is no array_search to check if the value already exists so that it doesn't get added twice.

I'm not sure if, in the world of config, it is a valid use case to have multiple elements in a sequence that are identical to one another or not. If that is a valid use case, then I'm not sure how to solve for both that and being able to merge config that may have duplicative elements that are not intended to be duplicated.

Log in or register to post comments

Comment #15

pyrello commented 13 September 2021 at 19:53

Assigned:

Unassigned

» pyrello

Log in or register to post comments

Comment #16

14 September 2021 at 16:26

pyrello opened merge request !11

Log in or register to post comments

Comment #17

pyrello commented 14 September 2021 at 20:13

I opened MR 11 because I was needing to test out the code from https://www.drupal.org/project/config_split/issues/3232667 and this issue at the same time and be able to show progress on the corresponding working that I am doing related to this. I can close it when I am able to create a "finalized" patch or alternately these two issues' MR's get merged.

Log in or register to post comments

Comment #18

pyrello commented 14 September 2021 at 20:25

Based on my understanding of the issues that were raised in Slack about the necessity of possible duplicate entries in config sequences, it seems like we have two situations that we are trying to account for:

Some sequences should not allow duplicates. Their presence likely indicates that the config is wrong. An example of this would be permissions in a user.role.*.yml file.
Some sequences may need to allow for duplicates. I am not personally familiar with any examples for this, but there is nothing preventing from using config this way, so we need to allow for the possibility.

I believe that the first case is actually the more common use case for config and if we were going to solve one versus the other in the near term, that it should be that case. @bircher and I have discussed the possibility of using some sort of plugin system that would provide a finer degree of control over individual cases. I think that is probably going to be necessary without changes to core config schema properties. If such a system comes into being, I still think the default case should be de-duplicating config sequences.

I think MR 9 is mostly accomplishing the first case above.

As the most recent test failure shows, there may still be room for adding some sort of post-merge sorting to sequences, so they will be in a predictable order.

Log in or register to post comments

Comment #19

pyrello commented 16 September 2021 at 21:04

Expecting this to go from 1 failing test to 2 failing tests.

Log in or register to post comments

Comment #20

pyrello commented 16 September 2021 at 21:12

And then back to 1 failing test.

Log in or register to post comments

Comment #21

bircher

🇨🇿

commented 20 September 2021 at 12:04

I just had a crazy idea, what if we make the keys of sequencial arrays string keys with some smart logic before doing the diff?

Like if the array contains unique strings (such as for permissions or dependencies) we make the key csp_unique $value. Or if the array contains another array we check if it has a uuid or id component and make the key csp_uuid $value['uuid'].

Then after merging we clean it up again and remove all the keys where we added the csp_ prefix. For legitimate duplicates we have its own key, and for arrays which dont have a uuid or where it is not unique we leave it as is or just pefix it but keep it an integer so that it will be removed and added as now.

Basically the idea is to try to identify the elemnents better so that the diff doesn't compare by identifying it as "the first element"

Log in or register to post comments

Comment #22

pyrello commented 23 September 2021 at 17:13

We had discussed how we might handle things like core.entity_view_display.node.*, especially in cases where Layout Builder is turned on and being used to provide a content type template. I think ideally, we would be able to have some sort of object comparison go through and figure out what is different for each object it finds. But I also think that how it is working right now is fine for the time being.

Here is an example of a patched config file of the above mentioned type: https://github.com/uiowa/uiowa/blob/0115ed14b9ed5115f21c149301d3a16488f2...

The issue we are trying to solve in this thread is something that makes CS2 unusable for certain types of configuration, which would be necessary for us to move forward with using it. Solving for more intelligent LB template handling and similar use cases seems like more of a nice to have that we could add in the future. So I propose a separate issue for tackling that and decoupling it from this issue.

If we are removing that from consideration, then I think we just need to figure out how to fix the sorting problem in the tests and make whatever adjustments are necessary there to make the tests work in a way that we have confidence in them.

Log in or register to post comments

Comment #23

alokbhatt commented 13 October 2021 at 13:50

I was facing an issue while run config-split:import for user role permissions, in which I was changing permission for some roles, so I added a few permissions with config-split.patch file. Once import config-split, I noticed that some of the permissions has been changed assigned in main user.role permission configuration file and due to that admin toolbar was removed for a role.

I have applied a patch https://git.drupalcode.org/project/config_split/-/merge_requests/11.patch and that fixed the problem. Now only permission affected written in config-split.patch file.

Thanks for the patch.

Log in or register to post comments

Comment #24

pyrello commented 13 October 2021 at 14:18

@alokbhatt - Just a word of warning that work in this issue is ongoing and so the format could yet change.

Glad to hear the patch worked for you!

Log in or register to post comments

Comment #25

pyrello commented 14 October 2021 at 13:48

@bircher - If you are okay with deferring the more complex object handling that we have discussed until this issue: https://www.drupal.org/project/config_split/issues/3238855, then I think the only thing needed here is to resolve the issue that causes the merge test to fail, which is that the merged sequences don't match because we are not sorting them. I'm not 100% sure that the test itself doesn't cause this problem because we are using fictitious config that is not associated with a schema that would allow the Sorter to do its thing. Some guidance on how to proceed would be appreciated.

Log in or register to post comments

Comment #26

14 October 2021 at 19:56

pyrello closed merge request !11

Log in or register to post comments

Comment #27

pyrello commented 14 October 2021 at 20:02

Status:

Needs work

» Needs review

This solution resolves the failing test. Looking for feedback on whether this approach is okay.

Log in or register to post comments

Comment #28

pyrello commented 14 October 2021 at 20:36

Status	File	Size
new	3232243-28.patch	9.46 KB

Adding a patch that I can reference from my project. Based on my testing, this seems stable enough for me to use for the time being.

Log in or register to post comments

Comment #29

bircher

🇨🇿

commented 17 October 2021 at 15:06

This looks already very good. I am just a bit afraid of the sorting, but maybe it is just me being paranoid.

I will have to test this a bit more with some real data and maybe commit it in two steps so that we have more traces in the git log of how the array utility methods change.

It may be hypothetical (and in Drupal there is always someone that has a case where it becomes a practical case) but I don't know if this could handle things that are on purpose duplicated in sequences. (like the same layout builder block twice?).
That is why I thought it could be an option to do something more defensively, ie change keys from numeric to predictable strings in cases we know what to expect (ie know the type of values etc).

Also if this works for you and you are not @pyrello or @alokbhatt (and not working on the same teams) then by all means chime in and let me know! The more people that say that this fixes their problem and nobody that says that it makes things worse the more confident I am with the approach.

Log in or register to post comments

Comment #30

pyrello commented 18 October 2021 at 02:49

For what its worth, I believe that for block in layout templates in the core.entity_view_display.node.*, each block (even duplicates) would have a different UUID, so this would not be an issue in that case. I can’t say for sure that this would never come up (your point about there always being a practical case for hypotheticals in Drupal is well taken). My testing with those particular files has seemed to indicate that they patch as expected.

I get being paranoid about the sorting. That is that part of this that I am the most unsure about. But, it also doesn’t break any existing tests, so it is not entirely clear without additional information what the problem with it might be.

Log in or register to post comments

Comment #31

tedfordgif commented 22 October 2021 at 15:03

Here are the configs in one of my codebases that have explicit integer array keys:

$ grep -r '^ *\d+:' -P config -l | sort | head -3
config/dev/devel.settings.yml
config/sync/system.site.yml
config/sync/views.view.[redacted].yml
# many more views

The integer key in devel.settings is for the error_handlers, which is used in devel.module:

      $error_handlers = devel_get_handlers();
      if (!empty($error_handlers[DEVEL_ERROR_HANDLER_STANDARD])) {
        \Drupal::messenger()->addMessage($msg, ($severity_level <= RfcLogLevel::NOTICE ? MessengerInterface::TYPE_ERROR : MessengerInterface::TYPE_WARNING), TRUE);
      }
      if (!empty($error_handlers[DEVEL_ERROR_HANDLER_BACKTRACE_KINT])) {
        print kpr(ddebug_backtrace(TRUE, 1), TRUE, $msg);
      }
      if (!empty($error_handlers[DEVEL_ERROR_HANDLER_BACKTRACE_DPM])) {
        dpm(ddebug_backtrace(TRUE, 1), $msg, 'warning');
      }

system.site has 403 and 404, but those are siblings of the "front" key, so this would technically be OK with your patch. Feels a little squishy to me, if not fishy.

For views.view.* I have four scenarios:

Filter groups: Pretty sure this would be affected (see core/modules/views/src/Plugin/views/query/Sql.php:919), but I've never had to split config at this level
Grouped Filters: Ditto, but in core/modules/views/src/Plugin/views/filter/FilterPluginBase.php:858. This sometimes has an 'All' key in the code, but that doesn't get added to the config.
Views bulk operations: Also need to preserve these keys.
Entity ID filters (e.g. term ID): Didn't bother checking if these need preserved, but I can imagine a site with a terrible development workflow where dev has different term IDs than prod, and they resort to config split, even as a temporary measure. Frankly they should pay the price, but this is just the nail in the coffin for this approach as a general one.

Why don't we just make all configs strongly typed? <ducks>

Log in or register to post comments

Comment #32

pyrello commented 22 October 2021 at 17:37

@tedforgif Thanks for bringing up these cases! This truly is not an easy problem to solve.

I have no idea how to resolve the case of devel.settings.yml:

error_handlers:
  1: 1

I think by the time we get to the point where we are doing the array merge, the above would be indistinguishable from:

error_handlers:
  - 1

For the system.site.yml, that is exactly the reason why it only considers the keys to be numeric if all of them are numeric. I'm open to suggestions about how to improve it.

I'll have to look into views a bit more. While I don't think we are currently using config split to override any existing views (which is where this patch merge code would come into play), it is something I had been thinking we would be able to take advantage of after moving to Config Split 2.

I will agree that this solution breaks (or potentially breaks) some of the above config entities if they are partially split. However, I would ask, does that seem like a bigger bug than the way it currently handles something like a user.role.*.yml without these changes? Here is an example of a patch export of a permissions file from my project: https://github.com/uiowa/uiowa/blob/b2f9d8847917aff4ca62637fe1055fffa931...

Log in or register to post comments

Comment #33

tedfordgif commented 22 October 2021 at 18:55

Edit: I know these don't address the "do we ignore the keys" question, but they are related. I don't have time to track down the existence of the "ignore?" ticket.

Log in or register to post comments

Comment #34

pyrello commented 22 October 2021 at 19:41

I don't think that first issue will have any affect on the problem with sequence handling outlined by this issue. That issue is driven by the fact that the merging method used doesn't have an elegant way of handling sequences (which really just reflects the same underlying problem with PHP's array_merge_recursive() function).

I added a test to cover the devel.settings.yml case.

Log in or register to post comments

Comment #35

bircher

🇨🇿

commented 22 October 2021 at 20:56

hmm instead of checking if the keys are integers, I think we could use the same logic the yaml formatting is doing to decide whether to use a dash or the number. (I am not suggesting to call that function but to copy that code or idea)
I think it comes down to check if the array keys match range(0, count($arr)) or similar.

I think it would also be good to clearly see where we deviate from the core utility.

I am not sure that we shouldn't also be able to deal with duplicates, but I don't have any good example of any config where that would be a thing. But for that I think the only way is to transform the keys so that this information can be kept.

What I like about this is that it solves a lot of problems.

And I think we should not interfere with the sorting here, its fine if it gets added to the end. The sorting should be a different concern. So for the test we can add a sorter that sorts everything recursively and add a dummy test name (I will refactor it anyway sooner or later to make the name required)
Or alternatively we make assertions that ignore the order.

Log in or register to post comments

Comment #36

pyrello commented 25 October 2021 at 18:15

I am not sure that we shouldn't also be able to deal with duplicates, but I don't have any good example of any config where that would be a thing. But for that I think the only way is to transform the keys so that this information can be kept.

To clarify, if we are going to do a check for duplicates, this would be within each array prior to merging. The merge by its very nature might produce duplicates and we wouldn't want to keep those. So, we would need a check prior to looping through the arrays to check this and to set the $is_sequence variable (introduced in the last commit) to be FALSE if duplicates were detected.

Log in or register to post comments

Comment #37

pyrello commented 25 October 2021 at 18:13

For duplicates, I think we could add another private static method that essentially does something like this: https://stackoverflow.com/a/43635394/1375741 and checks each array before processing and sets $is_sequence to FALSE if duplicates are present.

However, since we are not actually aware of any cases where valid duplicates might be used, would it make sense to leave this as is, for now, and add the duplicate check later if someone actually reports it as an issue?

Log in or register to post comments

Comment #38

pyrello commented 25 October 2021 at 18:40

Status	File	Size
new	3232243-38.patch	10.33 KB
new	interdiff_28-38.txt	0 bytes

1 file was hidden/shown/deleted

Status	File	Size
hidden	3232243-28.patch	9.46 KB

Adding another patch based on current progress.

Log in or register to post comments

Comment #39

pyrello commented 25 October 2021 at 18:42

Status	File	Size
new	interdiff_28-38.txt	2.86 KB

1 file was hidden/shown/deleted

Status	File	Size
hidden	interdiff_28-38.txt	0 bytes

Messed up the interdiff. Here's a new one.

Log in or register to post comments

Comment #40

bircher

🇨🇿

commented 25 October 2021 at 19:29

RE #37: Yes that makes sense, we can ignore duplicates for now, and then deal with it when there is a real case.

I suspect that your proposed solution for duplicates won't work though because we would need to detect duplicates also when creating the patch. So we could have duplicates before merging, we could have duplicates in the patch or we could have duplicates in the array before applying the patch.

So in order not to get the perfect in the way of the good I think we can live with it for now.

Just a little thing: the new methods need to be private because they do certainly not constitute an API and I can already see people using them and then their code breaks... All of this here is just a sandbox to find out what an API in core or another module would have to do.

Log in or register to post comments

Comment #41

bircher

🇨🇿

commented 25 October 2021 at 19:33

Status:

Needs review

» Needs work

I'll give it a proper review when I have some more time, but for now the public methods need work. (ie the tests need to be updated too)

I would prefer if we tested it through the public api of creating the patch and applying it, but I can live with a closure that uses call to access private functions of other classes in a test or even reflection.

Log in or register to post comments

Comment #42

pyrello commented 25 October 2021 at 20:38

Status:

Needs work

» Needs review

Seeing what happens with the tests.

Log in or register to post comments

Comment #43

pyrello commented 26 October 2021 at 04:01

I haven't quite figured out the testNonSequenceNumericKeyMerge test yet, but I think I have the rest of them working. If its obvious what I'm missing about how that test should be set up, let me know.

Log in or register to post comments

Comment #44

pyrello commented 26 October 2021 at 04:02

Status:

Needs review

» Needs work

Log in or register to post comments

Comment #45

bircher

🇨🇿

commented 26 October 2021 at 08:15

Ah that is because the test you copied is way too complicated for what we want to test here.

All we need is:
$configA
$configB

then we create a $patchAB and $patchBA

then we assert that the patch applied to the config gives the other config. (ie it works both ways.)

And that one patch is the inverse of the other.

Log in or register to post comments

Comment #46

bircher

🇨🇿

commented 26 October 2021 at 08:18

oh and of course the config A and B could come from a data provider, and we could optionally add a expected patch.

Log in or register to post comments

Comment #47

pyrello commented 26 October 2021 at 13:52

Status:

Needs work

» Needs review

Refactored the test for numeric keys which are not a sequence. In the process of doing that, I noticed that the diffArrayRecursive method was not respecting the non-sequence keys, which was causing the test to fail. I refactored to fix this issue.

Ready for review!

Log in or register to post comments

Comment #48

pyrello commented 26 October 2021 at 14:07

Status:

Needs review

» Needs work

Changing the logic in diffArrayRecursive broke another test.

Log in or register to post comments

Comment #49

pyrello commented 26 October 2021 at 14:17

Status:

Needs work

» Needs review

The test was implicitly using a mixed array (numeric and string keys), which is not what I think we are trying to test.

Log in or register to post comments

Comment #50

bircher

🇨🇿

commented 27 October 2021 at 10:59

I refactored the tests a bit. I hope it will be easier in the future to add test cases to the provider.
I also added it to the beginning so that it is the first thing you see and not the complicated test I made for myself.

Log in or register to post comments

Comment #51

pyrello commented 27 October 2021 at 12:53

These changes look great! Much more robust testing tools for the patch side of things!

Log in or register to post comments

Comment #52

pyrello commented 27 October 2021 at 14:35

Status:

Needs review

» Reviewed & tested by the community

I checked with @tedfordgif and he indicated that he was okay with the changes here. I tested again against my setup and am still getting consistent results for user.role.* config files on export. Marking as RTBC.

Log in or register to post comments

Comment #53

27 October 2021 at 18:36

bircher committed 110b275 on 2.0.x authored by pyrello

Issue #3232243 by pyrello, bircher, tedfordgif: Improve config patch and...

Log in or register to post comments

Comment #54

bircher

🇨🇿

commented 27 October 2021 at 18:37

Status:

Reviewed & tested by the community

» Fixed

Thanks for all the contribution!

This is a big improvement even though we left some cases unaccounted for.

Log in or register to post comments

Comment #55

10 November 2021 at 18:39

Status:

Fixed

» Closed (fixed)

Automatically closed - issue fixed for 2 weeks with no activity.

Log in or register to post comments

Improve config patch and merge for sequences

Problem/Motivation

Steps to reproduce

Proposed resolution

Remaining tasks

User interface changes

API changes

Data model changes

Issue fork config_split-3232243

Comments

Referenced by