Problem/Motivation

  1. During a cold start, we load a lot of config files from disk. These files must be stored on the shared filesystem when using multiple web heads. This is slow and often unreliable.
  2. We have an active config directory which looks editable but if you actually dare to edit it, things will break horribly. This is unnecessary brittleness.

Proposed resolution

  1. We retain the pluggable storage backend support for active configuration. Core ships with db storage by default, with no caching.
  2. The config.storage service is aliased to an un-cached config so that follow-up issues (or site-builders) can swap in a caching or a pre-load layer

Advantages of this approach

  • discourage people from hacking their active config
  • Less potential confusion about which files are which in the import/export workflow
  • less YAML parsing and file stats
  • more secure - the active configuration in no longer in files
  • More familiar development workflow - live database contains everything needed aside from the code

Cons or points of debate

  • harder to recover from invalid config causing a site-wide WSOD (#23), but similar to Drupal 7 in this regard
  • "discoverability" of the active configuration by browsing files is lost. However they can already be browsed using the single export form
  • The quick "copy from active" staging and export is gone. Import/Export UI already partially addresses this, but we would need to document expected workflows (with drush, with UI export, example below, etc.) in the handbook so that we can have people test them once the change is made.
  • There are some cases where there is no UI for what's in config (e.g., Breakpoint, Tour, REST modules) and editing active config directly (rather than via export/import) is safe (i.e., config entities aren't being created, deleted, or renamed, and there are no hook implementations that need to respond to what's being changed). For these cases, the development process is made more tedious by requiring either manually editing serialized db rows or going through the export/import workflow for each iterative change. However, #2226761: Change all default settings and config to fast/safe production values is an open issue for making most default Drupal settings be what is best for production, and then providing a mechanism for overriding for developer convenience. So this point could be addressed by using file storage when in DEV mode.
  • Git workflows changed. Use case described in https://drupal.org/node/1831818 would no longer be possible (but may be outdated in any case). There will be no more "native" diffing against staging stored in git; there's at least an extra step of "export your config, wipe out your separately tracked config repo that is in staging or somewhere else, and replace it with the export". Drush config-export allows to specify which config directory to export to, see this Git CMI workflow: video and example.

Remaining tasks

review/decide/commit

User interface changes

None.

API changes

No actual API changes, but some changes to developer workflow.

Follow-ups:
#2232791: Make the default core config service a db-backed storage with a cache layer.

CommentFileSizeAuthor
#116 install-env.png36.89 KBsun
#108 2161591-108.patch36.25 KBpwolanin
#108 increment.txt6.29 KBpwolanin
#100 2161591-100.patch34.91 KBpwolanin
#100 increment.txt5.26 KBpwolanin
#96 2161591-96.patch38.85 KBpwolanin
#96 increment.txt2.63 KBpwolanin
#94 2161591-94.patch36.22 KBpwolanin
#94 increment.txt3.89 KBpwolanin
#88 2161591-88.patch33.04 KBpwolanin
#88 increment.txt13.11 KBpwolanin
#81 interdiff.txt6.58 KBsun
#81 config.storage.active.80.patch33.11 KBsun
#78 2161591-78.patch32.98 KBpwolanin
#78 increment.txt573 bytespwolanin
#74 2161591-74.patch32.97 KBpwolanin
#74 increment.txt1.68 KBpwolanin
#73 2161591-73.patch33.41 KBpwolanin
#73 increment.txt10.66 KBpwolanin
#68 2161591-68.patch30.15 KBpwolanin
#68 increment.txt3.16 KBpwolanin
#66 2161591-66.patch30.25 KBpwolanin
#66 increment.txt3.73 KBpwolanin
#65 2161591-64.patch28.91 KBpwolanin
#65 increment.txt1.28 KBpwolanin
#63 2161591-63.patch28.95 KBpwolanin
#63 increment.txt7.57 KBpwolanin
#61 2161591-61.patch25.8 KBpwolanin
#61 increment.txt4.99 KBpwolanin
#59 2161591-59.patch22.42 KBpwolanin
#59 increment.txt10.7 KBpwolanin
#56 2161591-56.patch20.74 KBpwolanin
#56 increment.txt10.93 KBpwolanin
#51 2161591-51.patch20.3 KBpwolanin
#51 increment.txt2.26 KBpwolanin
#49 2161591-49.patch20.06 KBpwolanin
#49 increment.txt3.51 KBpwolanin
#47 2161591-47.patch16.55 KBpwolanin
#47 increment.txt7.32 KBpwolanin
#45 2161591-45.patch9.75 KBAnonymous (not verified)
#40 2161591-40.patch9.12 KBAnonymous (not verified)
#39 2161591-39.patch4.98 KBpwolanin
#39 increment.txt2.56 KBpwolanin
#36 2161591-36.patch2.42 KBAnonymous (not verified)
Support from Acquia helps fund testing for Drupal Acquia logo

Comments

sun’s picture

Interesting proposal — with wide-ranging consequences, so I'm not able to vote in any way yet.

Two questions though (as mentioned in IRC):

  1. There are claims about performance problems — Do we know what those actually look like? Can we back that up with numbers?

    Please note: I can certainly imagine that property-querying many config files will be costly. But it would be good to know what exact amount of performance regression we're talking about.

  2. Isn't it likely that config entities will be "full-blob"-cached at some layer anyway?

    (Entity cache or Config cache is an irrelevant detail, as long as there's a full-blob caching layer in front of the raw data)

    If so, shouldn't we rather think about materialized views for config entities?

    Materialized views would involve pretty much the same architectural challenges: (1) Auto-generating database tables, (2) using a query builder for them, and (3) maintaining their relational SQL database schema.

    The only addition would be (4) data pollution and (5) cache invalidation. However, those two are managed by D8's vastly improved caching (tags) system already, and yeah, the current config (entity) system is fully cached, so we're able to invalidate caches (derivative presentations) at the right time already.

    As an extra benefit: If we're able to provide materialized views for config entities, then we're also able to provide them for content entities. :)

chx’s picture

Materialized Views have a lot of problems the proposal doesn't. The proposal will use a single table (keyvalue) which will practically never change: only in minor versions with upgrade functions but it's hard to imagine what the heck would we want to change on a K-V storage. So there are no tables to autogenerate. Also, MV is data duplication with all the invalidate / rebuild headaches it involves, even with tags that's something to code. The beauty of the proposal is the simplicity it takes to implement it, here's how a StorageInterface implementation can talk to keyvalue:

  1. encode and decode are internal. We will use SimpleXML, I would imagine.
  2. exists can just call get -- existing config objects are going to be arrays so we can easily discern between the empty config and the nonexistent config.
  3. read / readMultiple / write / delete is equivalent to get / getMultiple / set / delete
  4. rename -- just add it to the K-V interface, if you have some freak show of a storage that can't rename then retrieve, set and delete. (I will discuss the race condition here in the next bullet point.)
  5. listAll -- you know what, we could add it to the K-V interface but I will ask msonnabaum. I am simply incapable of naming a persistent storage product that doesn't have prefix key search or can't do some trick to do it (the Redis author wrote a blogpost on autocomplete and others have made this solution even more efficient, all the dba family can iterate the keyspace -- and we very very likely won't have more than a few thousand keys). Yes, memcached can't do this but memcached is not a persistent store -- couchbase is, but couchbase can find the primary id/key by prefix ( and range and suffix ).
  6. deleteAll -- if we add listAll, add this as well. Yes, some DBs will need to call listAll and then do a deleteMultiple and that's race prone. Such is life. Those backends can get a lock. (Note that the config FileStorage does a listAll, a delete one by one and doesn't lock. That's a bug.)
chx’s picture

chx’s picture

alexpott’s picture

In the issue summary the problem is stated as

To determine which blocks to show on a page we query files on every page load. It's horrible, outdated and slow.

Afaics the relevant code is Drupal\Core\Config\Entity\Query::execute(), specifically:

public function execute() {
    // Load all config files.
    $entity_info = $this->entityManager->getDefinition($this->getEntityType());
    $prefix = $entity_info['config_prefix'] . '.';
    $prefix_length = strlen($prefix);
    $names = $this->configStorage->listAll($prefix);
    $configs = array();
    foreach ($names as $name) {
      $configs[substr($name, $prefix_length)] = \Drupal::config($name)->get();
    }
   //...

Running this through the debugger once the config cache is warm shows 0 hits to the disk. The call to Storage::listAll($prefix) results in a cache hit and so do the calls to Drupal::config($name)->get().

Admittedly reading this code shows that we could have a brilliant optimisation just swapping out the loop here for a ConfigFactory::loadMultiple()!

alexpott’s picture

yched’s picture

Other optim for trivial cases (won't help for the blocks though) : #2163919: Optimise Config EntityQueries in case there are conditions on the 'id'

yched’s picture

Also, from the OP:

To determine which blocks to show on a page we query files on every page load. It's horrible, outdated and slow

I'd say this is primarily a bad design choice from block entities to not have the theme be part of the file name (block ids are currently just the block machine name, not |theme].[machine_name]).

It's been established like 2 years ago that storing config in files means they are not efficiently queryable other than by filename prefixes, and that the choice of file name structure thus has to be driven by the "critical path" query pattern (here, get all blocks for a theme) - that, or a cache layer, which is what field does with FieldInfo::$fieldMap

Not sure why we'd suddenly find this unsustainable and re-put "store in DB" on the table after 2+ years ?

Berdir’s picture

Removing the theme prefix was explicitly changed, that was initially there. Not idea why.

alexpott’s picture

yched’s picture

Left a note in there...

tim.plunkett’s picture

Every config entity is $config_prefix.$entity_id.yml.
Blocks used to be $config_prefix.$theme_name.$entity_id.yml, and that was strange and confusing (on the API level) and there were many hacks spread throughout the block system to handle that. It was also confusing to the user, because it seemed like you should have been able to have the same block entity ID for each theme, but they had to be unique...

There were no performance concerns there one way or another.

chx’s picture

CMI took many twists and I am not sure whether the active store was editable at any point but the thing is, it's not now. If it's not human editable, why does it matter what format it is in? You can export it into YAML, version control that, edit that, etc.

yched’s picture

Blocks used to be $config_prefix.$theme_name.$entity_id.yml

Not really, Blocks are $config_prefix.$entity_id like everyone else.
It's just that $entity_id used to be $theme_name.$machine_name, and is now only $machine_name.
What made this confusing is that $machine_name is in fact named 'id' - so yeah,
$entity_id = $entity->id() = $entity->theme.$entity->id was confusing, but that's only a matter of property naming.

There are a lot of other config entities that have a compound entity id, exactly for the same kind of use case blocks have - loading the right ones for the typical business cases without parsing all of them.

moshe weitzman’s picture

I support this.

My current plan with Acquia cloud is to use Config_DB for storing Active Config in the DB. The only other option is on the shared filesystem and thats pretty scary from a performance and reliability point of view. I know that reads will usually be serviced by the cache backend but cold starts happen and then we have a serious issue.

The other motivation, as @chx mentions, is that people can't mistakenly edit an active directory because we won't have one!

So, I agree that Config_DB makes sense for core, in the spirit of Fast by Default.

chx’s picture

You'll need to find someone to work on this cos I won't as you know.

Anonymous’s picture

i'm 100% happy to work on this if we can get consensus on the following:

1. yaml for module default config and import/export become part of the contract supported by Drupal core's CMI system. we stop the nonsense that everyone wants to rely on yaml, despite the fact that in our design yaml is an implementation detail of a storage driver.

2. we retain the pluggable storage backend support for active configuration. core ships with db storage by default, with no caching.

3. core also ships with a db-backed storage with a cache layer. this will allow sites with memcached/redis support to take advantage of their existing non-db cache layer for config without writing a new storage driver. they can simply configure Drupal to use the cache-layer storage with memcached/redis as the cache backend.

i'm ok to explain the above in more detail, but i don't really want to lead the work to get consensus - i've tried and failed for some time now already.

webchick’s picture

Assigned: Unassigned » alexpott
Status: Active » Needs review

Assigning to alexpott to weigh in on #17.

alexpott’s picture

Status: Needs review » Needs work
Issue tags: +Needs issue summary update

#17.1 why is this part of this issue?
#17.3 means that we have to drop the "queryable" aspect of talked about in the original issue summary.

We need to decide what format we're going to store configuration in the db.

The fact that we've got the config export ability in the UI and Drush leads me the agree that the move of the active store to the database is a good thing. The lack of export and Drush tools was one of the reasons we went with the active store in files. Another reason why active storage in files has seemed to be a good thing was the ability to place it into version control. This idea has been discarded since this would lead to deploying directly into the active directory and not through a configuration import. The current ideas around version control and configuration are to keep the staging directory in a VCS or just another directory of exported configuration in your VCS repository. There are other reasons this is a good idea too:

  • discourage people from hacking their active config
  • one less thing to worry about in deployment
  • less YAML parsing will be a good thing
  • more secure - there have been debates about keeping you active configuration in files
pwolanin’s picture

#17.1 I agree with, but indeed should probably be a separate issue.

re: #17.3 I assume we could cache the result of each particular query?

moshe weitzman’s picture

Issue summary: View changes

Looks like have consensus on how to proceed. I updated the Issue Summary.

As to serialization format in the DB, it seems that there aren't significant advantages between var_export() and json_encode() and serialize(). See http://stackoverflow.com/questions/804045/preferred-method-to-store-php-... for some good discussion.

moshe weitzman’s picture

Title: Replace the config active storage with a queryable DB storage » Change active config from file storage to DB storage
sun’s picture

On architectural grounds, "I agree", because the entire point of rewriting CMI 2-3 times was to decouple the storage from the application-level configuration concept.

The underlying storage is nothing else than a key/value store.

The custom concept of a "ConfigStorageInterface" should not exist to begin with. Even the current FileStorage is a key/value store, because key=filename, value=string, collection=subpath. There should only be one concept of a key/value store in Drupal and it should be used wherever applicable. Let's ensure that the configuration system does not re-invent that wheel.

Because of that, I wholeheartedly agree with issues like #2161643: Add a KeyValueStore\IterableKeysInterface (allowing to retrieve keys with a given prefix), since that brings us a big leap closer to replacing that custom k/v store concept.

Likewise, the encoder/decoder for the serialization format should simply be injected, the storage doesn't care.

I wanted to suggest Drupal\Core\Serialization first, but then I wondered how the REST/Serialization module is handling serialization formats. I learned that we apparently have the Symfony Serializer component in core and REST/Serialization is a simple wrapper around that (for REST-specific purposes, not sure I get why those are separate modules...)

→ Use whichever serialization encoder you want to use. The key/value store does not and should not care.

In case Serializer is too slow/big/complex of a dependency, then let's simply do Drupal\Core\Serialization and provide standard implementations for serialize/YAML/JSON/YML that can be injected.

With that out of the way, the following detail in the outlined plan/reasoning concerns me:

discourage people from hacking their active config

Quoting myself from #2161995-3: Faulty module cannot be removed from module list; DrupalKernel rebuilds container on every page request

The idea of not supporting/allowing to edit the active configuration presents a major conflict with D8's changed paradigm of no longer silently ignoring missing or malformed modules.

As a trivial example:

  1. Install a new module.
  2. The module happens to be installed and enabled, but causes every request to blow up.
  3. The referred-to functionality (of exporting/importing/changing configuration) is no longer available/functional in the first place.

The only possible remedy is to manually remove the module from system.module.yml and run the rebuild script.

The concrete given example is to be disregarded. It is just one of many possible scenarios that one will be facing in real-world D8 development. That is underlined by the fact that the actual scenario I'm currently facing is a different one, too.

The entire architecture of Drupal 8 relies on the fact that the application is able to operate based on valid configuration data. If that is not the case for whichever reason, then the system very possibly freezes infinitely.

In that case, the concept of rebuild.php does not help either, because "rebuild" naturally implies "rebuild data structures based on the current active configuration". If configuration happens to be malformed for whichever reason, then a rebuild will fail.

Right now, (1) you can easily go to your active configuration to (2) fix whichever malformed configuration happens to blow up the system and (3) successfully rebuild.

What I'm pointing out is primarily a site building/development situation that can easily happen on a site that is "under construction." As a testament to that, the specific issue of #2161995 occurred after switching git branches. — However, I am NOT saying that a Drupal production site should have to deal with the possibility of fragile/broken configuration. So to perhaps clarify my overall stance here:

  1. +1 for killing the custom concept of ConfigStorage. → Replace with KeyValueStore + inject Serialization.

  2. Once that's done, it is piece of cake to use different KeyValueStore implementation and/or format.

    → I couldn't care less about the default in core.

    (My personal goal is to use raw JSON files via PHP-native JSON encoding/decoding without caching layer.) In other words, +1 to what @beejeebus stated with regard to keeping everything swappable/pluggable (whereas that does not mean that core supports any kind of syncing concept to aid with swapping implementations).

  3. For production sites, core should assume that the active configuration is "stable" and can be performance-optimized.

  4. I'm concerned about the DX of an active store that lives in the database or any other inaccessible location for the case of site development (and equally core development for that matter, but it's really one and the same thing).

    → If we're going to move the active store into a place that is not easily debuggable/fixable, then I wonder what kind of additional/alternative solutions we can offer to developers and site builders?

    (Any such solution needs to be able to operate without having access to anything "Drupal", not even DrupalKernel).

Thoughts?

moshe weitzman’s picture

Thanks for the feedback. It sounds like you support this patch, but you have some other (valid) concerns about "discourage people from hacking config". I think we should discuss those in a CMI conf call or in another issue.

I'm hoping that others will chime in about the proposal to use KeyValueStore. Sounds reasonable to me.

yched’s picture

Note that switching to key/value storage still keeps the "active store" non-queryable other than by config name. Meaning the current naming scheme for blocks is still sub-optimal (#8 / #14).

sun’s picture

xjm’s picture

Issue tags: +revisit before beta

@alexpott, @mtift, @Gábor Hojtsy, and I discussed this issue yesterday. While I'm not entirely opposed to this change since CMI has evolved a lot since we originally moved active config to the filesystem and all the things discussed here are legit, I'm also concerned that we have numerous critical beta blockers in CMI that must be resolved before we can ship, whereas we could totally ship with active config stored in files.

I'm not comfortable calling this a beta blocker since it's not a release blocker, but tagging "revisit before release" to make sure we look at it before we roll a beta.

I strongly recommend solving the top issues in #2187339: [META-0] CMI path to beta before we consider devoting resources to this.

pwolanin’s picture

Priority: Normal » Major

@xjm - based on our experiences trying to use config on an actual server, I would consider this a release blocker and very high priority.

I agree with beejeebus that we need to get this out of files as soon as possible, and then we can worry about tinkering with the internal implementation.

alexpott’s picture

@pwolanin care to outline what those "actual" experiences were?

catch’s picture

Issue tags: -revisit before beta +beta target, +revisit before release candidate

Tag futzing.

pwolanin’s picture

@alexpott - trying to finish up a related blog post, but I wasn't going to detail the hiccups there.

One major problem is that since we (by default) store the active config with the sites files, copying or failing to copy or update the files at he same time as the database can kill the site. In contrast, have the default active store in the DB would make it a lot easier to make a dev site that mirrors the live site since you know the config and data are in sync. In addition, people now can use tricks like redirecting file requests to live so they don't have to keep the files in sync.

We were also playing with this on Acquia hosting, and the initial effort to support Drupal 8 sets the config directory name differently for each environment for the site. This could be correct in as much as you might not want to overwrite your config when just copying uploaded files between environments, but then meant it was a lot harder to install in one environment and get a copy running in another.

pwolanin’s picture

quicksketch’s picture

These files must be stored on the shared filesystem when using multiple web heads. This is slow and often unreliable.

The config being stored as files certainly brings a new challenge to large-scale deployments of Drupal. Just as old problems (like sharing the files directory via NFS) have been considered solved for Drupal, solutions to the config directory across web servers are most likely already out there already. Something like AFS might work for config directories: local reads and writes would prevent slowness that you might get with the more typical NFS setup used for the file system. Pantheon's existing Valhalla filesystem probably already works with config files across servers (but I haven't validated that).

I don't think the summary statement is incorrect; shared filesystems *are* often slow and unreliable. But that doesn't mean there aren't solutions out there that would fit the requirements of the config directory. However, any technical solution at the OS level won't solve the issues of multiple-get or end-user visibility of the files, so those may be the better justifications here than problems with a distributed file system.

Please take my input with a fistful of salt, I'm just looking to provide additional data and understand the problems with each storage option.

Anonymous’s picture

Status: Needs work » Postponed
pwolanin’s picture

ok, then let's try to get that one in fast.

Anonymous’s picture

Assigned: alexpott »
Status: Postponed » Needs review
FileSize
2.42 KB

here's a first cut patch that makes drupal install with db storage.

pwolanin’s picture

From discussion at DevDays, the need to support file to DB storage would make releasing this in any form pre-beta much easier than post-beta.

While we could swap the implementation to unify with K/V, it's really not needed for the goal of getting active into the DB rather than in files.

The patch above also makes Drupal install much faster.

Status: Needs review » Needs work

The last submitted patch, 36: 2161591-36.patch, failed testing.

pwolanin’s picture

Status: Needs work » Needs review
FileSize
2.56 KB
4.98 KB

Trying to make the Frankentest code work

Anonymous’s picture

FileSize
9.12 KB

this follow up fixes the import/export config tests.

Status: Needs review » Needs work

The last submitted patch, 40: 2161591-40.patch, failed testing.

The last submitted patch, 39: 2161591-39.patch, failed testing.

Anonymous’s picture

Assigned: » Unassigned
Anonymous’s picture

Issue tags: +Configuration system
Anonymous’s picture

Status: Needs work » Needs review
FileSize
9.75 KB

fix use of db_like() in DatabaseBackend. lulz.

Status: Needs review » Needs work

The last submitted patch, 45: 2161591-45.patch, failed testing.

pwolanin’s picture

Status: Needs work » Needs review
FileSize
7.32 KB
16.55 KB

fixes update.php and trying to scrub out use of CONFIG_ACTIVE_DIRECTORY

Status: Needs review » Needs work

The last submitted patch, 47: 2161591-47.patch, failed testing.

pwolanin’s picture

Status: Needs work » Needs review
FileSize
3.51 KB
20.06 KB

Fix some more test fails, and add apparently assumed validation to Config class.

Status: Needs review » Needs work

The last submitted patch, 49: 2161591-49.patch, failed testing.

pwolanin’s picture

Status: Needs work » Needs review
FileSize
2.26 KB
20.3 KB

needs some doxygen cleanup, but this is more correct.

Status: Needs review » Needs work

The last submitted patch, 51: 2161591-51.patch, failed testing.

sun’s picture

Can we revert all the changes to config_get_config_directory(), constants, and also the installer, please?

There's really no need to change that here.

I'd like to retain the file based active configuration for tests. — Being able to easily study the actual configuration that got written in a test has become an enormous lifesaver for me in tons of issues that were exceptionally hard to debug. Removing that would definitely slow down progress on many major core issues.


In essence, all you want to change here are the service definitions to change the default active config storage from FileStorage to DatabaseStorage.

Lastly, this change should probably go hand in hand with #2226761: Change all default settings and config to fast/safe production values — use the database in production, use files for local development.

sun’s picture

In addition to #53, I understood the agreement of potentially doing this first in the way that #2190723: Add a KeyValueStore\FileStorage to replace e.g. ConfigStorage will still happen in parallel and/or afterwards.

That is one more reason for retaining the current installer and config code in HEAD unmodified and just simply replace the config storage implementation here.

This means to (1) add [back] the {config} table in system_schema() and (2) change the service definition to use DatabaseStorage instead of FileStorage. Further changes shouldn't be necessary, since DatabaseStorage actually was the active config storage some moons ago.

To keep the changes here compatible with #2190723, we should either leave the config.cachedstorage.storage service name alone or rename it to config.storage.active. — Renaming/re-purposing it into config.storage does not look right to me, because config.storage.staging + others still exist, and the service name should make clear which config storage is being returned.

pwolanin’s picture

@sun - I'm a little perplexed by the suggestion to use file active for tests. The attempt to over to DB has been turning up various places where we were cheating or assuming that behavior specific to File storage was generic to all config back-ends. If we wanted to be truly honest, we'd figure out how to install and test with 3 implementations, but that should be a follow-up.

In terms of the constants, etc, you mean we should create the default active directory even if we are not going to use it?

pwolanin’s picture

FileSize
10.93 KB
20.74 KB

Some test fixes, esp for the Frankentest, and puts back some of the active directory setup to minimize controversy/patch size.

pwolanin’s picture

Status: Needs work » Needs review

Status: Needs review » Needs work

The last submitted patch, 56: 2161591-56.patch, failed testing.

pwolanin’s picture

Status: Needs work » Needs review
FileSize
10.7 KB
22.42 KB

Discussed with catch - the better pattern is to not have the table as part of system_schema(), but instead create it when needed as the DB cache back-end code does as of this issue: #1167144: Make cache backends responsible for their own storage.

Taking the pattern from that patch and applying it to the DB config storage.

This also allows us to avoid changing a some of the test code

Status: Needs review » Needs work

The last submitted patch, 59: 2161591-59.patch, failed testing.

pwolanin’s picture

Status: Needs work » Needs review
FileSize
4.99 KB
25.8 KB

The ConfigStorageTestBase wants some things to fail that now succeed, and DatabaseStorageTest was trying to create the table in advance.

Here's a bit of a hack around, or maybe we should change the test?

Status: Needs review » Needs work

The last submitted patch, 61: 2161591-61.patch, failed testing.

pwolanin’s picture

Status: Needs work » Needs review
FileSize
7.57 KB
28.95 KB

Ugh, the interface and concrete classes and tests don't really agree on when/if exceptions are thrown.

This tries to make it a little more consistent and document some exceptions on the interface, and hopefully makes all the test pass.

Status: Needs review » Needs work

The last submitted patch, 63: 2161591-63.patch, failed testing.

pwolanin’s picture

FileSize
1.28 KB
28.91 KB

A few minor doc tweaks.

pwolanin’s picture

Status: Needs work » Needs review
FileSize
3.73 KB
30.25 KB

oops - throwing an exception for listAll() is blows up install. ok, let's fix the test, rather than coding to the test.

Anonymous’s picture

Status: Needs review » Needs work

pwolanin++ some nitpicks below.

+    $config =  Database::getConnection()->schema()->tableExists('config');

one too many spaces after '='.

+    // There are situations, like in the installer, where we may attempt a
+    // read without actually having the database available. In this case,
+    // catch the exception and just return an empty array so the caller can
+    // handle it if need be.
+    try {
+      return (bool) $this->connection->queryRange('SELECT 1 FROM {' . $this->connection->escapeTable($this->table) . '} WHERE name = :name', 0, 1, array(
+        ':name' => $name,
+      ), $this->options)->fetchField();
+    }
+    catch (\Exception $e) {
+      return FALSE;
+    }

this talks about an emtpy array, but seems to return false.

+   * The YAML component.
+   *
+   * @var \Symfony\Component\Yaml\Yaml
+   */
+  protected $yaml;
+  /**

missing blank line after proctected $yaml.

+    //$storage = new FileStorage($this->configDirectories[CONFIG_ACTIVE_DIRECTORY]);
+    $storage = new DatabaseStorage($this->container->get('database'), 'config');

i guess you meant to delete that FileStorage line?

pwolanin’s picture

Status: Needs work » Needs review
FileSize
3.16 KB
30.15 KB

Thanks, here's with those cleaned up.

sun’s picture

Good progress. Thanks for leaving the config directory setup of the installer in place!

I still need to apply this patch and actually review more in-depth, and perform some manual installer tests, but here's a code-only review of the patch:

  1. +++ b/core/core.services.yml
    @@ -67,16 +67,12 @@ services:
       config.storage:
    

    → config.storage.active

    Let's also move it down to the other config.storage.* definitions.

    In case 'config.storage' is required as-is, the service can be defined as an alias pointing to config.storage.active.

  2. +++ b/core/lib/Drupal/Core/Config/Config.php
    @@ -215,6 +215,11 @@ public function save() {
    +        $this->validateValue($key, $value);
    

    Hm. This looks misplaced here, since this validation is a detail of the serialization format being used by the storage.

  3. +++ b/core/lib/Drupal/Core/Config/DatabaseStorage.php
    @@ -56,24 +58,25 @@ public function __construct(Connection $connection, $table, array $options = arr
       public function exists($name) {
    ...
    +    // There are situations, like in the installer, where we may attempt a
    +    // read without actually having the database available. In this case,
    +    // catch the exception and just return an empty array so the caller can
    +    // handle it if need be.
    

    Hm. All of this special-casing should not be necessary... — at least with regard to the installer.

    Happy to help to debug this — however, the comment states "situations, like in the installer"... I assume the installer is the only cause?

  4. +++ b/core/lib/Drupal/Core/Config/DatabaseStorage.php
    @@ -114,6 +120,30 @@ public function readMultiple(array $names) {
    +      // If there was an exception, try to create the table.
    +      if (empty($this->options['throw_exception']) && $this->ensureTableExists()) {
    

    I agree that the storage/backend should have methods that allow the backend to create/manage its storage.

    However, for config (and key/value stores), I'm not sure whether automatically creating the storage on the fly is legit.

    The creation/management of actual storage bins should be left to the application setup/installation code that wraps or runs before the actual usage/consuming code.

    In particular in the installer, this automation can cause the storage to be automatically created before it is supposed to be created — possibly using preemptive storage parameters that have not been set up yet and thus were not intended. We have a good amount of bugs caused by similar automations in the installer right now, we shouldn't add a new one.

    In short, the installer (or "a" installation process) should be responsible for creating the storage bin. It should not be created automatically on demand.

    For completeness, I also think that createStorage() (ensureTableExists()) method should be accompanied by a deleteStorage() method.

  5. +++ b/core/lib/Drupal/Core/Config/DatabaseStorage.php
    @@ -122,6 +152,56 @@ public function write($name, array $data) {
    +   * @throws PDOException
    +   */
    +  protected function ensureTableExists()  {
    

    Under which circumstances is a PDOException thrown?

  6. +++ b/core/modules/config/lib/Drupal/config/Controller/ConfigController.php
    @@ -81,13 +82,15 @@ public function __construct(StorageInterface $target_storage, StorageInterface $
    +    $dumper = new Dumper();
    +    $dumper->setIndentation(2);
    ...
    +      $archiver->addString("$name.yml", $dumper->dump(\Drupal::config($name)->get(), PHP_INT_MAX, 0, TRUE));
    
    +++ b/core/modules/config/lib/Drupal/config/Form/ConfigSingleExportForm.php
    @@ -46,10 +54,14 @@ class ConfigSingleExportForm extends FormBase {
    +    $this->dumper = $dumper;
    +    $this->dumper->setIndentation(2);
    
    @@ -58,7 +70,8 @@ public function __construct(EntityManagerInterface $entity_manager, StorageInter
    +      new Dumper()
    
    @@ -151,8 +164,7 @@ public function updateExport($form, &$form_state) {
    +    $form['export']['#value'] = $this->dumper->dump($this->configStorage->read($name), PHP_INT_MAX, 0, TRUE);
    

    Part of the #2190723: Add a KeyValueStore\FileStorage to replace e.g. ConfigStorage effort that is so grossly ignored here:

    #2208633: Add a Yaml class to Drupal\Component\Serialization

    ...fully eliminates these manual overrides.

  7. +++ b/core/modules/simpletest/lib/Drupal/simpletest/DrupalUnitTestBase.php
    @@ -225,8 +225,9 @@ public function containerBuild(ContainerBuilder $container) {
         $container
    -      ->register('config.storage', 'Drupal\Core\Config\FileStorage')
    -      ->addArgument($this->configDirectories[CONFIG_ACTIVE_DIRECTORY]);
    +      ->register('config.storage', 'Drupal\Core\Config\DatabaseStorage')
    +      ->addArgument(Database::getConnection())
    +      ->addArgument('config');
    

    Do we need to change DUTB tests?

pwolanin’s picture

@sun - the auto-creation pattern was suggested by catch, and makes the whole thing easier to manage. However, I don't feel strongly abotu this.

The docs about what exceptions are thrown when needs some cleanup, but it would throw some kind of PDOException when the DB error is not that the table is already present.

The existing tests expected validation of values, hence $this->validateValue(). If we are testing that certain things like a file resource fail, then it's clearly not specific to serialization. I think this is reasonable since it's just applying essentially the same basic check that's applied in castValue() when we have a schema.

The DUTB change is certainly needed - we want to test the default active storage.

xjm’s picture

Issue summary: View changes

This already has the Needs issue summary update tag, but let's try to make it as complete as possible with the pros, the cons, and what we'd do to alleviate the cons. This decision--whichever way we go--has significant implications for all users of Drupal, sites small or large, site builders or developers.

I listed out the disadvantages I saw on during a CMI meeting today. I feel that we need to file additional issues to address these points.

  • "discoverability" of the active configuration by browsing is lost. catch suggested adding a UI in Drupal for this.
  • The quick "copy from active" staging and export is gone. Import/Export UI already partially addresses this, but we would need to document expected workflows (with drush, with UI export, etc.) in the handbook so that we can have people test them once the change is made.
  • Git workflows change or are hamstringed. Usecase described in https://drupal.org/node/1831818 would no longer be possible. There will be no more "native" diffing against staging stored in git; there's at least an extra step of "export your config, wipe out your separately tracked config repo that is in staging or somewhere else, and replace it with the export". (via drush or UI)

Also, based on a discussion with catch and berdir at Szeged, I feel we do actually need to make this beta-blocking if we are committed to it, as the upgrade path for it would be ghastly.

pwolanin’s picture

Issue summary: View changes
Issue tags: -beta target, -revisit before release candidate +beta blocker

There is already a UI of sorts for browsing your config - the single export lets you see all the existing types, and look at any individual one. That feels like it covers the need to browse, but is there additional functionality or UI changes that are needed there?

While there is no native file diff to git, this also seems like something drush can/should handle? I haven't tried recently, but Moshe discussed adding this as part of the config-import command.

The git workflow discussion you linked to seems like it may be outdated. Did you look at https://www.acquia.com/blog/drupal-8-configuration-workflows-using-git ?

Updating the issue summary a bit. I don't think the adding back a caching wrapper is for this issue and for now we are sticking with native PHP serialize()

Per your suggestion, marking this as a beta blocker.

pwolanin’s picture

FileSize
10.66 KB
33.41 KB

This patch adds a createStorage() method to the interface and classes (delete seems totally unneeded imho).

Also, sets up a service alias so site builders could swap in a caching layer if desired.

pwolanin’s picture

FileSize
1.68 KB
32.97 KB

A couple small doc fixes.

tim.plunkett’s picture

I also think the issue summary needs an update. We need to be able to weigh the pros and cons properly.

Just one note about the code:

+++ b/core/core.services.yml
@@ -85,6 +78,10 @@ services:
+  config.storage: "@config.storage.uncached"

Hm, didn't know this worked.

We used to have

plugin.manager.entity:
  alias: entity.manager

Maybe we should use that pattern instead?

scor’s picture

Issue summary: View changes
Issue tags: -Needs issue summary update
catch’s picture

Priority: Major » Critical

If it's a beta blocker it's also critical.

I personally find the active/staging reversal workflow for git that relies on files very confusing. Export/commit seems a lot more predictable with less chance of messing around in the wrong directory or getting your local.settings.php variables switched one day.

Additionally, export/commit/import is a workflow that will work regardless of what storage is used for active configuration. Whether it's MySQL, Redis, YAML you'll always be able to export/commit/import.

If we ship with file storage in core and encourage the staging/active directory swap, but every large site + hosted solution ends up using database storage, then some people are going to get a nasty surprise when they take on a site that's using the opposite method.

pwolanin’s picture

Issue summary: View changes
FileSize
573 bytes
32.98 KB

Changed the alias format in the yml file and made some more issue summary updates

sun’s picture

Thanks! :-)

  1. +  config.storage.uncached:
    

    Hm. Can we rename this to .active, please?

    Since there are multiple different config storages, the purpose of each service ID is to tell which config storage is being returned.

    All of the individual config.storage.* services are not cached (they cannot), they provide direct access to the storage.

  2.    public function exists($name) {
    ...
    +    try {
    ...
    +    catch (\Exception $e) {
    

    Still not happy with these... the system should know when it's able and ready to use a storage.

    I could help to sort out the installer environment to make these changes obsolete?

    (I'm asking upfront, because I don't really feel "welcome" on this issue here, and I don't want to cause any drama...)

  3. +  protected function ensureTableExists()  {
    +    try {
    +      if (!$this->connection->schema()->tableExists($this->table)) {
    +        $this->connection->schema()->createTable($this->table, static::schemaDefinition());
    

    Is there a situation in which createTable() is called after the storage has been set up already?

    I can't really imagine one?

    In other words, can't we simply make createTable() create the table and be done? (sans tableExists() check)

  4. Can we make the schema definition method protected? If it's public, then it would have to go onto the interface, but it only makes sense for the database storage. Doesn't appear to be called from anywhere else.


The DUTB change is certainly needed - we want to test the default active storage.

We have dedicated tests for each config storage implementation, which assert that each implementation meets all requirements. Therefore, tests can use any of the implementations.

Debugging problems and failures related to configuration is a lot easier when you have direct/verbose access to the configuration files. You're instantly able to see that e.g. a config file is entirely missing, or that a bug causes a config file to suddenly miss a value, or that some values might be using a wrong data type, etc.pp.

(Sadly) there's still a lot that can go wrong with configuration, and often times it's hard to pinpoint the root cause that led to a problem. That is, because the nature of configuration is that it just configures parameters, which instruct the application to do what's being configured. Thus, if the configuration is wrong, the application may still work, but just simply doesn't behave as expected.

In a properly decoupled system (that is D8), the chance for unexpected configuration to get written is large(r), because many different components, controllers, and plugins are interacting with each other autonomously. Unlike static service definitions, configuration is dynamic by design.

That's my primary (development) concern of moving the active config storage away from files into a storage that is harder to access and introspect.

The concern only applies to development. I do not have any objections at all for using the database as active config storage on production sites.

Therefore, I see a clear difference with regard to which storage adequately fulfills the needs of different environments/use-cases. From an architectural standpoint, using a particular storage backend for a particular environment is not different to using e.g. Memcache as cache backend in production, but a Memory backend in development.

— And whether you like it or not ;-), that is what ultimately brought me to the conclusion that it is wrong to hard-code any kind of storage implementation. You should be able to use the implementation that meets the requirements of your environment best.

As with any other persistent, non-volatile storage, you are locked into an implementation as soon as you hit the installer (i.e., there's no way to easily switch to a different storage later on). There's no reason for why that cannot or would not work. → In fact, the effort to make it happen is 90% done already, various child issues are RTBC + just waiting for commits.

In the end, we're shooting for pretty much the identical goal, and this patch does not actually introduce a problem that would present a hard blocker for completing the effort. I especially want to thank you for not removing the config file directory setup as part of this patch. The other issues do not make an implementation create its own storage yet, so that is definitely a step in the right direction, too (the new code here will just simply be moved into the k/v DatabaseStorage).

And just to further underline that: Even if this issue here would not happen, but the others would, then production sites using multiple webheads (e.g., sites hosted on Acquia's infrastructure?) will be able to use the database as active storage (or whichever implementation suits best).

Long story short, I see the point of avoiding slow YAML parsing in production, I can also see the problem with infrastructures involving multiple webheads, and thus the proposal makes some level of sense for production environments. However, at the same time, I'm very concerned about development environments (not limited to debugging and testing), and that's why I believe we need a solution that is able to address environment-specific concerns separately. But due to the reasons outlined above, there's really no need to make a drama out of this; we're able to deliver a sensible solution and most of the work is done already.

Therefore, I can live with this patch going in before the other patches, if we're committed to do them, too. In other words, it would merely be great if we could make this change "compatible" (mostly done already, only remaining issues in the review above, AFAICS), and I also hope that we can agree that one hard-coded default assumption won't be able to meet the (conflicting) requirements of all possible environments. :-)

pwolanin’s picture

@sun: brief comments in response.

we can change to .active or whatever, I don't have a strong opinion as long as there's some agreement that it's a clear naming.

Regardless of whether it's during install or otherwise, we should catch exceptions unless they are documented on the interface. So maybe we should re-write the comments with that in mind.

making the schema method protected is fine.

Possibly there is a better place to call the createStorage() method instead of in BootstrapConfigStorageFactory, or we can adjust that in a follow-up. I thought it made since there since you might implement the function named in setting drupal_bootstrap_config_storage and never want the table created at all.

sun’s picture

Briefly discussed with @pwolanin in IRC. A few adjustments:

  1. Renamed config.storage.uncached into config.storage.active.
  2. Create active config storage in installer, instead of every kernel (re)build.
  3. Changed visibility of schemaDefinition() to protected.

This patch does not revert the active storage back to FileStorage for tests. — That's still my only remaining issue with this patch. (The change to DB also does not improve test performance; I would notice, because I have a terribly slow filesystem a.k.a. Windows.)

effulgentsia’s picture

I haven't read the patch, but I'm +1 to the issue. I haven't read every comment, so I'm curious: is there anyone following this who's against it?

Status: Needs review » Needs work

The last submitted patch, 81: config.storage.active.80.patch, failed testing.

tim.plunkett’s picture

I need more time to gather my thoughts and explain my opinions.
But just to avoid the false assumption of consensus:

I'm firmly -1 on this issue.

sun’s picture

@pwolanin and I had a minor disagreement behind the scenes on the question of whether createStorage() should be

(A) explicit; i.e., calling it twice should throw an error/exception, or
(B) idempotent; i.e., essentially ensureStorage(), calling it twice does nothing.

After debating and disagreeing for a while, we asked others. :-) Since I/we plan to move/port that code into KeyValueStore later on, @msonnabaum was an adequate field expert to ask.

@msonnabaum wants to see the same implementation as for cache bins (idempotent), and the methods should be protected. — That is, because alternative storage implementations don't need any storage to get created first.

That said, he later mentioned an interesting idea that might be a fair middle-ground: Instead of try/catch in all methods, make the constructor ensure it once.

I guess we can leave that constructor idea/approach for later. For completeness, I was for the explicit approach, mainly because (1) this is about a required, non-volatile, persistent storage (!= cache) and (2) I don't like the idea of arbitrary code being able to call into createStorage() at any time "just to be sure". — However, @msonnabaum's recommendation eliminates that entirely, because the method cannot be called from anywhere else to begin with. So I'm happy to concede :-)

This means that the installer would essentially do "nothing", and the first time the config.storage.active service is requested and a write operation is attempted, it will try to create the storage.

webchick’s picture

We just had an epic discussion in IRC about this issue. The big hairy things that folks were concerned about to my recollection were:

a) The first point in the current "con" list: "harder to recover from invalid config causing a site-wide WSOD" In fact, "harder" seems like a pretty major understatement. Assuming you somehow found your way from a WSOD to the specific config key in the database that was causing the problem, your only recourse is directly modifying serialized text with SQL. :\

b) There are a few things in core (breakpoints, Tour) that do not provide any UI for editing, beyond the single file export browser thing. The inability to iteratively develop on those without either the SQL dance seems like a huge drawback, DX-wise. Yes, there's always install/uninstall but it's totally conceivable to want to write tours for existing data-providing modules as new features are added, and you would not want to lose all of your data just to tweak a sentence.

Whoever else was there, feel free to add your 2 cents.

webchick’s picture

Oh and a couple of counter-points to the above that I missed. beejeebus pointed out that if we're worried about invalid config, using files as storage is a much riskier proposition than the DB, which is atomic. And also that a lot of the workflow issues could likely be worked around in contrib.

pwolanin’s picture

Status: Needs work » Needs review
FileSize
13.11 KB
33.04 KB

Based on the discussion with sun, here's back to the automatic storage creation

xjm’s picture

The inability to iteratively develop on those without either the SQL dance seems like a huge drawback, DX-wise. Yes, there's always install/uninstall but it's totally conceivable to want to write tours for existing data-providing modules as new features are added, and you would not want to lose all of your data just to tweak a sentence.

I don't think this is a huge problem--or it's a problem that exists in HEAD anyway--it's just a little more awkward with DB storage. As far as I know, this would be the workflow:

  1. Make sure your staging dir is populated with your active config, by exporting the whole tree if necessary.
  2. Export the active config for your tour or whatever using drush or the UI.
  3. Tweak and stick it in staging.
  4. Config synch.
  5. Test.
  6. Rinse, repeat.
  7. When it's ready, put it back in the default config and commit.

And yeah, I can see devel including a tool to hack config directly in the UI to save module and theme developers time.

pwolanin’s picture

88: 2161591-88.patch queued for re-testing.

catch’s picture

\That said, he later mentioned an interesting idea that might be a fair middle-ground: Instead of try/catch in all methods, make the constructor ensure it once.

This would mean running db_table_exists() every request, it was already discussed and rejected for cache bin creation.

pwolanin’s picture

Waiting for sun to comment with some of thoughts about making it easier to swap the active storage for different installs. However, I think that can and should be a follow-up since this patch simply replaces on hard-coded class with another, and actually reduces the number of places it's fixed by using the service alias.

tim.plunkett’s picture

I don't think its responsible to continue to hardcode it. Either we make it swappable or we leave it alone.

pwolanin’s picture

FileSize
3.89 KB
36.22 KB

I tried to make the config.storage.active added in CoreServiceProvider like the uuid service, but I'm getting exceptions during the installer since the container builder is perhaps looking for it earlier due to the aliases.

I'm not sure it's possible to really get around hard-coding the storage for the kernel during install.

This patch adds a cleaner setting to get the instance to use in bootsrapping the kernel, but someone else would need to figure if it's possible to override the service definition that early in install. Possibly this was failing since the Settings class was still empty.

Status: Needs review » Needs work

The last submitted patch, 94: 2161591-94.patch, failed testing.

pwolanin’s picture

Status: Needs work » Needs review
FileSize
2.63 KB
38.85 KB

oops, LanguageServiceProvider also uses the bootstrap config. This should fix it.

Status: Needs review » Needs work

The last submitted patch, 96: 2161591-96.patch, failed testing.

pwolanin’s picture

Status: Needs work » Needs review

96: 2161591-96.patch queued for re-testing.

Status: Needs review » Needs work

The last submitted patch, 96: 2161591-96.patch, failed testing.

pwolanin’s picture

Status: Needs work » Needs review
FileSize
5.26 KB
34.91 KB

So, this was relying on the setting being in settings.php, but it fails for the test that starts install with an empty settings.php

This moves back closer to what in already in 8.x, but still provides documentation in settings.php and a method to get file storage if desired.

effulgentsia’s picture

Issue summary: View changes
effulgentsia’s picture

I don't think this is a huge problem...it's just a little more awkward with DB storage...
- Export the active config for your tour or whatever using drush or the UI.
- Tweak and stick it in staging.
- Config synch.

Yeah, I think it's that exact "little" more awkwardness that tim.plunkett objects to. I added that as an item to the cons list in #101.

pwolanin’s picture

We also have the single export/import UI. I would expect that to really be the place you'd do this, as discussed above.

tim.plunkett’s picture

If the single export/import UI is used as justification for shoehorning this issue in, I'll be very sad I ever wrote it.

Another thing I've used the file-based active store for:
As a module dev, symlinking specific files back to their original module, so that I can immediately see the changes to the module as I make changes.
This has been immensely useful when working on the core admin views.
Now, every save in the Views UI would require me to run an export of some sort, and copy it into the module I was working on.

Granted, that is not a need for a production site, but that is one of many tricks that could evolve in D8 if file storage is still available for development.
That's part of why I'm pushing hard for it to be swappable.

That said, the last couple patches seem much better. I'll have to have another look.

tim.plunkett’s picture

Title: Change active config from file storage to DB storage » Change default active config from file storage to DB storage
Status: Needs review » Needs work
  1. +++ b/core/core.services.yml
    @@ -67,16 +67,9 @@ services:
    -  config.cachedstorage.storage:
    -    class: Drupal\Core\Config\FileStorage
    -    factory_class: Drupal\Core\Config\FileStorageFactory
    -    factory_method: getActive
    
    @@ -85,6 +78,11 @@ services:
    +  config.storage.active:
    

    This would be good to leave intact with some name like config.storage.file, so that a module trying to switch it back can refer to it in its ServiceModifier.

  2. +++ b/core/includes/install.core.inc
    @@ -439,7 +439,7 @@ function install_begin_request(&$install_state) {
    -    $config = glob(config_get_config_directory(CONFIG_ACTIVE_DIRECTORY) . '/*.' . FileStorage::getFileExtension());
    +    $config = Database::getConnection()->schema()->tableExists('config');
    

    I think this is one of the few changes that just looks *wrong*. Can we add a method to the storage instead, or is this just too early?

  3. +++ b/core/lib/Drupal/Core/Config/Config.php
    @@ -215,6 +215,11 @@ public function save() {
    +    else {
    +      foreach ($this->data as $key => $value) {
    +        $this->validateValue($key, $value);
    +      }
    

    Looks out of scope, what's up?

  4. +++ b/core/modules/config/lib/Drupal/config/Form/ConfigSingleExportForm.php
    @@ -151,8 +164,7 @@ public function updateExport($form, &$form_state) {
    -    $data = $this->configStorage->read($name);
    -    $form['export']['#value'] = $this->configStorage->encode($data);
    +    $form['export']['#value'] = $this->dumper->dump($this->configStorage->read($name), PHP_INT_MAX, 0, TRUE);
    
    +++ b/core/modules/config/lib/Drupal/config/Tests/ConfigSingleImportExportTest.php
    @@ -152,7 +153,7 @@ public function testExport() {
    -    $data = \Drupal::service('config.storage')->encode($fallback_date->toArray());
    +    $data = $yaml->dump($fallback_date->toArray(), 2, 2);
    

    Not clear how these are different, and that's vaguely unfortunate.

  5. +++ b/sites/default/default.settings.php
    @@ -601,6 +601,19 @@
    + $settings['bootstrap_config_storage'] = array('Drupal\Core\Config\BootstrapConfigStorageFactory', 'getDatabaseStorage');
    

    a) I think the example should have getFileStorage() here.

    b) We need to clarify if this is actually needed. I swapped out the services locally without this line (after installing), and things seemed to work. If this is just for the installer, let's rename it. Otherwise we need to actually document when we need this.
    I did notice that \Drupal\language\LanguageServiceProvider calls it, but that seems wrong on its own.

If we were to leave the old file storage service, a module (like devel) would only need this:

class FooServiceProvider implements ServiceModifierInterface {
  public function alter(ContainerBuilder $container) {
    $container->getDefinition('config.storage.active')
      ->setClass('Drupal\Core\Config\CachedStorage')
      ->setArguments(array(new Reference('config.storage.file'), new Reference('cache.config')));
  }
}

Another feature we will need is the ability to export from the DB to the active directory while enabling a service provider like this. That of course, would be something for contrib.

Finally, I'd like to see a test module with a service modifier like the above, in order to prove that things do still work with file storage.

I think that would get us to an acceptable level of swappability.

pwolanin’s picture

@tim.plunkett

  • #1: sure
  • #2: per IRC, we could try using the bootstrap config and expect an exception when doing listAll()?
  • #3: This makes applies consistent validation with and without schema. Before this change the tests relied on an exception being thrown by the YAML dumper that was only in FileStorage in order to (misleadingly) assert that the value failed validation
  • #4: They are different since now we always export as YAML, rather than the variable serialization format of the storage back-end
  • #5: as discussed in IRC, we could make this a commented-out example instead
pwolanin’s picture

ok - talked to tim an dI misunderstood hist point #4 - we need to make the dump call consistent

pwolanin’s picture

Status: Needs work » Needs review
FileSize
6.29 KB
36.25 KB

I think this addresses Tim's concerns.

tim.plunkett’s picture

+++ b/core/modules/config/lib/Drupal/config/Controller/ConfigController.php
@@ -81,13 +82,15 @@ public function __construct(StorageInterface $target_storage, StorageInterface $
+      $archiver->addString("$name.yml", $dumper->dump(\Drupal::config($name)->get(), PHP_INT_MAX, 0, TRUE));

+++ b/core/modules/config/lib/Drupal/config/Form/ConfigSingleExportForm.php
@@ -151,8 +164,7 @@ public function updateExport($form, &$form_state) {
+    $form['export']['#value'] = $this->dumper->dump($this->configStorage->read($name), PHP_INT_MAX);

+++ b/core/modules/config/lib/Drupal/config/Tests/ConfigSingleImportExportTest.php
@@ -111,12 +127,11 @@ public function testImport() {
+      'import' => $this->dumper->dump($config->get(), PHP_INT_MAX),

@@ -152,7 +167,7 @@ public function testExport() {
+    $data = $this->dumper->dump($fallback_date->toArray(), PHP_INT_MAX);

Okay, this is really the last thing wrong with this. I *think* we want to always use , PHP_INT_MAX, 0, TRUE in all places, to be consistent with core.

pwolanin’s picture

So, I agree it's not consistent in core, really we'd only want to throw an exception on export.

the calls to dump() in \Drupal\config\Form\ConfigSingleExportForm and \Drupal\Core\Config\ConfigManager don't allow it to throw exceptions, and it's not clear that they should. If we change that behavior we'd also need to annotate all the methods with @throws.

tim.plunkett’s picture

Status: Needs review » Reviewed & tested by the community

Yeah I missed the ones in ConfigManager. We talked through each of the cases I called out, and this is fine as is.

I'll crosspost the devel.module issues to restore sanity for local dev when I open them, but this looks to be properly swappable.

xjm’s picture

Assigned: Unassigned » Dries

Thanks everyone! Dries needs to take a look at this issue, so assigning to him.

sun’s picture

Any chance to simply commit #2208633: Add a Yaml class to Drupal\Component\Serialization first, so as to avoid all of these manual/custom parameters to Yaml\Dumper::dump() throughout core? That issue + its sister issues are RTBC for several days/weeks already (and painfully chasing HEAD...)

aspilicious’s picture

Bu this one is a beta blocker...

xjm’s picture

Yep, what @aspilicious said. And Dries needs to review it either way.

sun’s picture

FileSize
36.89 KB

Since others and I raised major concerns against this change proposal, I assume it is a good idea to (1) clear up some (wide-spread) confusion and (2) outline a concrete path that is going to resolve each and every single concern that has been raised against this change proposal.

Doing so requires lots of context and background knowledge, but I'll try to keep it as short and structured as possible:

  1. The storage engine [+ format] used for active/runtime configuration is a regular, non-volatile data store.

    The data can be stored in whichever way you like, as long as the storage (1) is persistent, (2) is available at runtime, and (3) is not suddenly swapped out with a different one.

    → A single instance of a Drupal site will operate just fine, as long as the storage to use is defined at install time and sticks forever.

  2. The active config storage is not used for staging purposes. — The process of staging configuration is a unique and complex concept of its own that is not to be confused.

    1. Copying files from your active config directory on your local site to your production site is not supported.
    2. Manually editing config files in your active config directory is not supported.

    To stage configuration between multiple instances of your site, you have to use the built-in facilities for exporting and importing configuration.

    The "export" mechanism retrieves configuration from the active storage, performs validations, and writes to the staging storage.

    The result of that process depends on the staging storage implementation. It may be a directory of files, it may be tarball that is offered for download, or it might be a shell process that commits to git.

    → In order to stage configuration from site instance A (dev) to site instance B (prod), you have to use the export/import process.

    If you do not follow the config export process, then it is not guaranteed that the exported configuration can be imported.

  3. Identically to the active storage, the staging storage equally is a one-time (cross-)environment-specific decision.

    Both storages are nothing else than data carriers. You can use whatever suits your needs best, as long as the higher-level application subsystems are able to rely on them:

    1. The active config storage is specific to a particular site environment/installation. It is the exact same decision as choosing a database driver in the Drupal installer; i.e., MySQL vs. Postgres vs. SQLite.

      Active config storage:

      Site instance A != Site instance B != Site instance C

    2. The staging config storage is specific to a set of site instances that are representing the same "universally unique site." As long as all instances agree with each other, they can use whichever staging storage implementation they want.

      Staging config storage:

      Site instance A == Site instance B == Site instance C

    This implies the following:

    1. This whole topic is a matter of environments, not storages.
    2. The storage is irrelevant, as long as the requirements of higher-level application concepts are met.
    3. To swap out a storage, your environment has to satisfy the requirements of the config sync/export/import processes.
  4. The requirements for environments are vastly different.

    Developers need:

    1. Transparency: Easy access to configuration. By definition, configuration controls and directs the application but does not necessarily break it.
    2. Debugging: Recover from wrong configuration after e.g. switching git branches. → #1897468: Kernel rebuilds service container on every request when a module vanishes
    3. Authoring: Copying desired config files into a module's or installation profile's config default/install directory.

    Production sites need:

    1. Reliability: Race-conditions in writing runtime configuration caused by parallel requests must not exist.
    2. Scalability: Runtime configuration must be identical and synchronized across multiple web-heads.
    3. Performance: Runtime configuration should use the most performant storage engine available in a large-scale, high-traffic production environment.

    → None of the concerns should be after-thoughts. At the same time, it is impossible to meet the requirements of all environments with a single implementation.

    → None of both can be prioritized either. — We can't just simply ruin the DX for thousands of developers to account for potential problems on large-scale, high-traffic, and potentially cloud-hosted production sites leveraging multiple web-heads.

  5. To resolve all concerns and cleanly resolve the problem space, we need a proper separation of environments.

    #2226761: Change all default settings and config to fast/safe production values

    By now, I have a very concrete + feasible plan and vision for how that would work:

    1. The early Drupal installer asks you to choose the environment for the site instance you're installing.
    2. The installer automatically configures storage services (in the same way it configures the database connection info).

    Look at this in action:

     Production or Development

  6. This is not a pipe-dream. The parallel effort is >90% done already.

    The following issues are RTBC and merely waiting for commits:

    #2216527: Inject a serialization format into database key/value storage
    #2208609: Move Json utility class into Drupal\Component\Serialization
    #2208633: Add a Yaml class to Drupal\Component\Serialization

    The following issue is blocked on a naming bikeshed only:

    #2161643: Add a KeyValueStore\IterableKeysInterface (allowing to retrieve keys with a given prefix)

    With those (+ this), the following primary change will boil down to some minimal changes only:

    #2190723: Add a KeyValueStore\FileStorage to replace e.g. ConfigStorage

    As clearly visible, the parallel effort even goes one step further:

    1. It removes the entire custom storage concept from the configuration system, because it is nothing else than a key/value store.
    2. By removing the custom storage layer concept, you can use any key/value store implementation for your active configuration, regardless of whether that is a database, files, redis, mongo, or whatnot.

    → All available key/value stores are immediately available to be used as active config storage.

    To omit the storage defaults in the installer - as with any other swappable storage engine - you customize settings.php before hitting the installer:

    $settings['services']['config.storage.active'] = 'Drupal\custom\KeyValueStore\CustomStorage';
    

    (The only remaining issue with that is autoloading of custom namespaces in the early installer, but that is a completely different topic that needs to be resolved separately. — Likewise, such environment-specific service overrides are really supposed to live in a $conf_path/services.yml file, but that's yet another separate topic.)

  7. A custom storage override is a one-time install-time decision and sticks forever.

    Just like the installation profile sticks forever. You cannot simply switch to a different installation profile either.

    Just like the public files directory sticks forever. You cannot simply change that filesystem path either.

    1. There will be no way to "migrate" from one storage to another — in the same way there is no way to migrate your database from MySQL to Postgres.
    2. External tools (like Drush) can provide mechanisms for doing so.
    3. Staging configuration across site instances (each using different storages) requires you to use the config export/import mechanism.

    In case all you want to do is to change the config storage of a particular site instance from one implementation to another:

    #1613424: [META] Allow a site to be installed from existing configuration

    Steps in words:

    1. Export your current configuration.
    2. Destroy your current site instance.
    3. Re-install with preexisting configuration and choose a different environment (active config storage).

To summarize:

  • I'm +1 on this change, if we are committed to resolve the challenge of conflicting requirements for different environments in D8.

    That is not a pipe-dream, but doable within the next days — progress is only blocked on commits.

  • I'm –1 on this change, if there's no buy-in for the concrete plan outlined above.

    In that case, large-scale/high-traffic/cloud-hosted sites can and should have to tweak Drupal's storage defaults in the same way they always had to in the past. The proposed change of this issue optimizes for 1% of the total user-base. → Use Pressflow, thanks.

Lastly:

  1. Most of the ConfigStorage improvements will be moved to KeyValueStore implementations. The custom ConfigStorage* implementations will vanish entirely. The changes in this patch are helpful.
  2. I generally support this patch, because Drupal core should be fast, scalable, and reliable for production sites by default. We should do our best to reduce the (giant) amount of necessary changes to make Drupal work in production environments.
  3. The entire drama about this change proposal is unnecessary. We are able to resolve the environment-specific requirements. All of the raised concerns would be obsolete today if the patches of the parallel effort were committed already.
alexpott’s picture

@sun in order for the dev prod choice that you've mocked up in #116 to be relevant don't we need #1613424: [META] Allow a site to be installed from existing configuration too? Otherwise how can you have a production and dev version of the same site?

chx’s picture

chx’s picture

As it is usual it is almost impossible to follow which CMI issue does what and where should one chime in.

I will say this is a good change; I will say that going key-value store is a bad idea unless somehow the config-ness (that the data is recursive arrays and nothing else) is communicated to the store -- this is necessary for the store to store config for config entities in a smart way that helps config entity queries to be fast. You can't do that if you know nothing about the data you store, a generic k-v store can only run serialize over the data. Don't dumb down more than you need to!

The only remaining issue with that is autoloading of custom namespaces in the early installer, but that is a completely different topic that needs to be resolved separately.

Agreed. The discovery in DrupalKernel needs a bit of love and cleanup and namespaces is part of that. Maybe I will do that.

A custom storage override is a one-time install-time decision and sticks forever...
There will be no way to "migrate" from one storage to another...
In case all you want to do is to change the config storage of a particular site instance from one implementation to another...

There already is code migrating from one storage to another: http://drupalcode.org/project/mongodb.git/blob/0674b7b054032f4bae463cff5... it is not even horribly complicated. The only challenge was figuring out when to run this code. I delete the files but just as easily I could do a write through in my config storage and keep files up to date and then removing mongodb would be possible. I just don't want to support that stuff.

  • Commit abc7e15 on 8.x by Dries:
    Issue #2161591 by pwolanin, beejeebus, sun: Change default active config...
Dries’s picture

Assigned: Dries » Unassigned
Status: Reviewed & tested by the community » Fixed

I'm comfortable with this approach (like many others are). Committed to 8.x. Thank you!

alexpott’s picture

Status: Fixed » Active
Issue tags: +Missing change record

Considering that we've been talking about active config in files for 2+ years I think we should have an 8.x to 8.x change record for this.

pwolanin’s picture

here's a draft change notice: https://drupal.org/node/2241059

giorgio79’s picture

Somewhat related to performance: for querying this key value store we could use the innodb memcache protocol as of mysql 5.6 that can be 9x faster than simple innodb #2223757: Automatically use the memcache protocol on MySQL 5.6: 1.5x - 2x faster queries

xjm’s picture

Title: Change default active config from file storage to DB storage » Change record: Change default active config from file storage to DB storage

Thanks @pwolanin. We likely also need to update a bunch of existing change records.

xjm’s picture

I published https://drupal.org/node/2241059, because it's going to break everyone's existing D8 installs. Leaving this open though to improve it and check for other docs that need updates.

sun’s picture

How to use files for debugging configuration and hacking on D8 (or if you don't want to re-install):

https://gist.github.com/sun/10732324

Gábor Hojtsy’s picture

I updated the change notice with https://drupal.org/node/2241059/revisions/view/7140545/7145433 so it explains the benefits for people who are freaked out :D

xjm’s picture

Status: Active » Needs work

Thanks Gábor -- that's what I had in mind. :)

That leaves updates to existing ones. Looks like we need to update:
https://drupal.org/node/1500260 (soooo out of date, lol)
https://drupal.org/node/2164727 (how?)
That's all I could find, actually.

xjm’s picture

Gábor and I updated https://drupal.org/node/2164727/revisions/view/6803915/7149123. The original CMI change record still needs significant work.

xjm’s picture

Title: Change record: Change default active config from file storage to DB storage » Change default active config from file storage to DB storage
Status: Needs work » Fixed
Issue tags: -Missing change record

And I fixed https://drupal.org/node/1500260 by gutting it and linking to the far more current handbook documentation about the configuration API.

Gábor is also filing a followup to fix obsolete text in settings.php wrt the active config.

Gábor Hojtsy’s picture

While preparing the change notices, I found #2243439: Both file and db configuration storage are said to be default in settings.php and hit that bump in understanding which storage is used when, etc. The current docs are entirely untrue :)

Status: Fixed » Closed (fixed)

Automatically closed - issue fixed for 2 weeks with no activity.