[PP-x] Move the on-demand-table creation into the database API [#2371709]

Comment	File	Size	Author
#151	interdiff-2371709-120-149.txt	71.33 KB	bhanu951
#120	on_demand_table-2371709-120.patch	55.17 KB	almaudoh
#116	interdiff.txt	727 bytes	almaudoh
#116	on_demand_table-2371709-116.patch	55.68 KB	almaudoh
#109	interdiff.txt	4.53 KB	almaudoh
#109	on_demand_table-2371709-109.patch	55.71 KB	almaudoh
#105	on_demand_table-2371709-105.patch	55.1 KB	almaudoh
#99	2371709-99-do-not-test.patch	51.56 KB	kgoel
#95	2371709-95.patch	59.63 KB	kgoel
#86	interdiff.txt	6.89 KB	dawehner
#86	2371709-86.patch	59.63 KB	dawehner
#77	2371709-77.patch	60.26 KB	dawehner
#77	interdiff.txt	1.61 KB	dawehner
#75	interdiff.txt	17.56 KB	dawehner
#73	interdiff.txt	17.56 KB	dawehner
#73	2371709-71.patch	59.88 KB	dawehner
#65	interdiff.txt	9.44 KB	amateescu
#63	interdiff.txt	3.76 KB	amateescu
#61	2371709_61.patch	50.39 KB	chx
#61	interdiff.txt	3.97 KB	chx
#59	interdiff.txt	4.18 KB	chx
#58	2371709_58.patch	50.5 KB	chx
#57	interdiff.txt	4.21 KB	chx
#57	2371709_57.patch	49.41 KB	chx
#52	2371709_52.patch	48.97 KB	chx
#52	interdiff.txt	2.25 KB	chx
#50	2371709_49.patch	46.58 KB	chx
#50	interdiff.txt	18.04 KB	chx
#47	interdiff.txt	4.76 KB	chx
#47	2371709_46.patch	46.64 KB	chx
#45	2371709_44.patch	45.89 KB	chx
#41	2371709-pgsql.txt	763 bytes	amateescu
#19	interdiff.txt	2.93 KB	chx
#19	2371709_19.patch	39.45 KB	chx
#16	interdiff.txt	6.19 KB	chx
#16	2371709_15.patch	40.71 KB	chx
#12	interdiff.txt	1.42 KB	chx
#12	2371709_12.patch	37.44 KB	chx
#11	interdiff.txt	804 bytes	chx
#11	2371709_11.patch	37.53 KB	chx
#10	2371709_10.patch	37.1 KB	chx
#9	2371709_8.patch	35.15 KB	chx
#8	2371709_7.patch	35.17 KB	chx
#5	2371709_5.patch	34.2 KB	chx
#3	2371709_3.patch	30.66 KB	chx
	eate.patch	29.11 KB	chx

Comment #1

8 November 2014 at 17:26

Status:

Needs review

» Needs work

The last submitted patch, eate.patch, failed testing.

Log in or register to post comments

Comment #2

chx commented 8 November 2014 at 17:32

Issue summary:

View changes

Log in or register to post comments

Comment #3

chx commented 8 November 2014 at 19:41

Status:

Needs work

» Needs review

Status	File	Size
new	2371709_3.patch	30.66 KB

1 file was hidden/shown/deleted

Status	File	Size
hidden	eate.patch	29.11 KB

Bumped ensureTableExists to Connection; now we should install at least. Writing tests.

Log in or register to post comments

Comment #4

8 November 2014 at 20:18

Status:

Needs review

» Needs work

The last submitted patch, 3: 2371709_3.patch, failed testing.

Log in or register to post comments

Status	File	Size
new	2371709_5.patch	34.2 KB

Status	File	Size
hidden	2371709_3.patch	30.66 KB

Comment #6

chx commented 8 November 2014 at 21:53

Status:

Needs work

» Needs review

Log in or register to post comments

Comment #7

8 November 2014 at 22:30

Status:

Needs review

» Needs work

The last submitted patch, 5: 2371709_5.patch, failed testing.

Log in or register to post comments

Status	File	Size
new	2371709_7.patch	35.17 KB

Status	File	Size
hidden	2371709_5.patch	34.2 KB

Comment #9

chx commented 8 November 2014 at 23:39

Issue summary:

View changes

Status	File	Size
new	2371709_8.patch	35.15 KB

1 file was hidden/shown/deleted

Status	File	Size
hidden	2371709_7.patch	35.17 KB

dawehner asked for a rename.

Log in or register to post comments

Comment #10

chx commented 8 November 2014 at 23:50

Issue summary:

View changes

Status	File	Size
new	2371709_10.patch	37.1 KB

1 file was hidden/shown/deleted

Status	File	Size
hidden	2371709_8.patch	35.15 KB

Bumped ensureTableExists to schema, there's 1 usage of it outside of the database, so it doesn't really matter and it's really a schema-ish operation. Added a test for this one too. The patch is still net negative (54 LoC) despite adding a new interface and tests and it of course enables doing this conversion for a lot other very cheaply.

Log in or register to post comments

Comment #11

chx commented 9 November 2014 at 07:21

Issue summary:

View changes

Status	File	Size
new	2371709_11.patch	37.53 KB
new	interdiff.txt	804 bytes

1 file was hidden/shown/deleted

Status	File	Size
hidden	2371709_10.patch	37.1 KB

Missing interface doxygen pointed out by dawehner. Added the results of some discussions with dawehner into the summary.

Log in or register to post comments

Comment #12

chx commented 9 November 2014 at 17:44

Issue summary:

View changes

Status	File	Size
new	2371709_12.patch	37.44 KB
new	interdiff.txt	1.42 KB

2 files were hidden/shown/deleted

Status	File	Size
hidden	2371709_11.patch	37.53 KB
hidden	interdiff.txt	804 bytes

I realized that running a select on an empty table is pointless. Genius, that's my middle name, isn't it? Took me a bit.

Log in or register to post comments

Comment #13

dawehner

German

commented 9 November 2014 at 18:11

Issue summary:

View changes

+++ b/core/lib/Drupal/Core/Cache/CachetagSchema.php
@@ -0,0 +1,45 @@
+
+class CachetagSchema implements SchemaProviderInterface {

It would be nice to explain why this schema is separated out ...

+++ b/core/lib/Drupal/Core/Database/Query/Query.php
@@ -19,6 +21,13 @@
+   *
+   * @var string
+   */

Given that its more than just a string we should document it, if possible

+++ b/core/lib/Drupal/Core/Database/Query/Query.php
@@ -110,7 +119,41 @@ public function __clone() {
+      while ($table instanceof Query) {
+        $table = $table->getTable();
+      }
...
+  public function getTable() {
+    return $this->table;
+  }

Alternative getTable() could always return the table string.

+++ b/core/lib/Drupal/Core/Database/Schema.php
@@ -684,6 +683,33 @@ public function createTable($name, $table) {
+  public function ensureTableExists($table_name, SchemaProviderInterface $schema_provider) {
+    try {
+      if (!$this->tableExists($table_name)) {

I'm curious ... did you considered to add the tablename into the SchemaProviderInterface?

+++ b/core/lib/Drupal/Core/Database/SchemaProviderInterface.php
@@ -0,0 +1,28 @@
+  /**
+   * A schema API array.
+   *
+   * @return array
+   */
+  public function getSchema();

Maybe put a @see hook_schema() on here.

+++ b/core/modules/system/src/Tests/Database/SelectTest.php
@@ -28,6 +29,29 @@ function testSimpleSelect() {
   /**
+   * Tests rudimentary SELECT statements.
+   */

ha, this is not rudimentary anymore

+++ b/core/modules/system/src/Tests/Database/SelectTest.php
@@ -28,6 +29,29 @@ function testSimpleSelect() {
+  function testSimpleSelectWithTableAssert() {

Let's rename it to testSimpleSelecgtWithEnsuringTable

Log in or register to post comments

Comment #14

dawehner

German

commented 9 November 2014 at 18:12

Issue summary:

View changes

Log in or register to post comments

Comment #15

9 November 2014 at 18:21

Status:

Needs review

» Needs work

The last submitted patch, 12: 2371709_12.patch, failed testing.

Log in or register to post comments

Comment #16

chx commented 9 November 2014 at 18:40

Status:

Needs work

» Needs review

Status	File	Size
new	2371709_15.patch	40.71 KB
new	interdiff.txt	6.19 KB

2 files were hidden/shown/deleted

Status	File	Size
hidden	2371709_12.patch	37.44 KB
hidden	interdiff.txt	1.42 KB

All done except 4) because the query already has the tablename.

Log in or register to post comments

Comment #17

dawehner

German

commented 9 November 2014 at 19:50

Status:

Needs review

» Reviewed & tested by the community

This change is pretty convenient and even could make in the future makes some of the tests faster as well as avoids
the manual creation of tables in DrupalKernel tests, which would be pretty handy.

Log in or register to post comments

Comment #18

Crell commented 10 November 2014 at 04:01

Status:

Reviewed & tested by the community

» Needs work

Overall this makes good sense, and I like how it's broken up into pieces within the DB API. That said, there's some improvements we can still make:

+++ b/core/lib/Drupal/Core/Cache/DatabaseBackend.php
@@ -298,21 +275,11 @@ public function delete($cid) {
+    // Delete in chunks when a large array is passed.
+    foreach (array_chunk($cids, 1000) as $cids_chunk) {
+      $this->connection->delete($this->bin)
+        ->condition('cid', $cids_chunk, 'IN')
+        ->executeEnsuringTable($this);

If the table doesn't exist, why do we need to create the table and then delete? Our desired post-condition is already true (there are no records with those $cids in the database), so why bother creating the table? We only need do that on insert/update.

+++ b/core/lib/Drupal/Core/Cache/DatabaseBackend.php
@@ -346,17 +308,7 @@ public function deleteTags(array $tags) {
+    $this->connection->truncate($this->bin)->executeEnsuringTable($this);

Same comment here. If a truncate fails, our post-condition is known-true so why bother creating the table?

+++ b/core/lib/Drupal/Core/Cache/DatabaseBackend.php
@@ -389,25 +336,20 @@ public function invalidateMultiple(array $cids) {
+    $tag_cache = &drupal_static('Drupal\Core\Cache\CacheBackendInterface::tagCache', array());
+    $invalidated_tags = &drupal_static('Drupal\Core\Cache\DatabaseBackend::invalidatedTags', array());

Not introduced by this patch so not really a blocker here, but can someone explain to me why there's a drupal_static() call inside a class? Wouldn't an object property accomplish the same thing with less hackery?

+++ b/core/lib/Drupal/Core/Database/Query/Query.php
@@ -19,6 +20,15 @@
   /**
+   * The base table.
+   *
+   * Can be a Query object for subqueries.
+   *
+   * @var string|static
+   */
+  protected $table;

Off topic, but a reasonable improvement.

```
+++ b/core/lib/Drupal/Core/Database/Schema.php
@@ -684,6 +683,33 @@ public function createTable($name, $table) {
+   * @return bool
+   *   TRUE if the table exists, FALSE if it does not.
```
This could be read as whether or not the table existed before, not whether or not we're now sure the object exists. New wording: "TRUE if the table already existed or now exists, FALSE if there was an error and it does not."

+++ b/core/modules/system/src/Tests/Database/SchemaTest.php
@@ -440,4 +443,11 @@ protected function assertFieldCharacteristics($table_name, $field_name, $field_s
+  /**
+   * {@inheritdoc}
+   */
+  public function getSchema() {
+    return drupal_get_schema_unprocessed('database_test', 'test');
+  }

Since that's a tests-only table, can we not move it into this class entirely rather than calling out to hook_schema?

+++ b/core/modules/system/src/Tests/Database/SelectTest.php
@@ -28,6 +29,29 @@ function testSimpleSelect() {
+      self::addSampleData();

Nitpick: static, not self.

Discussing with dawehner at BADCamp I suggested that maybe we should have a table object, rather than a "schema provider object" (since in all relevant cases we will be ensuring only a single table). However, reading through the patch I can see the benefit of having the schema defined directly on the object that is the swappable service. So I'm OK with this design.

Log in or register to post comments

Comment #19

chx commented 10 November 2014 at 23:46

Status:

Needs work

» Needs review

Status	File	Size
new	2371709_19.patch	39.45 KB
new	interdiff.txt	2.93 KB

2 files were hidden/shown/deleted

Status	File	Size
hidden	2371709_15.patch	40.71 KB
hidden	interdiff.txt	6.19 KB

1,2 reverted (and removed the bug fix, not necessary now)

3. off topic as noted

5. 7. fixed

6. the schema is in the module and is used by the installSchema() call in setUp, why would I copy it?

Log in or register to post comments

Comment #20

dawehner

German

commented 11 November 2014 at 01:11

Status:

Needs review

» Reviewed & tested by the community

The feedback of @crell got addressed in #20.

6. the schema is in the module and is used by the installSchema() call in setUp, why would I copy it?

Not copying the schema array is a valid point.

Log in or register to post comments

Comment #21

catch

he/him

English

commented 12 November 2014 at 16:18

Priority:

Normal

» Major

I've just committed #1426804: Allow field storages to be persisted when they have no fields. - this added some test code that we should be able to remove again here. Leaving RTBC since that can happen in a separate issue, but also not committing yet because I've not been able to give this a proper look yet (like the idea a lot though).

Log in or register to post comments

Comment #22

yched commented 12 November 2014 at 18:15

This indeed looks like it would let us remove some edgy test code from #1426804: Allow field storages to be persisted when they have no fields., but that would require adding a SchemaProviderInterface class for Entity & Fields tables schemas, which is probably outside the scope of that issue here.

On that aspect though :

1) SchemaProviderInterface::getSchema() [no param] means there has to be one SchemaProvider object per table, which doesn't look applicable for dynamic needs like, well, entities & fields. Could that method receive the $table_name as a param to allow the same object to return schemas for different tables ? Then single-table implementations like the ones in this patch can always ditch/overlook the param.

2) Naming fun...
In the "Entity/field SQL storage" space, SqlContentEntityStorageSchema is in charge of assembling entity & field tables schema and creating the tables (currently does so in the various onEntityTypeCreate() / onFieldStorageCreate() / ... methods from EntityTypeListenerInterface & FieldStorageDefinitionListenerInterface)
So it would most likely be the object to also implement the SchemaProviderInterface added here.

But then : SqlContentEntityStorageSchema implements SchemaProviderInterface ?
I.e. "a Schema is a SchemaProvider" ? That sounds quite recursive :-/

That's a case of two different-but-related systems using the same name for slightly different things. Naming things in the "SQL entity storage schema" issue was quite debated IIRC. In the end, the "Schema" objects in the "SQL entity storage schema" land are more "handlers for some schema logic".

If this issue adds its notion of SchemaProviderInterface (which I think is a perfectly reasonable name), and Entity/field SQL storage has to use it, then maybe it would make sense for SqlContentEntityStorageSchema and its existing interfaces to move to being "SchemaProviders" too, instead of just "Schemas"...

Anyway - item 2) is probably more for discussion with the Entity Storage folks, probably in the followup issue where we move entity/field tables to the on-demand-creation mechanism added here.

Log in or register to post comments

Comment #23

chx commented 12 November 2014 at 18:42

Status:

Reviewed & tested by the community

» Needs work

> Could that method receive the $table_name as a param to allow the same object to return schemas for different tables ? Then single-table implementations like the ones in this patch can always ditch/overlook the param.

Very nice idea! I didn't want the schema to return an array indexed by table names cos the single use is common but your suggestion makes a lot of sense. So setting for CNW to move {cachetags} in.

Log in or register to post comments

Comment #24

plach

he/him

Italian

Venezia

commented 12 November 2014 at 23:46

@yched:

In the early Entity Storage patches SqlContentEntityStorageSchema was called ContentEntitySchemaHandler or something like that. I guess SqlContentEntityStorageSchemaProvider would definitely make more sense although "handler" is the right word IMO, as it does more than just returning a schema definition. I think we removed the handler suffix for consistency with the other entity handlers, but I've never been a big fan of that choice.

Anyway, I guess such a rename would need to happen in a BC way and, given the new policies, I'm not even sure that would be enough to let it be approved.

Log in or register to post comments

Comment #25

yched commented 13 November 2014 at 14:40

@chx, @plach:

Thinking a bit more about passing $table_name to SchemaProviderInterface::getSchema(), I don't think that would be enough for SqlContentEntityStorageSchema(). It means getSchema($table_name) needs to reverse-engineer the table name to figure out which table it is (the base table, the revision table, the data table, some single-field table...) - that is, do the opposite of TableMappingInterface::getXxxTableName(). That would be some brittle regexp logic - not even sure that's doable since we hash some parts to avoid long table names.

Would be more helpful if SchemaProviderInterface::getSchema() received the "internal code for the table" : 'base_table', 'revision_table', 'data_table', 'field_table for field foobar'...

Meaning, SchemaProviderInterface::getSchema() should receive the table name (good enough for other, non-entity cases) + an arbitrary $context that was passed to $query->executeEnsuringTable($schemaProvider, $context) ?
(the code that does the query knows which table it is talking about, and is able to pass the right $context to describe that table)

Log in or register to post comments

Comment #26

andypost

he/him

Russian

commented 13 November 2014 at 14:56

@chx++

Having this in core will help history, comment_statistics to have own storage on per bundle basis.
The tracker could get some facelift too

Log in or register to post comments

Comment #27

yched commented 13 November 2014 at 17:17

Wondering how the patch would work for queries JOINing several tables.
- one single $query->executeEnsuringTable[s?]($schemaProvider), and the same $schemaProvider is used for all the tables in the query ?
(not really compatible with the per-table $context param suggested in #25)
- one $query->executeEnsuringTable($schemaProvider, $table_name) per table present in the request ?

Log in or register to post comments

Comment #28

yched commented 13 November 2014 at 17:18

FTR, regarding #25 and #27 : I'd hate to block this patch on requests for the (somewhat complex...) Entity/Field case. Having on-demand creation for Entity/Field tables would definitely make tests faster and simpler to write, but moving *all* runtime Entity/Fields requests (SqlStorageController + EFQ + Views...) to "beware, the tables might not exist yet, use $query->executeEnsuringTable() correctly" is not likely to be completely trivial, and having on-demand table creation for other systems is totally worthwhile in itself.

Just trying to see if we can shape the initial feature in a way that keeps the door open for entities without too much API changes. I know, that's a fine line to scope creep :-/

Log in or register to post comments

Comment #29

chx commented 13 November 2014 at 17:28

More and more I tend to think that while adding a table name serves this issue well anything else should be a followup. Any query doing a JOIN is not fit for this. It's possible the field table case is too complicated to be covered by this simple mechanism. It is useful for so much else, although.

Log in or register to post comments

Comment #30

catch

he/him

English

commented 13 November 2014 at 17:37

This might be an awful idea, but what if we just put this logic directly into execute()?

The try/catch has no real cost.

The catch case if there's no schema definition we'd just rethrow the exception - that will have a bit of overhead, but PDO exceptions are rare enough that shouldn't matter.

Also I'm wondering what happens if the lazy table creation is implemented, but the first query to that table is in a query where it's not the base table. That feels like a race condition but also possible - could we check other tables in the query if the base table is OK, and only rethrow the exception after doing that?

Log in or register to post comments

Comment #31

chx commented 13 November 2014 at 20:02

Great idea with an optional schema_provider argument on execute itself. But I'd rather answer your multi table question the same as I did yched's: if there is more than one table then simply re-throw. This solution is for simple I would say.

Log in or register to post comments

Comment #32

chx commented 13 November 2014 at 20:03

By the way this is why I still love working on Drupal core: I thought this ready but since then we got two really great ideas! Please bring them on :)

Log in or register to post comments

Comment #33

Crell commented 14 November 2014 at 03:05

I think it's important to target the 80-90% use case here, not all use cases. To that end:

1) If you're a use case where you're not working with the base table, I guess this doesn't work for you. That's a case you'd still need to handle a variant of this logic manually (just as the current use cases are doing now). If even that isn't sufficient, you're back to hook_schema (ie, what you need to do already.) Fields may be such a use case where following this logic manually is necessary, or at least a task for a second pass.

2) I would prefer to not fold this logic into execute(). That complects executing the query with the (rare) case of lazy schema generation. If you want to mess with lazy schema generation you should know what you're doing. Keep each method single-purpose. (SRP and all that.)

Log in or register to post comments

Comment #34

chx commented 14 November 2014 at 03:28

Crell's argument for 2) sounds compelling but I will wait for catch's answer.

Log in or register to post comments

Comment #35

catch

he/him

English

commented 14 November 2014 at 10:33

I'm OK with not handling entities and fields here, I do think we should open a follow-up to explore that more though, and try to leave that open in this patch as much as possible, pretty much what yched said in #28.

@Crell this is another case where 'complection' doesn't really stand up as a magic/dirty word when looking at the overall context of the change.

Having the separate method only works if every read from the table is from exactly the same system that's handling the write. In the case of entities/fields there's at least three systems that interact with the storage (entity CRUD, entity field query and Views). If we don't use this for fields (which clearly is the most advanced and difficult to handle use case), there are other places where lazy table creation would be useful, that also have Views integration.

The obvious example in core is dblog. The Logger/DbLog class is currently in dblog module, and there's a hook_schema(). However it seems quite likely that we'll want to move the actual logger to core/lib at some point and drop the hook_schema(). This would allow the logger to be used independently of the UI at least, and remove the implicit dependency of the class on hook_schema() and module installation which was my original motivation for opening #1167144: Make cache backends responsible for their own storage.

However dblog module would still provide views integration, services.yml and extra UI bits like displaying individual log messages.

With the logic in execute(), dblog only needs to do the hook_schema() -> SchemaProviderInterface change, which is pure implementation detail and no API change (with the partial exception of the schema no longer showing up in drupal_get_schema() but that's never actually used any more and we're close to dropping it at this point).

With the logic in executeEnsuringTable(), Views would need to be able to distinguish between tables that need executeEnsuringTable() and queries that don't. If the former, it would also need to add to the hook_views_data() API to allow modules to communicate whether at table is lazy created or not.

(Side note, but any module can define Views integration for anything, I could see 'cache inspector' and 'queue inspector' modules showing up in contrib).

So while having two separate public methods keeps the concepts separate in the database layer, it will slowly have the effect of bleeding knowledge about lazy table creation - both the general concept and whether particular tables are lazy-created or not, around different subsystems in core, creating considerably more coupling overall.

Given that, this looks like an example of SRP taken too far to me, found this discussion which looks relevant too:

There is a second question, as important as "does every class have only one reason to change?" and that is "does every change only affect one class?"

From http://programmers.stackexchange.com/questions/150760/single-responsibil...

Apart from that. I'd expect by Drupal 9 that we move almost entirely away from hook_schema(), and more or less all tables are created via this mechanism - because a major trend since Drupal 5 or so has been to decouple various subsystems from specific database implementations (cache, lock, queue, entities). Most of what we have left in core hook_schema() definitions is things that simply have not been got to yet, @andypost gave three good examples. So the use cases for querying a table that's not lazy-created have already become increasingly narrow and this will tend towards the default (let alone that writing database queries is becoming increasingly narrow anyway with Views and EFQ).

Log in or register to post comments

Comment #36

yched commented 14 November 2014 at 11:27

Interesting points in #35.

Not sure I get how putting the logic in execute() is that different from having a dedicated executeEnsuringTable() method though. Whatever the method, a SchemaProviderInterface object needs to be passed to the query anyway, right ?

So the code that writes the query has to know 1) that the table is lazy created and 2) what is the right SchemaProvider for the table- execute() doesn't bring much over executeEnsuringTable() regarding "who needs to know what" ?

(except executeEnsuringTable() is easier to add in query_alter() ?)

Log in or register to post comments

Comment #37

plach

he/him

Italian

Venezia

commented 14 November 2014 at 11:19

Adding to #35: I think doing this for entities/fields would be tricky because atm the events that trigger schema creation/changes are completely separated from actual querying. Hence I'm not sure it would be easy to find a point in the various execution flows where it makes sense to call an explicit executeEnsuringTable() method (and pass a schema provider) and still keep things storage-agnostic. I've no actual concern, just the feeling that this would not play well with the current architecture...

(obviously this is not meant to be a blocker at all :)

Log in or register to post comments

Comment #38

catch

he/him

English

commented 14 November 2014 at 11:43

Whatever the method, a SchemaProviderInterface object needs to be passed to the query anyway, right ?

Thinking through it, that's making me think that having to provide the SchemaProviderInterface object at all is potentially brittle. It's not more brittle than what we do now, but if we apply the pattern to places like dblog then it gets trickier.

The only way to handle that would be to add either something declarative or an event so that when the exception is caught, it then figures out whether the table has a matching SchemaProviderInterface class somewhere - but that adds a fair bit of dependencies.

Log in or register to post comments

Comment #39

yched commented 14 November 2014 at 13:41

The only way to handle that would be to add either something declarative or an event so that when the exception is caught, it then figures out whether the table has a matching SchemaProviderInterface class somewhere - but that adds a fair bit of dependencies

Which is a bit like hook_schema() with an indirection : mapping table names to "the name of a class that can provide the schema" instead of the schema array directly...
Yeah, sounds a bit complicated in terms of instantiating the classes - they are likely to be business objects too, rather than just SchemaProviderInterface objects, and thus have dependencies...

What this patch gives us is a mechanism for on-demand creation of tables, where the impact is that all queries on those tables need to account for "the table might not exist yet" and provide the SchemaProvider for the table.
That is good enough for self-enclosed systems where all queries are generated by using a defined API - which is a general trend anyway since we tend to abstract from hardcoded SQL implementations. --> log, cache, history, comment_statistics...

Then, as @catch points, there's Views, that virtually allows exposing any SQL table, bypassing the APIs above.
But then could Views simply try / catch its query and consider that "Exception, table does not exist" means "empty result set" ?

Log in or register to post comments

Comment #40

chx commented 14 November 2014 at 18:23

Issue summary:

View changes

Had a massive discussion with catch; summary updated accordingly. I believe this works even for the field case and definitely works for Views. Now, can we get some PostgreSQL folks to implement isTableMissingException quickly?

Log in or register to post comments

Comment #41

amateescu commented 14 November 2014 at 21:14

Status	File	Size
new	2371709-pgsql.txt	763 bytes

Here you go :)

Log in or register to post comments

Comment #42

plach

he/him

Italian

Venezia

commented 14 November 2014 at 22:42

+++ b/core/lib/Drupal/Core/Database/Driver/pgsql/Connection.php
@@ -323,6 +323,16 @@ public function rollbackSavepoint($savepoint_name = 'mimic_implicit_commit') {
+    return ($e->getCode() === '42P01') ? TRUE : FALSE;

Why not just return $e->getCode() === '42P01';?

Log in or register to post comments

Comment #43

amateescu commented 14 November 2014 at 23:03

Because I'm dumb? :P

Log in or register to post comments

Comment #44

yched commented 15 November 2014 at 00:10

@chx @catch : great to hear about the new plan !

What I don't get though is:

call the the SchemaProvider interfaces with the table name and use the first that answers.

Which SchemaProviders do we call exactly ? How do we find them ?

Log in or register to post comments

Comment #45

chx commented 15 November 2014 at 19:40

Issue summary:	View changes
Status:	Needs work	» Needs review

Status	File	Size
new	2371709_44.patch	45.89 KB

3 files were hidden/shown/deleted

Status	File	Size
hidden	2371709_19.patch	39.45 KB
hidden	interdiff.txt	2.93 KB
hidden	2371709-pgsql.txt	763 bytes

There is no interdiff here, the change is too big, treat is as a completely new patch. amateescu, thanks for the pgsql patch.

The execute method debate is decided by patching query in the connection class instead.

Edit: re #44 we find them in the backtrace.

Log in or register to post comments

Comment #46

16 November 2014 at 02:04

Status:

Needs review

» Needs work

The last submitted patch, 45: 2371709_44.patch, failed testing.

Log in or register to post comments

Comment #47

chx commented 16 November 2014 at 04:52

Status:

Needs work

» Needs review

Status	File	Size
new	2371709_46.patch	46.64 KB
new	interdiff.txt	4.76 KB

1 file was hidden/shown/deleted

Status	File	Size
hidden	2371709_44.patch	45.89 KB

I didn't use the options carefully prepared in config, caught the wrong exception twice. Also two minor leftovers removed. Drupal standard now installs in peace. Whether the tests pass, I do not know :)

Log in or register to post comments

Comment #48

16 November 2014 at 05:13

Status:

Needs review

» Needs work

The last submitted patch, 47: 2371709_46.patch, failed testing.

Log in or register to post comments

Comment #49

dawehner

German

commented 16 November 2014 at 08:23

+++ b/core/lib/Drupal/Core/Database/Connection.php
@@ -218,6 +218,12 @@ public function destroy() {
+   * - on_table_missing: When a string, it is a table name. If the query

We talked about making that a little bit more explicit: 'create_missing_table'

+++ b/core/lib/Drupal/Core/Database/Connection.php
@@ -555,6 +562,25 @@ public function query($query, array $args = array(), $options = array()) {
+      if ($this->isTableMissingException($e) && !empty($options['on_table_missing'])) {
+        if ($options['return'] == Database::RETURN_STATEMENT) {
+          return new StatementEmpty();
+        }

We should explain why return early is okay: this basically optimizes for SELECT queries, right?

+++ b/core/lib/Drupal/Core/Database/Connection.php
@@ -555,6 +562,25 @@ public function query($query, array $args = array(), $options = array()) {
+          foreach (debug_backtrace(DEBUG_BACKTRACE_PROVIDE_OBJECT) as $bt) {
+            if (!empty($bt['object']) && $bt['object'] instanceof SchemaProviderInterface && ($schema = call_user_func([$bt['object'], 'getSchema'], $table))) {

Let's get rid of cuf

+++ b/core/lib/Drupal/Core/Database/Query/Merge.php
@@ -135,6 +135,9 @@ class Merge extends Query implements ConditionInterface {
+    if (!isset($options['on_table_missing'])) {
+      $options['on_table_missing'] = $table;
+    }

+++ b/core/lib/Drupal/Core/Database/Query/Select.php
@@ -137,6 +137,9 @@ class Select extends Query implements SelectInterface {
+    if (!isset($options['on_table_exception'])) {
+      $options['on_table_exception'] = TRUE;
+    }

+++ b/core/lib/Drupal/Core/Database/Query/Truncate.php
@@ -35,6 +35,9 @@ class Truncate extends Query {
+    if (!isset($options['on_table_missing'])) {
+      $options['on_table_missing'] = FALSE;
+    }

+++ b/core/lib/Drupal/Core/Database/Query/Update.php
@@ -74,6 +74,9 @@ class Update extends Query implements ConditionInterface {
+    if (!isset($options['on_table_missing'])) {
+      $options['on_table_missing'] = FALSE;
+    }

What about using the += pattern here as well? ... Oh and btw. on_table_exception seems to be an older name for on_table_missing?

+++ b/core/lib/Drupal/Core/Database/Schema.php
@@ -739,4 +766,9 @@ protected function escapeDefaultValue($value) {
+  protected function query($query, array $args = array(), $options = array()) {
+    $options['on_table_missing'] = FALSE;
+    return $this->connection->query($query, $args, $options);
+  }

Is there a reason to have this method even it is not used at all? (neither storm nor myself could find a use of it)

+++ b/core/lib/Drupal/Core/Database/SchemaProviderInterface.php
@@ -0,0 +1,31 @@
+ * Drupal\Core\Database\Query::executeEnsuringTable() and
+ * Drupal\Core\Database\SelectInterface::executeEnsuringTable() allows these

... old documentation

+++ b/core/lib/Drupal/Core/Database/SchemaProviderInterface.php
@@ -0,0 +1,31 @@
+  /**
+   * A schema API array.
+   *
+   * @return array
+   *   A schema API array
+   *
+   * @see hook_schema()
+   */
+  public function getSchema($table_name = NULL);

Let's document it ... in general it is confusing to be NULL by default, what about an empty string? It should document that most of the time you can pretty much ignore the table name, unless you need to support multiple tables.

+++ b/core/modules/system/src/Tests/Cache/DatabaseBackendTagTest.php
@@ -48,7 +49,7 @@ public function testTagInvalidations() {
-    $invalidations_before = intval(db_select('cachetags')->fields('cachetags', array('invalidations'))->condition('tag', 'test_tag:2')->execute()->fetchField());
+    $this->assertFalse(Database::getConnection()->schema()->tableExists('cachetags'));

This is a bit confusing: what about documenting that just setting cache entries does not save cache tags, ... this just happens if someone invalidates them.

Log in or register to post comments

Comment #50

chx commented 16 November 2014 at 08:46

Status:

Needs work

» Needs review

Status	File	Size
new	interdiff.txt	18.04 KB
new	2371709_49.patch	46.58 KB

2 files were hidden/shown/deleted

Status	File	Size
hidden	2371709_46.patch	46.64 KB
hidden	interdiff.txt	4.76 KB

Renamed to 'create_missing_table', thanks, great idea!
Added a comment -- thanks for bringing this up, I added a killswitch to Select::addJoin , I thought of it earlier then dropped it but it's crystal clear from writing the comment that this only works for single tables as we discussed above.
Gone!
Most of these are just gone. Merge sets unconditionally the return to affected so you'd need to do $options = ['return' => Database::RETURN_AFFECTED] + $options + ['create_missing_table' => $table]. I shied away from doing that. As a followup we could explore setting it in update instead of the constructor but I'd rather not change Merge behavior for now.
Removed, earlier attempt.
Refreshed documentation.
Same.
Added a comment.

Log in or register to post comments

Comment #51

16 November 2014 at 09:05

Status:

Needs review

» Needs work

The last submitted patch, 50: 2371709_49.patch, failed testing.

Log in or register to post comments

Comment #52

chx commented 16 November 2014 at 09:22

Status:

Needs work

» Needs review

Status	File	Size
new	interdiff.txt	2.25 KB
new	2371709_52.patch	48.97 KB

1 file was hidden/shown/deleted

Status	File	Size
hidden	2371709_49.patch	46.58 KB

Ah yes, MenuTreeStorageTest asserts that reading creates the table. That doesn't happen. So I refactored the test slightly and save instead.

Log in or register to post comments

Comment #53

yched commented 16 November 2014 at 11:51

Hate to be the downer here, but I can't say I'm in love with the "fetch missing info from wherever we can find it higher in the callstack" thing.

- It feels like uncommon/surprising black magic
- It feels brittle too: many things can be in the callstack, including objects that happen to be SchemaProviders for unrelated systems - since it seems a lot of simple systems won't check the table name param, you can end up creating the wrong table ?
- De facto, it moves from (previous approach) "only code that knows the right SchemaProvider can query a lazy table" to (current approach) "only the SchemaProvider can query a lazy table". I don't see the gain, it's just more constraining ? The previous approach (explicitly pass the SchemaProvider for the query) already had that "only the subsystem can query its tables" aspect, but more flexible and without black magic.
- If joins are still not supported, we haven't gained in terms of use cases ?

Won't fight this if everyone else likes the idea, but I'm not too yay myself :-)

Log in or register to post comments

Comment #54

amateescu commented 16 November 2014 at 13:08

Edit: Stepped into a different bug.

Log in or register to post comments

Comment #55

dawehner

German

commented 16 November 2014 at 12:50

@amateescu
Are you sure you don't run into the following bug? #2349441: Regression: 'Front page route not found' Exception when installing Drupal against an existing database

Log in or register to post comments

Comment #56

amateescu commented 16 November 2014 at 13:38

@dawehner, yup, that was the problem. Manual installation works fine on MySQL but fails on Postgres:

Drupal\Core\Database\DatabaseExceptionWrapper: SQLSTATE[42P01]: Undefined table: 7 ERROR: relation "cache_bootstrap" does not exist LINE 1: ..., checksum_invalidations, checksum_deletions FROM cache_boot... ^: SELECT cid, data, created, expire, serialized, tags, checksum_invalidations, checksum_deletions FROM {cache_bootstrap} WHERE cid IN (:cids_0); Array ( [:cids_0] => module_implements ) in Drupal\Core\Extension\ModuleHandler->getImplementationInfo() (line 517 of /home/andrei/work/d8.dev/core/lib/Drupal/Core/Extension/ModuleHandler.php).

Drupal\Core\Extension\ModuleHandler->getImplementationInfo('stream_wrappers_alter')
Drupal\Core\Extension\ModuleHandler->getImplementations('stream_wrappers_alter')
Drupal\Core\Extension\ModuleHandler->alter('stream_wrappers', Array)
Drupal\Core\StreamWrapper\StreamWrapperManager->register()
Drupal\Core\DrupalKernel->boot()
Drupal\Core\DrupalKernel->prepareLegacyRequest(Object)
drupal_install_system(Array)
install_base_system(Array)
install_run_task(Array, Array)
install_run_tasks(Array)
install_drupal()

Edit: I can try looking into this a bit later today if no one beats me to it.

Log in or register to post comments

Comment #57

chx commented 16 November 2014 at 16:18

Status	File	Size
new	2371709_57.patch	49.41 KB
new	interdiff.txt	4.21 KB

3 files were hidden/shown/deleted

Status	File	Size
hidden	interdiff.txt	18.04 KB
hidden	interdiff.txt	2.25 KB
hidden	2371709_52.patch	48.97 KB

@yched we can fix that. We were very close fixing that in fact -- the database passes in the table name already and the caller checks whether there is a schema returned for that table name. cache already returns a schema if it is something it is interested in it. All we need to do is to change the interface to make the table name required and make the implementations check it. In fact I had this coded and this is why I have switched from $bt['class'] to $bt['object'] because this way getSchema can look at $this->table / $this->bin. But I have backed out because I saw MenuTreeStorage calling $this->getSchema() and I thought that we can have our cake and eat it too but no. You are right, this is brittle so MenuTreeStorage will need to pass in the table name. That's really not a big deal.

Because the win is gigantic: query strings work without bothering to pass the SchemaProvider in to query/queryRange/queryTemporary.

Log in or register to post comments

Comment #58

chx commented 16 November 2014 at 16:43

Status	File	Size
new	2371709_58.patch	50.5 KB
new	interdiff.txt	4.18 KB

Edit: nevermind.

Log in or register to post comments

Comment #59

chx commented 16 November 2014 at 16:44

Status	File	Size
new	interdiff.txt	4.18 KB

1 file was hidden/shown/deleted

Status	File	Size
deleted	interdiff.txt	4.18 KB

Edit: nevermind. Double post, somehow.

Log in or register to post comments

Comment #60

16 November 2014 at 16:30

Status:

Needs review

» Needs work

The last submitted patch, 58: 2371709_58.patch, failed testing.

Log in or register to post comments

Comment #61

chx commented 16 November 2014 at 16:45

Status:

Needs work

» Needs review

Status	File	Size
new	interdiff.txt	3.97 KB
new	2371709_61.patch	50.39 KB

2 files were hidden/shown/deleted

Status	File	Size
hidden	2371709_58.patch	50.5 KB
hidden	interdiff.txt	4.18 KB

@amateescu, is this enough to fix pgsql? No doubt this is necessary because pgsql overrides the query() function without calling the parent (opsie); I am just not sure whether it's enough.

Log in or register to post comments

Comment #62

amateescu commented 22 November 2014 at 02:47

@chx, nope, doesn't seem to be enough. I also tried to debug it for a few hours but didn't get anywhere.. I'll try again tomorrow.

Additional uncaught exception thrown while handling exception.
Original

Drupal\Core\Database\DatabaseExceptionWrapper: SQLSTATE[42P01]: Undefined table: 7 ERROR: relation "cache_bootstrap" does not exist LINE 1: INSERT INTO cache_bootstrap (cid, serialized, created, expir... ^: INSERT INTO cache_bootstrap (cid, serialized, created, expire, tags, checksum_invalidations, checksum_deletions, data) VALUES (:db_insert_placeholder_0, :db_insert_placeholder_1, :db_insert_placeholder_2, :db_insert_placeholder_3, :db_insert_placeholder_4, :db_insert_placeholder_5, :db_insert_placeholder_6, :db_insert_placeholder_7); Array ( ) in Drupal\Core\Extension\ModuleHandler->getHookInfo() (line 290 of /home/andrei/work/d8.dev/core/lib/Drupal/Core/Extension/ModuleHandler.php).

Additional

Drupal\Core\Database\DatabaseExceptionWrapper: SQLSTATE[42P01]: Undefined table: 7 ERROR: relation "cachetags" does not exist LINE 1: INSERT INTO cachetags (invalidations, tag) VALUES ('1', 'ext... ^: INSERT INTO cachetags (invalidations, tag) VALUES (:db_insert_placeholder_0, :db_insert_placeholder_1); Array ( ) in Drupal\Core\Cache\Cache::invalidateTags() (line 132 of /home/andrei/work/d8.dev/core/lib/Drupal/Core/Cache/Cache.php).

Log in or register to post comments

Comment #63

amateescu commented 17 November 2014 at 22:38

Status	File	Size
new	interdiff.txt	3.76 KB

Almost there :)

There are multiple problems with Postgres here:

1) in Drupal\Core\Database\Driver\pgsql\Insert::execute(), we set the return option to Database::RETURN_NULL

    ...
    if (!empty($table_information->sequences)) {
      $options['sequence_name'] = $table_information->sequences[0];
    }
    // If there are no sequences then we can't get a last insert id.
    elseif ($options['return'] == Database::RETURN_INSERT_ID) {
      $options['return'] = Database::RETURN_NULL;
    }
    ...

which makes

      $handled = $this->handleMissingTable($e, $query, $args, $options);
      if (isset($handled)) {
        return $handled;
      }

fail because $handled will be NULL on an insert in a cache table. Added a hacky workaround to return an array instead and just use its first value.

2) since we've removed various pieces of code that ensured tables were created before running INSERT queries, we now have to extend the fixes/workarounds from #2181291: Prevent a query from aborting the entire transaction in pgsql to Drupal\Core\Database\Driver\pgsql\Insert::execute() as well as handling it inside Drupal\Core\Database\Driver\pgsql\Connection::query().

3) I'm now getting

Drupal\Core\Database\DatabaseExceptionWrapper: SQLSTATE[22P02]: Invalid text representation: 7 ERROR: invalid input syntax for type bytea LINE 1: ...'1416262383.549', '-1', 'entity_types', '0', '0', 'a:6:{s:19... ^: INSERT INTO cache_discovery (cid, serialized, created, expire, tags, checksum_invalidations, checksum_deletions, data) VALUES (:db_insert_placeholder_0, :db_insert_placeholder_1, :db_insert_placeholder_2, :db_insert_placeholder_3, :db_insert_placeholder_4, :db_insert_placeholder_5, :db_insert_placeholder_6, :db_insert_placeholder_7); Array ( ) in Drupal\Core\Plugin\DefaultPluginManager->setCachedDefinitions() (line 197 of /home/andrei/work/d8.dev/core/lib/Drupal/Core/Plugin/DefaultPluginManager.php).

Drupal\Core\Plugin\DefaultPluginManager->setCachedDefinitions(Array)
Drupal\Core\Plugin\DefaultPluginManager->getDefinitions()
Drupal\Core\Config\ConfigManager->getEntityTypeIdByName('core.extension')
Drupal\Core\Config\ConfigInstaller->createConfiguration('', Array)
Drupal\Core\Config\ConfigInstaller->installDefaultConfig('core', 'core')
drupal_install_system(Array)
install_base_system(Array)
install_run_task(Array, Array)
install_run_tasks(Array)
install_drupal()

which appears to be a general problem with inserting serialized PHP data into a bytea column, so I wonder how the installation even works without this patch.

Edit: to answer myself - it works by magic and I see that cache_discovery gets populated with serialized data just fine :/

To be continued tomorrow...

Log in or register to post comments

Comment #64

chx commented 21 November 2014 at 22:02

Discussed with @amateescu yesterday after he realized postgresql insert relies on schema. Here's the plan:

Move the schema-finder into its own method
Have the pgsql schema-from-information_schema method throw an exception for missing table
Catch the exception and call 1. It'll have the info just necessary. Table can be created at this point as well.

Log in or register to post comments

Comment #65

amateescu commented 22 November 2014 at 02:45

Status	File	Size
new	interdiff.txt	9.44 KB

Catch the exception and call 1. It'll have the info just necessary.

Except that the info from 1. (a $schema array) is not what's necessary for insert :) But that's no biggie, we can just query information_schema again.

Anyway, the plan above allowed me to make some progress. A lot more tables are created now but some are still not, like cache_data, cache_menu, cache_render and menu_tree.

This interdiff is against #61 if anyone wants to play with it during the weekend.

Log in or register to post comments

Comment #66

Crell commented 24 November 2014 at 06:06

I have to say, trying to crawl up the call stack to get to an object with a given interface strikes me as a "holy crap, why would you do such a thing?" action. That's relying on all sorts of implicit behavior and statefulness that feels like it's going to blow up in our faces, or more likely someone else's face.

To whatever extent a subsystem maintainer can veto something, I do to that approach.

The original approach seemed far more contained and reasonable to me. The only caveat is Views, which to me seems like a non-issue. If the table isn't there, there's nothing to show anyway so who cares? Let's solve the 80% case, not over-complicate it with language black magic (which has performance concerns, too; generating the call stack is not a fast operation) to try and cover every possible edge case.

Log in or register to post comments

Comment #67

chx commented 24 November 2014 at 08:51

Supporting query strings is not an edge case and there you have problems with doing this otherwise. Performance concerns are likely invalid since this is a very rare occasion coupled with one of the slowest possible database operations: creating a table.

Log in or register to post comments

Comment #68

chx commented 25 November 2014 at 01:50

But, I have said everything I had to say multiple times now; I have carried this issue so far if people don't like it; that's not really a problem of mine. Without any regrets I am unfollowing this issue. MongoDB now carries a little DB driver anyways to facilitate an easy install and it has a Schema class that blackholes every call to it anyways. If people want simpler and speedier tests they can continue with either the latest or earlier patches. I am out.

Log in or register to post comments

Comment #69

dawehner

German

commented 24 November 2014 at 09:45

As a compromise can we replace the auto-detection of the used class and pass along the schema provider as part of the $options?
Both $connection->query(), $connection->select(), $connection->update() etc. provide it.

Log in or register to post comments

Comment #70

catch

he/him

English

commented 24 November 2014 at 12:25

If we did it as part of options that gives us a potential route to supporting more than one table/schema provider (or the table not being the base table) later on as well as static queries. I'm thinking about an eventual situation where Views handles locating the schema provider, if one exists, for each table in a query and adds it to options.

Views isn't the only issue here, EntityQuery is almost as problematic (except that it's at least linked to the API dealing with the schema, whereas Views isn't at all, so EnttiyQuery probably has a closer route to discovering that information).

Using the backtrace isn't a performance concern for the reasons chx mentioned (we're already doing a DDL operation if we get to that point). I agree it's weird but so is an extra execute() method, $options does seem like a decent compromise between the two.

Log in or register to post comments

Comment #71

Crell commented 24 November 2014 at 14:01

Using the options array makes more sense, I agree. It's not quite what it was originally intended for, but such expansion is why it's an array rather than just a primary/replica boolean. Let's see if we can make that work.

Log in or register to post comments

Comment #72

tstoeckler

he/him

German

Essen, Germany

commented 24 November 2014 at 15:56

EntityQuery is almost as problematic (except that it's at least linked to the API dealing with the schema, whereas Views isn't at all, so EnttiyQuery probably has a closer route to discovering that information

Spot on. Entity Query already has the storage injected, so it can get the schema quite easily.

Log in or register to post comments

Comment #73

dawehner

German

commented 24 November 2014 at 16:17

Status	File	Size
new	2371709-71.patch	59.88 KB
new	interdiff.txt	17.56 KB

@amateescu
Great work!! Did you managed to do a little bit more progress in the meantime?

Here is a first try based upon the last patch in #61 to implement the idea mentioned in #69.

After fixing a couple of calls to $connection->select() in MenuTreeStorage
this could actually pass.

Log in or register to post comments

Comment #74

amateescu commented 24 November 2014 at 16:21

@dawehner, nope. And now I'll wait until the new approach/patch is RTBC before doing any other Postgres fixes.

Log in or register to post comments

Comment #75

dawehner

German

commented 24 November 2014 at 16:25

Status	File	Size
new	interdiff.txt	17.56 KB

Using the options array makes more sense, I agree. It's not quite what it was originally intended for, but such expansion is why it's an array rather than just a primary/replica boolean. Let's see if we can make that work.

Cool that you agree, having a conflict here would be bad.

Spot on. Entity Query already has the storage injected, so it can get the schema quite easily.

It does? Afaik this should be part of \Drupal\Core\Entity\Query\Sql\Query,
but I'm not really sure whether I see the entity storage at the moment, can you enlighten me?
On the other hand I try to understand the point what is the point in that. We could just specify $options['create_missing_table'] = TRUE;
which would return empty results in both views and entity query and be done. There is no real requirement to create the tables for that.

Log in or register to post comments

Comment #76

24 November 2014 at 18:02

Status:

Needs review

» Needs work

The last submitted patch, 73: 2371709-71.patch, failed testing.

Log in or register to post comments

Comment #77

dawehner

German

commented 24 November 2014 at 23:51

Issue summary:	View changes
Status:	Needs work	» Needs review

Status	File	Size
new	interdiff.txt	1.61 KB
new	2371709-77.patch	60.26 KB

Its green again.

Update the issue summary to include the new idea.

dawehner passes the ball to @amateescu

Log in or register to post comments

Comment #78

amateescu commented 25 November 2014 at 00:07

Thanks, but, as I said earlier, I'll resume working on this only when it's RTBC or at least fully reviewed and agreed upon by the db maintainers.

Log in or register to post comments

Comment #79

yched commented 25 November 2014 at 11:00

The IS still contains "climb the backtrace and call the the SchemaProvider interfaces..."

Also, not clear what's the status of selects with JOINs in that latest approach ? (asking for entity/fields in EFQ/Views).

Log in or register to post comments

Comment #80

dawehner

German

commented 25 November 2014 at 12:37

Issue summary:

View changes

To be honest I'm personally okay with not supporting every possible usecase.

Updated the IS a bit more.

Log in or register to post comments

Comment #81

yched commented 25 November 2014 at 13:00

To be honest I'm personally okay with not supporting every possible usecase

Sure, as I said earlier, I absolutely do not intend to block this on supporting every use case. Just trying to have a clear view of what's supported and what's not, because that's the difference between "this can be used by self-enclosed systems like cache or history" and "this can be leveraged by entity storage".

Log in or register to post comments

Comment #82

dawehner

German

commented 29 November 2014 at 23:50

So @yched do you agree with that patch as it is? We need a RTBC (which is odd, because that is the wrong state) so that @amateescu picks it up and tries to fix the PGSQL problems.

Log in or register to post comments

Comment #83

amateescu commented 30 November 2014 at 00:38

Well, let me clarify that a bit. I'm waiting for an 'unofficial' RTBC (as in, not the issue status) from people who want this patch to go in a specific direction or have strong opinions about it, like @catch, @Crell or @yched.

I hope no one gets this the wrong way, it's just that there are a lot of other patches to work on instead of learning Postgres (which I installed and used for the first time just a couple of weeks ago to help @chx and test that the error code from #41 works as expected).

Log in or register to post comments

Comment #84

Crell commented 30 November 2014 at 21:32

```
+++ b/core/lib/Drupal/Core/Cache/DatabaseBackend.php
@@ -298,21 +269,11 @@ public function delete($cid) {
+    // Delete in chunks when a large array is passed.
+    foreach (array_chunk($cids, 1000) as $cids_chunk) {
+      $this->connection->delete($this->bin, array('schema_provider' => $this))
+        ->condition('cid', $cids_chunk, 'IN')
+        ->execute();
     }
```
I think I noted this earlier, but may not have been clear about it... we don't need to create the table lazily on delete, since we don't need it to be there for the post condition to be satisfied. However, we DO need to catch a table-not-found case and continue gracefully without letting the exception bubble up. I'd suggest just catching the exception outside the foreach and swallowing it.

I think the same applies to a few other cases below.

+++ b/core/lib/Drupal/Core/Database/Connection.php
@@ -218,6 +218,12 @@ public function destroy() {
+   * - create_missing_table: When a string, it is a table name. If the query
+   *   fails with a table missing error then this table is created and if
+   *   the creation succeeded then the query is re-run. When TRUE and
+   *   return is set to Database::RETURN_STATEMENT and the query is failing
+   *   with a table missing error then an empty statement is returned. Defaults
+   *   to TRUE.

This doesn't say where the table definition comes from. I assume it's from schema API but that's not at all clear. But isn't the idea to allow the schema to be passed in, not pulled from a static hook_schema definition?

Also, I do not understand the TRUE meaning. If a query fails an exception gets thrown, so the return value is irrelevant. Don't load too much meaning into a single property.

Also, schema_provider is not documented?

+++ b/core/lib/Drupal/Core/Database/Connection.php
@@ -555,6 +562,10 @@ public function query($query, array $args = array(), $options = array()) {
+      $handled = $this->handleMissingTable($e, $query, $args, $options);
+      if (isset($handled)) {
+        return $handled;
+      }

$handled should never be null. From the code here I don't see how it should ever be anything but a boolean, The code below says it can also be a statement, but that's still an object. Just don't let handleMissingTable ever return null, period.

+++ b/core/lib/Drupal/Core/Database/Connection.php
@@ -576,6 +587,43 @@ public function query($query, array $args = array(), $options = array()) {
+    if ($this->isTableMissingException($e) && !empty($options['create_missing_table'])) {

$options['create_missing_table'] should never be empty. defaultOptions() should ensure that. If this is a boolean check just do that.

+++ b/core/lib/Drupal/Core/Database/Connection.php
@@ -576,6 +587,43 @@ public function query($query, array $args = array(), $options = array()) {
+      if (is_string($options['create_missing_table']) && isset($options['schema_provider'])) {
+        $table = $options['create_missing_table'];
+        /** @var \Drupal\Core\Database\SchemaProviderInterface $schema_provider */
+        if ($schema_provider = $options['schema_provider']) {
+          if ($this->schema()->ensureTableExists($table, $schema_provider->getSchema($table))) {
+            // Theoretically the table must exist at this point so this
+            // should not matter but still, databases. This makes sure
+            // no infinite recursion happens.
+            $options['create_missing_table'] = FALSE;
+            unset($options['schema_provider']);
+            return $this->query($query, $args, $options);
+          }
+        }
+      }

I really don't get the point of create_missing_table at all. If the schema provider defines one table, we're done. If it defines multiple... wouldn't we need all of them? This feels like is seriously over-complex for what should be a simple task.

+++ b/core/lib/Drupal/Core/Database/Database.php
@@ -443,4 +443,26 @@ public static function closeConnection($target = NULL, $key = NULL) {
+  /**
+   * Extracts the SQLSTATE error from the PDOException.
+   *
+   * @param \Exception $e
+   *   The exception
+   *
+   * @return string
+   *   The five character error code.
+   */
+  public static function getSQLState(\Exception $e) {

Why is this not a method on Connection instead, where it could be overridden per driver if needed? Database should just be the connection management and utility facades; this is not just a utility facade; it's internal behavior of a database driver.

+++ b/core/lib/Drupal/Core/Database/Driver/pgsql/Connection.php
@@ -143,6 +143,10 @@ public function query($query, array $args = array(), $options = array()) {
+      $handled = $this->handleMissingTable($e, $query, $args, $options);
+      if (isset($handled)) {
+        return $handled;
+      }

As above.

+++ b/core/lib/Drupal/Core/Menu/MenuTreeStorage.php
@@ -760,13 +732,13 @@ public function getRootPathIds($id) {
-      $query = $this->connection->select($this->table, $this->options);
+      $query = $this->connection->select($this->table, NULL, $this->options);

There's a lot of these lines. Does that mean this code was all that buggy before...?

+++ b/core/modules/system/src/Tests/Database/SelectTest.php
@@ -7,13 +7,14 @@
@@ -23,11 +24,36 @@ function testSimpleSelect() {

@@ -23,11 +24,36 @@ function testSimpleSelect() {
     $query->addField('test', 'name');
     $query->addField('test', 'age', 'age');
     $num_records = $query->countQuery()->execute()->fetchField();
+    debug($num_records);

womp womp.

I agree with the overall direction, but the internals feel very over-complicated at present.

I would see this as simply needing a schema_provider option. In case of table-not-found, install everything the schema_provider gives you and try again. If it still doesn't work or one wasn't provided, just exception as normal.

I can only assume that the double and undocumented keys here are for complex cases like Field API, but I believe the setup here puts the complexity in the wrong place. There should be no parameters on ProviderInterface::getSchema(). It should return all tables to be created. In a more complex case like Field API where the tables might be conditionally named, well, then you don't make the calling object itself a provider. Just instantiate one with appropriate constructor parameters. Something like (mostly pseudo-code):

class Foo {
  public function bar() {
    $provider = new FieldProvider($field_name);
    $this->connection->query('...', [], ['schema_provider' => $provider]);
  }
}

FieldProvider can then provide all of the necessary tables for that given field (or whatever) based on $field_name (or whatever other setup happens other than getSchema()). That's a totally legit thing to do, and cleaner than optimizing for the degenerate case of Foo being its own provider. (If it can be, that's cool, but we shouldn't count on that always being the case.)

That lets us separate out the schema definition logic to its own object, keep the Foo class or whatever it represents clean, and keeps the handling code inside the DB API shorter and with fewer overloaded properties. The presence of schema_provider itself triggers the "on failure do this and try again" logic, and its absence means... do exactly what happens now.

Log in or register to post comments

Comment #85

Crell commented 4 December 2014 at 17:26

Status:

Needs review

» Needs work

Log in or register to post comments

Comment #86

dawehner

German

commented 4 December 2014 at 22:26

Status:

Needs work

» Needs review

Status	File	Size
new	2371709-86.patch	59.63 KB
new	interdiff.txt	6.89 KB

Thank you @crell for your review!

I think I noted this earlier, but may not have been clear about it... we don't need to create the table lazily on delete, since we don't need it to be there for the post condition to be satisfied. However, we DO need to catch a table-not-found case and continue gracefully without letting the exception bubble up. I'd suggest just catching the exception outside the foreach and swallowing it.

Note: given that we have both create_missing_table and schema_provider we don't create tables of delete calls.

This doesn't say where the table definition comes from. I assume it's from schema API but that's not at all clear. But isn't the idea to allow the schema to be passed in, not pulled from a static hook_schema definition?

Also, I do not understand the TRUE meaning. If a query fails an exception gets thrown, so the return value is irrelevant. Don't load too much meaning into a single property.

Also, schema_provider is not documented?

Worked a bit on the documentation here.

$handled should never be null. From the code here I don't see how it should ever be anything but a boolean, The code below says it can also be a statement, but that's still an object. Just don't let handleMissingTable ever return null, period.

At least in my version of the code, in case you have a different exception ... you will return NULL;

There's a lot of these lines. Does that mean this code was all that buggy before...?

Yeah, you are right here.

Why is this not a method on Connection instead, where it could be overridden per driver if needed? Database should just be the connection management and utility facades; this is not just a utility facade; it's internal behavior of a database driver.

Fixed it.

I would see this as simply needing a schema_provider option. In case of table-not-found, install everything the schema_provider gives you and try again. If it still doesn't work or one wasn't provided, just exception as normal.

Well we want to at least support the cache backens ... which have a dynamic table name.

Log in or register to post comments

Comment #87

amateescu commented 4 December 2014 at 22:51

$handled should never be null. From the code here I don't see how it should ever be anything but a boolean, The code below says it can also be a statement, but that's still an object. Just don't let handleMissingTable ever return null, period.

At least in my version of the code, in case you have a different exception ... you will return NULL;

@Crell is right, $handled should never be null. that's a problem that we need to fix anyway for pgsql, see why in #63.

Also:

I would see this as simply needing a schema_provider option. In case of table-not-found, install everything the schema_provider gives you and try again. If it still doesn't work or one wasn't provided, just exception as normal.

Well we want to at least support the cache backens ... which have a dynamic table name.

Wouldn't that be fixed by something like this?

+++ b/core/lib/Drupal/Core/Cache/DatabaseBackend.php
@@ -228,7 +199,7 @@ public function setMultiple(array $items) {
-        ->insert($this->bin)
+        ->insert($this->bin, array('schema_provider' => array($this->bin => $this)))

Log in or register to post comments

Comment #88

5 December 2014 at 00:38

Status:

Needs review

» Needs work

The last submitted patch, 86: 2371709-86.patch, failed testing.

Log in or register to post comments

Comment #89

Crell commented 5 December 2014 at 00:46

Dynamic table name, sure. Then getSchema() can return a table definition with the right name. If it's a method on the backend itself, it has access to $this->bin. If it's a separate object, pass $this->bin to that object's constructor. Problem solved.

Really, this should be a lot easier than the last patch was making it out to be. :-)

Log in or register to post comments

Comment #90

yched commented 6 December 2014 at 14:09

SchemaProvider::getSchema() has no argument, it provides the schemas for all the tables to be created

Not opposed to that, but just trying to clarify what it would mean when applied to entities / fields.

This means the first request on an entity type, no matter what are the actual tables being requested, creates all the base tables and all the per-field tables for the configurable fields that exist at this point in time.

1) In practice it probably means little actual perf gains for tests, since we create all field tables anyway.
If not for test perf, there's IMO little interest to bother moving the entity/field storage to the lazy model ?

2) Say some new fields is created in the UI after that.
- Their tables are not created immediately, since in that scenario we moved to the "lazy" model.
- They are created by the next first request involving one of those fields, which triggers a SchemaProvider call
- The SchemaProvider returns all the previous tables + the new ones for the fields that were created since the last request that had to call the SchemaProvider.
- This means the "lazy create" supporting code needs to check which tables already exist, so that we don't create them again.
(We also have spent a bit more time re-computing schemas we don't need for tables that already exist - no biggie, since that is a rare runtime occurrence, but that still feels like a clustered, non-cached version of hook_schema())

So, as compared to "SchemaProvider::getSchema() receives the missing table name and only returns the schema for that one":
- some complexity moves from one place to the other, but can't be fully avoided ?
- "tables appear by bursts depending on the sequence of fields creation and requests being executed" feels a little more confusing/unpredictable than "a table appears when it is requested".
- it's much less interesting for test perf

Then again, it's still not clear to me whether the system designed here intends to missing tables in JOINs. If it doesn't, then entities/fields are out of scope to begin with, and the above is irrelevant.

Log in or register to post comments

Comment #91

catch

he/him

English

commented 6 December 2014 at 21:19

Would an array of schema providers in $options help the fields case, feels like that might be enough to support multiple tables? Of course entity/field needs to then figure what that array is supposed to be, but then on the exception could iterate each schema provider to check. Would be less tables at a time than every single field on an entity then for the case yched brings up.

Log in or register to post comments

Comment #92

Crell commented 8 December 2014 at 01:20

Perhaps I'm not understanding Field API's usage patterns properly. Wouldn't it be querying Field-at-a-time, and so only creating the tables for a given field? (Remember, we only need to bother creating tables on write, not on read, since for a SELECT we'd still end up with empty data anyway in the end so no sense mucking with the schema.)

Log in or register to post comments

Comment #93

yched commented 15 December 2014 at 14:39

@Crell

Wouldn't it be querying Field-at-a-time, and so only creating the tables for a given field? (Remember, we only need to bother creating tables on write, not on read, since for a SELECT we'd still end up with empty data anyway in the end so no sense mucking with the schema.)

Not good enough, unfortunately, because a *LEFT* JOIN on a table with no data does not mean the result set is empty.

Imagine the following case:
- You have a View that lists nodes. Let's say the current set of nodes is such that the View actually returns results.
- You add a new field to a node type, and change the view so that it filters on "nodes WHERE that new field is empty".
- With current HEAD, the new field table is created immediately. It is empty, but the Views query LEFT JOINs to it, so the View still returns the same result set as before. That is correct : there *are* nodes, and the new field is "empty" on them.
- With the current proposed approach, the View is suddenly empty. Until someone actually adds one field value in one arbitrary node (doesn't even have to be one of the nodes selected by the View), which creates the table, and the View is back to returning all its previous nodes. That would feel very broken :-)

An approach where :
- the SchemaProvider is responsible for a set of tables, and receives the name of the table that needs to be created,
- and the query can specify an array of SchemaProviders (e.g for our case one per entity type involved in the query); when a table is deemed missing, they are called in turn until one of them says "yep, that's mine" and actually returns a schema.
would seem to solve that.

That also means the "create on table not exists" logic would need to account for all the tables that are part of the query, and possibly behave differently based on the type of JOIN ?
[edit: hm, not completely sure that "behave differently depending on the join" part would actually be needed]

Log in or register to post comments

Comment #94

yched commented 15 December 2014 at 14:31

Restating this to be extra clear: I'm just trying to outline what (I think) it would take to be able to leverage that lazy-create feature for entity/fields because, in other issues, @catch and @alexpott seemed to expect that it will solve a couple race conditions on entity/field table creation, that currently require weird workarounds in some tests. And also, yes, it's likely that it would make our tests run faster.

I don't intend to block a simpler version from getting in, and I'm perfectly fine with a decision of "yeah, too complex, screw entity / fields, let's just have a nice lazy-creation mechanism that works great for simpler APis".
Or "let's get a simple version in first, and possibly enrich it to account for more complex uses in a later issue" (would then imply BC break concerns, of course).

In other words : not my call :-)

Log in or register to post comments

Comment #95

kgoel commented 27 December 2014 at 13:28

Status	File	Size
new	2371709-95.patch	59.63 KB

Need to address #84 and #87. Just a re-roll right now since patch didn't apply.

Log in or register to post comments

Comment #96

daffie commented 27 December 2014 at 13:38

Status:

Needs work

» Needs review

For the testbot

Log in or register to post comments

Comment #97

27 December 2014 at 13:40

Status:

Needs review

» Needs work

The last submitted patch, 95: 2371709-95.patch, failed testing.

Log in or register to post comments

Comment #98

kgoel commented 27 December 2014 at 13:43

My bad for not adding "do not test" suffix as the patch was going to fail anyway. Working on addressing #84 and $87.

Log in or register to post comments

Comment #99

kgoel commented 1 January 2015 at 14:50

Status	File	Size
new	2371709-99-do-not-test.patch	51.56 KB

Probably, this is not useful but posting patch anyway.

Log in or register to post comments

Comment #100

dawehner

German

commented 11 January 2015 at 19:21

Status:

Needs work

» Needs review

Let's always send it to the testbot.

Log in or register to post comments

Comment #101

yched commented 12 January 2015 at 09:42

I don't intend to block a simpler version from getting in, and I'm perfectly fine with a decision of "yeah, too complex, screw entity / fields, let's just have a nice lazy-creation mechanism that works great for simpler APis".

Given that the discussion has stalled since then, I'd vote for doing this :-/

Log in or register to post comments

Comment #102

bzrudi71 commented 6 March 2015 at 16:55

Seems that this patch would help us to fix some PosgreSQL related issues while in transaction. Didn't read the whole story, but +1 for this approach here! Adding the PostgreSQL issue as related.

Log in or register to post comments

Comment #103

amateescu commented 6 March 2015 at 17:06

@bzrudi71, you might also be interested in comments #62, #63 and #65 where I was trying to make this patch work for PostgreSQL, before the patch started to drift into a different direction.

Log in or register to post comments

Comment #104

dawehner

German

commented 1 June 2015 at 08:47

Issue tags:

+Needs reroll

Tagging, its not trivial btw.

Log in or register to post comments

Comment #105

almaudoh commented 1 June 2015 at 12:10

Status	File	Size
new	on_demand_table-2371709-105.patch	55.1 KB

Rerolled. Took three steps and an hour. Small changes due to some changes in SQL-queries in HEAD. Patch is smaller due to removals related to caching system changes.

Log in or register to post comments

Comment #106

1 June 2015 at 12:14

Status:

Needs review

» Needs work

The last submitted patch, 105: on_demand_table-2371709-105.patch, failed testing.

Log in or register to post comments

Comment #107

almaudoh commented 1 June 2015 at 12:15

+++ b/core/lib/Drupal/Core/Cache/DatabaseBackend.php
@@ -448,8 +350,8 @@ protected function normalizeCid($cid) {
+    $schema[$this->bin] = array(

Not really read the code yet, but shouldn't this be $schema[$table_name]

Log in or register to post comments

Comment #108

almaudoh commented 1 June 2015 at 13:40

+++ b/core/lib/Drupal/Core/Database/Connection.php
@@ -215,6 +215,12 @@ public function destroy() {
+   * - create_missing_table: Boolean value, whether to create a missing table.
+   * - missing_table_name: In case create_missing_table is TRUE, this will
+   *   be the table name used for the automatic table creation.
+   * - schema_provider: An object implementing
+   *   \Drupal\Core\Schema\SchemaProviderInterface . This will be called when

@@ -229,6 +235,7 @@ protected function defaultOptions() {
+      'create_missing_table' => TRUE,

@@ -616,6 +627,44 @@ protected function handleQueryException(\PDOException $e, $query, array $args =
+    if ($this->isTableMissingException($e) && $options['create_missing_table']) {
...
+      if (!empty($options['missing_table_name']) && isset($options['schema_provider'])) {
+        $table = $options['missing_table_name'];

+++ b/core/lib/Drupal/Core/Database/Query/Insert.php
@@ -72,9 +72,10 @@ class Insert extends Query {
+      'create_missing_table' => $table,

+++ b/core/lib/Drupal/Core/Database/Query/Merge.php
@@ -135,6 +135,9 @@ class Merge extends Query implements ConditionInterface {
+      $options['create_missing_table'] = $table;

The main cause of the failures. Inconsistency between implementation of 'create_missing_table' and 'missing_table_name'.

Log in or register to post comments

Comment #109

almaudoh commented 1 June 2015 at 13:58

Status:	Needs work	» Needs review
Issue tags:	-Needs reroll

Status	File	Size
new	on_demand_table-2371709-109.patch	55.71 KB
new	interdiff.txt	4.53 KB

Fixed #107 and #108.

Log in or register to post comments

Comment #110

4 June 2015 at 23:42

Status:

Needs review

» Needs work

The last submitted patch, 109: on_demand_table-2371709-109.patch, failed testing.

Log in or register to post comments

Comment #111

4 June 2015 at 23:43

basic queued 109: on_demand_table-2371709-109.patch for re-testing.

Log in or register to post comments

Comment #112

5 June 2015 at 00:08

The last submitted patch, 109: on_demand_table-2371709-109.patch, failed testing.

Log in or register to post comments

Comment #113

basic commented 5 June 2015 at 00:08

This patch is causing testbot issues and not succeeding on the bots. Please review / re-roll the patch before submitting again.

Log in or register to post comments

Comment #114

5 June 2015 at 18:00

The last submitted patch, 109: on_demand_table-2371709-109.patch, failed testing.

Log in or register to post comments

Comment #115

almaudoh commented 5 June 2015 at 20:29

I wonder why testbot keeps re-queuing it :(

Log in or register to post comments

Comment #116

almaudoh commented 5 June 2015 at 20:53

Status:

Needs work

» Needs review

Status	File	Size
new	on_demand_table-2371709-116.patch	55.68 KB
new	interdiff.txt	727 bytes

Re-rolled and fixed a stray debug...

Log in or register to post comments

Comment #117

5 June 2015 at 20:59

Status:

Needs review

» Needs work

The last submitted patch, 116: on_demand_table-2371709-116.patch, failed testing.

Log in or register to post comments

Comment #118

8 June 2015 at 17:34

The last submitted patch, 109: on_demand_table-2371709-109.patch, failed testing.

Log in or register to post comments

Comment #119

jthorson commented 8 June 2015 at 17:34

1 file was hidden/shown/deleted

Status	File	Size
hidden	on_demand_table-2371709-109.patch	55.71 KB

Log in or register to post comments

Comment #120

almaudoh commented 16 June 2015 at 00:06

Status:

Needs work

» Needs review

Status	File	Size
new	on_demand_table-2371709-120.patch	55.17 KB

Reroll

Log in or register to post comments

Comment #121

almaudoh commented 16 June 2015 at 00:07

3 files were hidden/shown/deleted

Status	File	Size
hidden	interdiff.txt	4.53 KB
hidden	on_demand_table-2371709-116.patch	55.68 KB
hidden	interdiff.txt	727 bytes

Log in or register to post comments

Comment #122

16 June 2015 at 00:12

Status:

Needs review

» Needs work

The last submitted patch, 120: on_demand_table-2371709-120.patch, failed testing.

Log in or register to post comments

Comment #123

16 June 2015 at 19:11

Status:

Needs work

» Needs review

almaudoh queued 120: on_demand_table-2371709-120.patch for re-testing.

Log in or register to post comments

Comment #124

16 June 2015 at 20:01

Status:

Needs review

» Needs work

The last submitted patch, 120: on_demand_table-2371709-120.patch, failed testing.

Log in or register to post comments

Comment #125

dawehner

German

commented 26 September 2015 at 09:15

The more I think about this particular issue I think it is wrong to do that in this subsystem. The query parts are separated from the schema at the moment and this patch would break that.

Log in or register to post comments

Comment #126

Crell commented 27 September 2015 at 12:17

Version:

8.0.x-dev

» 8.1.x-dev

I'm pretty sure this won't happen until 8.1 anyway...

dawehner: Can you clarify? You access the schema through the connection object already anyway.

Log in or register to post comments

Comment #127

dawehner

German

commented 27 September 2015 at 13:32

dawehner: Can you clarify? You access the schema through the connection object already anyway.

Fair, but i still think that its connects two subsystems which ideally would not know about each other.

Log in or register to post comments

Comment #128

Crell commented 29 September 2015 at 07:49

You mean the DB connection and the schema management? How would those be fully decoupled? The schema needs a connection in order to do anything...

Log in or register to post comments

Comment #129

fgm

French

Paris, France

commented 29 September 2015 at 08:03

I /think/ the idea is that the schema needs a storage (hence a connection) to implement its operations, but the schema mechanism itself should not need it.

Log in or register to post comments

Comment #130

29 September 2015 at 08:03

Version:

8.1.x-dev

» 8.2.x-dev

Drupal 8.1.0-beta1 was released on March 2, 2016, which means new developments and disruptive changes should now be targeted against the 8.2.x-dev branch. For more information see the Drupal 8 minor version schedule and the Allowed changes during the Drupal 8 release cycle.

Log in or register to post comments

Comment #131

29 September 2015 at 08:03

Version:

8.2.x-dev

» 8.3.x-dev

Drupal 8.2.0-beta1 was released on August 3, 2016, which means new developments and disruptive changes should now be targeted against the 8.3.x-dev branch. For more information see the Drupal 8 minor version schedule and the Allowed changes during the Drupal 8 release cycle.

Log in or register to post comments

Comment #132

29 September 2015 at 08:03

Version:

8.3.x-dev

» 8.4.x-dev

Drupal 8.3.0-alpha1 will be released the week of January 30, 2017, which means new developments and disruptive changes should now be targeted against the 8.4.x-dev branch. For more information see the Drupal 8 minor version schedule and the Allowed changes during the Drupal 8 release cycle.

Log in or register to post comments

Comment #133

29 September 2015 at 08:03

Version:

8.4.x-dev

» 8.5.x-dev

Drupal 8.4.0-alpha1 will be released the week of July 31, 2017, which means new developments and disruptive changes should now be targeted against the 8.5.x-dev branch. For more information see the Drupal 8 minor version schedule and the Allowed changes during the Drupal 8 release cycle.

Log in or register to post comments

Comment #134

29 September 2015 at 08:03

Version:

8.5.x-dev

» 8.6.x-dev

Drupal 8.5.0-alpha1 will be released the week of January 17, 2018, which means new developments and disruptive changes should now be targeted against the 8.6.x-dev branch. For more information see the Drupal 8 minor version schedule and the Allowed changes during the Drupal 8 release cycle.

Log in or register to post comments

Comment #135

29 September 2015 at 08:03

Version:

8.6.x-dev

» 8.7.x-dev

Drupal 8.6.0-alpha1 will be released the week of July 16, 2018, which means new developments and disruptive changes should now be targeted against the 8.7.x-dev branch. For more information see the Drupal 8 minor version schedule and the Allowed changes during the Drupal 8 release cycle.

Log in or register to post comments

Comment #136

29 September 2015 at 08:03

Version:

8.7.x-dev

» 8.8.x-dev

Drupal 8.7.0-alpha1 will be released the week of March 11, 2019, which means new developments and disruptive changes should now be targeted against the 8.8.x-dev branch. For more information see the Drupal 8 minor version schedule and the Allowed changes during the Drupal 8 release cycle.

Log in or register to post comments

Comment #137

andypost

he/him

Russian

commented 5 August 2019 at 20:32

Looks the issue brings some confusion, In #2664830-72: Add search capability to help topics
it is not clear which schema management to use - hook_schema vs definition from service's method

Log in or register to post comments

Comment #138

5 August 2019 at 20:32

Version:

8.8.x-dev

» 8.9.x-dev

Drupal 8.8.0-alpha1 will be released the week of October 14th, 2019, which means new developments and disruptive changes should now be targeted against the 8.9.x-dev branch. (Any changes to 8.9.x will also be committed to 9.0.x in preparation for Drupal 9’s release, but some changes like significant feature additions will be deferred to 9.1.x.). For more information see the Drupal 8 and 9 minor version schedule and the Allowed changes during the Drupal 8 and 9 release cycles.

Log in or register to post comments

Comment #139

5 August 2019 at 20:32

Version:

8.9.x-dev

» 9.1.x-dev

Drupal 8.9.0-beta1 was released on March 20, 2020. 8.9.x is the final, long-term support (LTS) minor release of Drupal 8, which means new developments and disruptive changes should now be targeted against the 9.1.x-dev branch. For more information see the Drupal 8 and 9 minor version schedule and the Allowed changes during the Drupal 8 and 9 release cycles.

Log in or register to post comments

Comment #140

5 August 2019 at 20:32

Version:

9.1.x-dev

» 9.2.x-dev

Drupal 9.1.0-alpha1 will be released the week of October 19, 2020, which means new developments and disruptive changes should now be targeted for the 9.2.x-dev branch. For more information see the Drupal 9 minor version schedule and the Allowed changes during the Drupal 9 release cycle.

Log in or register to post comments

Comment #141

5 August 2019 at 20:32

Version:

9.2.x-dev

» 9.3.x-dev

Drupal 9.2.0-alpha1 will be released the week of May 3, 2021, which means new developments and disruptive changes should now be targeted for the 9.3.x-dev branch. For more information see the Drupal core minor version schedule and the Allowed changes during the Drupal core release cycle.

Log in or register to post comments

Comment #142

catch

he/him

English

commented 28 June 2021 at 09:42

Title:

Move the on-demand-table creation into DBTNG

» Move the on-demand-table creation into the database API

Log in or register to post comments

Comment #143

catch

he/him

English

commented 4 October 2021 at 14:03

Log in or register to post comments

Comment #144

4 October 2021 at 14:03

Version:

9.3.x-dev

» 9.4.x-dev

Drupal 9.3.0-rc1 was released on November 26, 2021, which means new developments and disruptive changes should now be targeted for the 9.4.x-dev branch. For more information see the Drupal core minor version schedule and the Allowed changes during the Drupal core release cycle.

Log in or register to post comments

Comment #145

4 October 2021 at 14:03

Version:

9.4.x-dev

» 9.5.x-dev

Drupal 9.4.0-alpha1 was released on May 6, 2022, which means new developments and disruptive changes should now be targeted for the 9.5.x-dev branch. For more information see the Drupal core minor version schedule and the Allowed changes during the Drupal core release cycle.

Log in or register to post comments

Comment #146

4 October 2021 at 14:03

Version:

9.5.x-dev

» 10.1.x-dev

Drupal 9.5.0-beta2 and Drupal 10.0.0-beta2 were released on September 29, 2022, which means new developments and disruptive changes should now be targeted for the 10.1.x-dev branch. For more information see the Drupal core minor version schedule and the Allowed changes during the Drupal core release cycle.

Log in or register to post comments

Comment #147

andypost

he/him

Russian

commented 21 March 2023 at 14:46

Issue tags:

+Needs reroll

Log in or register to post comments

Comment #148

21 March 2023 at 14:46

Version:

10.1.x-dev

» 11.x-dev

Drupal core is moving towards using a “main” branch. As an interim step, a new 11.x branch has been opened, as Drupal.org infrastructure cannot currently fully support a branch named main. New developments and disruptive changes should now be targeted for the 11.x branch, which currently accepts only minor-version allowed changes. For more information, see the Drupal core minor version schedule and the Allowed changes during the Drupal core release cycle.

Log in or register to post comments

Comment #149

5 August 2023 at 10:19

Bhanu951 made their first commit to this issue’s fork.

Log in or register to post comments

Comment #150

5 August 2023 at 10:21

Bhanu951 opened merge request !4546

Log in or register to post comments

Comment #151

bhanu951 commented 5 August 2023 at 10:26

Issue tags:

-Needs reroll

Status	File	Size
new	interdiff-2371709-120-149.txt	71.33 KB

Tried to Re-roll patch from #120 to11. Branch.

Log in or register to post comments

Comment #152

bhanu951 commented 5 August 2023 at 10:43

Status:

Needs work

» Needs review

Log in or register to post comments

Comment #153

smustgrave commented 5 August 2023 at 16:21

Status:

Needs review

» Needs work

Reroll seemed to have test failures.

Log in or register to post comments

Comment #154

22 November 2024 at 09:09

daffie opened merge request !10291

Log in or register to post comments

Comment #155

daffie commented 22 November 2024 at 13:44

Issue summary:

View changes

I have updated the IS and created a CR.

Log in or register to post comments

Comment #156

22 November 2024 at 13:45

daffie changed the visibility of the branch 2371709-move-the-on-demand-table to hidden.

Log in or register to post comments

Comment #157

daffie commented 22 November 2024 at 13:46

Status:

Needs work

» Needs review

18 files were hidden/shown/deleted

Status	File	Size
hidden	2371709_57.patch	49.41 KB
hidden	interdiff.txt	4.21 KB
hidden	interdiff.txt	3.97 KB
hidden	2371709_61.patch	50.39 KB
hidden	interdiff.txt	3.76 KB
hidden	interdiff.txt	9.44 KB
hidden	2371709-71.patch	59.88 KB
hidden	interdiff.txt	17.56 KB
hidden	interdiff.txt	17.56 KB
hidden	interdiff.txt	1.61 KB
hidden	2371709-77.patch	60.26 KB
hidden	2371709-86.patch	59.63 KB
hidden	interdiff.txt	6.89 KB
hidden	2371709-95.patch	59.63 KB
hidden	2371709-99-do-not-test.patch	51.56 KB
hidden	on_demand_table-2371709-105.patch	55.1 KB
hidden	on_demand_table-2371709-120.patch	55.17 KB
hidden	interdiff-2371709-120-149.txt	71.33 KB

The Mr is ready for a review.

Log in or register to post comments

Comment #158

nicxvan commented 23 November 2024 at 01:57

Oh I've had my eye on this it will make hook schema deprecation much nicer.

Log in or register to post comments

Comment #159

andypost

he/him

Russian

commented 23 November 2024 at 03:33

added suggestion

Log in or register to post comments

Comment #160

daffie commented 23 November 2024 at 11:28

@andypost: Thank you for the review.

Changing the class variable to a readonly variable and setting its value is not allowed in PHP. From the PHP documentation:

Specifying an explicit default value on readonly properties is not allowed, because a readonly property with a default value is essentially the same as a constant, and thus not particularly useful.

See: https://www.php.net/manual/en/language.oop5.properties.php

I have changed the variables to be of the string type.

Log in or register to post comments

Comment #161

catch

he/him

English

commented 23 November 2024 at 13:09

Out of interest why could we not use constants for this case? The classes should be able to override those.

Log in or register to post comments

Comment #162

daffie commented 23 November 2024 at 13:14

@catch: I some cases it can be a constant in other cases it cannot. Like with Drupal\Core\Menu\MenuTreeStorage, Drupal\Core\Routing\MatcherDumper and Drupal\Core\KeyValueStore\DatabaseStorage. In those cases the table name is set in the class constructor.

Log in or register to post comments

Comment #163

mondrake

🇮🇹

commented 28 November 2024 at 19:55

A bit late to the party... but why not to use events instead of a trait? #3410480: [META] Use events in Database Schema operations

For example (conceptual, not tested):

from

  /**
   * Returns a new batch id.
   *
   * @return int
   *   A batch id.
   */
  public function getId(): int {
    $try_again = FALSE;
    try {
      // The batch table might not yet exist.
      return $this->doInsertBatchRecord();
    }
    catch (\Exception $e) {
      // If there was an exception, try to create the table.
      if (!$try_again = $this->ensureTableExists()) {
        // If the exception happened for other reason than the missing table,
        // propagate the exception.
        throw $e;
      }
    }
    // Now that the table has been created, try again if necessary.
    if ($try_again) {
      return $this->doInsertBatchRecord();
    }
  }

to

  /**
   * Returns a new batch id.
   *
   * @return int
   *   A batch id.
   */
  public function getId(): int {
    $event = new EnsureTableExistEvent(
      callable: [$this, 'doInsertBatchRecord'],
      table: static::TABLE_NAME,
      schema: $this->schemaDefinition(),
    );
    $this->connection->dispatchEvent($event);
    return $event->result();
  }

OR

from

  public function load($id) {
    // Ensure that a session is started before using the CSRF token generator.
    $this->session->start();
    try {
      $batch = $this->connection->select('batch', 'b')
        ->fields('b', ['batch'])
        ->condition('bid', $id)
        ->condition('token', $this->csrfToken->get($id))
        ->execute()
        ->fetchField();
    }
    catch (\Exception $e) {
      $this->catchException($e);
      $batch = FALSE;
    }
    if ($batch) {
      return unserialize($batch);
    }
    return FALSE;
  }

to

  public function load($id) {
    // Ensure that a session is started before using the CSRF token generator.
    $this->session->start();
    try {
      $batch = $this->connection->select('batch', 'b')
        ->fields('b', ['batch'])
        ->condition('bid', $id)
        ->condition('token', $this->csrfToken->get($id))
        ->execute()
        ->fetchField();
    }
    catch (\Exception $e) {
      $event = new TablePossiblyMissingEvent(
        exception: $e,
        table: static::TABLE_NAME,
        schema: $this->schemaDefinition(),
      );
      $this->connection->dispatchEvent($event);
      $batch = FALSE;
    }
    if ($batch) {
      return unserialize($batch);
    }
    return FALSE;
  }

EDIT: ... or, else, how about a SchemaGenerator service with methods similar to the two events above?

Log in or register to post comments

Comment #164

29 November 2024 at 11:36

mondrake opened merge request !10397

Log in or register to post comments

Comment #165

mondrake

🇮🇹

commented 29 November 2024 at 11:37

A PoC for #163 in MR!10397.

Log in or register to post comments

Comment #166

mondrake

🇮🇹

commented 29 November 2024 at 14:46

Hm... looking at the MR now, I start figuring out we could simplify further, and let the dynamic query layer handle this:

  public function getId(): int {
        return $this->connection->insert('batch')
          ->fields([
            'timestamp' => $this->time->getRequestTime(),
            'token' => '',
            'batch' => NULL,
          ])
          ->onFailureEnsureSchemaAndRetry([static::TABLE_NAME => $this->schemaDefinition()])
          ->execute();
  }

  public function delete($id) {
      $this->connection->delete('batch')
        ->condition('bid', $id)
        ->onFailureEnsureSchema([static::TABLE_NAME => $this->schemaDefinition()])
        ->execute();
  }

Log in or register to post comments

Comment #167

catch

he/him

English

commented 29 November 2024 at 15:17

#163-#166 are incredibly concise so if we can make those work, that is very tempting.

Log in or register to post comments

Comment #168

mondrake

🇮🇹

commented 29 November 2024 at 17:28

#166 would have BC concerns (db drivers would have to match the behavior of the onFailureEnsureSchema() method on all classes extending from Query), so maybe for now we can settle on a proxy of that.

Log in or register to post comments

Comment #169

29 November 2024 at 20:04

mondrake changed the visibility of the branch 2371709-use-events-II to hidden.

Log in or register to post comments

Comment #170

29 November 2024 at 20:09

mondrake opened merge request !10403

Log in or register to post comments

Comment #171

mondrake

🇮🇹

commented 2 December 2024 at 19:32

MR!10403 is for review. That includes the framework and some conversion, but not all of them. Once/if the framework is agreed, we can easy complete the remaining ones.

An interesting next step would be to pass the schema as a value object instead of an array, but there are other issues for that.

Log in or register to post comments

Comment #172

daffie commented 3 December 2024 at 12:00

Status:

Needs review

» Needs work

Log in or register to post comments

Comment #173

3 December 2024 at 13:51

mondrake changed the visibility of the branch 2371709-lazy-table-creation-trait to hidden.

Log in or register to post comments

Comment #174

mondrake

🇮🇹

commented 3 December 2024 at 13:54

Assigned:

Unassigned

» mondrake

Thanks for review @daffie! Let's continue with the new approach then. Before completing the conversions: add tests for Connection::executeEnsuringSchemaOnFailure().

Log in or register to post comments

Comment #175

mondrake

🇮🇹

commented 7 December 2024 at 10:37

Status:

Needs work

» Needs review

MR!10403 is now doing #163, deprecates/converts all uses of ::ensureTableExists() and ::catchException(), and is green on all databases. A review at this stage would be helpful.

#166 is just a piece of cake away with this underlying layer, so I will push that forward.

Log in or register to post comments

Comment #176

mondrake

🇮🇹

commented 7 December 2024 at 11:17

Assigned:

mondrake

» Unassigned

Now Flood\DatabaseBackend implemenrs #166, and if this is OK we need to convert the rest where possible (i.e. calls to dynamic queries extending Query).

Pausing now to let reviewing happen.

Log in or register to post comments

Comment #177

mondrake

🇮🇹

commented 7 December 2024 at 11:27

Filed #3492391: Make the event dispatcher available before container full bootstrap as a possible follow up.

Log in or register to post comments

Comment #178

daffie commented 8 December 2024 at 10:15

Assigned:

Unassigned

» acbramley

@mondrake: It looks like a beautiful solution. My problem is that the solution will not work with MongoDB. The solution relies on that the database will throw an exception when the table does not exists. However that is not what MongoDB does. MongoDB just creates a table when one does not exists on an insert. A table in MongoDB is called a collection. It will be a table with no validation added to make sure that it only allows data in the right form being added. See: https://www.mongodb.com/docs/manual/reference/method/db.collection.inser....

The reason I started to work on this issue was to add a hook to allow the database driver to change the table schema. The database driver for MongoDB stores timestamps as MongoDB\BSON\UTCDateTime instead of an integer value as is done by the by core supported databases.

If we could make it to work for MongoDB that would be great. I will need some time to think how I can make it to work with MongoDB. If you have some ideas or suggestions, then please speak up.

Log in or register to post comments

Comment #179

daffie commented 8 December 2024 at 10:16

Assigned:

acbramley

» Unassigned

Wrongly assigned to issue to @acbramley

Log in or register to post comments

Comment #180

mondrake

🇮🇹

commented 8 December 2024 at 10:34

@daffie well, that's why I suggest using events, that gives us the possibility to extend/override how they are processed. At least conceptually, the MongoDB driver could implement a subscriber/listener for the ExecuteMethodEnsuringSchemaEvent, make it so that it processes the event before core's SchemaRequestSubscriber does, do its alternative processing of the schema request, and stop propagation of the event so that core no longer processes it afterwards. Of course, needs to be proven in practice.

Log in or register to post comments

Comment #181

daffie commented 10 December 2024 at 16:46

Status:

Needs review

» Needs work

Log in or register to post comments

Comment #182

mondrake

🇮🇹

commented 13 December 2024 at 13:46

Status:	Needs work	» Postponed
Related issues:		+#3492391: Make the event dispatcher available before container full bootstrap, +#3410480: [META] Use events in Database Schema operations

Postpone on #3492391: Make the event dispatcher available before container full bootstrap

Log in or register to post comments

Comment #183

13 December 2024 at 13:46

Version:

11.x-dev

» main

Drupal core is now using the main branch as the primary development branch. New developments and disruptive changes should now be targeted to the main branch.

Comment #184

catch

he/him

English

commented 17 April 2026 at 07:01

Title:	Move the on-demand-table creation into the database API	» [PP-x] Move the on-demand-table creation into the database API
Issue summary:	View changes

Adding the issue this is postponed on to the issue summary.

Log in or register to post comments

[PP-x] Move the on-demand-table creation into the database API

Problem/Motivation

Proposed resolution

Remaining tasks

User interface changes

API changes

Issue fork drupal-2371709

Comments