TaxonomyVocabularyHierarchyUpdateTest::testTaxonomyUpdateParents() is prone to breaking on PostgreSQL [#3031479]

Problem/Motivation

In #2957381: Data model problems with Vocabulary hierarchy we added an update path test which tries to verify that an index exists by looking up its name. However, there are two problems with that approach:

- the real intention of the test was to check whether an index with a given set of columns exists on a table;
- on PostgreSQL, if the name of an index exceeds 63 chars (the identifier limit) and its table is renamed, the new name of the index can not be determined in any way.

Proposed resolution

Add a database schema introspection test trait and use it in \Drupal\Tests\taxonomy\Functional\Update\TaxonomyVocabularyHierarchyUpdateTest::testTaxonomyUpdateParents().

Remaining tasks

Review.

User interface changes

Nope.

API changes

A new trait is available for tests that need to check whether an index with a given set of columns exists.

Data model changes

Nope.

Release notes snippet

Nope.

Comment	File	Size	Author
#26	3031479-26-followup.patch	913 bytes	amateescu
#26	8.7.x: PHP 7.2 & MySQL 5.5 24,993 pass PHP 7.1 & PostgreSQL 9.1 24,993 pass PHP 7.1 & SQLite 3.15 24,993 pass
#18	interdiff-18.txt	6.44 KB	amateescu
#18	3031479-18.patch	16.83 KB	amateescu
#18	8.7.x: PHP 7.2 & MySQL 5.5 24,994 pass PHP 7.1 & PostgreSQL 9.1 24,990 pass PHP 7.1 & SQLite 3.15 24,990 pass
#13	interdiff-13.txt	10.25 KB	amateescu
#13	3031479-13.patch	13.69 KB	amateescu
#13	8.7.x: PHP 7.2 & MySQL 5.5 24,960 pass PHP 7.1 & PostgreSQL 9.1 24,960 pass PHP 7.1 & SQLite 3.15 24,960 pass
#6	interdiff-6.txt	10.62 KB	amateescu
#6	3031479-6.patch	10.78 KB	amateescu
#6	8.7.x: PHP 7.2 & MySQL 5.5 24,960 pass PHP 5.5 & MySQL 5.5 24,606 pass PHP 5.5 & PostgreSQL 9.1 24,603 pass PHP 5.5 & SQLite 3.8 24,602 pass PHP 7.1 & PostgreSQL 9.1 24,957 pass PHP 7.1 & SQLite 3.15 24,957 pass
#2	3031479.patch	4.96 KB	amateescu
#2	8.7.x: PHP 7.2 & MySQL 5.5 24,957 pass PHP 7.1 & PostgreSQL 9.1 24,957 pass PHP 7.1 & SQLite 3.15 24,957 pass

Support from Acquia helps fund testing for Drupal Acquia logo

Comments

Comment #1

7 February 2019 at 23:36

amateescu created an issue. See original summary.

Comment #2

amateescu CreditAttribution: amateescu for Pfizer, Inc. commented 7 February 2019 at 23:37

Status:

Active

» Needs review

File	Size
3031479.patch	4.96 KB
8.7.x: PHP 7.2 & MySQL 5.5 24,957 pass PHP 7.1 & PostgreSQL 9.1 24,957 pass PHP 7.1 & SQLite 3.15 24,957 pass

This should do it.

Comment #3

jibran

he/him

Urdu

Sydney, Australia

CreditAttribution: jibran as a volunteer commented 8 February 2019 at 03:43

+++ b/core/tests/Drupal/Tests/Core/Database/SchemaIntrospectionTestTrait.php
@@ -0,0 +1,92 @@
+  protected function getIndexColumnNames($table_name) {

This function doesn't need to be in a test trait. Maybe move it to some helpful location and call it in assert methods?

Comment #4

amateescu CreditAttribution: amateescu for Pfizer, Inc. commented 8 February 2019 at 07:29

I don't think it needs to be public API yet, that's why I put it in a trait.

Comment #5

jibran

he/him

Urdu

Sydney, Australia

CreditAttribution: jibran as a volunteer commented 8 February 2019 at 07:42

Status:

Needs review

» Reviewed & tested by the community

Let's see what core committer think. Do we need a test only patch?

+++ b/core/tests/Drupal/Tests/Core/Database/SchemaIntrospectionTestTrait.php
@@ -0,0 +1,92 @@
+        $index_columns = $connection->query("SELECT	i.relname AS index_name, string_agg(a.attname, ',' ORDER BY a.attname ASC) AS index_columns FROM pg_class t, pg_class i, pg_index ix, pg_attribute a WHERE t.oid = ix.indrelid AND i.oid = ix.indexrelid AND a.attrelid = t.oid AND a.attnum = ANY(ix.indkey) AND t.relkind = 'r' AND t.relname = :table_name GROUP BY i.relname", [

Wired whitespace between SELECT and i.

Comment #6

amateescu CreditAttribution: amateescu for Pfizer, Inc. commented 8 February 2019 at 11:26

File	Size
3031479-6.patch	10.78 KB
8.7.x: PHP 7.2 & MySQL 5.5 24,960 pass PHP 5.5 & MySQL 5.5 24,606 pass PHP 5.5 & PostgreSQL 9.1 24,603 pass PHP 5.5 & SQLite 3.8 24,602 pass PHP 7.1 & PostgreSQL 9.1 24,957 pass PHP 7.1 & SQLite 3.15 24,957 pass
interdiff-6.txt	10.62 KB

If we improve the new assertion methods to also take the index type as an argument, we can also use them to improve the readability of another existing test.

Fixed #5 as well.

Comment #7

jibran

he/him

Urdu

Sydney, Australia

CreditAttribution: jibran as a volunteer commented 8 February 2019 at 12:45

+++ b/core/tests/Drupal/Tests/Core/Database/SchemaIntrospectionTestTrait.php
@@ -0,0 +1,135 @@
+  protected function getIndexColumnNames($table_name, $index_type) {

Do we need tests for this method? :D

Comment #8

amateescu CreditAttribution: amateescu for Pfizer, Inc. commented 8 February 2019 at 13:06

I think the assertions added by this patch are enough to demonstrate that the method does its job extremely well.

Comment #9

9 February 2019 at 14:56

Status:

Reviewed & tested by the community

» Needs work

The last submitted patch, 6: 3031479-6.patch, failed testing. View results

Comment #10

amateescu CreditAttribution: amateescu for Pfizer, Inc. commented 9 February 2019 at 17:25

Status:

Needs work

» Reviewed & tested by the community

Probably a random fail, queued a retest.

Comment #11

mondrake

🇮🇹

CreditAttribution: mondrake as a volunteer commented 9 February 2019 at 18:49

Status:

Reviewed & tested by the community

» Needs review

+++ b/core/tests/Drupal/Tests/Core/Database/SchemaIntrospectionTestTrait.php
@@ -0,0 +1,135 @@
+  /**
+   * Returns the column names used by an index of a table.
+   *
+   * @param string $table_name
+   *   A table name.
+   * @param string $index_type
+   *   The type of the index. Can be one of 'index', 'unique' or 'primary'.
+   *
+   * @return array
+   *   An array keyed by the index name and values as strings in the form of
+   *   'column1,column2'.
+   */
+  protected function getIndexColumnNames($table_name, $index_type) {
+    assert(in_array($index_type, ['index', 'unique', 'primary'], TRUE));
+    $connection = \Drupal::database();
+

Couldn't this be a method on the Schema API? Like findPrimaryKeyColumns($table). That way also contrib drivers could add their own implementation.

Edit - see #2881522: Add a Schema::findPrimaryKeyColumns method to remove database specific logic from test

Comment #12

plach

he/him

Italian

Venezia

CreditAttribution: plach as a volunteer commented 13 February 2019 at 09:12

Given that we already have ::findPrimaryKeyColumns(), I guess a ::findIndexColumns() method would make sense.

Comment #13

amateescu CreditAttribution: amateescu for Pfizer, Inc. commented 11 February 2019 at 10:17

File	Size
3031479-13.patch	13.69 KB
8.7.x: PHP 7.2 & MySQL 5.5 24,960 pass PHP 7.1 & PostgreSQL 9.1 24,960 pass PHP 7.1 & SQLite 3.15 24,960 pass
interdiff-13.txt	10.25 KB

@mondrake, having a method which can be implemented by alternative database drivers makes very much sense. This way they could even try to run more of \Drupal\KernelTests\Core\Database\SchemaTest in their own tests.

Decided to go with a introspectIndexSchema() method in the Schema API, to match the existing introspectSchema() already defined by SQLite.

Comment #14

jibran

he/him

Urdu

Sydney, Australia

CreditAttribution: jibran as a volunteer commented 11 February 2019 at 10:36

Status:

Needs review

» Reviewed & tested by the community

4 files were hidden/shown/deleted

File	Size
3031479.patch	4.96 KB
8.7.x: PHP 7.2 & MySQL 5.5 24,957 pass PHP 7.1 & PostgreSQL 9.1 24,957 pass PHP 7.1 & SQLite 3.15 24,957 pass
3031479-6.patch	10.78 KB
8.7.x: PHP 7.2 & MySQL 5.5 24,960 pass PHP 5.5 & MySQL 5.5 24,606 pass PHP 5.5 & PostgreSQL 9.1 24,603 pass PHP 5.5 & SQLite 3.8 24,602 pass PHP 7.1 & PostgreSQL 9.1 24,957 pass PHP 7.1 & SQLite 3.15 24,957 pass
interdiff-6.txt	10.62 KB
interdiff-13.txt	10.25 KB

Perfect! this looks much better now.

Comment #15

plach

he/him

Italian

Venezia

CreditAttribution: plach as a volunteer commented 13 February 2019 at 09:12

Status:

Reviewed & tested by the community

» Needs work

+++ b/core/lib/Drupal/Core/Database/Schema.php
@@ -545,6 +545,28 @@ protected function findPrimaryKeyColumns($table) {
+    throw new \RuntimeException("The '{$this->connection->driver()}' database driver does not implement " . __METHOD__);

I was about to suggest to switch to BadMethodCallException, but given that the driver is configurable, a runtime exception makes sense to me.

+++ b/core/tests/Drupal/Tests/Core/Database/SchemaIntrospectionTestTrait.php
@@ -0,0 +1,83 @@
+    foreach ($this->getIndexColumnNames($table_name, $index_type) as $index_name => $index_columns) {
...
+    foreach ($this->getIndexColumnNames($table_name, $index_type) as $index_name => $index_columns) {

I was initially confused by the $index_name variable, since it is unused. Can we remove it?

+++ b/core/tests/Drupal/Tests/Core/Database/SchemaIntrospectionTestTrait.php
@@ -0,0 +1,83 @@
+      if ($column_names == $index_columns) {
+        $this->assertTrue(TRUE);
+      }
+    }
+    $this->assertFalse(FALSE);
...
+      if ($column_names == $index_columns) {
+        $this->assertFalse(FALSE);
+      }
+    }
+    $this->assertTrue(TRUE);

It seems to me that this logic is a bit messed up: all assertions here, especially the last lines, are no-ops. There's no way for that code to fail :)
I guess the original goal was to fail as soon as a non-matching condition was found and only assert TRUE if all were met. Something like this:

    foreach ($this->getIndexColumnNames($table_name, $index_type) as $index_columns) {
      if ($column_names !== $index_columns) {
        $this->assertTrue(FALSE);
        return; // Not strictly needed, probably.
      }
    }
    $this->assertTrue(TRUE);

Also, would it make sense to use strict comparison here?

```
+++ b/core/tests/Drupal/Tests/Core/Database/SchemaIntrospectionTestTrait.php
@@ -0,0 +1,83 @@
+    $introspect_index_schema = new \ReflectionMethod(get_class($schema), 'introspectIndexSchema');
```
What about making the method public? Being able to introspect the actual schema might turn out to be useful while doing fancy stuff with it ;)
Moreover I'm not sure we normally "open" protected methods this way in tests, given that we are not testing the method return value itself.
Which leads me to the next point :)
```
+++ b/core/tests/Drupal/Tests/Core/Database/SchemaIntrospectionTestTrait.php
@@ -0,0 +1,83 @@
+      $indexes = [$index_schema['primary key']];
```
For consistency would it make sense to add a key name here as well? Otherwise the index name will be an integer in this case and that might cause issues with loose equality in the caller code.

Regardless of that, it seems that we are not actually checking that all drivers are returning the exact same data structure (although it seems they are), given that index name keys are unused. Probably it would be a good idea to add a small test explicitly testing ::introspectIndexSchema() itself.

Comment #16

plach

he/him

Italian

Venezia

CreditAttribution: plach as a volunteer commented 13 February 2019 at 09:12

I guess the original goal was to fail as soon as a non-matching condition was found and only assert TRUE if all were met.

Now that I think about it, that wouldn't work if multiple indexes were returned. However the logic as it is now doesn't seem to work either.

Comment #17

plach

he/him

Italian

Venezia

CreditAttribution: plach as a volunteer commented 13 February 2019 at 09:12

Maybe something like this?

  protected function assertIndexOnColumns($table_name, array $column_names, $index_type = 'index') {
    foreach ($this->getIndexColumnNames($table_name, $index_type) as $index_columns) {
      if ($column_names === $index_columns) {
        $this->assertTrue(TRUE);
        return;
      }
    }
    $this->assertTrue(FALSE);
  }

  protected function assertNoIndexOnColumns($table_name, array $column_names, $index_type = 'index') {
    foreach ($this->getIndexColumnNames($table_name, $index_type) as $index_columns) {
      if ($column_names === $index_columns) {
        $this->assertTrue(FALSE);
        return;
      }
    }
    $this->assertTrue(TRUE);
  }

Comment #18

amateescu CreditAttribution: amateescu for Pfizer, Inc. commented 12 February 2019 at 07:47

Status:

Needs work

» Reviewed & tested by the community

File	Size
3031479-18.patch	16.83 KB
8.7.x: PHP 7.2 & MySQL 5.5 24,994 pass PHP 7.1 & PostgreSQL 9.1 24,990 pass PHP 7.1 & SQLite 3.15 24,990 pass
interdiff-18.txt	6.44 KB

Thanks for the review!

Re #15:

And it's also consistent with the other unimplemented method from that class :)
Sure thing, removed it.
Ok.. that's embarassing :| Fixed the assertion logic. We shouldn't use a strict comparison there because that means the order of the columns would need to match as well..
I would very much like to have a public introspectSchema() method, but this one is only about getting the index definitions and I think it should stay "hidden" because it's not very useful on its own.
We do a lot of reflection for protected methods in tests, the search results for new \ReflectionMethod throughout core will surprise you :D
The new assertion methods are intentionally not dealing with index names at all (because of PostgreSQL), so I changed the other two places to use integer keys for consistency :)
Added full test coverage for the new introspectIndexSchema() method.

Comment #19

plach

he/him

Italian

Venezia

CreditAttribution: plach as a volunteer commented 13 February 2019 at 09:13

Thanks for the new test, it looks great!

1: Good point :)
2: 👍
3: That's what I was asking: would it make sense to be able to assert also the column order? I mean, since you are writing a test you can bother providing the correct order, and if the test starts failing maybe that means something got messed up with the schema. Not feeling strongly about this but I thought it would be good to point it out. Or are we implying that the some drivers may not provide a predictable order?
4:

+++ b/core/tests/Drupal/KernelTests/Core/Database/SchemaTest.php
@@ -274,6 +274,83 @@ public function testSchema() {
+    $introspect_index_schema = new \ReflectionMethod(get_class($this->schema), 'introspectIndexSchema');
+    $introspect_index_schema->setAccessible(TRUE);
+    $index_schema = $introspect_index_schema->invoke($this->schema, $table_name);

That's what I meant earlier: we normally make protected methods accessible in tests to assert their result, not to get access to internal APIs. The latter seems to suggest we need a public API instead. However, I'm fine with leaving the method protected, if the goal is to have a complete API instead.

5: Thanks!

+++ b/core/lib/Drupal/Core/Database/Driver/mysql/Schema.php
@@ -631,7 +631,7 @@ protected function introspectIndexSchema($table) {
-      elseif ($row->Non_unique != 0) {
+      elseif ($row->Non_unique == 0) {

Tricky :)

Was this caught by the new test?

Comment #20

amateescu CreditAttribution: amateescu for Pfizer, Inc. commented 12 February 2019 at 10:20

Or are we implying that the some drivers may not provide a predictable order?

Exactly! More specifically, the pgsql query to get the index column names does not return them in the order they were defined in the schema array, and if we add any kind of ordering to the query, it still won't match the initial schema order.

That's what I meant earlier: we normally make protected methods accessible in tests to assert their result, not to get access to internal APIs. The latter seems to suggest we need a public API instead. However, I'm fine with leaving the method protected, if the goal is to have a complete API instead.

Now that we have a test for asserting its result, I hope you feel better with leaving it protected :)

Was this caught by the new test?

It was caught by fixing assertIndexOnColumns(), which convinced me that we definitely need a proper test for the whole method.

Comment #21

plach

he/him

Italian

Venezia

CreditAttribution: plach as a volunteer commented 13 February 2019 at 09:13

Now that we have a test for asserting its result, I hope you feel better with leaving it protected :)

Yep, this looks good to go to me :)

Comment #22

plach

he/him

Italian

Venezia

CreditAttribution: plach as a volunteer commented 13 February 2019 at 09:14

Saving credits

Comment #23

13 February 2019 at 09:15

plach committed 431b5b4 on 8.7.x

Issue #3031479 by amateescu, jibran, mondrake:...

Comment #24

plach

he/him

Italian

Venezia

CreditAttribution: plach as a volunteer commented 13 February 2019 at 09:16

Status:

Reviewed & tested by the community

» Fixed

Committed 431b5b4 and pushed to 8.7.x. Thanks!

Comment #25

mondrake

🇮🇹

CreditAttribution: mondrake as a volunteer commented 13 February 2019 at 11:10

+++ b/core/tests/Drupal/KernelTests/Core/Database/SchemaTest.php
@@ -309,6 +274,83 @@ public function testSchema() {
+      'primary key' => ['id', 'test_field_1'],
+      'unique keys' => [
+        '2' => ['test_field_2'],
+        '3_4' => ['test_field_3', 'test_field_4'],
+      ],
+      'indexes' => [
+        '4' => ['test_field_4'],
+        '4_5' => ['test_field_4', 'test_field_5'],
+      ],

Uhm, numeric only index name like '2' can create issues in contrib as these can be interpreted as numbers rather than identifiers, e.g.

Drupal\KernelTests\Core\Database\SchemaTest::testIntrospectIndexSchema
Doctrine\DBAL\Exception\SyntaxErrorException: An exception occurred while executing 'CREATE UNIQUE INDEX 2 ON test97095778mhun9p3n (test_field_2)':

SQLSTATE[42000]: Syntax error or access violation: 1064 You have an error in your SQL syntax; check the manual that corresponds to your MySQL server version for the right syntax to use near '2 ON test97095778mhun9p3n (test_field_2)' at line 1

Is it a big deal to change those to sth like

+++ b/core/tests/Drupal/KernelTests/Core/Database/SchemaTest.php
@@ -309,6 +274,83 @@ public function testSchema() {
+      'primary key' => ['id', 'test_field_1'],
+      'unique keys' => [
+        'uk_2' => ['test_field_2'],
+        'uk_3_4' => ['test_field_3', 'test_field_4'],
+      ],
+      'indexes' => [
+        'idx_4' => ['test_field_4'],
+        'idx_4_5' => ['test_field_4', 'test_field_5'],
+      ],

Comment #26

amateescu CreditAttribution: amateescu for Pfizer, Inc. commented 13 February 2019 at 11:23

Status:

Fixed

» Needs review

File	Size
3031479-26-followup.patch	913 bytes
8.7.x: PHP 7.2 & MySQL 5.5 24,993 pass PHP 7.1 & PostgreSQL 9.1 24,993 pass PHP 7.1 & SQLite 3.15 24,993 pass

It's not a big deal at all, let's use those keys as they are used in real-world use cases, by combining the names of the columns.

I tested this locally on all three db drivers supported by core and it works fine as expected.

Comment #27

mondrake

🇮🇹

CreditAttribution: mondrake as a volunteer commented 13 February 2019 at 11:31

Status:

Needs review

» Reviewed & tested by the community

Great. Thanks :)

Comment #28

plach

he/him

Italian

Venezia

CreditAttribution: plach as a volunteer commented 13 February 2019 at 12:22

I was thinking about other ways a contrib driver could be affected by this: should we catch the not-implemented exception and prevent the test from failing if the implementation is missing?

Comment #29

mondrake

🇮🇹

CreditAttribution: mondrake as a volunteer commented 13 February 2019 at 12:29

#28 actually only tests would fail here - not normal runtime code. And only if you run the core testsuite under the contrib driver. So I find the exception raised as a good 'call for action' to contrib driver developers that would have to implement the missing pieces if they want to keep up with the evolving db abstraction layer. BTW this also applies to ::findPrimaryKeyColumns().

Maybe a change record would be good, anyway?

Comment #30

mondrake

🇮🇹

CreditAttribution: mondrake as a volunteer commented 13 February 2019 at 12:41

See https://www.drupal.org/node/2974956 for the change record that was done for #2881522: Add a Schema::findPrimaryKeyColumns method to remove database specific logic from test

Comment #31

plach

he/him

Italian

Venezia

CreditAttribution: plach as a volunteer commented 13 February 2019 at 13:28

Well, this is not a public API so a CR would only be helpful for DB driver developers, but why not? :)

Comment #32

plach

he/him

Italian

Venezia

CreditAttribution: plach as a volunteer commented 13 February 2019 at 13:30

#28 actually only tests would fail here - not normal runtime code.

Yep, I was suggesting to catch the failure in the test, however if core starts using that method in runtime code, it will break when using a driver not implementing that, so I agree it's fine to let the test fail.

Comment #33

13 February 2019 at 13:37

plach committed f5d6ace on 8.7.x

Issue #3031479 follow-up by amateescu, mondrake: Replaced numeric index...

Comment #34

plach

he/him

Italian

Venezia

CreditAttribution: plach as a volunteer commented 13 February 2019 at 13:38

Status:

Reviewed & tested by the community

» Fixed

Committed #26 as f5d6ace and pushed it to 8.7.x. Thanks!

Let's keep this fixed now ;)

Comment #35

27 February 2019 at 13:39

Status:

Fixed

» Closed (fixed)

Automatically closed - issue fixed for 2 weeks with no activity.