Unable to filter on columns of entity reference fields [#2958587]

We are encountering an issue where we are unable to filter by column values in a reference field. The docs page for Filtering says:

Tip: You can filter by sub-properties of a field. For example, a path like field_phone.country_code will work even though field_phone isn't a relationship.

But this doesn't seem to apply for entity reference fields. For example, if I have an image field attached to my article content type, the following requests to filter by the alternate text column values fail:

GET: /jsonapi/node/article?filter[field_image.alt][value]=test
Symfony\Component\HttpKernel\Exception\BadRequestHttpException: Invalid nested filtering. The field `alt`, given in the path `field_image.alt`, does not exist. in Drupal\jsonapi\Context\FieldResolver->resolveInternal() (line 135 of web/modules/contrib/jsonapi/src/Context/FieldResolver.php).

Note that the following EntityQuery does in fact work:

$query = \Drupal::entityQuery('node');
$query->condition('type', 'article');
$query->condition('field_image.alt', 'Test');
$query->execute();

Comment	File	Size	Author
#23	2958587-23.patch	14.06 KB	wim leers
#23	interdiff.txt	721 bytes	wim leers
#21	2958587-21.patch	13.77 KB	gabesullice
#21	interdiff-17-21.txt	10.89 KB	gabesullice
#15	2958587-15.patch	11.2 KB	gabesullice
#15	interdiff-12-15.txt	2.18 KB	gabesullice
#15	2958587-15.tests_only.patch	3.06 KB	gabesullice
#6	2958587.tests_only.patch	815 bytes	gabesullice
#6	2958587.patch	5.1 KB	gabesullice
#12	2958587-12.tests_only.patch	5.23 KB	gabesullice
#12	interdiff-6-12.txt	13.74 KB	gabesullice
#12	2958587-12.patch	13.38 KB	gabesullice
#17	interdiff-15-17.txt	3.31 KB	gabesullice
#17	2958587-17.patch	11.21 KB	gabesullice

Comments

Comment #1

4 April 2018 at 18:47

Dave Reid created an issue. See original summary.

Comment #2

dave reid

he/him

English

Nebraska USA

commented 4 April 2018 at 18:48

Issue summary:

View changes

Comment #3

dave reid

he/him

English

Nebraska USA

commented 4 April 2018 at 18:57

Issue summary:

View changes

Comment #4

wim leers

Ghent 🇧🇪🇪🇺

commented 5 April 2018 at 10:34

Priority:	Normal	» Major
Issue tags:		+Needs tests

Thanks for reporting this! This seems at least a major bug.

Comment #5

wim leers

Ghent 🇧🇪🇪🇺

commented 23 April 2018 at 15:37

#2962457: Handle computed fields in entity queries: throwing a helpful exception is better than a PHP fatal error seems related.

Comment #6

gabesullice

English

commented 26 April 2018 at 11:12

Status	File	Size
new	2958587.tests_only.patch	815 bytes
new	2958587.patch	5.1 KB

This is actually a separate issue from computed fields (although they both have to do with the field resolver), but they don't need to be handled together.

The issue here is that we are blindly saying that any path part after a reference field must be a field on the targeted entity and are ignoring the fact that reference fields can have other properties.

The most obvious case is target_id, which is not a reference property like the (computed) entity property on entity reference fields.

Another issue I discovered while doing this is that it would also be impossible to target references that do not have the entity property. The language property on a language field is an example of this kind of reference (although it's inapplicable to filtering because it's computed).

Tests and patch attached.

Comment #7

gabesullice

English

commented 26 April 2018 at 11:13

Status:

Active

» Needs review

Comment #8

26 April 2018 at 11:18

The last submitted patch, 6: 2958587.tests_only.patch, failed testing. View results

Comment #9

wim leers

Ghent 🇧🇪🇪🇺

commented 26 April 2018 at 14:22

Issue tags:

-Needs tests

This is more complex than I first thought. Or maybe it's my still congested head that's having trouble understanding this.

+++ b/src/Context/FieldResolver.php
@@ -139,14 +141,32 @@ class FieldResolver {
+      // Determine if the next path part is a sub-property of the field.

Nit: s/is/will/ (because it's for the next iteration of the loop.

```
+++ b/src/Context/FieldResolver.php
@@ -139,14 +141,32 @@ class FieldResolver {
+        // If the property is a reference, then we can continue resolving the
+        // path.
```
Hm … why is this necessary? Following multiple levels of entity references already was working before. That's why we have the while ($part = array_shift($parts)) { loop. Why are we now skipping ahead one iteration of the loop?

+++ b/src/Context/FieldResolver.php
@@ -139,14 +141,32 @@ class FieldResolver {
+        // sub-properties, not subsequent fields.
...
+      // sub-properties of the field.

Nit: s/sub-property/property/

Fields have properties. Subproperties are not a thing.

+++ b/src/Context/FieldResolver.php
@@ -139,14 +141,32 @@ class FieldResolver {
+      if (!empty($parts)) {
+        $reference_breadcrumbs[] = 'entity';
+      }

@@ -174,7 +194,7 @@ class FieldResolver {
-    $entity_path = implode('.entity.', $references);
+    $entity_path = implode('.', $references);

Why this move?

+++ b/src/Context/FieldResolver.php
@@ -320,4 +340,49 @@ class FieldResolver {
+  protected static function isCandidateDefinitionProperty($part, $candidate_definitions) {

Nit: array $candidate_definitions

+++ b/tests/src/Kernel/Context/FieldResolverTest.php
@@ -75,6 +75,11 @@ class FieldResolverTest extends JsonapiKernelTestBase {
+      ['field_test_ref1.target_id', 'field_test_ref1.target_id'],

This uses \Drupal\Core\TypedData\DataReferenceTargetDefinition, which is not an instance of \Drupal\Core\TypedData\DataReferenceDefinitionInterface. Why is this now working?

+++ b/tests/src/Kernel/Context/FieldResolverTest.php
@@ -75,6 +75,11 @@ class FieldResolverTest extends JsonapiKernelTestBase {
+      ['field_test_ref1.entity.field_test_text.value', 'field_test_ref1.entity.field_test_text.value'],

This is just a normal DataDefinition … I don't understand why this didn't work in HEAD, even though I can see that the test is failing.

I can see that it's fixing certain use cases simply because your newly added tests are failing and are passing after making the changes. But I don't understand yet why the changes make things work. I could step through it with a debugger, but I suspect it might be faster for you to explain?

Comment #10

gabesullice

English

commented 26 April 2018 at 16:11

Ah, yes, this is complicated. The change is relatively simple, but how it all fits together is definitely not.

Just as a reminder before I dig in: The entity query system expects uid.entity.name but JSON API makes it possible to just do uid.name to be mirror the dot-separated syntax of include paths. The current logic blindly assumes entity is the "glue" between path parts, but that's not truly the case.

Okay, so:

- An entity reference field has two properties, target_id and entity
- target_id is an instance of DataReferenceTargetDefinition
- entity is an instance of DataReferenceDefinition (missing "Target")
- Our heuristic for determining if a field is a reference field is: "does the field have a DataReferenceTargetDefinition property?"
That's here:

$main_property_definition = $item_definition->getPropertyDefinition(
  $item_definition->getMainPropertyName()
);

// Check if the field is a flavor of an Entity Reference field.
if (!$main_property_definition instanceof DataReferenceTargetDefinition) {
  return [];
}

- If the current part is a reference based on that heuristic, we added it to the "breadcrumbs"

=== If the field is a reference ===
- Then, we advance to the next part of the path, assuming the next part is a field on the targeted entity type. (e.g. uid.name on a node would say "uid" is a reference, therefore continue on the path and evaluate "name" against the User entity type)
- When we are done resolving the path, we then concatenate all the breadcrumbs with "entity"
- This works because the entity query system looks at the property definitions of a field and checks for DataReferenceDefinitions and "entity" is the property for those on most (perhaps all) entity reference-like fields.
- To say it differently, the entity query system does not use the target_id property to determine what entity table to join against, it uses the DataReferenceDefinition identified in the path (usually "entity" is this property)

=== If the field is not a reference ===
- We assume all remaining parts target deltas or properties of the field (or properties of those properties, etc etc. in the case of nested Map or ComplexData property types, hence "sub-properties"). An example of this might be body.2.value.

Where does this fail:

If the field is a reference, then we assumed (wrongly) that the next part of the path must be a field on the target entity type.
However, if the part is actually a property of the reference field, that possibility is never considered.
It's also possible that the DataReferenceDefinition is not actually named entity, in that case, it could never have worked with JSON API, because we always assume that it is.

How does this fix things:
If the field is a reference, we now "peek" ahead to the next path part to see if it's actually a property of the reference field.

If it's not, we continue with the path described above.

If it is, then we check if it's a reference property or not (instanceof DataReferenceDefinition).
If it is a reference property like entity (or anything other string), then we push that onto the breadcrumbs: $reference_breadcrumbs[] = array_shift($parts);.

If it is not a reference property, but it is a property of the reference field, then we assume that all remaining parts of the string are "sub-properties" of that property.

In explaining this, I realized two errors/unhandled cases in my logic 😅I've written the above as if the logic is correct.

The two flaws I found are:

I'm not actually checking for indices, as in the 1 in body.1.value
The continue; is incorrect. The loop still needs to do some processing in case the next part is a reference. So:

if (static::isReferenceProperty($parts[0], $candidate_definitions)) {
  $reference_breadcrumbs[] = array_shift($parts);
  continue;
}
return $this->constructInternalPath($reference_breadcrumbs, $parts);

becomes:

if (!static::isReferenceProperty($parts[0], $candidate_definitions)) {
  return $this->constructInternalPath($reference_breadcrumbs, $parts);
}
$reference_breadcrumbs[] = array_shift($parts);
// rest of loop still gets evaluated

More tests and patch to follow.

Comment #11

wim leers

Ghent 🇧🇪🇪🇺

commented 27 April 2018 at 12:19

…
However, if the part is actually a property of the reference field, that possibility is never considered.
…
[…] in that case, it could never have worked with JSON API, because we always assume that it is.

Thanks for making it all click :) I still don't understand the field_test_text.value case though. Text fields aren't referencing anything at all.

I realized two errors/unhandled cases in my logic 😅

Looking forward to the next iteration!

I'm not actually checking for indices

I thought about this too. But I'm not sure you can support this; whether you can depends not on logic in FieldResolver, but of capabilities of the Entity Field Query system?

P.S.: I think this could be made much easier to understand (through refactoring) by having more methods, that each do one thing explicitly.

Comment #12

gabesullice

English

commented 27 April 2018 at 14:11

Issue tags:

+API-First Initiative

Status	File	Size
new	2958587-12.patch	13.38 KB
new	interdiff-6-12.txt	13.74 KB
new	2958587-12.tests_only.patch	5.23 KB

Okay, I tried to flatten out the logic and more verbose commentary. I added more test cases as well. I'm attaching an interdiff as well, but I doubt it'll be that useful.

I still don't understand the field_test_text.value case though. Text fields aren't referencing anything at all.

I don't understand what you don't understand. If you mean:

+      ['field_test_ref1.entity.field_test_text.value', 'field_test_ref1.entity.field_test_text.value'],

It was the entity part that was failing because it wasn't possible to specify a property of any kind on the entity reference.

I thought about this too. But I'm not sure you can support this; whether you can depends not on logic in FieldResolver, but of capabilities of the Entity Field Query system?

The field resolver is primarily an entity query validator with two JSON API specific transformations:

honor field aliases
allow the reference property field to be elided after an entity reference field

It is therefore possible to support this because it's possible in the entity query system.

I think this could be made much easier to understand (through refactoring) by having more methods, that each do one thing explicitly.

Indeed, I ended up adding more methods and also repeating some logic rather than trying to cleverly reduce if/else statements.

I could step through it with a debugger, but I suspect it might be faster for you to explain?

I still think stepping through with a debugger would be helpful.

Comment #13

27 April 2018 at 14:14

The last submitted patch, 12: 2958587-12.patch, failed testing. View results
- codesniffer_fixes.patch Interdiff of automated coding standards fixes only.

Comment #14

27 April 2018 at 14:15

The last submitted patch, 12: 2958587-12.tests_only.patch, failed testing. View results
- codesniffer_fixes.patch Interdiff of automated coding standards fixes only.

Comment #15

gabesullice

English

commented 27 April 2018 at 14:52

Status	File	Size
new	2958587-15.patch	11.2 KB
new	interdiff-12-15.txt	2.18 KB
new	2958587-15.tests_only.patch	3.06 KB

Whoops, that included some other debugging stuff.

Comment #16

27 April 2018 at 14:56

The last submitted patch, 15: 2958587-15.tests_only.patch, failed testing. View results

Comment #17

gabesullice

English

commented 27 April 2018 at 15:02

Status	File	Size
new	2958587-17.patch	11.21 KB
new	interdiff-15-17.txt	3.31 KB

And fixing the CS violations.

Comment #18

wim leers

Ghent 🇧🇪🇪🇺

commented 7 May 2018 at 11:55

Status:

Needs review

» Needs work

#12

Okay, I tried to flatten out the logic and more verbose commentary. I added more test cases as well. I'm attaching an interdiff as well, but I doubt it'll be that useful.

The many early returns you added make it much easier to understand this logic — thank you! And this interdiff actually was useful! I applied #12 locally to look at helper methods not being introduced in #12, but #12's interdiff allowed me to see what you were changing and why.

It was the entity part that was failing […]

D'oh, of course! 🙈😅 Thanks for clarifying!

It is therefore possible to support this because it's possible in the entity query system.

I didn't know the entity query system supported this! I just checked \Drupal\Core\Entity\Query\QueryInterface::condition(), and sure enough, it's there. Grep for "delta".
This actually revealed to me that there is a special query condition called %delta, which allows one to find entities which have e.g. 5 items (5 deltas). Does JSON API's querying/filtering already support this? If not, can we add an explicit failing test? Can we create an issue to clarify this?

Interdiff review:

+++ b/src/Context/FieldResolver.php
@@ -117,20 +119,21 @@ class FieldResolver {
+    // This complex expression is needed to handle the string, "0", which would
+    // otherwise be evaluated as FALSE.
+    while (!is_null(($part = array_shift($parts)))) {

This comment is most definitely necessary! This is where PHP the language gets in the way, and such careful conditions (and accompanying components) are an absolute necessity!

+++ b/src/Context/FieldResolver.php
@@ -117,20 +119,21 @@ class FieldResolver {
-      // remaining path parts are for sub-properties.
+      // remaining path parts are targeting field deltas and/or field
+      // properties.

MUCH clearer ❤️

+++ b/src/Context/FieldResolver.php
@@ -117,20 +119,21 @@ class FieldResolver {
-      $candidate_definitions = $this->getFieldItemDefinitions(
-        $resource_types,
-        $field_name
-      );
+      // Different resource types have different field definitions.
+      $candidate_definitions = $this->getFieldItemDefinitions($resource_types, $field_name);

Nit: the comment is fine, but can we revert the code reformatting? It's an unnecessary distraction.

+++ b/src/Context/FieldResolver.php
@@ -141,34 +144,45 @@ class FieldResolver {
+      // If the next part is a delta, as in "body.0.value", then we add it to
+      // the breadcrumbs and remove it from the parts that still must be
+      // processed.
+      if (static::isDelta($parts[0])) {
+        $reference_breadcrumbs[] = array_shift($parts);
+      }
+
+      // If there are no remaining path parts, the process is finished.
+      if (empty($parts)) {

This implies you could have an expression like body.0 or field_tags.3.

Do we actually support those? This logic suggests we do. If we do, then we should have explicit test coverage for it.

+++ b/src/Context/FieldResolver.php
@@ -141,34 +144,45 @@ class FieldResolver {
+        // If the property is not a reference property, then all
+        // remaining parts must be further property specifiers.
+        if (!static::isReferenceProperty($parts[0], $candidate_definitions)) {

This says "must be", but it doesn't validate this; it only checks the next expression path part. Can we add a @todo to validate this in the future?

+++ b/src/Context/FieldResolver.php
@@ -141,34 +144,45 @@ class FieldResolver {
+      else {
+        // The next path part is neither a delta nor a field property, so it
+        // must be a field on a targeted resource type. We need to guess the
+        // intermediate reference property since one was not provided.
+        $reference_breadcrumbs[] = static::getDataReferencePropertyName($candidate_definitions, $parts);
       }

Woah! I think this would benefit from an example in the comment.

I stepped through this with a debugger. I first thought this was necessary only for the most complex edge cases. But actually it's necessary for even something as simple as type.uuid! This comment sounds more complex than it is: it usually will result in 'entity' being appended to $reference_breadcrumbs.

+++ b/src/Context/FieldResolver.php
@@ -340,6 +354,48 @@ class FieldResolver {
+   * Determine the reference property name from the given field definitions.

Nit: s/Determine/Determines/

+++ b/src/Context/FieldResolver.php
@@ -340,6 +354,48 @@ class FieldResolver {
+        if ($property_definition instanceof DataReferenceDefinitionInterface) {

Can't this reuse static::isReferenceProperty()?

+++ b/src/Context/FieldResolver.php
@@ -340,6 +354,48 @@ class FieldResolver {
+  protected static function isDelta($part) {
+    return (bool) preg_match('/^[0-9]+$/', $part);
+  }

👌

Comment #19

wim leers

Ghent 🇧🇪🇪🇺

commented 7 May 2018 at 11:59

And one more review of the entire patch:

+++ b/src/Context/FieldResolver.php
@@ -320,4 +352,93 @@ class FieldResolver {
+      $message = sprintf('Ambiguous path. Try one of the following: %s, before the given path: %s', $choices, $given);
+      throw new BadRequestHttpException($message);

There's no explicit test coverage for this yet. Having that would also help address #18.6.

+++ b/tests/src/Kernel/Context/FieldResolverTest.php
@@ -86,12 +86,38 @@ class FieldResolverTest extends JsonapiKernelTestBase {
+      // Entity reference with followed by field property, with and without
+      // multiple deltas.
...
+      // Entity reference with followed by field, with and without reference
+      // properties.

These comments should be turned into array keys for these test cases; these array keys are considered *test case labels* by PHPUnit.

So when any of them fail, you'll see "test case #15 ()" rather than just "test case #15").

Comment #20

wim leers

Ghent 🇧🇪🇪🇺

commented 7 May 2018 at 12:00

Once #18 and #19 are addressed (mostly nits, but nits that help improve maintainability), this is RTBC. This is obviously a very important bugfix, comes with explicit test coverage (just see how many more test cases \Drupal\Tests\jsonapi\Kernel\Context\FieldResolverTest::resolveInternalProvider() gets!), and helps make the crucial FieldResolver component of JSON API more maintainable!

Comment #21

gabesullice

English

commented 7 May 2018 at 17:01

Status	File	Size
new	interdiff-17-21.txt	10.89 KB
new	2958587-21.patch	13.77 KB

Nit: the comment is fine, but can we revert the code reformatting? It's an unnecessary distraction.

Done.

This implies you could have an expression like body.0 or field_tags.3. Do we actually support those? This logic suggests we do. If we do, then we should have explicit test coverage for it.

We do, and there is coverage for it :) See:

+++ b/tests/src/Kernel/Context/FieldResolverTest.php
@@ -86,12 +86,38 @@ class FieldResolverTest extends JsonapiKernelTestBase {
+      ['field_test_ref1.0.target_id', 'field_test_ref1.0.target_id'],
+      ['field_test_ref1.1.target_id', 'field_test_ref1.1.target_id'],

Can we add a @todo to validate this in the future?

Done.

I first thought this was necessary only for the most complex edge cases. But actually it's necessary for even something as simple as type.uuid! This comment sounds more complex than it is: it usually will result in 'entity' being appended to $reference_breadcrumbs.

I flipped the logic so the common case is first, added more commentary and an example.

18.7: Done

Can't this reuse static::isReferenceProperty()?

Not really. I renamed the method to make it more specific to its use case.

There's no explicit test coverage for this yet. Having that would also help address #18.6.

Without a custom field type, this isn't possible to test. I'm happy to remove the logic, make a follow-up issue and hard-code the word entity in the method for now. What do you think? Add the robust logic now, without a test, or have a hardcoded value now with a follow-up and test?

These comments should be turned into array keys for these test cases; these array keys are considered *test case labels* by PHPUnit.

Done!

Comment #22

gabesullice

English

commented 7 May 2018 at 17:05

Status:

Needs work

» Needs review

Comment #23

wim leers

Ghent 🇧🇪🇪🇺

commented 8 May 2018 at 09:06

Status	File	Size
new	interdiff.txt	721 bytes
new	2958587-23.patch	14.06 KB

We do, and there is coverage for it :)

😅 I even saw it after I wrote that remark. Forgot to remove that remark. Sorry!

I flipped the logic so the common case is first, added more commentary and an example.

18.7: Done

✔️

Without a custom field type, this isn't possible to test. […] What do you think? Add the robust logic now, without a test, or have a hardcoded value now with a follow-up and test?

Let's add the robust logic now and add a test in a follow-up. (Also: I was gonna say we already have a jsonapi_test_field_type test module, but that doesn't actually allow you to test this particular

… which means I can add a @todo and don't need to wait for you to just add a single comment for this patch to be RTBC'able/committable 😃🎉

Comment #24

wim leers

Ghent 🇧🇪🇪🇺

commented 8 May 2018 at 09:07

This is the issue I created for that explicit test coverage btw: #2971281: Test coverage: ambiguous filter expression paths.

Comment #25

wim leers

Ghent 🇧🇪🇪🇺

commented 8 May 2018 at 09:09

Status:

Needs review

» Reviewed & tested by the community

And I of course meant to RTBC in #23 — I said it, but didn't do it. Sorry for the noise.

Comment #26

8 May 2018 at 09:20

Wim Leers committed 1f8ddc9 on 8.x-1.x authored by gabesullice

Issue #2958587 by gabesullice, Wim Leers, Dave Reid: Unable to filter on...

Wim Leers committed e38d36c on 8.x-2.x authored by gabesullice

Issue #2958587 by gabesullice, Wim Leers, Dave Reid: Unable to filter on...

Comment #27

wim leers

Ghent 🇧🇪🇪🇺

commented 8 May 2018 at 09:20

Status:

Reviewed & tested by the community

» Fixed

Green on both 8.5 & 8.6, let's ship this excellent bugfix!

Comment #28

22 May 2018 at 09:24

Status:

Fixed

» Closed (fixed)

Automatically closed - issue fixed for 2 weeks with no activity.

Unable to filter on columns of entity reference fields

Comments

Comment #1

Comment #2

Comment #3

Comment #4

Comment #5

Comment #6

Comment #7

Comment #8

Comment #9

Comment #10

Comment #11

Comment #12

Comment #13

Comment #14

Comment #15

Comment #16

Comment #17

Comment #18

Comment #19

Comment #20

Comment #21

Comment #22

Comment #23

Comment #24

Comment #25

Comment #26

Comment #27

Comment #28

Related issues

Referenced by

News items

Our community

Documentation

Drupal code base

Governance of community