Indexing status can show a negative percentage of content as having been indexed [#2178643]

Comment	File	Size	Author
#54	2178643-54.patch	10.21 KB	ianthomas_uk
#49	2178643-49.patch	10.22 KB	ianthomas_uk
#48	interdiff.txt	2.93 KB	jhodgdon
#48	2178643-48.patch	10.1 KB	jhodgdon
#45	2178643-45.patch	10.09 KB	ianthomas_uk
#41	2178643-41.patch	10.09 KB	ianthomas_uk
#39	2178643-39.patch	10.09 KB	ianthomas_uk
#38	2178643.38.patch	9.98 KB	jayeshanandani
#36	2178643.34.patch	9.05 KB	jayeshanandani
#32	2178643-32.patch	9.41 KB	jayeshanandani
#22	explain-18-count.png	15.79 KB	jhodgdon
#22	explain-12-count.png	20.74 KB	jhodgdon
#22	explain-18-find-nodes.png	17.69 KB	jhodgdon
#22	explain-12-find-nodes.png	22.23 KB	jhodgdon
#18	2178643-18.patch	9.02 KB	ianthomas_uk
#12	2178643-12.patch	9.07 KB	jhodgdon
#11	2178643-11.patch	8.91 KB	jhodgdon

Comment #1

ianthomas_uk

he/him

English

Brighton, UK

commented 22 January 2014 at 13:52

I've can confirm that this is related to the translations, the bug is in NodeSearch::indexStatus(), where the first query will count one per node but the second will count one per translation.

A simple fix would be to change the second COUNT(*) to DISTINCT(nid).

A better fix might be for all indexing logic to count based on translations. The would also remove the need for the special case in NodeSearch::updateIndex() ($counter). There's probably a reason it doesn't do this already, I'll need to have a closer look.

Log in or register to post comments

Comment #2

jhodgdon

she/her

English

commented 22 January 2014 at 14:51

Good catch. There are probably several things in the search module that weren't updated for languages fully.

Log in or register to post comments

Comment #3

ianthomas_uk

he/him

English

Brighton, UK

commented 23 January 2014 at 00:26

I'm not sure which way to go with this, at the moment there is a weird hybird of sometimes counting nodes, and sometimes counting translations. The confusion centres around the code in NodeSearch::updateIndex(), which was introduced in #1669876-10: Add missing language functionality in search module. I assume the thinking was that the throttle would be set to a suitable value for a single language, and multi-language nodes will take much longer to index, so fewer should be done at once.

At the moment, the indexing status is meant to count nodes, but the indexing throttle is meant to apply to translations. That means if you have 10 nodes and you set the throttle to 10, then one cron run will show:
- 10 items processed if they have one translation each
- 5 items processed if they have two translations each (2 * 5 nodes = 10 translations)
- 3 items processed if they have three translations each (because the fourth node would take it over the throttle)
- 0 items processed when it hits a node with >10 translations (because it can never fit within the throttle)

We need to decide one way or the other if we are counting nodes or translations. I think we should count nodes, and just accept that more complicated nodes take longer to index. I think users would expect the limits to apply to the number of nodes.

If we were to count translations properly, then that would include splitting the indexing of a single node across two cron runs, which will slow down indexing.

I'll add a comment to the other issue to see if anyone objects to switching back to a node-based throttle.

Log in or register to post comments

Comment #4

jhodgdon

she/her

English

commented 23 January 2014 at 00:38

I agree that we should count nodes in the throttle, but the UI should probably warn (on that setting) that multi-lingual nodes may take longer to index than single-language nodes. It's also true that some node types will take longer to index than others, though... the throttle is just a guestimate...

Really, it would be even better to do what the Queue system does and throttle based on time. Or better yet, use the queue system, since that is precisely what it is set up for. There's an issue on that somewhere.

Regarding the indexing status, I think we have to report it as number of nodes because as far as I know, the only way to count the number of un-indexed items with a simple db query is by node, right? We don't want to have the counting process be a long and complicated for loop.

Log in or register to post comments

Comment #5

ianthomas_uk

he/him

English

Brighton, UK

commented 23 January 2014 at 01:19

as I know, the only way to count the number of un-indexed items with a simple db query is by node

No, we can count the number of unindexed translations with an SQL query (doing so accidentally is what caused this bug).

better yet, use the queue system

The issue with the queue system is it doesn't handle items being re-queued very well.

it would be even better to ... throttle based on time

That would be pretty easy. We could even have a total time and split it by search page ourselves (e.g. you allocate 10 seconds for indexing, we spend 5 seconds indexing NodeSearch, and 5 seconds indexing FooSearch. If NodeSearch was fully indexed, then we'd spend the whole time on NodeSearch.)

It would make my patch on the cron_limit issue obsolete of course ;)

Log in or register to post comments

Comment #6

jhodgdon

she/her

English

commented 23 January 2014 at 16:59

I agree, using the queue system is more complex. Let's discuss that on the other issue dedicated to the queue question (which will likely not be done anyway).

So... I am OK with any of the options:

a) Be more careful about reporting: report number of nodes, not translations, in indexing status, and keep the throttling based on number of nodes.

b) Throttle based on (approximate) number of node translations, with the understanding that we'd always fully-index all the translations when processing a given node. So we'd stop when the number of translations that have been indexed meets or exceeds the throttle number, at the end of indexing a node.

c) Throttle based on time. Be careful in reporting so that everything is based on nodes.

My thoughts:

1. It seems like the simplest fix for just the observed bug reported here would be (a).

2. It also seems to me that if we are going to change how the throttling is done, the most sensible thing to do would be (c), because the objective of throttling is to make sure cron doesn't time out, so basing it on time is the obvious solution. However, it would I think require a change in the API -- the best way (IMO) to implement it would be for search_cron() to pass in a $max_time parameter to the updateIndex() method on each plugin, wouldn't it?

3. Given all of that, probably we should just do (a), and maybe file a separate issue to change throttling to being time-based?

Log in or register to post comments

Comment #7

ianthomas_uk

he/him

English

Brighton, UK

commented 24 January 2014 at 00:33

OK, let's settle on node-based counting and implement the fix suggested in #1 (as suggested #6a). We can consider this issue fixed when a patch for that is committed.

The more I think about it the more I think a translation-based count doesn't make sense, especially when you're indexing multiple content types. A node with five translations but two fields would probably index faster than a node with one translation but 50 fields.

I've therefore opened #2180389: Switch from item-based to time-based index throttling to switch to time-based indexing.

Log in or register to post comments

Comment #8

jhodgdon

she/her

English

commented 24 January 2014 at 16:25

Sounds like a plan.

Log in or register to post comments

Comment #9

jhodgdon

she/her

English

commented 28 January 2014 at 22:21

Issue summary:	View changes
Related issues:		+#2179081: [meta] Inventory search.module tests. File issues to address test holes., +#2180389: Switch from item-based to time-based index throttling

Added issue summary. Also this will take care of one of the tests needed in #2179081: [meta] Inventory search.module tests. File issues to address test holes..

Log in or register to post comments

Comment #10

jhodgdon

she/her

English

commented 31 January 2014 at 20:59

Assigned:

Unassigned

» jhodgdon

I'm working on this.

Log in or register to post comments

Comment #11

jhodgdon

she/her

English

commented 4 February 2014 at 02:21

Status:

Active

» Needs review

Status	File	Size
new	2178643-11.patch	8.91 KB

Here is a patch. NodeSearch now throttles and calculates status based on number of nodes, not number of translations, and there are tests.

Log in or register to post comments

Comment #12

jhodgdon

she/her

English

commented 4 February 2014 at 14:54

Status	File	Size
new	2178643-12.patch	9.07 KB

I added a couple more lines to the end of the test (thought of one more thing that should be tested). Otherwise, this is the same patch. Didn't bother with an interdiff as there has not been a review yet.

Log in or register to post comments

Comment #13

jhodgdon

she/her

English

commented 13 February 2014 at 20:38

12: 2178643-12.patch queued for re-testing.

Log in or register to post comments

Comment #14

ianthomas_uk

he/him

English

Brighton, UK

commented 16 February 2014 at 22:32

+++ b/core/modules/node/lib/Drupal/node/Plugin/Search/NodeSearch.php
@@ -290,24 +290,18 @@ protected function addNodeRankings(SelectExtender $query) {
-    $result = $this->database->queryRange("SELECT DISTINCT n.nid, d.reindex FROM {node} n LEFT JOIN {search_dataset} d ON d.type = :type AND d.sid = n.nid WHERE d.sid IS NULL OR d.reindex <> 0 ORDER BY d.reindex ASC, n.nid ASC", 0, $limit, array(':type' => $this->getPluginId()), array('target' => 'slave'));
+    $result = $this->database->queryRange("SELECT n.nid FROM {node} n LEFT JOIN (SELECT sd.sid, MAX(sd.reindex) as reindex FROM {search_dataset} sd WHERE sd.type = :type GROUP BY sd.sid) d ON d.sid = n.nid WHERE d.sid IS NULL OR d.reindex <> 0 ORDER BY d.reindex ASC, n.nid ASC", 0, $limit, array(':type' => $this->getPluginId()), array('target' => 'slave'));

Do we need the subquery here? We're want each unindexed nid once, can't we do that with SELECT n.nid, max(d.reindex) ... GROUP BY n.nid?

+++ b/core/modules/node/lib/Drupal/node/Plugin/Search/NodeSearch.php
@@ -362,7 +356,8 @@ public function resetIndex() {
-    $remaining = $this->database->query("SELECT COUNT(*) FROM {node} n LEFT JOIN {search_dataset} d ON d.type = :type AND d.sid = n.nid WHERE d.sid IS NULL OR d.reindex <> 0", array(':type' => $this->getPluginId()))->fetchField();
+    $remaining = $this->database->query("SELECT COUNT(*) FROM {node} n LEFT JOIN (SELECT sd.sid, MAX(sd.reindex) as reindex FROM {search_dataset} sd WHERE sd.type = :type GROUP BY sd.sid) d ON d.sid = n.nid WHERE d.sid IS NULL OR d.reindex <> 0", array(':type' => $this->getPluginId()))->fetchField();

This one I think we could do as "SELECT count(n.nid) ..."

If we do need the longer queries, they would be easier to read if split onto multiple lines.

The changes to the test look good.

Log in or register to post comments

Comment #15

jhodgdon

she/her

English

commented 18 February 2014 at 18:07

Assigned:

jhodgdon

» Unassigned

Regarding comment #14:

1) I think we do need a subquery is the most efficient way to do this query and the most database-independent, and I tried a bunch of other methods and found they were not doing what I expected; also using a sub-query makes the join a lot faster. Please feel free to try and make it better if you can, if you can make sure all the tests pass. I think this is the best option.

2) Again, I tried a bunch of variations and this is the only one I found that would make the tests pass. Please feel free to try to improve it if you can, but again I think this is the best option that actually works.

3) And regarding your other comment about splitting up the query into separate lines, I haven't seen that done elsewhere in Core (or at least in the Search code), so I am not inclined to do it here?

Log in or register to post comments

Comment #16

ianthomas_uk

he/him

English

Brighton, UK

commented 21 February 2014 at 14:13

I was thinking of something along the lines of

SELECT COUNT(n.nid) FROM node n
LEFT JOIN search_dataset sd ON sd.sid = n.nid AND sd.type = 'node_search'
WHERE sd.sid IS NULL OR sd.reindex <> 0

I can't test this manually though, because I can't translate my nodes at the moment for some reason. I'll come back to this when I can translate them.

The coding standards (https://drupal.org/node/2497) include examples of different ways that people are formatting multi-line SQL, but I had a quick look over core and couldn't find any examples either. We should probably be using Database\Connection::select().

Log in or register to post comments

Comment #17

jhodgdon

she/her

English

commented 21 February 2014 at 16:03

Well, good luck, I don't think that query in #16 will work, but you can try. We have tests for the desired behavior in this patch, so if you can get them to pass, you can try another patch. I'd be interested to know about the performance implications though -- from my reading, the subquery would be expected to be faster than joining first.

Regarding using select(), sure, we could do that. I left it as one-line queries because that is what it was before. I have heard in the past from Berdir that these are also more efficient than using select(), but I am not sure if that is still the case.

Log in or register to post comments

Comment #18

ianthomas_uk

he/him

English

Brighton, UK

commented 24 February 2014 at 20:10

Status	File	Size
new	2178643-18.patch	9.02 KB

2 files were hidden/shown/deleted

Status	File	Size
hidden	2178643-11.patch	8.91 KB
hidden	2178643-12.patch	9.07 KB

This patch has my join-based queries, which pass tests locally. The queries in HEAD, patch #12 and patch #18 are as followed

Nodes to index, where :type = $this->getPluginId()

HEAD

SELECT DISTINCT n.nid, d.reindex FROM {node} n 
LEFT JOIN {search_dataset} d ON d.type = :type AND d.sid = n.nid 
WHERE d.sid IS NULL OR d.reindex <> 0 
ORDER BY d.reindex ASC, n.nid ASC

#12

SELECT n.nid FROM {node} n 
LEFT JOIN (SELECT sd.sid, MAX(sd.reindex) as reindex FROM {search_dataset} sd  WHERE sd.type = :type  GROUP BY sd.sid) d ON d.sid = n.nid 
WHERE d.sid IS NULL OR d.reindex <> 0 
ORDER BY d.reindex ASC, n.nid ASC

#18

SELECT n.nid, MAX(sd.reindex) FROM {node} n
LEFT JOIN {search_dataset} sd ON sd.sid = n.nid AND sd.type = :type
WHERE sd.sid IS NULL OR sd.reindex <> 0 
GROUP BY n.nid 
ORDER BY MAX(sd.reindex) is null DESC, MAX(sd.reindex) ASC, n.nid ASC

Number of nodes still needing indexing

HEAD

SELECT COUNT(*) FROM {node} n 
LEFT JOIN {search_dataset} d ON d.type = :type AND d.sid = n.nid 
WHERE d.sid IS NULL OR d.reindex <> 0

#12

SELECT COUNT(*) FROM {node} n 
LEFT JOIN (SELECT sd.sid, MAX(sd.reindex) as reindex FROM {search_dataset} sd WHERE sd.type = :type GROUP BY sd.sid) d ON d.sid = n.nid
WHERE d.sid IS NULL OR d.reindex <> 0

#18

SELECT COUNT(DISTINCT n.nid) FROM {node} n 
LEFT JOIN {search_dataset} sd ON sd.sid = n.nid AND sd.type = :type 
WHERE sd.sid IS NULL OR sd.reindex <> 0

There are no other changes. I would expect a join to be faster, than a subquery, as the dbms has more chances to optimise the query plan, but we'd need to do some profiling to be sure. Is there a recommended way to do that on a large db? I guess we'd need devel_generate.

IMO the join-based queries are easier to read. Note the additional order by clause in the first query is a new feature, deliberately prioritising content that has never been indexed over outdated content.

Log in or register to post comments

Comment #19

ianthomas_uk

he/him

English

Brighton, UK

commented 24 February 2014 at 20:02

One point I forgot to mention is that neither version of the queries copes with the situation where there is one translation of a node indexed and up to date (i.e. in search_dataset with reindex = 0) and one translation of a node has never been indexed (i.e. not in search_dataset). It's not possible to detect this using just these two tables, because we assume that the missing translation just doesn't exist.

I can't find a way that this currently causes a problem, because both adding a translation and updating any translation currently updates search_dataset for all translations. Maybe langcode should just be removed from search_dataset?

Log in or register to post comments

Comment #20

jhodgdon

she/her

English

commented 25 February 2014 at 16:20

RE #19... Right, we do not need to worry about this at all. In Node::postSave(), it calls node_reindex_node_search($nid), and this will be called whenever any change is made to any translation, including adding a new translation. We are always calling search_mark_for_reindex('node_search', $nid), and never making it translation-specific.

Anyway... I am pretty sure your new queries are less efficient than the previous ones, because they are potentially joining a lot more rows, and then doing the grouping after the join -- from what I've read, it is more efficient to do the group first as the queries in #12 do.

I'm also not sure your new queries will run on PostgreSQL -- did you test it?

Also, the Order By clauses seem very weird to me, like "ORDER BY MAX(sd.reindex) is null DESC". The query in #12 has the same effect as it orders by d.reindex ASC, which will put the NULL entries first, and I think it's cleaner. I don't share your enthusiasm for the new queries being easier to read... but whatever. We need to fix this one way or another.

Log in or register to post comments

Comment #21

jhodgdon

she/her

English

commented 1 March 2014 at 00:46

Well, I still think the queries in my patch are more efficient, but in the interest of getting this patch in and not wanting to argue endlessly, I am willing to use the queries in #18. They are working correctly, and I just tested in PostgreSQL too, it's fine.

So, I am good for RTBC on either the patch in #18 or the patch in #12. But since I wrote most of the code in both (except for the changed queries in #18), we need someone else to review the patch(es).

I am going to ask in IRC for someone who is more of a query optimization expert than I am to weigh in on the queries in #12 vs. #18. They give us the same results, as demonstrated by the tests.

Log in or register to post comments

Comment #22

jhodgdon

she/her

English

commented 1 March 2014 at 13:42

Status	File	Size
new	explain-12-find-nodes.png	22.23 KB
new	explain-18-find-nodes.png	17.69 KB
new	explain-12-count.png	20.74 KB
new	explain-18-count.png	15.79 KB

I realized I had a D7 database with some search data indexed -- about 2500 entries in search_dataset. I ran the queries in #12 and #18 with EXPLAIN (due to caching, query times are hard to measure). This may not count as a huge data set, but at least it is not insignificant. Nothing else was running on my local DB at the time.

Results:

a) Nodes to index:

#12
SELECT n.nid FROM {node} n
LEFT JOIN (SELECT sd.sid, MAX(sd.reindex) as reindex FROM {search_dataset} sd  WHERE sd.type = :type  GROUP BY sd.sid) d ON d.sid = n.nid
WHERE d.sid IS NULL OR d.reindex <> 0
ORDER BY d.reindex ASC, n.nid ASC

#18
SELECT n.nid, MAX(sd.reindex) FROM {node} n
LEFT JOIN {search_dataset} sd ON sd.sid = n.nid AND sd.type = :type
WHERE sd.sid IS NULL OR sd.reindex <> 0
GROUP BY n.nid
ORDER BY MAX(sd.reindex) is null DESC, MAX(sd.reindex) ASC, n.nid ASC

#12 explain output:
explain #18 output

#18 explain output:
explain #18 output

b) Counting nodes:

#12
SELECT COUNT(*) FROM {node} n
LEFT JOIN (SELECT sd.sid, MAX(sd.reindex) as reindex FROM {search_dataset} sd WHERE sd.type = :type GROUP BY sd.sid) d ON d.sid = n.nid
WHERE d.sid IS NULL OR d.reindex <> 0

#18
SELECT COUNT(DISTINCT n.nid) FROM {node} n
LEFT JOIN {search_dataset} sd ON sd.sid = n.nid AND sd.type = :type
WHERE sd.sid IS NULL OR sd.reindex <> 0

#12 explain output:
explain #18 output

#18 explain output:
explain #18 output

Hopefully that will help figure this out?

Log in or register to post comments

Comment #23

jhodgdon

she/her

English

commented 1 March 2014 at 13:53

OK... I guess the information I read about doing this type of join earlier was incorrect or didn't consider indexing. From what I can tell, the queries in #18 look like they are more efficient, given this EXPLAIN output, because they are making better use of indexes (the inner SELECT table isn't indexed).

So, I'm +1 for RTBC on the queries in #18 that ian_thomas_uk supplied.

ianthomas_uk: I wrote the rest of the code in this patch. Would you care to review that and mark the patch RTBC (hopefully) or whatever? Thanks!

Log in or register to post comments

Comment #24

ianthomas_uk

he/him

English

Brighton, UK

commented 2 March 2014 at 16:36

I'm not sure what to do here.

The rest of the patch looks good, and fixes the standard use case without breaking anything.

However, the queries could still end up with the wrong values if modules call the API functions themselves (as described in #19). The correct fix to that is just to drop the $langcode parameter from all these functions and tables, and ideally refactor into the appropriate classes at the same time. But that's a much bigger change and I think there are more important things to do. There's a related bug where if you call search_reindex(null, null, $langcode) then the langcode will be ignored.

So I'm tempted to say we should go with the current patch (which fixes the observed problems), and file a followup to fix the little-used API.

Log in or register to post comments

Comment #25

jhodgdon

she/her

English

commented 3 March 2014 at 18:18

RE #24, which API functions are you talking about specifically that would malfunction?

Log in or register to post comments

Comment #26

jhodgdon

she/her

English

commented 3 March 2014 at 22:38

So... Regarding #19/#24, I do not think we need to worry about this, because:

The patch here is about what the NodeSearch plugin does regarding index throttling and status reporting. There is no problem here -- the NodeSearch plugin and node.module will always mark an entire node as needing a reindex if any translation is updated, so it is all self-consistent. And the queries in question for this issue are part of NodeSearch plugin, not generalized, so that is fine.

For another hypothetical plugin that uses the search index, we have several API functions:

a) search_index() causes {search_dataset} tol be updated, for the particular "sid" value and language code that it was called with, to have reindex=0 for that particular type/sid/language.

b) search_reindex() causes {search_dataset} and {search_index} to be cleared for that particular item (type/sid/language) requested.

c) search_mark_for_reindex() -- that function does not have a language code parameter, so all existing languages' reindex values for the given type/sid will be marked to the current request time when this is called. We could add a $langcode parameter, but that's a separate issue.

So, this hypothetical other plugin is free to report whatever it wants in its search indexing status and do whatever it wants in its update index method. I don't see how any of (b-d) are problems in that regard, since it can do its own queries on the {search_dataset} table joined with its particular table. What's the problem you are seeing?

Log in or register to post comments

Comment #27

ianthomas_uk

he/him

English

Brighton, UK

commented 4 March 2014 at 01:41

The queries could produce invalid data for a valid database, but I can't see how NodeSearch could create that valid database, short of a fatal error in the loop in indexNode(). The "valid database" could contain:
-1 node
-2 translations
-1 entry in search_dataset with reindex = 0
The queries would tell you that everything was indexed, because we know we've indexed every node (we check by joining the node table) and we know that we've indexed every translation that we know about (because reindex = 0).

You could get the database into that state by calling search_reindex with a $sid and/or $type, plus $langcode, while a node was waiting to be indexed.

Also, a call to search_reindex(null, null, 'en') will delete the index for all languages (separate bug, not a regression).
Finally, it's poorly named - I'd expect to have a freshly populated index after calling search_reindex, but I'd get an empty one (again, a separate issue).

To summarise, it's not a good data structure / api, but this patch doesn't make that any worse.

Log in or register to post comments

Comment #28

jhodgdon

she/her

English

commented 4 March 2014 at 14:35

I feel like you're trying to find problems that don't exist.

search_reindex() is *designed and documented* to clear out the whole search index if you call it with type/sid == NULL -- it ignores language code in that case. That is not a bug -- that is how to clear out the whole search index with an API call. The name of the function is stupid, I'll grant you that, but it's been like this since Drupal 5, and the name of that function is not relevant to whether the API is broken or not, or this issue at all.

In any case... NodeSearch and the Node module never call search_reindex with $langcode, and certainly never calls search_reindex(null, null, $langcode). So I think NodeSearch is using the database/API in a consistent way. If other plugins (with presumably their own indexing type) choose to use the API in an inconsistent way, that is their choice, but not a good choice. I really don't see what the problem is at all with the API or the database or this patch.

Can we just review *this patch* and if you think there are other problems, discuss them elsewhere please please please?

Log in or register to post comments

Comment #29

ianthomas_uk

he/him

English

Brighton, UK

commented 4 March 2014 at 15:13

Sorry for going off-topic above. These are real problems that I spotted while reviewing the change, but they are not regressions caused by the change, so shouldn't block the patch.

I'm also +1 to RTBC #18, but I'd be happier if someone else could have a look too.

Log in or register to post comments

Comment #30

ianthomas_uk

he/him

English

Brighton, UK

commented 5 March 2014 at 16:55

Status:

Needs review

» Reviewed & tested by the community

Summary of patch reviews so far:
- jhodgdon posted a patch on #12. I was happy with it apart from two SQL queries.
- I posted a patch with alternative queries on #18. jhodgdon initially didn't like them but later came round. The queries are equivilent so it's partly personal preference. Performance is not known to be a problem, but #18 does look faster.
- I had concerns about the API, but these are not caused by this patch. I've opened #2211241: Refactor search_reindex() into separate functions as a followup.

Therefore after another look over the patch I'm now happy to RTBC.

Log in or register to post comments

Comment #31

alexpott

he/they

English

🇪🇺🌍

commented 8 March 2014 at 10:23

Status:	Reviewed & tested by the community	» Needs work
Issue tags:		+Needs reroll

2178643-18.patch no longer applies.

error: patch failed: core/modules/search/lib/Drupal/search/Tests/SearchMultilingualEntityTest.php:92
error: core/modules/search/lib/Drupal/search/Tests/SearchMultilingualEntityTest.php: patch does not apply

Log in or register to post comments

Comment #32

jayeshanandani commented 9 March 2014 at 11:49

Status:	Needs work	» Needs review
Issue tags:	-Needs reroll

Status	File	Size
new	2178643-32.patch	9.41 KB

Rerolled #18.

Log in or register to post comments

Comment #33

ianthomas_uk

he/him

English

Brighton, UK

commented 9 March 2014 at 11:58

Status:

Needs review

» Needs work

+++ b/core/modules/search/lib/Drupal/search/Tests/SearchMultilingualEntityTest.php
@@ -93,53 +100,60 @@ function setUp() {
-    $plugin->setSearch('English OR Hungarian', array('f' => array('language:hu')), array());
-    $search_result = $plugin->execute();
+
+    $this->plugin->setSearch('English OR Hungarian', array('f' => array('langcode:hu')), array());
+    $search_result = $this->plugin->execute();

Extra whitespace.
Why is language being changed to langcode? It's REALLY, REALLY important when resolving conflicts to changed lines to check if other parts of the line have been changed. The intentional change here was $plugin to $this->plugin.

+++ b/core/modules/search/lib/Drupal/search/Tests/SearchMultilingualEntityTest.php
@@ -93,53 +100,60 @@ function setUp() {
+

More whitespace

Log in or register to post comments

Comment #34

9 March 2014 at 12:50

The last submitted patch, 32: 2178643-32.patch, failed testing.

Log in or register to post comments

Comment #35

9 March 2014 at 12:50

The last submitted patch, 32: 2178643-32.patch, failed testing.

Log in or register to post comments

Comment #36

jayeshanandani commented 9 March 2014 at 13:18

Status:

Needs work

» Needs review

Status	File	Size
new	2178643.34.patch	9.05 KB

Last reroll was chaging some other code. Modified that and uploading a new reroll.

Log in or register to post comments

Comment #37

9 March 2014 at 14:19

Status:

Needs review

» Needs work

The last submitted patch, 36: 2178643.34.patch, failed testing.

Log in or register to post comments

Comment #38

jayeshanandani commented 9 March 2014 at 23:42

Status:

Needs work

» Needs review

Status	File	Size
new	2178643.38.patch	9.98 KB

Last Rerolled missed some modifications. Updated the patch with those and added new modficiations so test that doesnt fail.

Log in or register to post comments

Comment #39

ianthomas_uk

he/him

English

Brighton, UK

commented 10 March 2014 at 00:46

Status	File	Size
new	2178643-39.patch	10.09 KB

2 files were hidden/shown/deleted

Status	File	Size
hidden	2178643-32.patch	9.41 KB
hidden	2178643.34.patch	9.05 KB

@jhodgon Did you deliberately combine the tests in #11? I didn't spot it before, but it means we're now testing searching in a function called testIndexingThrottle(). Here's a patch that splits them again, based on #18 (because I started it before #38 was posted).

#38 looks good as a straight reroll of #18.

Log in or register to post comments

Comment #40

jhodgdon

she/her

English

commented 10 March 2014 at 16:11

Status:

Needs review

» Needs work

Yest, it was intentional. It is a lot more efficient to have all the tests in one method, because setup doesn't have to be done twice. So let's instead of splitting that up, change the name of the method.

Log in or register to post comments

Comment #41

ianthomas_uk

he/him

English

Brighton, UK

commented 11 March 2014 at 00:01

Status:

Needs work

» Needs review

Status	File	Size
new	2178643-41.patch	10.09 KB

3 files were hidden/shown/deleted

Status	File	Size
hidden	explain-18-count.png	15.79 KB
hidden	2178643.38.patch	9.98 KB
hidden	2178643-39.patch	10.09 KB

This is a revision of #38 with an alternative name and comment.

-   * Tests for indexing throttle with nodes in multiple languages.
-  function testIndexingThrottle() {
+   * Tests the indexing throttle and search results with multilingual nodes.
+  function testMultilingualSearch() {

Other than this I would have been happy to RTBC #38.

Log in or register to post comments

Comment #42

11 March 2014 at 00:33

Status:

Needs review

» Needs work

The last submitted patch, 41: 2178643-41.patch, failed testing.

Log in or register to post comments

Comment #43

ianthomas_uk

he/him

English

Brighton, UK

commented 11 March 2014 at 00:56

Status:

Needs work

» Needs review

41: 2178643-41.patch queued for re-testing.

Log in or register to post comments

Comment #44

jhodgdon

she/her

English

commented 11 March 2014 at 15:19

Status:

Needs review

» Needs work

This looks good except in the test there is still this comment:

     // Update the index and then run the shutdown method.
     // See testIndexingThrottle() for further explanation.

that refers to a now nonexistent method.

Log in or register to post comments

Comment #45

ianthomas_uk

he/him

English

Brighton, UK

commented 11 March 2014 at 19:30

Status:

Needs work

» Needs review

Status	File	Size
new	2178643-45.patch	10.09 KB

2 files were hidden/shown/deleted

Status	File	Size
hidden	2178643-18.patch	9.02 KB
hidden	2178643-41.patch	10.09 KB

I've just removed those two lines - the full explaination it was referring to is only a few lines above anyway.

Log in or register to post comments

Comment #46

jhodgdon

she/her

English

commented 12 March 2014 at 14:07

Priority:	Minor	» Normal
Status:	Needs review	» Reviewed & tested by the community

OK, I'm +1 for RTBC again, and this time I think I can mark it since I didn't make any of the changes since the last RTBC (which finally amounted to mostly rerolling).

Log in or register to post comments

Comment #47

alexpott

he/they

English

🇪🇺🌍

commented 24 March 2014 at 08:45

Status:

Reviewed & tested by the community

» Needs work

+++ b/core/modules/search/lib/Drupal/search/Tests/SearchMultilingualEntityTest.php
@@ -156,5 +166,36 @@ function testSearchingMultilingualFieldValues() {
+  protected function checkIndexCounts($remaining, $total, $message) {

This should be called assertIndexCounts

Log in or register to post comments

Comment #48

jhodgdon

she/her

English

commented 24 March 2014 at 16:08

Status:

Needs work

» Needs review

Status	File	Size
new	2178643-48.patch	10.1 KB
new	interdiff.txt	2.93 KB

Good idea. New patch, changing the test method checkIndexCounts to assertIndexCounts.

Log in or register to post comments

Comment #49

ianthomas_uk

he/him

English

Brighton, UK

commented 26 March 2014 at 23:08

Status	File	Size
new	2178643-49.patch	10.22 KB

3 files were hidden/shown/deleted

Status	File	Size
hidden	2178643-45.patch	10.09 KB
hidden	2178643-48.patch	10.1 KB
hidden	interdiff.txt	2.93 KB

Changes in #48 are as described, but the patch already needed a reroll.

Log in or register to post comments

Comment #50

26 March 2014 at 23:52

Status:

Needs review

» Needs work

The last submitted patch, 49: 2178643-49.patch, failed testing.

Log in or register to post comments

Comment #51

ianthomas_uk

he/him

English

Brighton, UK

commented 27 March 2014 at 00:12

Status:

Needs work

» Needs review

49: 2178643-49.patch queued for re-testing.

Failures in
Drupal\node\Tests\NodeTranslationUITest
Drupal\path\Tests\PathLanguageTest

Log in or register to post comments

Comment #52

jhodgdon

she/her

English

commented 27 March 2014 at 16:45

Status:

Needs review

» Reviewed & tested by the community

Reroll is good. Only difference (besides reroll) between the patch in #50 and patch in #45 is the requested change to the assert method in the test class. Tests are passing. So back to RTBC.

Log in or register to post comments

Comment #53

28 March 2014 at 03:40

Status:

Reviewed & tested by the community

» Needs work

The last submitted patch, 49: 2178643-49.patch, failed testing.

Log in or register to post comments

Comment #54

ianthomas_uk

he/him

English

Brighton, UK

commented 28 March 2014 at 08:06

Status:

Needs work

» Reviewed & tested by the community

Status	File	Size
new	2178643-54.patch	10.21 KB

1 file was hidden/shown/deleted

Status	File	Size
hidden	2178643-49.patch	10.22 KB

Simple reroll caused by $this->entityManager->getStorageController('node'); becoming $this->entityManager->getStorage('node');

Log in or register to post comments

Comment #55

1 April 2014 at 13:01

Status:

Reviewed & tested by the community

» Needs work

The last submitted patch, 54: 2178643-54.patch, failed testing.

Log in or register to post comments

Comment #56

jhodgdon

she/her

English

commented 1 April 2014 at 14:12

Status:

Needs work

» Needs review

54: 2178643-54.patch queued for re-testing.

Log in or register to post comments

Comment #57

jhodgdon

she/her

English

commented 1 April 2014 at 14:12

Status:

Needs review

» Reviewed & tested by the community

Log in or register to post comments

Comment #58

alexpott

he/they

English

🇪🇺🌍

commented 9 April 2014 at 12:56

Status:

Reviewed & tested by the community

» Fixed

Committed 3c488c4 and pushed to 8.x. Thanks!

Log in or register to post comments

Comment #59

9 April 2014 at 12:57

Commit 3c488c4 on 8.x by alexpott:

Issue #2178643 by ianthomas_uk, jhodgdon, jayeshanandani: Indexing...

Log in or register to post comments

Comment #60

23 April 2014 at 13:00

Status:

Fixed

» Closed (fixed)

Automatically closed - issue fixed for 2 weeks with no activity.

Log in or register to post comments

Indexing status can show a negative percentage of content as having been indexed

Problem/Motivation

Proposed resolution

Remaining tasks

User interface changes

API changes

Original report by @ianthomas_uk

Comments

Nodes to index, where :type = $this->getPluginId()

Number of nodes still needing indexing

Related issues