Support paging through multiple requests [#2640516]

Comment	File	Size	Author
#137	migrate_plus-support_paging-2640516-137.diff	33.48 KB	nadavoid
#130	migrate_plus-support-paging-2640516-130.patch	17.2 KB	jacobbell84
#125	migrate_plus-support-paging-2640516-125.patch	17.15 KB	jacobbell84
#124	interdiff-123-124.txt	1021 bytes	kekkis
#124	migrate_plus-support-paging-2640516-124.patch	16.97 KB	kekkis
#123	migrate_plus-support-paging-2640516-123.patch	16.8 KB	chuyenlv
#121	interdiff_119_121.txt	3.86 KB	jcandan
#121	migrate_plus-support-paging-2640516-121.patch	16.81 KB	jcandan
#119	migrate_plus-pager_selector_error-2640516-119.patch	15.6 KB	kekkis
#119	interdiff-118-119.txt	1.01 KB	kekkis
#118	migrate_plus-pager_selector_error-2640516-118.patch	15.39 KB	slayne40
#114	migrate_plus-pager_selector_error-2640516.patch	15.33 KB	quadbyte
#112	migrate_plus-pager_selector_error-2640516-112.patch	15.3 KB	quadbyte
#109	migrate_plus-pager_selector_error-2640516-108---altered-on-line-202.patch	15.08 KB	auth
#107	migrate_plus-pager_selector_error-2640516-107.patch	15.1 KB	monymirza
#102	migrate_plus-pager_selector_error-2640516-102.patch	19.35 KB	rob230
#100	interdiff-2640516-98-100.txt	1.78 KB	rob230
#100	migrate_plus-pager_selector_error-2640516-100.patch	0 bytes	rob230
#98	migrate_plus-pager_selector_error-2640516-98.patch	18.08 KB	sleopold
#96	migrate_plus-pager_selector_error-2640516-96.patch	17.59 KB	KevinVanRansbeeck
#95	migrate_plus-pager_selector_error-2640516.patch	912 bytes	peter törnstrand
#94	support-paging-through-multiple-requests-2640516-94.patch	19.18 KB	hmdnawaz
#93	migrate_plus-support_paging-2640516-92.patch	19.49 KB	scott_euser
#93	interdiff-2640516-90-92.txt	329 bytes	scott_euser
#91	interdiff-2640516-88-90.txt	438 bytes	scott_euser
#91	migrate_plus-support_paging-2640516-90.patch	19.49 KB	scott_euser
#90	interdiff-2640516-88-90.txt	438 bytes	scott_euser
#88	interdiff-2640516-86-88.txt	505 bytes	scott_euser
#88	migrate_plus-support_paging-2640516-88.patch	19.35 KB	scott_euser
#88	2021-09-20_13-38.png	6.9 KB	scott_euser
#86	reroll_diff_83-86.txt	2.77 KB	ronaldmulero
#86	migrate_plus-support_paging-2640516-86.patch	19.36 KB	ronaldmulero
#83	reroll_diff_81-83.txt	5.7 KB	ronaldmulero
#83	migrate_plus-support_paging-2640516-83.patch	18.85 KB	ronaldmulero
#81	interdiff-81.txt	607 bytes	szloredan
#81	migrate_plus-support_paging-2640516-81.patch	18.79 KB	szloredan
#79	interdiff-78.txt	694 bytes	segi
#79	migrate_plus-support_paging-2640516-78.patch	18.64 KB	segi
#76	migrate_plus-support_paging-2640516-76.patch	18.53 KB	ruslan piskarov
#72	migrate_plus-support_paging-2640516-72.patch	18.15 KB	robin.ingelbrecht
#70	migrate_plus-support_paging-2640516-70.patch	18.15 KB	prudloff
#69	interdiff-2640516.68.69.txt	2.06 KB	pixlkat
#69	migrate_plus-support_paging-2640516-69.patch	17.76 KB	pixlkat
#68	migrate_plus-support_paging-2640516-68.patch	17.2 KB	duaelfr
#68	interdiff-2640516.65.68.txt	1.58 KB	duaelfr
#65	migrate_plus-support_paging-2640516-65.patch	16.87 KB	duaelfr
#65	interdiff-2640516.63.65.txt	2.17 KB	duaelfr
#62	interdiff_60-62.txt	782 bytes	weekbeforenext
#60	migrate_plus-support_paging-2640516-60.patch	15.91 KB	m.lebedev
#60	interdiff_48-60.txt	9.98 KB	m.lebedev
#48	migrate_plus-support_paging-2640516-48.patch	16.39 KB	sdstyles
#48	interdiff_45_48.txt	672 bytes	sdstyles
#45	migrate_plus-support_paging-2640516-45.patch	17.32 KB	jcandan
#45	interdiff_41_45.txt	551 bytes	jcandan
#41	interdiff_39_41.txt	1.84 KB	jcandan
#41	migrate_plus-support_paging-2640516-41.patch	17.33 KB	jcandan
#39	migrate_plus-support_paging-2640516-39.png	386.28 KB	jcandan
#39	interdiff_37_39.txt	13.64 KB	jcandan
#39	migrate_plus-support_paging-2640516-39.patch	17.28 KB	jcandan
#35	migrate_plus-support_paging-2640516-35.patch	13.19 KB	m.lebedev
#33	migrate_plus-support_paging-2640516-33.patch	12.96 KB	m.lebedev
#27	migrate_plus-support_paging-2640516-27.patch	12.96 KB	berenddeboer
#26	migrate_plus-support_paging-2640516-26.patch	13.29 KB	berenddeboer
#25	migrate_plus-test-case.patch	4.1 KB	berenddeboer
#23	migrate_plus-support_paging-2640516-23.patch	9 KB	mortona2k
#19	migrate_plus-support_paging-2640516-19.patch	9.38 KB	berenddeboer
#16	interdiff-2640516-12-16.txt	752 bytes	badrange
#16	migrate_plus-support_paging-2640516-16.patch	8.36 KB	badrange
#12	interdiff-2640516.txt	8.44 KB	drclaw
#12	migrate_plus-support_paging-2640516-12.patch	8.34 KB	drclaw
#5	support_paging-2640516-5.patch	8.08 KB	Grayside
#37	migrate_plus-support_paging-2640516-37.patch	13.19 KB	m.lebedev
#63	migrate_plus-support_paging-2640516-63.patch	16.84 KB	weekbeforenext
#63	interdiff_37-63.txt	13.52 KB	weekbeforenext
#62	migrate_plus-support_paging-2640516-62.patch	16.44 KB	weekbeforenext
#63	interdiff_62-63.txt	12.04 KB	weekbeforenext

Comment #1

24 December 2015 at 22:32

mikeryan created an issue. See original summary.

Log in or register to post comments

Comment #2

mikeryan

he/him

English

Pittsfield, MA, USA

commented 19 May 2016 at 19:47

Component:

Source plugins

» Plugins

Log in or register to post comments

Comment #3

Grayside commented 15 September 2016 at 23:08

Assigned:

Unassigned

» Grayside

Next step after #2608610: Add support for list/item pattern.

Log in or register to post comments

Comment #4

Grayside commented 16 September 2016 at 17:36

Here's my rough game plan going into this. Note that I'm attempting generic approach but I am not going to work on XML/SOAP.

Configuration

I think paging should be opt-in
A selector to find the "next" link in the document body seems necessary. Raw JSON might use next, HAL would be _links/next/href.

Links Header

DataParserPluginBase::nextSource() should call $this->getNextLinks() and prepend the resulting URLs to $this->urls. getNextLinks() will be abstract.

Links Header: JSON

Json::openSourceUrls() can extract link headers from the data fetcher. Should data fetchers in general have a getStreamMetadata() method or should it check for the Http plugin specifically?
Paging being so intrinsic to fetching, it makes sense to move more of the Link header parsing into the data fetcher. No need for HTTP structure to leak into JSON parsing.

Links Header: XML (Blocked)

XMLReader is not currently using the HTTP data fetcher at all. Link Header support is dependent on switching to it. (E.g., download the XML doc via fetcher then open from temporary file location).

Links Header: SOAP (Blocked)

See XML.

Next Links

Next Links: JSON

Json::openSourceUrls calls getSourceData() to retrieve the data for import, but that might exclude data for the paging process.
Building on work done in #2608610-10: Add support for list/item pattern, we could inject a "pager selector" into getSourceData to extract the next link. Should we use static caching pattern to avoid repeatedly decoding?

Next Links: XML

TBD, but appears like it can be approximately similar to JSON in that we could parse out the next link based on a selector.

Next Links: SOAP

TBD, but appears like it can be approximately similar to JSON in that we could parse out the next link based on a selector.

Log in or register to post comments

Comment #5

Grayside commented 22 September 2016 at 05:00

Assigned:	Grayside	» Unassigned
Status:	Active	» Needs review

Status	File	Size
new	support_paging-2640516-5.patch	8.08 KB

Here we go. This patch will conflict with #2608610: Add support for list/item pattern as both needed some of the same changes to Json::getSourceData().

Log in or register to post comments

Comment #6

mikeryan

he/him

English

Pittsfield, MA, USA

commented 11 October 2016 at 20:25

Version:

8.x-1.x-dev

» 8.x-2.x-dev

Log in or register to post comments

Comment #7

mikeryan

he/him

English

Pittsfield, MA, USA

commented 11 October 2016 at 21:07

Thanks for this patch - I'm going to need to think on this bit, just want to let you know I'm not ignoring it as I catch up on reviews...

Log in or register to post comments

Comment #8

Grayside commented 26 October 2016 at 17:07

I suspect the current implementation of this or #2608610: Add support for list/item pattern is disrupting rollbacks. Got this second-hand so I do not have a lot of details, but likely related to item counts.

Log in or register to post comments

Comment #9

mikeryan

he/him

English

Pittsfield, MA, USA

commented 2 December 2016 at 14:45

Status:

Needs review

» Needs work

+++ b/src/DataFetcherPluginInterface.php
@@ -46,4 +46,16 @@ interface DataFetcherPluginInterface {
+  public function getNextLinksFromMetadata($url);

Haven't gotten around to a thorough review and testing, but I'd like to see the interface be more general - getNextLinks(). The implementation could, based on configuration, derive the links from headers, selectors, or query strings (e.g., ?page=1)

Log in or register to post comments

Comment #10

Grayside commented 2 December 2016 at 18:43

Sure. I named it getNextLinksFromMetadata() because to my mind, getNextLinks() sounded like specifically the Link HTTP header and metadata is more generic.

Would like to wait on more feedback before diving back in myself.

There is also this spurious warning in the logs under some configuration. Haven't traced which aspect of my usage is triggering it, as I have 2-3 variations of the pager configuration in place.

Notice: Undefined index: pager_selector in Drupal\migrate_plus\Plugin\migrate_plus\data_parser\Json->getNextLinks() (line 234 of /var/www/build/html/modules/contrib/migrate_plus/src/Plugin/migrate_plus/data_parser/Json.php).

Log in or register to post comments

Comment #11

hugovk commented 17 February 2017 at 13:09

This would be a very useful addition. What's the current status? Thanks!

Log in or register to post comments

Comment #12

drclaw commented 9 September 2017 at 16:03

Status	File	Size
new	migrate_plus-support_paging-2640516-12.patch	8.34 KB
new	interdiff-2640516.txt	8.44 KB

Here's a re-rolled patch for 8.x-4.x from #5 with a few changes and a few additions:

Changes to original patch:

I named the next url getter methods getNextUrls which IMO is more technically correct. Also it matches the language we use throughout the fetcher/parser plugins.
It looked like original patch was making a second request to the active url to get pager data off the response. I don't think that Guzzle caches responses (at least not without a plugin maybe?) so I added some code to store the active url's source data so that we don't make extraneous requests.
Pager config is now nested under the 'pager' key instead of being sibling to it. This is to support different paging types. The paging type in the original patch I've named "urls" since it's pulling urls off of the response.

Example config for the "urls" paging type:

source:
  plugin: url
  data_fetcher_plugin: http
  data_parser_plugin: json
  pager:
    type: urls
    selector: "pager/next"

And the payload might look something like this

{
  "data": [
    {
      ... etc ...
    }
  ],
  "pager": {
    "next": "https://api.example.com/api/v1/collection?page=3",
    "prev": "https://api.example.com/api/v1/collection?page=1"
  }
}

Additions:

Added support for cursor-based paging

Some APIs (like the one I'm using) doesn't return full urls for the next and previous pages. Instead you get a cursor value that you need to insert into the url yourself. The twitter api docs explains this technique pretty well and has some examples https://dev.twitter.com/overview/api/cursoring.

Here's an example of the pager config for this:

source:
  plugin: url
  data_fetcher_plugin: http
  data_parser_plugin: json
  pager:
    type: cursor
    selector: "cursor/next"
    # The key will be used as the parameter name in the url
    key: cursor

And the payload might look like

{
  "data": [
    {
      ... etc ...
    }
  ],
  "cursor": {
    "next": 4,
    "prev": "3"
  }
}

Log in or register to post comments

Comment #13

drclaw commented 9 September 2017 at 16:04

Version:	8.x-2.x-dev	» 8.x-4.x-dev
Status:	Needs work	» Needs review

Ack, metadata :)

Log in or register to post comments

Comment #14

badrange commented 11 September 2017 at 10:35

Happy to see that this patch has progressed! I just tried it out in our project and it seemed to do it's job - running for a long time - until it suddenly stopped with this error message:

Migration failed with source plugin exception: Error message: cURL error 3: &lt;url&gt; malformed (see http://curl.haxx.se/libcurl/c/libcurl-errors.html) at .  [error]

Log in or register to post comments

Comment #15

badrange commented 11 September 2017 at 12:05

And it is not surprising that it fails; the last page in our datasource does return meta/next: null when there are no more items. I guess it would make sense for the pager to stop if the data source returns "null" or "false"?

https://api.hel.fi/linkedevents/v1/event/?end=2017-09-12&include=locatio...

Log in or register to post comments

Comment #16

badrange commented 11 September 2017 at 12:54

Status	File	Size
new	migrate_plus-support_paging-2640516-16.patch	8.36 KB
new	interdiff-2640516-12-16.txt	752 bytes

A simple check to see if the url is empty might do the trick.

Log in or register to post comments

Comment #17

heddn

English

Nicaragua

commented 11 September 2017 at 13:45

Status:	Needs review	» Needs work
Issue tags:		+Needs tests

We could use some tests.

Log in or register to post comments

Comment #18

drclaw commented 18 September 2017 at 17:37

@badrange I think you're right, a simple check for data should do it!

@heddn Yeah tests would be great. I'll try and get to those in the next couple weeks.

Log in or register to post comments

Comment #19

berenddeboer commented 13 November 2017 at 06:34

Status:

Needs work

» Needs review

Status	File	Size
new	migrate_plus-support_paging-2640516-19.patch	9.38 KB

3 files were hidden/shown/deleted

Status	File	Size
hidden	support_paging-2640516-5.patch	8.08 KB
hidden	migrate_plus-support_paging-2640516-12.patch	8.34 KB
hidden	interdiff-2640516.txt	8.44 KB

This patch really should be committed guys. Works great. Have extended it to handle the case where paging is done by page numbers. I.e. no previous/next, just ?page=1 kind of style.

Log in or register to post comments

Comment #20

13 November 2017 at 06:42

Status:

Needs review

» Needs work

The last submitted patch, 19: migrate_plus-support_paging-2640516-19.patch, failed testing. View results

Log in or register to post comments

Comment #21

svendecabooter

he/him

Dutch

Gent

commented 21 November 2017 at 08:51

@berenddeboer your patch does not apply, because it is not diffed from the module root directory, but rather from your Drupal installation root.

Log in or register to post comments

Comment #22

heddn

English

Nicaragua

commented 22 November 2017 at 15:19

Issue tags:

+Needs reroll

I agree this is pretty important. But with the ease of adding a unit or kernel test these days, I'm going to ask we do that. So leaving at NW. Plus there's the need for a re-roll per #21.

Log in or register to post comments

Comment #23

mortona2k commented 1 January 2018 at 23:59

Status	File	Size
new	migrate_plus-support_paging-2640516-23.patch	9 KB

Rerolled the patch.

Log in or register to post comments

Comment #24

berenddeboer commented 18 January 2018 at 01:05

Saying you need a test is easier said then done, definitely doesn't appear to be easy. I.e. it seems you may need an HTTP source for json.

I made a first attempt at such a test using a file uri, see attached file, and the error I get is:

../vendor/bin/phpunit --filter=MigrateHttpJsonCursoringTest
PHPUnit 4.8.36 by Sebastian Bergmann and contributors.

Testing 
F

Time: 9.09 seconds, Memory: 230.00MB

There was 1 failure:

1) Drupal\Tests\migrate_plus\Kernel\MigrateHttpJsonCursoringTest::testTableDestination
Migration failed with source plugin exception: The URI &#039;file:////home/berend/src/snl/nzad/modules/contrib/migrate_plus/tests/src/Kernel/cursoring.json&#039; is invalid. You must use a valid URI scheme. Use base: for a path, e.g., to a Drupal file that needs the base path. Do not use this for internal paths controlled by Drupal.
Failed asserting that false is true.

/home/berend/src/snl/nzad/core/tests/Drupal/KernelTests/AssertLegacyTrait.php:23
/home/berend/src/snl/nzad/core/modules/migrate/tests/src/Kernel/MigrateTestBase.php:195
/home/berend/src/snl/nzad/core/modules/migrate/src/MigrateExecutable.php:192
/home/berend/src/snl/nzad/modules/contrib/migrate_plus/tests/src/Kernel/MigrateHttpJsonCursoringTest.php:103

FAILURES!
Tests: 1, Assertions: 1, Failures: 1.

Remaining deprecation notices (1)

Drupal\taxonomy\Tests\TaxonomyTestTrait is deprecated in Drupal 8.4.0 and will be removed before Drupal 9.0.0. Instead, use \Drupal\Tests\taxonomy\Functional\TaxonomyTestTrait: 1x
    1x in KernelTestSuite::suite from Drupal\Tests\TestSuites

How on earth do I fix that???

Log in or register to post comments

Comment #25

berenddeboer commented 18 January 2018 at 01:11

Status	File	Size
new	migrate_plus-test-case.patch	4.1 KB

3 files were hidden/shown/deleted

Status	File	Size
hidden	migrate_plus-support_paging-2640516-16.patch	8.36 KB
hidden	interdiff-2640516-12-16.txt	752 bytes
hidden	migrate_plus-support_paging-2640516-19.patch	9.38 KB

Oops, here the files.

Log in or register to post comments

Comment #26

berenddeboer commented 18 January 2018 at 01:40

Status	File	Size
new	migrate_plus-support_paging-2640516-26.patch	13.29 KB

1 file was hidden/shown/deleted

Status	File	Size
hidden	migrate_plus-test-case.patch	4.1 KB

OK, let's do this better. Here a full patch including the test. Test will fail. Have added two comments testing this against a local webserver, and it passes then.

Log in or register to post comments

Comment #27

berenddeboer commented 19 January 2018 at 04:12

Status	File	Size
new	migrate_plus-support_paging-2640516-27.patch	12.96 KB

And another attempt to get a patch that can be applied.

Log in or register to post comments

Comment #28

berenddeboer commented 19 January 2018 at 04:13

1 file was hidden/shown/deleted

Status	File	Size
hidden	migrate_plus-support_paging-2640516-26.patch	13.29 KB

Log in or register to post comments

Comment #29

dweidner commented 3 September 2018 at 16:39

What is holding this feature back? Looking for something similar to process a large HAL+JSON endpoint. I would like to help, even though I have no experience in contributing to drupal projects.

Log in or register to post comments

Comment #30

ressa

he/him

commented 5 October 2018 at 10:04

Status:

Needs work

» Needs review

I could also use this feature, maybe the patch just needs a review? I am changing status to Needs Review.

Log in or register to post comments

Comment #31

heddn

English

Nicaragua

commented 22 October 2018 at 15:20

Status:	Needs review	» Needs work
Issue tags:	-Needs tests, -Needs reroll

We now have some tests, but those are failing. And I don't see a lot of docs in this patch on how to configure a json migration with paging. Let's beef things up and get the failing test to pass.

Log in or register to post comments

Comment #32

ressa

he/him

commented 3 November 2018 at 21:59

I wonder if the patch will also add an option of paging through multiple files? This which works fine currently:

urls:
  - 'public://migrate_files/result-1-100.json'
  - 'public://migrate_files/result-101-200.json'
  - 'public://migrate_files/result-201-300.json'
  - 'public://migrate_files/result-301-400.json'
...

(I got the tip from Mike Ryan in From: Using Migrate API with a multi-page / paginated source.)

But if you have hundreds of files, something like this would be better:

urls: 'public://migrate_files/result-($1)-($2).json'
url-offsets: [1,100,4]

Meaning: Start from 1, get 100 records, repeat four times.

Log in or register to post comments

Comment #33

m.lebedev commented 7 December 2018 at 08:24

Status:

Needs work

» Needs review

Status	File	Size
new	migrate_plus-support_paging-2640516-33.patch	12.96 KB

For test.

Log in or register to post comments

Comment #34

7 December 2018 at 08:31

Status:

Needs review

» Needs work

The last submitted patch, 33: migrate_plus-support_paging-2640516-33.patch, failed testing. View results
- codesniffer_fixes.patch Interdiff of automated coding standards fixes only.

Log in or register to post comments

Comment #35

m.lebedev commented 7 December 2018 at 09:07

Status:

Needs work

» Needs review

Status	File	Size
new	migrate_plus-support_paging-2640516-35.patch	13.19 KB

1 file was hidden/shown/deleted

Status	File	Size
hidden	migrate_plus-support_paging-2640516-33.patch	12.96 KB

Rerolled; changed a path of a test file.

Log in or register to post comments

Comment #36

7 December 2018 at 09:17

Status:

Needs review

» Needs work

The last submitted patch, 35: migrate_plus-support_paging-2640516-35.patch, failed testing. View results
- codesniffer_fixes.patch Interdiff of automated coding standards fixes only.

Log in or register to post comments

Comment #37

m.lebedev commented 7 December 2018 at 09:56

Status:

Needs work

» Needs review

Status	File	Size
new	migrate_plus-support_paging-2640516-37.patch	13.19 KB

1 file was hidden/shown/deleted

Status	File	Size
hidden	migrate_plus-support_paging-2640516-35.patch	13.19 KB

last try. I take an example from data_parser/JsonTest.php

Besides, I have an other problem
Notice: Undefined index: next in Drupal\migrate_plus\Plugin\migrate_plus\data_parser\Json->getSourceData()

    foreach ($selectors as $selector) {
      if (!empty($selector)) {
        $return = $return[$selector]; // <-- Notice.
      }
    }

I get a data from a resource, which may don't have a next page on last page.

Log in or register to post comments

Comment #38

7 December 2018 at 09:34

Status:

Needs review

» Needs work

The last submitted patch, 37: migrate_plus-support_paging-2640516-37.patch, failed testing. View results
- codesniffer_fixes.patch Interdiff of automated coding standards fixes only.

Log in or register to post comments

Comment #39

jcandan commented 5 January 2019 at 18:20

Status	File	Size
new	migrate_plus-support_paging-2640516-39.patch	17.28 KB
new	interdiff_37_39.txt	13.64 KB
new	migrate_plus-support_paging-2640516-39.png	386.28 KB

Fixed the failing test, changed names to match test performed, and added appropriate doc-blocks.

Added a new pager type: paginator. This pager type allows one to deal with a paginated data where the next urls cannot be derived from the content. The example configuration below demonstrates how the page_key value would iteratively increase appropriately based on the configured paginator_type (e.g. check api.example.com/v1/?offset=100&limit50).

source:
  . . .
  urls: https://api.example.com/v1?format=json
  pager:
    type: paginator
    paginator_type: starting_item  # page_number [default]
    default_num_items: 50  # [required] 
    page_key: offset  # page [default]
    size_key: limit  # pagesize [default]

Options:

paginator_type Use "page_number" when the pager uses page numbers to determine the item to start at, use "starting_item" when the pager uses the item number to start at.
default_num_itemsThe first call is made without size specified; one must therefore define the default size. There is a @todo noted to address this.
page_key The url parameter key that should be used for incrementing pages
size_key The url parameter key that specifies the number of items per page

As I wrote this description, I realized that unless we do add the ability to specify the size on the first run, the default_num_size is unnecessary. I may address this in a follow-up patch.

Need help with TESTS
Though I got the tests passing, I do not feel that the test, as supplied by previous patches, tests the available pager options adequately. I think it would be best to inject a mock http client into the parser and iterate over mock endpoints, as this issue addresses. I made a few attempts, but could not figure out how to accomplish this.

This pager type addition was inspired by External Entities, and met my needs perfectly.

External Entities Pager Settings form

Log in or register to post comments

Comment #40

jcandan commented 5 January 2019 at 18:36

Additionally, I removed this bit of code which allows one to configure the paginator size type. This was also inspired by the External Entities UI (see image above in #39). However, I had a hard time understanding its use-case. If someone feels it necessary to add to the patch, feel free.

  // Use "num_items_per_page" when the pager uses this parameter
  // to determine the amount of items on each page, use "ending_item"
  // when the pager uses this parameter to determine the number of the
  // last item on the page.
  // @todo Handle ending_item pagination size type
  // Set default paginator size type.
  $paginator_size_type_options = ['num_items_per_page', 'ending_item'];
  $paginator_size_type = $paginator_size_type_options[0];
  // Check configured size type.
  if (!empty($this->configuration['pager']['size_type'])) {
    if (!in_array($this->configuration['pager']['size_type'], $paginator_size_type_options)) {
      // Not set to one of the two available options.
      throw new MigrateException(
        'Pager "size_type" must be configured as either "num_items_per_page" or "ending_item" ("num_items_per_page" is default).'
      );
    }
    $paginator_size_type = $this->configuration['pager']['size_type'];
  }
  if ($paginator_size_type === 'ending_item') {
    $next_end = $next_start + $next_end;
  }

Log in or register to post comments

Comment #41

jcandan commented 5 January 2019 at 22:11

Status	File	Size
new	migrate_plus-support_paging-2640516-41.patch	17.33 KB
new	interdiff_39_41.txt	1.84 KB

Re-rolled patch #39. Fixed coding standards in patched files.

Log in or register to post comments

Comment #42

achap

🇦🇺

commented 11 January 2019 at 11:59

Hi @jcandan. Thanks for the patch. Is it possible to elaborate on this config key? Maybe an example. It's not really clear to me. "paginator_type Use "page_number" when the pager uses page numbers to determine the item to start at, use "starting_item" when the pager uses the item number to start at."

Otherwise, it is working.

Log in or register to post comments

Comment #43

jcandan commented 11 January 2019 at 23:22

@achap, see the url parameters in the example given in #39. There, the parameter keys are limit and offset. Now, think in terms of how that is paginated: in this case, the offset tells the API endpoint how many items to skip, which is the same as listing the starting item (e.g. offset=0 would show items 0-10, offset=11 would show items 11-20, etc.). In another example, the parameter keys might be p for page, and ps for page size. The pagination increment is representative of pages of data (p=1, p=2, etc.), in which case paginator_type would be set to page. Examples of both of these are https://librivox.org/api/info and http://rijksmuseum.github.io/ respectively. Hope this helps clarify.

Log in or register to post comments

Comment #44

achap

🇦🇺

commented 12 January 2019 at 01:00

@jcandan yes makes perfect sense. Thank you!

Log in or register to post comments

Comment #45

jcandan commented 12 January 2019 at 17:37

Status	File	Size
new	interdiff_41_45.txt	551 bytes
new	migrate_plus-support_paging-2640516-45.patch	17.32 KB

Re-rolled patch #41 (a re-roll of #39). Fixed one last missed coding standards item.

Log in or register to post comments

Comment #46

jcandan commented 12 January 2019 at 17:38

3 files were hidden/shown/deleted

Status	File	Size
hidden	migrate_plus-support_paging-2640516-39.png	386.28 KB
hidden	migrate_plus-support_paging-2640516-41.patch	17.33 KB
hidden	interdiff_39_41.txt	1.84 KB

Log in or register to post comments

Comment #47

jcandan commented 12 January 2019 at 17:39

5 files were hidden/shown/deleted

Status	File	Size
hidden	migrate_plus-support_paging-2640516-23.patch	9 KB
hidden	migrate_plus-support_paging-2640516-27.patch	12.96 KB
hidden	migrate_plus-support_paging-2640516-37.patch	13.19 KB
hidden	migrate_plus-support_paging-2640516-39.patch	17.28 KB
hidden	interdiff_37_39.txt	13.64 KB

Log in or register to post comments

Comment #48

sdstyles commented 16 January 2019 at 13:02

Status	File	Size
new	interdiff_45_48.txt	672 bytes
new	migrate_plus-support_paging-2640516-48.patch	16.39 KB

2 files were hidden/shown/deleted

Status	File	Size
hidden	interdiff_41_45.txt	551 bytes
hidden	migrate_plus-support_paging-2640516-45.patch	17.32 KB

Use data fetcher plugin to check if URL is valid, this will work also with URLs which require authorization.

Log in or register to post comments

Comment #49

sdstyles commented 16 January 2019 at 13:04

Status:

Needs work

» Needs review

Log in or register to post comments

Comment #50

audriusb commented 21 January 2019 at 16:04

I am having an issue with latest patches. They do not return correct data.
see this call:
$data = $this->getSourceData($url, $this->configuration['pager']['selector']);
it has 2 paramaters but the function is
protected function getSourceData($url)
so the second parameter is completely ignored.

the latest working patch with 2nd parameter is #37

nevermind, missed the #39 with instructions. Do initial call require to be without page limit? what if it returns whole set then? I am going back to #37 for now

Log in or register to post comments

Comment #51

m.lebedev commented 21 January 2019 at 17:10

Import does not work starting from #39

source:
pager:
type: urls
selector: "next"

In function getNextUrls:

$data = $this->getSourceData($url, $this->configuration['pager']['selector']);

but function takes one arg
protected function getSourceData($url)

How to set up for the "http://devel.loc/api/v1/request?page=85"?
{
"self": "http://devel.loc/api/v1/request?page=84",
"first": "http://devel.loc/api/v1/request?page=0",
"last": "http://devel.loc/api/v1/request?page=87",
"prev": "http://devel.loc/api/v1/request?page=83",
"next": "http://devel.loc/api/v1/request?page=85",
"list": [
What type of pager to use?

Log in or register to post comments

Comment #52

audriusb commented 22 January 2019 at 09:16

you need to use type: paginatorinstead of type: urls you currently use to work with latest patch. There are additional params needed to be set in #39.

Log in or register to post comments

Comment #53

m.lebedev commented 22 January 2019 at 12:28

ok..

1)
source:
pager:
type: paginator
paginator_type: page_number
default_num_items: 100
page_key: page
size_key: limit

When a parser has try to get next link after last link I get an error:
"Error message: Client error: `GET http://devel.loc/api/v1/request?page=88&limit=100` resulted in a `404 Not Found`"

if this:

// Service may return 404 for last page, ensure next_urls are valid.
foreach ($next_urls as $key => $next_url) {
  $response = $this->getDataFetcherPlugin()->getResponse($next_url);

  if ($response->getStatusCode() !== 200) {
    unset($next_urls[$key]);
  }
}

change on this:

// Service may return 404 for last page, ensure next_urls are valid.
foreach ($next_urls as $key => $next_url) {
  try {
    $response = $this->getDataFetcherPlugin()->getResponse($next_url);

    if ($response->getStatusCode() !== 200) {
      unset($next_urls[$key]);
    }
  }
  catch (\Exception $e) {
    unset($next_urls[$key]);
  }
}

then it will work.

2) what about other types of pager? They don't work now

Log in or register to post comments

Comment #54

cameron prince commented 5 March 2019 at 15:09

I've tested the patch against a SharePoint API and it seems to work very well for importing paginated responses. The one thing that bugs me is that it seems to break drush ms. When I debug the command, I see that it's paging through the API just like the import, trying to get the total number of rows to display.

Do you think listing the paginated migrations with maybe "N/A" or "---" for their values in the status report would be a good solution?

Log in or register to post comments

Comment #55

weynhamz

Ningbo

commented 6 March 2019 at 05:12

Just a heads up here, specifically for JSON:API, this issue propose to create a jsonapi data_parser plugin for JSON:API responses, in which also added the support for links/next, will there be any conflict against the work here?

Log in or register to post comments

Comment #56

heddn

English

Nicaragua

commented 6 March 2019 at 14:35

Status:

Needs review

» Needs work

Re #54: there's a skip_count: true option for all source plugins.

+++ b/src/DataFetcherPluginBase.php
@@ -29,4 +29,11 @@ abstract class DataFetcherPluginBase extends PluginBase implements DataFetcherPl
+  public function getNextUrls($url) {

+++ b/src/DataFetcherPluginInterface.php
@@ -47,4 +47,17 @@ interface DataFetcherPluginInterface {
diff --git a/src/DataParserPluginBase.php b/src/DataParserPluginBase.php

I think we need more docs on how to use this feature in the doxygen.

+++ b/src/DataFetcherPluginInterface.php
@@ -47,4 +47,17 @@ interface DataFetcherPluginInterface {
+  public function getNextUrls($url);

I think we need more docs on how to use this feature in the doxygen.

+++ b/tests/src/Kernel/MigrateHttpJsonCursoringTest.php
@@ -0,0 +1,139 @@
+  public function testHttpJsonCursoring() {

I don't see where we are using the pager. As the test data only has a single page.

Log in or register to post comments

Comment #57

cameron prince commented 7 March 2019 at 17:05

Thanks @heddn! skip_count: true took care of the issue with the status report.

Log in or register to post comments

Comment #58

cameron prince commented 4 April 2019 at 22:03

I'm seeing a few bugs with this now that we're actually testing things and comparing counts. For instance, if the row limit is set to 50, we're seeing missing records. If we set the row limit down to 10, we get all results.

Also, when adding debugging to the Http class, I see that the same URL is queried multiple times. It seems to go forward two steps, then back one, such as: startrow=10, 20, 10, 20, 30, 20, 30, 40, etc.

Log in or register to post comments

Comment #59

cameron prince commented 6 May 2019 at 20:14

Another thing we've found with this patch is that when a migration is configured with a pager, it will no longer work if you change the data_fetcher_plugin to file. In our case, we were providing a local .json file as the source for a test. It wasn't obvious that the failure was related, but removing the pager resolved the issue.

Log in or register to post comments

Comment #60

m.lebedev commented 16 May 2019 at 07:48

Status	File	Size
new	interdiff_48-60.txt	9.98 KB
new	migrate_plus-support_paging-2640516-60.patch	15.91 KB

I created a patch based on the #53 comment

Log in or register to post comments

Comment #61

hudri

German

Austria

commented 26 June 2019 at 13:01

The last working patch for me is #37. I believe this is because I need to use the pager: type: urls together with item_selector config key.

Given a common data structure like

{
  data: { ...content payload... }
  links: {
    prev: 'remote api provides full uri to prev data',
    next: 'remote api provides full uri to next data'
  }
}

and a migration config like

source:
  plugin: url
  data_fetcher_plugin: http
  data_parser_plugin: json
  urls: '...'
  item_selector: data
  skip_count: true
  pager:
    type: urls
    selector: 'links/next'

then this patch fails on migrate:import with URI must be a string or UriInterface because file src/Plugin/migrate_plus/data_parser/Json.php

  protected function getNextUrls($url) {
    $next_urls = [];

    if (!empty($this->configuration['pager']['type'])) {
       // next-link can never be found, parameter $item_selector from #37 is mandatory
      $data = $this->getSourceData($url);

getSourceData() can't return the next url because it is limited to payload within item_selector

Log in or register to post comments

Comment #62

weekbeforenext

English

Asheville, NC

commented 12 August 2019 at 21:17

Status	File	Size
new	migrate_plus-support_paging-2640516-62.patch	16.44 KB
new	interdiff_60-62.txt	782 bytes

2 files were hidden/shown/deleted

Status	File	Size
hidden	migrate_plus-support_paging-2640516-48.patch	16.39 KB
hidden	interdiff_48-60.txt	9.98 KB

The patch from #60 worked for me, but my API endpoint returns a warning message instead of NULL when there are no more results.

This is a patch that avoids the error:

In Json.php line 124:
                                             
  Passed variable is not an array or object

My configuration for pagination looks like this:

  pager:
    type: paginator
    default_num_items: 20
    paginator_type: starting_item
    page_key: startNum
    size_key: numResults

Log in or register to post comments

Comment #63

weekbeforenext

English

Asheville, NC

commented 21 August 2019 at 16:17

Status:

Needs work

» Needs review

Status	File	Size
new	migrate_plus-support_paging-2640516-63.patch	16.84 KB
new	interdiff_62-63.txt	12.04 KB
new	interdiff_37-63.txt	13.52 KB

4 files were hidden/shown/deleted

Status	File	Size
hidden	interdiff_60-62.txt	782 bytes
hidden	migrate_plus-support_paging-2640516-60.patch	15.91 KB
hidden	interdiff_45_48.txt	672 bytes
shown	migrate_plus-support_paging-2640516-37.patch	13.19 KB

I had a need for the 'urls' pager type, so then I experienced the errors you all speak of.

I created a new patch where both the 'urls' and 'paginator' pager types work. I've also included interdiffs between patch #37 and patch #62.

Log in or register to post comments

Comment #64

weekbeforenext

English

Asheville, NC

commented 21 August 2019 at 16:20

2 files were hidden/shown/deleted

Status	File	Size
hidden	migrate_plus-support_paging-2640516-62.patch	16.44 KB
hidden	migrate_plus-support_paging-2640516-37.patch	13.19 KB

Log in or register to post comments

Comment #65

duaelfr

he/him

French

Montpellier, France

commented 9 September 2019 at 14:33

Status	File	Size
new	interdiff-2640516.63.65.txt	2.17 KB
new	migrate_plus-support_paging-2640516-65.patch	16.87 KB

I found out that there was an issue if the current page number were "0".
Here is the fix.

Log in or register to post comments

Comment #66

heddn

English

Nicaragua

commented 9 September 2019 at 22:19

I wonder if this piggy-backed on the work in #3040427: Allow callback for Url source, and single item Json plugin if we would be better off? Build a default callback for drupal's json that is callable via a callback.

Log in or register to post comments

Comment #67

duaelfr

he/him

French

Montpellier, France

commented 10 September 2019 at 08:48

@heddn Thanks for your feedback. I believe both features could work together.
In my current case, I need to build 20K+ URLs so the other issue would be perfect but each of these URLs can be paginated and I cannot know if they are before reading them (the page count is in the answer).

Log in or register to post comments

Comment #68

duaelfr

he/him

French

Montpellier, France

commented 10 September 2019 at 15:19

Version:

8.x-4.x-dev

» 8.x-5.x-dev

Status	File	Size
new	interdiff-2640516.65.68.txt	1.58 KB
new	migrate_plus-support_paging-2640516-68.patch	17.2 KB

2 files were hidden/shown/deleted

Status	File	Size
hidden	interdiff-2640516.63.65.txt	2.17 KB
hidden	migrate_plus-support_paging-2640516-65.patch	16.87 KB

Bumped version to 5.x and added a way to select the max page from the source in "page" mode.
I feel like we should find a cleaner way to handle these use cases. We might need a new plugin type, don't you think?

Log in or register to post comments

Comment #69

pixlkat commented 12 September 2019 at 13:44

Status	File	Size
new	migrate_plus-support_paging-2640516-69.patch	17.76 KB
new	interdiff-2640516.68.69.txt	2.06 KB

In reference to the comment in #58 -- while debugging this, I discovered that if you set the `default_num_items` config variable to anything greater than the default number of rows returned by the API, it will skip the number you specified, so it is possible to skip the difference between your requested size and the default size since the *initial* url is not set with the pager values. I got around this by setting this in my initial URL.

The API we are using pagination with returns the number of rows in the response as part of the JSON. I've updated the patch in 68 to use the `selector` configuration option to return that value and use it to decide if I need to add another URL to the array. I realize this may be somewhat of a snowflake, but it eliminated the need to request the URL to decide if it should be added. Our API will return a 200 whether or not there are any results or even an error in the query.

I would also support the idea of using a callback to enable pager support rather than trying to create a patch that is all things to all people.

Log in or register to post comments

Comment #70

prudloff commented 31 December 2019 at 17:56

Status	File	Size
new	migrate_plus-support_paging-2640516-70.patch	18.15 KB

The patch in #69 was very useful to us!

However, I found two remaining issues :

When using a pager on an API endpoint that implements rel="next" in Link headers, the pager behave strangely, because Json::getNextUrls() and Http::getNextUrls() will both want to add the same pages.
When using an item_selector that returns some empty items, it will cause the pager to go to the next page even if the current page is not over. The attached patch fixed this issue for us.

Log in or register to post comments

Comment #71

robin.ingelbrecht commented 3 March 2020 at 11:53

Status:

Needs review

» Needs work

I found another use case where this patch crashes. I have multiple URL's setup and each of them has multiple pages.
The following line in DataParserPluginBase::addNextUrls() causes the array keys to be unsequential, which results in a NULL url in DataParserPluginBase::nextSource():

      $this->urls = array_unique($this->urls);

Adding the following fixes the problem:

      $this->urls = array_unique($this->urls);
      $this->urls = array_values($this->urls);

Log in or register to post comments

Comment #72

robin.ingelbrecht commented 3 March 2020 at 11:55

Status:

Needs work

» Needs review

Status	File	Size
new	migrate_plus-support_paging-2640516-72.patch	18.15 KB

Log in or register to post comments

Comment #73

axel80 commented 21 September 2020 at 17:05

Hi, thanks for the very useful patch.
I made some test on patch #72 with YouTube Data API v3, which use a cursor for pagination.
Cursors available are stored in the following JSON field:

"nextPageToken": for next available page
"prevPageToken": for the previous available page

The cursor are NOT returned if the corresponding pages is not available (i.e. last API call doesn't have "nextPageToken" in the response)

I made a test running the following configuration

pager:
    type: cursor
    selector: nextPageToken
    key: pageToken

If I run drush command, the import is succesfully executed with the Info message [info] Undefined index: nextPageToken Json.php:83

$ drush migrate:import youtube_video_gam_json --verbose
 [info] Undefined index: nextPageToken Json.php:83

  16/164 [==>-------------------------]   9% 7 secs
  32/164 [=====>----------------------]  19% 10 secs
  48/164 [========>-------------------]  29% 13 secs
  64/164 [==========>-----------------]  39% 17 secs
  80/164 [=============>--------------]  48% 19 secs
  96/164 [================>-----------]  58% 23 secs
 112/164 [===================>--------]  68% 26 secs
 128/164 [=====================>------]  78% 29 secs
 144/164 [========================>---]  87% 32 secs [info] Undefined index: nextPageToken Json.php:83
 [notice] Processed 154 items (154 created, 0 updated, 0 failed, 0 ignored) - done with 'youtube_video_gam_json'

The items not imported are duplicated url that are skipped (unique constraint is not satisfied)

The same test executed from UI is failing after the import of the first 2 items. This is what I get on the page

An AJAX HTTP error occurred.
HTTP Result Code: 200
Debugging information follows.
Path: /en/batch?id=14&op=do_nojs&op=do
StatusText: OK
ResponseText: 
Notice:  Undefined index: nextPageToken in /Users/alessandro.senatore/sites/app4gym/web/modules/contrib/migrate_plus/src/Plugin/migrate_plus/data_parser/Json.php on line 83
{"status":true,"percentage":"1","message":"Migrating \u003Cem class=\u0022placeholder\u0022\u003EYuotube Playlist GAM - JSON\u003C\/em\u003E","label":"Importing \u003Cem class=\u0022placeholder\u0022\u003EYuotube Playlist GAM - JSON\u003C\/em\u003E (1%)."}

Log in or register to post comments

Comment #74

azussman commented 23 September 2020 at 18:58

Is anyone else having issues with paginator. I am trying to map a drupal 7 content type to a drupal 8 content type. I am using Views Datasource to help create a JSON endpoint on my drupal 7 CMS. I've created the endpoint and have successfully mapped the fields into my drupal 8 site. This works great with my test data. But I am looking at a massive migration 300K+ of data, because of that I want to utilize the power of pagination. The JSON endpoint provides this snippet to use:

pager: {
     pages: 2,
     page: 0,
     count: 4,
     limit: 2
}

I tried using this data to map onto the paginator as such:

  pager:
    type: paginator
    paginator_type: starting_item
    default_num_items: 'pager/count'
    page_key: 'pager/page'
    size_key: 'pager/limit'

This returns a warning `A non-numeric value encountered Json.php:255` and actually does return page 1, but it does not fetch page 2. If I try and hardcode the values into all those fields. The migration just stops working all together and just sits idle, returning no error message or feedback of what could be hanging up the request.

Any help or information around this would be greatly appreciated.

Log in or register to post comments

Comment #75

ruslan piskarov

he/him

Ukrainian

Kyiv, Ukraine

commented 14 October 2020 at 08:37

Also the following code not always work:

if ($response->getStatusCode() !== 200) {
  unset($next_urls[$key]);
}

in my case I get the following result with status 200:

{
    "item_type": "asset",
    "total_count": 253,
    "offset": 5200,
    "limit": 100,
    "items": []
}

Need to check if "item_selector" in my case "items" is empty also make unset($next_urls[$key]);

Log in or register to post comments

Comment #76

ruslan piskarov

he/him

Ukrainian

Kyiv, Ukraine

commented 5 November 2020 at 09:32

Status	File	Size
new	migrate_plus-support_paging-2640516-76.patch	18.53 KB

I have updated #72 patch for the case if an API returns 200 status code, but with empty 'item_selector'.
For some reason, I can't create an interdiff on my local.
My updated the following:

              if ($response->getStatusCode() !== 200) {
                unset($next_urls[$key]);
              }
              else {
                // In some cases, API returns 200 status code,
                // but with empty 'item_selector'.
                $sourceData = $this->getSourceData($next_url, $this->itemSelector);
                if (is_null($sourceData) || empty($sourceData)) {
                  unset($next_urls[$key]);
                }
              }

Log in or register to post comments

Comment #77

12 November 2020 at 13:09

Ruslan Piskarov opened merge request !1

Log in or register to post comments

Comment #78

10 February 2021 at 13:07

segi made their first commit to this issue’s fork.

Log in or register to post comments

Comment #79

segi commented 10 February 2021 at 13:42

Status	File	Size
new	migrate_plus-support_paging-2640516-78.patch	18.64 KB
new	interdiff-78.txt	694 bytes

I have fined a bug when I used patch #76.

Notice: Undefined index: next in Drupal\migrate_plus\Plugin\migrate_plus\data_parser\Json->getSourceData() (line 83 of .../modules/contrib/migrate_plus/src/Plugin/migrate_plus/data_parser/Json.php)

I fixed in a new patch, plus I pulled the last changes from original repo to MR.

Log in or register to post comments

Comment #80

JeremyFrench commented 10 March 2021 at 11:50

I ran into an issue using this patch with a target of a D7 restws site. Trying to pull back terms in a vocabulary ended up with an encoding error on the URL.

https://www.example.com/taxonomy_term?vocabulary=5&page=1

In the page, this was being returned in the json as a Unicode code (which may be the issue and nothing to do with this patch).

,"next":"https:\/\/www.example.com\/taxonomy_term?vocabulary=5\u0026page=1",

The pager config is like this

 pager:
    type: urls
    selector: 'next'

Trying to call the page with &amp in the URL gives a 404.

Log in or register to post comments

Comment #81

szloredan commented 6 May 2021 at 17:26

Status	File	Size
new	migrate_plus-support_paging-2640516-81.patch	18.79 KB
new	interdiff-81.txt	607 bytes

I have a case where i need the sourceData parsed from URLs so a getter will be very good to not run the fetch again for same URL. I added a patch for this.

Log in or register to post comments

Comment #82

jcandan commented 25 August 2021 at 16:18

RE: #74, @azussman:

The paginator pager type is meant to specify the parameters passed in the URL, not the item selector path to read pagination metadata. The example given in #39 shows how URL parameters might be used in pagination, e.g. ?offset=100&limit=50 puts you at page three. And the pager configuration example works to iterate through that pagination without the next link supplied as seen in these other pager types.

Log in or register to post comments

Comment #83

ronaldmulero commented 16 September 2021 at 22:36

Status	File	Size
new	migrate_plus-support_paging-2640516-83.patch	18.85 KB
new	reroll_diff_81-83.txt	5.7 KB

Re-roll of #81 against the latest 8.x-5.x-dev branch.

Log in or register to post comments

Comment #84

16 September 2021 at 22:41

Status:

Needs review

» Needs work

The last submitted patch, 83: migrate_plus-support_paging-2640516-83.patch, failed testing. View results

Log in or register to post comments

Comment #85

roam2345 commented 17 September 2021 at 13:57

@ronaldmulero thank you kindly for that, we are working with this issue actively over this week to consume an endpoint that requires this. Looking through the patch I don't see any documentation on the required YAML definition to utilize the patch. Looking through and trying some of the options put forward in this thread im still battling to get it to function.

Could that be added to the readme? Or an example be added some where in this module.

Thanks.
Lathan

Log in or register to post comments

Comment #86

ronaldmulero commented 19 September 2021 at 13:38

Status	File	Size
new	migrate_plus-support_paging-2640516-86.patch	19.36 KB
new	reroll_diff_83-86.txt	2.77 KB

@lathan See #39 for @jcandan's example of how to use the patch.

Rerolled the patch to fix that failing test:
see reroll_diff_83-86.txt

Only 2 changes since #83:

@@ -217,9 +217,6 @@ of Json.php: That else block replaces the entire source array with NULL, which can't be right.
@@ -14,7 +14,7 @@ of JsonTest.php: property must be declared protected.

Log in or register to post comments

Comment #87

ronaldmulero commented 19 September 2021 at 14:25

Status:

Needs work

» Needs review

Log in or register to post comments

Comment #88

scott_euser commented 20 September 2021 at 12:41

Status	File	Size
new	2021-09-20_13-38.png	6.9 KB
new	migrate_plus-support_paging-2640516-88.patch	19.35 KB
new	interdiff-2640516-86-88.txt	505 bytes

Thanks everyone for the work on this - very helpful!

+++ b/src/Plugin/migrate_plus/data_parser/Json.php
@@ -22,43 +25,76 @@ class Json extends DataParserPluginBase implements ContainerFactoryPluginInterfa
     // Otherwise, we're using xpath-like selectors.
-    $selectors = explode('/', trim($this->itemSelector, '/'));
+    $selectors = explode('/', trim($item_selector, '/'));
+    $return = $this->sourceData;
     foreach ($selectors as $selector) {
-      if (!empty($selector) || $selector === '0') {
-        $source_data = $source_data[$selector];
+      if (!empty($selector) && !empty($return[$selector])) {
+        $return = $return[$selector];
       }
     }
-    return $source_data;
+    return $return;

When testing this with a D7 Restful Web Services site https://www.drupal.org/project/restws I noticed that the first page does not have a 'prev' and the last page does not have a 'next'. With the keys missing the entire JSON is returned instead for the pagination. The entire JSON is of course not a URL so the migration will error with "URI must be a string or UriInterface"

Screenshot of error message - URI must be a string or UriInterface

Updated patch to ensure that getSourceData by selector will only return data for that selector and if not found, then returns NULL instead.

Log in or register to post comments

Comment #89

20 September 2021 at 12:48

Status:

Needs review

» Needs work

The last submitted patch, 88: migrate_plus-support_paging-2640516-88.patch, failed testing. View results

Log in or register to post comments

Comment #90

scott_euser commented 21 September 2021 at 08:00

Status:

Needs work

» Needs review

Status	File	Size
new	interdiff-2640516-88-90.txt	438 bytes

Updated patch to not assume that a selector has been provided.

Log in or register to post comments

Comment #91

scott_euser commented 21 September 2021 at 08:01

Status	File	Size
new	migrate_plus-support_paging-2640516-90.patch	19.49 KB
new	interdiff-2640516-88-90.txt	438 bytes

Log in or register to post comments

Comment #92

21 September 2021 at 08:05

Status:

Needs review

» Needs work

The last submitted patch, 91: migrate_plus-support_paging-2640516-90.patch, failed testing. View results

Log in or register to post comments

Comment #93

scott_euser commented 21 September 2021 at 08:40

Status:

Needs work

» Needs review

Status	File	Size
new	interdiff-2640516-90-92.txt	329 bytes
new	migrate_plus-support_paging-2640516-92.patch	19.49 KB

Sorry for the spam, obviously need a bit more coffee this morning!

Log in or register to post comments

Comment #94

hmdnawaz commented 23 November 2021 at 08:08

Status	File	Size
new	support-paging-through-multiple-requests-2640516-94.patch	19.18 KB

I have version 5.1 installed but the patch in #93 is not applying cleanly.

So I have rerolled the patch for that version.

My migration configurations are:

pager:
    type: urls
    selector: '@odata.nextLink'

Log in or register to post comments

Comment #95

peter törnstrand commented 14 December 2021 at 12:13

Status	File	Size
new	migrate_plus-pager_selector_error-2640516.patch	912 bytes

If the pager selector contains a path with more then one component the pager URL's returned are wrong.

For example, with a pager selector like _links/next/href the returned value contains all data under _links.

Log in or register to post comments

Comment #96

KevinVanRansbeeck commented 29 December 2021 at 16:10

Status	File	Size
new	migrate_plus-pager_selector_error-2640516-96.patch	17.59 KB

Re-roll against latest 8.x-5.x branch

Log in or register to post comments

Comment #97

29 December 2021 at 16:14

Status:

Needs review

» Needs work

The last submitted patch, 96: migrate_plus-pager_selector_error-2640516-96.patch, failed testing. View results
- codesniffer_fixes.patch Interdiff of automated coding standards fixes only.

Log in or register to post comments

Comment #98

sleopold commented 18 January 2022 at 14:53

Status	File	Size
new	migrate_plus-pager_selector_error-2640516-98.patch	18.08 KB

Previous patch resulted in an `ArgumentCountError`

Log in or register to post comments

Comment #99

apmsooner commented 13 April 2022 at 14:17

Patch in #98 works pretty well for me except I get an error on the migrations overview page:

Notice: Undefined index: next in Drupal\migrate_plus\Plugin\migrate_plus\data_parser\Json->getSourceData() (line 82 of /var/www/web/modules/composer/migrate_plus/src/Plugin/migrate_plus/data_parser/Json.php)

Log in or register to post comments

Comment #100

rob230 commented 29 April 2022 at 17:54

Status	File	Size
new	migrate_plus-pager_selector_error-2640516-100.patch	0 bytes
new	interdiff-2640516-98-100.txt	1.78 KB

Patch #98 has a few issues arising from parameters being used instead of a class variable - this was not replaced in all areas.

We also have the issue that when there are not multiple pages, the JSON:API doesn't output a links/next section, so I've added a check for this which means it won't try to load a next URL if the xpath for it isn't found. Pagination has to end so eventually there won't be a next URL. This may also fix your undefined index error apmsooner.

Log in or register to post comments

Comment #101

rob230 commented 29 April 2022 at 17:55

Status:

Needs work

» Needs review

Log in or register to post comments

Comment #102

rob230 commented 29 April 2022 at 18:05

Status	File	Size
new	migrate_plus-pager_selector_error-2640516-102.patch	19.35 KB

Not sure why the patch didn't upload before, hopefully this time it works.

Log in or register to post comments

Comment #103

29 April 2022 at 18:10

Status:

Needs review

» Needs work

The last submitted patch, 102: migrate_plus-pager_selector_error-2640516-102.patch, failed testing. View results
- codesniffer_fixes.patch Interdiff of automated coding standards fixes only.

Log in or register to post comments

Comment #104

samerali commented 1 May 2022 at 04:59

Thanks for the patch @rob230, any chance we can get this applied to 6.x ? i tried applying it and it would not apply.

Log in or register to post comments

Comment #105

apmsooner commented 6 June 2022 at 18:46

patch in #102 didn't work for me applied against 5.x version. It returned no results. #98 is still working for me aside from the visual errors.

Log in or register to post comments

Comment #106

apmsooner commented 6 June 2022 at 18:59

FYI, if anyone is trying to migrate from wordpress rest apis, i finally worked through a solution for that with example. The pager type is simply "page" and the defaults work.

plugin: url
  data_fetcher_plugin: http
  data_parser_plugin: json
  urls: 'https://www.{thewordpressdomain}.com/wp-json/wp/v2/media?media_type=image&_fields=id,data,guid,modified,slug,title,caption,alt_text,media_type,mime_type,media_details,source_url&per_page=100'
  pager:
    type: page
  item_selector: /

You may also need to set authentication if the rest apis are restricted on wordpress site.

Log in or register to post comments

Comment #107

monymirza

Urdu

Islamabad

commented 2 August 2022 at 11:16

Version:	8.x-5.x-dev	» 6.0.x-dev
Status:	Needs work	» Needs review

Status	File	Size
new	migrate_plus-pager_selector_error-2640516-107.patch	15.1 KB

Re-roll against latest 6.0.x dev branch

Log in or register to post comments

Comment #108

2 August 2022 at 11:19

Status:

Needs review

» Needs work

The last submitted patch, 107: migrate_plus-pager_selector_error-2640516-107.patch, failed testing. View results

Log in or register to post comments

Comment #109

auth commented 26 September 2022 at 08:39

Status	File	Size
new	migrate_plus-pager_selector_error-2640516-108---altered-on-line-202.patch	15.08 KB

Attaching a modification to the patch in #107 where we use is_numeric instead of is_int to determine if the selector should be used to select depth instead of as an xpath-like selector.

Log in or register to post comments

Comment #110

ricovandevin commented 18 November 2022 at 11:03

Status:

Needs work

» Needs review

Triggering tests.

Log in or register to post comments

Comment #111

ricovandevin commented 18 November 2022 at 12:21

Status:

Needs review

» Needs work

Log in or register to post comments

Comment #112

quadbyte commented 3 December 2022 at 15:47

Status:

Needs work

» Needs review

Status	File	Size
new	migrate_plus-pager_selector_error-2640516-112.patch	15.3 KB

Reapplied #109 to latest 6.0.x-dev and improved code for the "paginator" pager type to support non scalar selector.

Log in or register to post comments

Comment #113

bramdriesen

he/him

Dutch

Belgium 🇧🇪🇪🇺

commented 5 December 2022 at 06:51

Status:

Needs review

» Needs work

Patch does not apply

Log in or register to post comments

Comment #114

quadbyte commented 5 December 2022 at 12:27

Status	File	Size
new	migrate_plus-pager_selector_error-2640516.patch	15.33 KB

Fixed patch prefix.

Log in or register to post comments

Comment #115

bramdriesen

he/him

Dutch

Belgium 🇧🇪🇪🇺

commented 5 December 2022 at 13:08

Status:

Needs work

» Needs review

Setting to needs review to trigger tests :)

Log in or register to post comments

Comment #116

5 December 2022 at 13:13

Status:

Needs review

» Needs work

The last submitted patch, 114: migrate_plus-pager_selector_error-2640516.patch, failed testing. View results
- codesniffer_fixes.patch Interdiff of automated coding standards fixes only.

Log in or register to post comments

Comment #117

heddn

English

Nicaragua

commented 30 December 2022 at 18:19

Issue tags:

+Needs tests

This really needs some tests. Any chance we can add some?

Log in or register to post comments

Comment #118

slayne40 commented 26 January 2023 at 12:13

Status	File	Size
new	migrate_plus-pager_selector_error-2640516-118.patch	15.39 KB

For pager type urls, verify if the selector return an url.

Log in or register to post comments

Comment #119

kekkis

he/him

Finnish

Pirkkala

commented 9 March 2023 at 11:35

Status	File	Size
new	interdiff-118-119.txt	1.01 KB
new	migrate_plus-pager_selector_error-2640516-119.patch	15.6 KB

I believe the patches starting from #112 contain an error when using the 'paginator' type and when the $selector_data is scalar. The comparison operator between $num_items and $selector_data has shifted from '==' to '!=' for an unknown reason. How this manifests is that only the first page ever gets processed.

In this new patch I try to explain the thinking in the comment a bit better, plus of course fix the operator.

Leaving as Needs work since there still are no tests.

Log in or register to post comments

Comment #120

sassafrass commented 11 July 2023 at 16:32

I am testing using the latest patch. In my yaml, I am using the paginator type because the urls provided in the JSON endpoints are not valid for my use case.

  urls:
    - http://services.baltimorecountymd.gov/api/hub/pets/pets?status=Adoptable
    - http://services.baltimorecountymd.gov/api/hub/pets/pets?status=Lost
  pager:
    type: paginator
    selector: /metaData/
    paginator_type: page_number
    default_num_items: 10
    page_key: page
    size_key: recordsPerPage

The paginated urls are generated as expected. However, it never stops generating urls. Urls being generated are valid but have no records. For example see: https://services.baltimorecountymd.gov/api/hub/pets/pets?status=Adoptabl....

My particular use case has non-scalar $selector_data but the JSON is not in format expected by this segment of code, which always evaluates to true.

     else {
            // If we have an array of rows
            if (count($selector_data) > 0) {
              $next_urls[] = Url::fromUri($path['path'], [
                'query' => $path['query'],
                'fragment' => $path['fragment'],
              ])->toString();
            }
          }
        }

Log in or register to post comments

Comment #121

jcandan commented 26 September 2023 at 16:47

Status	File	Size
new	migrate_plus-support-paging-2640516-121.patch	16.81 KB
new	interdiff_119_121.txt	3.86 KB

40 files were hidden/shown/deleted

Status	File	Size
hidden	interdiff_62-63.txt	12.04 KB
hidden	interdiff_37-63.txt	13.52 KB
hidden	migrate_plus-support_paging-2640516-63.patch	16.84 KB
hidden	interdiff-2640516.65.68.txt	1.58 KB
hidden	migrate_plus-support_paging-2640516-68.patch	17.2 KB
hidden	migrate_plus-support_paging-2640516-69.patch	17.76 KB
hidden	interdiff-2640516.68.69.txt	2.06 KB
hidden	migrate_plus-support_paging-2640516-70.patch	18.15 KB
hidden	migrate_plus-support_paging-2640516-72.patch	18.15 KB
hidden	migrate_plus-support_paging-2640516-76.patch	18.53 KB
hidden	migrate_plus-support_paging-2640516-78.patch	18.64 KB
hidden	interdiff-78.txt	694 bytes
hidden	migrate_plus-support_paging-2640516-81.patch	18.79 KB
hidden	interdiff-81.txt	607 bytes
hidden	migrate_plus-support_paging-2640516-83.patch	18.85 KB
hidden	reroll_diff_81-83.txt	5.7 KB
hidden	migrate_plus-support_paging-2640516-86.patch	19.36 KB
hidden	reroll_diff_83-86.txt	2.77 KB
hidden	2021-09-20_13-38.png	6.9 KB
hidden	migrate_plus-support_paging-2640516-88.patch	19.35 KB
hidden	interdiff-2640516-86-88.txt	505 bytes
hidden	interdiff-2640516-88-90.txt	438 bytes
hidden	migrate_plus-support_paging-2640516-90.patch	19.49 KB
hidden	interdiff-2640516-88-90.txt	438 bytes
hidden	interdiff-2640516-90-92.txt	329 bytes
hidden	migrate_plus-support_paging-2640516-92.patch	19.49 KB
hidden	support-paging-through-multiple-requests-2640516-94.patch	19.18 KB
hidden	migrate_plus-pager_selector_error-2640516.patch	912 bytes
hidden	migrate_plus-pager_selector_error-2640516-96.patch	17.59 KB
hidden	migrate_plus-pager_selector_error-2640516-98.patch	18.08 KB
hidden	migrate_plus-pager_selector_error-2640516-100.patch	0 bytes
hidden	interdiff-2640516-98-100.txt	1.78 KB
hidden	migrate_plus-pager_selector_error-2640516-102.patch	19.35 KB
hidden	migrate_plus-pager_selector_error-2640516-107.patch	15.1 KB
hidden	migrate_plus-pager_selector_error-2640516-108---altered-on-line-202.patch	15.08 KB
hidden	migrate_plus-pager_selector_error-2640516-112.patch	15.3 KB
hidden	migrate_plus-pager_selector_error-2640516.patch	15.33 KB
hidden	migrate_plus-pager_selector_error-2640516-118.patch	15.39 KB
hidden	interdiff-118-119.txt	1.01 KB
hidden	migrate_plus-pager_selector_error-2640516-119.patch	15.6 KB

Re-roll #119 for latest 6.0.x-dev.

Log in or register to post comments

Comment #122

26 September 2023 at 16:48

jcandan opened merge request !81

Log in or register to post comments

Comment #123

chuyenlv commented 13 October 2023 at 09:51

Status	File	Size
new	migrate_plus-support-paging-2640516-123.patch	16.8 KB

Re-roll #121 for 6.0.x, Drupal 10 and PHP 8.1

Log in or register to post comments

Comment #124

kekkis

he/him

Finnish

Pirkkala

commented 23 October 2023 at 09:41

Status	File	Size
new	migrate_plus-support-paging-2640516-124.patch	16.97 KB
new	interdiff-123-124.txt	1021 bytes

2 files were hidden/shown/deleted

Status	File	Size
hidden	migrate_plus-support-paging-2640516-121.patch	16.81 KB
hidden	migrate_plus-support-paging-2640516-123.patch	16.8 KB

The return type of \Drupal\migrate_plus\Plugin\migrate_plus\data_parser\Json::getSourceData cannot be set to array reliably since it might return NULL (per its own code) or any scalar (from json_decode). This is my suggestion how to fix it, even though using mixed might do as well.

Log in or register to post comments

Comment #125

jacobbell84 commented 7 November 2023 at 20:27

Status:

Needs work

» Needs review

Status	File	Size
new	migrate_plus-support-paging-2640516-125.patch	17.15 KB

Comment 100 introduced a logic change to the getSourceData function in the Json data parser plugin that I believe wasn't intended.

     // Otherwise, we're using xpath-like selectors.
     $selectors = explode('/', trim($item_selector, '/'));
     $return = $this->sourceData;
     foreach ($selectors as $selector) {
+      if (!isset($return[$selector])) {
+        return NULL;
+      }
       if (!empty($selector) || $selector === '0') {
         $return = $return[$selector];
       }

With that change, it's no longer possible to use the Json data parser without setting an item selector. This is because the 'explode' method will return an array with one empty element if an empty item selector is passed to it, which causes the new check to fail 100% of the time and returns null. I've changed that logic to what's below, which I believe solves that problem and keeps the original intended functionality.

    // Otherwise, we're using xpath-like selectors.
    $return = $this->sourceData;
    if (!empty($item_selector)) {
      $selectors = explode('/', trim($item_selector, '/'));
      foreach ($selectors as $selector) {
        if (!isset($return[$selector])) {
          return NULL;
        }
        $return = $return[$selector];
      }
    } elseif (sizeOf($return) === 0) {
      return NULL;
    }

Log in or register to post comments

Comment #126

7 November 2023 at 20:58

Status:

Needs review

» Needs work

The last submitted patch, 125: migrate_plus-support-paging-2640516-125.patch, failed testing. View results
- codesniffer_fixes.patch Interdiff of automated coding standards fixes only.

Log in or register to post comments

Comment #127

nadavoid commented 8 November 2023 at 23:04

Is there a way to use this patch for the paging scenario I have? The things in particular that I'm not sure how to solve:

The token for the next page is unpredictable; it needs to be read from the current page
The identifier for the next page is only a token, not a full URL

My JSON source has this structure:

{
  "response": {
    "docs": ["one", "two", "three"],
    "count": "329",
    "nextPageToken": "qwk9"
}

The URL of the next page is the same as the current page, with the change of a URL param: &pageToken=qwk9

I'm thinking that I'll need to update the patch or extend it using custom code, but wanted to check here first. If updating the patch is there an existing pager type that would be best to model this on? These are the pager types that I see recognized in the patch so far:

urls
cursor
page
paginator

UPDATE
The solution is cursor with this configuration:

  pager:
    type: cursor
    key: pageToken
    selector: response/nextPageToken

Log in or register to post comments

Comment #128

nadavoid commented 9 November 2023 at 21:51

Issue summary:

View changes

Updating issue summary.

Log in or register to post comments

Comment #129

nadavoid commented 9 November 2023 at 22:21

Commenting with what I understand to be the purpose of some of the pager options for the JSON plugin.

selector: The path to the JSON item identifying the next page.

type

urls: selector points to one or more URLs.
cursor: selector is just a portion of the next page URL. Set `key` to the URL parameter key that should be passed.
page: selector is a current page number, and expected to increment: 1, 2, 3, etc.
paginator: not sure

key: Parameter key in the URL, will be given a value that identifies the page to request. If key is set to page, then the resulting generated URL will contain ?page=[value-from-current-json]

Comment #130

jacobbell84 commented 29 November 2023 at 19:22

Status	File	Size
new	migrate_plus-support-paging-2640516-130.patch	17.2 KB

Fixing some incorrect variable names (A couple spots were still using $this->itemSelector instead of $item_selector) and fixing another edge case where the item_selector is blank.

Log in or register to post comments

Comment #131

20 December 2023 at 12:17

guiu.rocafort.ferrer made their first commit to this issue’s fork.

Log in or register to post comments

Comment #132

guiu.rocafort.ferrer

Spanish

Barcelona

commented 21 December 2023 at 08:08

I updated the issue fork with the latest 6.0.x branch, so it is mergeable now. I also made a few changes so the Gitlab CI is available and runs de tests, and also fixed some of the tests.

Some of the changes i made, which might be arguable, are:

When the item_selector does not exist, the getSourceData returns now an empty array instead of null. This way, the plugin acts as there is no rows to be imported and does nothing.
When the item_selector is NULL or it is not defined, it is set to '' by default. In that case i believe the getSourceData method should return the whole sourceData contents.
One of the tests failed because the $item_selector parameter for getSourceData was set to type string, but it was passing an integer value. Inside the function, it checked if the parameter is_numeric, to apply the backwards compatibility for depth selection. I changed the method definition to accept also integer values.

My plan is to next write some tests for the pager functionality, and go from there to fix potential issues and make sure it works ok, before merging into the main branch.

Log in or register to post comments

Comment #133

guiu.rocafort.ferrer

Spanish

Barcelona

commented 27 December 2023 at 12:42

The last commit adds test cases for the pager types urls, cursor, and page. The tests for paginator are still missing thought.

I also expanded a little bit the documentation on the pager types here: https://www.drupal.org/docs/8/api/migrate-api/migrate-source-plugins/mig...

Log in or register to post comments

Comment #134

guiu.rocafort.ferrer

Spanish

Barcelona

commented 28 December 2023 at 10:21

Status:	Needs work	» Needs review
Issue tags:	-Needs tests

I am changing the status to "Needs review", since i wrote tests for all the pager types, and also updated the documentation to reflect the new functionality.

Documentation: https://www.drupal.org/docs/8/api/migrate-api/migrate-source-plugins/mig...

Log in or register to post comments

Comment #135

nadavoid commented 18 January 2024 at 17:59

The issue fork has merge conflicts against the main branch, with the recent update of migrate_plus to 6.0.2. The latest patches also no longer apply.

@guiu.rocafort.ferrer Could you merge the latest into your issue fork and resolve conflicts? I'll be happy to test after that, since I'm actively using this, and will continue to for quite a while.

It looks like the conflict is only in .gitlab-ci.yml

<<<<<<< HEAD
# variables:
#   SKIP_ESLINT: '1'
=======
variables:
  _TARGET_CORE: "$CORE_STABLE"
  _SHOW_ENVIRONMENT_VARIABLES: '1'
  OPT_IN_TEST_MAX_PHP: '1'
>>>>>>> 6.0.x

Log in or register to post comments

Comment #136

guiu.rocafort.ferrer

Spanish

Barcelona

commented 19 January 2024 at 13:56

Hi @nadavoid, the merge request should now be good to go.

Log in or register to post comments

Comment #137

nadavoid commented 19 January 2024 at 16:02

Status:

Needs review

» Reviewed & tested by the community

Status	File	Size
new	migrate_plus-support_paging-2640516-137.diff	33.48 KB

Thanks for the quick update @guiu.rocafort.ferrer! I really appreciate it. I've successfully applied the patch from the MR on 6.0.2 and confirmed that the cursor pagination still works in my installation.

The code looks good good to me, and the tests seem comprehensive. So I'm marking this RTBC. Others who have been deeper in this issue are of course also welcome to review. I'm strongly in favor of merging soon, and if adjustments are needed or bugs are found, they can be handled in smaller followup issues.

I'm also uploading the current version of the patch so that people have something stable to use in their builds today.

Log in or register to post comments

Comment #138

guiu.rocafort.ferrer

Spanish

Barcelona

commented 27 January 2024 at 14:01

Issue summary:

View changes

Log in or register to post comments

Comment #139

longwave

he/him

English

UK

commented 29 February 2024 at 21:02

I'm using this with the paginator pager type and while it works well, it is repeating HTTP requests - and my source is relatively slow to respond, so this is slowing down my migrations.

If I add logging to Http::getResponse() I see that requests are made up to three times for the same URL:

get https://[redacted]?size=500... done
get https://[redacted]?size=500&offset=500... done
get https://[redacted]?size=500... done
get https://[redacted]?size=500&offset=500... done
get https://[redacted]?size=500&offset=1000... done
get https://[redacted]?size=500&offset=500... done
get https://[redacted]?size=500&offset=1000... done
get https://[redacted]?size=500&offset=1500... done
get https://[redacted]?size=500&offset=1000... done

The extra calls come from two places. Firstly:

    return array_merge(parent::getNextUrls($url), $next_urls);

where parent::getNextUrls($url) calls the Http fetcher to look for Link headers; this probably isn't necessary if a pager is configured?

Secondly:

          // Service may return 404 for last page, ensure next_urls are valid.
          foreach ($next_urls as $key => $next_url) {
            try {
              $response = $this->getDataFetcherPlugin()->getResponse($next_url);

This isn't necessary for my source so it would be good to configure this as well.

I hate to say this at this stage but I wonder if the different pagers need to be their own type of plugin? That would mean they could be more easily extended for different cases. As an example my source also returns a count of total rows in each response which should mean I can automatically calculate the final page number without having to look ahead.

Log in or register to post comments

Comment #140

guiu.rocafort.ferrer

Spanish

Barcelona

commented 4 March 2024 at 10:08

Hi @longwave, I understand your concerns about the performance issues, and the redundant http requests, i do believe too that this is not optimal, and there is room for improve.

This issue have been opened since 2015, and it has been difficult to push it forward and make it to merge with the development branch, so i am worried that addressing those issues would delay even more. I personally have a few sites that make use of the patch for a while now, and i believe some other people might be in the same situation.

So i believe it would be better to merge the issue into development, and then create a follow-up issue to improve the performance situation.
What do the other think about this ?

Log in or register to post comments

Comment #141

longwave

he/him

English

UK

commented 4 March 2024 at 10:46

That decision is up to the Migrate Plus maintainers, I think it would be fine to defer to followups given this fixes some immediate issues and both removing the additional requests and refactoring to use plugins could be done separately.

Log in or register to post comments

Comment #142

googletorp commented 18 April 2024 at 17:30

I've used this in the past and compared to the alternative (which is to fetch everything in one go) this is a major improvement. This is also an opt-in feature, so while making a lot of requests can cause problems, it's something the developers using this can determine if it's a problem or not. From what I understand if you're not using the feature we won't have any performance change.

Maybe it's better to get it in, solve a lot of people's problems and get real life feedback on which performance issues would make sense to address.

I think this is RTBC.

Log in or register to post comments

Comment #143

heddn

English

Nicaragua

commented 18 July 2024 at 19:34

This is likely one of the longest running requests in the migrate plus queue. Let's land it and make incremental improvements on what we have here.

Log in or register to post comments

Comment #144

18 July 2024 at 19:44

heddn committed b79a5735 on 6.0.x authored by jcandan

Issue #2640516 by jcandan, m.lebedev, berenddeboer, scott_euser, segi,...

Log in or register to post comments

Comment #145

heddn

English

Nicaragua

commented 18 July 2024 at 19:44

Status:

Reviewed & tested by the community

» Fixed

Thank you everyone for sticking with this issue.

Log in or register to post comments

Comment #146

1 August 2024 at 19:44

Status:

Fixed

» Closed (fixed)

Automatically closed - issue fixed for 2 weeks with no activity.

Log in or register to post comments

Comment #147

heddn

English

Nicaragua

commented 5 August 2024 at 14:13

Log in or register to post comments

Comment #148

ressa

he/him

commented 5 August 2024 at 18:48

Yes, thanks everyone for landing this feature with great tenacity. This is one of the things which makes so Drupal so great -- while an issue may take some time, many will eventually be resolved, and make Drupal even greater.

Log in or register to post comments

Comment #149

init90

Ukraine

commented 7 August 2024 at 07:30

I've created an issue to address one of the points from #139 - #3466499: Omit an extra request when the pager is used

Support paging through multiple requests

Remaining tasks

Issue fork migrate_plus-2640516

Comments

Configuration

Links Header

Links Header: JSON

Links Header: XML (Blocked)

Links Header: SOAP (Blocked)

Next Links

Next Links: JSON

Next Links: XML

Next Links: SOAP

Changes to original patch:

Additions: