Set a known timestamp if not set in the input in aggregator_parse_feed() [#1304758]

In function aggregator_parse_feed() the publication date / timestamp is derived out one of several fields. Sometimes a feed does contain none of these fields. The consequence is the item is created several times, each with the current date. The real problem is, these items are always shown on top.
This can be easily solved by replacing the timestamp with the timestamp from an earlier record of the item. I don't know how to create a diff, so I paste my current code:

    // Save this item. Try to avoid duplicate entries as much as possible. If
    // we find a duplicate entry, we resolve it and pass along its ID is such
    // that we can update it if needed.
    if (!empty($guid)) {
      $entry = db_fetch_object(db_query("SELECT iid, timestamp FROM {aggregator_item} WHERE fid = %d AND guid = '%s'", $feed['fid'], $guid));
    }
    else if ($link && $link != $feed['link'] && $link != $feed['url']) {
      $entry = db_fetch_object(db_query("SELECT iid, timestamp FROM {aggregator_item} WHERE fid = %d AND link = '%s'", $feed['fid'], $link));
    }
    else {
      $entry = db_fetch_object(db_query("SELECT iid, timestamp FROM {aggregator_item} WHERE fid = %d AND link = '%s'", $feed['fid'], $link));
    }

    // if timestamp not known from input, but from a known record, use the records-timestamp
    if ($date == 'now' && $entry->timestamp) {
      $timestamp = $entry->timestamp;
    }

My changes:
1. select from the aggregator_item record also the timestamp.
2. if the timestamp still is the default 'now' and retrieved from a record from the database, use the latter as the timestamp.

Comment	File	Size	Author
#15	aggregator-timestamp-fix-1304758-4.patch	1.99 KB	PROMES
#9	aggregator-timestamp-fix-1304758-3.patch	2.01 KB	PROMES
#4	aggregator-timestamp-fix-1304758-2.patch	2.01 KB	PROMES
#1	aggregator-timestamp-fix-1304758-1.patch	1.81 KB	NROTC_Webmaster

Support from Acquia helps fund testing for Drupal Acquia logo

Comments

Comment #1

NROTC_Webmaster CreditAttribution: NROTC_Webmaster commented 10 March 2012 at 00:52

Version:	6.22	» 6.x-dev
Status:	Active	» Needs review

File	Size
aggregator-timestamp-fix-1304758-1.patch	1.81 KB

While this seems fairly straightforward to me I'm not sure that this will be implemented.

This is also very similar to the way they implemented it in D8 although it was moved to aggregator.processor.inc

        if (!empty($item['guid'])) {
          $entry = db_query("SELECT iid, timestamp FROM {aggregator_item} WHERE fid = :fid AND guid = :guid", array(':fid' => $feed->fid, ':guid' => $item['guid']))->fetchObject();
        }
        elseif ($item['link'] && $item['link'] != $feed->link && $item['link'] != $feed->url) {
          $entry = db_query("SELECT iid, timestamp FROM {aggregator_item} WHERE fid = :fid AND link = :link", array(':fid' => $feed->fid, ':link' => $item['link']))->fetchObject();
        }
        else {
          $entry = db_query("SELECT iid, timestamp FROM {aggregator_item} WHERE fid = :fid AND title = :title", array(':fid' => $feed->fid, ':title' => $item['title']))->fetchObject();
        }
        if (!$item['timestamp']) {
          $item['timestamp'] = isset($entry->timestamp) ? $entry->timestamp : REQUEST_TIME;
        }

Comment #2

PROMES

Dutch

CreditAttribution: PROMES commented 15 March 2012 at 15:11

Thanks for your patch.
I presume that either D8 or wrong or your patch. See the equal where clauses in the second and thir query, both: where ... link = "%s" ... . In the original lines it states once link = "%s" and once title = "%s".
But the title and link in the D8 implementation are reversed between the second and third query. So I don't know when to use title or link.

     else if ($link && $link != $feed['link'] && $link != $feed['url']) {
-      $entry = db_fetch_object(db_query("SELECT iid FROM {aggregator_item} WHERE fid = %d AND link = '%s'", $feed['fid'], $link));
+      $entry = db_fetch_object(db_query("SELECT iid, timestamp FROM {aggregator_item} WHERE fid = %d AND link = '%s'", $feed['fid'], $link));
     }
     else {
-      $entry = db_fetch_object(db_query("SELECT iid FROM {aggregator_item} WHERE fid = %d AND title = '%s'", $feed['fid'], $title));
+      $entry = db_fetch_object(db_query("SELECT iid, timestamp FROM {aggregator_item} WHERE fid = %d AND link = '%s'", $feed['fid'], $link));

A further extension in my code is after the first query a test whether the entry allready exists (coming from real life data):

    if (!empty($guid)) {
      $entry = db_fetch_object(db_query("SELECT iid, timestamp FROM {aggregator_item} WHERE fid = %d AND guid = '%s'", $feed['fid'], $guid));
      // test whether this entry allready exists with anothere guid; is it an updated item?
      if (!isset($entry->iid)) {
        $entry = db_fetch_object(db_query("SELECT iid, timestamp FROM {aggregator_item} WHERE fid = %d AND link = '%s' AND title = '%s'", $feed['fid'], $link, $title));
      }
    }

I hope to improve this fine module.

Comment #3

NROTC_Webmaster CreditAttribution: NROTC_Webmaster commented 15 March 2012 at 15:24

Status:

Needs review

» Needs work

That was in your original code that you posted. Yes, it should be title and I didn't catch that when creating the patch. If you could copy your entire section of code from line 799 to the end of your changes I can create a patch for this.

You will still need to get some other people to review this and then approval from one of the core maintainers to get this in.

Comment #4

PROMES

Dutch

CreditAttribution: PROMES commented 3 May 2012 at 13:56

File	Size
aggregator-timestamp-fix-1304758-2.patch	2.01 KB

You are right. The difference is in my original code.
I now can create diffs. Attached is my current diff-file, based on version 6.26.
I don't know any other user of this patch. If someone likes this, please drop a line.

Comment #5

NROTC_Webmaster CreditAttribution: NROTC_Webmaster commented 3 May 2012 at 15:20

Status:

Needs work

» Needs review

Comment #6

3 May 2012 at 15:21

Status:

Needs review

» Needs work

The last submitted patch, aggregator-timestamp-fix-1304758-2.patch, failed testing.

Comment #7

3 May 2012 at 15:21

The last submitted patch, aggregator-timestamp-fix-1304758-2.patch, failed testing.

Comment #8

NROTC_Webmaster CreditAttribution: NROTC_Webmaster commented 3 May 2012 at 15:25

I'm not sure why this failed testing but you should make patches against the dev branch.

Comment #9

PROMES

Dutch

CreditAttribution: PROMES commented 10 May 2012 at 15:14

File	Size
aggregator-timestamp-fix-1304758-3.patch	2.01 KB

New patch against dev branch.
I presume the test fails because I used as module name: aggregator-new.module.

Comment #10

NROTC_Webmaster CreditAttribution: NROTC_Webmaster commented 10 May 2012 at 22:52

Status:

Needs work

» Needs review

Comment #11

10 May 2012 at 22:53

Status:

Needs review

» Needs work

The last submitted patch, aggregator-timestamp-fix-1304758-3.patch, failed testing.

Comment #12

PROMES

Dutch

CreditAttribution: PROMES commented 11 May 2012 at 12:56

The problems are:
1. failed testing: Detect invalid patch format. Ensure the patch only contains unix-style line endings.
2. I am working on a Windows pc, so the requirement seems not so easy to meet.
Do you know a Windows program to change the line endings into Unix style? I found a lot of same as #1 warnings on this site, but no solution.

Comment #13

NROTC_Webmaster CreditAttribution: NROTC_Webmaster commented 13 May 2012 at 20:58

I like notepad++ as it has lots of options to set it up based on the coding standards. Eclipse can integrate the drupal coding standard with DrupalCS so that may be useful but in my opinion it is rather difficult to get started with.

Comment #14

NROTC_Webmaster CreditAttribution: NROTC_Webmaster commented 13 May 2012 at 21:23

Additionally, you can go to Development toolsand pick from the list on the page.

Comment #15

PROMES

Dutch

CreditAttribution: PROMES commented 14 May 2012 at 09:48

File	Size
aggregator-timestamp-fix-1304758-4.patch	1.99 KB

Thanks NROTC_Webmaster. After some testing with the settings of Notepad++, which already was one of my editors, I presume the file is now in Unix line endings.

8 March 2013 at 22:49

Version:

8.6.x-dev

» 8.8.x-dev

Drupal 8.6.x will not receive any further development aside from security fixes. Bug reports should be targeted against the 8.8.x-dev branch from now on, and new development or disruptive changes should be targeted against the 8.9.x-dev branch. For more information see the Drupal 8 and 9 minor version schedule and the Allowed changes during the Drupal 8 and 9 release cycles.

Comment #24

8 March 2013 at 22:49

Version:

8.8.x-dev

» 8.9.x-dev

Drupal 8.8.7 was released on June 3, 2020 and is the final full bugfix release for the Drupal 8.8.x series. Drupal 8.8.x will not receive any further development aside from security fixes. Sites should prepare to update to Drupal 8.9.0 or Drupal 9.0.0 for ongoing support.

Bug reports should be targeted against the 8.9.x-dev branch from now on, and new development or disruptive changes should be targeted against the 9.1.x-dev branch. For more information see the Drupal 8 and 9 minor version schedule and the Allowed changes during the Drupal 8 and 9 release cycles.

Comment #25

8 March 2013 at 22:49

Version:

8.9.x-dev

» 9.2.x-dev

Drupal 8 is end-of-life as of November 17, 2021. There will not be further changes made to Drupal 8. Bugfixes are now made to the 9.3.x and higher branches only. For more information see the Drupal core minor version schedule and the Allowed changes during the Drupal core release cycle.

Comment #26

8 March 2013 at 22:49

Version:

9.2.x-dev

» 9.3.x-dev

Comment #27

Spokje

Dutch

CreditAttribution: Spokje at Chromatic commented 31 March 2022 at 09:20

Project:	Drupal core	» Aggregator
Version:	9.3.x-dev	» 1.x-dev
Component:	aggregator.module	» Code

The aggregator module has been removed from Core in 10.0.x-dev and now lives on as a contrib module.
Issues in the Core queue about the aggregator module, like this one, have been moved to the contrib module queue.