Ensure exceptions thrown by event subscribers are logged [#3354701]

Problem/Motivation

This was discovered by Wim in #3338666: Add functional test that proves there is reasonable UX whenever a stage event subscriber has an exception. Copying here for posterity:

th[e] stack trace [of an exception raised during batch processing] should also have been logged, to make diagnosis after the fact possible. We should assert that this indeed happened. Otherwise post-mortems in the real-world (where updates will be installed automatically, and hence no user will be present to witness the error) will become impossible.

This was discovered during form updates but we want exceptions from the event subscribers like this to be logged however they occur.

Proposed resolution

In \Drupal\package_manager\StageBase::dispatch() we should log any error with the backtrace.
Add test coverage in StageBaseTest for the logging
After 1) we should be able to remove the logging that would now be duplicated in StageBase
For example

We logging at the end of the catch after
```
$stage_id = parent::begin(['drupal' => $target_version], $timeout);
      $this->stage();
      $this->apply();
```
We would no longer need to log this if the exception was a StageEventException instance because this would already be logged.
```
try {
      $this->postApply();
    }
    catch (\Throwable $e) {
      $this->logger->error($e->getMessage());
    }
```
Would also not need to be logged if it is instance of StageEventException instance
```
  try {
      $this->destroy();
    }
    catch (StageEventException $e) {
      $this->logger->error($e->getMessage());
    }
```
Looking at the comment before this I think we could remove the try/catch around this all together

Issue fork automatic_updates-3354701

Show commands

Start within a Git clone of the project using the version control instructions.

Add & fetch this issue fork’s repository

Or, if you do not have SSH keys set up on git.drupalcode.org:

Add & fetch this issue fork’s repository

3354701-ensure-exceptions-thrown changes, plain diff MR !852
Check out this branch for the first time

Check out existing branch, if you already have it locally

About issue forks

Comments

Comment #1

17 April 2023 at 18:51

phenaproxima created an issue. See original summary.

Comment #2

wim leers

Ghent 🇧🇪🇪🇺

commented 18 April 2023 at 11:20

Priority:	Normal	» Major
Issue tags:		+maintainability
Related issues:		+#3338666: Add functional test that proves there is reasonable UX whenever a stage event subscriber has an exception

Without this, figuring out what actually happened on real sites would be impossible. Bumping to Major.

Comment #3

wim leers

Ghent 🇧🇪🇪🇺

commented 19 April 2023 at 07:53

Issue tags:		+sprint
Related issues:		+#3351895: Add Drush command to allow running cron updates via console and by a separate user, for defense-in-depth

This came up again at #3351895-13: Add Drush command to allow running cron updates via console and by a separate user, for defense-in-depth.

Pulling into sprint.

Comment #4

wim leers

Ghent 🇧🇪🇪🇺

commented 19 April 2023 at 07:53

Issue tags:

+beta target, +Usability

Comment #5

yash.rode commented 26 April 2023 at 12:56

Assigned:

Unassigned

» yash.rode

Comment #6

27 April 2023 at 09:36

yash.rode opened merge request !852

Comment #7

wim leers

Ghent 🇧🇪🇪🇺

commented 28 April 2023 at 10:00

No commits yet in >36 hours. Everything going okay here, @yash.rode?

Comment #8

yash.rode commented 28 April 2023 at 10:08

I was wrong yesterday that PreDestroyEvent exception is not logged but the case is both PreDestroyEvent and PostDestroyEvent Exceptions are not getting logged, I tried debugging it, as we don't know where exactly the error messages are getting logged I could not find it why it is happening.

Comment #9

tedbow

he/him

English

Ithaca, NY, USA

commented 1 May 2023 at 17:44

Status:	Active	» Needs work
Issue tags:		+Needs issue summary update, +Needs tests

In the quote from @Wim Leers in the summary we have

Otherwise post-mortems in the real-world (where updates will be installed automatically, and hence no user will be present to witness the error)

but in the suggested solution it says we should change UpdateErrorTest::testExceptionFromEventSubscriber(), which doesn't exist anymore since #3354325: Consolidate UpdateErrorTest but was a functional that test of what happens when the user does the update manually so won't cover "no user will be present "

Also the current MR changes src/BatchProcessor.php which also won't affect cron updates.

This is another example of why we should not make issues that simply quote another issue as the "Problem/Motivation" because is unclear unless you really dig in the other issue what the scope of the problem to be addressed is.

Comment #10

phenaproxima

he/him

English

Massachusetts

commented 1 May 2023 at 17:53

@tedbow, to me this was perfectly clear. If an exception occurs from an event subscriber during batch processing, we want to be sure the exception and its backtrace are logged. Although the issue summary refers to a non-existent method, the gist is still correct, and what we're trying to implement. What's the problem here?

Comment #11

tedbow

he/him

English

Ithaca, NY, USA

commented 1 May 2023 at 18:08

re #10

If an exception occurs from an event subscriber during batch processing

but the quote from the issue summary says

Otherwise post-mortems in the real-world (where updates will be installed automatically, and hence no user will be present to witness the error) will become impossible.

(emphasis mine)

If "no user will be present to witness the error" then we are talking about cron updates, cron updates do not use "batch processing"

Comment #12

wim leers

Ghent 🇧🇪🇪🇺

commented 2 May 2023 at 08:42

Assigned:	yash.rode	» Unassigned
Issue tags:		+Needs followup

@tedbow's point in #11 is fair, since apparently cron vs UI (non-cron) updates have different execution flow.

The real point here though is that exceptions should ALWAYS be logged, regardless of cron vs UI updates! And #11 may be correct in quoting what I said literally but is sort of ignoring the spirit: we cannot expect end users to meticulously take notes of everything they see happen. No matter whether that's during cron updates (where they CANNOT see it) or UI updates (where they COULD see it).

However … it's shocking news to me that cron updates and non-cron updates apparently do not use the exact same logic! 😳 It never even occurred to me that they'd be implemented differently. The stage life cycle is the same, only the way things get executed is different.

We need to document why these are implemented so differently. Why is it for UI-based installing of updates not okay to assume the entire stage life cycle finishes within a single request, but for cron-based installing of updates it is okay? 🤔

Unassigning @yash.rode, because @tedbow and @phenaproxima need to clarify this. We also need a follow-up issue that only either of them can work on to document the architectural differences between cron vs non-cron updates.

Comment #13

tedbow

he/him

English

Ithaca, NY, USA

commented 2 May 2023 at 12:05

The real point here though is that exceptions should ALWAYS be logged, regardless of cron vs UI updates! And #11 may be correct in quoting what I said literally but is sort of ignoring the spirit:

I agree. This should have been state in the summary more clearly. I don't think we should be making issues and handing them off to others follow the "spirit" of the issue.

Why is it for UI-based installing of updates not okay to assume the entire stage life cycle finishes within a single request, but for cron-based installing of updates it is okay?

Cron sets a long time limit to be able to do more operations. We could do this in the UI but then user has no feedback for the entire cycle.
The batch system gives the user feedback that something is happening so they don't have to wait and see nothing for whole cycle.
After the update is staged and before it is applied in UI we are able to show the user the form with the "Continue". Here we can show them more relevant information about what was staged. Currently I think all we are showing them is the message about possible database updates from StagedDBUpdateValidator but this is very important information and the user may decide not to run an update that has staged database updates(to run it at another time). Other contrib or custom code also follow this same pattern of listening to StatusCheckEvent and if there is staged update show some warning if it is not cron and an error if it is cron.
In cron updates we don't allow updates to run if there are staged database updates. There wouldn't be anybody to show this information to anyways.

Comment #14

tedbow

he/him

English

Ithaca, NY, USA

commented 2 May 2023 at 12:09

Issue tags:

-Needs followup

Created a follow-up #3357632: [META] Update doc comment on BatchProcessor to specify why use the batch system

Comment #15

tedbow

he/him

English

Ithaca, NY, USA

commented 3 May 2023 at 12:11

Issue tags:

+core-post-mvp

Comment #16

omkar.podey commented 27 June 2023 at 07:07

Assigned:

Unassigned

» omkar.podey

Comment #17

omkar.podey commented 27 June 2023 at 08:56

Assigned:

omkar.podey

» tedbow

So is the conclusion that cron sets a longer time hence can be assumed that it happens in a single request so this should be only done for cron and not for the UI updates ?, can you update the issue summary so it's more clear what the next steps are. and could this issue affected by #3357969: For web server dependent unattended updates run the entire life cycle in a separate process that will not be affected by hosting time limits

Comment #18

tedbow

he/him

English

Ithaca, NY, USA

commented 27 June 2023 at 18:15

Assigned:	tedbow	» omkar.podey
Issue summary:	View changes
Issue tags:	-Needs issue summary update

@omkar.podey I have update the summary. I think this is actionable now

The issue will not be affect by #3357969: For web server dependent unattended updates run the entire life cycle in a separate process that will not be affected by hosting time limits

Comment #19

omkar.podey commented 28 June 2023 at 13:57

Issue summary:

View changes

Comment #20

omkar.podey commented 29 June 2023 at 07:58

Assigned:	omkar.podey	» Unassigned
Status:	Needs work	» Needs review

Comment #21

omkar.podey commented 3 July 2023 at 09:08

Assigned:	Unassigned	» omkar.podey
Status:	Needs review	» Needs work

Comment #22

omkar.podey commented 4 July 2023 at 07:30

Assigned:	omkar.podey	» Unassigned
Status:	Needs work	» Needs review

Comment #23

wim leers

Ghent 🇧🇪🇪🇺

commented 5 July 2023 at 09:52

Assigned:	Unassigned	» omkar.podey
Status:	Needs review	» Needs work

A new quarter, time to increase the expectations! 🤓 I know you can do it! 😊

#9 added Needs tests. Why is this marked Needs review if the tag is still present? Are the necessary tests present now?
Looking good, just have a few questions!

Comment #24

omkar.podey commented 5 July 2023 at 10:17

Issue tags:

-Needs tests

Comment #25

omkar.podey commented 5 July 2023 at 11:25

Assigned:	omkar.podey	» Unassigned
Status:	Needs work	» Needs review

@Wim.leers I was trying to think of a way to test the back trace as i'm currently logging it as context. do you think that's necessary ?

Comment #26

phenaproxima

he/him

English

Massachusetts

commented 5 July 2023 at 13:12

Assigned:

Unassigned

» phenaproxima

Self-assigning for review.

Comment #27

phenaproxima

he/him

English

Massachusetts

commented 5 July 2023 at 13:45

Assigned:	phenaproxima	» Unassigned
Status:	Needs review	» Needs work

This seems like a reasonable start, but there are some aspects that feel...porous to me. Maybe I'm missing something, though. In general I think we need better comments.

Comment #28

omkar.podey commented 6 July 2023 at 07:56

Assigned:

Unassigned

» omkar.podey

Comment #29

omkar.podey commented 6 July 2023 at 12:46

Assigned:	omkar.podey	» Unassigned
Status:	Needs work	» Needs review

Comment #30

tedbow

he/him

English

Ithaca, NY, USA

commented 6 July 2023 at 16:25

Assigned:

Unassigned

» tedbow

reviewing

Comment #31

tedbow

he/him

English

Ithaca, NY, USA

commented 6 July 2023 at 17:08

Assigned:	tedbow	» Unassigned
Status:	Needs review	» Needs work

Looking good. Just a few more things

Comment #32

tedbow

he/him

English

Ithaca, NY, USA

commented 6 July 2023 at 17:13

Assigned:

Unassigned

» omkar.podey

Comment #33

omkar.podey commented 10 July 2023 at 11:19

Assigned:	omkar.podey	» Unassigned
Status:	Needs work	» Needs review

Comment #34

tedbow

he/him

English

Ithaca, NY, USA

commented 26 October 2023 at 18:55

Status:

Needs review

» Needs work

Needs to be merged/rebased with 3.0.x

Comment #35

phenaproxima

he/him

English

Massachusetts

commented 2 November 2023 at 03:13

Assigned:	Unassigned	» tedbow
Status:	Needs work	» Needs review

Comment #36

tedbow

he/him

English

Ithaca, NY, USA

commented 2 November 2023 at 13:40

Assigned:	tedbow	» phenaproxima
Status:	Needs review	» Needs work

Comment #37

phenaproxima

he/him

English

Massachusetts

commented 2 November 2023 at 13:50

Assigned:	phenaproxima	» tedbow
Status:	Needs work	» Needs review

Comment #38

tedbow

he/him

English

Ithaca, NY, USA

commented 2 November 2023 at 14:00

Assigned:	tedbow	» Unassigned
Status:	Needs review	» Reviewed & tested by the community

Looks good!

Comment #39

2 November 2023 at 14:41

phenaproxima committed 4400bebd on 3.0.x authored by yash.rode

Issue #3354701 by omkar.podey, phenaproxima, yash.rode, tedbow, Wim...

Comment #40

phenaproxima

he/him

English

Massachusetts

commented 2 November 2023 at 14:41

Status:

Reviewed & tested by the community

» Fixed

Comment #41

16 November 2023 at 14:44

Status:

Fixed

» Closed (fixed)

Automatically closed - issue fixed for 2 weeks with no activity.

Ensure exceptions thrown by event subscribers are logged

Problem/Motivation

Proposed resolution

Issue fork automatic_updates-3354701

Comments

Related issues

Referenced by