Script test runner does not clean after fatal error [#1565718]

Comment	File	Size	Author
#44	1565718-cleanup-after-fatal-error-D7-44.patch	8.32 KB	aspilicious

#44	diff-script-patches.txt	2.07 KB	aspilicious
#32	drupal8.run-tests-fatal-clean.32.without-fatal.patch	9.17 KB	jthorson

#34	drupal8.run-tests-fatal-clean.34.with-fatal.patch	8.55 KB	jthorson

#34	drupal8.run-tests-fatal-clean.34.without-fatal.patch	8 KB	jthorson

#29	drupal8.run-tests-fatal-clean.29.patch	10.39 KB	jthorson

#22	drupal8.run-tests-fatal-clean.22.patch	8.55 KB	sun

#20	drupal8.run-tests-fatal-clean.20.patch	8.71 KB	sun

#13	drupal8.run-tests-fatal-clean.13.patch	8.61 KB	sun

#10	drupal8.run-tests-fatal-clean.10.patch	8.44 KB	sun

#10	drupal8.run-tests-fatal-clean.10.without-fatal.patch	7.83 KB	sun

#8	drupal8.run-tests-fatal-clean.8.patch	4.98 KB	sun

#4	drupal8.run-tests-fatal-clean.4.patch	4.55 KB	sun

#2	drupal8.run-tests-fatal-clean.2.patch	4.38 KB	sun

	drupal8.run-tests-fatal-clean.0.patch	4.98 KB	sun

Comment #1

7 May 2012 at 16:54

Status:

Needs review

» Needs work

The last submitted patch, drupal8.run-tests-fatal-clean.0.patch, failed testing.

Log in or register to post comments

Comment #2

sun

German

Karlsruhe

CreditAttribution: sun commented 7 May 2012 at 17:19

Status:

Needs work

» Needs review

File	Size
drupal8.run-tests-fatal-clean.2.patch	4.38 KB

Added the test-specific error.log to the review log output.

Apparently, I do not see any output from the new cleanup function in the review log for patch #0 in the testbot review log. However, this works correctly on my local machine. That possibly means that the simpletest_test_id table does not contain a db prefix on the testbot. (?)

Log in or register to post comments

Comment #3

7 May 2012 at 17:43

Status:

Needs review

» Needs work

The last submitted patch, drupal8.run-tests-fatal-clean.2.patch, failed testing.

Log in or register to post comments

Comment #4

sun

German

Karlsruhe

CreditAttribution: sun commented 7 May 2012 at 18:09

Status:

Needs work

» Needs review

File	Size
drupal8.run-tests-fatal-clean.4.patch	4.55 KB

Review log for #2 contains the expected cleanup operations after the fatal error.

However, there's a second fatal error in another test case now.

The $test_id appears to be the same for all test runners? How is that possible? For Simpletest module, it changes for every test case being executed in the batch.

Log in or register to post comments

Comment #5

sun

German

Karlsruhe

CreditAttribution: sun commented 7 May 2012 at 18:18

I fail to see how {simpletest_test_id}.last_prefix is able to work when tests are executed concurrently.

run-tests.sh calls into simpletest_last_test_get() (like Simpletest module itself), but that is totally not designed for concurrency in parallel threads.

Log in or register to post comments

Comment #6

7 May 2012 at 18:31

Status:

Needs review

» Needs work

The last submitted patch, drupal8.run-tests-fatal-clean.4.patch, failed testing.

Log in or register to post comments

Comment #7

sun

German

Karlsruhe

CreditAttribution: sun commented 7 May 2012 at 19:30

Status:

Needs work

» Needs review

The concurrency problem with last_prefix cannot be fixed without a database schema change, which inherently means it cannot be fixed without adjusting the entire PIFR stack.

The PIFR stack must work across major versions of Drupal, so it would need a conditional, version-agnostic adaption all over the place.

I've no interest in pursuing that.

So while this patch here fixes a tough problem, which causes @jthorson to have to manually reset and clean individual testbots every second day, it's not possible to commit the fix.

Log in or register to post comments

Comment #8

sun

German

Karlsruhe

CreditAttribution: sun commented 7 May 2012 at 19:41

File	Size
drupal8.run-tests-fatal-clean.8.patch	4.98 KB

Alternatively, let's see what happens if we assign a new test_id for every class. This might blow up though.

Log in or register to post comments

Comment #9

7 May 2012 at 20:03

Status:

Needs review

» Needs work

The last submitted patch, drupal8.run-tests-fatal-clean.8.patch, failed testing.

Log in or register to post comments

Comment #10

sun

German

Karlsruhe

CreditAttribution: sun commented 8 May 2012 at 17:26

Status:

Needs work

» Needs review

File	Size
drupal8.run-tests-fatal-clean.10.without-fatal.patch	7.83 KB

drupal8.run-tests-fatal-clean.10.patch	8.44 KB

Looks like that stop-gap fix worked. Not sure about PIFR implications, but attached patch completes the fix and properly removes the bogus assumption that there's only one $test_id.

Log in or register to post comments

Comment #11

sun

German

Karlsruhe

CreditAttribution: sun commented 8 May 2012 at 17:31

Priority:	Normal	» Critical
Issue tags:		+Needs backport to D6, +Needs backport to D7

Log in or register to post comments

Comment #12

sun

German

Karlsruhe

CreditAttribution: sun commented 8 May 2012 at 17:58

oink. Now the review log contains the additional cleanup debug messages for each test case:

Node access and fields 50 passes, 0 fails, and 0 exceptions

- Found database prefix 'simpletest644034' for test ID 280.
Node access rebuild 19 passes, 0 fails, and 0 exceptions

- Found database prefix 'simpletest983969' for test ID 282.
Node access pagination 30 passes, 0 fails, and 0 exceptions

- Found database prefix 'simpletest60768' for test ID 281.
Node access records 26 passes, 0 fails, and 0 exceptions

- Found database prefix 'simpletest433378' for test ID 283.
Node Access on any table 159 passes, 0 fails, and 0 exceptions

That should only be output in case of a fatal error. Need to think how to fix this.

Log in or register to post comments

Comment #13

sun

German

Karlsruhe

CreditAttribution: sun commented 9 May 2012 at 19:18

File	Size
drupal8.run-tests-fatal-clean.13.patch	8.61 KB

Attached patch simplifies the changes and should resolve the verbose debug output issue for positive tests.

Log in or register to post comments

Comment #14

9 May 2012 at 19:40

Status:

Needs review

» Needs work

The last submitted patch, drupal8.run-tests-fatal-clean.13.patch, failed testing.

Log in or register to post comments

Comment #15

sun

German

Karlsruhe

CreditAttribution: sun commented 9 May 2012 at 19:52

Status:

Needs work

» Needs review

So #13 actually works as intended.

However, as of now, DrupalUnitTestCase only sets and changes the database prefix, but does not update it correctly in Simpletest's test_id table. The patch in #1563620: All unit tests blow up with a fatal error fixes that.

Log in or register to post comments

Comment #16

catch

he/him

English

CreditAttribution: catch commented 10 May 2012 at 08:09

So #13 exposes the unit tests bug that's been hidden for months?

Log in or register to post comments

Comment #17

Berdir

German

Switzerland

CreditAttribution: Berdir commented 19 May 2012 at 10:28

Status:

Needs review

» Needs work

+++ b/core/scripts/run-tests.shundefined
@@ -347,8 +342,12 @@ function simpletest_script_execute_batch($test_id, $test_classes) {
         if ($status['exitcode']) {
-          echo 'FATAL ' . $test_class . ': test runner returned a non-zero error code (' . $status['exitcode'] . ').' . "\n";
+          echo 'FATAL ' . $child['class'] . ': test runner returned a non-zero error code (' . $status['exitcode'] . ').' . "\n";

Separate issue, but is there a way to extract the actual error and display it somehow? It can be very hard to see what the actual error is currently.

I'm not sure what exactly is the state here, does this need to go in first or the linked issue? Or doesn't it matter? Anyway, this certainly needs a version of the patch without the forced fatal error in node.test.

Log in or register to post comments

Comment #18

sun

German

Karlsruhe

CreditAttribution: sun commented 19 May 2012 at 14:32

Status:

Needs work

» Needs review

The actual error is supposed to be captured and inserted into SimpleTest's assertion table. In case of fatal errors the actual error is read from the PHP error.log, and this happens after the fact (and is only possible in run-tests.sh, since simpletest.module runs in the same process as the test).

Note that this issue/change is about web/integration tests, not unit tests.

In order to be able to clean up the test environment after a test run with fatal error(s), the test runner needs to know the database prefix that was used to set up the test.

Simpletest normally records the database prefix for each test being run. Simpletest itself attempts to clean up the test environment after each test run. In case of a fatal error, those Simpletest functions are no longer executed.

The current run-tests.sh script does not contain any clean up code currently. This patch basically adds the same clean-up operations to run-tests.sh, which are executed whenever a test runner completes.

However, run-tests.sh supports a special notion of concurrently executing multiple test runner threads in sub-processes, which was only added to the run-tests.sh script, but not to Simpletest. Right now, run-tests.sh uses the same test ID for all tests being executed. This means that, in concurrency, run-tests.sh is not able to retrieve the database prefix that was used to setup the test site, because the concurrently executed test runners overwrite the recorded database prefix. Therefore, this patch changes run-tests.sh to use a separate unique test ID for each test runner, so each test runner can retrieve the database prefix, and in turn, clean up the environment after test execution.

This also implies a functional change -- the configuration option that controls whether the test environment should be cleaned after a test run is ignored.

Unit tests do not properly set up a database prefix currently - even though they are supposed to record any errors and fatal errors in the same way like web tests. That's why this patch depends on #1563620: All unit tests blow up with a fatal error

Log in or register to post comments

Comment #19

sun

German

Karlsruhe

CreditAttribution: sun commented 21 May 2012 at 15:39

Note: We're testing these patches in #1591812: Sun's Simpletest patches currently.

Log in or register to post comments

Comment #20

sun

German

Karlsruhe

CreditAttribution: sun commented 31 May 2012 at 15:09

File	Size
drupal8.run-tests-fatal-clean.20.patch	8.71 KB

Technically, this should work correctly now.

Log in or register to post comments

Comment #21

31 May 2012 at 15:30

Status:

Needs review

» Needs work

The last submitted patch, drupal8.run-tests-fatal-clean.20.patch, failed testing.

Log in or register to post comments

Comment #22

sun

German

Karlsruhe

CreditAttribution: sun commented 31 May 2012 at 15:39

Status:

Needs work

» Needs review

File	Size
drupal8.run-tests-fatal-clean.22.patch	8.55 KB

Excellent.

And now. Let's try to kill a bot.

Log in or register to post comments

Comment #23

31 May 2012 at 15:41

Status:

Needs review

» Needs work

The last submitted patch, drupal8.run-tests-fatal-clean.22.patch, failed testing.

Log in or register to post comments

Comment #24

sun

German

Karlsruhe

CreditAttribution: sun commented 31 May 2012 at 15:54

No improvement. :( Sorry.

http://qa.drupal.org/pifr/test/135299 is stuck with another, subsequent test now.

The review log still contains MySQL data file I/O errors. I don't know why.

Technically, the concurrency fix for run-tests.sh worked. But at some point, MySQL suddenly starts with I/O errors on data files.

I wonder whether the testbot client itself attempts to perform additional clean-up operations on its own, which might conflict?

EDIT: The only actual improvement is that the patch/test didn't come back as "green" (with 0 passes).

Log in or register to post comments

Comment #25

Berdir

German

Switzerland

CreditAttribution: Berdir commented 31 May 2012 at 15:58

That does sound like a full file system (RAM in our case) issue...

Log in or register to post comments

Comment #26

sun

German

Karlsruhe

CreditAttribution: sun commented 31 May 2012 at 16:41

Right. However, that's exactly what this patch tries to prevent.

Log in or register to post comments

Comment #27

Berdir

German

Switzerland

CreditAttribution: Berdir commented 31 May 2012 at 19:03

Well, I think it's lying, because it's not actually removing the tables.

I've applied the latest patch and started executing all tests with a concurrency of 4. And my table list kept growing and growing despite it telling me that it removed them. Did some SHOW TABLES like 'prefix%'; and the exact amount of tables that it claimed to have removed where still there.

Log in or register to post comments

Comment #28

Berdir

German

Switzerland

CreditAttribution: Berdir commented 31 May 2012 at 19:04

Ah, I know why.

  foreach (db_find_tables($db_prefix . '%') as $table) {
    db_drop_table($tables);
    $count++;
  }

Guess what's wrong here ;)

Log in or register to post comments

Comment #29

jthorson CreditAttribution: jthorson commented 31 May 2012 at 23:45

Status:

Needs work

» Needs review

File	Size
drupal8.run-tests-fatal-clean.29.patch	10.39 KB

Nice catch!

Let's try this one. (Same patch, without the 's'.)

Log in or register to post comments

Comment #30

31 May 2012 at 23:52

Status:

Needs review

» Needs work

The last submitted patch, drupal8.run-tests-fatal-clean.29.patch, failed testing.

Log in or register to post comments

Comment #31

jthorson CreditAttribution: jthorson commented 1 June 2012 at 00:13

Nice ... expected failure, many fatals, but no PDO exceptions or "bot-killing" /tmpfs errors this time! That's easily the most exciting 'failed' test I've seen in a LONG time! :)

Log in or register to post comments

Comment #32

jthorson CreditAttribution: jthorson commented 1 June 2012 at 03:59

File	Size
drupal8.run-tests-fatal-clean.32.without-fatal.patch	9.17 KB

This version strips out the entity_create('foo'), and ~~the commented code inside of index.php~~ EDIT: Half of a patch which shouldn't have been here in the first place! ~~I believe that should be enough to run green (and thus represent the patch for actual commit)~~.

Log in or register to post comments

Comment #33

1 June 2012 at 02:33

The last submitted patch, drupal8.run-tests-fatal-clean.32.without-fatal.patch, failed testing.

Log in or register to post comments

Comment #34

jthorson CreditAttribution: jthorson commented 1 June 2012 at 03:58

Status:

Needs work

» Needs review

File	Size
drupal8.run-tests-fatal-clean.34.without-fatal.patch	8 KB

drupal8.run-tests-fatal-clean.34.with-fatal.patch	8.55 KB

Doh ... looks like a previous patch snuck into #29.

Here's two patches ... one with the test-bot killing fatals (#22 with the extra 's' removed), and one without (should be the 'green' patch for application).

Log in or register to post comments

Comment #35

1 June 2012 at 04:19

Status:

Needs review

» Needs work

The last submitted patch, drupal8.run-tests-fatal-clean.34.without-fatal.patch, failed testing.

Log in or register to post comments

Comment #36

jthorson CreditAttribution: jthorson commented 1 June 2012 at 04:55

Status:

Needs work

» Needs review

Odd ... the without-fatal test failed the first time through, with PDO errors on an image-related test. Re-test run came back clean.

Log in or register to post comments

Comment #37

sun

German

Karlsruhe

CreditAttribution: sun commented 1 June 2012 at 11:14

To make sure we're not introducing random test failures, I'm resending this patch for re-test all over again a bit... (already done so two times)

Log in or register to post comments

Comment #38

sun

German

Karlsruhe

CreditAttribution: sun commented 1 June 2012 at 17:05

#34: drupal8.run-tests-fatal-clean.34.without-fatal.patch queued for re-testing.

Log in or register to post comments

Comment #39

Dries CreditAttribution: Dries commented 1 June 2012 at 17:18

Status:

Needs review

» Reviewed & tested by the community

I reviewed this patch and it looks good. I'm going to mark it RTBC and give it a bit more time before getting this committed.

Log in or register to post comments

Comment #40

Dries CreditAttribution: Dries commented 2 June 2012 at 20:50

Status:

Reviewed & tested by the community

» Fixed

Committed to 8.x. Thanks.

Log in or register to post comments

Comment #41

Berdir

German

Switzerland

CreditAttribution: Berdir commented 3 June 2012 at 14:02

Looking a http://qa.drupal.org/pifr/test/279838, which has some fatal errors but also working tests. The testbot however now doesn't list anymore which tests failed and which don't, so you have to go through the detail log to find out what exactly happened. So the nice thing is that you actually can do that and actuall see a PHP fatal error there but there is no nice overview anymore.

Can we do anything anywhere (run-tests.sh, testbot, ..) to improve that?

Log in or register to post comments

Comment #42

catch

he/him

English

CreditAttribution: catch commented 3 June 2012 at 15:08

Version:	8.x-dev	» 7.x-dev
Status:	Fixed	» Patch (to be ported)

Log in or register to post comments

Comment #43

jthorson CreditAttribution: jthorson commented 3 June 2012 at 17:04

Item in #41 opened as #1615232: Have PIFR provide a results summary even for failed tests..

Log in or register to post comments

Comment #44

aspilicious CreditAttribution: aspilicious commented 7 June 2012 at 11:19

Status:

Patch (to be ported)

» Needs review

File	Size
diff-script-patches.txt	2.07 KB
1565718-cleanup-after-fatal-error-D7-44.patch	8.32 KB

Patch and a diff to proof the patch is identical as the D8 version.

Log in or register to post comments

Comment #45

sun

German

Karlsruhe

CreditAttribution: sun commented 7 June 2012 at 14:10

Status:

Needs review

» Postponed

heh. Just wanted to cry and ask why and how on earth #44 was able to pass, but "fortunately", only the reported test result is misleading - the review log contains fatal errors.

That is, because this patch depends on the changes in #1563620: All unit tests blow up with a fatal error, so postponing on that :)

Log in or register to post comments

Comment #46

sun

German

Karlsruhe

CreditAttribution: sun commented 29 July 2012 at 22:53

The primary dependency landed: #1541958: Split setUp() into specific sub-methods

The second dependency is: #1563620: All unit tests blow up with a fatal error

After that, we can backport this one (whereas this patch is the most straightforward of the chain).

Log in or register to post comments

Comment #47

sun

German

Karlsruhe

CreditAttribution: sun commented 3 November 2012 at 02:36

Version:	7.x-dev	» 8.x-dev
Status:	Postponed	» Closed (fixed)
Issue tags:	-Needs backport to D6, -Needs backport to D7

The primary purpose of backporting these changes was to retain a comparable testing framework between D8 and D7, so as to make future backports easier.

Getting close to D8 feature freeze, and facing a heavily diverged simpletest framework between D8 and D7, it's close to impossible to retain "similarity" between major core versions, and doing so involves a huge risk of breaking things badly in D7.

Therefore, I'm calling out the end to testing framework backports.

For D7, you get to work with the current mess. At the very least, that's "predictable."

Needless to say, actual bug fixes are excluded from that. However, given that there are 30,000+ test assertions that pass against D7, people will have to make pretty good arguments and show solid proof that there is actually a bug somewhere. ;)

Let's focus on the future. (Which isn't Simpletest either way.)

Thanks everyone for working on this and making Drupal awesome! :)