Every time a test case creates a flag, it defines a huge array of data. It's hard to see which parts of that are just defaults that Flag needs, and which settings are important to the test.

It might be easier if createFlag() had an array of defaults. The tests could then be trimmed down to only specify the flag settings that are relevant to the test (whether they differ from defaults or not).