This is totally not true. Timing is true in the live music world, but for playback, most of our imaging with the exception of specific dual microphone setups rarely used, imaging is primarily volume, and phase does not play into it, not even one little bit as long as the phase response is consistent on each channel.
TIME is arrival time. Ignoring Xover phase, a flat baffle box with a 8" woofer and dome tweeter has a driver arrival delta of about 500µS or about 2kHz. In a 2 way system, the kick beater will arrive ahead of the fundamental. In a multi woofer system, the direct arrival is at multiple times, PLUS first reflections varying in both time and intensity. Imaging suffers.
PHASE is the synchronicity between fundamental and harmonics. If harmonics arrive asynchronously to fundamental, imaging suffers.
A system with TIME wrong cannot get PHASE coherent.
Most systems make no attempt to get TIME or PHASE coherent.
Imaging is NOT level (volume). Imaging is when the speakers disappear and one can
walk into the stage! Most systems fail miserably. Ditto rooms.