"Since if a speaker can produce a step response correctly, therefore it is time-phase coherent, and therefore it must be "good" - is not something I hear him saying. He does say that "all else being equal, phase coherent speakers tend to produce exceptional imaging". (My semi-quote)
Yes, I think that's what he said. I may have mistaken "exceptional imaging" for what he felt as "good". I think I've read the same article that you read and he did say "exceptional imaging". It's been awhile since I've read the article (and I can't seem to find it anymore), but I also believed he said something to the extent "if everything else has been taken care off", that is if a speaker was well designed in other regards, then "time-coherence" may have an advantage of providing "exceptional image".
Anyway, what I will write probably won't be based on my personal opinions but more based what I've seen from simulations with actual real world data. Others will have to draw their own conclusions. I only used John Atkinson's statement as an "introduction", but I will try to stay clear of using my own subjectivity.
I think that phase coherence (as an objective or success) increases the difficulty of making a good speaker by a large multiple, and that many attempts fail in many ways, including Thiel's attempts.
I agree that time-coherence is not easy and that's why besides Vandersteens, I don't know of anyone else. I have seen a few measurements from Stereophile done by John Atkinson and he did question the validity of some of the claims of being time-coherent (not Thiel though).
My document is not meant to be a vehicle for judging anyone's work, but it is meant to discuss "time coherence" in a objective way with actual simulation using real measured data (when possible), plots and graphs, and as I said, I will not include any of my subjective opinions.
And for whatever reasons, this topic is weirdly fascinating to me, and I probably won't sleep well until I put the document together :-)