Reviewers and consumers are at a great disadvantage in assessing the sound of a product, because they have never directly auditioned single-parameter changes. For example, a change in operating point (quiescent current) of 20% up or down, or a shift in topology in one part of the circuit. I grade these with a simple metric of:
1) No audible change, or at least nothing at threshold level
2) Different, but neither better or worse, just different
3) Worse (and how quickly is it noticed ... 5 seconds, 5 minutes, or an overall quality of discomfort or dislike)
4) Better (how quickly is this noticed, or is it a change in mood)
There are probably twenty or more points of adjustment in an amplifier or loudspeaker where these changes can be made. Some affect measurements, but most don’t ... they’re purely subjective. Also, conflicts can occur ... a better measurement may result in worse sound. At that point, something has gone wrong, and you need to stop and see where you have gone off-track.
We have to address what can be measured at the current state of the art and what can’t. Here’s just one example: for the the purposes of electronic design, nearly all modern capacitors are perfect. There is nothing to choose between them except voltage capacity and long-term reliability. Distortion is vanishingly small, at or below the threshold of measurement.
But ... in a high-resolution system, they all sound different. They are not neutral sonically. Mylar sounds different from polypropylene which sounds different than Teflon which sounds different from waxed or oiled paper. Metallized film sounds different than solid foil. In a vacuum-tube circuit, there are circuit nodes that actually exaggerate the coloration. Worse, DA, DF, self-inductance, or even price have little or nothing to do with sonics.
Perhaps worst of all, the notorious "burn-in" phenomenon where XYZ parts sounds really bad for the first 5, 10, 20, or 50 hours. With no change in DA, DF, or distortion measurements, and no plausible physical mechanism responsible for this. Anything that slow must be electrochemical, but what is it? Just a lot of hand-waving and supposition from the manufacturer, with no data to back it up. But plainly and clearly audible.
Not only that, some parts have essentially no break-in at all (paper and wax) while others can take 50 hours or more (polypropylene or Teflon). No explanation offered, no measurements, no underlying physical mechanism. Well, it’s not ghosts or psychic energy. It’s physics. But what physics? Nobody’s saying anything.
In loudspeakers and vacuum tubes, break-in is real, measurable, and the reasons are well-documented in papers going back to the Fifties. Caps? Nope. Why does copper wire sound different than silver? Again, no explanation. I accept this, but it is not satisfactory. Something is going on, and it is not self-hypnosis or expectancy effect. Often, the most expensive part sounds the worst, and the cheapo part sounds quite good.