Amir and Blind Testing


Let me start by saying I like watching Amir from ASR, so please let’s not get harsh or the thread will be deleted. Many times, Amir has noted that when we’re inserting a new component in our system, our brains go into (to paraphrase) “analytical mode” and we start hearing imaginary improvements. He has reiterated this many times, saying that when he switched to an expensive cable he heard improvements, but when he switched back to the cheap one, he also heard improvements because the brain switches from “music enjoyment mode” to “analytical mode.” Following this logic, which I agree with, wouldn’t blind testing, or any A/B testing be compromised because our brains are always in analytical mode and therefore feeding us inaccurate data? Seems to me you need to relax for a few hours at least and listen to a variety of music before your brain can accurately assess whether something is an actual improvement.  Perhaps A/B testing is a strawman argument, because the human brain is not a spectrum analyzer.  We are too affected by our biases to come up with any valid data.  Maybe. 

chayro

Great post!

I concur...

But "Objective measuring tool fetichists" , not Amirm perhaps, but his less enlightened disciples, will claim that sound experience, contrary to any psycho-acoustic/ physical acoustic science fields experience, will come directly and is DECIDED by and from the measured gear specs , not Amirm who is intelligent enough to give only his personal measured numbers, and will SUGGEST that his measures numbers had this meaning or this other one ...But for his disciples this suggestion is a defintive dogma... No listening experiments can contradict it with any value of any kind...Only blind test will defeat SUBJECTIVE biases....And they need to defeat it... But we cannot optimally  tune a SMALL room for ourself  WITHOUT our learned subjective  biases  with  only objective physical acoustic principles ... 😁😊

People are gullible, be it  techno fad "alleged" scientist or those other type of " fetichist who taste their brand name gear" in itself for itself without any objective context to put them at test....The more important context is a room acoustic controlled or not ...

Audio for me is investigation by listening experiments of the acoustic/psycho-acoustic dimension... It is not about a McIntosh or Schiit or Mephisto amplifier specs or price... For sure all piece of gear are different by their design but what we can do to put them at an audiophile level of optimal working ? This is the question...

There is a good amplifier at any price tag....New, old or vintage anyway...

Deception for me in audio is the ignorance of acoustic importance...Bad design will exist even after Amirm measured testing tool bench test , and sometimes good design will exist in spite of his critic with his measuring tool... And anyway what is good in some room may be bad in another room SOMETIMES...

Reality dont emerge from a simplistic formula....

There exist truly bad design but no measures specs are necessary most of the times, a little listening will do...

 

The inherent flaw of any sort of blindfolded A/B testing is when there’s unfamiliarity of the test subjects with the items being tested. For example - a blind taste test of Coke vs Pepsi is adversely affected if the test takers aren’t cola drinkers. Without a recognizable frame of reference, "best" or "better" is merely a guessing game.

So, playing a piece of music the subjects don’t know on a system that is not like their own and asking them to compare that sample to a slightly changed subsequent sample is a waste of time, not a universal truth. Most of us have several pieces of music/performances/albums that we know intimately. If the benchmark used is one of those on our systems (or an equivalent one), then comparative testing has validity, but only then.

 

Hello,

not much discussion here about statistics...  well, it's not an exciting subject really.

I thought the point of blind testing was the sample size.  One person is not enough, maybe you need more than 100.  If more than, say, 60% of them opted for A you might be able to say that A is better than B, and if it's 50/50 then that's a pretty good indication that A and B are close to the same thing.  I would venture that when only one person attempts AB you will not get conclusive results unless one of A or B is truly terrible.  

It's hard to get 100 audiophiles into the same room, and you would need good security to prevent fights breaking out.

 

 

You are right for sure...

Blind test is most used in the pharma business among others industry...There it is easy to understand why it is very important tool and testing drugs methodology ... Statistical method and blind testing goes hand in hand here...

But in audio if even singular non statistically significant blind test in some case yes can be useful, CLAIMING it must be used systematically on people to DECONSTRUCT their CONTINUOUS experience in meaningless bits is bordering ridiculous when "measuring tool fetichist" warring against "brand name testing subjective fetichist " called it "science"...

Audio is a science where A CORRELATION between subjective experience being maintained must be accounted for with objective tools and methods yes, but not systematically deconstructed by ideological claims...Audio experience is more encompassing than audio industry itself , it is not primarily about gear and tools it is about acoustic/psycho-acoustic and sound/speech/music scientific relation...

Acoustic/psycho-acoustic first and last are sciences in audio with electronic engineering between these two moments of A/P.A...Blind test is a secondary tool at best here, not a primary concern like some fetichist claim for their weak argument about audio which is : We listen and can really listen ONLY what is measured or explained by some set of measures...The rest is deceptive fraud or illusions...

Subjective Perception of tonal timbre for example may be tested by blind test usefully but not negated in itself to exist OBJECTIVELY with the utmost value even if it is a subjective determination... Some people are better than other at this task of timbre tonal perception for sure....Subjective evaluation controlled and correlated with and by objective means is the heart of music/sound experience and science but the reverse is ALSO true, objective means and tools must be subordinated also to the subjective evaluation itself , it is a true TWO WAY correlation process ...Not an ideology...

Hearing is not even understood completely by far today...

It is very revelatory that the two warring fetichist groups battle each other AROUND the gear, focusing on the gear MARKET, and not around acoustic/psycho-acoustic experiments and experience in audio thread...

And like A.I. promotion in some circle is used AGAINST human real INTELIGENCE , which is conceptual creation, in audio digital   industry  some  want and claim  to redefine human hearing experience WITHOUT the need for a  human subject....

This is not science in the two cases but ideological groups inside true science...

 

Welcome to you here by the way....

 

Hello,

not much discussion here about statistics... well, it’s not an exciting subject really.

I thought the point of blind testing was the sample size. One person is not enough, maybe you need more than 100. If more than, say, 60% of them opted for A you might be able to say that A is better than B, and if it’s 50/50 then that’s a pretty good indication that A and B are close to the same thing. I would venture that when only one person attempts AB you will not get conclusive results unless one of A or B is truly terrible.

It’s hard to get 100 audiophiles into the same room, and you would need good security to prevent fights breaking out.

 

@adambennette not much discussion here about statistics... well, it’s not an exciting subject really.

Until an appreciation of stats demonstrates how often they may be used incorrectly to draw conclusions that have no basis. That’s just a general observation, so moving right along....

If in fact there is a difference and 40% say there is none, this is saying that those 40% of people have less than optimal hearing. Isn’t that conceding that it is a poor test audience, not to be relied upon?

If we are to trust out ears (whatever that means, despite it being some kind of mantra), ought not 100% agree that there is a difference?

And if prior to the test an unknown portion of the audience cannot trust their ears, on what basis can it then be said after the test that A and B are in fact different? These hard of hearing people may be saying there is a difference when none exists.

The good thing about the one person test is that the variability in peoples hearing is removed - if you have "poor" hearing, any AB test will be subject to a query and be discounted.

If "good" hearing, then multiple tests would need to be done in accordance with good practice (not so easy, as it happens) - to find some measure of confidence would involve discussing probability theory but there does exist common sense rules of thumb which I don’t really like much.

Clear as mud?

edit - by the way, there is no requirement that the individual/s tested be into music/audio gear, whatever.  The best test subjects would be teenagers or even slightly younger.  Just sayin'

Post removed