How strange. It's not like there's any perceptual tricks like with the dress where your brain corrects for the lighting - it's just, like, one clip of audio.
EDIT: Okay, I messed with it in Audacity for a bit. If I pitch it down one octave, it doesn't sound like either; too distorted. If I pitch it down two octaves, I can just barely make out a distorted "yanny." But I had to pitch it down TWO OCTAVES to do that, so I don't know how anyone can hear "yanny" normally. I can also hear it if I apply an extremely strong high-pass filter at 2000Hz, but again, this is such a strong effect that I can't imagine anyone hearing that normally.
EDIT2: Try this out:
https://www.nytimes.com/interactive/2018/05/16/upshot/audio-clip-yanny-laurel-debate.htmlThis lets you distort the frequencies to "hear" both. I had to go to the 2nd notch from the right to start hearing both words equally, and 1 notch to hear yanny unequivocally. What's your critical point?
And, for what it's worth, the original clip says "laurel," with some distorted higher frequencies added to trip people up.