First, thanks for taking the time and trouble to post this comp!
Second, I have some major caveats to note: I listened on my work computer, with a crappy sound card, using inexpensive in-ear headphones, and I only listened to 2 x 2. That being said, I was 16 for 16 using ABX comparator. After listening, I looked at the amplitude statistics and noted that sample A was ~ 1 dB average RMS louder than sample B (for Educated Guess, sample A was ~ 0.7 dB average RMS quieter). So I preferred A (for 2 x 2), which had better vocal clarity and high end definition, but that may have been affected by the volume difference. Also, my playback is kind of light in the bass range, so I would have had trouble hearing differences in that range.
Long and short: I would like to equalize the volume and listen on my nice cans, DAC and headphone amp and see if that changes my initial impression...