No reason a multi-ADC recorder can't be designed to save in whatever format is desired, including 16bit mp3 or whatever. Zoom F8N_Pro can be set to record 24bit fixed, and Zoom has confirmed that the recorder is using the switching dual ADC signal path when doing so. I believe the SD MixPre's can as well.
As for quantization errors, that problem was solved decades ago. Application of dither eliminates quantization errors. Just apply it regardless of bit depth. It's an easy fix. Manufacturers apply dither when saving to 16bit, yet for some reason don't bother when saving to 24bit. I supposed they decided back when that the recorded dynamic range of 24 bit was large enough that any quantization noise at such low levels didn't matter when folks were setting gain reasonably. Or maybe they determined that there was already enough random noise in the signal noise at those levels to effectively self-dither. If manufacturers still consider not applying dither at 24bits to be a legitimate design choice, while now simultaneously claiming that 32-bit float "solves the quantization problem", I smell rotten fish.
Again, whenever you see marketing materials use those stair-step drawings, best to stop reading right there because you are being lied to.
On the 'combining algorithms' TheJez mentions-
They are clever, they are cool, they are useful. But philosophically what bugs me deep down about all this is that the quality of a digital recorder used to be defined by absolute linearity within it's bandwidth - being measurably bit perfect in/out within that range. Use that available bandwidth anyway you like and be assured that whatever you put in you get back out again. That includes sound-design folks recording ultrasonics and pitching them down into human hearing range, which is apparently where multi-ADC combining artifacts of some recorder were noticed. This deeper problem is not about us not noticing or caring about the artifacts / non-linearities, it's about the shift from truly predictable linear transparent systems to ones which only need be perceptually transparent. I appreciate the useful extension of dynamic range.. when it is done right and effective. But when adopting the tech required to do that sacrifices measurable, linear, predictability I don't like where it can be too easily taken. It all too quickly becomes a marketing smoke and mirror game rather than one of measurable linearity. Reminds me of the mp3 perceptual codec thing all over again in sacrificing true linearity for perceptual psychoacoustics.