Beyond the practical take.. (disclaimer, I'm not a digital audio EE!)
I think those staircase waveform images are one of the biggest deciets in digital audio. First off, a lot of folks, although not so much TS members, seem to believe that staircase represents the shape of the audio output. But as most here know, the output is always smoothed to a curve. The question is how accurately the shape of the smoothed output curve reflects the input. Sampling Theorem tells us that within a given bandwidth, the number of samples needed to do that needs only to exceed that bandwidth by two times in order to be able to retrieve the complete waveform, including all values between those sample times. As long as there are two or more sample points per cycle a steady state sine wave can be fully reconstructed. Extending that, a sinc function of overlapping sine waves is further capable of recreating a complex waveform shape that varies with time.
The devil is in the details of how the waveform is decimated and recreated, the filtering to limit the signal to within the usable bandwith, not in the bit depth and sample rate itself.
It would be alot less misleading if those images showed just the individual sample points at each "step corner" and not the lines connecting them that visually form the stair step representation. The actual sample points do not get connected by straight lines, they determine a series of overlapping curves that integrate to form the curved output waveform. The overlapping curves and averaging fit output waveform to the sample points. There are no stair steps.
It would also be less misleading if such images varied the spacing between sample points along the vertical axis with change of bit-depth, and the spacing between points along the horizontal axis with change of sample-rate. But such images almost never show that. Instead they are almost always drawn showing equal spacing along both axes.. as perfectly square stair-shaped steps.
The actual bandwidth limit of a recording is most likely to be solely determined by the dynamic range of the acoustic situation, beyond that by the dynamic range capability of the microphones, and beyond that by the preamp stage or ADC. A more complicated ADC arrangement designed to switch gracefully between multiple ADCs can extend the ADC range constraint, yet the range of a single 24-bit ADC most likely already exceeds that of the acoustic environment and most microphones. Within the bandwidth limits determined by the recording chain, the information represented in a 24-bit representation is going to be the same as in a 32-bit float representation. The 32-bit float storage representation just allows the 24-bit chunk of meaningful data to be shifted up and down as needed in the digital realm, it doesn't provide greater resolution within that meaningful range.
Remember, multiple switching ADCs and 32-bit float representation of the data are two different things. We can have one without the other, which manufacturers tend to gloss that over. A multiple switching ADC scheme extends the dynamic range envelope allowing for more lax real-world input gain setting by the user.. or no setting at all. But in simple terms, the "resolution" within that envelope is defined by the sample rate and determines the high frequency limit of the system, not the reproduction accuracy within the limits of the frequency range. Higher resolution extends the frequency range, but doesn't increase accuracy within the range.
a 32-bit floating point representation of the output from a non-dithered single 16-bit ADC contains the same information as a 16-bit fixed point representation of it and vice-versa. The 32-bit floating point digital container itself is vastly larger than the 16-bit digital container, making it capable of representing a far wider dynamic range, but in this case the useful data being output from the single 16-bit ADC and stored inside it remains the same.
Likewise, the electrical noise floor of a 32-bit float recorder is going to be determined by the analog input stage and ADC of the recorder, not by the data storage format.