I remember doing one recording sbd + aud (hyper, din, in the middle of the room). I aligned the first version exactly according to the waveform and my listening. In the second version, I delayed sbd by 10 ms (maybe a little less, I don't remember it well). My friend, who has perfect hearing and good monitors, confirmed that the second version is better. The sbd didn't sound good, and by delaying it, it fell into the aud. It was not a significant change but it was positive. And when I tried to just turn down the levels of the sbd, it didn't have the same good effect. [..snip]
Was going back through this thread and realized that I'd intended, but never actually got around to replying to this post way back when.
Here's something that I think provides insight into the observation made by kuba e above, has informed how I think about all this, and which I believe may be relevant to mixing AUD and SBD..
Check out the paper by Gunter Theil that is attached to the bottom of this post titled: "Room Related Balancing Technique". It discusses an interesting technique for mixing orchestral section spot mics in with a main stereo pair microphone feed. When reading it, substitute a typical AUD recording for the orchestral main stereo pair of microphones and a SBD feed for the spot mics. After all, a SBD feed is essentially a mix of a bunch of spot mics.
This technique treats the main pair (AUD) as the primary source, reinforced by the spot mics (SBD) as secondary source, used at a lower level. Although that's the opposite of the original intent of this thread, doing it that way around is my preference whenever possible rather than to treating the dry SBD as primary and using the AUD to "spatialize it" with the addition of some room and audience sound, as that can sound upfront and clear, but almost always a bit flat, artificial, and not natural souding me. Useful when necessary, but I prefer to preserve the depth, imaging, and naturalness of a good main stereo pair or mic array when possible. More on the fundamental difference in approaches in wforwumbo's post earlier in the thread-
https://taperssection.com/index.php?topic=198253.msg2367215#msg2367215The goal of
Room Related Balancing Technique is to preserve imaging and depth cues inherent in a good main stereo AUD pair by having that content arrive first, providing first arrival information, and treat and the spot microphone signals (SBD) so that they act like early reflections reinforcing the main pair signal. To do so, the dry spot-mics are first fully aligned with the main pair (or calculations made to figure out the delays necessary to bring both into alignment), then the spot mic signals are muti-tap delayed by a few milliseconds with respect to the main pair. Each delayed copy is panned to a random position so that they collectively emulate the ever-so-slightly delayed arrival of early reflections from random directions. The spot mic signals are essentially fit in between the initial arrival of the main pair and the reverb tail of the main pair as would be discrete early reflections. In that way they provide the desired increased clarity and bring up the level of content not well enough represented in the main pair alone, but without screwing up the initial arrival information from main pair that provides good stereo imaging and depth cues.
In application to what we are doing, we'd not mulit-tap delay the SBD feed and randomly pan those delayed copies around in the same way, although we might do that if working with the individual SBD channels. Instead we'd most likely do exactly what kuba e describes - delay the SBD just enough that it is a few milliseconds behind the AUD. AUD pings our awareness first, then SBD comes in and reinforces the AUD withing the precedence-effect perceptual window, inhabiting a Goldilocks zone by not arriving too early so as to obscure the main mic cues, not or too late to blur or become echoic.
Cool stuff and a good read-