I have a recent recording I'm going to try this on. It's an instrumental ensemble - Hammond organ, guitar, bass, drums and sax - that the crowd in the tiny room just yakked really loud the whole time.
I hope to isolate the talking using the separate vocal stems method and discard as much of the conversation as I can.
Thanks for your write up about how you accomplished your edits.
EDIT - I took a short clip of the show from right at the beginning where the talking is especially bad and ran it through UVR5. Here's a clip of the results
https://soundcloud.com/roger-cox-7/robertwalter2024-05-03uvrcomp?si=822ae8d008194275ae17808e98ee4bcb&utm_source=clipboard&utm_medium=text&utm_campaign=social_sharing.
0-10 sec = raw track
10-20 sec = just removed noise (vocal track)
20-30 sec = just resulting music (instrumental track)
30-40 sec = raw track again to compare
It's better but it's not a silver bullet for this kind of thing. It definitely removed the worst of the talking and didn't degrade the music all that much. If you listen closely to the removed voices only you can hear a little of the organ that got removed too.