Running BKE_scene_eval_sequencer_sequences() removes sound from scene if any scene strip uses camera as input.
This patch fixes the issue, but since sound is "owned" by scene, now even scenes using camera input will play audio.
I don't know, if this can be solved nicely. I was thinking about setting seq->volume to 0 - it does work, but not sure about that nice part.