Consider using CoreAudio dynamic aggregate devices

User picture
User picture
User picture
User picture