Do you send the audio Monitor to the same device as the Desktop source or Audio Output Capture? That'll do it.
Most people are surprised with how late the pickup is for the "speaker capture", to make up a general term. It really does capture *everything* that comes out of the speakers/headphones, with a pedantic definition of "everything".
So if you send the mic there, it'll include the mic, etc.