Another thing to watch for, is if you have a Desktop source pointing to the same device that you send the Monitor to, then that Desktop source will include a slightly delayed (usually by a second or less, but still noticeable) copy of the Monitor. Most people don't want that, so don't make those two things equal.
The Audio Output Capture source that you can put in a scene, works the same way, so watch for that too.
It's not OBS that does that, but the operating system. The OS only gives OBS one possible feed for each selection, which is after it's mixed everything into it, even if it came from the same app.