I think the existence of inaudible dog whistles is serious blow against your hypothesis. We've had a much longer experience with audio frequencies near the edge of human perception than you would perhaps realize at first blush. Much of that history pre-dates any attempt at digital sampling. If 99.9% of people can't perceive dog whistles at 22 Khz, they aren't going to hear it played on their Alpine speakers in their car either.
Video framing on the other hand is relatively quite new...unless you count thumb powered flipbooks pen and paper animations.