GStreamer needs a clock to synchronize between audio and video. That clock is usually derived from the audio sink. In this case, something is wrong with your audio sink -- probably intel_hda related -- and so things go pear-shaped.
This problem is not in GStreamer, and is not related to the audio format.