Intro time (time to start first vocals)

One of the radio broadcasting demands is knowing the length of the intro of a music part. Typically the DJ is talking over this part, or software is shifting a voice-track over the music in an automatic fashion. I am quite impressed with the information that is currently extracted and I wonder if a vocal / non-vocal binary feature detector is available and could be used to achieve an intro and possibly outro.

