The tone detector is quite sensitive and will trigger a 'Y' if there is audio with a peak around 2100Hz for more than a few hundred milliseconds. You can enable detection of only certain tones via the API.
↧