Vocal burst
Measure emotional expression from non-linguistic vocalizations like laughs, sighs, and gasps.
The vocal burst model measures 48 dimensions of emotional expression from non-linguistic vocalizations such as laughs, sighs,
gasps, cries, and other sounds that carry emotional meaning but are not words. Recommended input filetypes: .wav, .mp3, .mp4.
Job configuration
The vocal burst model has no configurable parameters in either API. Enable it by passing an empty object:
Output
Each prediction includes:
- Time interval: the
beginandendtimestamps in seconds - Emotion scores: scores for each of the 48 expressions
- Descriptions: scores for each of the 67 burst types (e.g., “Laugh”, “Sigh”, “Gasp”)
Burst types
The model can detect a wide range of vocal burst types, from laughter and cheering to gasps and groans.
Expressions
The vocal burst model measures the following 48 expressions. These are the same expressions measured by the facial expression and speech prosody models.

