This study proposes a methodology to find the interest levels of two speakers in a conversation. The ANN-HMM approach-a hybrid method is adopted. The hybrid method uses language input as an additional parameter in addition to the acoustic features. The language input provides a measure of classification of the input speech utterance. A combined classifier is used to make a linear decision on the emotion of the uttered speech as an arousal or valence. When the decision is fed to the Generative Factor Analyzed Hidden Markov Model (GFA-HMM) it evidently substantiates to be a better method with good accuracy rate of classification of whether the speaker is entangled in the conversation or vice-versa. The proposed method produced highly satisfactory results for the Linguistic Data Consortium (LDC) emotional prosody dataset.
展开▼