Dialogue Act Detection from Human-Human Spoken Conversations

Nithin Ramacandran

Call for Paper

June Edition

IJCA solicits high quality original research papers for the upcoming June edition of the journal. The last date of research paper submission is 20 May 2024

Submit your paper

Know more

The week's pick

Enhancing Privacy Preservation: Multi-Attribute Protection with P-Sensitive K-Anonymity

Twinkle Patel Kiran Amin

Random Articles

Efficient Algorithm for Mining Frequent Subgraphs (Static and Dynamic) based on gSpan

February

2013

Handwritten Gurmukhi Numeral Recognition using Different Feature Sets

August

2011

Spectral Entropy Estimation of HRV Data of Thyroid and Healthy subjects

September

2011

Finger Vein Verification System based on Three Methodologies of Feature Extraction

Aug

2017

Reseach Article

Dialogue Act Detection from Human-Human Spoken Conversations

by Nithin Ramacandran

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 67 - Number 5

Year of Publication: 2013

Authors: Nithin Ramacandran

10.5120/11392-6688

Nithin Ramacandran . Dialogue Act Detection from Human-Human Spoken Conversations. International Journal of Computer Applications. 67, 5 ( April 2013), 24-27. DOI=10.5120/11392-6688

@article{ 10.5120/11392-6688,

author = { Nithin Ramacandran },

title = { Dialogue Act Detection from Human-Human Spoken Conversations },

journal = { International Journal of Computer Applications },

issue_date = { April 2013 },

volume = { 67 },

number = { 5 },

month = { April },

year = { 2013 },

issn = { 0975-8887 },

pages = { 24-27 },

numpages = {9},

url = { https://ijcaonline.org/archives/volume67/number5/11392-6688/ },

doi = { 10.5120/11392-6688 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2024-02-06T21:23:52.915919+05:30

%A Nithin Ramacandran

%T Dialogue Act Detection from Human-Human Spoken Conversations

%J International Journal of Computer Applications

%@ 0975-8887

%V 67

%N 5

%P 24-27

%D 2013

%I Foundation of Computer Science (FCS), NY, USA

Abstract

Accurate detection of dialogue acts is essential for understanding human conversations and to recognize emotions. This requires 1) the segmentation of human-human dialogs into turns, 2) the intra-turn segmentation into DA boundaries and 3) the classification of each segment according to a DA tag. Most dialogue act classification models approaches the problem of identifying the different DA segments within an utterance in separate fashion: first, DA boundary segmentation within an utterance was addressed with generative or discriminative approaches then, DA labels were assigned to such boundaries based on multi-classification. This paper, presents an effective approach to improve the accuracy of dialogue act recognition from speech signal by combining acoustic and linguistic features. This paper adopts the use of a silence removal algorithm based on Mahalanobis Distance for the segmentation of human-human dialogs into turns and proposes the keyword spotting feature to reduce the ambiguity of opinion vs. non-opinion statements and agreements vs. acknowledgements, occurs while classifying the dialogue acts.

References

Y. Liu, A. Stolcke, E. Shriberg, and M. Harper. 2004. Comparing and combining generative and posterior probability models: Some advances in sentence boundary detection in speech. In Proceedings of the Conference on Empirical Methods in Natural Language Processing
S. Quarteroni and G. Riccardi, "Dialog Act Classification in Human Human and Human Machine Conversations," in Proc. INTERSPEECH, 2010.
Silvia Quarteroni, Alexei V. Ivanov, Giuseppe Riccardi, "Simultaneous Dialog Act Segmentation And Classification from Human-Human Spoken Conversations," IEEE 2011.
G. Saha, Sandipan Chakroborty, Suman Senapati, "A New Silence Removal and Endpoint Detection Algorithm for Speech and Speaker Recognition Applications", Proc. Eleventh Conference on Speech Processing, IIT Khragpur 2005.
M. Dinarelli, S. Quarteroni, S. Tonelli, A. Moschitti, and G. Riccardi,"Annotating spoken dialogs: from speech segments to dialog acts and frame semantics," in Proc. SRSL, 2009.
J. Lafferty, A. McCallum, and F. Pereira, "Conditional random fields: Probabilistic models for segmenting and labelling sequence data," in Proc. ICML, 2001.
P. Boersma, "Praat, a system for doing phonetics by computer," Glot International, vol. 5, no. 9/10, pp. 341–345, 2001. [Online]. Available: http://www. praat. org
Atal, B. ; Rabiner, L. , "A pattern recognition approach to voiced-unvoiced-silence classification with applications to speech recognition" Acoustics, Speech, and Signal Processing [see also IEEE Transactions on Signal Processing], IEEE Transactions on , Volume: 24 , Issue: 3 , Jun 1976, Pages: 201 - 212.
D. G. Childers, M. Hand, J. M. Larar, " Silent and Voiced/Unvoied/Mixed excitation(FourWay),Classification of Speech", IEEE Transaction on ASSP, Vol-37, No-11, pp. 1771-74, Nov 1989.
A. Stolcke, K Ries, N. Coccaro, E. Shriberg, R. Bates, D. Jurafsky, P. Taylor, R. Martin, C. Van Ess-dykema, andM. Meteer, "Dialogue Act modeling and automatic tagging And recognition of conversational speech", Computational Linguistics vol. 26, 2000.
Richard. O. Duda, Peter E. Hart, David G. Strok, "Pattern Classification", A Wiley Inter science publication, John Wiley & Sons, Inc, Second Edition, 2001.
Sarma, V. ; Venugopal, D. , "Studies on pattern recognition approach to voiced-unvoiced-silence classification", Acoustics, Speech, and Signal Processing, IEEE International conference on ICASSP '78. ,Volume 3:April 1978, pages 1-4.

Index Terms

Computer Science

Information Sciences

Keywords

Dialogue Acts Silence Removal Algorithms Conditional Random Fields