CFP last date
20 May 2024
Reseach Article

Dialogue Act Detection from Human-Human Spoken Conversations

by Nithin Ramacandran
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 67 - Number 5
Year of Publication: 2013
Authors: Nithin Ramacandran
10.5120/11392-6688

Nithin Ramacandran . Dialogue Act Detection from Human-Human Spoken Conversations. International Journal of Computer Applications. 67, 5 ( April 2013), 24-27. DOI=10.5120/11392-6688

@article{ 10.5120/11392-6688,
author = { Nithin Ramacandran },
title = { Dialogue Act Detection from Human-Human Spoken Conversations },
journal = { International Journal of Computer Applications },
issue_date = { April 2013 },
volume = { 67 },
number = { 5 },
month = { April },
year = { 2013 },
issn = { 0975-8887 },
pages = { 24-27 },
numpages = {9},
url = { https://ijcaonline.org/archives/volume67/number5/11392-6688/ },
doi = { 10.5120/11392-6688 },
publisher = {Foundation of Computer Science (FCS), NY, USA},
address = {New York, USA}
}
%0 Journal Article
%1 2024-02-06T21:23:52.915919+05:30
%A Nithin Ramacandran
%T Dialogue Act Detection from Human-Human Spoken Conversations
%J International Journal of Computer Applications
%@ 0975-8887
%V 67
%N 5
%P 24-27
%D 2013
%I Foundation of Computer Science (FCS), NY, USA
Abstract

Accurate detection of dialogue acts is essential for understanding human conversations and to recognize emotions. This requires 1) the segmentation of human-human dialogs into turns, 2) the intra-turn segmentation into DA boundaries and 3) the classification of each segment according to a DA tag. Most dialogue act classification models approaches the problem of identifying the different DA segments within an utterance in separate fashion: first, DA boundary segmentation within an utterance was addressed with generative or discriminative approaches then, DA labels were assigned to such boundaries based on multi-classification. This paper, presents an effective approach to improve the accuracy of dialogue act recognition from speech signal by combining acoustic and linguistic features. This paper adopts the use of a silence removal algorithm based on Mahalanobis Distance for the segmentation of human-human dialogs into turns and proposes the keyword spotting feature to reduce the ambiguity of opinion vs. non-opinion statements and agreements vs. acknowledgements, occurs while classifying the dialogue acts.

References
  1. Y. Liu, A. Stolcke, E. Shriberg, and M. Harper. 2004. Comparing and combining generative and posterior probability models: Some advances in sentence boundary detection in speech. In Proceedings of the Conference on Empirical Methods in Natural Language Processing
  2. S. Quarteroni and G. Riccardi, "Dialog Act Classification in Human Human and Human Machine Conversations," in Proc. INTERSPEECH, 2010.
  3. Silvia Quarteroni, Alexei V. Ivanov, Giuseppe Riccardi, "Simultaneous Dialog Act Segmentation And Classification from Human-Human Spoken Conversations," IEEE 2011.
  4. G. Saha, Sandipan Chakroborty, Suman Senapati, "A New Silence Removal and Endpoint Detection Algorithm for Speech and Speaker Recognition Applications", Proc. Eleventh Conference on Speech Processing, IIT Khragpur 2005.
  5. M. Dinarelli, S. Quarteroni, S. Tonelli, A. Moschitti, and G. Riccardi,"Annotating spoken dialogs: from speech segments to dialog acts and frame semantics," in Proc. SRSL, 2009.
  6. J. Lafferty, A. McCallum, and F. Pereira, "Conditional random fields: Probabilistic models for segmenting and labelling sequence data," in Proc. ICML, 2001.
  7. P. Boersma, "Praat, a system for doing phonetics by computer," Glot International, vol. 5, no. 9/10, pp. 341–345, 2001. [Online]. Available: http://www. praat. org
  8. Atal, B. ; Rabiner, L. , "A pattern recognition approach to voiced-unvoiced-silence classification with applications to speech recognition" Acoustics, Speech, and Signal Processing [see also IEEE Transactions on Signal Processing], IEEE Transactions on , Volume: 24 , Issue: 3 , Jun 1976, Pages: 201 - 212.
  9. D. G. Childers, M. Hand, J. M. Larar, " Silent and Voiced/Unvoied/Mixed excitation(FourWay),Classification of Speech", IEEE Transaction on ASSP, Vol-37, No-11, pp. 1771-74, Nov 1989.
  10. A. Stolcke, K Ries, N. Coccaro, E. Shriberg, R. Bates, D. Jurafsky, P. Taylor, R. Martin, C. Van Ess-dykema, andM. Meteer, "Dialogue Act modeling and automatic tagging And recognition of conversational speech", Computational Linguistics vol. 26, 2000.
  11. Richard. O. Duda, Peter E. Hart, David G. Strok, "Pattern Classification", A Wiley Inter science publication, John Wiley & Sons, Inc, Second Edition, 2001.
  12. Sarma, V. ; Venugopal, D. , "Studies on pattern recognition approach to voiced-unvoiced-silence classification", Acoustics, Speech, and Signal Processing, IEEE International conference on ICASSP '78. ,Volume 3:April 1978, pages 1-4.
Index Terms

Computer Science
Information Sciences

Keywords

Dialogue Acts Silence Removal Algorithms Conditional Random Fields