Dialogue Act Detection from Human-Human Spoken Conversations

Nithin Ramacandran

Call for Paper

August Edition

IJCA solicits high quality original research papers for the upcoming August edition of the journal. The last date of research paper submission is 21 July 2025

Submit your paper

Know more

The week's pick

Navigating the Future of Cybersecurity: A Strategic Approach to Crypto Agility for Modern Enterprises

Aditya Gupta

Random Articles

Concept based Ranking of Results using an Ontology and Fuzzy Network for a Personalized Web Search Engine

November

2013

Comparing Soft Computing Techniques for Estimating Demand of Season Ticket Holders

Oct

2018

Distance Field based Haptic Rendering of Scattered Oriented Points

January

2013

Application Development Feasibility: DevOps or SRE?

Aug

2023

Reseach Article

Dialogue Act Detection from Human-Human Spoken Conversations

by Nithin Ramacandran

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 67 - Number 5

Year of Publication: 2013

Authors: Nithin Ramacandran

10.5120/11392-6688

Nithin Ramacandran . Dialogue Act Detection from Human-Human Spoken Conversations. International Journal of Computer Applications. 67, 5 ( April 2013), 24-27. DOI=10.5120/11392-6688

@article{ 10.5120/11392-6688,

author = { Nithin Ramacandran },

title = { Dialogue Act Detection from Human-Human Spoken Conversations },

journal = { International Journal of Computer Applications },

issue_date = { April 2013 },

volume = { 67 },

number = { 5 },

month = { April },

year = { 2013 },

issn = { 0975-8887 },

pages = { 24-27 },

numpages = {9},

url = { https://ijcaonline.org/archives/volume67/number5/11392-6688/ },

doi = { 10.5120/11392-6688 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2024-02-06T21:23:52.915919+05:30

%A Nithin Ramacandran

%T Dialogue Act Detection from Human-Human Spoken Conversations

%J International Journal of Computer Applications

%@ 0975-8887

%V 67

%N 5

%P 24-27

%D 2013

%I Foundation of Computer Science (FCS), NY, USA

Abstract

Accurate detection of dialogue acts is essential for understanding human conversations and to recognize emotions. This requires 1) the segmentation of human-human dialogs into turns, 2) the intra-turn segmentation into DA boundaries and 3) the classification of each segment according to a DA tag. Most dialogue act classification models approaches the problem of identifying the different DA segments within an utterance in separate fashion: first, DA boundary segmentation within an utterance was addressed with generative or discriminative approaches then, DA labels were assigned to such boundaries based on multi-classification. This paper, presents an effective approach to improve the accuracy of dialogue act recognition from speech signal by combining acoustic and linguistic features. This paper adopts the use of a silence removal algorithm based on Mahalanobis Distance for the segmentation of human-human dialogs into turns and proposes the keyword spotting feature to reduce the ambiguity of opinion vs. non-opinion statements and agreements vs. acknowledgements, occurs while classifying the dialogue acts.

References

Y. Liu, A. Stolcke, E. Shriberg, and M. Harper. 2004. Comparing and combining generative and posterior probability models: Some advances in sentence boundary detection in speech. In Proceedings of the Conference on Empirical Methods in Natural Language Processing
S. Quarteroni and G. Riccardi, "Dialog Act Classification in Human Human and Human Machine Conversations," in Proc. INTERSPEECH, 2010.
Silvia Quarteroni, Alexei V. Ivanov, Giuseppe Riccardi, "Simultaneous Dialog Act Segmentation And Classification from Human-Human Spoken Conversations," IEEE 2011.
G. Saha, Sandipan Chakroborty, Suman Senapati, "A New Silence Removal and Endpoint Detection Algorithm for Speech and Speaker Recognition Applications", Proc. Eleventh Conference on Speech Processing, IIT Khragpur 2005.
M. Dinarelli, S. Quarteroni, S. Tonelli, A. Moschitti, and G. Riccardi,"Annotating spoken dialogs: from speech segments to dialog acts and frame semantics," in Proc. SRSL, 2009.
J. Lafferty, A. McCallum, and F. Pereira, "Conditional random fields: Probabilistic models for segmenting and labelling sequence data," in Proc. ICML, 2001.
P. Boersma, "Praat, a system for doing phonetics by computer," Glot International, vol. 5, no. 9/10, pp. 341–345, 2001. [Online]. Available: http://www. praat. org
Atal, B. ; Rabiner, L. , "A pattern recognition approach to voiced-unvoiced-silence classification with applications to speech recognition" Acoustics, Speech, and Signal Processing [see also IEEE Transactions on Signal Processing], IEEE Transactions on , Volume: 24 , Issue: 3 , Jun 1976, Pages: 201 - 212.
D. G. Childers, M. Hand, J. M. Larar, " Silent and Voiced/Unvoied/Mixed excitation(FourWay),Classification of Speech", IEEE Transaction on ASSP, Vol-37, No-11, pp. 1771-74, Nov 1989.
A. Stolcke, K Ries, N. Coccaro, E. Shriberg, R. Bates, D. Jurafsky, P. Taylor, R. Martin, C. Van Ess-dykema, andM. Meteer, "Dialogue Act modeling and automatic tagging And recognition of conversational speech", Computational Linguistics vol. 26, 2000.
Richard. O. Duda, Peter E. Hart, David G. Strok, "Pattern Classification", A Wiley Inter science publication, John Wiley & Sons, Inc, Second Edition, 2001.
Sarma, V. ; Venugopal, D. , "Studies on pattern recognition approach to voiced-unvoiced-silence classification", Acoustics, Speech, and Signal Processing, IEEE International conference on ICASSP '78. ,Volume 3:April 1978, pages 1-4.

Index Terms

Computer Science

Information Sciences

Keywords

Dialogue Acts Silence Removal Algorithms Conditional Random Fields