Speech Enhancement using Segmental Non-Negative Matrix Factorization (SNMF) and Hidden Marvok Model (HMM)

Barinder Singh; Karan Mahajan

Call for Paper

May Edition

IJCA solicits high quality original research papers for the upcoming May edition of the journal. The last date of research paper submission is 20 April 2026

Submit your paper

Know more

The week's pick

A Unified NIST SP 800-90B Validation Framework for CMOS True Random Number Generators and Quantum Random Number Generators

Che-Ping Lin

Random Articles

Reseach Article

Speech Enhancement using Segmental Non-Negative Matrix Factorization (SNMF) and Hidden Marvok Model (HMM)

by Barinder Singh, Karan Mahajan

International Journal of Computer Applications

Foundation of Computer Science (FCS), NY, USA

Volume 119 - Number 6

Year of Publication: 2015

Authors: Barinder Singh, Karan Mahajan

10.5120/21068-3738

Barinder Singh, Karan Mahajan . Speech Enhancement using Segmental Non-Negative Matrix Factorization (SNMF) and Hidden Marvok Model (HMM). International Journal of Computer Applications. 119, 6 ( June 2015), 1-2. DOI=10.5120/21068-3738

@article{ 10.5120/21068-3738,

author = { Barinder Singh, Karan Mahajan },

title = { Speech Enhancement using Segmental Non-Negative Matrix Factorization (SNMF) and Hidden Marvok Model (HMM) },

journal = { International Journal of Computer Applications },

issue_date = { June 2015 },

volume = { 119 },

number = { 6 },

month = { June },

year = { 2015 },

issn = { 0975-8887 },

pages = { 1-2 },

numpages = {9},

url = { https://ijcaonline.org/archives/volume119/number6/21068-3738/ },

doi = { 10.5120/21068-3738 },

publisher = {Foundation of Computer Science (FCS), NY, USA},

address = {New York, USA}

}

%0 Journal Article

%1 2024-02-06T23:03:17.644701+05:30

%A Barinder Singh

%A Karan Mahajan

%T Speech Enhancement using Segmental Non-Negative Matrix Factorization (SNMF) and Hidden Marvok Model (HMM)

%J International Journal of Computer Applications

%@ 0975-8887

%V 119

%N 6

%P 1-2

%D 2015

%I Foundation of Computer Science (FCS), NY, USA

Abstract

Speech Enhancement refered as to improve quality or intelligibility of speech signal. Speech signal is often degraded by additive background noise like babble noise, train noise, restaurant noise etc. Speech enhancement aims at improving the performance of speech communication systems in noisy environments. This paper proposes a segmental NMF (SNMF) speech enhancement scheme to improve the conventional frame-wise NMF-based method. In this two algorithms are derived to decompose the original nonnegative matrix associated with the magnitude spectrogram, the first algorithm is used in the spectral domain and the second algorithm is used in the temporal domain . In this paper Hidden macro model and SNMF(S) for subjective learning (SNMF-S). Then the SNMF for the objective learning (SNMF-O) will be implemented.

References

Hao-Teng Fan1, Jeih-weih Hung 1, Xugang Lu2, Syu-Siang Wang3 , and Yu Tsao3," Speech enhancement using segmental non negative matrix factorization, 2014 IEEE International Conference on Acoustic, Speech and Signal Processing (ICASSP).
R. Martin, "Noise power spectral density estimation based on optimal smoothing and minimum statistics," IEEE Transactions on Speech and Audio Processing, 9(5), pp. 504–512, 2001.
I. Cohen, "Noise spectrum estimation in adverse environments: Improved minima controlled recursive averaging," IEEE Transactions on Speech and Audio Processing, 11(5) ,pp. 466–475, 2003.
S. Srinivasan, J. Samuelsson, and W. Kleijn, "Codebook driven short-term predictor parameter estimation for speech enhancement," IEEE Transactions on Audio, Speech, and Language Processing, 14(1), pp. 163–176, 2006.
D. Y. Zhao and W. B. Kleijn, "HMM-based gain modelling for enhancement of speech in noise," IEEE Transactions on Audio, Speech, and Language Processing, 15(3), pp. 882–892, 2007.
K. El-Maleh, A. Samouelian, and P. Kabal, "Frame level noise classification in mobile environments," in Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 237–240, 1999.
Berdugo , B. and Cohen, "Noise estimation by minima controlled recursive averaging for robust speech enhancement" , IEEE Signal Proc. Letters, vol. 9, no. 1, pp. 12-15, Jan. 2002.
C. Breithaupt and R. Martin,"MMSE estimation of magnitude-squared DFT coefficients with super-gaussian priors",? in Proc. IEEE Int. Conf. Acoust. , Speech, Signal Processing, pp. 848-851, 2003.
Malah, D. , Cox, R. V. and Accardi, A. J. , "Tracking speech-presence uncertainty to improve speech enhancement in non-stationary noise environments,"? Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing, vol. 2, pp. 789-792, 15-19 Mar 1999.
Y. Ephraim," Speech Enhancement Using a Minimum Mean-Square Error Log-Spectral Amplitude Estimator", IEEE Transactions On Acoustics, Speech and Signal Processing 0096-35 18/8S/0400-0443, 1985.

Index Terms

Computer Science

Information Sciences

Keywords

Speech Enhancement Non negative Matrix Factorization (NMF) Segmental Nonnegative Matrix Factorization (SNMF)