International Journal of Computer Applications |
Foundation of Computer Science (FCS), NY, USA |
Volume 187 - Number 3 |
Year of Publication: 2025 |
Authors: Akansha Tyagi, Padmanabhan Rajan |
![]() |
Akansha Tyagi, Padmanabhan Rajan . Subspace-based Representations for Acoustic Scene Classification. International Journal of Computer Applications. 187, 3 ( May 2025), 1-8. DOI=10.5120/ijca2025924777
Real-world acoustic scene data has a complex structure that leads to high levels of overlap within an acoustic scene class. This overlap stems from various similar factors, such as different recording devices and recording locations or cities, which act as confounding factors. On the other hand, the same set of confounding factors would be present across different acoustic scene classes and can be considered as a common link across them. Utilizing this common structure, it is possible to perform multi-block analysis to learn the representation of these common links. Two formulations are proposed for the multi-block analysis of acoustic scene data, employing a common orthogonal basis extraction algorithm. The proposed formulations enhance the performance of the acoustic scene classification system by reducing the information pertaining to the recording devices and cities from the learnt acoustic scene representations. Experiments were conducted on five standard Detection and Classification of Acoustic Scenes and Events (DCASE) datasets. Across all datasets, the classification performance achieved using features derived from the multi-block formulations surpassed that of features not incorporating these formulations.