Sensors & Transducers
Vol. 263, Issue 4, December 2023, pp. 119-130
_______________
Classifying Musical Instrument Using Spatiotemporal Features
​with Deep Neural Networks
1 Chai Xiu CHIAH, 1, * Lee Choo TAY and 1 Weng Kin LAI
1
Tunku Abdul Rahman University of Management and Technology,
​Jalan Genting Kelang, Setapak, 53300 Kuala Lumpur, Malaysia
1 Tel.: 6034145123, fax: 60341423166
* E-mail: taylc@tarc.edu.my
Received: 2 October 2023 / Accepted: 8 December 2023 Published: 21 December 2023
Abstract:
Similar to trained human ears, a machine could be trained to recognize musical instruments presence in audio tracks for the purpose of musical information retrieval. The audio signal produced by an instrument has unique temporal and spectral features, which could be extracted for machine learning purpose. This research investigates the use of spatiotemporal features, which were converted from temporal and spectral features of monophonic music for this application. The performance of a recurrent neural network model called bidirectional Long Short Term Memory (Bi-LSTM) network and two convolutional neural networks (CNNs) in musical instruments classification with log Mel-spectrogram were analysed and evaluated. With a dataset of 14 musical instruments, the Bi-LSTM and 1-dimensional CNN models obtained a macro F1 score of 0.976 and 0.977 respectively, while 2-dimensional CNN model achieved 0.985.
Keywords:
Long Short-Term Memory Network (LSTM), Convolutional Neural Network (CNN), Musical instruments classification, Mel-Spectrogram, Spectral features, Spatiotemporal features.
_____________________________________________________________________