Multiple Instance Learning for Automatic Content-Based Classification of Speech Audio

Authors

  • LAXMAN BAVANDLAPALLY Author
  • THOTA SRAVANTI Author
  • JOGU PRAVEEN Author

Keywords:

Audio classification, Multiple Instance Learning (MIL), Feature extraction, mi-Graph, mi-SVM

Abstract

Speech analytics researchers are working to improve their ability to decipher audio material. This research presents a new method for classifying news audio clips based on their content, called the Multiple Instance Learning (MIL) approach. Audio classification and segmentation benefit from content-based analysis. As a starting point, a classifier that can predict the category of an audio sample has been proposed. Perceptual Linear Prediction (PLP) coefficients and Mel-Frequency Cepstral Coefficients (MFCC) are two kinds of features used for audio content identification (MFCC). For classification, two MIL approaches, mi-Graph and mi-SVM, are used. Different performance matrices are used to assess the outcomes gained via the use of various approaches. The results of the experiments clearly show that the MIL has great audio categorization capacity.

Downloads

Download data is not yet available.

Downloads

Published

21-04-2022

How to Cite

Multiple Instance Learning for Automatic Content-Based Classification of Speech Audio. (2022). International Journal of Information Technology and Computer Engineering, 10(2), 61-67. https://ijitce.org/index.php/ijitce/article/view/294