Notice of Retraction Comparison of Parametric representations of Birdcall in Gaussian Mixture model

Ricky Mohanty, Sandeep Singh Solanki

Abstract


Notice of Retraction

-----------------------------------------------------------------------
After careful and considered review of the content of this paper by a duly constituted expert committee, this paper has been found to be in violation of APTIKOM's Publication Principles.

We hereby retract the content of this paper. Reasonable effort should be made to remove all past references to this paper.

The presenting author of this paper has the option to appeal this decision by contacting ij.aptikom@gmail.com.

-----------------------------------------------------------------------


This paper focuses on the methods of automatic classifications of birds into different species based on feature extraction methods & audio recordings of their sounds. The recognition system uses Gaussian mixture model (GMM) to model 14 poultry bird species calls. Mel frequency cepstral coefficients (MFCC) parameters & wavelet parameters are used for feature vector extraction. The paper briefly explains the methods &  also evaluates the performance of these methods in Gaussian Mixture Model classification .The results depicts the performance of  Gaussian Mixture Model classification using wavelet was more efficient in terms of percentage of accuracy  at around 80% and computation was also faster.


References


Catchpole CK, Slater PJB. Bird Song: Biological Themes and Variations. Cambridge, U.K.: Cambridge Univ. Press, 1995.

Thorpe WH. Bird Song. Cambridge, U.K.: Cambridge Univ. Press, 1961.

Potter RK, Kopp GA, Green HC. Visible Speech. New York: Van Nostrand, 1947.

Kogan JA, Margoliash D. Automated recognition of bird song elements from continuous recordings using dynamic time warping and hidden Markov models: A comparative study, JournalAcoust. Soc. Am., 1998; 103(4): 2185–2196.

Fagerlund S, Harma A. Parametrization of inharmonic bird sounds for automatic recognition. 13th European Signal Processing Conference (EUSIPCO 2005). Antalya, Turkey, 2005.

Kwan C, Ho KC, Mei G. An automated acoustic system to monitor and classify birds, EURASIP Journal on Applied Signal Processing, 2006; 2006(Article ID 96706):19-38.

Lee C, Han C, Chuang C. Automatic classification of bird species from their sounds using two-dimensional cepstral coefficients, IEEE Transactions on Audio, Speech and Language Processing, 2008;16(8):1541-1550.

Juang C-F, Chen T-M. Birdsong recognition using prediction-based recurrent neural fuzzy networks, Neurocomputing, 2007; 71(1-3):121-130.

Selin A, Turunen J, Tanttu JT. Wavelets in automatic recognition of bird sounds. EURASIP Journal on Signal Processing Special Issue on Multirate Systems and Applications, 2007: 2007(1).

H¨arm¨a A, Somervuo P. Classification of the harmonic structure in bird vocalization. IEEE Int. Conf. Acoustics, Speech, Signal Processing (ICASSP 2004), Montreal, Canada, 2004.

Chen Z, Maher RC. Semi-automatic classification of bird vocalizations using spectral peak tracks. The Journal of the Acoustical Society of America, 2006; 120(5): 2974-2984.

Heller JR, Pinezich J D. Automatic recognition of harmonic bird sounds using a frequency tract extraction algorithm. The Journal of the Acoustical Society of America, 2008; 124(3): 1830-1837.

Jancovic P, Kokuer M. Automatic detection and recognition of tonal bird sounds in noisy environments. EURASIP Journal on Advances in Signal Processing, 2011(2011).

Briggs F, Lakshminarayanan B, Neal L, Fern XZ, Raich R, Hadley SJK, Hadley AS, Betts MG. Acoustic classification of multiple simultaneous bird species: A multi-instance multi-label approach. The Journal of the Acoustical Society of America, 2012; 131(6): 4640-4650.

Kumari TRJ, Jayanna HS. Limited Data Speaker Verification: Fusion of Features, International Journal of Electrical & Computer Engineering (IJECE). 2017; 7(6): 3344-3357.

Nguyen LN, et al. A New Recognition Method for Visualizing Music Emotion. International Journal of Electrical & Computer Engineering (IJECE), 2017; 7(3); 1246-1254.

Sabir B, et al. Improved algorithm for pathological and Normal Voices Identification. International Journal of Electrical & Computer Engineering (IJECE), 2017; 7(1); 238-243.

Lee C, Hsu S, Shih J, Chou C. Continuous birdsong recognition using Gaussian mixture modelling of image shape features. IEEE Transactions on Multimedia, 2013; 15(2): 454-464.

Fagerlund S. Automatic Recognition of Bird Species by Their Sounds, M.S. thesis, Helsinki Univ. Technol., Espoo, Finland, 2004.

BogertBP, Healy MJR, Tukey JW. The frequency analysis of time series for echoes: cepstrum, pseudoautocovariance, cross-cepstrum and saphe cracking. Proc. Symp. Time Series Analysis, Istanbul, Turkey, 1963; 9: 209-243.

Oppenheim A, Schafer R. Digital Signal Processing. New York: Springer, Prentice-Hall, 1975.

Davis S, Mermelstein P. Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences. IEEE Trans. Acoust., Speech, Signal Process. 1980; 28(4): 357-366.

Jankowski CR, VO JR, Lippman RP. A comparison of signal processing front ends for automatic word recognition, IEEE Trans. Speech Audio Process., 1995; 3(4): 286-292.

Dempster AP, Laird NM, Rubin DB. Maximum likelihood from incomplete data via the EM algorithm. J. Roy. Statist. Soc., Series B, 1977; 39(1): 1-38.

C M Bishop. Neural Networks for Pattern Recognition. Oxford, U.K.: Oxford Univ. Press, 1995.

Linde Y, Buzo A, Gray R. An algorithm for vector quantizer design. IEEE Trans. Commun., 1980; 28(1): 84-95.

Gray R. Vector quantization. IEEE Acoust., Speech, Signal Process. Mag., 1984; 1(2): 4-29.

Kohonen T. Self-Organizing Maps. Berlin, Germany: Springer, 1995.

SomervuoP, Harma A, Fagerlund S. Parametric representations of bird sounds for automatic species recognition. IEEE Transactions on Audio, Speech and Language Processing, 2006; 14(6): 2252-2263.




DOI: https://doi.org/10.11591/APTIKOM.J.CSIT.71

Refbacks

  • There are currently no refbacks.


Copyright (c) 2019 APTIKOM Journal on Computer Science and Information Technologies



ISSN: 2722-323X, e-ISSN: 2722-3221

CSIT Stats

 

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.