
INTERNATIONAL JOURNAL OF LATEST TECHNOLOGY IN ENGINEERING,
MANAGEMENT & APPLIED SCIENCE (IJLTEMAS)
ISSN 2278-2540 | DOI: 10.51583/IJLTEMAS | Volume XV, Issue II, February 2026
www.rsisinternational.org
models for embedded deployment on edge devices, and leveraging self-supervised pretraining on large unlabeled
wildlife audio corpora to further improve generalization.
REFERENCES
1. P. Marler, "Bird calls: Their potential for behavioral neurobiology," Ann. N.Y. Acad. Sci., vol. 1016, pp.
31–44, 2004.
2. K. Riede, "Acoustic monitoring of Orthoptera and its potential for conservation," J. Insect Conserv., vol.
2, pp. 217–223, 1998.
3. D. Stowell, "Computational bioacoustics with deep learning: A review and roadmap," PeerJ, vol. 10,
e13152, 2022.
4. J. Sueur et al., "Acoustic indices for biodiversity assessment and landscape investigation," Acta Acust.
united Ac., vol. 100, pp. 772–781, 2014.
5. S. Fagerlund, "Bird species recognition using support vector machines," EURASIP J. Adv. Signal Process.,
2007.
6. C. Kwan et al., "An automated acoustic system for monitoring wildlife," J. Acoust. Soc. Am., vol. 119,
pp. 2665–2672, 2006.
7. A. Härmä, "Automatic identification of bird species based on sinusoidal modeling," in Proc. IEEE
ICASSP, 2003, pp. 545–548.
8. P. Somervuo et al., "Parametric representations of bird sounds for automatic species recognition," IEEE
Trans. Audio Speech Lang. Process., vol. 14, pp. 2252–2263, 2006.
9. V. Morfi and D. Stowell, "Deep learning for audio event detection on low-resource datasets," J. Acoust.
Soc. Am., vol. 147, pp. 1354–1364, 2020.
10. M. Zhong et al., "Robust animal sound classification using spectro-temporal attention," Ecol. Inform., vol.
61, 2021.
11. J. Salamon and J. P. Bello, "Deep convolutional neural networks and data augmentation for environmental
sound classification," IEEE Signal Process. Lett., vol. 24, pp. 279–283, 2017.
12. S. Kahl et al., "BirdNET: A deep learning solution for avian diversity monitoring," Ecol. Inform., vol. 61,
101236, 2021.
1. 13 S. Shon et al., "Bioacoustic classification using contrastive self-supervised learning," in Proc. IEEE
ICASSP, 2022.
13. X. Wei et al., "Self-supervised audio model for rare species detection," arXiv:2401.00000, 2024.
14. A. Baevski et al., "Wav2Vec 2.0: A framework for self-supervised learning of speech representations," in
Proc. NeurIPS, 2020, pp. 12449–12460.
15. W. Hsu et al., "HuBERT: Self-supervised speech representation learning by masked prediction,"
IEEE/ACM Trans. Audio Speech Lang. Process., vol. 29, pp. 3451–3460, 2021.
16. A. Nguyen and A. Kumar, "Cross-species audio classification using transfer learning with Wav2Vec2,"
IEEE/ACM Trans. Audio Speech Lang. Process., vol. 32, pp. 50–65, 2024.
17. D. Stowell et al., "Few-shot learning for bioacoustic sound event detection," in Proc. NeurIPS, 2023.
18. T. Ganchev et al., "Automated acoustic identification of singing insects," Bioacoustics, vol. 26, pp. 141–
158, 2017.
19. S. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word
recognition," IEEE Trans. Acoust. Speech Signal Process., vol. 28, pp. 357–366, 1980.
20. D. Mitrovic et al., "Features for content-based audio retrieval," Adv. Comput., vol. 78, pp. 71–150, 2010.
21. L. Breiman, "Random forests," Mach. Learn., vol. 45, pp. 5–32, 2001.
22. M. Towsey et al., "A toolbox for animal call recognition," Bioacoustics, vol. 21, pp. 107–125, 2012.
23. A. Priyadarshani et al., "Automated birdsong recognition in complex acoustic environments," Methods
Ecol. Evol., vol. 9, pp. 1580–1594, 2018.
24. I. Potamitis et al., "Automatic bird sound detection in long real-field recordings," Appl. Acoust., vol. 80,
pp. 1–9, 2014.
25. Y. LeCun et al., "Deep learning," Nature, vol. 521, pp. 436–444, 2015.
26. K. J. Piczak, "Environmental sound classification with convolutional neural networks," in Proc. IEEE
MLSP, 2015, pp. 1–6.