0% found this document useful (0 votes)

73 views8 pages

Bird Song Recognition Through Spectrogram Processing and Labeling

This document discusses a project to develop an automated bird song recognition system using spectrogram analysis and machine learning. It addresses two main challenges: (1) obtaining manually labeled training data and (2) reducing noise in audio recordings from outdoor environments. The document describes creating a program that allows users to efficiently segment, label and review spectrograms from audio recordings to generate labeled training data for the machine learning models. It also discusses algorithms used to reduce noise in the spectrogram images in order to more clearly identify bird songs.

Uploaded by

dmzy2

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

73 views8 pages

Bird Song Recognition Through Spectrogram Processing and Labeling

Uploaded by

dmzy2

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Bird Song Recognition through Spectrogram Processing and Labeling

Katie Wolf University of Minnesota - Twin Cities [email protected]

Abstract
In this work, we present solutions to two problems on a project developing an end-to-end system for collecting and analyzing bioacoustic recordings, in particular bird songs. This system will conduct automatic bird species survey, which involves gathering the presence and absence of information as well as an abundance of information of dierent bird species. The two problems discussed in this work are that (1) there needs to be an eective way to obtain manually collected data for the system to initially learn and train from and (2) a noise reduction algorithm is needed to allow for eective gathering and labeling of information because the recordings from the environment tend to be noisy.

concentrates on the problem of cleaning the data and facilitating human labeling for use with machine learning algorithms. In this paper, Section 2.1 will provide an overview of bioacoustics and past research involving automated recognition and bioacoustics. In Section 2.2, an explanation of spectrograms will be discussed and how they were used in this project. Section 3 will give an overview of the project. The program to assist in labeling data will be discussed in Section 4, and Section 5 will explain the algorithms used for cleaning the data. A nal summary will be discussed in Section 6.

2
2.1

Background
Bioacoustics

Introduction

Collecting data from wild animals in outdoor environments and analyzing them has been a tedious task that some eld workers have little enthusiasm for, considering it dicult, dull and old-fashioned work [2]. However, modern technology has enabled scientists to accomplish this task in a way that was not possible years ago. The spectrogram was a turning point in bird song research. It made it possible to analyze, measure, classify and recognize the dierent sounds a bird makes. Today, with digital and computer-based technology, the power and speed of the latest generation machines allows for easier storage of data as well as being able to manipulate and synthesize for experimental purposes [2]. While data is more easily available, the vast amount of continuous sound recordings often take, at a minimum, equivalent time to analyze as to acquire. The use of automated recognition is a developing area of great opportunity for analyzing continuous data [3]. However, even to use a machine learning method, a training set of properly labeled data is necessary, but labeling data is a time consuming process. The project 1

Bioacoustics pertain to the sounds that animals make and can often provide insight to their behavior. Bioacoustics research and tools can aid in monitoring and managing species, which is vital to the conservation and preservation of diversity. Currently, capturing and monitoring of bird species through marking individuals with radio monitoring devices or visual tags is sometimes necessary for biosurveys. The survival and successful reproduction of these captured individuals may be aected. Using bioacoustics may be one way in which to expand conservation eorts and shun this type of handling [3]. Using sound to identify certain species is not a new idea. T. A. Parker recorded the dawn choruses of bird in the Bolivian Amazon, and within 7 days he found he had captured 85% of the regional species on tape. In that same region, seven experienced ornithologists took 54 days to inventory the birds using a capture and release technique [3]. The basis of the project is to study and use bioacoustical recordings from birds to create a machine learning algorithm that, taking in an audio stream, will be used as an automated species recognition tool. Automated recognition of bioacoustics signals has been re-

Song Meter Locations Roads Stream order

PC18 ! !PC17 PC16 ! PC15 ! PC11 ! PC13 ! PC10 ! PC8 !

PC9 ! PC6 ! PC7 !

PC5 ! PC4 PC3 ! !

PC2 !

PC1

0.5

Kilometers

Figure 1. Map of the H.J. Andrews Research Forest. Each dot represents a Song Meter recording data, while the lines running through the valleys represent the water sources.

ported with encouraging results in a variety of animals including bats, birds, frogs and Orthoptera (grasshoppers, crickets and locusts) [1]. Birds species recognition in particular has been studied using support vector machines [4], sinusoidal modeling [5], hidden Markov models [6], and dynamic time warping [6]. Using prerecorded and pre-labeled data from the Cornell Maculauy Sound Library was a way in which to create a database of sounds and species labels for the machine. However, there were many challenges to the project when faced with the raw data that was collected from the H.J. Andrews Research Forest including noisy audio les and unlabeled data.

2.2

Spectrograms

Spectrograms are graphical representations of audio les, with time on the horizontal axis and frequency (going from low frequency at the top of the image and high at the bottom) on the vertical axis. The images of the graphs are what we used on the project and are generated from the audio les. For each of the audio les, we divide the sound of the signal into frames and compute the spectrum for each of the frames. A spectrum represents the intensity of the signal as a function of frequency. The spectrogram is then created as a graph of the spectra of each frame in the sound. In our case to create the spectra we divided the signals into frames with a frame-size = 1024 samples. With each frame starting a frame-step = 256 samples after the previous frame, we have a 75% overlap between 2

frames. Then for each of the frames, we take the fastFourier transform (FFT) complex coecients for i = 1,...,m where m = frame-size /2 = 512. One way in which we remove the noise in the lower frequency range is by dropping the lowest 13 FFT coecients. This leaves us with n =499 elements in each spectrum which are used to create the spectrogram.

Figure 2. The song Meter in its location in the forest. Figure 3. A clairity comparison can be seen between the spectrogram created with an audio le from the H.J. Andrews Research Forest (top) versus a spectrogram generated from an audio le from the Cornell Maculauy Sound Library(bottom)

Project

Within the H.J. Andrews Research forest, Song Meters (Figure 2) used to record data were placed at 16 locations, and, as can be seen in Figure 1, most of the surrounding area is near moving water. Each recording has some noise or echo of a noise from the rivers and streams that run through the forest. Figure 3 shows the dierence in clarity between a raw image from the forest and an image from the sound library. On the top image there is a high concentration of noise near the low frequencies not found in the bottom image, which represents the noise that is coming from the water. These bright areas that are not a part of the data we are trying to collect, also called the noise, tend to dim the actual bird songs or signals which we are trying to preserve. One of the goals of our research is to create an algorithm, or a series of algorithms that will remove these areas or de-noise the image, so that the signal is clear to one reviewing the spectrograms. There are a few reasons why we need to visibly see the signals within the image. One reason is that for the machine learning algorithms to successfully train and categorize data they will need the cleanest images possible. Another is that for our initial labeling of the raw data we needed ecologists to go through each recording and manually label the signals. In order for them to correctly see 3

and label the data, it has to be clearly displayed. This brings us to the other part of our research: creating a user interface that will allow labeling spectrograms by species in order to create a training set of data for the machine learning algorithms.

Annotation

Our rst part of the research was to create a program that would take in the audio wave, compute the FTT coecients in order to create a spectrogram, and from there use a GUI program that would allow a user to label parts of the spectrogram. The initial requirements for the program were to have it: Break up a large audio le into smaller ones so that the spectrograms would t on the screen. This was done by prompting for the size (in seconds) of the smaller segments. Play the audio les for the small segments.

Figure 4. The annotator program with the left window representing the current labels and the right window represents the annotation window where the users label the spectrograms

Flip through the segmented spectrograms (moving both forward and backward while still preserving the labels that were assigned. Contain a labeling system to tell the dierence between types of species. Save a text le for each of the individual segments with information about the spectrogram (including frame-size and frame-step) and information on the labels (including the coordinates for opposite points of the annotated rectangle and the label that was assigned to that annotation). The initial program met these requirements and had all of the basic tools needed to label the data. After working with the program we noticed that while the labeling system worked well, another window was needed in order to view past labels while still labeling the current spectrograms. This would not only keep the labeling more consistent, but would allow the user to notice any incorrect labels immediately. A second window would also be able to keep track of which key represented which labels and give more options to the user as they did there annotations on the main window. Figure 4 depicts the basic display of the two windows. For more on the programs functionality, view the user manual [11]. In order to test the program we were able to go through and manually label a day of data which consisted of 20 minute segments of sound for each hour of the day. Figure 5 is a graph of the number of bird calls that were labeled per hour for each hour. As can be seen the results show that the bird calls are mostly concentrated during the early morning between 5:00 AM and 11:00 AM. This is consistent with the well known fact that birds have a peak of singing activity in the 4

early morning, called the dawn chorus [2], and also consistent with the information given from the ecologists who work within the research forest. Between 5:00 AM and 9:00 AM they conduct point counts which consist of going from one site to another every 10 minutes and counting the number of each bird species. We have yet to compare their counted data with data from the program that was taken at the exact site and time.

Noise Reduction

In this section we specify algorithms used to clean the images so that the signals are easier to identify.

5.1

Wiener Filter

Our attenuated Wiener Filter algorithm used to estimate the noise from the signal and remove it is composed of the following steps: 1. First, boost the image contrast of by setting each pixel in the image to the square root of its value (the pixel values range from 0 (black) to 1 (white)). 2. Run a low band pass lter that removes 8% of the pixels with the lowest frequency (those that are closest to the top of the image) by setting their value to 0. 3. For each frame (columns of pixels) compute the sum of all of the bins (rows of pixels) in that frame. 4. For each of the bins, compute the bin sum for the lowest 20% of the frame sums (found in 2) and then for each pixel set the value to the value divided by the bin sum (of the lowest 20% found previously) of the bin that the pixel is in.

450

Total Bird Syllable Count 5/15/09 PC5

400

350

Number of Bird Syllables

300

250

200

150

100

0 0:15 1:15 2:15 3:15 4:15 5:15 6:15 7:15 8:15 9:15 10:15 11:15 12:15 13:15 14:15 15:15 16:15 17:15 18:15 19:15 20:15 21:15 22:15 23:15

Time of Day

Figure 5. Graph of the number of bird songs annotated for a day of data (20 minutes of each hour starting at the time listed).

5. Apply 3 and 4 again on the modied values of the pixels. To minimize the least mean squared error between the lter output and the desired signal, Wiener lter coecients are calculated. In a basic Wiener lter it is assumed that the signals are stationary processes. Because in our process we are dealing with a variable signal but a stationary noise, we use an application of the Wiener lter for additive noise reduction [10]. In our signals, we know the noise (from the moving water in the forest) is located predominantly at the low frequencies of the spectrograms and that they also stay consistent throughout the frames. In our algorithm above, by rst computing the sum of the bins for each frame, we are using the lowest sums as estimates of where the noise is alone without a signal. Then by taking a sum of these estimates for each bin and dividing each pixel in the bin by that sum we are removing our estimation of the noise from the image. Once we have completed this we apply steps 3 and 4 again to smooth out the bins. The top three images in Figure 6 illustrates this algorithm. 5

5.2

Smoothing Filters

Once we have removed most of the constant low frequency noise from the spectrogram, we want to clarify the image further by applying smoothing lters to remove the speckling in the images. This section will discuss our 3 algorithms that we implemented with promising results: Gaussian Blur, Median Filter and Hybrid Median Filter. 5.2.1 Gaussian Blur

The Gaussian lter is applied so that for each pixel in the image, a weighted average is calculated such that the central pixels contribute more considerably to the result than the pixels on the edge [8]. The Gaussian lter (2D smoothing operator) (, ) is given below: (, ) = ( + /2 ) where (x, y) are the image co-ordinates and sigma (standard deviation) is the only parameter of the lter [9].
2 2 2

Figure 6. The top image represent a raw image taken from the forest. The next two images are applications of the attenuated Weiner Filter. From6 there we applied 3 different smoothing techniques along with a brightness and contrast algorithm (listed from left to right) Gaussian Blur, Median Filter, and Hybrid Median Filter.

Figure 7. The left image represents the median of the horizontal/vertical pixels, the center image represents the median of the cross pixels, and the right pixel represents the center pixel. The Hybrid Median lter takes the median of these 3 and sets the value of the center pixel to it.

These coecients of the size k by k lter mask have only the one parameter, sigma, which should be chosen as (2w + 1) / 2 where w is the scale of the diameter of the blur. Then the size k should be chosen such that k > 2w + 1 [8]. In our algorithm we estimated w to be 2 so then we chose k = 7 (which is greater than 5) and sigma = 3 (rounded up from 5/2). 5.2.2 Median Filter

median of the neighboring cross pixels, and (3) the center pixel. As in the Median lter, the parameter k represents the lter length and was chosen as 5 in our algorithm for the same reason as above. The Figure 7 represents the 3 pixels of which the median will replace the center pixel. The rst image represents the median of the 9 pixels in the horizontal/vertical lines, the second represents the median of the 9 cross pixels, and the last represents the center pixel [7].

The algorithm to smooth an image using the Median lter is as follows: For each pixel, replace the pixel with the median of the k by k neighborhood of pixels. While conserving the sharp edges of the image, the median lter is used to reduce speckle noise. In the lter there is only one parameter k, which is the lter length. Ideally, k should be chosen with at least k = 2w + 1, where w is the noise feature representing the estimated width of the speckles (in pixels) [8]. In our algorithm we used k = 5, since the speckles averaged a width of 2 pixels. The main disadvantage of the median lter is that by taking the rectangular neighborhood of the pixels, this can damage thin lines and sharp corners of the image. One way to solve this is to use the Hybrid Median Filter described in the next section. 5.2.3 Hybrid Median Filter

5.3

Brightness and Contrast Filters

Lastly, using a grey-scale transformation, the brightness and contrast will be adjusted in order to clarify the image further. Transformations are used typically when humans are analyzing images, because it is easier to interpret if the contrast is enhanced [9]. Given the normalized value of a pixel which is between 0 and 1, we apply the algorithm below: For each pixel, take the value and multiply it by the brightness. Subtract a faction from that and multiply it by the contrast and add a fraction. Finally, apply a clamp that sets all values above one to one (white), and negative values to 0 (black). Using this after the previous smoothing lters reduces the noise between signals and also enhances the clarity of the signals themselves.

Like the Median lter, the hybrid takes the median of pixels and sets the center value to that median. However, the Hybrid Median lter takes the median of only 3 values as can be seen by the algorithm below: Replace the center pixel of a k by k neighborhood with the median of: (1) the median of the neighboring center vertical and center horizontal (2) the 7

5.4

Algorithm Results

Figure 6 displays the full noise reduction algorithm sequence applied to an image from the research forest. As can be seen each of the three smoothing algorithms,

in addition to adjusting the brightness and contrast, eectually eliminates the additional noise after the attenuated Weiner Filter is applied.

Summary

This paper discussed two important implementations that will contribute to the overall project of using machine learning techniques to automate bird species recognition. Having designed a program to assist in manually labeling data eciently, we will now be able to study and use that information as a training set for machine learning algorithms. The next step will be to test this program by comparing results of species found during the manual point counts done by ecologists in the eld to the count of species found with the labeling program collected from audio at the same times and locations as the point counts. Additionally, sets of labeled data found with this program will act as the expected results when testing the accuracy of machine learning algorithms. By reducing background noise for the raw collected data through applying various signal processing algorithms, we have images with more dened signals. This not only contributes to the accuracy of manually labeling data, but will also create a clearer distinction between the noise and the signal for more precise automation in future algorithms.

[4] S. Fagerlund. Bird species recognition using support vector machines. EURASIP J. Appl. Signal Process., 2007(1):6464, 2007. [5] A. Harma. Automatic identication of bird species based on sinusoidal modeling of syllables. International Conference on Acoustic Speech Signal Processing, 5:545548, 2003. [6] J. A. Kogan and D. Margoliash. Automated recognition of bird song elements from continuous recordings using dynamic time warping and Hidden Markov Models: A comparative study. The Journal of the Acoustical Society of America, 103(4):21852196, 1998. [7] C. Reiter. With j: image processing 1: smoothing lters. SIGAPL APL Quote Quad, 34(2):915, 2004. [8] M. Seul, L. OGorman, and M. J. Sammon. Practacal algorithms for image analysis description, examples, and code. Cambridge University Press, 2000. [9] M. Sonka, V. Hlavac, and R. Boyle. Image processing, analysis, and machine vision. Brooks/Cole Publishing Company, 1999. [10] S. V. Vaseghi. Advanced signal processing and digital noise reduction. John Wiley & Sons and B. G. Teubner Publishers, 1996. [11] K. Wolf. Sound Annotator User Guide, November 2009. Can be found at: http: //www.cra.org/Activities/craw_archive/ dmp/awards/2009/wolf/projects/user_manual/ annontator_user_manual.pdf.

Acknowledgements

This work was supported by DREU the Distributed Research Experience for Undergraduates and Oregon State University where this research was conducted. The author is grateful to Dr. Xiaoli Fern, Dr. Raviv Raich, and Ph.D. Student Forrest Briggs for their insights and contributions to this work.

References
[1] T. S. Brandes, P. Naskrecki, and H. K. Figueroa. Using image processing to detect and classify narrow-band cricket and frog calls. Journal of the Acoustical Society of America, 120(5):29502957, 2006. [2] C. Catchpole and P. Slater. Bird song : biological themes and variations. Cambridge University Press, 1995. [3] J. R. Clemmons, R. Buchholz, and A. B. Society. Behavioral approaches to conservation in the wild / edited by Janine R. Clemmons and Richard Buchholz. Cambridge University Press, Cambridge ; New York :, 1997.

Abazie I. Data Structures & Algorithms For All Programmers 2023
No ratings yet
Abazie I. Data Structures & Algorithms For All Programmers 2023
655 pages
Computer Project For ISC
61% (36)
Computer Project For ISC
52 pages
C Programming
100% (10)
C Programming
205 pages
Be First Year (First Sem Foc 2 Mark Notes)
No ratings yet
Be First Year (First Sem Foc 2 Mark Notes)
58 pages
Villanueva-Rivera Et Al 2011
No ratings yet
Villanueva-Rivera Et Al 2011
14 pages
General Principles, Connecting Computational Thinking and Program
No ratings yet
General Principles, Connecting Computational Thinking and Program
92 pages
Python Module 2 Ktu Complete Notes
100% (1)
Python Module 2 Ktu Complete Notes
22 pages
6.29 N M.Sc. IT 1 - Organized
No ratings yet
6.29 N M.Sc. IT 1 - Organized
59 pages
Exploring Animal Behavior Through Sound Volume 1
No ratings yet
Exploring Animal Behavior Through Sound Volume 1
524 pages
GE8151 Notes PSPP
No ratings yet
GE8151 Notes PSPP
100 pages
CPSC 413 Lecture Notes - Part I Department of Computer Science
No ratings yet
CPSC 413 Lecture Notes - Part I Department of Computer Science
237 pages
Monroe Township Schools
No ratings yet
Monroe Township Schools
44 pages
5320521929977595-Coding Fundamentals - Block 1 Lesson 1 - Basic Moves and The Agent
100% (1)
5320521929977595-Coding Fundamentals - Block 1 Lesson 1 - Basic Moves and The Agent
9 pages
Algorithm and Flowchart Unit - 1
No ratings yet
Algorithm and Flowchart Unit - 1
10 pages
Mellinger 2006
No ratings yet
Mellinger 2006
17 pages
Heba DSBook 2022
No ratings yet
Heba DSBook 2022
337 pages
Musical Instrument Timbres Classification With Spectum
100% (1)
Musical Instrument Timbres Classification With Spectum
10 pages
Operating Instructions PLB 7.8 Robot Guidance Systems en Im0113083
No ratings yet
Operating Instructions PLB 7.8 Robot Guidance Systems en Im0113083
188 pages
Rao, 2020
No ratings yet
Rao, 2020
25 pages
Cambridge International AS & A Level: Information Technology 9626/12
No ratings yet
Cambridge International AS & A Level: Information Technology 9626/12
12 pages
Klenova Volodin - PRACTICAL GUIDE FOR BIOACOUSTICS
No ratings yet
Klenova Volodin - PRACTICAL GUIDE FOR BIOACOUSTICS
97 pages
SRS Project
No ratings yet
SRS Project
9 pages
Algorithm, Notation, Performance Analysis
No ratings yet
Algorithm, Notation, Performance Analysis
84 pages
Interconnection Network Architectures
100% (1)
Interconnection Network Architectures
54 pages
Birdsong AI Project
No ratings yet
Birdsong AI Project
45 pages
Bachelor Thesis Informatik Thema
100% (2)
Bachelor Thesis Informatik Thema
8 pages
Automatic Large-Scale Classification of Bird Sounds Is Strongly Improved by Unsupervised Feature Learning
No ratings yet
Automatic Large-Scale Classification of Bird Sounds Is Strongly Improved by Unsupervised Feature Learning
31 pages
Exposición 2 DSP
No ratings yet
Exposición 2 DSP
28 pages
083 - XII - CS - Question Bank 01
No ratings yet
083 - XII - CS - Question Bank 01
39 pages
Mel Frequency Cepstral Coefficients Properties Optimization Due To Ultrasonic Bands and Data Structure: Application To Acoustic Signals Identification
No ratings yet
Mel Frequency Cepstral Coefficients Properties Optimization Due To Ultrasonic Bands and Data Structure: Application To Acoustic Signals Identification
9 pages
20IS42 Module-1
No ratings yet
20IS42 Module-1
79 pages
Honors Thesis Uvm
100% (3)
Honors Thesis Uvm
9 pages
Part 3
No ratings yet
Part 3
11 pages
Exposición 1 DSP
No ratings yet
Exposición 1 DSP
16 pages
Automatic Acoustic Identification of Individual Animals
No ratings yet
Automatic Acoustic Identification of Individual Animals
28 pages
Birds Soco Paper
No ratings yet
Birds Soco Paper
21 pages
Audio Classification of Bird Species Using Convolu
No ratings yet
Audio Classification of Bird Species Using Convolu
15 pages
Bird Clef CNN Architecture
No ratings yet
Bird Clef CNN Architecture
16 pages
Nietjet 0602S 2018 003
No ratings yet
Nietjet 0602S 2018 003
5 pages
Bioacoustic Detection With Wavelet-Conditioned Convolutional Neural Networks
No ratings yet
Bioacoustic Detection With Wavelet-Conditioned Convolutional Neural Networks
13 pages
CNN BatDetection Vogl Flossmann Postprint
No ratings yet
CNN BatDetection Vogl Flossmann Postprint
17 pages
Bird Species Classification MDPI
No ratings yet
Bird Species Classification MDPI
15 pages
Lostanlen2024taslp Compressed
No ratings yet
Lostanlen2024taslp Compressed
17 pages
"Visual and Acoustic Identification of Bird Species": A Seminar Report ON
No ratings yet
"Visual and Acoustic Identification of Bird Species": A Seminar Report ON
29 pages
Final Thesis
No ratings yet
Final Thesis
31 pages
Peerj 2108
No ratings yet
Peerj 2108
24 pages
1 s2.0 S0957417424019134 Main
No ratings yet
1 s2.0 S0957417424019134 Main
18 pages
Birds Ganchev
No ratings yet
Birds Ganchev
9 pages
Icdt 2024
No ratings yet
Icdt 2024
11 pages
Comparing Recurrent Convolutional Neural Networks For Large Scale Bird Species Classification
No ratings yet
Comparing Recurrent Convolutional Neural Networks For Large Scale Bird Species Classification
12 pages
A Machine Learning Framework To Classify Southeast Asia 2022 Ecological Indi
No ratings yet
A Machine Learning Framework To Classify Southeast Asia 2022 Ecological Indi
13 pages
Spectrogram Denoising and Automated Extraction of The Fundamental Frequency Variation of Dolphin Whistles
No ratings yet
Spectrogram Denoising and Automated Extraction of The Fundamental Frequency Variation of Dolphin Whistles
12 pages
Wild Bird Species Identification Based On A Lightw
No ratings yet
Wild Bird Species Identification Based On A Lightw
12 pages
Jurnal - 2
No ratings yet
Jurnal - 2
6 pages
Incze 2018
No ratings yet
Incze 2018
6 pages
J Ecolind 2021 107419
No ratings yet
J Ecolind 2021 107419
12 pages
Content Bird 1
No ratings yet
Content Bird 1
12 pages
The Caatinga Orchestra Acoustic Indices Track Temporal CH - 2021 - Ecological I
No ratings yet
The Caatinga Orchestra Acoustic Indices Track Temporal CH - 2021 - Ecological I
15 pages
Ibis - 2023 - Pérez Granados - BirdNET Applications Performance Pitfalls and Future Opportunities
No ratings yet
Ibis - 2023 - Pérez Granados - BirdNET Applications Performance Pitfalls and Future Opportunities
8 pages
Document
No ratings yet
Document
7 pages
Comp 2
No ratings yet
Comp 2
6 pages
DCASE2018Workshop Lasseck 134
No ratings yet
DCASE2018Workshop Lasseck 134
5 pages
IJRPR12605
No ratings yet
IJRPR12605
5 pages
Acoustic - Indices - For - Biodiversity - Assess20160224 16595 14xl80b With Cover Page v2
No ratings yet
Acoustic - Indices - For - Biodiversity - Assess20160224 16595 14xl80b With Cover Page v2
11 pages
A Research Survey Review of AI Solution Strategies of Job Shop Scheduling Problem
No ratings yet
A Research Survey Review of AI Solution Strategies of Job Shop Scheduling Problem
13 pages
Jurnal 5
No ratings yet
Jurnal 5
4 pages
MIT BS Computer Science - Course Descriptions
No ratings yet
MIT BS Computer Science - Course Descriptions
5 pages
Optimal Compartment Layout Design For A Naval Ship
No ratings yet
Optimal Compartment Layout Design For A Naval Ship
12 pages
Jurnal 4
No ratings yet
Jurnal 4
4 pages
Armitage & Ober. 2010
No ratings yet
Armitage & Ober. 2010
9 pages
4 Al Haddad UPM
No ratings yet
4 Al Haddad UPM
5 pages
Pad Assignment 2
No ratings yet
Pad Assignment 2
12 pages
Iaria Congress 2024 2 170 50084
No ratings yet
Iaria Congress 2024 2 170 50084
8 pages
Mixture of Mixups For Multi-Label Classification of Rare Anuran Sounds
No ratings yet
Mixture of Mixups For Multi-Label Classification of Rare Anuran Sounds
5 pages
Automated Bird Species Identification Using Machine Learning and Audio Analysis
No ratings yet
Automated Bird Species Identification Using Machine Learning and Audio Analysis
4 pages
Manual Calculation - Pencil Beam
No ratings yet
Manual Calculation - Pencil Beam
25 pages
Sukri 2020
No ratings yet
Sukri 2020
4 pages
FFT Research
No ratings yet
FFT Research
8 pages
Identification of Bird Species Using Aut
No ratings yet
Identification of Bird Species Using Aut
6 pages
Articulo 5
No ratings yet
Articulo 5
6 pages
Types of Algorithm Design Techniques
No ratings yet
Types of Algorithm Design Techniques
4 pages
G Freeman 2024 Shazam For Birds
No ratings yet
G Freeman 2024 Shazam For Birds
3 pages
Sound Classification
No ratings yet
Sound Classification
5 pages
A Comparative Study in Automatic Recognition of Broadcast Audio
No ratings yet
A Comparative Study in Automatic Recognition of Broadcast Audio
4 pages
Chapter 2
No ratings yet
Chapter 2
9 pages
Framework For Integration of Domain Knowlede Into Logistic Regression
No ratings yet
Framework For Integration of Domain Knowlede Into Logistic Regression
8 pages
Atlas of the Electrical Generators of Sleep
From Everand
Atlas of the Electrical Generators of Sleep
Dr. Mark Doidge
No ratings yet
A.I. Cancer Timebomb
From Everand
A.I. Cancer Timebomb
charles r giardina
No ratings yet
Supervised Machine Learning for Science: How to stop worrying and love your black box
From Everand
Supervised Machine Learning for Science: How to stop worrying and love your black box
Christoph Molnar
No ratings yet
Seismic Instrumentation Design: Selected Research Papers on Basic Concepts
From Everand
Seismic Instrumentation Design: Selected Research Papers on Basic Concepts
Raman K. Attri
No ratings yet

Bird Song Recognition Through Spectrogram Processing and Labeling

Uploaded by

Bird Song Recognition Through Spectrogram Processing and Labeling

Uploaded by

Bird Song Recognition through Spectrogram Processing and Labeling

Katie Wolf University of Minnesota - Twin Cities [email protected]

Song Meter Locations Roads Stream order

PC18 ! !PC17 PC16 ! PC15 ! PC11 ! PC13 ! PC10 ! PC8 !

PC9 ! PC6 ! PC7 !

PC5 ! PC4 PC3 ! !

Total Bird Syllable Count 5/15/09 PC5

Number of Bird Syllables

Brightness and Contrast Filters

You might also like