Music Genre Detection Using Machine Learning Algorithms
Music Genre Detection Using Machine Learning Algorithms
ISSN No:-2456-2165
Abstract:- Music genre classification is one example of labels, which allows them to classify new, unlabeled pieces
content-based analysis of music signals. Historically, of music into the appropriate genre category. The genres of
human-engineered features were employed to automate music that different communities write or even just listen to
this process, and in the 10-genre classification, 61% can be used to identify them. Different groups and
accuracy was attained. Even yet, it falls short of the 70% communities listen to various types of music. The music's
accuracy that humans are capable of in the identical genre is a key characteristic that distinguishes it from other
activity. Here, we suggest a novel approach that types of music. A regular person cannot recognize the genre
combines understanding of the neurophysiology of the of the music right away after listening to it. But because the
auditory system with research on human perception in distinctions between many genres of music can be hazy,
the classification of musical genres. The technique classifying them is a particularly challenging job. For
involves training a straightforward convolutional neural instance, in a test using a 10-way forced choice problem,
network (CNN) to categorise a brief portion of the music college students were able to classify the music 70%
input. The genre of the song is then identified by accurately after hearing it for just 3 seconds, and the
breaking it up into manageable chunks and combining accuracy remained constant with longer music [1].
CNN's predictions from each individual chunk. The Additionally, the amount of tagged data is sometimes
filters learned in the CNN match the Spectro temporal significantly lower than the data's dimension. For instance,
receptual field (STRF) in humans, and after training, even though the GTZAN dataset used in this work only has
this approach reaches human-level (70%) accuracy. 1000 audio tracks, each audio track is 30 seconds long and
has a sample rate of 22,050 Hz.
I. INTRODUCTION
II. LITRATURE REVIEW
Music plays a very important role in people’s lives.
Music brings like-minded people together and is the glue Numerous research papers on the classification of
that holds communities together. A music genre is a musical genres have extensively employed this kind of
category or classification of music that shares common methodology. Multiple spectrograms obtained from audio
characteristics such as musical style, instrumentation, recordings are used as inputs for CNN, and their patterns are
rhythm, melody, and cultural and historical context. extracted into a 2D convolutional layer with the appropriate
Examples of music genres include rock, pop, hip-hop, filter and kernel sizes [9]. The spectrogram is mentioned in
classical, jazz, blues, country, electronic, folk, and many CNN because the model is good at identifying picture
others. Each genre is defined by a set of conventions that details [8]. Lau proposed applying the Convolutional Neural
distinguish it from other genres and often has a dedicated Network (CNN) model using a preprocessed GTZAN
fan base and industry infrastructure. The boundaries dataset. Each song's extracted Mel-Frequency Cepstrum
between genres can sometimes be blurred, and new genres Coefficient (MFCC) spectrogram was included in the
can emerge through a fusion of existing ones or by dataset. Additionally, the feature descriptions for the audio
incorporating elements of different styles. Music genre excerpts in 3 seconds and 30 seconds were included in a
detection is the process of automatically identifying the separate.csv file [8]. Then, using Keras, he created a CNN
genre of a piece of music using algorithms and machine architecture with 5 convolutional blocks. Each block
learning techniques. The goal of music genre detection is to contained a convolutional layer with a 3x3 filter and a 1x1
classify a piece of music into one or more predefined stride, a max pooling with a 2x2 windows size and a 2x2
categories based on its acoustic features, such as timbre, stride, and a Rectifying Linear Unit (ReLU) function to
rhythm, harmony, and melody. Music genre detection is display the probabilities for 10 music genres; the genre with
used in various applications such as music recommendation the highest probability was picked as the input's
systems, music streaming platforms, and content-based classification label [8]. Twenty MFCCs were trained on 30-
music retrieval systems. The process typically involves second and 3-second pieces of music, three CNN models
analyzing the audio signal using signal processing were built on spectrograms, and a classification test was run
techniques to extract relevant features, which are then fed on the test sets following training [8]. As Lau noted, there
into machine learning models trained on labeled datasets. was a problem with the training datasets because the 3-
The models learn to recognize patterns and associations second dataset did not match the number of genres in the
between the extracted features and the corresponding genre sample Nevertheless, some genres featured fewer or more
Table 2 SVM with Oversampling Techniques Table 5 KNN with Undersampling Techniques
TREE:
Table 6 TREE
KNN:
Table 3 KNN
Random Forest:
VI. CONCLUSION
REFERENCES