machine learning for music

Machine Learning for Music

Faculty of Mathematics and Informatics, SUPetko Nikolov April 8, 2015

About Me

Machine Learning

Music Information Retrieval

Machine Learning / Automated Data Science

What’s Music Information Retrieval?

Musicology

Computer Science

Signal Processing

Machine Learning

Music Recommendations

Recommending tags

Spotify’s Shuffle Mode

● Not really random

● Certainly some processing

● Probably some MIR behind

Pandora’s Music Genome Project

● started in 2000

● 800 000 manually annotated tracks by music experts

● 450 attributes to describe music

● 25 minutes per track to label

Music Information Retrieval Evaluation eXchange annual competition featuring more than 20 tasks

state-of-the-art algorithms compete against each other

Structured Information

Retrieval

Synthesis

fingerprintingcover song detectiongenre recognitioninstrument recognitionmood detectiontranscriptionplaylist generation

beat trackingkey detectionpitch trackingvocal detectionrecommendationaudio similaritysource separation

genre recognitioninstrument recognitionmood detection

vocal detection

audio similarity

MIR Architecture

Segmentation and

Preprocessing

MIR Architecture

Segmentation and

Preprocessing

Feature Extraction

MIR Architecture

Segmentation and

Preprocessing

Feature Extraction

Machine Learning

MIR Architecture

Segmentation and

Preprocessing

Feature Extraction

Machine Learning

classical

romanticBethoven

by Daniel Barenboim

MIR Architecture

Segmentation and

Preprocessing

classical

romanticBethoven

Deep Learning

by Daniel Barenboim

MIR Architecture

Audio signal

human hearing: 20 Hz to 20 KHz

Segmentation

SegmentationFrame

52 msf1

SegmentationFrame

52 msf1 f2

SegmentationFrame

52 msf1 f2 f3

SegmentationFrame

52 msf1 f2 f3 f4

SegmentationFrame

52 msf1 f2 f3 f4 fn

Spectrum - on frame level

Discrete Fourier Transform (DFT)

time frequency

Feature extraction

Spectral Centroid

where is the ‘center of mass’ of the spectrum

Spectral Slope

fit linear regression and get the slope coef.

Spectral Slope

Spectral Correlation is the cosine distance between the frequency vectors of two consecutive framesVariation is (1.0 - correlation) respectively.

Spectral Correlation / Variation

Feature extraction - Result

f11 f12 f13 f14 f15 ……… f1m

f21 f22 f23 f24 f25 ……… f2m

centroid

correlation

Frames

Feature extraction - Result

f11 f12 f13 f14 f15 ……… f1m

f21 f22 f23 f24 f25 ……… f2m

centroid

correlation

Framesframes number vary across audio recordings

Universal Background Model

Gaussian Mixture Model

frame feature vector

Multivariate Gaussian Distribution

Gaussian Mixture Model - per track

[𝛍1,𝛍2,𝛍3,𝛍4]

Classification - Example Neural Netaik

Feature vector

Input Hidden Output

Likelihood of Rock?

Layers:

Feature vector

Input Hidden Output

Likelihood of Rock?

Layers:

Feature vector

Input Hidden Output

Likelihood of Rock?

Layers:

What’s Deep Learning?

(defn deep-learning? [neural-net] (hidden-layer? neural-net))

we are trying to learn new high-level representation having many more hidden layers

input is as raw as possible

Mel-spectrum

Deep Neural Network

Backpropagation

Deep Neural Network

Backpropagation

Deep Neural Network

Backpropagation gradient fades quickly

Deep Belief Network

Input (Mel spectrum)

Output

Hidden Layer 3

Hidden Layer 2

Hidden Layer 1Restricted Boltzmann Machine

Rock Jazz Punk Electronic

Deep Belief Network

Output

Hidden Layer 3

Hidden Layer 2

Rock Jazz Punk Electronic

Deep Auto Encoders

Mel spectrum

Mel spectrumOutput

Deep Auto Encoders

Mel spectrum

Mel spectrumOutput

Used for denoising

essentia - audio retrieval algorithms

theano - CPU/GPU symbolic optimization

scikit-learn - machine learning in Python

machine learning for music

trackgaussian mixture

slope coef

msf1 f2 f3segmentationframe

msf1 f2 f3 f4segmentationframe

msf1 f2segmentationframe

music experts

msf1 f2 f3 f4 fnspectrum

frame leveldiscrete

Data & Analytics

machine learning and azure machine learning

machine learning -...

cs7267 machine learning introduction to machine learning

machine learning introduction machine...

machine learning chapter 11. 2 machine learning what is...

a machine learning approach to discover rules for ... ·...

more like this: machine learning approaches to music...

nyai - understanding music through machine learning by brian...

fast track machine learning part 1 (machine learning...

the marriage between music and machine learning in kkbox

support v ector machine active learning for music mood...

¿qué es machine learning? usos de machine learning

machine learning with matlab · 2 agenda machine learning...

machine learning: machine learning: introduction...

on the potential of ai and machine learning for music...

machine learning for nlp - ethics and machine learning

machine learning - astro.sunysb.edu · machine learning...

incorporating machine-learning into music similarity...

machine learning on spark - uc berkeley amp...

machine learning and big data for music discovery at spotify