Transcription of Musical Audio Using Poisson Point Processes and Sequential MCMC

Bunch, Pete; Godsill, Simon

doi:10.1007/978-3-642-23126-1_6

Pete Bunch²⁰ &
Simon Godsill²⁰

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 6684))

Included in the following conference series:

International Symposium on Computer Music Modeling and Retrieval

1206 Accesses
1 Citations

Abstract

In this paper models and algorithms are presented for transcription of pitch and timings in polyphonic music extracts. The data are decomposed framewise into the frequency domain, where a Poisson point process model is used to write a polyphonic pitch likelihood function. From here Bayesian priors are incorporated both over time (to link successive frames) and also within frames (to model the number of notes present, their pitches, the number of harmonics for each note, and inharmonicity parameters for each note). Inference in the model is carried out via Bayesian filtering using a powerful Sequential Markov chain Monte Carlo (MCMC) algorithm that is an MCMC extension of particle filtering. Initial results with guitar music, both laboratory test data and commercial extracts, show promising levels of performance.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Transcribing Bach Chorales Using Particle Swarm Optimisations

Melody extraction from music using modified group delay functions

Article 03 February 2017

Application of Multiple Sound Representations in Multipitch Estimation Using Shift-Invariant Probabilistic Latent Component Analysis

References

Cemgil, A., Godsill, S.J., Peeling, P., Whiteley, N.: Bayesian statistical methods for audio and music processing. In: O’Hagan, A., West, M. (eds.) Handbook of Applied Bayesian Analysis, OUP (2010)
Google Scholar
Davy, M., Godsill, S., Idier, J.: Bayesian analysis of polyphonic western tonal music. Journal of the Acoustical Society of America 119(4) (April 2006)
Google Scholar
Gilks, W.R., Richardson, S., Spiegelhalter, D.J. (eds.): Markov Chain Monte Carlo in Practice. Chapman and Hall, Boca Raton (1996)
MATH Google Scholar
Godsill, S.J., Davy, M.: Bayesian computational models for inharmonicity in musical instruments. In: Proc. of IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, New Paltz, NY (October 2005)
Google Scholar
Kashino, K., Nakadai, K., Kinoshita, T., Tanaka, H.: Application of the Bayesian probability network to music scene analysis. In: Rosenthal, D.F., Okuno, H. (eds.) Computational Audio Scene Analysis, pp. 115–137. Lawrence Erlbaum Associates, Mahwah (1998)
Google Scholar
Klapuri, A., Davy, M.: Signal processing methods for music transcription. Springer, Heidelberg (2006)
Book Google Scholar
Pang, S.K., Godsill1, S.J., Li, J., Septier, F.: Sequential inference for dynamically evolving groups of objects. To appear: Barber, Cemgil, Chiappa (eds.) Inference and Learning in Dynamic Models, CUP (2009)
Google Scholar
Peeling, P.H., Li, C., Godsill, S.J.: Poisson point process modeling for polyphonic music transcription. Journal of the Acoustical Society of America Express Letters 121(4), EL168–EL175 (2007)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Signal Processing and Communications Laboratory, Department of Engineering, University of Cambridge, UK
Pete Bunch & Simon Godsill

Authors

Pete Bunch
View author publications
You can also search for this author in PubMed Google Scholar
Simon Godsill
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

CNRS - LMA, 31 Chemin Joseph Aiguier, 13402, Marseille Cedex 20, France
Sølvi Ystad
CNRS-INCM, 31 Chemin Joseph Aiguier, 13402, Marseille Cedex 20, France
Mitsuko Aramaki
CNRS-LMA, 31 Chemin Joseph Aiguier, 13402, Marseille Cedex 20, France
Richard Kronland-Martinet
Aalborg University Esbjerg, Niels Bohr Vej 8, 6700, Esbjerg, Denmark
Kristoffer Jensen

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bunch, P., Godsill, S. (2011). Transcription of Musical Audio Using Poisson Point Processes and Sequential MCMC. In: Ystad, S., Aramaki, M., Kronland-Martinet, R., Jensen, K. (eds) Exploring Music Contents. CMMR 2010. Lecture Notes in Computer Science, vol 6684. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-23126-1_6

Download citation

DOI: https://doi.org/10.1007/978-3-642-23126-1_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-23125-4
Online ISBN: 978-3-642-23126-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Transcription of Musical Audio Using Poisson Point Processes and Sequential MCMC

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Transcribing Bach Chorales Using Particle Swarm Optimisations

Melody extraction from music using modified group delay functions

Application of Multiple Sound Representations in Multipitch Estimation Using Shift-Invariant Probabilistic Latent Component Analysis

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Transcription of Musical Audio Using Poisson Point Processes and Sequential MCMC

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

Transcribing Bach Chorales Using Particle Swarm Optimisations

Melody extraction from music using modified group delay functions

Application of Multiple Sound Representations in Multipitch Estimation Using Shift-Invariant Probabilistic Latent Component Analysis

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation