Paper and Data


Introduction

Welcome to the companion site for the paper “Reconstructing the Charlie Parker Omnibook using an audio-to-score automatic transcription pipeline”. This was work carried out by Xavier Riley, a PhD candidate on the AIM programme at QMUL.

This site contains links to Soundslice pages where you can preview all the data.

Here’s a video of it in action - remember, no human intervention was required for this transcription. As we discuss in the paper, there are issues with notes in the upper register and the rhythms are also over simplified here but we hope to address these problems in future work.

Live demo

You can try the pipeline out for yourself at Replicate. You will need to add payment details but you only pay for the time that the GPU is running. This currently works out to around $0.03 for a 2 minute transcription.

Model weights

Under preparation

Automatically Transcribed Scores

As described in the paper, it hasn’t been possible yet to transcribe the complete Omnibook using the pipeline yet. We provide a link to the 32 complete transcribed scores as MusicXML below:

Download Estimated MusicXML here

Synchronised Score Previews

Feedback/Questions

We welcome feedback on the dataset - please direct this to Xavier Riley whose email address can be found in the footer of this page.

License

The dataset contains copyright restricted material and is shared with researchers under the following conditions:

  • it may only be used by the individual signing below and by members of the research group or organisation of this individual. This permission is not transferable.
  • it may be used only for non-commercial research purposes.
  • it (or data enabling the its reproduction) may not be sold, leased, published or distributed to any third party without written permission from the it administrator.

  • When research results obtained using it are publicly released (in the form of reports, publications, or derivative software), clear indication of the use of it shall be given, usually in the form of a citation of the following paper:

  • X. Riley and S. Dixon (TBD), TBD.
  • Queen Mary University of London shall not be held liable for any errors in the content of it nor damage arising from the use of it.
  • The it administrator may update these conditions of use at any time.

Acknowledgements

The author is a research student at the UKRI Centre for Doctoral Training in Artificial Intelligence and Music, supported by UK Research and Innovation [grant number EP/S022694/1].