Norwegian version of this page

MIRAGE - A Comprehensive AI-Based System for Advanced Music Analysis

One main goal is to improve computers' capability to listen to and understand music, and to conceive technologies to facilitate music understanding and appreciation. One main application is to make music more accessible and engaging.

A demo of some of our new transcription technologies:

About the MIRAGE project

Read the article: "Artificial intelligence can help you understand music better"

Short description of the MIRAGE project: "Advancing AI for music analysis and transcription"

The objective is to generate rich and detailed descriptions of music, encompassing a large range of dimensions, including low-level features, mid-level structures and high-level concepts. Significant effort will be dedicated to the design of applications of value for musicology, music cognition and the general public.

We extend further the design of our leading computational framework aimed at extracting a large set of information from music, such as timbre, notes, rhythm, tonality or structure. Yet music can easily become complex. To make sense of such a subtle language, refined musicological considerations need to be formalised and integrated into the framework. Music is a lot about repetition: motives are repeated many times within a piece, and pieces of music imitate each other and cluster into styles. Revealing this repetition is both challenging and crucial. A large range of musical styles will be considered: traditional, classical and popular; acoustic and electronic; and from various cultures. The rich description of music provided by this new computer tool will also be used to investigate elaborate notions such as emotions, groove or mental images.

The approach follows a transdisciplinary perspective, articulating traditional musicology, cognitive science, signal processing and artificial intelligence.

This project is also oriented towards the development of groundbreaking technologies for the general public. Music videos have the potential to significantly increase music appreciation. The effect is increased when music and video are closely articulated. Our technologies will enable to generate videos on the fly for any music. One challenge in music listening is that it all depends on the listeners�� implicit ear training. Automated, immersive, interactive visualisations will help listeners (even hearing-impaired) understand and appreciate better the music they like (or don��t like yet). This will make music more accessible and engaging. It will be also possible to visually browse into large music catalogues. Applications to music therapy will also be considered.

This project is in collaboration with the National Library of Norway, world leading in digitising cultural heritage. Check out the National Library of Norway��s Digital Humanities Laboratory.

Objectives

The fundamental research question of MIRAGE is:

How to design a computational system that would generate detailed and rich descriptions of music along a large range of dimensions?

A number of sub-questions arise from this:

What kinds of music analysis can be undertaken using this new computational technology, that would push musicology forwards towards new directions?
How can music perception be modelled in the form of a complex system composed of a large set of interdependent modules related to different music dimensions?
What can be discovered in terms of new predictive models describing listeners�� percepts and impressions, based on this new type of music analysis?

General methodology

The envisioned methodology consists of close interaction between automated music transcription and detailed musicological analysis of the transcriptions.

Methods based on mathematical analysis of audio recordings are not able to catch all the subtle transformations at work in the music. Most listeners, even if they are not musicians nor musicologists, are able to follow and understand the logic of the music because they more or less consciously build a somewhat refined analysis of the music. For these reasons, we advocate a different approach that attempts to construct a highly detailed analysis of the music in order to grasp some of the subtler aspects and to reach a higher degree of understanding. Besides there is a high degree of interdependency between these different structural dimensions. Any particular dimension of music cannot be understood fully without taking into consideration the other dimensions as well. For instance, the ��simple�� task of tracking beat actually requires to be able to detect repeating motifs, because beat perception can emerge from successive repetitions. But in the same time, motivic analysis requires a rich description of the music, which includes rhythm. We see therefore a circularity in the dependencies that can be addressed only by considering all these aspects altogether while progressively analysing the piece of music from beginning to the end.

Hence, whereas a traditional audio-based approach would typically apply signal processing, machine learning or statistical operators directly to the audio recording, the proposed approach relies on the contrary on a transcription of the audio signal into a music score and an AI-based detailed analysis of the score grounded on musicological and cognitive considerations.

Structure

The project is organised into five workpackages:

WP1: New Methods for Automated Transcription: Detection of notes from audio, construction of the score based on higher-level musicological analysis (provided by WP2). Tests on a large range of music from diverse cultures and genres. Through a close collaboration with musicologists, traditional music from Norway and many other cultures around the world will be considered. Popular and art music from the 20th and 21st centuries will also be studied, in particular music using particular instrumentarium or particular performing techniques, as well as electro-acoustic music, challenging further the task of transcription by questioning the basic definitions of ��notes�� or musical events and their parameters.
WP2: Comprehensive Model for Music Analysis: Modelling of the musicological analysis of the (transcriptions of audio recording using WP1, MIDI files, etc.) along a large range of musical dimensions. Each musical dimension is modelled by a specific module. The complex network of interdependencies between modules is also investigated. Musicological validation following the same principles and plan as in WP1.
WP3: New Perspectives for Musicology. This WP aims at transcending musicology��s capabilities through the development of new computational technologies specifically tailored to its needs. Three topics:
- Maximising the informativeness of music visualisation
- Retrieval technology tailored to musicological queries
- Unveiling music intertextuality
WP4: Theoretical and Practical Impacts on Music Cognition. We take benefit of the new analytical tool to enrich music cognition models. Theoretically, in the way the computational models conceive in this project can suggest blueprints for cognitive models. Practically, because a better description of music enables a richer understanding of its impact on listeners. Extending further the momentum gathered by our previous software MIRtoolbox in the domain of music cognition, the new computational framework for music analysis will be fully integrated in our new open source toolbox MiningSuite. One particular application consists in enriching predictive models formalising the relationships between musical characteristics and their impact in listeners�� appreciation of music. Will be considered in particular music shape and mental images, groove and emotions.
WP5: Technological and Societal Repercussions. This WP examines the large range of possible impacts of the research, with a view to initiate further research and innovation projects and networks. Three axes:
- Valorisation of online music catalogue: As a continuation to the SoundTracer project, we will prototype apps allowing the general public to browse into the Norwegian folk music catalogue, understand the characteristics of the different music recordings, interactively search for particular musical characteristics of their choices, such as melodies or rhythmical patterns, and get personalised recommendations based on the users�� appreciation of the tunes they have already listened. Contrary to Spotify of Apple Music, for instance, where songs are compared based on users�� consumption, here music will be compared based on actual musical content as found by the analyses produced in WP2.
- Impact to the general public: The objective of this task is to compile a detailed list of possible applications of the developed technologies to the general public. We will imagine for instance how new methods of music visualisations will help non-expert understand better how music works and appreciate the richness of music more deeply. We will also investigate how these new technologies might be used for instance as a complement to traditional music critique or to music videos. We will prototype examples of visualisations that will be published in mainstream music or technology magazines. The new capabilities of music retrieval, catalogue browsing and recommendations offered by these new technologies will be studied on various music catalogues. The final objective is to initiate new research and innovation projects around those topics.
- Music therapy tools: We will also prototype music therapy applications of our technologies. In particular, we will extend further our Music Therapy ToolBox (MTTB), dedicated to the analysis of free improvisations between therapists and clients, with the integration of visualisations related to higher-level music analysis. Here also, further applications will be envisaged for future research and innovation proposals.

Lartillot, Olivier; Swarbrick, Dana; Upham, Finn & Cancino-Chac��n, Carlos Eduardo (2025). Video Visualization of a String Quartet Performance of a Bach Fugue: Design and Subjective Evaluation. Music & Science. ISSN 2059-2043. 8. doi: 10.1177/20592043251352299. Full text in Research Archive Show summary
Visualizing music��through music notation, analytical representations, or music videos��might potentially boost the appreciation of music in all its richness. The purpose of this study was to design and test a visualization strategy aimed at explicating to a large audience with diverse backgrounds��especially novices��the multifaceted beauty of the final Contrapunctus in J.S. Bach's The Art of Fugue, performed by the Danish String Quartet. At the surface level of the musical structure, the rich fluctuation of pitch shaped by each musician was depicted in the form of undulating pitch curves. At a deeper structural level, the repetition of pitch curves, distinctive of fugues, was highlighted through vertical alignment��inspired by a technique called paradigmatic analysis, originating from anthropology and music semiology. The visualization was initially prototyped in the form of a real-time technology as part of the MusicLab Copenhagen research concert. The concert audience focused on the performance itself, and did not pay much attention to, nor appreciate, the visualization. To evaluate more thoroughly the potential of the visualization, participants with varied musical expertise and taste were invited to listen to a recorded performance of the piece and watch the visualization on their own computer. A large majority reported that they felt they understood the visualization, around half of them felt that it enhanced their musical understanding, and a small group felt that it helped them to better appreciate the music.
Lartillot, Olivier (2024). Musicological and Technological Perspectives on Computational Analysis of Electroacoustic Music. In Jensenius, Alexander Refsum (Eds.), Sonic Design: Explorations Between Art and Science. Springer Nature. ISSN 978-3-031-57892-2. p. 271�C297. doi: https:/doi.org/10.1007/978-3-031-57892-2_15. Full text in Research Archive Show summary
Analysing electroacoustic music remains challenging, leaving this artistic treasure somewhat out of reach of mainstream musicology and many music lovers. This chapter examines electroacoustic music analysis, covering musicological investigations and desires and technological challenges and potentials. The aim is to develop new technologies to overcome the current limitations. The compositional and musicological foundations of electroacoustic music analysis are based on Pierre Schaeffer��s Trait�� des objects musicaux. The chapter presents an overview of core analytical principles underpinning more recent musicological approaches, including R. Murray Schafer��s soundscape analysis, Denis Smalley��s spectro-morphology, and Lasse Thoresen��s graphical formalisation. Then the state of the art in computational analysis of electroacoustic music is compiled and organised along broad themes, from detecting sound objects to estimating dynamics, facture and grain, mass, motions, space, timbre and rhythm. Finally, I sketch the principles of what could be a Toolbox des objets sonores.
Bishop, Laura; H?ffding, Simon; Lartillot, Olivier Serge Gabriel & Laeng, Bruno (2023). Mental Effort and Expressive Interaction in Expert and Student String Quartet Performance. Music & Science. ISSN 2059-2043. 6. doi: 10.1177/20592043231208000.
Maidhof, Clemens; M��ller, Viktor; Lartillot, Olivier; Agres, Kat; Bloska, Jodie & Asano, Rie [Show all 8 contributors for this article] (2023). Intra- and inter-brain coupling and activity dynamics during improvisational music therapy with a person with dementia: an explorative EEG-hyperscanning single case study. Frontiers in Psychology. ISSN 1664-1078. 14. doi: 10.3389/fpsyg.2023.1155732.
Szorkovszky, Alexander; Veenstra, Frank; Lartillot, Olivier Serge Gabriel; Jensenius, Alexander Refsum & Glette, Kyrre (2023). Embodied Tempo Tracking with a Virtual Quadruped, Proceedings of the Sound and Music Computing Conference 2023. SMC Network . ISSN 978-91-527-7372-7. doi: 10.5281/zenodo.10060970. Full text in Research Archive
Thedens, Hans-Hinrich & Lartillot, Olivier (2023). AudioSegmentor: A tool for disseminating archival recordings online. Studia Musicologica Norvegica. ISSN 0332-5024. 49(1), p. 92�C101. doi: 10.18261/smn.49.1.7. Full text in Research Archive Show summary
Much of the holdings of Norwegian folk music archives has already or will soon enter the public domain and can then be distributed freely to users through online catalogues. But for many the challenge arises that digitized tapes are not indexed with start times and duration of parts that should be associated with catalogue posts, like recordings of single songs and tunes. The music information retrieval project MIRAGE has now developed tools that can help this process through automatization. The tools will be available to all others who face similar challenges in their online presentation of sound recordings.
Lartillot, Olivier; Johansson, Mats Sigvard; Elowsson, Anders; Monstad, Lars L?berg & Cyvin, Mattias Stor?s (2023). A Dataset of Norwegian Hardanger Fiddle Recordings with Precise Annotation of Note and Beat Onsets. Transactions of the International Society for Music Information Retrieval. ISSN 2514-3298. 6(1), p. 186�C202. doi: 10.5334/TISMIR.139.
Juslin, Patrik N.; Sakka, Laura S.; Barradas, Gon?alo T. & Lartillot, Olivier (2022). Emotions, mechanisms, and individual differences in music listening: A stratified random sampling approach. Music Perception. ISSN 0730-7829. 40(1), p. 55�C86. doi: 10.1525/mp.2022.40.1.55. Full text in Research Archive
Lartillot, Olivier; Elovsson, Anders; Johansson, Mats Sigvard; Thedens, Hans-Hinrich & Monstad, Lars Alfred L?berg (2022). Segmentation, Transcription, Analysis and Visualisation of the Norwegian Folk Music Archive. In Pugin, Laurent (Eds.), DLfM '22: 9th International Conference on Digital Libraries for Musicology. Association for Computing Machinery (ACM). ISSN 978-1-4503-9668-4. p. 1�C9. doi: https:/doi.org/10.1145/3543882.3543883. Full text in Research Archive Show summary
We present an ongoing project dedicated to the transmutation of a collection of field recordings of Norwegian folk music established in the 1960s into an easily accessible online catalogue augmented with advanced music technology and computer musicology tools. We focus in particular on a major highlight of this collection: Hardanger fiddle music. The studied corpus was available as a series of 600 tape recordings, each tape containing up to 2 hours of recordings, associated with metadata indicating approximate positions of pieces of music. We first need to retrieve the individual recording associated with each tune, through the combination of an automated pre-segmentation based on sound classification and audio analysis, and a subsequent manual verification and fine-tuning of the temporal positions, using a home-made user interface. Note detection is carried out by a deep learning method. To adapt the model to Hardanger fiddle music, musicians were asked to record themselves and annotate all played note, using a dedicated interface. Data augmentation techniques have been designed to accelerate the process, in particular using alignment of varied performances of same tunes. The transcription also requires the reconstruction of the metrical structure, which is particularly challenging in this style of music. We have also collected ground-truth data, and are conceiving a computational model. The next step consists in carrying out detailed music analysis of the transcriptions, in order to reveal in particular intertextuality within the corpus. A last direction of research is aimed at designing tools to visualise each tune and the whole catalogue, both for musicologists and general public.
Haugen, Mari Romarheim (2021). Investigating Music-Dance Relationships. A Case Study of Norwegian Telespringar. Journal of music theory. ISSN 0022-2909. 65(1), p. 17�C38. doi: 10.1215/00222909-9124714. Show summary
This article studies the rhythm of Norwegian telespringar, a tradition with an intimate relationship between music and dance that features a nonisochronous meter; that is, the durations between adjacent beats are unequal. A motion-capture study of a fiddler and dance couple revealed a long-medium-short duration pattern at the beat level in both the fiddler's and the dancers' periodic movements. The results also revealed a correspondence between how the fiddler and the dancers executed the motion patterns. This correspondence suggests that the performers share a common understanding of the underlying ��feel�� of the music. The results are discussed in light of recent theoretical perspectives on the multimodality of human perception. It is argued that the special feel of telespringar derives from embodied sensations related to the dance and how music and dance have developed in tandem over time. The study advocates a holistic view of music and dance, the importance of insider experience, and the role of embodied experience in guiding our understanding of the music as such.
Lartillot, Olivier (2021). Computational Musicological Analysis of Notated Music: a Brief Overview. Nota Bene. ISSN 1891-4829. 15, p. 142�C161. Full text in Research Archive Show summary
I present a short overview of computational methods for musicological analysis of notated music. We first need to clarify the various levels of computational representations of music: on one side, notated music, on the other, audio recordings, and in the middle, a note-level representa- tion of music performance where higher-level musical descriptions are absent. The article provides a synthetic and partial panorama of the different types of music analysis that have been systematised and auto- mated using computers. While pioneering works were mainly focused on statistical descriptions of the surface of music, other dimensions of music analysis such as harmony, metre and structure have been taken into consideration since. I conclude by sketching my personal vision of the future of computational music analysis.
Weisser, St��phanie; Lartillot, Olivier & Sechehaye, H��l��ne (2021). Investiguer la gr��sillance. Pour une approche ethno-acoustique du timbre musical. Cahiers d'ethnomusicologie. ISSN 2235-7688. 34, p. 37�C58. Show summary
��
Elovsson, Anders & Lartillot, Olivier (2021). A Hardanger Fiddle Dataset with Performances Spanning Emotional Expressions and Annotations Aligned using Image Registration, Proceedings of the 22nd International Society for Music Information Retrieval Conference, Online, Nov 7-12, 2021. International Society for Music Information Retrieval. ISSN 978-1-7327299-0-2. p. 174�C181. Full text in Research Archive Show summary
This paper presents a Hardanger fiddle dataset ��HF1�� with polyphonic performances spanning five different emotional expressions: normal, angry, sad, happy, and tender. The performances thus cover the four quadrants of the activity/valence-space. The onsets and offsets, together with an associated pitch, were human-annotated for each note in each performance by the fiddle players themselves. First, they annotated the normal version. These annotations were then transferred to the expressive performances using music alignment and finally human-verified. Two separate music alignment methods based on image registration were developed for this purpose; a B-spline implementation that produces a continuous temporal transformation curve and a Demons algorithm that produces displacement matrices for time and pitch that also account for local timing variations across the pitch range. Both methods start from an ��Onsetgram�� of onset salience across pitch and time and perform the alignment task accurately. Various settings of the Demons algorithm were further evaluated in an ablation study. The final dataset is around 43 minutes long and consists of 19 734 notes of Hardanger fiddle music, recorded in stereo. The dataset and source code are available online. The dataset will be used in MIR research for tasks involving polyphonic transcription, score alignment, beat tracking, downbeat tracking, tempo estimation, and classification of emotional expressions.
Lartillot, Olivier; Nymoen, Kristian; C?mara, Guilherme Schmidt & Danielsen, Anne (2021). Computational localization of attack regions through a direct observation of the audio waveform. Journal of the Acoustical Society of America. ISSN 0001-4966. 149(1), p. 723�C736. doi: 10.1121/10.0003374. Show summary
This article addresses the computational estimation of attack regions in audio recordings. Previous attempts to do so were based on the reduction of the audio waveform into an envelope curve, which decreases its temporal resolution. The proposed approach detects the attack region directly from the audio waveform. The attack region is modeled as a line starting from a low-amplitude point and intersecting one of the local maxima according to two principles: (1) maximizing the slope, while favoring, at the same time, a higher peak if the slope remains only slightly lower and (2) dismissing initial attack regions of relatively low amplitude. The attack start position is fine-tuned by intersecting the attack slope with the audio waveform. The proposed method precisely pinpoints the attack region in cases where it is unambiguously observable from the waveform itself. In such cases, previous methods selected a broader attack region due to the loss of temporal resolution. When attack regions are less evident, the proposed method��s estimation remains within the range of results provided by other methods. Applied to the prediction of judgments of P-center localization [Danielsen, Nymoen, Anderson, C^amara, Langer?d, Thompson, and London, J. Exp. Psychol. Hum. Percept. Perform. 45, 402�C418 (2019)], the proposed method shows a significant increase in precision, at the expense of recall.
Bruford, Fred & Lartillot, Olivier (2020). Multidimensional similarity modelling of complex drum loops using the GrooveToolbox, Proceedings of the 21st International Society for Music Information Retrieval (ISMIR) Conference. McGill-Queen's University Press. ISSN 978-0-9813537-0-8. p. 263�C270. Full text in Research Archive Show summary
The GrooveToolbox is a new Python toolbox implementing various algorithms, new and pre-existing, for the analysis and comparison of symbolic drum loops, including rhythm features, similarity metrics and microtiming features. As part of the GrooveToolbox we introduce two new metrics of rhythm similarity and four features for describing the significant properties of microtiming deviations in drum loops. Based on a two-part perceptual evaluation, we show these four new microtiming features can each correlate to similarity perception, and be used with rhythm similarity metrics to improve personalized similarity models for drum loops. A new measure of structural rhythmic similarity is also shown to correlate more strongly to similarity perception of drum loops than the more com- monly used Hamming distance. These results point to the potential application of the GrooveToolbox and its new features in drum loop analysis for intelligent music production tools. The GrooveToolbox may be found at: https://github.com/fredbru/GrooveToolbox
Lartillot, Olivier & Bruford, Fred (2020). Bistate reduction and comparison of drum patterns, Proceedings of the 21st International Society for Music Information Retrieval (ISMIR) Conference. McGill-Queen's University Press. ISSN 978-0-9813537-0-8. p. 318�C324. Full text in Research Archive Show summary
This paper develops the hypothesis that symbolic drum patterns can be represented in a reduced form as a sim- ple oscillation between two states, a Low state (commonly associated with kick drum events) and a High state (often associated with either snare drum or high hat). Both an onset time and an accent time is associated to each state. The systematic inference of the reduced form is formal- ized. This enables the specification of a rhythmic struc- tural similarity measure on drum patterns, where reduced patterns are compared through alignment. The two-state representation allows a low computational cost alignment, once the complex topological formalization is fully taken into account. A comparison with the Hamming distance, as well as similarity ratings collected from listeners on a drum loop dataset, indicates that the bistate reduction enables to convey subtle aspects that goes beyond surface-level com- parison of rhythmic textures.
Elovsson, Karl Anders (2020). Polyphonic pitch tracking with deep layered learning. Journal of the Acoustical Society of America. ISSN 0001-4966. 148(1), p. 446�C468. doi: 10.1121/10.0001468. Show summary
This article presents a polyphonic pitch tracking system that is able to extract both framewise and note-based estimates from audio. The system uses several artificial neural networks trained individually in a deep layered learning setup. First, cascading networks are applied to a spectrogram for framewise fundamental frequency (f0) estimation. A sparse receptive field is learned by the first network and then used as a filter kernel for parameter sharing throughout the system. The f0 activations are connected across time to extract pitch contours. These contours define a framework within which subsequent networks perform onset and offset detection, operating across both time and smaller pitch fluctuations at the same time. As input, the networks use, e.g., variations of latent representations from the f0 estimation network. Finally, erroneous tentative notes are removed one by one in an iterative procedure that allows a network to classify notes within a correct context. The system was evaluated on four public test sets: MAPS, Bach10, TRIOS, and the MIREX Woodwind quintet and achieved state-of-the-art results for all four datasets. It performs well across all subtasks f0, pitched onset, and pitched offset tracking.
Lartillot, Olivier; Cancino-Chac��n, Carlos & Brazier, Charles (2020). Real-Time Visualisation Of Fugue Played By A String Quartet. In Spagnol, Simone & Valle, Andrea (Ed.), Proceedings of the 17th Sound and Music Computing Conference. Axea sas/SMC Network. ISSN 978-88-945415-0-2. p. 115�C122. Full text in Research Archive Show summary
We present a new system for real-time visualisation of music performance, focused for the moment on a fugue played by a string quartet. The basic principle is to offer a visual guide to better understand music using strategies that should be as engaging, accessible and effective as possible. The pitch curves related to the separate voices are drawn on a space whose temporal axis is normalised with respect to metrical positions, and aligned vertically with respect to their thematic and motivic classification. Aspects related to tonality are represented as well. We describe the underlying technologies we have developed and the technical setting. In particular, the rhythmical and structural representation of the piece relies on real-time polyphonic audio-to-score alignment using online dynamic time warping. The visualisation will be presented at a concert of the Danish String Quartet, performing the last piece of The Art of Fugue by Johann Sebastian Bach.

View all works in Cristin

Monstad, Lars L?berg (2025). Bandet har millioner av avspillinger p? Spotify uten ? eksistere: �C Problematisk. [Internet]. NRK. Show summary
Kunstig intelligens vil prege musikkproduksjon fremover, tror musikkbransjen. �C Det f?les meningsl?st ? lage musikk manuelt, sier musiker i Brenn.
Lartillot, Olivier (2024). Successes and challenges of computational approaches for audio and music analysis and for predicting music-evoked emotion. Show summary
Background Decades of research in computational sound and music analysis has led to a large range of analysis tools offering rich and diverse description of music, although a large part of the subtlety of music remains out of reach. These descriptors are used to establish computational models predicting perceived or induced emotion directly from music. Although the models can predict a significant amount of variability of emotions experimentally measured (Panda et al., 2023), further progress seems hard to achieve, probably due to the subtlety of music and of the mechanisms underlying the evocation of emotion from music. Aims An extensive but synthetic panorama of computational research in sound and music analysis as well as emotion prediction from music is presented. Core challenges are highlighted and prospective ways forward are suggested. Main contribution For each separate music dimension (dynamics, timbre, rhythm, tonality and mode, motifs, phrasing, structure and form), a synthetic panorama of the state of the art is evoked, highlighting strengths and challenges as well as indicating how particular sound and music features have been found to correlate with rated emotions. The various strategies for modelling emotional reactions to audio and musical features are presented and discussed. One common general analytical approach carries out a broad and approximate analysis of the audio recording based on simple mathematical models, describing individual audio or musical characteristics numerically. It is suggested that such loose approach might tend to drift away from commonly understood musical processes and to generate artefacts. This vindicates a more traditional musicological approach based on a focus on the score or approximations of it �C through automated transcription if necessary �C and a reconstruction of the types of traditional representations commonly studied in musicology. I also argue for the need to closely reflect the way humans listen to and understand music, inspired by a cognitive perspective. Guided by these insights, I sketch the idea of a complex system made of interdependent modules, founded on sequential pattern inference and activation scores not based on statistical sampling. I also suggest perspectives for the improvement of computational prediction of emotions evoked by music. Discussion and conclusion Further improvements of computational music analysis methods, as well as emotion prediction, seem to call for a change of modelling paradigm. References R. Panda, R. Malheiro, R. Paiva, "Audio Features for Music Emotion Recognition: A Survey", IEEE Transactions on Affective Computing, 14-1, 68-88, 2023.
Lartillot, Olivier (2024). KI-verkt?y for h?ndtering, transkribering og analyse av musikkarkiver. Show summary
Jeg presenterer en rekke verkt?y utviklet i �ǲ��ֹ��_�ǲ�pt�ֻ��ͻ��˵�¼ med Nasjonalbiblioteket. AudioSegmentor deler automatisk b?ndopptak i individuelle musikkstykker. Dette verkt?yet forenklet digitaliseringen av Norsk folkemusikksamling. Vi bruker avanserte dyp l?ringsmetoder for ? skape et banebrytende automatisk musikktranskriberingssystem, MusScribe, f?rst finjustert for Hardingfele, og n? gjort tilgjengelig for musikkarkivprofesjonelle for et bredt spekter av musikk. Jeg diskuterer ogs? v?re p?g?ende fremskritt innen den automatiserte musikologiske analysen av folkemusikkstykker og omfattende samlinger.
Ziegler, Michelle; Sudo, Marina; Akkermann, Miriam & Lartillot, Olivier (2024). Towards Collaborative Analysis: Kaija Saariaho��s IO.
Lartillot, Olivier (2024). Harmonizing Tradition with Technology: Enhancing Norwegian Folk Music through Computational Innovation. Show summary
My work involves developing computational tools to safeguard and elevate the cultural significance of music repertoires, with a focus on a cooperative project with the National Library of Norway related to their collection of Norwegian folk music. Our first phase centered on transforming unstructured audio tapes into a systematic dataset of melodies while ensuring its access and longevity through efficient data management and linking with other catalogues. Our core activity involves transcribing audio recordings into scores, comparing the traditional manual method with our modern attempts towards automation. Providing detailed performance notation, the close alignment between scores and audio recordings will help improve comprehension and overall accessibility, as well as a more advanced structuring of the collection. Challenges arose when incorporating this music into the International Inventory of Musical Sources (RISM) database due to the incompatible 'incipit' concept, unfitting genres like Hardanger fiddle folk music. We suggest innovative generalisations for this concept. Moreover, we're creating techniques to digitally dissect the musical corpus, aiming to extract key features of each tune. This initiative not only serves as an alternative to incipits but also provides novel metadata formats, increasing the usability and connectivity within its content and with other databases.
Monstad, Lars L?berg & Lartillot, Olivier (2024). muScribe: a new transcription service for music professionals.
Lartillot, Olivier (2024). MIRAGE Closing Seminar: Digitisation and computer-aided music analysis of folk music. Show summary
One aim of the MIRAGE project is to conceive new technologies allowing to better access, understand and appreciate music, with a particular focus on Norwegian folk music. This seminar presents what has been achieved during the four years of the project, leading in particular to the digital version of the Norwegian Catalogue of Folk Music. We are also conceiving tools to automatically transcribe audio recordings of folk music. More advanced musicological applications are discussed as well. To conclude, we introduce the new spinoff project, called muScribe, aimed at the development of transcription services, for a broad range of music, besides folk music, in a first stage tailored to professional organisations such as archives, publishers and producers.
Johansson, Mats Sigvard & Lartillot, Olivier (2024). Automated transcription of Hardanger fiddle music: Tracking the beats.
Thedens, Hans-Hinrich & Lartillot, Olivier (2024). The Norwegian Catalogue of Folk Music Online.
Lartillot, Olivier (2024). Real-time MIRAGE visualisation of Bartok's first quartet, first movement.
Lartillot, Olivier (2024). Overview of the MIRAGE project.
Monstad, Lars L?berg & Lartillot, Olivier (2024). Automated transcription of Hardanger fiddle music: Detecting the notes.
Lartillot, Olivier & Monstad, Lars L?berg (2023). MIRAGE - A Comprehensive AI-Based System for Advanced Music Analysis.
Christodoulou, Anna-Maria; Lartillot, Olivier & Anagnostopoulou, Christina (2023). Computational Analysis of Greek Folk Music of the Aegean.
Lartillot, Olivier (2023). Towards a Comprehensive Modelling Framework for Computational Music Transcription/Analysis. Show summary
Computational music analysis, still in its infancy, lacking overarching reliable tools, can be seen at the same time as a promising approach to fulfill core epistemo- logical needs. Analysis in the audio domain, although approaching music in its entirety, is doomed to superficiality if it does not fully embrace the underlying symbolic system, requiring a complete automated transcription and scaffolding of metrical, modal/harmonic, voicing and formal structures on top of the layers of elementary events (such as notes). Automated transcription enables to get over the polarity between sound and music notation, providing an interfacing semiotic system that combines the advantages of both domains, and surpassing the limitation of traditional approaches based on graphic representations. Deep learning and signal processing approaches for the discretisation of the continuous signal are compared and discussed. The multi-dimensional music transcription and analysis framework (where both tasks are actually deeply intertwined) requires to take into account the far-reaching interdependencies between dimensions, for instance between motivic and metrical analysis. We propose an attempt to build such a comprehensive framework, founded on general musical and cognitive principles and an attempt to build music analysis capabilities through a combina- tion of simple and general operators. The validity of the analyses is addressed in close discussion with music experts. The potential capability to produce valid analyses for a very large corpus of music would make such a complex system a potentially relevant blueprint for a cognitive modelling of music understanding. We try to address a large diversity of music cultures and their specific challenges: among others, maqam modes (with Mondher Ayari), Norwegian Hardanger fiddle rhythm (with Mats Johansson and Hans-Hinrich Thedens), djembe drumming from Mali (with Rainer Polak) or electroacoustic music (Towards a Toolbox des objets musicaux, with Rolf Inge God?y). We aim at making the framework fully transparent, collaborative and open.
Lartillot, Olivier (2023). Music Therapy Toolbox, and prospects.
Lartillot, Olivier & Monstad, Lars L?berg (2023). Computational music analysis: Significance, challenges, and our proposed approach. Show summary
Music is something that we mostly all appreciate, yet it remains a hidden and enigmatic concept for many of us. Music notation, in the form of music scores, facilitates practicing and enhances the understanding of the richness of musical works. However, acquiring musical scores for any music performance is a tedious and demanding task (called music transcription) that demands considerable proficiency. Hence the interest of computational automation. But music is not just notes, it is also melody, rhythm, themes, timbre, and very subtle aspects such as form. While many of us may not be consciously familiar with these concepts, they still have a subconscious influence on our aesthetic experience. Interestingly, it often happens that the more we consciously understand the underlying language of music, the more we tend to appreciate and enjoy it. Therefore, there is value in creating computational tools that can automate and enhance these types of analyses. The presenters' past work resulted in the creation of Matlab's MIRtoolbox, which measures a broad range of musical characteristics directly from audio through signal processing techniques. Currently, the MIRAGE project prioritises music transcription (with a particular focus on Norwegian folk music), blending neural-network-based deep learning with conventional rule-based models. Through this project, they highlight the importance of acknowledging the interconnectedness between all musical elements. Additionally, they have crafted animated visualisations to make analyses more accessible to the general public and are aiming to make music transcription technology available to the public, with support from UiO Growth House.
Lartillot, Olivier (2023). MIRAGE Symposium #2: Music, emotions, analysis, therapy ... and computer. Show summary
The 2nd MIRAGE Symposium covers a broad range of topics related to the MIRAGE project, mainly related to music and emotion, music cognition in general, music analysis and music therapy. Featuring two keynotes by Patrik Juslin and Didier Grandjean.
Wosch, Thomas; Vobig, Bastian; Lartillot, Olivier & Christodoulou, Anna-Maria (2023). HIGH-M (Human Interaction assessment and Generative segmentation in Health and Music).
Maidhof, Clemens; Agres, Kat; Fachner, J?rg & Lartillot, Olivier (2023). Intra- and inter-brain coupling during music therapy.
Monstad, Lars L?berg & Lartillot, Olivier (2023). Automatic Transcription Of Multi-Instrumental Songs: Integrating Demixing, Harmonic Dilated Convolution, And Joint Beat Tracking. Show summary
In the rapidly expanding field of music information retrieval (MIR), automatic transcription remains one of the most sought-after capabilities, especially for songs that employ multiple instruments. Musscribe emerges as a state-of-the-art transcription tool that addresses this challenge by integrating three distinct methodologies: demixing, harmonic dilated convolution, and joint beat tracking. Demixing is employed to isolate individual instruments within a song by separating overlapping audio sources, thus ensuring each instrument is transcribed distinctly. Beat tracking is then run as a parallel process to extract the joint beat and downbeat estimations. These processes results in an output midi file, which is then quantized using information derived from the beat tracking. As such, this method paves the way for more accurate and sophisticated analyses, bridging the gap between human and machine understanding of music. Together, these methodologies allow us to produce transcriptions that are not only accurate but also highly representative of the original compositions. Preliminary tests and evaluations showcase the potential in transcribing complex musical pieces with high fidelity, outperforming many contemporary tools in the market. This innovative approach not only has implications for music transcription but also for broader applications in audio analysis, remixing, and digital music production. The model has been instrumental in accelerating the composition process for several Norwegian television shows. Moreover, its efficacy can be observed in the Netflix series "A Storm for Christmas." Renowned composer Peter Baden harnessed this tool to enhance his workflow, proving the demand for innovative tools like this in the professional music industry.
Christodoulou, Anna-Maria; Lartillot, Olivier & Anagnostopoulou, Christina (2023). Greek Folk Music Dataset.
Lartillot, Olivier; Swarbrick, Dana; Upham, Finn & Cancino-Chac��n, Carlos Eduardo (2023). Video visualization of a string quartet performance of a Bach Fugue: Design and subjective evaluation.
Bishop, Laura; H?ffding, Simon; Laeng, Bruno & Lartillot, Olivier (2023). Mental effort and expressive interaction in expert and student string quartet performance.
Monstad, Lars Alfred L?berg (2023). KI kan demokratisere musikkbransjen. VG : Verdens gang. ISSN 0805-5203.
Lartillot, Olivier (2023). Computational audio and musical features extraction: from MIRtoolbox to the MiningSuite.
Lartillot, Olivier (2023). Dynamic Visualisation of Fugue Analysis, Demonstrated in a Live Concert by the Danish String Quartet.
Lartillot, Olivier (2023). Towards a comprehensive model for computational music transcription and analysis: a necessary dialog between machine learning and rule-based design?
Lartillot, Olivier; Thedens, Hans-Hinrich; Mjelva, Olav Lukseng?rd; Elovsson, Anders; Monstad, Lars L?berg & Johansson, Mats Sigvard [Show all 8 contributors for this article] (2023). Norwegian Folk Music & Computational Analysis. Show summary
As a pr��lude for Norway's Constitution Day, this special event celebrated the Norwegian folk music tradition, showcasing our new online archive and demonstrating the richness of Hardanger fiddle music, with live performance. One aim of the project is to conceive new technologies allowing to better access, understand and appreciate Norwegian folk music. In this event, we introduced a new online version of the Norwegian Folk Music Archive and discuss underlying theoretical and technical challenges. A live concert/workshop, with the participation of Olav Lukseng?rd Mjelva, offered a lively introduction to Hardanger fiddle music and its elaborate rhythm. The interests and challenges of automated transcription and analysis were discussed, with the public release of our new software Annotemus. The symposium was organised in the context of the MIRAGE project (RITMO, in collaboration with the National Library of Norway's Digital Humanities Laboratory).
Monstad, Lars Alfred L?berg; Baden, Peter & W?rstad, Bernt Isak Grave (2023). Kan kunstig intelligens brukes i l?tskriverprosessen?
Monstad, Lars L?berg (2023). Kunstig Intelligens i kunst og kultur. [TV]. NRK Dagsrevyen.
Monstad, Lars Alfred L?berg (2023). Demonstrasjon av Kunstig Intelligens som verkt?y for komponister.
Monstad, Lars L?berg; Silje Larsen, Borgan & Vegard, Waske (2023). AI i musikken: konsekvenser og muligheter.
Lartillot, Olivier & Thedens, Hans-Hinrich (2022). Online Norwegian Folk Music Archive.
Lartillot, Olivier (2022). The MIRAGE project: Unlocking new computational abilities in computational music analysis.
Lartillot, Olivier (2022). Computational music analysis: Application to music & emotion.
Lartillot, Olivier; God?y, Rolf Inge & Christodoulou, Anna-Maria (2022). Computational detection and characterisation of sonic shapes: Towards a Toolbox des objets sonores. Show summary
Computational detection and analysis of sound objects is of high importance both for musicology and sound design. Yet Music Information Retrieval technologies have so far been mostly focusing on transcription of music into notes in a classical sense whereas we are interested in detecting sound objects and their feature categories, as was suggested by Pierre Schaeffer��s typology and morphology of sound objects in 1966, reflecting basic sound-producing action types. We propose a signal-processing based approach for segmentation, based on a tracking of the salient characteristics over time, and dually Gestalt-based segmentation decisions based on changes. Tracking of pitched sound relies on partial tracking, whereas the analysis of noisy sound requires tracking of larger frequency bands possibly varying over time. The resulting sound objects are then described based on Schaeffer��s taxonomy and morphology, expressed first in the form of numerical descriptors, each related to one type of taxonomy (percussive/sustained/iterative, stable/moving pitch vs unclear pitch) or morphology (such as grain). This multidimensional feature representation is further divided into discrete categories related to the different classes of sounds. The typological and morphological categorisation is driven by the theoretical and experimental framework of the morphodynamical theory. We first experiment on isolated sounds from the Solf��ge des objets sonores��which features a large variety of sound sources��before considering more complex configurations featuring a succession of sound objects without silence or with simultaneous sound objects. Analytical results are visualised in the form of graphical representations, aimed both for musicology and music pedagogy purposes. This will be applied to the graphical descriptions of and browsing within large music catalogues. The application of the analytical descriptions to music creation is also investigated.
Lartillot, Olivier; Elovsson, Anders; Johansson, Mats Sigvard; Thedens, Hans-Hinrich & Monstad, Lars Alfred L?berg (2022). Segmentation, Transcription, Analysis and Visualisation of the Norwegian Folk Music Archive. Show summary
We present an ongoing project dedicated to the transmutation of a collection of field recordings of Norwegian folk music established in the 1960s into an easily accessible online catalogue augmented with advanced music technology and computer musicology tools. We focus in particular on a major highlight of this collection: Hardanger fiddle music. The studied corpus was available as a series of 600 tape recordings, each tape containing up to 2 hours of recordings, associated with metadata indicating approximate positions of pieces of music. We first need to retrieve the individual recording associated with each tune, through the combination of an automated pre-segmentation based on sound classification and audio analysis, and a subsequent manual verification and fine-tuning of the temporal positions, using a home-made user interface. Note detection is carried out by a deep learning method. To adapt the model to Hardanger fiddle music, musicians were asked to record themselves and annotate all played note, using a dedicated interface. Data augmentation techniques have been designed to accelerate the process, in particular using alignment of varied performances of same tunes. The transcription also requires the reconstruction of the metrical structure, which is particularly challenging in this style of music. We have also collected ground-truth data, and are conceiving a computational model. The next step consists in carrying out detailed music analysis of the transcriptions, in order to reveal in particular intertextuality within the corpus. A last direction of research is aimed at designing tools to visualise each tune and the whole catalogue, both for musicologists and general public.
Danielsen, Anne; C?mara, Guilherme Schmidt; Lartillot, Olivier; Leske, Sabine Liliana & Spiech, Connor (2022). Musical rhythm. Behavioural, computational and neurophysiological perspectives.
Dalgard, Joachim; Lartillot, Olivier; Vuoskoski, Jonna Katariina & Guldbrandsen, Erling Eliseus (2021). Absorption - Somewhere between the heart and the brain.
Lartillot, Olivier & Johansson, Mats Sigvard (2021). Automated beat tracking of Norwegian Hardanger fiddle music. Show summary
Norwegian Hardanger fiddle music is typically played by a solo fiddler, without rhythmic accompaniment except for the musician��s discreet foot stomping. Some of its repertoire features an asymmetrical ternary meter, with an uneven proportion of durations between the three beats of each bar, and with varying degrees of fluctuation of those proportions throughout each piece. In addition, there is often no clear audible onset corresponding to the beat position. As a result, many listeners find it difficult to hear the beats without experience from playing or dancing, and the beat onsets cannot be properly tracked by state-of-the-art beat trackers. The aim of this study is to develop a computational model of beat tracking of Hardanger fiddle music. Due to the rhythmic irregularity of the music, computational approaches relying on the detection of regular periodicities cannot be used. The proposed strategy adopts a cognitive perspective, modeling processes that progressively infer beats while scanning the music sequence chronologically. To each successive note is associated a tentative metrical position, which is determined based on a set of rules, using various input data such as (1) the ratio of the inter-onset interval (IOI) from the previous beat onset to the current note onset and the preceding inter-beat-onset interval and (2) the ratio of the IOI from the bar onset to the current note onset and the preceding inter-bar-onset interval. Successive repetition of eighth notes (as well as of eighth-note triplets) induce specific states that also guide the subsequent extension of the sequence. Multiple beat tracking scenarios can coexist at particular moments in the tune for very short periods. In particular, the very first notes at the beginning of the tune may initially imply conflicting metrical structures and tempi. The conflicting parallel beat tracking scenarios are progressively extended note after note in parallel. A scenario ends whenever it reaches a dead-end situation where the music is in total contradiction. Multiple scenarios are fused when they are continued exactly the same way, and only the scenario deemed the most congruent is retained. One particularity of Hardanger fiddle music is that beat onsets are not precise points in time but rather diffuse temporal extension, closely related to the notion of beat bin (Danielsen, 2010). Sometimes, multiple successive notes can all be considered as possible onsets for a given beat (Johansson, 2010; Stover et al., 2021). This multiplicity of beat onsets has been integrated into the model. Most of the analysis can be carried out using solely note onset time as input data, although more challenging cases occasionally require taking into account note duration or higher structure such as motivic repetition. This indicates that a proper beat tracker needs to be integrated as a module within a comprehensive music analysis framework, with bidirectional dependencies with the other modules of the framework. The model has so far been tuned and tested on a couple of tunes only. Its application to the automated analysis of a larger corpus is under investigation. Danielsen, Anne (2010). ��Here, there, and everywhere. Three accounts of pulse in D'Angelo's 'Left and Right��.�� In A. Danielsen (Ed.), Musical Rhythm in the Age of Digital Reproduction. Farnham: Ashgate/Routledge, UK. Johansson, Mats (2010). ��The Concept of Rhythmic Tolerance �C Examining Flexible Grooves in Scandinavian Folk-fiddling.�� In A. Danielsen (Ed.), Musical Rhythm in the Age of Digital Reproduction. Farnham: Ashgate/Routledge, UK. Stover, Chris; Danielsen, Anne & Johansson, Mats (2021). ��Bins, Spans, Tolerance: Three Theories of Microtiming Behavior.�� [under review in Music Theory Spectrum].
Thedens, Hans-Hinrich (2021). Archiving representations of a folk music tradition in sound and notation.
Danielsen, Anne (2021). Opening remarks, presentation of RITMO.
Lartillot, Olivier; Guldbrandsen, Erling Eliseus & Cancino-Chac��n, Carlos Eduardo (2021). Dynamics analysis, and application to a comparative study of Bruckner performances.
Johansson, Mats Sigvard (2021). Representing meter in traditional fiddle music: Accounting for variability and ambiguities.
Lartillot, Olivier & Johansson, Mats Sigvard (2021). Tracking beats in Hardanger fiddle tunes .
Lartillot, Olivier; Elovsson, Anders & Mjelva, Olav Lukseng?rd (2021). A new software for computer-assisted annotation of music recordings, with a focus on transcription.
Lartillot, Olivier (2021). Presentation of MIRAGE project.
God?y, Rolf Inge & Lartillot, Olivier (2021). Acoustic substrates of musique concr��te features: Towards a Toolbox de l'objet musical?
Elovsson, Anders (2021). Polyphonic transcription and generation of annotated datasets using score alignment.
Tidemann, Aleksander (2021). Exploring Hardanger fiddle performance patterns through interactive tools.
Tidemann, Aleksander & Lartillot, Olivier (2021). Interactive tools for exploring performance patterns in hardanger fiddle music.
Lartillot, Olivier & Weisser, St��phanie (2021). Roughness, Crackliness, Buzzingness, ...: Characterizations of Sonic Unsteadiness and Application to the Analysis of Traditional Music from Ethiopia, Kenya, Morocco and India.
Elovsson, Anders & Lartillot, Olivier (2021). A Hardanger Fiddle Dataset with Performances Spanning Emotional Expressions and Annotations Aligned using Image Registration. Show summary
This paper presents a Hardanger fiddle dataset ��HF1�� with polyphonic performances spanning five different emotional expressions: normal, angry, sad, happy, and tender. The performances thus cover the four quadrants of the activity/valence-space. The onsets and offsets, together with an associated pitch, were human-annotated for each note in each performance by the fiddle players themselves. First, they annotated the normal version. These annotations were then transferred to the expressive performances using music alignment and finally human-verified. Two separate music alignment methods based on image registration were developed for this purpose; a B-spline implementation that produces a continuous temporal transformation curve and a Demons algorithm that produces displacement matrices for time and pitch that also account for local timing variations across the pitch range. Both methods start from an ��Onsetgram�� of onset salience across pitch and time and perform the alignment task accurately. Various settings of the Demons algorithm were further evaluated in an ablation study. The final dataset is around 43 minutes long and consists of 19 734 notes of Hardanger fiddle music, recorded in stereo. The dataset and source code are available online. The dataset will be used in MIR research for tasks involving polyphonic transcription, score alignment, beat tracking, downbeat tracking, tempo estimation, and classification of emotional expressions.
Lartillot, Olivier & Lillesl?tten, Mari (2021). Artificial intelligence can help you understand music better. [Internet]. RITMO News. Show summary
Algorithms and technology have so far helped listeners to more of the same music. Now, UiO researchers are working on new technology that can get people interested in a greater musical variety.
Lartillot, Olivier & Lillesl?tten, Mari (2021). Olivier Lartillot utvikler verkt?y for ? forst? musikk bedre. [Internet]. Det humanistiske fakultet UiO YouTube account. Show summary
Kunstig intelligens kan hjelpe deg ? forst? musikk bedre. UiO-forsker Olivier Lartillot jobber for at ny teknologi kan ?pne folks ?rer for ny musikk.
Haugen, Mari Romarheim (2021). Asymmetrical Meter and Periodic Body Motion in Norwegian Telespringar Performance.
Elovsson, Anders & Lartillot, Olivier (2021). HF1: Hardanger fiddle dataset. Show summary
HF1 is a Hardanger fiddle dataset with polyphonic performances spanning five different emotional expressions. The onsets and offsets, together with an associated pitch, were human-annotated for each note in each performance by the fiddle players themselves. The dataset is around 43 minutes long and consists of 19 734 notes of Hardanger fiddle music, recorded in stereo.
Tidemann, Aleksander; Lartillot, Olivier & Johansson, Mats Sigvard (2021). Towards New Analysis And Visualization Software For Studying Performance Patterns in Hardanger Fiddle Music. Show summary
Analyzing musical performances is a challenging and emergent field of computational music research, aiming to reveal performance patterns and link them to musical contexts. There exists a modest amount of computational research on Hardanger fiddle performances. The MIRAGE research project is currently contributing to this scientific body, developing advanced MIR frameworks that build on recent musicological research. This paper presents the development and evaluation of two Max/MSP/Jitter software applications for music analysis and data visualization that integrate contemporary research perspectives on the complex rhythmical structuring of springar performances, investigating how we can design user-friendly computational tools that explore performance patterns in Hardanger fiddle music, in collaboration with MIRAGE. Based on a small questionnaire and a few operational tests, the study shows an interest in more effective software tools capable of revealing complex interrelations between musical dimensions in Hardanger fiddle performances. Additionally, the study highlights design considerations for tools aiming to increase the availability of computational music research in the field of musicology, such as cross-compatibility and integrated features that actively facilitate nuanced interpretation processes.
Lartillot, Olivier; Cancino-Chac��n, Carlos & Brazier, Charles (2020). Real-Time Visualisation Of Fugue Played By A String Quartet. Show summary
We present a new system for real-time visualisation of music performance, focused for the moment on a fugue played by a string quartet. The basic principle is to offer a visual guide to better understand music using strategies that should be as engaging, accessible and effective as possible. The pitch curves related to the separate voices are drawn on a space whose temporal axis is normalised with respect to metrical positions, and aligned vertically with respect to their thematic and motivic classification. Aspects related to tonality are represented as well. We describe the underlying technologies we have developed and the technical setting. In particular, the rhythmical and structural representation of the piece relies on real-time polyphonic audio-to-score alignment using online dynamic time warping. The visualisation will be presented at a concert of the Danish String Quartet, performing the last piece of The Art of Fugue by Johann Sebastian Bach.
Bruford, Fred & Lartillot, Olivier (2020). Multidimensional similarity modelling of complex drum loops using the GrooveToolbox. Show summary
The GrooveToolbox is a new Python toolbox implementing various algorithms, new and pre-existing, for the analysis and comparison of symbolic drum loops, including rhythm features, similarity metrics and microtiming features. As part of the GrooveToolbox we introduce two new metrics of rhythm similarity and four features for describing the significant properties of microtiming deviations in drum loops. Based on a two-part perceptual evaluation, we show these four new microtiming features can each correlate to similarity perception, and be used with rhythm similarity metrics to improve personalized similarity models for drum loops. A new measure of structural rhythmic similarity is also shown to correlate more strongly to similarity perception of drum loops than the more com- monly used Hamming distance. These results point to the potential application of the GrooveToolbox and its new features in drum loop analysis for intelligent music production tools. The GrooveToolbox may be found at: https://github.com/fredbru/GrooveToolbox
Lartillot, Olivier & Toiviainen, Petri (2020). Read about the Matlab MIRtoolbox. Young Acousticians Network (YAN) Newsletter. p. 4�C10. Show summary
MIRtoolbox is a Matlab toolbox dedicated to the analysis of music and sound from audio recordings and to the extraction of musical features such as tonality, rhythm, or structures. It has also been used for non- musical applications, such as in Non Destructive Testing, and with non-audio signals. In this issue of the newsletter, the YAN discusses the MIRtoolbox with Olivier Lartillot (RITMO Centre for Interdisciplinary Studies in Rhythm, Time and Motion, University of Oslo, Norway) and Petri Toiviainen (University of Jyv?skyl?, Finland) You can also check out the MIRtoolbox website at: shorturl.at/oA038
Lartillot, Olivier & Bruford, Fred (2020). Bistate reduction and comparison of drum patterns. Show summary
This paper develops the hypothesis that symbolic drum patterns can be represented in a reduced form as a sim- ple oscillation between two states, a Low state (commonly associated with kick drum events) and a High state (often associated with either snare drum or high hat). Both an onset time and an accent time is associated to each state. The systematic inference of the reduced form is formal- ized. This enables the specification of a rhythmic struc- tural similarity measure on drum patterns, where reduced patterns are compared through alignment. The two-state representation allows a low computational cost alignment, once the complex topological formalization is fully taken into account. A comparison with the Hamming distance, as well as similarity ratings collected from listeners on a drum loop dataset, indicates that the bistate reduction enables to convey subtle aspects that goes beyond surface-level com- parison of rhythmic textures.
Joachimiak, Grzegorz; Ahrendt, Rebekah & Lartillot, Olivier (2024). Endangered Musical Sources: Strategies for Safeguarding, Digitization, and International Collaboration. Report of Working Group 2 SOURCES, Wroc?aw, 22�C24 May 2024. Zenodo.
Christodoulou, Anna-Maria; Anagnostopoulou, Christina & Lartillot, Olivier (2022). Computational Analysis of Greek folk music of the Aegean islands. National and Kapodistrian University of Athens.

View all works in Cristin

Published May 12, 2019 5:10 PM - Last modified Aug. 9, 2025 9:39 AM

Contact

Head of project:

Olivier Lartillot

Participants

Olivier Lartillot University of Oslo
Anders Elovsson University of Oslo
Lars Alfred L?berg Monstad University of Oslo
Kyrre Glette University of Oslo
Rolf Inge God?y University of Oslo
Erling E. Guldbrandsen University of Oslo
Carlos Eduardo Cancino-Chac��n University of Oslo
Anne Danielsen University of Oslo
Alexander Refsum Jensenius University of Oslo
Mari Romarheim Haugen University of Oslo
Hans-Hinrich Thedens
Mats Sigvard Johansson

Detailed list of participants

Project duration

December 2019 �C 2024

Financing

MIRAGE is funded by The Research Council of Norway under the program IKTPLUSS.

News

Music & Science paper: "Video Visualization of a String Quartet Performance of a Bach Fugue: Design and Subjective Evaluation"

We design and test a visualization strategy aimed at explicating to a large audience with diverse backgrounds��especially novices��the multifaceted beauty of the final Contrapunctus in J.S. Bach's The Art of Fugue, performed by the Danish String Quartet. At the surface level of the musical structure, the rich fluctuation of pitch shaped by each musician was depicted in the form of undulating pitch curves. At a deeper structural level, the repetition of pitch curves, distinctive of fugues, was highlighted through vertical alignment��inspired by a technique called paradigmatic analysis, originating from anthropology and music semiology.

Simplified version:

Complete version:

MIRAGE spinoff project: muScribe

We develop a service for music archives to digitize music performance recording using state-of-the-art deep learning and our own cutting-edge research. Our hybrid approach is markedly original, merging the strengths of machine learning with symbolic AI, rooted in cognitive science and musicology. The project is particularly oriented towards cultural institutions, music publishers, and copyright organizations. By automating transcription, we reduce costs and increase the precision and availability of music scores.

MIRAGE Closing Seminar: Digitisation and computational analysis of folk music

This seminar presents what has been achieved during the four years of the MIRAGE project, leading in particular to the digital version of the Norwegian Catalogue of Folk Music. We are also conceiving tools to automatically transcribe and analyse audio recordings of folk music, and are launching a transcription service for music professionals.

Highlighted use case of supercomputing services in the Annual Report of Sigma2, the Norwegian research infrastructure services.

"We take substantial pride in the impactful research facilitated by our national e-infrastructure services. We are confident these research projects will produce new and invaluable scientific insights."

TISMIR paper: "A Dataset of Norwegian Hardanger Fiddle Recordings with Precise Annotation of Note and Beat Onsets"

This paper presents a dataset of several hours of recordings of Hardanger fiddle music, with note annotations of onsets, offsets and pitches, provided by the performers themselves. A subset has also been annotated with beat onset positions by the performer as well as three expert musicians. We design a new method for beat annotation in Hardanger fiddle music based on a selection of notes in the note annotation.

MIRAGE Symposium #2: Music, emotions, analysis, therapy ... and computer

The 2nd MIRAGE Symposium covers a broad range of topics related to the MIRAGE project, mainly related to music and emotion, music cognition in general, music analysis and music therapy. Featuring two keynotes by Patrik Juslin and Didier Grandjean.

Special event: Norwegian Folk Music & Computational Analysis

Image may contain: Musical instrument, String instrument, String instrument, Violin family, String instrument accessory.

As a pr��lude for Norway's Constitution Day celebrations, this special event on May 16, 2023 at the National Library of Norway exhibited new technologies allowing to better access, understand and appreciate Norwegian folk music.

Computational Musicology Working Group

The Computational Musicology Working Group is a MIRAGE initiative to foster collaborations between Music Information Retrieval (MIR) and Musicology

Participation to IMS 2022

IMS Study Group ��Digital Musicology��:
��Crossing Borders in Computational Musicology��

Participation to DLFM @ IAML 2022

Publication:

��Segmentation, Transcription, Analysis and Visualisation of the Norwegian Folk Music Archive��

Dataset:

Norwegian Catalogue of Folk Music online at OSF

Software release:

SoundSegmentor software
Annotemus software

Both software are about to be published. Please register to the MIRAGE mailing list to get notified about the release.

MusicLab 8: Synaesthesia

We explored the synaesthesia of guitarist and artistic researcher Bj?rn Charles Dreyer in a multimodal and interactive concert based on new technologies specifically developed for this collaboration. You can watch the concert, followed by a panel discussion and a technical demonstration:

MusicLab Copenhagen

We premiered a new type of music visualisation, during a live performance of the last Contrapunctus of JS Bach��s The Art of the Fugue by the Danish String Quartet. The aim was to convey to a non-expert audience both surface and structural aspects of the music. A score following module has been conceived by Carlos Eduardo Cancino-Chac��n, in the context of a collaboration with the Con Espressione project.

MIRAGE Symposium #1: Computational Musicology

The 1st MIRAGE Symposium, which took place on 8-9 June, 2021, has been recorded and is available on replay.

New method for computational analysis of sound chosen as JASA highlight

A recently published article has been chosen as a highlight by The Journal of the Acoustical Society of America (JASA).

"Computational Musicological Analysis of Notated Music: A Brief Overview"

A chapter from the newly released book "Notated Music in the Digital Sphere: Possibilities and Limitations", Nota Bene 15