{"id":5792,"date":"2026-05-05T09:00:00","date_gmt":"2026-05-05T07:00:00","guid":{"rendered":"https:\/\/hacnumedia.org\/?p=5792"},"modified":"2026-05-05T09:09:02","modified_gmt":"2026-05-05T07:09:02","slug":"como-the-tiny-model-that-generates-sound-through-movement","status":"publish","type":"post","link":"https:\/\/hacnumedia.org\/en\/como-the-tiny-model-that-generates-sound-through-movement\/","title":{"rendered":"CoMo, the tiny model that generates sound through movement"},"content":{"rendered":"\n<p class=\"is-style-chapo\"><strong>In this new series, HACNUM\u00e9dia invites readers to explore the tools shaping contemporary digital creation. Not only the most visible ones, but also those circulating at the margins of dominant trends. Here, the focus is on CoMo: a web-based environment developed at IRCAM that uses small machine learning models to link gestures to sounds. Technically, it is AI. In practice, it has very little to do with what that term commonly evokes today.<br\/>    <\/strong><\/p>\n\n<p>Following our <a href=\"https:\/\/hacnumedia.org\/en\/creating-with-ai-ecological-alternatives-do-exist\/\">article<\/a> on tiny models and ecological alternatives to generative AI, <a href=\"https:\/\/apps.ismm.ircam.fr\/como\">CoMo<\/a> illustrates another possible approach: frugal, embodied, collective. Developed for nearly ten years by <a href=\"https:\/\/www.ircam.fr\/fr\">IRCAM\u2019s<\/a> <a href=\"https:\/\/www.ircam.fr\/fr\/recherche\/equipes-recherche\/interaction-son-musique-mouvement\">ISMM<\/a> team, the tool remains relatively discreet, known mainly within circles of musical research and experimental sound creation.<br\/>  <\/p>\n\n<h2 class=\"wp-block-heading\"><strong>What is it for? Playing sound through movement <\/strong><\/h2>\n\n<p>\u201cCo\u201d stands for collective. \u201cMo\u201d stands for movement. Two words that capture what CoMo aims to do: enable groups of people to play with sound together by moving.<br\/> \u201cThe main idea was to do things collectively. Not just an interface for a single user, but a group of people interacting together,\u201d explains <a href=\"https:\/\/www.ircam.fr\/fr\/people\/frdric-bevilacqua\">Fr\u00e9d\u00e9ric Bevilacqua,<\/a> Research Director at IRCAM and head of the ISMM team (Interaction Sound Music Movement).<br\/><br\/><br\/>This focus on gesture as a sonic interface is not new: the ISMM team has been working on gesture-based sound control since the mid-2000s. <a href=\"https:\/\/apps.ismm.ircam.fr\/como\">CoMo<\/a>, whose earliest versions date back to 2017, builds on this lineage. The tool is accessible directly through a web browser, requires no installation, and is open source. For motion sensing, no specialized hardware is needed: a smartphone is enough. Not for its screen\u2014deliberately diverted from its usual function\u2014but for its built-in accelerometers and gyroscopes. The different versions of the application, mainly developed by <a href=\"https:\/\/www.stms-lab.fr\/person\/benjamin-matuszewski\">Benjamin Matuszewski <\/a>(CoMo Elements, <a href=\"https:\/\/apps.ismm.ircam.fr\/como-elements\">CoMo Vox<\/a>, <a href=\"https:\/\/www.ircam.fr\/fr\/action-culturelle\/en-maternelle\">CoMo.Education<\/a>, and <a href=\"https:\/\/www.stms-lab.fr\/projects\/pages\/sonification-du-mouvement-pour-la-reeducation\">CoMo Rehabilitation<\/a>), share the same core engine: interfaces adapted to different contexts of use.    <\/p>\n\n<p><\/p>\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1800\" height=\"1351\" src=\"https:\/\/hacnumedia.org\/wp-content\/uploads\/2026\/05\/unnamed-2-1800x1351.jpg\" alt=\"\" class=\"wp-image-5817\" srcset=\"https:\/\/hacnumedia.org\/wp-content\/uploads\/2026\/05\/unnamed-2-1800x1351.jpg 1800w, https:\/\/hacnumedia.org\/wp-content\/uploads\/2026\/05\/unnamed-2-900x675.jpg 900w, https:\/\/hacnumedia.org\/wp-content\/uploads\/2026\/05\/unnamed-2-768x576.jpg 768w, https:\/\/hacnumedia.org\/wp-content\/uploads\/2026\/05\/unnamed-2-1536x1153.jpg 1536w, https:\/\/hacnumedia.org\/wp-content\/uploads\/2026\/05\/unnamed-2.jpg 1999w\" sizes=\"auto, (max-width: 740px) 100vw, 740px\" \/><\/figure>\n\n<p><\/p>\n\n<h2 class=\"wp-block-heading\"><strong>What makes it distinctive: interacting with the body, not with prompts<\/strong><\/h2>\n\n<p>CoMo uses machine learning techniques. But calling it \u201cAI\u201d in 2026 is almost misleading.<br\/> \u201cNow, compared to the popular imagination, we can\u2019t really call this AI anymore,\u201d acknowledges Fr\u00e9d\u00e9ric Bevilacqua. <br\/> <a href=\"https:\/\/fr.wikipedia.org\/wiki\/Grand_mod%C3%A8le_de_langage\">large language models<\/a>CoMo belongs to what researchers call interactive machine learning: more traditional, lightweight models that can be trained with very little data\u2014sometimes a single gesture\u2014in just a few seconds, directly on a smartphone processor.<br\/><br\/><em>\u201cWithin the realm of small data, these are really the smallest. Tiny, tiny models, <\/em>\u201d Fr\u00e9d\u00e9ric summarizes. This is intentional: the term \u201cAI\u201d was deliberately avoided so as not to distance audiences drawn to gesture, the body, and music. Another key distinction between CoMo and current AI tools lies in the role of training within the process. There is no separate phase: you record a gesture, associate it with a sound, test it, adjust it. All of this unfolds within the same creative gesture.<br\/><br\/><br\/><em>\u201cTraining is fully integrated into the design. At any moment, you can record or modify. It\u2019s extremely flexible,\u201d  <\/em>explains Fr\u00e9d\u00e9ric.<\/p>\n\n<p><\/p>\n\n<figure class=\"wp-block-image size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"1800\" height=\"1200\" src=\"https:\/\/hacnumedia.org\/wp-content\/uploads\/2026\/05\/Design-sans-titre-9.png\" alt=\"\" class=\"wp-image-5788\" srcset=\"https:\/\/hacnumedia.org\/wp-content\/uploads\/2026\/05\/Design-sans-titre-9.png 1800w, https:\/\/hacnumedia.org\/wp-content\/uploads\/2026\/05\/Design-sans-titre-9-900x600.png 900w, https:\/\/hacnumedia.org\/wp-content\/uploads\/2026\/05\/Design-sans-titre-9-600x400.png 600w, https:\/\/hacnumedia.org\/wp-content\/uploads\/2026\/05\/Design-sans-titre-9-768x512.png 768w, https:\/\/hacnumedia.org\/wp-content\/uploads\/2026\/05\/Design-sans-titre-9-1536x1024.png 1536w\" sizes=\"auto, (max-width: 740px) 100vw, 740px\" \/><\/figure>\n\n<p><\/p>\n\n<h2 class=\"wp-block-heading\"><strong>How does it work? <\/strong><\/h2>\n\n<p>In practice, using CoMo comes down to four steps:<br\/>record a gesture<br\/>associate it with a sound<br\/>test<br\/>refine<br\/><br\/>Repeat as many times as needed. The workflow is intentionally minimal\u2014and that is precisely where the tool\u2019s strength lies.<br\/>To get started, there are two scenarios. The simplest is to connect directly to the version hosted by IRCAM, accessible through any browser. No installation, no account required. The available sounds are predefined and the session is temporary, but this is more than sufficient for a first workshop or quick introduction.<br\/>The second scenario, for those wishing to work with their own sounds, involves installing CoMo on a local machine. This requires some technical familiarity\u2014modern browsers now impose secure HTTPS protocols to access sensors, among other constraints\u2014although a new application currently under development should soon simplify this step. Once installed, a simple Wi-Fi router is enough to create a local network to which smartphones can connect. For practitioners already equipped with creative tools, bridges with <a href=\"https:\/\/cycling74.com\/products\/max\">Max<\/a> \/ <a href=\"http:\/\/www.ableton.com\/fr\/live\/max-for-live\/\">Max For Live<\/a>, <a href=\"https:\/\/derivative.ca\">TouchDesigner<\/a>, or other creative development environments are also being prepared.<br\/>        <\/p>\n\n<figure class=\"wp-block-embed is-type-video is-provider-youtube wp-block-embed-youtube wp-embed-aspect-16-9 wp-has-aspect-ratio\"><div class=\"wp-block-embed__wrapper\">\n<iframe loading=\"lazy\" title=\"CoMo - Element tutorial\" width=\"640\" height=\"360\" src=\"https:\/\/www.youtube.com\/embed\/XbCb_TAMbDA?feature=oembed\" frameborder=\"0\" allow=\"accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share\" referrerpolicy=\"strict-origin-when-cross-origin\" allowfullscreen><\/iframe>\n<\/div><\/figure>\n\n<h2 class=\"wp-block-heading\"><strong>Examples of use <\/strong><\/h2>\n\n<p>Artist-researcher <a href=\"https:\/\/hugoscurto.com\/fr\/\">Hugo Scurto<\/a> followed the development of CoMo from within, evolving alongside the ISMM team without being directly involved in its design. Their first workshops with the tool took place at the Beaux-Arts in Marseille, working with children:<br\/> \u201cThe idea was to create sound-based storytelling through movement. We recorded sounds in the street, then retold them through gesture.\u201d<br\/>Participants receive smartphones, learn a few gestures, and compose short performative scenes\u2014all without writing a single line of code.<br\/>Since September 2025, this exploration has continued in a more unexpected setting. Supported by the Association R\u00e9gionale pour l\u2019Int\u00e9gration (ARI), Hugo now leads weekly sessions in a child psychiatric care center with four children, alongside a psychologist and a psychomotor therapist. Second-hand smartphones are attached to foam balls, allowing movement without focusing on the screen\u2014so that the object becomes musical.<br\/>In this context, it is often when the application misbehaves that something truly happens.<br\/> \u201cIt\u2019s almost the error that becomes more generative than the perfectly smooth functioning of the algorithm,\u201d Hugo observes.<br\/> <\/p>\n\n<p>In 2019, composer <a href=\"https:\/\/michelleagnes.net\">Michelle Agnes Magalhaes<\/a> pushed the tool in an unexpected direction during a residency at IRCAM with <a href=\"https:\/\/michelleagnes.net\/constellactions\/\">Constella(c)tions<\/a>, a piece that does not rely on gesture recognition. Instead of a defined gesture vocabulary, the work maps raw phone parameters\u2014energy, orientation, speed\u2014directly onto sound filters. Performers wear smartphones on their wrists and interact with large physical ropes, while the audience takes part as well. The piece unfolds in varied spaces, off-stage, with a permeability between performers and spectators at the core of the proposal.<br\/>This example shows that CoMo can be diverted far beyond its basic functioning, and that the \u201cco\u201d in its name is more than a promise.<br\/>    <\/p>\n\n<figure class=\"wp-block-embed is-type-video is-provider-youtube wp-block-embed-youtube wp-embed-aspect-16-9 wp-has-aspect-ratio\"><div class=\"wp-block-embed__wrapper\">\n<iframe loading=\"lazy\" title=\"&quot;Constella(c)tions&quot; by Michelle Agnes Magalhaes\" width=\"640\" height=\"360\" src=\"https:\/\/www.youtube.com\/embed\/0ZJtb2QZ6Ac?feature=oembed\" frameborder=\"0\" allow=\"accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share\" referrerpolicy=\"strict-origin-when-cross-origin\" allowfullscreen><\/iframe>\n<\/div><\/figure>\n\n<h2 class=\"wp-block-heading\"><strong>Three tips for getting started with CoMo<\/strong><\/h2>\n\n<p>Start with the online version\u2014don\u2019t rush into installation<br\/>CoMo is accessible directly through a browser, without installation or account creation. This is the best entry point: connect, record a gesture, link it to a sound, and within seconds the principle becomes clear. There is no need to attempt a local installation before exploring what the tool already offers.<br\/>  <\/p>\n\n<p>Choose bold gestures rather than precise ones<br\/>A common beginner\u2019s impulse is to seek perfect recognition. It is better to begin with highly contrasted gestures\u2014for instance, a large sweeping motion versus a still posture. CoMo is not a precision tool; it is a playful one. Accepting imprecision\u2014even error\u2014is often what unlocks its most interesting possibilities.<br\/>   <\/p>\n\n<p>Think collectively from the outset<br\/>CoMo can be used alone, but that is not where it feels most alive. The tool was designed for multiple participants to interact simultaneously, share models, and respond to one another. A workshop with two or three people\u2014even informal\u2014reveals dimensions of the tool that solitary exploration rarely uncovers.<br\/>  <\/p>\n\n<p><\/p>\n\n<p class=\"is-style-signature\">Romain Astouric<\/p>\n\n<p><\/p>\n","protected":false},"excerpt":{"rendered":"<p>In this new series, HACNUM\u00e9dia invites readers to explore the tools shaping contemporary digital creation. Not only the most visible [&hellip;]<\/p>\n","protected":false},"author":7,"featured_media":5791,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[37],"tags":[42,38],"class_list":["post-5792","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-tutorial","tag-ai","tag-hybrid-creation","entry"],"_links":{"self":[{"href":"https:\/\/hacnumedia.org\/en\/wp-json\/wp\/v2\/posts\/5792","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/hacnumedia.org\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/hacnumedia.org\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/hacnumedia.org\/en\/wp-json\/wp\/v2\/users\/7"}],"replies":[{"embeddable":true,"href":"https:\/\/hacnumedia.org\/en\/wp-json\/wp\/v2\/comments?post=5792"}],"version-history":[{"count":2,"href":"https:\/\/hacnumedia.org\/en\/wp-json\/wp\/v2\/posts\/5792\/revisions"}],"predecessor-version":[{"id":5819,"href":"https:\/\/hacnumedia.org\/en\/wp-json\/wp\/v2\/posts\/5792\/revisions\/5819"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/hacnumedia.org\/en\/wp-json\/wp\/v2\/media\/5791"}],"wp:attachment":[{"href":"https:\/\/hacnumedia.org\/en\/wp-json\/wp\/v2\/media?parent=5792"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/hacnumedia.org\/en\/wp-json\/wp\/v2\/categories?post=5792"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/hacnumedia.org\/en\/wp-json\/wp\/v2\/tags?post=5792"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}