Share this article
Latest news
With KB5043178 to Release Preview Channel, Microsoft advises Windows 11 users to plug in when the battery is low
Copilot in Outlook will generate personalized themes for you to customize the app
Microsoft will raise the price of its 365 Suite to include AI capabilities
Death Stranding Director’s Cut is now Xbox X|S at a huge discount
Outlook will let users create custom account icons so they can tell their accounts apart easier
Microsoft’s latest patent reveals a Copilot able to compose music that match videos and PowerPoint presentations
The technology has been patented, so who knows?
3 min. read
Updated onOctober 7, 2024
updated onOctober 7, 2024
Share this article
Read our disclosure page to find out how can you help Windows Report sustain the editorial teamRead more
While the Redmond-based tech giant has started updatingCopilot with a brand-new interface that makes the AI model stand out with a sleek look, it seems the company has even bigger plans for it.
In a recently published patent, Microsoft is developing an Artificial intelligence model for composing audio scores that can create music or audio that matches videos, text, PowerPoint presentations, virtual realities, or even video games in development.
The paper, titled suggestively,“Artificial intelligence model for composing audio scores,” discusses the methods this Copilot would use to createmusic.
First, it sets off to collect data, gathering a large amount of training data, which includes many audiovisual datasets containing both video and audio components.
Each of these datasets is analyzed to extract different types of features. For example, it would look at the video’s visual features and elements, such as colors, shapes, movements, and scenes. Any text that appears in the video, such as subtitles or on-screen text, would also be extracted. Lastly, in-video audio features, such as sounds and music, are already present in the video and not part of a musical score.
After extracting them, Copilot would analyze them and find a correlation between these features. For example, certain scenes (like a sunset) often have specific types of music (like calm, soothing tunes).
Copilot would be trained with these features, and using the correlation system, it would generate appropriate audio scores matching new videos’ visual and textual features.
In real life, this technology can be used in various applications, such as:
With the ability to compose music, Copilot could also save time and ensure that the audio perfectly complements the visual content by automating the process of composing audio scores.
It’s worth mentioning that the AI model can somehow create music ata very rudimental state using the SUNO plugin, which was released earlier this year.
However, an improvement of that plugin would be more than welcome. It would allow creators to pin down their product’s music concept before pitching it to an actual music composer.
While the issue of actually replacing a music composer should be considered, ultimately, giving Copilot the ability to compose music would only streamline productivity down the line. But what are your thoughts on this?
You canread the paper here.
More about the topics:AI,Copilot
Flavius Floare
Tech Journalist
Flavius is a writer and a media content producer with a particular interest in technology, gaming, media, film and storytelling.
He’s always curious and ready to take on everything new in the tech world, covering Microsoft’s products on a daily basis. The passion for gaming and hardware feeds his journalistic approach, making him a great researcher and news writer that’s always ready to bring you the bleeding edge!
User forum
0 messages
Sort by:LatestOldestMost Votes
Comment*
Name*
Email*
Commenting as.Not you?
Save information for future comments
Comment
Δ
Flavius Floare
Tech Journalist
Flavius is a writer and a media content producer with a particular interest in technology, gaming, media, film and storytelling.