Share this article

Latest news

With KB5043178 to Release Preview Channel, Microsoft advises Windows 11 users to plug in when the battery is low

Copilot in Outlook will generate personalized themes for you to customize the app

Microsoft will raise the price of its 365 Suite to include AI capabilities

Death Stranding Director’s Cut is now Xbox X|S at a huge discount

Outlook will let users create custom account icons so they can tell their accounts apart easier

Microsoft’s latest patent reveals a Copilot able to compose music that match videos and PowerPoint presentations

The technology has been patented, so who knows?

3 min. read

Updated onOctober 7, 2024

updated onOctober 7, 2024

Share this article

Read our disclosure page to find out how can you help Windows Report sustain the editorial teamRead more

While the Redmond-based tech giant has started updatingCopilot with a brand-new interface that makes the AI model stand out with a sleek look, it seems the company has even bigger plans for it.

In a recently published patent, Microsoft is developing an Artificial intelligence model for composing audio scores that can create music or audio that matches videos, text, PowerPoint presentations, virtual realities, or even video games in development.

The paper, titled suggestively,“Artificial intelligence model for composing audio scores,” discusses the methods this Copilot would use to createmusic.

First, it sets off to collect data, gathering a large amount of training data, which includes many audiovisual datasets containing both video and audio components.

Each of these datasets is analyzed to extract different types of features. For example, it would look at the video’s visual features and elements, such as colors, shapes, movements, and scenes. Any text that appears in the video, such as subtitles or on-screen text, would also be extracted. Lastly, in-video audio features, such as sounds and music, are already present in the video and not part of a musical score.

After extracting them, Copilot would analyze them and find a correlation between these features. For example, certain scenes (like a sunset) often have specific types of music (like calm, soothing tunes).

Copilot would be trained with these features, and using the correlation system, it would generate appropriate audio scores matching new videos’ visual and textual features.

In real life, this technology can be used in various applications, such as:

With the ability to compose music, Copilot could also save time and ensure that the audio perfectly complements the visual content by automating the process of composing audio scores.

It’s worth mentioning that the AI model can somehow create music ata very rudimental state using the SUNO plugin, which was released earlier this year.

However, an improvement of that plugin would be more than welcome. It would allow creators to pin down their product’s music concept before pitching it to an actual music composer.

While the issue of actually replacing a music composer should be considered, ultimately, giving Copilot the ability to compose music would only streamline productivity down the line. But what are your thoughts on this?

You canread the paper here.

More about the topics:AI,Copilot

Flavius Floare

Tech Journalist

Flavius is a writer and a media content producer with a particular interest in technology, gaming, media, film and storytelling.

He’s always curious and ready to take on everything new in the tech world, covering Microsoft’s products on a daily basis. The passion for gaming and hardware feeds his journalistic approach, making him a great researcher and news writer that’s always ready to bring you the bleeding edge!

User forum

0 messages

Sort by:LatestOldestMost Votes

Comment*

Name*

Email*

Commenting as.Not you?

Save information for future comments

Comment

Δ

Flavius Floare

Tech Journalist

Flavius is a writer and a media content producer with a particular interest in technology, gaming, media, film and storytelling.