Cloudinary Blog

Enhance Audio With APIs Only

Dolby’s Enhance API Effectively Spiffs Up Audio Quality

Now that telecommuting for work will likely continue to be the norm, people are communicating more and more by audio and video. In fact, many professional creators are now running projects from their home setup.

Not all digital content is created equal, however. Audio quality, for one thing, is a key factor that distinguishes between mediocrity and excellence. Noisy and imbalanced audio is distracting, often causing the audience to click away. As a remedy, you need professional-standard microphones and sound booths, which are, unfortunately, rare finds in a home configuration.

So what can you do? Just keep producing poor audio?

Luckily, no. Modern technology is here to help.

One solution is Dolby’s Enhance API, which improves, transforms, and corrects audio. Besides describing that API’s features along with a real-world example, this article also explains how to leverage them alongside Cloudinary to enhance video as a whole.

Who knows? After reading this piece, you might be inspired to spruce up your audio with the tips. Every success originates from a single step.

Features of Enhance API

Dolby’s Enhance API helps businesses augment the audio of their media, streaming, and communications with the following capabilities:

  • Removal of unwanted background noise, which eradicates ambient noise: wind, traffic, air conditioners.
  • Correction of loudness, which balances the volume of the overloud or overquiet voices on budget microphones.
  • **Isolation of adaptive speech, which makes primary sounds stand out from the background, e.g., by enhancing a speaker’s voice while minimizing the crowd’s chatter.
  • Reduction of sibilance, which lowers, with four different intensities, the natural hissing sounds that tend to arise when people say such words as “sip” or “zip.”
  • Removal of plosives, which eliminates wind and popping sounds.
  • Balance of talkers, which raises the voice of those who talk softly and moderates the volume if multiple people are talking.

Listen to this noisy and unbalanced audio of Teddy Roosevelt and then compare it to this enhanced version, which makes use of most of the Enhance API’s attributes. You’ll immediately detect the difference: The processed version, significantly quieter and more even keeled, delivers a much more pleasant listening experience.

You can create high-quality audio with the Enhance API for countless purposes—from podcasts to EdTech solutions to social media and live events—and you don’t need an audio engineering background to do that. For details, check out Dolby’s extensive documentation.

Enhanced Live Audio In Action

Have a look at a real-life example.

Cloudinary’s video podcasts, MX Matters and DevJams, which focuses on media trends worldwide, features interviews with international creators who share their expertise. Because we aren’t all audio professionals, creators talk with whatever devices we have, often ones with subpar built-in microphones. The result is a poor listening experience for the listeners.

On the flip side, those speakers with top-grade audio equipment are a stark contrast. Thanks to the superior audio setup, their presentations sound fantastic: no balance issues, no background noise, no inappropriate loudness.

After researching solutions to improve the listener experience, Cloudinary found that they could run MX Matters through the Enhance API and generate audio quality that’s comparable to that produced by the guests with professional equipment. The refined tracks are clean, balanced, and more polished and presentable than the original—a pleasure to listen to.

The Cloudinary Accompaniment

Cloudinary helps businesses generate captivating visuals and user-generated content (UGC) with APIs. If you’re already leveraging Cloudinary for those ends, why not also adopt Dolby’s audio-enhancing capabilities? Subsequently, you’ll reap an enjoyable audio-visual experience for your audience without investing in sophisticated equipment.

Here’s a workflow to consider for using the Enhance API with Cloudinary-hosted content:

  1. Extract the audio from the video.
  2. Clean, enhance, and transform the audio with Dolby’s Media Processing APIs.
  3. Stitch the enhanced audio back into the video before applying Cloudinary’s features.

By storing audio, video, and image files on Cloudinary, you have two options for optimizing the audio without impacting your application:

  • Write code for a new upload to process the audio file through the Enhance API and then upload the enhanced version to Cloudinary.
  • Build a function to fetch the audio, process the audio file through the Enhance API, and then reupload the enhanced version to Cloudinary.

Closing Thoughts

Audio quality can make or break projects. To clearly distinguish your sound recording from the background noise, turn up your audio game. An easy and economical way to do so is to give your audio tracks a professional makeover with Dolby’s Enhance API, which has proven to be inordinately helpful for businesses and projects that rely on UGC, for creators on a limited budget, and even for companies with remote employees.

Combine the Enhance API with Cloudinary, and you’ll be able to master your digital brand and position your content up there with the pros. That’s because Cloudinary not only serves as a repository for your imagery, audio, and video, but also seamlessly integrates them across all the platforms you’re on.

Bottom line: Cloudinary offers endless possibilities for enhancing digital content, requiring no audio or video engineering expertise whatsoever. Visit Cloudinary for details.

Recent Blog Posts

Our $2B Valuation

By
Blackstone Growth Invests in Cloudinary

When we started our journey in 2012, we were looking to improve our lives as developers by making it easier for us to handle the arduous tasks of handling images and videos in our code. That initial line of developer code has evolved into a full suite of media experience solutions driven by a mission that gradually revealed itself over the course of the past 10 years: help companies unleash the full potential of their media to create the most engaging visual experiences.

Read more
Direct-to-Consumer E-Commerce Requires Compelling Visual Experiences

When brands like you adopt a direct–to-consumer (DTC) e-commerce approach with no involvement of retailers or marketplaces, you gain direct and timely insight into evolving shopping behaviors. Accordingly, you can accommodate shoppers’ preferences by continually adjusting your product offering and interspersing the shopping journey with moments of excitement and intrigue. Opportunities abound for you to cultivate engaging customer relationships.

Read more
Automatically Translating Videos for an International Audience

No matter your business focus—public service, B2B integration, recruitment—multimedia, in particular video, is remarkably effective in communicating with the audience. Before, making video accessible to diverse viewers involved tasks galore, such as eliciting the service of production studios to manually dub, transcribe, and add subtitles. Those operations were costly and slow, especially for globally destined content.

Read more
Cloudinary Helps Minted Manage Its Image-Generation Pipeline at Scale

Shoppers return time and again to Minted’s global online community of independent artists and designers because they know they can count on unique, statement-making products of the highest quality there. Concurrently, the visual imagery on Minted.com must do justice to the designs into which the creators have poured their hearts and souls. For Minted’s VP of Engineering David Lien, “Because we are a premium brand, we need to ensure that every single one of our product images matches the selected configuration exactly. For example, if you pick an 18x24 art print on blue canvas, we will show that exact combination on the hero images in the PDF.”

Read more
Highlights on ImageCon 2021 and a Preview of ImageCon 2022

New year, same trend! Visual media will continue to play a monumental role in driving online conversions. To keep up with visual-experience trends and best practices, Cloudinary holds an annual conference called ImageCon, a one-of-a-kind event that helps attendees create the most engaging visual experiences possible.

Read more