Cloudinary Blog

DAM for Developers: AI-Based Content Analysis

DAM for Developers: AI-Based Content Analysis

Content analysis on images based on artificial intelligence (AI) is yet another powerful feature of Cloudinary’s digital asset management (DAM) solution. This article explains the types of analysis Cloudinary DAM can perform for you.

To run an analysis on an image in your Media Library, click the image from the dashboard and then the Manage button near the top right corner. A pane then opens on the left with three tabs at the top: Summary, Metadata, and Analysis, as shown below:

Analysis Tab for an Image Analysis Tab for an Image

Click the Analysis tab for the related display: ai based analysis

Cloudinary already runs some basic, AI-based analysis on the images you’ve uploaded to your Cloudinary account’s Media Library. As the above screenshot shows, you must register for add-ons before certain types of analysis can occur.

Following are the analyses you can run for your images with Cloudinary DAM.

Auto-Tagging

Thanks to deep-learning AI models, which categorize images’ object types, you can automate the process of marking each of those categories as tags for your images.

First, click Register under Free Registration for Amazon Rekognition, as shown here: Amazon Rekognition

Now run the analysis to show the objects in the image. See this example: Amazon analysis

You can then assign all those categories as tags to your images. What a time-saver!

Detection of Faces and Celebrities

Face Detection

By leveraging AI’s face-detection algorithms, Cloudinary can name the celebrities in an uploaded image. Each face- or celebrity-detection option displays in the preview pane on the right—in color-coded frames—the coordinates of the faces it finds. To show or hide those frames, click the Show or Hide button.

Also displayed is the accuracy and confidence level of the celebrities in the image.

To activate this feature, click Register under Free Registration for Amazon Rekognition: Celebrity Detection, like this:

Celebrity-Detection Activation of Celebrity-Detection Add-On

Now run the analysis to find out who the celebrity in the image is.

President Obama Detection of Celebrity

Sure enough, Cloudinary detected the celebrity as Barack Obama with 100-percent accuracy. Bravo!

Detection of Optical Characters

Cloudinary DAM can read the text in uploaded images by running an analysis of optical-character recognition (OCR) on them.

OCR Image With Text Before Cloudinary DAM’s OCR Analysis

To activate that feature, click Register under Free Registration for OCR Optical Character Recognition. Afterwards, run the OCR analysis. Here’s an example of the result:

text detection

Color Analysis

Subsequent to a color analysis that yields information about the main colors in an image, Cloudinary DAM can automatically add background padding to it in an attractive color.

Additionally, with the premium feature Image Analysis, an advanced Media Library search capability, you can search for images based on their predominant colors.

Color Analysis Color Analysis of an Image on Cloudinary DAM

Conclusion

Cloudinary DAM’s AI-based content analysis capability serves innumerable use cases for developers. Do share the creative ways you’re applying it to your projects. We can all learn from one another’s ideas. Check out my next post in the DAM for developers series about access control.


DAM for Developers: Series


Further Reading on Digital Asset Management

Recent Blog Posts

Our $2B Valuation

By
Blackstone Growth Invests in Cloudinary

When we started our journey in 2012, we were looking to improve our lives as developers by making it easier for us to handle the arduous tasks of handling images and videos in our code. That initial line of developer code has evolved into a full suite of media experience solutions driven by a mission that gradually revealed itself over the course of the past 10 years: help companies unleash the full potential of their media to create the most engaging visual experiences.

Read more
Direct-to-Consumer E-Commerce Requires Compelling Visual Experiences

When brands like you adopt a direct–to-consumer (DTC) e-commerce approach with no involvement of retailers or marketplaces, you gain direct and timely insight into evolving shopping behaviors. Accordingly, you can accommodate shoppers’ preferences by continually adjusting your product offering and interspersing the shopping journey with moments of excitement and intrigue. Opportunities abound for you to cultivate engaging customer relationships.

Read more
Automatically Translating Videos for an International Audience

No matter your business focus—public service, B2B integration, recruitment—multimedia, in particular video, is remarkably effective in communicating with the audience. Before, making video accessible to diverse viewers involved tasks galore, such as eliciting the service of production studios to manually dub, transcribe, and add subtitles. Those operations were costly and slow, especially for globally destined content.

Read more
Cloudinary Helps Minted Manage Its Image-Generation Pipeline at Scale

Shoppers return time and again to Minted’s global online community of independent artists and designers because they know they can count on unique, statement-making products of the highest quality there. Concurrently, the visual imagery on Minted.com must do justice to the designs into which the creators have poured their hearts and souls. For Minted’s VP of Engineering David Lien, “Because we are a premium brand, we need to ensure that every single one of our product images matches the selected configuration exactly. For example, if you pick an 18x24 art print on blue canvas, we will show that exact combination on the hero images in the PDF.”

Read more
Highlights on ImageCon 2021 and a Preview of ImageCon 2022

New year, same trend! Visual media will continue to play a monumental role in driving online conversions. To keep up with visual-experience trends and best practices, Cloudinary holds an annual conference called ImageCon, a one-of-a-kind event that helps attendees create the most engaging visual experiences possible.

Read more