{"id":35641,"date":"2024-09-09T07:00:00","date_gmt":"2024-09-09T14:00:00","guid":{"rendered":"https:\/\/cloudinary.com\/blog\/?p=35641"},"modified":"2025-11-26T17:10:26","modified_gmt":"2025-11-27T01:10:26","slug":"ai-vision-bringing-genai-media-management","status":"publish","type":"post","link":"https:\/\/cloudinary.com\/blog\/ai-vision-bringing-genai-media-management","title":{"rendered":"Cloudinary AI Vision: Bringing GenAI to Visual Media Management"},"content":{"rendered":"\n<p>Introducing AI Vision. The latest Cloudinary cutting-edge feature integrates the power of generative AI directly into visual media management to efficiently classify, moderate, and describe visual content at scale. Cloudinary AI Vision streamlines visual media management making it more precise and practical while vastly improving your team&#8217;s productivity.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">What is AI Vision?<\/h2>\n\n\n\n<p>AI Vision is Cloudinary&#8217;s latest generative AI solution and brings generative AI capabilities to key media management workflows.&nbsp;<\/p>\n\n\n\n<p>AI Vision uses a generative multimodal LLM enhanced with specialized models, algorithms, and tailored prompts that address existing LLM blind spots. This allows it to interpret and respond to visual content and queries. Essential media management tasks such as content classification, image moderation, and custom descriptions can hence be automated with AI vision.&nbsp;<\/p>\n\n\n\n<p>Standard AI models often require complex and extensive training to meet different brands&#8217; needs. AI Vision eliminates the intricacies of model training by offering a flexible, ready-to-use solution that integrates effortlessly with Cloudinary&#8217;s Digital Asset Management (DAM) platform.&nbsp;<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Key Features of AI Vision<\/h2>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Custom taxonomy and image classification<\/strong>. AI Vision supports a teams&#8217; unique taxonomy without requiring training or fine-tuning tagging models. Tags can be used with custom and specific descriptions to categorize images according to their branding and organizational needs. This customization allows for accurate tagging based on detailed criteria such as visual product attributes, background color, subject orientation, or demographic details \u2014 whatever your organization&#8217;s unique taxonomy needs may be.<\/li>\n\n\n\n<li><strong>Content moderation and compliance<\/strong>. AI Vision streamlines brand safety and compliance checks with its automated moderation capabilities. It offers clear answers to compliance-related queries, allowing brands to quickly identify potentially sensitive content and uphold consistent standards across multiple platforms. Whether checking for public figures in images or ensuring that visuals do not contain violent or inappropriate content, AI Vision provides precise, automated moderation at scale.<\/li>\n\n\n\n<li><strong>General questions and tasks<\/strong>. AI Vision enables users to receive detailed, contextually aware answers to questions about their images. By analyzing visual content, the AI identifies objects, scenes, and in-image text, making media assets more searchable and logically organized. For instance, users can ask AI Vision to describe the setting of an image or identify specific elements, such as the number of people or objects in a scene. AI vision can also be used for standard task completion such as image CTA suggestions and captioning. This capability allows for more efficient media management and retrieval.<\/li>\n<\/ul>\n\n\n\n<h2 class=\"wp-block-heading\">Using AI Vision<\/h2>\n\n\n\n<p>AI Vision simplifies managing large media libraries by allowing you to create robust generative AI back-end processes. To get started, let\u2019s look at a few examples of AI Vision&#8217;s features in action.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\">Getting Started<\/h3>\n\n\n\n<p>To use AI Vision, customers will need to subscribe to the Cloudinary AI Vision add-on and consume it via the Analyze API service. You can subscribe by logging into your Cloudinary account and navigating to your add-ons screen under settings. From there, you can activate the add-on and start using AI Vision.<\/p>\n\n\n\n<p>Check out this page for more information on Cloudinary\u2019s <a href=\"https:\/\/cloudinary.com\/documentation\/cloudinary_ai_vision_addon#banner\">AI Vision<\/a> Add-on. If you&#8217;d like to learn more about the Analyze API service, click <a href=\"https:\/\/cloudinary.com\/documentation\/analyze_api_guide\">here<\/a>.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\">Use Cases<\/h3>\n\n\n\n<p>AI Vision works by submitting requests to the AI Vision API and getting the response back in JSON format. If you need to store information on the asset returned by AI Vision, you can use this response in your custom workflows, including <a href=\"https:\/\/home.mediaflows.cloudinary.com\/\">Cloudinary MediaFlows<\/a>. Let&#8217;s concentrate on the API responses for these use cases.<\/p>\n\n\n\n<h4 class=\"wp-block-heading\">Custom Taxonomies<\/h4>\n\n\n\n<p>Users can automate the tagging of images based on specific organizational needs. In this use case, users would define their tags and custom definitions to suit their needs. For example, they can create a taxonomy that identifies if an image has a human model and if certain clothing accessories are present. They then can make an automated flow that organizes the images based on the response, which will only contain the tags that match the image. Let&#8217;s see this in action below. As you can see in the request, we\u2019ve created our definitions for the tags, and AI Vision has responded accordingly, returning only the tags for &#8220;model&#8221; and &#8220;dress,&#8221; which are what was detected:<\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-large\"><img decoding=\"async\" src=\"https:\/\/cloudinary-marketing-res.cloudinary.com\/image\/upload\/v1764205709\/blog-Cloudinary_AI_Vision-1.jpg\" alt=\"Model wearing a pink and gold dress\"\/><\/figure><\/div>\n\n\n<h5 class=\"wp-block-heading\">Request<\/h5>\n\n\n<pre class=\"wp-block-code\" aria-describedby=\"shcb-language-1\" data-shcb-language-name=\"JavaScript\" data-shcb-language-slug=\"javascript\"><span><code class=\"hljs language-javascript shcb-wrap-lines\">POST \/analysis\/{cloud_name}\/analyze\/ai_vision_tagging\n\n{\n\n\u00a0\u00a0<span class=\"hljs-string\">\"source\"<\/span>: {\n\n\u00a0\u00a0\u00a0\u00a0<span class=\"hljs-string\">\"uri\"<\/span>: <span class=\"hljs-string\">\"https:\/\/res.cloudinary.com\/siedner\/image\/upload\/v1725388251\/cldrop\/pexels-sedat-yetis-248508609-19985033_e5mhxk.jpg\"<\/span>\n\n\u00a0\u00a0},\n\n\u00a0\u00a0<span class=\"hljs-string\">\"tag_definitions\"<\/span>: &#91;\n\n\u00a0\u00a0\u00a0\u00a0{<span class=\"hljs-string\">\"name\"<\/span>: <span class=\"hljs-string\">\"model\"<\/span>, <span class=\"hljs-string\">\"description\"<\/span>: <span class=\"hljs-string\">\"Does the image contain a person?\"<\/span>},\n\n\u00a0\u00a0\u00a0\u00a0{<span class=\"hljs-string\">\"name\"<\/span>: <span class=\"hljs-string\">\"back-facing\"<\/span>, <span class=\"hljs-string\">\"description\"<\/span>: <span class=\"hljs-string\">\"Does the image show someone who is back facing?\"<\/span>},\n\n\u00a0\u00a0\u00a0\u00a0{<span class=\"hljs-string\">\"name\"<\/span>: <span class=\"hljs-string\">\"dress\"<\/span>, <span class=\"hljs-string\">\"description\"<\/span>: <span class=\"hljs-string\">\"Does the image show someone wearing a dress?\"<\/span>},\n\n\u00a0\u00a0\u00a0\u00a0{<span class=\"hljs-string\">\"name\"<\/span>: <span class=\"hljs-string\">\"bag\"<\/span>, <span class=\"hljs-string\">\"description\"<\/span>: <span class=\"hljs-string\">\"does the image show someone holding a handbag?\"<\/span>},\n\n\u00a0\u00a0]\n\n}<\/code><\/span><small class=\"shcb-language\" id=\"shcb-language-1\"><span class=\"shcb-language__label\">Code language:<\/span> <span class=\"shcb-language__name\">JavaScript<\/span> <span class=\"shcb-language__paren\">(<\/span><span class=\"shcb-language__slug\">javascript<\/span><span class=\"shcb-language__paren\">)<\/span><\/small><\/pre>\n\n\n<h5 class=\"wp-block-heading\">Response<\/h5>\n\n\n<pre class=\"wp-block-code\" aria-describedby=\"shcb-language-2\" data-shcb-language-name=\"JSON \/ JSON with Comments\" data-shcb-language-slug=\"json\"><span><code class=\"hljs language-json shcb-wrap-lines\">{ <span class=\"hljs-attr\">\"limits\"<\/span>: { <span class=\"hljs-attr\">\"usage\"<\/span>: { <span class=\"hljs-attr\">\"type\"<\/span>: <span class=\"hljs-string\">\"ai_vision\"<\/span>, <span class=\"hljs-attr\">\"count\"<\/span>: <span class=\"hljs-number\">1925<\/span> } }, <span class=\"hljs-attr\">\"request_id\"<\/span>: <span class=\"hljs-string\">\"5dda92ecfc6925279689b1c840e13745\"<\/span>, <span class=\"hljs-attr\">\"data\"<\/span>: { <span class=\"hljs-attr\">\"entity\"<\/span>: <span class=\"hljs-string\">\"https:\/\/res.cloudinary.com\/siedner\/image\/upload\/v1725388251\/cldrop\/pexels-sedat-yetis-248508609-19985033_e5mhxk.jpg\"<\/span>, <span class=\"hljs-attr\">\"analysis\"<\/span>: { <span class=\"hljs-attr\">\"tags\"<\/span>: &#91; { <span class=\"hljs-attr\">\"name\"<\/span>: <span class=\"hljs-string\">\"model\"<\/span> }, { <span class=\"hljs-attr\">\"name\"<\/span>: <span class=\"hljs-string\">\"dress\"<\/span> } ], <span class=\"hljs-attr\">\"model_version\"<\/span>: <span class=\"hljs-number\">1<\/span> } } }<\/code><\/span><small class=\"shcb-language\" id=\"shcb-language-2\"><span class=\"shcb-language__label\">Code language:<\/span> <span class=\"shcb-language__name\">JSON \/ JSON with Comments<\/span> <span class=\"shcb-language__paren\">(<\/span><span class=\"shcb-language__slug\">json<\/span><span class=\"shcb-language__paren\">)<\/span><\/small><\/pre>\n\n\n<h4 class=\"wp-block-heading\">Moderation Checks<\/h4>\n\n\n\n<p>AI Vision can quickly address questions like, &#8220;Is there anything in the image that could be considered violent or disturbing?&#8221; It provides an automated response to help brands meet compliance standards efficiently, especially when dealing with content like UGC and third-party uploads. In this example, we&#8217;ve asked two simple questions regarding the image:<\/p>\n\n\n\n<p><em>&#8220;<\/em>Does it clearly show any logos or other IP?<em>&#8220;<\/em><\/p>\n\n\n\n<p><em>&#8220;<\/em>Does it contain any offensive or NSFW elements?<em>&#8220;<\/em><\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-large\"><img decoding=\"async\" src=\"https:\/\/cloudinary-marketing-res.cloudinary.com\/image\/upload\/v1764205711\/blog-Cloudinary_AI_Vision-2.jpg\" alt=\"Couple holding hands in front of hot air balloons in the sky\"\/><\/figure><\/div>\n\n\n<h5 class=\"wp-block-heading\">Request<\/h5>\n\n\n<pre class=\"wp-block-code\" aria-describedby=\"shcb-language-3\" data-shcb-language-name=\"JavaScript\" data-shcb-language-slug=\"javascript\"><span><code class=\"hljs language-javascript shcb-wrap-lines\">POST \/analysis\/{cloud_name}\/analyze\/ai_vision_moderation\n\n{\n\n\u00a0\u00a0<span class=\"hljs-string\">\"source\"<\/span>: {\n\n\u00a0\u00a0\u00a0\u00a0<span class=\"hljs-string\">\"uri\"<\/span>: <span class=\"hljs-string\">\"https:\/\/res.cloudinary.com\/siedner\/image\/upload\/v1725646190\/cldrop\/pexels-mlkbnl-8633368_y8qplj.jpg\"<\/span>\n\n\u00a0\u00a0},\n\n\u00a0\u00a0<span class=\"hljs-string\">\"rejection_questions\"<\/span>: &#91;\n\n\u00a0\u00a0\u00a0\u00a0<span class=\"hljs-string\">\"Does it clearly show any logos or other IP?\"<\/span>,\n\n\u00a0\u00a0\u00a0\u00a0<span class=\"hljs-string\">\"Does it contain any offensive or NSFW elements?\"<\/span>\n\n\u00a0\u00a0\u00a0\u00a0]\n\n}<\/code><\/span><small class=\"shcb-language\" id=\"shcb-language-3\"><span class=\"shcb-language__label\">Code language:<\/span> <span class=\"shcb-language__name\">JavaScript<\/span> <span class=\"shcb-language__paren\">(<\/span><span class=\"shcb-language__slug\">javascript<\/span><span class=\"shcb-language__paren\">)<\/span><\/small><\/pre>\n\n\n<h5 class=\"wp-block-heading\">Response&nbsp;<\/h5>\n\n\n<pre class=\"wp-block-code\" aria-describedby=\"shcb-language-4\" data-shcb-language-name=\"JSON \/ JSON with Comments\" data-shcb-language-slug=\"json\"><span><code class=\"hljs language-json shcb-wrap-lines\">{ <span class=\"hljs-attr\">\"limits\"<\/span>: { <span class=\"hljs-attr\">\"usage\"<\/span>: { <span class=\"hljs-attr\">\"type\"<\/span>: <span class=\"hljs-string\">\"ai_vision\"<\/span>, <span class=\"hljs-attr\">\"count\"<\/span>: <span class=\"hljs-number\">2542<\/span> } }, <span class=\"hljs-attr\">\"request_id\"<\/span>: <span class=\"hljs-string\">\"1204425015234600630037349fca1ff6\"<\/span>, <span class=\"hljs-attr\">\"data\"<\/span>: { <span class=\"hljs-attr\">\"entity\"<\/span>: <span class=\"hljs-string\">\"https:\/\/res.cloudinary.com\/siedner\/image\/upload\/v1725387515\/cldrop\/pexels-monurblc-27124723_rghu5p.jpg\"<\/span>, <span class=\"hljs-attr\">\"analysis\"<\/span>: { <span class=\"hljs-attr\">\"responses\"<\/span>: &#91; { <span class=\"hljs-attr\">\"prompt\"<\/span>: <span class=\"hljs-string\">\"Does it clearly show any logos or other IP?\"<\/span>, <span class=\"hljs-attr\">\"value\"<\/span>: <span class=\"hljs-string\">\"no\"<\/span> }, { <span class=\"hljs-attr\">\"prompt\"<\/span>: <span class=\"hljs-string\">\"Does it contain any offensive or NSFW elements?\"<\/span>, <span class=\"hljs-attr\">\"value\"<\/span>: <span class=\"hljs-string\">\"no\"<\/span> } ], <span class=\"hljs-attr\">\"model_version\"<\/span>: <span class=\"hljs-number\">1<\/span> } } }<\/code><\/span><small class=\"shcb-language\" id=\"shcb-language-4\"><span class=\"shcb-language__label\">Code language:<\/span> <span class=\"shcb-language__name\">JSON \/ JSON with Comments<\/span> <span class=\"shcb-language__paren\">(<\/span><span class=\"shcb-language__slug\">json<\/span><span class=\"shcb-language__paren\">)<\/span><\/small><\/pre>\n\n\n<h4 class=\"wp-block-heading\">General Queries<\/h4>\n\n\n\n<p>For this example, we asked AI Vision to provide an alt tag and a caption for the image. AI Vision responded, adhering to our length constraints as requested. This is a very helpful example for creating SEO-friendly and accessible text.<\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-large\"><img decoding=\"async\" src=\"https:\/\/cloudinary-marketing-res.cloudinary.com\/image\/upload\/v1764205713\/blog-Cloudinary_AI_Vision-3.jpg\" alt=\"An exercise room\"\/><\/figure><\/div>\n\n\n<h5 class=\"wp-block-heading\">Request (Code View)<\/h5>\n\n\n<pre class=\"wp-block-code\" aria-describedby=\"shcb-language-5\" data-shcb-language-name=\"JSON \/ JSON with Comments\" data-shcb-language-slug=\"json\"><span><code class=\"hljs language-json shcb-wrap-lines\">POST \/analysis\/{cloud_name}\/analyze\/ai_vision_general\n\n{\n\n\u00a0\u00a0<span class=\"hljs-attr\">\"source\"<\/span>: {\n\n\u00a0\u00a0\u00a0\u00a0<span class=\"hljs-attr\">\"uri\"<\/span>: <span class=\"hljs-string\">\"https:\/\/res.cloudinary.com\/demo\/image\/upload\/sample.jpg\"<\/span>\n\n\u00a0\u00a0},\n\n\u00a0\u00a0<span class=\"hljs-attr\">\"prompts\"<\/span>: &#91;\n\n\u00a0\u00a0\u00a0\u00a0 \u201cprovide a seo friendly description suitable for an alt tag in under <span class=\"hljs-number\">100<\/span> characters\u201d,\n\n\u201cplease provide a <span class=\"hljs-number\">25<\/span> word caption for this image\u201d\n\n\u00a0\u00a0\u00a0\u00a0]\n\n}<\/code><\/span><small class=\"shcb-language\" id=\"shcb-language-5\"><span class=\"shcb-language__label\">Code language:<\/span> <span class=\"shcb-language__name\">JSON \/ JSON with Comments<\/span> <span class=\"shcb-language__paren\">(<\/span><span class=\"shcb-language__slug\">json<\/span><span class=\"shcb-language__paren\">)<\/span><\/small><\/pre>\n\n\n<h5 class=\"wp-block-heading\">Response (Code View)<\/h5>\n\n\n<pre class=\"wp-block-code\" aria-describedby=\"shcb-language-6\" data-shcb-language-name=\"JSON \/ JSON with Comments\" data-shcb-language-slug=\"json\"><span><code class=\"hljs language-json shcb-wrap-lines\">{ <span class=\"hljs-attr\">\"limits\"<\/span>: { <span class=\"hljs-attr\">\"usage\"<\/span>: { <span class=\"hljs-attr\">\"type\"<\/span>: <span class=\"hljs-string\">\"ai_vision\"<\/span>, <span class=\"hljs-attr\">\"count\"<\/span>: <span class=\"hljs-number\">1386<\/span> } }, <span class=\"hljs-attr\">\"request_id\"<\/span>: <span class=\"hljs-string\">\"e7aa53f6d1aa8a3ecafb91215b573c9b\"<\/span>, <span class=\"hljs-attr\">\"data\"<\/span>: { <span class=\"hljs-attr\">\"entity\"<\/span>: <span class=\"hljs-string\">\"https:\/\/res.cloudinary.com\/siedner\/image\/upload\/v1725386162\/cldrop\/pexels-heyho-7031705_cu9bjc.jpg\"<\/span>, <span class=\"hljs-attr\">\"analysis\"<\/span>: { <span class=\"hljs-attr\">\"responses\"<\/span>: &#91; { <span class=\"hljs-attr\">\"value\"<\/span>: <span class=\"hljs-string\">\"Modern gym with treadmills, equipment, wood floors, and intricate ceiling design by large windows\"<\/span> }, { <span class=\"hljs-attr\">\"value\"<\/span>: <span class=\"hljs-string\">\"State-of-the-art fitness center boasts sleek equipment, hardwood floors, and an eye-catching geometric ceiling. Bathed in natural light from floor-to-ceiling windows, it offers a luxurious workout experience with panoramic views.\"<\/span> } ], <span class=\"hljs-attr\">\"model_version\"<\/span>: <span class=\"hljs-number\">1<\/span> } } }<\/code><\/span><small class=\"shcb-language\" id=\"shcb-language-6\"><span class=\"shcb-language__label\">Code language:<\/span> <span class=\"shcb-language__name\">JSON \/ JSON with Comments<\/span> <span class=\"shcb-language__paren\">(<\/span><span class=\"shcb-language__slug\">json<\/span><span class=\"shcb-language__paren\">)<\/span><\/small><\/pre>\n\n\n<h2 class=\"wp-block-heading\">Why Choose Cloudinary AI Vision vs. Traditional LLMs?<\/h2>\n\n\n\n<p>AI Vision stands apart from traditional AI tools by combining visual and textual data for a more comprehensive understanding of content. This intelligence enables businesses to build tailored workflows for media management that will align with unique brand and customer expectations. Unlike standard LLM wrappers, AI Vision integrates a foundational model enhanced with specialized algorithms, prompt engineering, and fine-tuning that address specific industry needs and common LLM blind spots. This tailored approach ensures more accurate, brand-specific outcomes right out of the box.<\/p>\n\n\n\n<p>Additionally, AI Vision democratizes access to advanced AI capabilities, allowing brands to leverage powerful media workflows without needing extensive AI budgets or specialized teams. AI Vision is an out-of-the-box solution that eliminates the complexities of building, training, and hosting custom models, making high-quality AI-driven media management accessible to all teams regardless of size.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Conclusion<\/h2>\n\n\n\n<p><a href=\"https:\/\/cloudinary.com\/documentation\/cloudinary_ai_vision_addon#banner\">AI Vision by Cloudinary<\/a> is a powerful, user-friendly solution for modern media management needs. Businesses can classify, moderate, and describe images more efficiently than ever by leveraging generative AI. Whether automating compliance checks, enhancing image searchability, or developing custom tagging workflows, AI Vision delivers the precision necessary to handle large-scale digital asset management. <a href=\"https:\/\/cloudinary.com\/\">Contact us today<\/a> to learn more.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Introducing AI Vision. The latest Cloudinary cutting-edge feature integrates the power of generative AI directly into visual media management to efficiently classify, moderate, and describe visual content at scale. Cloudinary AI Vision streamlines visual media management making it more precise and practical while vastly improving your team&#8217;s productivity. What is AI Vision? AI Vision is [&hellip;]<\/p>\n","protected":false},"author":87,"featured_media":35643,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"_cloudinary_featured_overwrite":false,"footnotes":""},"categories":[1],"tags":[409],"class_list":["post-35641","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-uncategorized","tag-generative-ai"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO Premium plugin v25.6 (Yoast SEO v26.9) - https:\/\/yoast.com\/product\/yoast-seo-premium-wordpress\/ -->\n<title>Media Management With AI Vision: Cloudinary\u2019s Generative AI Solution<\/title>\n<meta name=\"description\" content=\"Cloudinary\u2019s AI Vision automates content classification, moderation, and description, transforming your media management workflows for improved efficiency and precision.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/cloudinary.com\/blog\/ai-vision-bringing-genai-media-management\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Cloudinary AI Vision: Bringing GenAI to Visual Media Management\" \/>\n<meta property=\"og:description\" content=\"Cloudinary\u2019s AI Vision automates content classification, moderation, and description, transforming your media management workflows for improved efficiency and precision.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/cloudinary.com\/blog\/ai-vision-bringing-genai-media-management\" \/>\n<meta property=\"og:site_name\" content=\"Cloudinary Blog\" \/>\n<meta property=\"article:published_time\" content=\"2024-09-09T14:00:00+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2025-11-27T01:10:26+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/res.cloudinary.com\/cloudinary-marketing\/images\/f_auto,q_auto\/v1725575261\/ai_vision-blog\/ai_vision-blog.jpg?_i=AA\" \/>\n\t<meta property=\"og:image:width\" content=\"2000\" \/>\n\t<meta property=\"og:image:height\" content=\"1100\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"melindapham\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"NewsArticle\",\"@id\":\"https:\/\/cloudinary.com\/blog\/ai-vision-bringing-genai-media-management#article\",\"isPartOf\":{\"@id\":\"https:\/\/cloudinary.com\/blog\/ai-vision-bringing-genai-media-management\"},\"author\":{\"name\":\"melindapham\",\"@id\":\"https:\/\/cloudinary.com\/blog\/#\/schema\/person\/0d5ad601e4c3b5be89245dfb14be42d9\"},\"headline\":\"Cloudinary AI Vision: Bringing GenAI to Visual Media Management\",\"datePublished\":\"2024-09-09T14:00:00+00:00\",\"dateModified\":\"2025-11-27T01:10:26+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/cloudinary.com\/blog\/ai-vision-bringing-genai-media-management\"},\"wordCount\":1033,\"publisher\":{\"@id\":\"https:\/\/cloudinary.com\/blog\/#organization\"},\"image\":{\"@id\":\"https:\/\/cloudinary.com\/blog\/ai-vision-bringing-genai-media-management#primaryimage\"},\"thumbnailUrl\":\"https:\/\/res.cloudinary.com\/cloudinary-marketing\/images\/f_auto,q_auto\/v1725575261\/ai_vision-blog\/ai_vision-blog.jpg?_i=AA\",\"keywords\":[\"Generative AI\"],\"inLanguage\":\"en-US\",\"copyrightYear\":\"2024\",\"copyrightHolder\":{\"@id\":\"https:\/\/cloudinary.com\/#organization\"}},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/cloudinary.com\/blog\/ai-vision-bringing-genai-media-management\",\"url\":\"https:\/\/cloudinary.com\/blog\/ai-vision-bringing-genai-media-management\",\"name\":\"Media Management With AI Vision: Cloudinary\u2019s Generative AI Solution\",\"isPartOf\":{\"@id\":\"https:\/\/cloudinary.com\/blog\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/cloudinary.com\/blog\/ai-vision-bringing-genai-media-management#primaryimage\"},\"image\":{\"@id\":\"https:\/\/cloudinary.com\/blog\/ai-vision-bringing-genai-media-management#primaryimage\"},\"thumbnailUrl\":\"https:\/\/res.cloudinary.com\/cloudinary-marketing\/images\/f_auto,q_auto\/v1725575261\/ai_vision-blog\/ai_vision-blog.jpg?_i=AA\",\"datePublished\":\"2024-09-09T14:00:00+00:00\",\"dateModified\":\"2025-11-27T01:10:26+00:00\",\"description\":\"Cloudinary\u2019s AI Vision automates content classification, moderation, and description, transforming your media management workflows for improved efficiency and precision.\",\"breadcrumb\":{\"@id\":\"https:\/\/cloudinary.com\/blog\/ai-vision-bringing-genai-media-management#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/cloudinary.com\/blog\/ai-vision-bringing-genai-media-management\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/cloudinary.com\/blog\/ai-vision-bringing-genai-media-management#primaryimage\",\"url\":\"https:\/\/res.cloudinary.com\/cloudinary-marketing\/images\/f_auto,q_auto\/v1725575261\/ai_vision-blog\/ai_vision-blog.jpg?_i=AA\",\"contentUrl\":\"https:\/\/res.cloudinary.com\/cloudinary-marketing\/images\/f_auto,q_auto\/v1725575261\/ai_vision-blog\/ai_vision-blog.jpg?_i=AA\",\"width\":2000,\"height\":1100},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/cloudinary.com\/blog\/ai-vision-bringing-genai-media-management#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/cloudinary.com\/blog\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Cloudinary AI Vision: Bringing GenAI to Visual Media Management\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/cloudinary.com\/blog\/#website\",\"url\":\"https:\/\/cloudinary.com\/blog\/\",\"name\":\"Cloudinary Blog\",\"description\":\"\",\"publisher\":{\"@id\":\"https:\/\/cloudinary.com\/blog\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/cloudinary.com\/blog\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/cloudinary.com\/blog\/#organization\",\"name\":\"Cloudinary Blog\",\"url\":\"https:\/\/cloudinary.com\/blog\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/cloudinary.com\/blog\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/res.cloudinary.com\/cloudinary-marketing\/images\/f_auto,q_auto\/v1649718331\/Web_Assets\/blog\/cloudinary_logo_for_white_bg_1937437aa7_19374666c7_193742f877\/cloudinary_logo_for_white_bg_1937437aa7_19374666c7_193742f877.png?_i=AA\",\"contentUrl\":\"https:\/\/res.cloudinary.com\/cloudinary-marketing\/images\/f_auto,q_auto\/v1649718331\/Web_Assets\/blog\/cloudinary_logo_for_white_bg_1937437aa7_19374666c7_193742f877\/cloudinary_logo_for_white_bg_1937437aa7_19374666c7_193742f877.png?_i=AA\",\"width\":312,\"height\":60,\"caption\":\"Cloudinary Blog\"},\"image\":{\"@id\":\"https:\/\/cloudinary.com\/blog\/#\/schema\/logo\/image\/\"}},{\"@type\":\"Person\",\"@id\":\"https:\/\/cloudinary.com\/blog\/#\/schema\/person\/0d5ad601e4c3b5be89245dfb14be42d9\",\"name\":\"melindapham\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/cloudinary.com\/blog\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/e6f989fa97fe94be61596259d8629c3df65aec4c7da5c0000f90d810f313d4f4?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/e6f989fa97fe94be61596259d8629c3df65aec4c7da5c0000f90d810f313d4f4?s=96&d=mm&r=g\",\"caption\":\"melindapham\"}}]}<\/script>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"Media Management With AI Vision: Cloudinary\u2019s Generative AI Solution","description":"Cloudinary\u2019s AI Vision automates content classification, moderation, and description, transforming your media management workflows for improved efficiency and precision.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/cloudinary.com\/blog\/ai-vision-bringing-genai-media-management","og_locale":"en_US","og_type":"article","og_title":"Cloudinary AI Vision: Bringing GenAI to Visual Media Management","og_description":"Cloudinary\u2019s AI Vision automates content classification, moderation, and description, transforming your media management workflows for improved efficiency and precision.","og_url":"https:\/\/cloudinary.com\/blog\/ai-vision-bringing-genai-media-management","og_site_name":"Cloudinary Blog","article_published_time":"2024-09-09T14:00:00+00:00","article_modified_time":"2025-11-27T01:10:26+00:00","og_image":[{"width":2000,"height":1100,"url":"https:\/\/res.cloudinary.com\/cloudinary-marketing\/images\/f_auto,q_auto\/v1725575261\/ai_vision-blog\/ai_vision-blog.jpg?_i=AA","type":"image\/jpeg"}],"author":"melindapham","twitter_card":"summary_large_image","schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"NewsArticle","@id":"https:\/\/cloudinary.com\/blog\/ai-vision-bringing-genai-media-management#article","isPartOf":{"@id":"https:\/\/cloudinary.com\/blog\/ai-vision-bringing-genai-media-management"},"author":{"name":"melindapham","@id":"https:\/\/cloudinary.com\/blog\/#\/schema\/person\/0d5ad601e4c3b5be89245dfb14be42d9"},"headline":"Cloudinary AI Vision: Bringing GenAI to Visual Media Management","datePublished":"2024-09-09T14:00:00+00:00","dateModified":"2025-11-27T01:10:26+00:00","mainEntityOfPage":{"@id":"https:\/\/cloudinary.com\/blog\/ai-vision-bringing-genai-media-management"},"wordCount":1033,"publisher":{"@id":"https:\/\/cloudinary.com\/blog\/#organization"},"image":{"@id":"https:\/\/cloudinary.com\/blog\/ai-vision-bringing-genai-media-management#primaryimage"},"thumbnailUrl":"https:\/\/res.cloudinary.com\/cloudinary-marketing\/images\/f_auto,q_auto\/v1725575261\/ai_vision-blog\/ai_vision-blog.jpg?_i=AA","keywords":["Generative AI"],"inLanguage":"en-US","copyrightYear":"2024","copyrightHolder":{"@id":"https:\/\/cloudinary.com\/#organization"}},{"@type":"WebPage","@id":"https:\/\/cloudinary.com\/blog\/ai-vision-bringing-genai-media-management","url":"https:\/\/cloudinary.com\/blog\/ai-vision-bringing-genai-media-management","name":"Media Management With AI Vision: Cloudinary\u2019s Generative AI Solution","isPartOf":{"@id":"https:\/\/cloudinary.com\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/cloudinary.com\/blog\/ai-vision-bringing-genai-media-management#primaryimage"},"image":{"@id":"https:\/\/cloudinary.com\/blog\/ai-vision-bringing-genai-media-management#primaryimage"},"thumbnailUrl":"https:\/\/res.cloudinary.com\/cloudinary-marketing\/images\/f_auto,q_auto\/v1725575261\/ai_vision-blog\/ai_vision-blog.jpg?_i=AA","datePublished":"2024-09-09T14:00:00+00:00","dateModified":"2025-11-27T01:10:26+00:00","description":"Cloudinary\u2019s AI Vision automates content classification, moderation, and description, transforming your media management workflows for improved efficiency and precision.","breadcrumb":{"@id":"https:\/\/cloudinary.com\/blog\/ai-vision-bringing-genai-media-management#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/cloudinary.com\/blog\/ai-vision-bringing-genai-media-management"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/cloudinary.com\/blog\/ai-vision-bringing-genai-media-management#primaryimage","url":"https:\/\/res.cloudinary.com\/cloudinary-marketing\/images\/f_auto,q_auto\/v1725575261\/ai_vision-blog\/ai_vision-blog.jpg?_i=AA","contentUrl":"https:\/\/res.cloudinary.com\/cloudinary-marketing\/images\/f_auto,q_auto\/v1725575261\/ai_vision-blog\/ai_vision-blog.jpg?_i=AA","width":2000,"height":1100},{"@type":"BreadcrumbList","@id":"https:\/\/cloudinary.com\/blog\/ai-vision-bringing-genai-media-management#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/cloudinary.com\/blog\/"},{"@type":"ListItem","position":2,"name":"Cloudinary AI Vision: Bringing GenAI to Visual Media Management"}]},{"@type":"WebSite","@id":"https:\/\/cloudinary.com\/blog\/#website","url":"https:\/\/cloudinary.com\/blog\/","name":"Cloudinary Blog","description":"","publisher":{"@id":"https:\/\/cloudinary.com\/blog\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/cloudinary.com\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/cloudinary.com\/blog\/#organization","name":"Cloudinary Blog","url":"https:\/\/cloudinary.com\/blog\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/cloudinary.com\/blog\/#\/schema\/logo\/image\/","url":"https:\/\/res.cloudinary.com\/cloudinary-marketing\/images\/f_auto,q_auto\/v1649718331\/Web_Assets\/blog\/cloudinary_logo_for_white_bg_1937437aa7_19374666c7_193742f877\/cloudinary_logo_for_white_bg_1937437aa7_19374666c7_193742f877.png?_i=AA","contentUrl":"https:\/\/res.cloudinary.com\/cloudinary-marketing\/images\/f_auto,q_auto\/v1649718331\/Web_Assets\/blog\/cloudinary_logo_for_white_bg_1937437aa7_19374666c7_193742f877\/cloudinary_logo_for_white_bg_1937437aa7_19374666c7_193742f877.png?_i=AA","width":312,"height":60,"caption":"Cloudinary Blog"},"image":{"@id":"https:\/\/cloudinary.com\/blog\/#\/schema\/logo\/image\/"}},{"@type":"Person","@id":"https:\/\/cloudinary.com\/blog\/#\/schema\/person\/0d5ad601e4c3b5be89245dfb14be42d9","name":"melindapham","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/cloudinary.com\/blog\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/e6f989fa97fe94be61596259d8629c3df65aec4c7da5c0000f90d810f313d4f4?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/e6f989fa97fe94be61596259d8629c3df65aec4c7da5c0000f90d810f313d4f4?s=96&d=mm&r=g","caption":"melindapham"}}]}},"jetpack_featured_media_url":"https:\/\/res.cloudinary.com\/cloudinary-marketing\/images\/f_auto,q_auto\/v1725575261\/ai_vision-blog\/ai_vision-blog.jpg?_i=AA","_links":{"self":[{"href":"https:\/\/cloudinary.com\/blog\/wp-json\/wp\/v2\/posts\/35641","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/cloudinary.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/cloudinary.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/cloudinary.com\/blog\/wp-json\/wp\/v2\/users\/87"}],"replies":[{"embeddable":true,"href":"https:\/\/cloudinary.com\/blog\/wp-json\/wp\/v2\/comments?post=35641"}],"version-history":[{"count":13,"href":"https:\/\/cloudinary.com\/blog\/wp-json\/wp\/v2\/posts\/35641\/revisions"}],"predecessor-version":[{"id":39433,"href":"https:\/\/cloudinary.com\/blog\/wp-json\/wp\/v2\/posts\/35641\/revisions\/39433"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/cloudinary.com\/blog\/wp-json\/wp\/v2\/media\/35643"}],"wp:attachment":[{"href":"https:\/\/cloudinary.com\/blog\/wp-json\/wp\/v2\/media?parent=35641"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/cloudinary.com\/blog\/wp-json\/wp\/v2\/categories?post=35641"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/cloudinary.com\/blog\/wp-json\/wp\/v2\/tags?post=35641"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}