{"id":38821,"date":"2025-10-18T08:52:11","date_gmt":"2025-10-18T15:52:11","guid":{"rendered":"https:\/\/cloudinary.com\/blog\/?p=38821"},"modified":"2025-10-21T13:41:57","modified_gmt":"2025-10-21T20:41:57","slug":"how-to-perform-ocr-on-images-using-java-sdk","status":"publish","type":"post","link":"https:\/\/cloudinary.com\/blog\/questions\/how-to-perform-ocr-on-images-using-java-sdk\/","title":{"rendered":"How to perform OCR on images using Java SDK"},"content":{"rendered":"\n<p>Developers often run into images that contain valuable text: invoices, receipts, scanned forms, ID cards, dashboards, or screenshots. But how do folks find different ways to extract that text cleanly and reliably?&nbsp;<\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Question:<\/h2>\n\n\n\n<p><em>Hi all,<\/em><br><em>I need to extract text from images in a Java application and I am looking for a reliable approach and code examples. Specifically, how to perform OCR on images using Java SDK, what preprocessing steps help accuracy, and how to handle multiple languages. I will be processing images from URLs and from user uploads, sometimes low quality. Tips for scaling this in production would be great too. Thanks!<\/em><\/p>\n\n\n\n<h2 class=\"wp-block-heading\">Answer:<\/h2>\n\n\n\n<p>Great question! OCR quality depends on two things: the OCR engine and the quality of the input image. Here\u2019s how you can do it:<\/p>\n\n\n\n<h3 class=\"wp-block-heading\">1) Pick an OCR engine for Java<\/h3>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Tesseract via Tess4J:<\/strong> open source, runs locally, good for many Latin scripts and more with trained data.<\/li>\n\n\n\n<li><strong>Hosted OCR APIs:<\/strong> Google Cloud Vision, AWS Textract, Azure Computer Vision. Difficult documents and tables usually see higher accuracy, which requires a subscription.<\/li>\n<\/ul>\n\n\n\n<h3 class=\"wp-block-heading\">2) Preprocessing matters<\/h3>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Convert to a suitable format.<\/strong> For text-heavy images, PNG is often better for sharp lines; high-quality JPG can also work. See format tradeoffs here: <a href=\"https:\/\/cloudinary.com\/guides\/image-formats\/jpeg-vs-png\">JPEG vs PNG<\/a>.<\/li>\n\n\n\n<li><strong>Increase contrast, denoise, deskew, and binarize if needed.<\/strong> These steps can dramatically boost OCR precision. See a helpful overview of enhancement ideas: <a href=\"https:\/\/cloudinary.com\/guides\/image-effects\/image-enhancement\">Image Enhancement<\/a>.<\/li>\n\n\n\n<li><strong>Use sufficient resolution.<\/strong> 300 DPI for scans is a common baseline. If you are unsure what DPI means or how it works, check out <a href=\"https:\/\/cloudinary.com\/guides\/image\/dpi-vs-pixels\">DPI vs pixels<\/a>.<\/li>\n<\/ul>\n\n\n\n<h3 class=\"wp-block-heading\">3) Example with Tess4J (local OCR)<\/h3>\n\n\n\n<p>Add Tess4J to your build and make sure you have Tesseract trained data files available. Then:<\/p>\n\n\n<pre class=\"wp-block-code\" aria-describedby=\"shcb-language-1\" data-shcb-language-name=\"PHP\" data-shcb-language-slug=\"php\"><span><code class=\"hljs language-php shcb-wrap-lines\">&lt;dependency&gt;\n\u00a0 &lt;groupId&gt;net.sourceforge.tess4j&lt;\/groupId&gt;\n\u00a0 &lt;artifactId&gt;tess4j&lt;\/artifactId&gt;\n\u00a0 &lt;version&gt;<span class=\"hljs-number\">5.10<\/span><span class=\"hljs-number\">.0<\/span>&lt;\/version&gt;\n&lt;\/dependency&gt;\n\nimport net.sourceforge.tess4j.Tesseract;\nimport net.sourceforge.tess4j.TesseractException;\n\nimport javax.imageio.ImageIO;\nimport java.awt.image.BufferedImage;\nimport java.io.File;\nimport java.net.URL;\n\n<span class=\"hljs-keyword\">public<\/span> <span class=\"hljs-class\"><span class=\"hljs-keyword\">class<\/span> <span class=\"hljs-title\">OcrExample<\/span> <\/span>{\n\u00a0 <span class=\"hljs-keyword\">public<\/span> <span class=\"hljs-keyword\">static<\/span> void main(String&#91;] args) throws <span class=\"hljs-keyword\">Exception<\/span> {\n\u00a0 \u00a0 <span class=\"hljs-comment\">\/\/ 1) Load image from disk<\/span>\n\u00a0 \u00a0 BufferedImage img = ImageIO.read(<span class=\"hljs-keyword\">new<\/span> File(<span class=\"hljs-string\">\"invoice.png\"<\/span>));\n\n\u00a0 \u00a0 <span class=\"hljs-comment\">\/\/ 2) Configure Tesseract<\/span>\n\u00a0 \u00a0 Tesseract tesseract = <span class=\"hljs-keyword\">new<\/span> Tesseract();\n\u00a0 \u00a0 tesseract.setDatapath(<span class=\"hljs-string\">\"\/path\/to\/tessdata\"<\/span>);\u00a0 <span class=\"hljs-comment\">\/\/ folder containing .traineddata files<\/span>\n\u00a0 \u00a0 tesseract.setLanguage(<span class=\"hljs-string\">\"eng\"<\/span>);\u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 <span class=\"hljs-comment\">\/\/ e.g., \"eng\", \"spa\", \"eng+deu\"<\/span>\n\n\u00a0 \u00a0 <span class=\"hljs-comment\">\/\/ 3) Run OCR<\/span>\n\u00a0 \u00a0 <span class=\"hljs-keyword\">try<\/span> {\n\u00a0 \u00a0 \u00a0 String text = tesseract.doOCR(img);\n\u00a0 \u00a0 \u00a0 System.out.println(text);\n\u00a0 \u00a0 } <span class=\"hljs-keyword\">catch<\/span> (TesseractException e) {\n\u00a0 \u00a0 \u00a0 e.printStackTrace();\n\u00a0 \u00a0 }\n\u00a0 }\n}<\/code><\/span><small class=\"shcb-language\" id=\"shcb-language-1\"><span class=\"shcb-language__label\">Code language:<\/span> <span class=\"shcb-language__name\">PHP<\/span> <span class=\"shcb-language__paren\">(<\/span><span class=\"shcb-language__slug\">php<\/span><span class=\"shcb-language__paren\">)<\/span><\/small><\/pre>\n\n\n<p>Reading from a URL is similar:<\/p>\n\n\n<pre class=\"wp-block-code\" aria-describedby=\"shcb-language-2\" data-shcb-language-name=\"JavaScript\" data-shcb-language-slug=\"javascript\"><span><code class=\"hljs language-javascript shcb-wrap-lines\">BufferedImage img = ImageIO.read(<span class=\"hljs-keyword\">new<\/span> URL(<span class=\"hljs-string\">\"https:\/\/example.com\/receipt.jpg\"<\/span>));\n<span class=\"hljs-built_in\">String<\/span> text = tesseract.doOCR(img);<\/code><\/span><small class=\"shcb-language\" id=\"shcb-language-2\"><span class=\"shcb-language__label\">Code language:<\/span> <span class=\"shcb-language__name\">JavaScript<\/span> <span class=\"shcb-language__paren\">(<\/span><span class=\"shcb-language__slug\">javascript<\/span><span class=\"shcb-language__paren\">)<\/span><\/small><\/pre>\n\n\n<h3 class=\"wp-block-heading\">4) Practical accuracy tips<\/h3>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Clean input: <\/strong>crop borders, remove stamps or heavy watermarks, and deskew tilted scans.<\/li>\n\n\n\n<li><strong>Binarize and denoise:<\/strong> thresholding can make text crisper and suppress background patterns.<\/li>\n\n\n\n<li><strong>Use the right language pack:<\/strong> for multilingual docs, combine languages like &#8220;eng+fra&#8221;.<\/li>\n\n\n\n<li><strong>Work in batches: <\/strong>normalize files to consistent dimensions and formats before OCR.<\/li>\n<\/ul>\n\n\n\n<h3 class=\"wp-block-heading\">5) Speed and scaling<\/h3>\n\n\n\n<ul class=\"wp-block-list\">\n<li><strong>Cache results: <\/strong>if an image does not change, persist extracted text and skip repeated OCR.<\/li>\n\n\n\n<li><strong>Parallelize:<\/strong> run OCR in worker threads or microservices. Limit concurrency to available CPU cores.<\/li>\n\n\n\n<li><strong>Preprocess once:<\/strong> keep a normalized copy alongside the original to avoid repeating transforms.<\/li>\n<\/ul>\n\n\n\n<h3 class=\"wp-block-heading\">6) Improve preprocessing and delivery with Cloudinary<\/h3>\n\n\n\n<p>If you already manage <a href=\"https:\/\/cloudinary.com\/blog\/\">media assets<\/a> at scale, you can use Cloudinary to normalize images on the fly before feeding them to your OCR code. For example: fetch a remote image, convert to PNG, apply grayscale, sharpen, strong contrast, and threshold to boost text clarity, then pipe the transformed image into your OCR.<\/p>\n\n\n<pre class=\"wp-block-code\" aria-describedby=\"shcb-language-3\" data-shcb-language-name=\"PHP\" data-shcb-language-slug=\"php\"><span><code class=\"hljs language-php shcb-wrap-lines\">&lt;dependency&gt;\n\u00a0 &lt;groupId&gt;com.cloudinary&lt;\/groupId&gt;\n\u00a0 &lt;artifactId&gt;cloudinary-http44&lt;\/artifactId&gt;\n\u00a0 &lt;version&gt;<span class=\"hljs-number\">1.39<\/span><span class=\"hljs-number\">.0<\/span>&lt;\/version&gt;\n&lt;\/dependency&gt;\n\nimport com.cloudinary.Cloudinary;\nimport com.cloudinary.Transformation;\nimport com.cloudinary.utils.ObjectUtils;\n\nimport javax.imageio.ImageIO;\nimport java.awt.image.BufferedImage;\nimport java.net.URL;\n\nCloudinary cloudinary = <span class=\"hljs-keyword\">new<\/span> Cloudinary(ObjectUtils.asMap(\n\u00a0 <span class=\"hljs-string\">\"cloud_name\"<\/span>, <span class=\"hljs-string\">\"YOUR_CLOUD_NAME\"<\/span>,\n\u00a0 <span class=\"hljs-string\">\"api_key\"<\/span>, <span class=\"hljs-string\">\"YOUR_API_KEY\"<\/span>,\n\u00a0 <span class=\"hljs-string\">\"api_secret\"<\/span>, <span class=\"hljs-string\">\"YOUR_API_SECRET\"<\/span>\n));\n\n<span class=\"hljs-comment\">\/\/ 1) Build a preprocessing URL for a remote image<\/span>\nString preppedUrl = cloudinary.url()\n\u00a0 .type(<span class=\"hljs-string\">\"fetch\"<\/span>)\n\u00a0 .transformation(<span class=\"hljs-keyword\">new<\/span> Transformation()\n\u00a0 \u00a0 .fetchFormat(<span class=\"hljs-string\">\"png\"<\/span>)\u00a0 \u00a0 \u00a0 \u00a0 \u00a0 <span class=\"hljs-comment\">\/\/ lossless for crisp text<\/span>\n\u00a0 \u00a0 .quality(<span class=\"hljs-string\">\"auto\"<\/span>) \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 <span class=\"hljs-comment\">\/\/ sensible default optimization<\/span>\n\u00a0 \u00a0 .effect(<span class=\"hljs-string\">\"grayscale\"<\/span>) \u00a0 \u00a0 \u00a0 \u00a0 <span class=\"hljs-comment\">\/\/ reduce color noise<\/span>\n\u00a0 \u00a0 .effect(<span class=\"hljs-string\">\"sharpen\"<\/span>) \u00a0 \u00a0 \u00a0 \u00a0 \u00a0 <span class=\"hljs-comment\">\/\/ sharpen edges<\/span>\n\u00a0 \u00a0 .effect(<span class=\"hljs-string\">\"contrast:30\"<\/span>) \u00a0 \u00a0 \u00a0 <span class=\"hljs-comment\">\/\/ boost contrast<\/span>\n\u00a0 \u00a0 .effect(<span class=\"hljs-string\">\"threshold:200\"<\/span>) \u00a0 \u00a0 <span class=\"hljs-comment\">\/\/ strong binarization<\/span>\n\u00a0 )\n\u00a0 .generate(<span class=\"hljs-string\">\"https:\/\/example.com\/receipt.jpg\"<\/span>);\n\n<span class=\"hljs-comment\">\/\/ 2) Feed the transformed image into your OCR<\/span>\nBufferedImage img = ImageIO.read(<span class=\"hljs-keyword\">new<\/span> URL(preppedUrl));\nString text = tesseract.doOCR(img);\nSystem.out.println(text);<\/code><\/span><small class=\"shcb-language\" id=\"shcb-language-3\"><span class=\"shcb-language__label\">Code language:<\/span> <span class=\"shcb-language__name\">PHP<\/span> <span class=\"shcb-language__paren\">(<\/span><span class=\"shcb-language__slug\">php<\/span><span class=\"shcb-language__paren\">)<\/span><\/small><\/pre>\n\n\n<p>This pattern centralizes file retrieval, consistent transforms, and delivery. You can also normalize formats upfront based on your needs, drawing on background knowledge like <a href=\"https:\/\/cloudinary.com\/guides\/image-formats\/jpeg-vs-png\">JPEG vs PNG<\/a> and enhancement techniques from <a href=\"https:\/\/cloudinary.com\/guides\/image-effects\/image-enhancement\">Image Enhancement<\/a>.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\">TL;DR<\/h3>\n\n\n\n<ul class=\"wp-block-list\">\n<li>Use Tess4J for local OCR or a hosted OCR API for tougher documents and higher accuracy.<\/li>\n\n\n\n<li>Preprocess images: consistent format, higher contrast, grayscale, denoise, binarize, deskew.<\/li>\n\n\n\n<li>Pipeline tip: generate a preprocessed image URL with Cloudinary and pass that into your OCR code for more consistent results at scale.<\/li>\n\n\n\n<li>Mind resolution and readability. See <a href=\"https:\/\/cloudinary.com\/guides\/image\/dpi-vs-pixels\">DPI vs pixels<\/a> to avoid undersampling.<\/li>\n<\/ul>\n\n\n\n<h2 class=\"wp-block-heading\">Learn More<\/h2>\n\n\n\n<ul class=\"wp-block-list\">\n<li><a href=\"https:\/\/cloudinary.com\/tools\/image-to-jpg\">Image to JPG<\/a><\/li>\n\n\n\n<li><a href=\"https:\/\/cloudinary.com\/tools\/png-to-webp\">PNG to WebP<\/a><\/li>\n\n\n\n<li><a href=\"https:\/\/cloudinary.com\/tools\/avif-to-jpg\">AVIF to JPG<\/a><\/li>\n\n\n\n<li><a href=\"https:\/\/cloudinary.com\/background-remover\">Background Remover<\/a><\/li>\n<\/ul>\n\n\n\n<p>Ready to streamline your OCR pipeline with consistent preprocessing, storage, and delivery? <a href=\"https:\/\/cloudinary.com\/users\/register_free\">Create a free Cloudinary account<\/a> and start optimizing today.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Developers often run into images that contain valuable text: invoices, receipts, scanned forms, ID cards, dashboards, or screenshots. But how do folks find different ways to extract that text cleanly and reliably?&nbsp; Question: Hi all,I need to extract text from images in a Java application and I am looking for a reliable approach and code [&hellip;]<\/p>\n","protected":false},"author":88,"featured_media":38822,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"_cloudinary_featured_overwrite":false,"footnotes":""},"categories":[1],"tags":[423],"class_list":["post-38821","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-uncategorized","tag-questions"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO Premium plugin v25.6 (Yoast SEO v26.9) - https:\/\/yoast.com\/product\/yoast-seo-premium-wordpress\/ -->\n<title>How to perform OCR on images using Java SDK<\/title>\n<meta name=\"description\" content=\"Developers often run into images that contain valuable text: invoices, receipts, scanned forms, ID cards, dashboards, or screenshots. But how do folks\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/cloudinary.com\/blog\/questions\/how-to-perform-ocr-on-images-using-java-sdk\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"How to perform OCR on images using Java SDK\" \/>\n<meta property=\"og:description\" content=\"Developers often run into images that contain valuable text: invoices, receipts, scanned forms, ID cards, dashboards, or screenshots. But how do folks\" \/>\n<meta property=\"og:url\" content=\"https:\/\/cloudinary.com\/blog\/questions\/how-to-perform-ocr-on-images-using-java-sdk\/\" \/>\n<meta property=\"og:site_name\" content=\"Cloudinary Blog\" \/>\n<meta property=\"article:published_time\" content=\"2025-10-18T15:52:11+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2025-10-21T20:41:57+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/res.cloudinary.com\/cloudinary-marketing\/images\/f_auto,q_auto\/v1760802695\/how_to_perform_ocr_on_images_using_java_sdk_featured_image\/how_to_perform_ocr_on_images_using_java_sdk_featured_image.jpg?_i=AA\" \/>\n\t<meta property=\"og:image:width\" content=\"2000\" \/>\n\t<meta property=\"og:image:height\" content=\"1100\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"damjanantevski\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"NewsArticle\",\"@id\":\"https:\/\/cloudinary.com\/blog\/questions\/how-to-perform-ocr-on-images-using-java-sdk\/#article\",\"isPartOf\":{\"@id\":\"https:\/\/cloudinary.com\/blog\/questions\/how-to-perform-ocr-on-images-using-java-sdk\/\"},\"author\":{\"name\":\"damjanantevski\",\"@id\":\"https:\/\/cloudinary.com\/blog\/#\/schema\/person\/43592e43c12520a1e867d456b1e8cf7e\"},\"headline\":\"How to perform OCR on images using Java SDK\",\"datePublished\":\"2025-10-18T15:52:11+00:00\",\"dateModified\":\"2025-10-21T20:41:57+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/cloudinary.com\/blog\/questions\/how-to-perform-ocr-on-images-using-java-sdk\/\"},\"wordCount\":580,\"publisher\":{\"@id\":\"https:\/\/cloudinary.com\/blog\/#organization\"},\"image\":{\"@id\":\"https:\/\/cloudinary.com\/blog\/questions\/how-to-perform-ocr-on-images-using-java-sdk\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/res.cloudinary.com\/cloudinary-marketing\/images\/f_auto,q_auto\/v1760802695\/how_to_perform_ocr_on_images_using_java_sdk_featured_image\/how_to_perform_ocr_on_images_using_java_sdk_featured_image.jpg?_i=AA\",\"keywords\":[\"Questions\"],\"inLanguage\":\"en-US\",\"copyrightYear\":\"2025\",\"copyrightHolder\":{\"@id\":\"https:\/\/cloudinary.com\/#organization\"}},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/cloudinary.com\/blog\/questions\/how-to-perform-ocr-on-images-using-java-sdk\/\",\"url\":\"https:\/\/cloudinary.com\/blog\/questions\/how-to-perform-ocr-on-images-using-java-sdk\/\",\"name\":\"How to perform OCR on images using Java SDK\",\"isPartOf\":{\"@id\":\"https:\/\/cloudinary.com\/blog\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/cloudinary.com\/blog\/questions\/how-to-perform-ocr-on-images-using-java-sdk\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/cloudinary.com\/blog\/questions\/how-to-perform-ocr-on-images-using-java-sdk\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/res.cloudinary.com\/cloudinary-marketing\/images\/f_auto,q_auto\/v1760802695\/how_to_perform_ocr_on_images_using_java_sdk_featured_image\/how_to_perform_ocr_on_images_using_java_sdk_featured_image.jpg?_i=AA\",\"datePublished\":\"2025-10-18T15:52:11+00:00\",\"dateModified\":\"2025-10-21T20:41:57+00:00\",\"description\":\"Developers often run into images that contain valuable text: invoices, receipts, scanned forms, ID cards, dashboards, or screenshots. But how do folks\",\"breadcrumb\":{\"@id\":\"https:\/\/cloudinary.com\/blog\/questions\/how-to-perform-ocr-on-images-using-java-sdk\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/cloudinary.com\/blog\/questions\/how-to-perform-ocr-on-images-using-java-sdk\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/cloudinary.com\/blog\/questions\/how-to-perform-ocr-on-images-using-java-sdk\/#primaryimage\",\"url\":\"https:\/\/res.cloudinary.com\/cloudinary-marketing\/images\/f_auto,q_auto\/v1760802695\/how_to_perform_ocr_on_images_using_java_sdk_featured_image\/how_to_perform_ocr_on_images_using_java_sdk_featured_image.jpg?_i=AA\",\"contentUrl\":\"https:\/\/res.cloudinary.com\/cloudinary-marketing\/images\/f_auto,q_auto\/v1760802695\/how_to_perform_ocr_on_images_using_java_sdk_featured_image\/how_to_perform_ocr_on_images_using_java_sdk_featured_image.jpg?_i=AA\",\"width\":2000,\"height\":1100},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/cloudinary.com\/blog\/questions\/how-to-perform-ocr-on-images-using-java-sdk\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/cloudinary.com\/blog\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"How to perform OCR on images using Java SDK\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/cloudinary.com\/blog\/#website\",\"url\":\"https:\/\/cloudinary.com\/blog\/\",\"name\":\"Cloudinary Blog\",\"description\":\"\",\"publisher\":{\"@id\":\"https:\/\/cloudinary.com\/blog\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/cloudinary.com\/blog\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/cloudinary.com\/blog\/#organization\",\"name\":\"Cloudinary Blog\",\"url\":\"https:\/\/cloudinary.com\/blog\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/cloudinary.com\/blog\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/res.cloudinary.com\/cloudinary-marketing\/images\/f_auto,q_auto\/v1649718331\/Web_Assets\/blog\/cloudinary_logo_for_white_bg_1937437aa7_19374666c7_193742f877\/cloudinary_logo_for_white_bg_1937437aa7_19374666c7_193742f877.png?_i=AA\",\"contentUrl\":\"https:\/\/res.cloudinary.com\/cloudinary-marketing\/images\/f_auto,q_auto\/v1649718331\/Web_Assets\/blog\/cloudinary_logo_for_white_bg_1937437aa7_19374666c7_193742f877\/cloudinary_logo_for_white_bg_1937437aa7_19374666c7_193742f877.png?_i=AA\",\"width\":312,\"height\":60,\"caption\":\"Cloudinary Blog\"},\"image\":{\"@id\":\"https:\/\/cloudinary.com\/blog\/#\/schema\/logo\/image\/\"}},{\"@type\":\"Person\",\"@id\":\"https:\/\/cloudinary.com\/blog\/#\/schema\/person\/43592e43c12520a1e867d456b1e8cf7e\",\"name\":\"damjanantevski\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/cloudinary.com\/blog\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/3b40c995531fe4d510212a06c9d4fc666d2cb8efbfebc98a94191701accf4817?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/3b40c995531fe4d510212a06c9d4fc666d2cb8efbfebc98a94191701accf4817?s=96&d=mm&r=g\",\"caption\":\"damjanantevski\"}}]}<\/script>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"How to perform OCR on images using Java SDK","description":"Developers often run into images that contain valuable text: invoices, receipts, scanned forms, ID cards, dashboards, or screenshots. But how do folks","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/cloudinary.com\/blog\/questions\/how-to-perform-ocr-on-images-using-java-sdk\/","og_locale":"en_US","og_type":"article","og_title":"How to perform OCR on images using Java SDK","og_description":"Developers often run into images that contain valuable text: invoices, receipts, scanned forms, ID cards, dashboards, or screenshots. But how do folks","og_url":"https:\/\/cloudinary.com\/blog\/questions\/how-to-perform-ocr-on-images-using-java-sdk\/","og_site_name":"Cloudinary Blog","article_published_time":"2025-10-18T15:52:11+00:00","article_modified_time":"2025-10-21T20:41:57+00:00","og_image":[{"width":2000,"height":1100,"url":"https:\/\/res.cloudinary.com\/cloudinary-marketing\/images\/f_auto,q_auto\/v1760802695\/how_to_perform_ocr_on_images_using_java_sdk_featured_image\/how_to_perform_ocr_on_images_using_java_sdk_featured_image.jpg?_i=AA","type":"image\/jpeg"}],"author":"damjanantevski","twitter_card":"summary_large_image","schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"NewsArticle","@id":"https:\/\/cloudinary.com\/blog\/questions\/how-to-perform-ocr-on-images-using-java-sdk\/#article","isPartOf":{"@id":"https:\/\/cloudinary.com\/blog\/questions\/how-to-perform-ocr-on-images-using-java-sdk\/"},"author":{"name":"damjanantevski","@id":"https:\/\/cloudinary.com\/blog\/#\/schema\/person\/43592e43c12520a1e867d456b1e8cf7e"},"headline":"How to perform OCR on images using Java SDK","datePublished":"2025-10-18T15:52:11+00:00","dateModified":"2025-10-21T20:41:57+00:00","mainEntityOfPage":{"@id":"https:\/\/cloudinary.com\/blog\/questions\/how-to-perform-ocr-on-images-using-java-sdk\/"},"wordCount":580,"publisher":{"@id":"https:\/\/cloudinary.com\/blog\/#organization"},"image":{"@id":"https:\/\/cloudinary.com\/blog\/questions\/how-to-perform-ocr-on-images-using-java-sdk\/#primaryimage"},"thumbnailUrl":"https:\/\/res.cloudinary.com\/cloudinary-marketing\/images\/f_auto,q_auto\/v1760802695\/how_to_perform_ocr_on_images_using_java_sdk_featured_image\/how_to_perform_ocr_on_images_using_java_sdk_featured_image.jpg?_i=AA","keywords":["Questions"],"inLanguage":"en-US","copyrightYear":"2025","copyrightHolder":{"@id":"https:\/\/cloudinary.com\/#organization"}},{"@type":"WebPage","@id":"https:\/\/cloudinary.com\/blog\/questions\/how-to-perform-ocr-on-images-using-java-sdk\/","url":"https:\/\/cloudinary.com\/blog\/questions\/how-to-perform-ocr-on-images-using-java-sdk\/","name":"How to perform OCR on images using Java SDK","isPartOf":{"@id":"https:\/\/cloudinary.com\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/cloudinary.com\/blog\/questions\/how-to-perform-ocr-on-images-using-java-sdk\/#primaryimage"},"image":{"@id":"https:\/\/cloudinary.com\/blog\/questions\/how-to-perform-ocr-on-images-using-java-sdk\/#primaryimage"},"thumbnailUrl":"https:\/\/res.cloudinary.com\/cloudinary-marketing\/images\/f_auto,q_auto\/v1760802695\/how_to_perform_ocr_on_images_using_java_sdk_featured_image\/how_to_perform_ocr_on_images_using_java_sdk_featured_image.jpg?_i=AA","datePublished":"2025-10-18T15:52:11+00:00","dateModified":"2025-10-21T20:41:57+00:00","description":"Developers often run into images that contain valuable text: invoices, receipts, scanned forms, ID cards, dashboards, or screenshots. But how do folks","breadcrumb":{"@id":"https:\/\/cloudinary.com\/blog\/questions\/how-to-perform-ocr-on-images-using-java-sdk\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/cloudinary.com\/blog\/questions\/how-to-perform-ocr-on-images-using-java-sdk\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/cloudinary.com\/blog\/questions\/how-to-perform-ocr-on-images-using-java-sdk\/#primaryimage","url":"https:\/\/res.cloudinary.com\/cloudinary-marketing\/images\/f_auto,q_auto\/v1760802695\/how_to_perform_ocr_on_images_using_java_sdk_featured_image\/how_to_perform_ocr_on_images_using_java_sdk_featured_image.jpg?_i=AA","contentUrl":"https:\/\/res.cloudinary.com\/cloudinary-marketing\/images\/f_auto,q_auto\/v1760802695\/how_to_perform_ocr_on_images_using_java_sdk_featured_image\/how_to_perform_ocr_on_images_using_java_sdk_featured_image.jpg?_i=AA","width":2000,"height":1100},{"@type":"BreadcrumbList","@id":"https:\/\/cloudinary.com\/blog\/questions\/how-to-perform-ocr-on-images-using-java-sdk\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/cloudinary.com\/blog\/"},{"@type":"ListItem","position":2,"name":"How to perform OCR on images using Java SDK"}]},{"@type":"WebSite","@id":"https:\/\/cloudinary.com\/blog\/#website","url":"https:\/\/cloudinary.com\/blog\/","name":"Cloudinary Blog","description":"","publisher":{"@id":"https:\/\/cloudinary.com\/blog\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/cloudinary.com\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/cloudinary.com\/blog\/#organization","name":"Cloudinary Blog","url":"https:\/\/cloudinary.com\/blog\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/cloudinary.com\/blog\/#\/schema\/logo\/image\/","url":"https:\/\/res.cloudinary.com\/cloudinary-marketing\/images\/f_auto,q_auto\/v1649718331\/Web_Assets\/blog\/cloudinary_logo_for_white_bg_1937437aa7_19374666c7_193742f877\/cloudinary_logo_for_white_bg_1937437aa7_19374666c7_193742f877.png?_i=AA","contentUrl":"https:\/\/res.cloudinary.com\/cloudinary-marketing\/images\/f_auto,q_auto\/v1649718331\/Web_Assets\/blog\/cloudinary_logo_for_white_bg_1937437aa7_19374666c7_193742f877\/cloudinary_logo_for_white_bg_1937437aa7_19374666c7_193742f877.png?_i=AA","width":312,"height":60,"caption":"Cloudinary Blog"},"image":{"@id":"https:\/\/cloudinary.com\/blog\/#\/schema\/logo\/image\/"}},{"@type":"Person","@id":"https:\/\/cloudinary.com\/blog\/#\/schema\/person\/43592e43c12520a1e867d456b1e8cf7e","name":"damjanantevski","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/cloudinary.com\/blog\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/3b40c995531fe4d510212a06c9d4fc666d2cb8efbfebc98a94191701accf4817?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/3b40c995531fe4d510212a06c9d4fc666d2cb8efbfebc98a94191701accf4817?s=96&d=mm&r=g","caption":"damjanantevski"}}]}},"jetpack_featured_media_url":"https:\/\/res.cloudinary.com\/cloudinary-marketing\/images\/f_auto,q_auto\/v1760802695\/how_to_perform_ocr_on_images_using_java_sdk_featured_image\/how_to_perform_ocr_on_images_using_java_sdk_featured_image.jpg?_i=AA","_links":{"self":[{"href":"https:\/\/cloudinary.com\/blog\/wp-json\/wp\/v2\/posts\/38821","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/cloudinary.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/cloudinary.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/cloudinary.com\/blog\/wp-json\/wp\/v2\/users\/88"}],"replies":[{"embeddable":true,"href":"https:\/\/cloudinary.com\/blog\/wp-json\/wp\/v2\/comments?post=38821"}],"version-history":[{"count":1,"href":"https:\/\/cloudinary.com\/blog\/wp-json\/wp\/v2\/posts\/38821\/revisions"}],"predecessor-version":[{"id":38823,"href":"https:\/\/cloudinary.com\/blog\/wp-json\/wp\/v2\/posts\/38821\/revisions\/38823"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/cloudinary.com\/blog\/wp-json\/wp\/v2\/media\/38822"}],"wp:attachment":[{"href":"https:\/\/cloudinary.com\/blog\/wp-json\/wp\/v2\/media?parent=38821"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/cloudinary.com\/blog\/wp-json\/wp\/v2\/categories?post=38821"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/cloudinary.com\/blog\/wp-json\/wp\/v2\/tags?post=38821"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}