{"id":1337,"date":"2021-01-22T16:12:13","date_gmt":"2021-01-22T23:12:13","guid":{"rendered":"https:\/\/nisus.com\/blogs\/?p=1337"},"modified":"2021-01-25T09:32:47","modified_gmt":"2021-01-25T16:32:47","slug":"extract-text-from-images","status":"publish","type":"post","link":"https:\/\/nisus.com\/blogs\/extract-text-from-images\/","title":{"rendered":"Extract Text from Images"},"content":{"rendered":"<p>Nisus Writer recently added a feature that allows you to extract editable text from your photos, scans, PDFs, and other images. This process is often called <a href=\"https:\/\/en.wikipedia.org\/wiki\/Optical_character_recognition\">optical character recognition<\/a> (aka OCR).<\/p>\n<p>Let&#8217;s see how text extraction works using a COVID relief notice I recently received from the United States government:<\/p>\n<div style=\"text-align: center; margin-bottom: 20px\"><a href=\"https:\/\/nisus.com\/blogs\/wp-content\/uploads\/2021\/01\/ocr-photo.jpg\"><img decoding=\"async\" src=\"https:\/\/nisus.com\/blogs\/wp-content\/uploads\/2021\/01\/ocr-photo-small.jpg\" style=\"max-width: 90%\"\/><\/a><\/div>\n<p>Once the image is in Nisus Writer Pro document, select it and use the <i>Extract Text From Image<\/i> command to generate an editable text version of the image:<\/p>\n<div style=\"text-align: center; margin-bottom: 20px\"><a href=\"https:\/\/nisus.com\/blogs\/wp-content\/uploads\/2021\/01\/ocr-screenshot.png\"><img decoding=\"async\" src=\"https:\/\/nisus.com\/blogs\/wp-content\/uploads\/2021\/01\/ocr-screenshot.png\" style=\"max-width: 95%\"\/><\/a><\/div>\n<p>Most of the text is correct and in sequence. There are a few minor errors and text misplacements, like the number 6 appearing before the title\u2013 perhaps caused by the Treasury Department&#8217;s seal alongside the main textual content.<\/p>\n<p>Let&#8217;s try a few others images, like this paperback book and store receipt:<\/p>\n<div style=\"text-align: center; margin-bottom: 20px\"><a href=\"https:\/\/nisus.com\/blogs\/wp-content\/uploads\/2021\/01\/ocr-cosmos.jpg\"><img decoding=\"async\" src=\"https:\/\/nisus.com\/blogs\/wp-content\/uploads\/2021\/01\/ocr-cosmos.jpg\" style=\"max-width: 95%\"\/><\/a><\/div>\n<div style=\"text-align: center; margin-bottom: 20px\"><a href=\"https:\/\/nisus.com\/blogs\/wp-content\/uploads\/2021\/01\/ocr-food.jpg\"><img decoding=\"async\" src=\"https:\/\/nisus.com\/blogs\/wp-content\/uploads\/2021\/01\/ocr-food.jpg\" style=\"max-width: 95%\"\/><\/a><\/div>\n<p>Overall pretty good! Usually editing extracted text is a better starting point than retyping something entirely.<\/p>\n<p>The accuracy of the extraction will depend on a variety of factors including the quality of the image, whether text is slanted or rotated, the language and words in the text, and your system version. Nisus Writer uses <a href=\"https:\/\/developer.apple.com\/machine-learning\/\">Apple&#8217;s machine learning capabilities<\/a> to accomplish this task, and requires at least macOS 10.15 Catalina.<\/p>\n<p>Hopefully you&#8217;ll find a good use for this new feature.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Nisus Writer recently added a feature that allows you to extract editable text from your photos, scans, PDFs, and other images. This process is often called optical character recognition (aka&hellip;<\/p>\n","protected":false},"author":6,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[18,17,14],"tags":[],"class_list":["post-1337","post","type-post","status-publish","format-standard","hentry","category-express","category-pro","category-tips"],"_links":{"self":[{"href":"https:\/\/nisus.com\/blogs\/wp-json\/wp\/v2\/posts\/1337","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/nisus.com\/blogs\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/nisus.com\/blogs\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/nisus.com\/blogs\/wp-json\/wp\/v2\/users\/6"}],"replies":[{"embeddable":true,"href":"https:\/\/nisus.com\/blogs\/wp-json\/wp\/v2\/comments?post=1337"}],"version-history":[{"count":22,"href":"https:\/\/nisus.com\/blogs\/wp-json\/wp\/v2\/posts\/1337\/revisions"}],"predecessor-version":[{"id":1366,"href":"https:\/\/nisus.com\/blogs\/wp-json\/wp\/v2\/posts\/1337\/revisions\/1366"}],"wp:attachment":[{"href":"https:\/\/nisus.com\/blogs\/wp-json\/wp\/v2\/media?parent=1337"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/nisus.com\/blogs\/wp-json\/wp\/v2\/categories?post=1337"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/nisus.com\/blogs\/wp-json\/wp\/v2\/tags?post=1337"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}