Hi! This is Shalin Shah — creator of Voice. I’m a senior studying computer science at the University of California, Berkeley. Voice has been a project of mine since I was in high school, and I’ve been working hard to make it better over the last 6 years.
Voice was hand-crafted for people like you. It can help you quickly read items like product labels and magazine pages in your day-to-day. But you can also use it for more advanced reading like mail or books.
My goal with Voice was to create the most simple and intuitive interface to help you read things. Here are some features that I hope are helpful.
Voice’s OCR engine is perhaps one of the best in the world. You no longer need to worry about low lighting and bad focus, Voice corrects it automatically and gives you pixel-perfect accuracy every time. Voice can even read scribbles and handwritten text with incredible accuracy.
Standard Photo Capture:
The most basic way to use Voice is by simply tapping the button labeled by VoiceOver as “Camera. Button” This will take a picture. Then simply tap the button labeled “Next. Button” and Voice will perform OCR on your image and read it aloud.
Alternatively, you can control the app using your voice if you find that tapping buttons shake your camera. Simply say “capture” to snap a picture, and “read” to start processing the image.
Batch mode is enabled by default. To read more than one page, just keep taking photos using the “Camera. Button” or by saying the word “capture” many times. Voice will read all the documents one after another.
Good OCR detection does not depend on the corners of a document to be visible. But if corner detection is important to you, Scan Tone plays a tone when it sees all 4 corners visible. The tone gets louder or softer depending on the placement, tilt, and orientation of your phone relative to your document. A louder scan tone means better visibility of your document.
Voice also supports real-time scanning. Toggle this on, then simply hold your phone in front of any document with text and Voice will read it out loud in real-time. Voice also automatically turns on flash when it detects sub-par lighting and turns it off for objects that would glare.
Reading Voices & Languages:
Voice supports 47 languages and offers 180 reading voices. 52 voices are the standard iOS voices, and 128 of them are premium AI-generated voices with extremely fluent intonations. You can adjust your language, reading voice, and speaking rate in Settings.
The photo library picker lets you pick multiple images at a time from any of your albums.
Voice now fully works without wifi. If privacy is a concern or you don’t care for that extra bit of OCR quality, then feel free to use Voice in offline mode. You can turn it on in settings.
Saving & Exporting:
Once your document has been scanned, it takes one tap to copy your detected OCR text to your clipboard, export it as an accessible PDF, export it as a Plain Text file, or simply export all the images you captured.
Voice allows you to import both images and PDFs from other apps. It automatically detects the document format and performs OCR.
The entire app was crafted with VoiceOver in mind, so everything is fully accessible.
Voice has only a 6.9-megabyte app size.
You get 20 free scans per month. You can use those scans in either Short Text Mode or Standard Capture Mode. Once those 20 scans are up, you must purchase the Elite plan for $9.99 per month or the Believer plan for $99.99 per year. You save $20 a year, or 17%, by upgrading to the Believer plan. We are committed to keeping Voice OCR open for scholars and others lacking financial stability. Fill out this short form to tell us about your situation at http://bit.ly/VoiceOCR.
Feel free to reach out to my personal email, firstname.lastname@example.org, for any feedback.
- New gestures to control the app when using VoiceOver.
- Two-finger Double-tap to Play and Pause.
- 3-finger swipe right to read the next sentence.
- 3-finger swipe left to read the previous sentence.
- 2 finger scrub gesture to close out of pages.
- Option in settings for making OCR recognition either really fast or really accurate.
- Added "4 corners detected" feature back into the app, and removed Scan Tone.
- Fixed bugs on the sign-up screen, and made it a lot simpler.
- Voice OCR's audio cues like "4 corners detected" were translated into your native language, not just English.
- Fixed audio issue when opening the app with Siri.
- Better accessibility labels that make the app more intuitive.
- Overall, tons of bug fixes and massive performance improvements.
- Feel free to email me at email@example.com for any questions or feedback!
Ratings and ReviewsSee All
To me not worth it
Problem I have yes voice OCR. And I want to know why I have to delete this app off my phone, and order for it to work? And I paid for it to see if it will work properly. I am thinking about deleting it completely. Until everything is correct! I am not happy with this app. LJ
Impressive OCR Capabilities
I downloaded this last night because I had been trying to read a Ramen packet that I had gotten in a subscription box. Well, not only was I able to read the info about the nutritional facts and the ingredients, but this engine was also able to read the Chinese information on the package as well. I have since been able to read information on a package of sour gummy candies, a bottle of kombucha, and a package of tuna burgers that I received from my community supported fishery. This app has been consistent and reliable and has provided the clearest scans thus far. Additionally, I am grateful for the voice-activated ability to capture pictures and have them read. This allows for a hands-free experience. I'm glad to have found out about Voice OCR and will definitely be joining up as a Believer as soon as I am able to do so. Thank you for creating this extremely accurate and reliable app. As an aside, thanks for the cute and quirky retro videogame music when the magic is happening after the photos have been taken! It's icing on the cake for me.
Amazing OCR App!
I like how accurate this app reads documents that it scans. I have had really good results and I like that I can use my voice to do most of the actions on this app. It comes with some really awesome features and I love the different invoices I can use so I definitely recommend this app for many people. Thank you for making such an amazing app for me.
Data Linked to You
The following data may be collected and linked to your identity:
- Contact Info
- Usage Data
Privacy practices may vary, for example, based on the features you use or your age. Learn More
- Shalin Shah
- 11.4 MB
- Requires iOS 13.0 or later.
- iPod touch
- Requires iOS 13.0 or later.
- Requires macOS 11 or later and a Mac with Apple M1 chip.
- Age Rating
- © 2020, Shalin Shah.
- In-App Purchases
- Elite $9.99
- Believer $99.99
With Family Sharing set up, up to six family members can use this app.