Seeing AI 4+

Talking Camera for the Blind

Microsoft Corporation

    • Free

Screenshots

Description

Seeing AI is a free app that narrates the world around you. Designed with and for the blind and low vision community, this ongoing research project harnesses the power of AI to open up the visual world by describing nearby people, text and objects.

Seeing AI provides tools to assist with a variety of daily tasks:
• Short Text - Speaks text as soon as it appears in front of the camera.
• Documents - Provides audio guidance to capture a printed page, and recognizes the text, along with its original formatting.
• Products - Scans barcodes, using audio beeps to guide you; hear the name, and package information when available.
• People - Saves people’s faces so you can recognize them, and get an estimate of their age, gender, and expression.
• Currency - Recognizes currency notes.
• Scenes - Hear an overall description of the scene captured. Explore the photo by moving your finger over the screen to hear the location of different objects.
• World - An Audio Augmented Reality experience to explore an unfamiliar environment, including hearing objects announced around you with Spatial Audio (requires a device with a LiDAR, and iOS 14+).
• Indoor Navigation - Available on the World Channel, enables you to create routes through a building, like "entrance to classroom", and navigate by following the sound (requires a device with an A9 or later processor, and iOS 14+).
• Colors - Identifies colors.
• Handwriting - Reads handwritten text like in greeting cards (available in a subset of languages).
• Light - Generates an audible tone corresponding to the brightness in the surroundings.
• Images in other apps - Just tap “Share” and “Recognize with Seeing AI” to describe images from Mail, Photos, Twitter, and more.
• Browse Photos - Hear descriptions of photos saved on your device.

Seeing AI continues to evolve as we hear from the community, and AI research advances.

Check out tutorials with this YouTube playlist: http://aka.ms/SeeingAIPlaylist.

Questions, feedback or feature requests? Email us at SeeingAI@Microsoft.com.

What’s New

Version 5.3

• Find My Things - This new feature, available via the World channel on devices running iOS 14+, enables you to teach Seeing AI to recognize your personal objects. Most AI systems focus on large datasets of generic objects, so we built this system that provides an audio-first experience to teach new objects yourself. You can then follow the audio cues to locate your things.
• Ask questions about photos (English only) - You can now ask Seeing AI questions about photos on the Scene channel, in Browse Photos, or those shared from another app. For example, after hearing the rich description, you may wish to hear more about a specific aspect of the image.
• Rich image descriptions, when you tap "More Info", have been improved to include even more details. Additionally, the names of people you have taught Seeing AI to identify are now included. And, this is now available in all 19 languages in which Seeing AI is localized.
• Handwriting recognition is now also available in Japanese and Korean.
• Document orientation - After scanning a document, you can find out which way round it is - useful when organizing multiple pages for stapling, for example. Tap "More", then "Page Info".
• Plus, various bug fixes under the hood.

App Privacy

The developer, Microsoft Corporation, indicated that the app’s privacy practices may include handling of data as described below. For more information, see the developer’s privacy policy.

Data Not Linked to You

The following data may be collected but it is not linked to your identity:

  • Identifiers
  • Usage Data
  • Diagnostics

Privacy practices may vary based on, for example, the features you use or your age. Learn More

More By This Developer

Microsoft Word
Productivity
Microsoft Teams
Business
Microsoft Outlook
Productivity
Microsoft Excel
Productivity
Microsoft 365 (Office)
Productivity
Microsoft PowerPoint
Productivity

You Might Also Like

Envision AI
Productivity
OrCamMyEye
Productivity
NaturalReader - Text To Speech
Productivity
VDScan
Productivity
Voice Aloud Reader
Productivity
Adobe Aero
Productivity