AVSpeechSynthesis
AVSpeechSynthesis
WWDC 2018
AVSpeechSynthesis
- API for computer synthesized speech
- Uses
- Announcements
- Non-sighted interfaces
- Education apps
- Many more… - Example — Audio updates during a workout
AVSpeechSynthesis and Accessibility
- Powerful tool for helping many users
- Congnitive
- Speech vocalization
- Non-sighted - Not a replacement for VoiceOver
- Speech can overlap with VoiceOver’s
- Won’t be available to Braille devices
- Make your app accessible instead usingUIAccessibility
Basics
- Create
AVSpeechSynthesizer
2. Ensure it’s retained until speech is done ⚠️
( Speech will be cancelled in the synthesizer is deallocated )
data:image/s3,"s3://crabby-images/aee95/aee952d6452c23364009a01f141569433b352486" alt=""
3. Create an utterance
4. Dispatch to synthesizer
data:image/s3,"s3://crabby-images/ec6c7/ec6c7ad6d6c7e653376593c87e7f562fb620bdf9" alt=""
AVSpeechSynthesis and Audio Sessions
AVAudioSession
automatically activated onspeak()
- To mix with other audio, use
data:image/s3,"s3://crabby-images/cd302/cd3027fb419480139cfedf6ae3b7372d43a54060" alt=""
- To duck other audio, use
data:image/s3,"s3://crabby-images/f3218/f3218a2e436cfee76fc6095a13c114569bb3a249" alt=""
Callbacks
- Delegate methods inform about the life cycle of an utterance
AVSpeechSynthesizerDelegate
defines optional methods
- Speech started
- Speech finished
- Character range will be spoken
- Speech paused
- Speech continued
data:image/s3,"s3://crabby-images/40e1a/40e1ae09e8f76b31295fcd42038f7ff5306f6b0a" alt=""
data:image/s3,"s3://crabby-images/e31cb/e31cb1c78c61c570810d9b3896cbbbd9091cad19" alt=""
Choosing the Right Voice
- Built in voices (1 for each supported language)
- Siri voices not available through API
- Users can download ⏬ higher quality voices
- Select with an identifier or a language
- Selecting by language will select the users default voice
data:image/s3,"s3://crabby-images/926bc/926bcc147e98b6815dc8e8e22b8901d7ab2052e6" alt=""
data:image/s3,"s3://crabby-images/5cd1d/5cd1de592bbe0ea94a1de65f01edbb4e839d4bb1" alt=""
Speech Rate
- Controls the rate of speech [ 0 ~ 1.0 ]
- Speaking Rate is scaled from ≅ 0x ➜ 1x with values [ 0, 0.5 ]
- Speaking Rate is scaled from 1x ➜ 4x with values [ 0.5, 1 ]
data:image/s3,"s3://crabby-images/48ca4/48ca477d9ea733d817bd76c3033377f760470f35" alt=""
Pitch and Volume
Set properties on AVSpeechUtterance
- Pitch [ 0 ~ 1.0 ]
- Volume [ 0 ~ 1.0 ]
data:image/s3,"s3://crabby-images/abeb7/abeb7118fffb60a76c6e90c6b8db0287ed124de3" alt=""
Attributed Strings
- Customize behavior of generated speech with attributes
- IPA Notation
- International Phonetic Alphabet
- Uses — Specialized names, proper nouns, business names…
- Available — en-US, en-AU, en-GB, de-DE, es-ES, es-MX, fr-CA, fr-FR, it-IT, ja-JP
- Example — iPhone = ˈa͡ɪ.ˈfo͡ʊn
IPA Notation
- How do you generate IPA notation?
- Settings > General > Accessibility > Speech > Pronunciations
data:image/s3,"s3://crabby-images/1fc26/1fc2629686f39ceeeca0168aa006945e2170d42f" alt=""
- Use attributed string API with IPA pronunciation
data:image/s3,"s3://crabby-images/6d70d/6d70dedfcfc4f493a764af70c40269ec6af09452" alt=""