TLDR:
Google just filed for a patent to make it's AI train on radio streams, this will make the AI learn natural voice, and has likely unlimited voice audio to learn from. I find it ingenious!
Long Story:
Google found it's unlimited resources of audio data, it's free, it's streamed live non-stop, you can learn natural voice, you can learn rare languages, you can learn all the languages, all you need to do is just listen to radio.
Google just filed for a patent where they describe entirely the procedure of training their own AI models on radio data. Reminder if you want to know why exactly google needs audio sources in the first place you can read a super simple explanation here.
I honestly find this pretty ingenious, you have all this raw data of voices, which are streamed over internet a lot of the time, and all you need to do, is just feed them to an algorithm that's gonna break it into computer understandable and distinguishable state. What computers understand are numbers, so that's exactly what they are going to do.
Audio is a audio wave, which means it has a unique fingerprint. All of these fingerprints can then later be grouped and aligned into a database.
Now with tons of trial and error, until this is polished up properly, Google will eventually have a vast source of audio profiles in most languages.
AI will be able to synthesize any text into audio, literally speak as a radio broad caster 📻 🎙️