Text-to-speech model can preserve speaker's emotional tone and acoustic environment.
Everyone thought it was cool to take selfies doing crimes until the FBI got all their data from Google and said hello.
https://www.vox.com/recode/22867000/january-6-fbi-search-facebook-google-insurrection