That would be a hell of large add-on for how many sounds it is going to have for each word.
It can certainly be done.It'd use a ton of sounds and datablocks though. So that's a reason not to have it.