MAI released models that can transcribe voice into text as well as generate audio and images after the group's formation six ...