Microsoft is doubling down on AI models that aren’t large language models. The company announced on Thursday that it’s releasing three new models: brand new models for voice and text transcription, and the second generation of its in-house image model.
The voice and text transcription models are the first of their kind from Microsoft. The transcription model can translate recordings into text in 25 different languages. It’s built for video captioning, meeting transcription and voice agents. The voice…

![[CITYPNG.COM]White Google Play PlayStore Logo – 1500×1500](https://startupnews.fyi/wp-content/uploads/2025/08/CITYPNG.COMWhite-Google-Play-PlayStore-Logo-1500x1500-1-630x630.png)