Skip to content
Industry News

Microsoft Launches Three New AI Models to Challenge Google and OpenAI

Microsoft's MAI Superintelligence team has released three foundational AI models — MAI-Transcribe-1, MAI-Voice-1, and MAI-Image-2 — signalling a bold push to build its own multimodal AI stack and compete directly with Google and OpenAI.

A

Admin

· 2 min read · Updated

Microsoft AI, the tech giant's research division, announced the release of three new foundational AI models on April 2, 2026, as it accelerates its push to build an independent AI stack — even while maintaining its long-standing partnership with OpenAI.

The Three New Models

The three models were developed by Microsoft's MAI Superintelligence team, led by CEO of Microsoft AI Mustafa Suleyman, and are now available on Microsoft Foundry:

  • MAI-Transcribe-1 — A speech-to-text model supporting 25 languages, 2.5× faster than Microsoft's Azure Fast. Starts at $0.36/hour.
  • MAI-Voice-1 — An audio generation model. Generates 60 seconds of audio in one second. Supports custom voice creation. Starts at $22 per 1M characters.
  • MAI-Image-2 — Image and video generation model, previously soft-launched on MAI Playground March 19. Starts at $5 per 1M tokens (text) / $33 per 1M tokens (image).

Why This Matters for Indian Businesses

  1. Lower cost AI models — Cheaper alternatives to Google and OpenAI.
  2. Multimodal capabilities — Opens new use cases for WhatsApp Commerce, support bots, and multilingual marketing.
  3. Azure integration — Plug directly into existing Azure workflows.

Microsoft's AI Independence Strategy

Despite investing over $13 billion into OpenAI, Microsoft is building its own AI stack in parallel. Suleyman: "At Microsoft AI, we're building Humanist AI — putting humans at the center, optimizing for how people actually communicate."

What's Next

Suleyman: "You'll see more models from us soon in Foundry and directly in Microsoft products and experiences."

Source: TechCrunch — April 2, 2026

By The Numbers

MAI-Transcribe-1 is 2.5x faster than Microsoft Azure Fast
Source: Microsoft Press Release, April 2026
MAI-Voice-1 generates 60 seconds of audio in one second
Source: Microsoft Press Release, April 2026
Microsoft has invested over $13 billion into OpenAI
Source: TechCrunch, April 2026

Share this article

A

Admin

News Editor at OG Marka

Covering AI, CRM systems, and digital transformation news for Indian growth brands.

More news →