You are using an out of date browser. It may not display this or other websites correctly. You should upgrade or use an alternative browser.
multimodal models
About this tag
Multimodal models are a key focus in Microsoft's AI strategy, with the company planning to build its own state-of-the-art multimodal models by 2027 to reduce dependence on OpenAI. These models process and generate multiple data types, such as text and images, enabling advanced AI applications. On WindowsForum.com, discussions cover Microsoft's frontier compute investments and the use of multimodal AI tools like ChatGPT, Gemini, and Grok for creative tasks such as generating bespoke Holi greeting images. The tag highlights the shift toward integrated AI capabilities in consumer and enterprise software, emphasizing practical applications and the evolving landscape of generative AI.
Microsoft’s push to build its own cutting-edge AI models by 2027 marks one of the clearest signs yet that the company no longer wants to be defined as merely OpenAI’s biggest distributor. The strategy is not subtle: build frontier-scale compute, train state-of-the-art multimodal models, and...
Holi’s riot of color meets the new ritual of personalisation: this year, you can make a bespoke Holi greeting in minutes using modern AI image tools — whether you choose ChatGPT’s Images, Google’s Gemini (Nano Banana 2), or xAI’s Grok Imagine — and walk away with a shareable, high‑resolution...