AI on IBM z17, Meta’s Llama 4 and Google Cloud Next 2025

Watch the episode
Episode 50: AI on IBM z17, Meta’s Llama 4 and Google Cloud Next 2025

IBM z17™ is here! In episode 50 of Mixture of Experts, host Tim Hwang is joined by Kate Soule, Shobhit Varshney and Hillery Hunter to debrief the launch of a new mainframe with robust AI infrastructure. Next, Meta dropped Llama 4 over the weekend—how’s it going? Then, Shobhit is recording live from Google Cloud Next in Las Vegas, along with Gemini 2.5 Pro. What are some of the most exciting announcements? Finally, the Pew Research Center shows perception of AI—how does this impact the industry? All that and more on today’s 50th Mixture of Experts.

Key takeaways:

  • 00:00 – Intro  
  • 00:55 – IBM z17
  • 11:42 – Llama 4
  • 25:02 – Google Cloud Next 2025 
  • 34:29 – Pew’s research on perception of AI

The opinions expressed in this podcast are solely those of the participants and do not necessarily reflect the views of IBM or any other organization or entity.

View all Mixture of Experts episodes
Listen on Apple Podcasts Spotify Podcasts YouTube Casted
Explore more episodes OpenAI goes open, Anthropic on interpretability, Apple Intelligence updates and Amazon AI agents

In episode 49, our experts unpack Altman's open source push, Anthropic’s AI insights, Apple’s AI race and Amazon’s new AI agents. What’s next in AI? Tune in to Mixture of Experts for the full scoop.

DeepSeek-V3-0324, Gemini Canvas and GPT-4o image generation

What’s the best open-source model? In episode 48 of Mixture of Experts, we discuss a new release of DeepSeek V3, Google’s Gemini 2.5 and Canvas, Extropic’s thermodynamic chip and OpenAI’s GPT-4o image generation.

NVIDIA GTC, Baidu reasoning models and Gemini AI image generation

In episode 47 of Mixture of Experts, we discuss NVIDIA GTC announcements, Baidu reasoning models, chain of thought flaws and Gemini image generation.

Watch all episodes from Mixture of Experts

IBM z17 makes more possible
 

A full stack AI solution with IBM z17

Learn how IBM z17 processes up to 5 million inference operations per second with less than 1 millisecond response time.

Transforming and simplifying the mainframe for greater productivity and efficiency with AI on IBM z17

Find out why 88% of IT execs say that app modernization is key, and 78% see mainframes as central to transformation. Learn how IBM helps clients boost value, AI productivity, and efficiency across key systems.

IBM® watsonx Assistant™ for Z

Unlock new levels of productivity on the IBM Z platform with a generative AI assistant.

Learn more about AI

What is artificial intelligence (AI)?

Applications and devices equipped with AI can see and identify objects. They can understand and respond to human language. They can learn from new information and experience. But what is AI?

What is fine-tuning?

It has become a fundamental deep learning technique, particularly in the training process of foundation models used for generative AI. But what is fine-tuning and how does it work?

How to build an AI-powered multimodal RAG system with Docling and Granite?

In this tutorial, you will use IBM’s Docling and open-source IBM® Granite® vision, text-based embeddings and generative AI models to create a retrieval augmented generation (RAG) system.

Stay on top of the AI news with our experts

Follow us on Apple Podcasts and Spotify.

Subscribe to our playlist on YouTube