---
Transcript
**** · The brightest minds in AI don't necessarily agree on some of the very fundamental things about the industry about definitions of AGI about whether or not large language models are the path forward and are going to get us where we want to go or if we need world models Yan Lun would said whether or not jobs are going to go away or more jobs are going to be created whether we should sell chips to China nobody knows but the fact that we're in a place in society where we're having these conversations openly to me is fantast Fantastic.
**** · Welcome to the artificial intelligence show, the podcast that helps your business grow smarter by making AI approachable and actionable. My name is Paul Ritzer. I'm the founder and CEO of Smarter X and Marketing AI Institute and I'm your host. Each week, I'm joined by my co-host and Smarter X Chief Content Officer, Mike Kaput, as we break down all the AI news that matters and give you insights and perspectives that you can use to advance your company and your career. Join us as we accelerate AI literacy for all.
**** · Welcome to episode 210 of the Artificial Intelligence Show. I'm your host, Paul Ritzer, along with my co-host, Mike Kaput.
**** · going a little makeshift this week.
**** · So, I have been traveling. I was in Washington DC with my daughter's class the last three days.
**** · I got back late Sunday night. Then we normally record this on Monday mornings, but I happen to be flying to Las Vegas this morning for the Google Next Conference this week. So, we are doing this as an unusual time. I honestly am not even sure what time it is now.
**** · I think it's what you're Mike is Eastern now.
**** · Okay.
**** · So, it is end of the day Eastern time. I am all sorts of out of sorts here in terms of what is going on. But as I was doing the final prep what because Wi-Fi on the plane of course isn't going to work. but it ends up as we're delaying this, we get the last minute notification that Tim Cook's out as CEO of Apple, which is shows you how real time this is because Mike and I are scrambling 10 minutes ago.
**** · what is what is going on? What did they say? And as of p.m. on Monday, the 20th, they haven't said much. So, I'm sure by the time this airs, you'll have heard this news and it'll be all over. and when Wall Street opens, when the stock market opens Tuesday morning, I would imagine we're going to see some movement one way or the other. so, more to come on the Tim Cook thing, I guess. But that's that's how real time this is. And then Mike and I were joking. I if any of you ever have to do virtual meetings when you're traveling or do podcasts or things that when you're traveling, we all know the makeshift setups that we have to do. And I have to admit, I have a first in this case, I have a recycling can on my lap that my microphone is sitting on top of. And I had to take a picture of it to so I could show Mike what in the world I'm doing here.
**** · So, I'm overlooking the Vegas airport.
**** · we're going to make this work one way or the other. We got to figure out next week, Mike, because you and I are both traveling on Monday. We need to finish that conversation. I'm also out next Monday.
**** · So anyway, episode 210.
**** · As I said, this is a pretty real time for us. We, if you're new to the podcast because we get new listeners every week. This is how we roll.
**** · we do this all in one take all the time. I think in the entire history of the podcast, we've had one or two stops and it was usually for coughing fits. So, we just do this. Claire on our team edits it and it drops the next morning. So, here we go. Mike, So, this episode is brought to us by AI Academy by Smarter X, which helps individuals and businesses accelerate their AI literacy and transformation through personalized learning journeys and an AI powered learning platform. New educational content is added weekly so you're always up to date with the latest AI trends and technologies. The AI for Industries collection features seven course series and certificates designed to jumpstart AI understanding and adoption. Those seven series are professional services, healthcare, software and technology, insurance, financial services, retail and CPG and manufacturing was which just dropped last Friday. Mike, if I'm if I'm correct. Okay. Correct. And then Mike's going to give us a little rundown some key takeaways from the AI for manufacturing series toward the end today. So these series are an ideal launchpad for organizations that want to level up their teams and accelerate AI adoption and impact. And today we're going to share some insights. As I said with AI for mark manufacturing course series, you can get individual or business accounts. both plans are available now. You can go to academy.smartx.ai to learn more. Pricing is totally transparent. So when you go there, you can see the exact pricing for both individual and business plans. So again, go to academy.smarterx.ai to learn more about that. Okay. We've I don't know how many models are going to drop by the time we do this. This comes out on Tuesday, but I we did have one already today. we might not even get into that one until next week. But there is there's rumors that more models are coming. And then I said, I'm at the Google Next conference. While I don't necessarily expect any major models to necessarily drop this week, that's more usually the IO conference where they would do that. I do expect lots of announcements from Google. So we'll definitely have lots to talk about on next week's episode when we start getting into the updates from Google.
**** · But we've got a big new report to talk about to start off though, Mike.
**** · Yes, we do, Paul. So Stanford's Institute for Human-Centered AI just released their 2026 AI index report.
**** · This is an annual benchmark where they try to map the entire state of AI across research, performance, investment, jobs, public opinion, energy, and policy. And boy, I I haven't looked at every report out there, but this might be the biggest one. It weighs in at over 400 pages this year. I think that's maybe the longest it's been. It is worth at least skimming or dropping into Notebook LM because it does represent one of the top industry reports out there for getting a perspective on what's happening at a macro level in AI. So some highlights that jumped out to us and obviously we're not going to be able to cover all 400 pages, but a couple things that are important to note. So first they find that the US China AI performance gap has effectively closed.
**** · So US and Chinese models have been trading the top performance ranking multiple times since early 2025. Deepseek R1 briefly matched the best US model in February 2025. And as of March 2026, Anthropic's top model leads top Chinese models by just 2.7%. China is now also ahead of the US in study publication research, publication volume, citations, patent output and industrial robot installations.
**** · They also found that AI capabilities are outpacing benchmarks but still suffer from this term we use sometimes in AI called jagged intelligence. So what that means is frontier models now meet or exceed human baselines on PhD level science questions, multimodal reasoning and competition mathematics.
**** · And they're saturating evals that have been intended to be challenging for years, but they're doing it in a matter of months. So for instance, the SWE verified coding test on that one performance skyrocketed from roughly 60% to near 100% in a single year. However, while these models can do things win math Olympiads, they also have big blind spots. They still fail to do things read the time correctly about half the time.
**** · They also found model convergence is happening at the frontier. So performance among the be very best models is becoming indistinguishable.
**** · So the top four models on the arena leaderboard from anthropic, XAI, Google and OpenAI are now clustered within just 25 ELO points of each other which is how they measure their effectiveness. so raw capability is not really serving now as a clear differentiator and that's why the competitive pressure is rapidly shifting towards cost, reliability and domain specific performance. So a few more things they highlighted. Generative AI adoption is outpacing if you can believe it the adoption rate of the internet. So generative AI reached 53% global population adoption within 3 years which is faster than either the personal computer or the internet. At the same time AI's physical and environmental footprint seems to be expanding drastically. For instance, Gro 4's training emissions equaled about 72,000 tons of CO2 equivalent, which is comparable to driving 17,000 cars for a year. And a AI data center power capacity reached almost 30 gawatt, which is comparable to the peak demand of the entire state of New York. On top of all this, global corporate AI investment more than doubled in 2025 to 581.7 billion. Private investment reached 344.7 billion as well. Couple final points here, more related to younger people.
**** · AI's labor market effects are hitting junior workers hard. Employment for software developers aged 22 to 25 dropped nearly 20% since 2024. That pattern was mirrored in customer service and other highly AI exposed roles. And the public and AI experts as a result of this view the future of the technology quite differently. 73% of AI experts expect AI to have a positive impact on how people do their jobs. That's compared to just 23% of the US public.
**** · So quite a bit of a 50 point gap.
**** · Furthermore, 64% of Americans expect AI to lead to fewer jobs, and the US reports the lowest trust, 31%, in its own government to regulate AI responsibly, out of any country surveyed. And lastly, four out of five US high school and college students now use generative AI for school work, but formal school guidance is failing to keep up. Only half of middle and high schools have AI policies in place. A mere 6% of teachers report that those policies are clear. So Paul, lots of different threads to pull on here, but I will say I just look at this data and I find it very hard to take seriously the argument that AI is overhyped at this point.
**** · There's a ton of good stuff in here and you said, it is massive. I can't imagine anyone's going to sit down and read this whole thing.
**** · But I thought they did a nice job of breaking it into chapters. And when you go to the site, and we'll put the links into the show notes. You can just go by chapter, and that's how I processed most of it. I didn't have time to read the whole thing, but the nine chapters are R&D, so research and development, technical performance, responsible AI, economy, science, medicine, education, policy and governance, and public opinion. So just even that alone Mike starts to show the breadth of the impact AI is having that we have all these different areas we have to look at a macro level of what is going on. they do a great job with the research.
**** · Obviously they have incredible research partners and some amazing people that are contributing to this and writing this report. So one of the ways you can do it is just go chapter by chapter on the website and it calls out usually it's between five and 10 highlights from each of those chapters. And so that's a great way to go through and do it. And so you did, Mike, I went through and pulled out the ones that were jumping out to me that might be worth a little bit more context. I'll just highlight a couple of additional ones.
**** · So one I you might have touched a little bit on this one, but this idea of the evals and the benchmarks that we normally use to measure progress. And what I what jumped out to me here is this continued need that we talk a lot about for companies and individuals to have their own evaluations of these models. You can't rely on these reports that are published by the AI frontier labs or even these organizations that are cropping up that are specifically designed to just measure the intelligence of these models. you have to have the standard tasks or workflows that you perform internally and so when a new model comes out you can determine the impact of it. So, as Mike was going through this, I was glancing down to see if there's any updates on the Tim Cook thing, and I saw somebody post something about did they make an update to Claude 4.7 this morning? It's sassier than it was last week, and I can't use it again. And so, the models truly do change all the time. And when they do these updates, they should incrementally be better, but sometimes something gets broken or something changes in the process. And so with the rate of change and the rate of model releases, it really is becoming more and more important. And it's I flagged something Mike will talk about on next week's episode around these eval. I saw a new company or tool that was created. Oh, was it Zapier? I think it was Zapier. So again, we'll talk about it next week cuz I haven't had time to prep on it, but there was a new thing that was more about actual work being done in developing eval. So eval is really important. they highlighted and I'm under the technical performance section now. video generation models are starting to capture how objects behave, not just produce realistic content. This goes back to the thing Mike and I have talked a lot about over the last 6 months or so. This idea of understanding the physics of the world and being able to develop world models which is going to be very important to especially robotics but video generation, interactive game generation, things that. So we're seeing progress being made. you'd highlighted a number of the ones in the economy side, Mike, that I was looking at the labor market effects are showing up unevenly. it does seem to be affecting the youngest workers most at this point. And then one-third of organizations expect AI to reduce their workforce in the coming year. even though we're not necessarily seeing it in the jobs data yet. I think you'd mentioned this one, Mike, about the four out of five high school and college students now use AI for school work. but the school policies aren't keeping up. So that's a major problem. Another one I thought was interesting under policy and governance was US public investment in AI remains modest compared to private sector. And the reason I flagged this one, Mike, is because I think this is about to change dramatically. Yeah.
**** · I don't know what the numbers where the numbers are going to come out to, but we'll have to go back and Mike maybe you can find the show notes. There was an episode where I did a breakdown of the Manhattan Project. this is probably last year where I broke down how much of GDP was going to the Manhattan project back in the day.
**** · And at the time I said we were going to need something that from the government.
**** · And I do think what's about to happen is going to be the government subsidizing the build out of energy, the build out of data centers, the building of chips. I think they're going to start to get into subsidizing the training and education of American workforces. I think that's going to have to happen. And so I could see sometime in the next one to two years where the government is spending well over a trillion dollars a year on AI infrastructure and talent and all the hard and soft costs associated with it. if they don't take over one of the labs. So, and I'm not a proponent of that happening, but I think we're just getting to the point where AI is so fundamental to the future of to democracy, future of the republic that I don't think they're going to be able to just sit back and hope that the three to five private companies figure this all out. So, I think they're going to go.
**** · I just don't know exactly what that looks yet. And then under public opinion, AI optimism is rising, but so is anxiety. It's generally people are pretty excited about it, but they're also really not sure. There was one stat that the one thing that jumped on me I thought was weird. It was people, in their confidence of understanding AI. It was 67% of people said they're confident in their understanding of AI.
**** · I was okay. I you could give me the hundred smartest CEOs in the world and I'm not sure I'd get to 67% who understand AI. So, I'm not sure who exactly they were asking that question of, but I'm thinking different students maybe.
**** · Yeah.
**** · it's either they're either really overconfident in their ability or they're asking technical people. because that is not by any means a representative of the average worker or business leader. twothirds of Americans expecting AI to lead to fewer jobs over the next 20 years, which I thought was a really weird timeline to throw into this. It's what? Two years maybe, but 20 years, who knows? that's crazy. And then one other one, the companionship thing, they talked about it still being niche, but that I was just noting I don't think that's going to be niche for long. they're still treating this people using this in a relationship or companionship. I feel that is just skyrocketing beneath our eyes we're it's probably happening and most people just aren't aware it's going on. And then the last two was the United States reported the lowest trust in its government to regulate AI responsibly.
**** · that is accurately placed mistrust. I would say I don't think the US government is currently on a path to responsibly regulate AI. And then across all 50 states, concerns about too little AI regulation outweighs concerns about too much. And then just a final note on their methodology, I thought it was interesting. They said AI index is written by a team of human researchers and they do a nice job of highlighting those people up front. And they said the authors used Chad GPT and Claude to help refine and copy edit drafts. And then all images in the publication were generated with AI by Johanna Freriedman. So they recognized the human but said hey we did this with AI and then they said the two specific models that they used. And then the final thing is public data and tools. One of the nice things about this report is they open source a lot of this. So you can go in, they have a Google Drive link to get all the public data and the images that they created, all the charts, and you can use all of those things. So if you're doing presentations on this stuff or business research, they just tell you how to properly site the report. And then they also have a global AI vibrancy tool that compares the AI ecosystem across 36 countries. So I, you and I, Mike, love a good research report. And this is extremely well done, very professionally done.
**** · It's the research we're happy to shine a spotlight on this podcast because I think there's a lot of really good information in here and they obviously put a ton of effort and it was a very thoughtful presentation of the report but it's their seventh year doing it I think so it's they got it down to a science at this point.
**** · So, next up, big thing happening this week is that OpenAI is going through a very public reorging reorg. And this week brought together a few separate threads that tell this overall story of the company speedrunning through some serious growing pains. So, first we had reported last week on some executive level shakeups. We've got a few more this week. So, Kevin Wild, OpenAI's former chief product officer has announced he is leaving. He joined OpenAI in 2024. He became CPO. Then he stepped out recently to launch OpenAI for science. On the same day, Bill Peebles who led OpenAI Sora short form video app also announced his exit. We have previously talked about OpenAI rolling Sora into ChatgPT/ sunsetting it as its own thing. So that makes perfect sense. Peoples might be out. Open AAI says it's also now decentralizing that Open AI for science initiative back into the core research teams. And so the idea here that we've talked about a couple weeks in a row is OpenAI is shedding what their leadership internally calls these side quests. So things that are just no longer a fit for the company's core mission as they pivot especially hard towards the enterprise. second, there's some internal drama around the company's IPO timing. The Wall Street Journal reported Sam Alman wants an IPO as early as Q4 2026. CFO Sarah Frier has told colleagues the company may not be ready this year. Frier has raised concerns that OpenAI's financial exposure given its computing infrastructure spending is a problem because that could reach $121 billion in 2028 alone. Sources also have said Sam Alman has excluded Frier from investor conversations and major financial decisions. Unfortunately, a pattern of behavior he has exhibited in some of the other articles we've discussed. Third, there's some internal drama or at least competitive animosity towards Anthropic. The Verge got its hands on a leaked internal memo from OpenAI's chief revenue officer, Denise Dresser, who sent this to employees, taking direct shots at anthropic.
**** · Dresser accuses them of inflating their run rate by roughly $8 billion through what she calls accounting treatment that makes revenue look bigger than it is, specifically by grossing up revenue sharing agreements with Google and Amazon rather than using net figures. She also writes that Anthropic is built on fear restriction and the idea that a small group of elites should control AI.
**** · And last but not least, there's also external drama around the company's valuation. The Financial Times reports that OpenAI investors are starting to question the current $852 billion valuation as the company shifts towards enterprise. So the idea here is that all these recent deals and abandoned projects are all part of defending Chad GPT's consumer dominance while going after anthropic in higher margin corporate markets. So the issue here is that one early backer told Financial Times that despite Chat GPT having almost a billion users, OpenAI is a deeply unfocused company. So Paul, this also comes as OpenAI and Sam Alman are sounding the alarm on AI's possible impact on jobs. They even released an AI transformation plan for jobs this week. but honestly, I don't know, do they have enough drama on their plates without all that? it seems they're really dealing with a lot of these core issues to the business.
**** · Yeah. And the Musk trial, I think, is Yeah.
**** · later this month still.
**** · Yeah.
**** · Yeah.
**** · Yeah.
**** · and Fiji Simu is still out and the CEO of AGI deployment. I don't remember when exactly they changed that title. It was applications, And then they updated it recently to AGI deployment. So, she's on medical leave.
**** · yeah, I saw I don't remember what day it was when those resignations started happening, but they were back to back on my ex feed and I took a screenshot and I put it in our sandbox for the podcast and I was hm. And then an hour later one more came and then the articles started pouring in.
**** · So it does seem to just be a byproduct. obviously reorgs happen a lot in the AI lab space. Well at least here anthropic seems to be pretty stable. but this is there's been a lot of moving pieces at as at OpenAI for a number of years now and this one seems pretty significant, but it does seem the byproduct of this renewed focus and the need to constrain compute to the newer models.
**** · there's talk that we're going to get their Spud model. It's been cenamed Spud, which I think is 5.5. That is lots of increasing rumors that is coming this week. so, yeah, I don't know. It's just a really interesting time. I don't I try not to over speculate too often on what could happen with these things. I usually constrain it to internal conversations, but the I will throw one out here that I'll just say publicly.
**** · The most logical thing that happens to me with OpenAI is Brett Taylor takes over as the CEO before the end of the year. So what by that is Sam Alman through his own comments in the media does not seem enthusiastic about the idea of being the CEO of OpenAI when it's a publicly traded company.
**** · There's lots of things going on. It's a very stressful job. I can't even imagine what Sam deals with personally and professionally. Obviously, he's had issues recently with attacks on his home, attacks on the office, personal threats to his safety, his family's safety, and I just don't ever get a sense when I listen to Sam talk anymore that he's having fun, that this is yeah, what he wanted and so when you start adding all those things up, you just start to look around be I just can't see him long term doing this. I could see him being the face of the company for fundraising and the infrastructure and things that, but at some point I just don't know that he doesn't burn out from being the real CEO day-to-day of the company and having to deal with this. And Brett Taylor is just the most obvious thing because Brett's built Sierra, an agent company built on top of chat GPT capabilities. He's the chairman of the board. he's a co-CEO of Salesforce prior to this. He was a former CTO of Facebook. He started his career at Google. He co-created Google Maps. Brett is a legend in Silicon Valley circles. And if you were going to look and say, "Well, who should be running this company and maybe would want to be running a company this into the future when we're a multi-trillion dollar company?" Again, I just don't know that it's going to be Sam. And Brett seems a really ob obvious choice. So, I wouldn't even say this is a prediction. I would say I'm more public narrative that I would usually reserve for conversations Mike and I have when I'm just throwing things around. But every once in a while, something just seems so obvious that it's worth at least sharing with our audience of just something to think about. I can I if I had to put odds on is Sam the CEO of this company 12 months from now. I would I don't think he would be. he certainly could be, but I just I'm starting to see more and more science that I don't I don't think it makes sense for him to be in that role, but we'll see. I have no inside information on that whatsoever. There's no sources I'm getting this from. It's just piecing stuff together. The jobs report, Mike, I thought was pretty interesting. So, along the lines of the Stanford one, I liked the effort they made here. So, they've been putting more and more work into their thoughts around the impact AI is going to have on jobs, trying to do more to project out. And this is what we've been calling for the last 2 years was we should at least be studying this. We should at least be considering that maybe this doesn't go smoothly in the near term and that we could lose a bunch of jobs. But the main way that it's often talked about is exposure levels. So, it's how I built jobs GPT. you look at the exposure of tasks that make up jobs and you say, "Can AI do this thing?" but what they're looking at is goes, layers on top of exposure and they look at demand elasticity, which is something I talk a lot about on the podcast, and then human necessity, which is an interesting one. So, in this report, they say they look at 900 occupations and which they're claiming is 153 million jobs or 99.7% of US employment. Now, that would have to include some part-time jobs cuz the full-time jobs in the US is around 136 million. So, I'm not sure exactly where all those are coming from, but they're looking at a bunch of jobs. And then they're trying to figure out how exposed are these occupations?
**** · And then does that exposure mean an impact on actual work? And so they, they look at this thing the demand elasticity where saying how much demand changes when price changes and is it connected to productivity and the impact on employment. So they're trying to look at in a simple way I think about this okay so AI makes us cheaper to deliver legal services for example because now we can use AI to do it. We can be way more productive.
**** · Does the demand for legal services increase enough to justify that we keep hiring more and more people? So now the argument a lot of these leaders are making that we talked about in last week's episode is software developers and engineers. So what they're saying is the demand for software is so significant that as the cost to create that software goes down, yes, we'll need fewer people to create the same amount of software, but we're just going to keep seeing more demand for software. So we'll keep hiring more people. And my argument has been all along that doesn't necessarily translate over to industries or professions where there just isn't a ton of demand to see. And so, if you're in a company where the demand isn't going to grow no matter what happens to your price, then you're not going to need as many people. So, that's what this report tries to take a look at. They're looking at this job transitions framework they created and they're asking four fundamental questions. Can AI do a meaningful share of the work if AI lowers the effective cost of providing the service or the product? Is demand likely to expand enough to absorb the productivity gain?
**** · Meaning cost goes down, is there enough demand for us to still need all these people? The third question is if it can for the remaining tasks, is a person still central to the work's delivery, judgment, accountability, or physical execution? And so they're not just looking at knowledge work tasks, they're looking at all tasks and then they're saying, is AI already being used meaningfully for these tasks. So then they use this approach to critically assess these 157 million jobs or 153 million jobs. And then they break the occupations into four different possible outcomes. So one is jobs with less immediate change are those where the current combination of exposure, necessity, and elasticity does not yet point clearly to one dominant near-term outcome. So there that is almost half.
**** · It's 46% or 70 million they're saying see less immediate change. Then they go into jobs that will reorganize that have high exposure and strong human necessity but demand is not elastic enough to absorb the productivity gains. That's 25% they say. And then 18% is jobs at higher automation risk. have high exposure, weak human necessity, and insufficient or ambiguous demand offset.
**** · So that's 27 million jobs in their breakdown that are at high automation risk, meaning these are the ones that could go away the fastest. And then jobs that will the fourth category is jobs that will grow with AI, have high exposure, but enough demand response to lower the cost may increase utilization, affordability, access, and quality adjusted output. but then one other note they did say our ability to forecast far into the future is limited and it is very difficult to project how much labor how much the labor market might evolve over the long term. On a shorter horizon this framework should help us envision how the labor market may evolve and what policy responses we can consider and implement to facilitate a smoother and more people- centered AI transition. So again, my point with highlighting some of this is I'm really happy that they're analyzing this at a deeper level. I won't I would love to see all labs doing this. Anthropic has been leading the way on this. So they're certainly doing this already. this is the research that needs to be happening at all the different labs and it's what the stuff that economists should be, aggressively pushing on. But even with all this analysis they did, they came out to we really just don't know. It's if you want the two long read version of this, we don't know, but we don't think just exposure alone is enough, which they're 100% on. So the need for humans in the loop for interpersonal communication between humans is critical. and then the need for understanding demand. So they couple examples I'll finish with. They said the least elastic occupations include firefighters and home health aids. And what they mean by that is that is needed no matter what the cost. we we have to have it. somewhat elastic occupations are physical therapists, editors, dental hygienists.
**** · You might just go less if it costs more.
**** · but if the cost goes down, the question, do I buy more of it if the cost goes down? And then the final some of the most elastic occupations include graphic designers and software developers. so yeah, it's it's really interesting stuff. If you're interested in the economic side of all this, it's a good read, good thing to throw in notebook LM. But I think my main takeaway again is a lot of people are starting to think about this at a deeper level, which is great. And no one has an answer. no one can say any what I always say is anyone who tells you confidently without debate that they know what's going to happen to jobs in the next ones two years is either misleading themselves or intentionally that there's a reason there's they have something to gain by you believing that.
**** · We try and sit in the middle and saying, "Listen, I hope that's what happens, that it doesn't in affect jobs." But I'm also trying to be very realistic that every scenario I look at where I think about this and say, "Yeah, but what about if demand doesn't go up for a company or a role? H how could you possibly create more jobs that fast?"
**** · and that's the part I just no one has the answer to. So, yeah, I think that's the big takeaway. Lots of change at OpenAI, lots more to come. the innovation is going to keep moving forward no matter what and they're starting to branch out and trying to understand the impact that their models are creating and it stuff they've thought about for a while. they did that GPTs or GPT study back in 2023 when GPT4 first came out.
**** · So it's not this is new. It's the first time they're thinking about it but they're starting to expand the way they look at it.
**** · So our third big topic this week is we saw several stories that weave together to paint a picture of the implications that AI agents might have on business at large. So we wanted to talk about a handful of these that we've been tracking that may give us some clues as to what AI agents could mean for business leaders. So first up, Open AI launched what they call codecs for almost everything. This is just an update to OpenAI's Codeex coding agent, but it's a pretty big one because it gives Codeex background computer use, meaning it can see, click, and type across any application now on your Mac while you continue working in parallel. OpenAI also released more than 90 new plugins for different types of software that your computer using agent can now operate.
**** · Second, we've heard more commentators and companies talk about this slang term going around called token maxing. This is the practice of maximizing AI token consumption inside a company. It's being used as a proxy in some cases for AI adoption. So, for instance, writer CEO Mayh Khabib, who we've talked about before, told the Wall Street Journal that stuff token maxing to drive internal AI usage is existential for her company. So writer and several others are in the camp that you need to be token maxing. However, there are critics at some other companies that call this an empty metric and an invitation to waste. Now on top of that, interestingly, token maxing can have some unintended consequences. So we also had a story where Uber CTO Previne Naga told the information that the company burned its entire 2026 AI budget in just 4 months driven primarily by claude code and cursor. So Uber engineers were actively encouraged to use these tools. The company ranked them based on internal leaderboards based on AI usage and they just blew through the available budget. Now, a couple other stories that hammer home all the nuances people are dealing with. So, Business Insider reported that Microsoft executive Rajesh Ja argues that AI agents may eventually need their own software licenses just human employees. His logic is that as AI agents start to act as full employees, companies will naturally have to give them login accounts, email inboxes, permissions, and access to tools that turns each agent into a potential paid software seat. On top of all of this, Box CEO Aaron Levy posted a widely shared thread this week on what it takes to keep up with all this AI architecture. He argues that companies building on AI have to accept that they will be dramatically upgrading their AI architecture over and over again. He cited how there are infrastructure patterns rag graph rag and orchestration frameworks that we're all talking about as state-of-the-art 2 years ago but are already obsolete. To end all this, Amazon is living a version of this problem firsthand. Business Insider reports the company has a pretty serious AI sprawl problem. They have duplicate internal tools, disconnected data, and a ton of growing operational challenges around AI agents. So Paul, the reason we mentioned this is there are these threads coming together here in these very broad strokes. So first we're seeing these agentic coding platforms codeex and quad code develop into generalpurpose agentic platforms for nontechnical knowledge workers. But as these agents begin to permeate the enterprise, they're raising all these really difficult questions and totally unanticipated consequences around budgets, permissions, and architecture. And I'm just curious, it sounds some of the top companies in the world do not have this figured out. Does anyone have this figured out?
**** · No, I don't think anybody has this figured out. I think it's presented as though people have a grasp on what is happening, but every day that goes by, you realize just how early we are in the integration of agents into workflows and businesses. And it can feel you're way behind because people are out on the frontiers trying all this stuff. but they're also the ones who are going to find all the potholes and take on a lot of risk. So, a few notes here. Mike, I think you did a great job just summarizing. we probably could have honestly just broke that entire recap you just gave into the whole episode today and just talked about each of those.
**** · But at a high level, here's a few observations I have. So, one, the accelerated capabilities, it's every week there's just new capabilities of these agents and when you connect the agents together and it is overwhelming. even for someone me who's paying attention to this stuff, reading about it constantly, anor analyzing it every day, it's it's moving so fast.
**** · almost it's almost impossible to just keep track of what is going on. The second is increased risks and unknowns especially as the power of coding is in the hands of non-coders and technical staff. What by that is there was one that popped up today that really caught my attention. So I have talked a lot recently about lovable. So it's an example of an app building tool, non-coding app building tool that I've mentioned on the podcast that I've used a few times to build some things. It's really cool. if you've never used it, it functions chat GPT. You just go in and give it a prompt, but then it builds something. And so, in my case, I built an org chart was an example. I built an assessment tool. And when you build these, it stays private by design, but then you can share it to allow people to have access to the app. So, any non-coder would assume if I'm sharing a link, I'm just sharing a direct link to that app and so people can experience it. I'm not thinking I'm giving them access to everything else behind it. So, I see a tweet this morning as I'm standing in line for my plane to Vegas and it's this random user. Impulsive is the name of the X account. Weezer OS int is the actual handle. So, when you first see something this, I would never retweet this stuff away. I was who is this person? Is this legit? you got to do a little homework. So, I'll read you what it says. It says, "Loveable has a mass data breach affecting every project created before November 2025.
**** · I made a Lovable account today and was able to access another user's source code, database credentials, AI chat histories, and customer data are all readable by any free account. NVIDIA, Microsoft, Uber, and Spotify employees all have accounts. The bug was reported 48 days ago. It is not fixed. They marked it as duplicate and then left it open." And I was "What?" "That cannot be possible." And then sure enough, 5 hours later, while I'm now on my connecting flight, lovable tweets without acknowledging the original. We were made aware of concerns regarding the visibility of chat messages and code on lovable projects with public visibility settings. To be clear, we did not suffer a data breach. Our documentation of what public quote unquote means implies what public implies was unclear. and that's a failure on us. Specifically for public projects, chat messages used to be visible. This is now no longer possible.
**** · importantly for enterprise customers, being able to set visibility to public for new projects has been disabled since May 2025. So, I was I read that I'm okay, maybe I'm just not smart enough to understand what that post means. But our documentation of what public implies was unclear. It's okay, well, first that seems super condescending, but again, maybe I'm just reading more into this than I thought because this isn't cyber security isn't necessarily my area of expertise at all.
**** · So then I was oh no, I'm not an idiot. That 2 hours later, they have to now post again and correct themselves because I think other people also were what the hell are you talking about? So this is their follow-up. We're sorry our initial statement didn't properly address our mistake. Here's what a public project on Lovable means and how we got to where we are today. In the early days, people didn't know what Lovable was capable of. So, we wanted to make it easy to explore what others were building as a way to spark ideas and lower the barrier to getting started. scrolling GitHub or Dribble, which again non-coders don't use. You browse projects to see what's possible, then go build. When you create a project on GitHub, it explains how it's default to public, whatever. And then they said, "Over time, we realized this was confusing. Many users thought public just meant others could see their published app. I wonder why because that is the most logical assumption, not the chat of an unpublished work. So in essence, you go in, you build something lovable. You're "this is cool. I'm going to share it with somebody. I let's say I'm doing it." I send it to Mike "Hey, dude, check this out. Check out this quick demo." Mike gets on there.
**** · Mike can see my entire chat history. He can see everything I've done in the thing. And it's that is not at all what I intended to have happen. No. So again, I highlight that one as an example of we just don't know. And Lovable is a really good company with tons of funding and the things you would normally do to say "Okay, is this a legit company I should be building something with?" They would check the boxes. And yet they made this apparently intentional choice to it certainly seems deceitful to me, but I guess they didn't think it was. And then the other one, and maybe Mike, we'll go into this one more next week. but this Versel hack is insane. So, I'll just read this quickly, and again, I'm trying to highlight all the unknowns around the use of agents is my whole point here. But there was this very high-profile thing that happened with Versell. And so, here's the CEO's post.
**** · He said, "Here's an update to the broader community about the ongoing incident investigation. I want to give you the rundown of the situation directly. A Verscel employee got compromised via the breach of an AI platform customer called context.ai that he was using. So employee has access to this things connected to it. The details are being fully investigated through a series of maneuvers that access that employee giving access to context.ai from the colleagues compromised Verscell Google Workspace account. So they connected to their Google Workspace account. The attacker got further access to Verscell environments. We believe the attacking group to be highly sophisticated and I strongly expect significantly accelerated by AI. They moved with surprising velocity and in-depth understanding of Verscell. So again, these are this is a very knowledgeable company that you would think would be better than most at avoiding unforced errors when it comes to cyber security and they let it happen. So my thing here is again there's no debating the impact agents are going to have in enterprises it is very obvious but there's just there's so many growing pains we have to go through and so I would just think very deeply as a as a leader or maybe as someone who's pushing for these kind this access within your company you see the potential and you want to set up co-work and claude code and open claw you want to do all the things you got to understand what it is you're doing or the organization has to understand all the risks that come with doing these kinds of things. And if you're relying on outside consultants or agencies to do this work for you, you got to know what they're setting up, what access they're you're giving them and in turn what access and technology they're integrating into your systems. And this is why AI is not just tools. It is change management. And it gets into impact on org structure and tech stacks and the roles within companies. And that's when I say I don't if you ask me just point blank how many companies let's just take enterprises let's take companies 250 employees or more do I think fully understand generative AI let's just keep a genic AI on the side for a second fully understand the capabilities of generative AI platforms and have properly integrated into their companies to where they can really scale transformation how what percentage do we think that is it's single digits you can't convince me otherwise at this point it's low single digits I Think if you then extend to that say okay which ones are prepared to integrate a gentic technology and scale it within their company in a responsible way it's well under 1%.
**** · Easy.
**** · Yeah.
**** · So my point is we do our best to highlight where this is today and where it is going. Do not feel if you're not racing into agents you are way behind. you're not. It's okay. if you're a Y Combinator company or a VC backed AI native company, this is expected. you're not getting funding without this stuff. But if you're a legacy enterprise that's trying to drive transformation in a responsible way, agent stuff this is going to be sandsbox through a technical team for the next 12 to 24 months before you start seeing this stuff really living in the wild. So yeah, it's just I don't know.
**** · I said, it's moving so fast, Mike. And I really do each week have a hard time comprehending some of the things that are happening.
**** · Same. And I keep wondering to myself too, I wonder what happens when we finally have the clawed code moment for something co-work or another one of these tools for non-technical knowledge workers. And by that all this stuff we've been talking about since the beginning of the year with AGI and people wondering something big is happening. That's all been driven by the fact that Claude Code was out for a year before this combination of factors late last year meant it got really good and everyone started positively and negatively freaking out. I think that's going to happen for something whether it's co-work or another tool for non-technical knowledge workers. People are going to have meltdowns andor move 37 moments we've talked about and I don't think anyone's prepared for whenever that happens.
**** · Yeah.
**** · And it is, I think there's also just this the technical side of how these models perform and how the harnesses are built around them in terms of allowing the capabilities that they're going to have. And I saw a tweet from was Ethan Malik, I think, was saying it doesn't make any sense. if you look at Gemini's model from Google, it's so good through the API, but it's not even close to competing with Claude when you're in the app. it's just a totally different beast because anthropic excelled at putting the harness around those agent capabilities and the model capabilities and then bringing them to the actual user interface where Google has been really struggling to keep up and the things we're seeing now I think I put the information article in for next week that there's a code red again internally at Google where Sergey Brand is taking the lead and because all the people at Google are using cloud code because their own coding capabilities are on not on are.
**** · So, yeah, again, the labs themselves are struggling to keep up. We talked last week, I think about meta using cloud code internally and take this from me, I'm quitting. I need these capabilities. So, wild. It's really incredible. Yeah. Nobody's got it all figured out.
**** · Not even close. Paul, before we dive into rapid fire, a quick announcement. This episode is also brought to you this week by our AI for writer Summit. So, the future of storytelling is being rewritten thanks to AI, which is why we're super excited to be hosting our annual AI for Writer Summit on Thursday, May 7th. This is a half-day virtual event for writers, editors, content teams, anyone who does any type of writing or content creation as part of their job. And during it, we'll have some incredible speakers breaking down exactly how AI can help you create smarter and faster. and importantly do all that without losing the heart and soul of your writing. This event has a free registration option. So go check out your registration options today. We've got the full agenda live.
**** · Go to a writerssummit.com.
**** · That's a writerssummit.com.
**** · Paul. First breaking rapid fire happened just before we jumped on today. Apple announced that Tim Cook is stepping down as CEO effective September 1st, 2026 after serving in the role since 2011. John Turnis, Apple's current senior VP of hardware engineering, is taking over as CEO and joining the company's board of directors. Cook is transitioning to the role of executive chairman, while Apple's current non-exec chairman, Arthur Levenson, will become the lead independent director. So, Turnis holds a bachelor's degree in mechanical engineering from the University of Pennsylvania. In a press release, Cook expressed his full confidence in the transition, stating that Turnis had the mind of an engineer, soul of an innovator, and the heart to lead with integrity and with honor. The transition marks the end of a nearly 15-year run as CEO for Cook, who took over directly from Steve Jobs. Under Cook's leadership, Apple's revenue almost quadrupled to over 400 billion.
**** · His tenure saw the company push into wearables with the Apple Watch and AirPods, as well as the Vision Pro. And Paul, I think a big thing we don't know much yet, but something everyone's going to be watching is how Turnis is going to handle Apple's transition finally hopefully into the full AI era. What do you think is likely to happen here?
**** · Yeah, hopefully. I I think the timing of this maybe comes as a surprise, but this has been rumored for well over a year that Tim Cook was, probably in the final stages of his career here. So, not necessarily shocking, I wouldn't say. And I think Turnis was rumored to be the main candidate here. So, Bloomberg did have the internal memos. So, I'll just read a quick excerpt from Tim Cook's memo. So, today we have a truly extraordinary road map. I've never been more optimistic about Apple's future.
**** · That is why I've decided that now is the time for me to transition to a new role of executive chairman. I'm thrilled to announce that John Turnis will be our new CEO. Throughout the many years I've worked with him and our many conversations about his becoming Apple's next CEO, John's passion and love for Apple shine through. He is a visionary in his own a man of remarkable integrity and the person we can all be proud to follow. he's going to, Cook's going to remain as co through the summer and work very closely with John as they transition roles. and then they said they would have a town hall in the Steve Jobs Theater a.m. Tuesday morning, which a.m. Pacific, I assume. So, I'm sure we'll be hearing plenty more about this in the coming days. And I just glanced at after hours trading and the stock is flat. It's nothing's happening after hours that Yeah. So yeah, I don't know to be continued, but I do think, from our perspective and what we do on this podcast, what this means to their AI roadmap, my guess is that's pretty locked in for the next 12 to 18 months knowing Apple, they don't just do things change on a dime necessarily.
**** · So I'm sure whatever decisions Tim Cook has made leading up to this point, John was heavily involved in those decisions and there's continuity in terms of their AI roadmap, but we'll definitely keep everyone informed what we learn about that in the days and weeks ahead. Next up, Anthropic has launched a product or feature capability called Claude Design, which is a new collaborative visual design tool powered by Claude Opus 4.7.
**** · This lets users build designs, prototypes, slides, and marketing materials just through conversation. it automatically applies team brand guidelines by reading code bases and design files. It accepts text prompts or uploaded documents and exports designs to Canva, PDF, PowerPoint, and standalone HTML. It is available now for Claude Pro, Max team, and Enterprise subscribers. Interesting drama here.
**** · This launch was telegraphed a couple days earlier because Anthropic CPO Mike Kger the information found out they published some reporting on the fact that Anthropic was preparing both Opus 4.7 which we'll talk about at the end in our product updates and a new AI design tool. And on that same day, Kger resigned from Figma's board of directors, which is a seat he'd only had for about a year. Figma being a design and prototyping tool had previously collaborated with Anthropic to integrate its AI models into Figma's products. And that's really interesting because Creger has said in the past that the largest AI labs will come to dominate software businesses and that this thesis alone has rocked public markets at times this year which we've talked about because we've talked about Wall Street has been undergoing a bit of a SAS apocalypse lately where major AI companies are supplanting established software businesses by just building their capabilities directly on the model layer. This appears to now be happening in design. So, Figma, Canva, which is not publicly traded, Adobe, Wix, all of these emerging design startups. Paul, this seems this just straight in the cross. I don't know how much more blatant this can get. if something Claw Design does in fact work as advertised, what does that mean for these software design companies, for designers even at large?
**** · again with the the roles and the giving designers superpowers I do think it's what we've always said great designers who learn to work with these tools are going to become even greater they're going to have superpowers to be more creative more innovative more productive things that and designers who don't it's going to be really hard to compete from a pricing perspective from an output perspective from a turnaround perspective the expectation is just going to be instant I can honestly say this I feel this myself when something is taking a really long time internally. I'm what why what is our reason for something taking a long time internally?
**** · because I feel those barriers are just gone and if it's design related or content related my patience is very thin because I know what's possible whether that's working with outside design teams or whatever I was I'll do it myself I'll just go into cloud and I do it myself this is taking too long. So, I feel more and more people are going to get that you're going to realize what you're capable of doing on your own.
**** · And if people aren't doing it, you're going to get really annoyed and you're going to want things to move a lot faster. Now, that being said, there's also times as a CEO when I want deeper human involvement and I want it to take time. I want to have the patience of allowing the human involvement in a process when that's better. But when it's just an output we just need, you just want to go. Yeah.
**** · Figma's stock, many SAS stocks, is down, I'm just glancing now, last three months, 29%. I think it's going to be hard for them to rebound. I think it's going to be really hard for a lot of these software companies to rebound. And we've talked extensively about this. I think it was episode 197 was the SAS apocalypse. We went into great detail about this one.
**** · yeah. And for us, Mike, we were just in a meeting, was it last Monday, a week ago, where you were talking about you and Taylor were using a claude skill to do design of slides. And as soon as I saw this, I was "Oh, the skill probably just got obsoleted." now we could probably just do it within the thing. I don't know.
**** · Unlike Figma, I'm ecstatic about that.
**** · The more you can obsolete me having to do anything with this, the happier.
**** · And Claire on our team was already throwing into our Zoom chat something she built. She said would have taken her hours. She did it in 20 minutes using this already. And then I'm sure we'll drop a Gen AI app review soon in our AI Academy because this is this is the exact scenario of why with our AI Academy we're so focused on real time education. It's not we have this 3month roadmap and that's all that's going to come out. It's if a new tool comes out and it's hot and it's something that we're all excited about boom, let's get a Jenna app review put up in the next one to two weeks. So, I'm sure our team is already working on getting a gen app review out of cloud design because these are the kinds of things that we want to be able to do. It's let's talk about it on a Tuesday and have a review of it on a Friday. I don't know if I'm not promising that we're going to have that out on Friday, but that's the concept here that we feel our education has to move at the speed of what these AI models are enabling, but yeah, it's very challenging time to run a tool. And I what does Sam always say, Mike? It's if updated models don't make your company or product better.
**** · Yeah.
**** · then you don't have a company anymore. just assume that they're going to make everything better. And if that is going to obsolete what you're doing, then just switch gears now and do a different company.
**** · you need to be building something that gets better. So to your point, Mike, you don't care in your job as chief content officer if something comes out and takes your skill from one hour of doing it to 2 minutes. Awesome. That just made me better at my job. So the thing you built fine. we weren't monetizing it. It was an internal tool.
**** · Yeah.
**** · So yeah.
**** · Yeah.
**** · It's interesting too. It's probably a whole other topic or an ongoing one. Just that idea around expectations. if you're if I had one piece of takeaway advice for anybody and what I take away constantly from these types of conversations is just you ignore changing expectations at your peril. Whether you're an employee, an entrepreneur, a leader, a practitioner, whatever, you got to realize the game has changed and people are having these conversations behind closed doors or on this podcast of hey, that timeline or that limitation, I think we can do better than that now based on what we have available to us.
**** · Yeah.
**** · And there's two scenarios now. I'm there's the you're a listener and all this stuff and your boss doesn't yet. And so you are a superhero. You get stuff done so fast. It's always amazing. They're always "Oh, you're just overd delivering. You're so incredible."
**** · And then there's you have the AI forward leader who's "Why is that taking so long?" Because I know it's possible. And so it's a it's a tough environment, but you're probably in one or the other now. everything you do seems so fast to everybody or it's not fast enough. And you could be in either one in the same week. I can tell you that.
**** · That's for sure. Because I might not know something is possible and then Michael will show me something's possible and it's wait a second, why doesn't everyone else know that's possible?
**** · **** · So, next up, after months at this point of open conflict, we might have seen this week a real thaw between Anthropic and the Trump administration. So, Bloomberg is reporting that the White House was moving to give US federal agencies access to Anthropic's new mythos model.
**** · And the very next day, Anthropic CEO Dario Ammedday met at the White House with chief of staff Susie Wilds and Treasury Secretary Scott Bessant was also present. So if you recall, things even just a few weeks ago were very bad. Anthropic was blacklisted by the government, called a national security risk. They're suing the Pentagon over that blacklisting. the White House, however, has called Friday's meeting productive and constructive. It seems Mythos is at the center of this. So according to Anthropic and what we've talked about, Mythos can identify weaknesses and security flaws in software. And Anthropic has only released it to a small group of tech and financial firms as part of its project Glass Wings, its cyber security initiative, because they're worried that hackers could use this model for very malicious purposes. So apparently Gregory Barbakia, who is the federal CIO at the OM, emailed agency officials this week saying his office is setting up protections that would allow agencies to go access mythos. And it turns out based on a separate scoop by Axios, the NSA is already using Mythos despite the blacklist. So Anthropic at the same time is also spending heavily on some Washington influence. Bloomberg government reported Anthropics hired Ballard Partners, which is described as a very big shop in Washington with very strong ties to Trump to lobby what the administration now calls DOW procurement, meaning Department of War.
**** · So, long story short, Paul, you predicted that the White House and Enthropic have to make a deal given how integral the company's tech is to both how the federal government works and national security at large. Is this the beginning of that? Gotta love politics.
**** · No kidding.
**** · So predictable sometimes.
**** · Yeah.
**** · I think I said last week that mythos just demonstrated how absurd this whole government effort to blacklist them or blacklisting them was and that they were always going to have to come back around, but they were never going to admit they were at fault. So they're going to have to find some negotiated offramp that can make it look the administration got what they wanted out of it and proved their point and they weren't wrong. and as long as Daario is willing to allow that to happen, then, I think it's only a matter of time before that court case just magically falls to the back of the back pages, I guess.
**** · Yeah.
**** · yeah. I again I just go back to it's such an insane thing that they're trying to do this to one of the three companies in the world that if you were to list the most important companies to the US government anthropic is easily in the top five now. so yeah they have to have an offramp.
**** · They have to find a way I'm shocked that we're 4 days from this meeting and both sides have managed to keep it relatively quiet what was discussed. He was there for a while. I thought for sure by Friday night we would have something leaked and it they've kept this under wraps which is pretty impressive for this government and for Anthropic has every motivation to keep it quiet but yeah so I don't know I hope again regardless I hope they find a way it's anything else regardless of what you think about politics and different this administration previous administration at the end of the day if you're American you want the most valuable American companies playing an important role especially when it affects cyber security of every citizen and every business and so you want a deal done we want this over we want them to move forward and find a path to work together despite their differences yeah it's interesting one of the quotes from the Axio article that a Trump adviser told them they said quote this is a big problem everyone's complaining there's all this drama so this got elevated to Suzie to hear Daario out determine what is BS and start to plot a way forward so I think behind closed doors they're not as unified as you would think yeah I that's probably pretty safe to say I can't imagine the NSA is super ecstatic that they're supply chain risk for them they know how important they are so next up we've got an interesting story developing and a conversation around Nvidia so for the last few years the US government has restricted which advanced AI chips Nvidia is allowed to sell to China. The goal here is slowing Chinese AI progress by limiting access to the best hardware.
**** · Nvidia has responded by designing downrated chips specifically for the Chinese market. The latest one is called the B30. And it's been this ongoing debate whether the US should be selling these chips at all, whether Nvidia is helping or hurting American AI leadership by doing it. And this week that debate blew up a little bit in AI circles because there was an interview from Dwar Patel who hosts one of the more influential shows in AI where he had Nvidia CEO Jensen Wong on for a wide-ranging conversation as Dwarvesh typically does but everything was dominated by this segment about China. So in it, Dwarvesh played devil's advocate and pushed Jensen on why selling chips to China is a good idea. And Jensen pushed back uncharacteristically hard enough that this spent the next several days being dissected across X. So Jensen said, look, people assume China is far behind or US policy makers do and it is not. China manufactures 60% or more of the world's mainstream chips.
**** · Roughly half the world's AI researchers are Chinese, hearkening back to that Stanford report. And the country has abundant energy. And so he said, look, victimizing China, turning it into an enemy isn't the best answer likely. They're an adversary. We want the US to win, but having a research dialogue is probably the safest thing to do. And so Daresh pushes back and says look if you're these AI models themselves he's pointing to anthropic mythos and saying look these are weapons Anthropic is not deliberately is deliberately not releasing these to the public because of cyber security risks. So if Chinese labs, which they have said publicly are bottlenecked on compute, doesn't go selling them more chips, let them build more of these weapons. And he framed it as enriched uranium, material for nuclear weapons. And Jensen called that analogy lousy and illogical.
**** · And he accused Dark Cesh of a loser mindset, arguing that the idea that Nvidia would inev inevitably lose the Chinese market is defeatism, not strategy. He says, "Look, we should be able to compete in China aggressively. Keep America advancing the American tech stack and make sure that Chinese AI developers keep building on Nvidia's architecture rather than, national champions in China Huawei." So his fear is that if the US forces Nvidia out of China, Chinese developers will spend the next few years optimizing models for non-American hardware which will not benefit the US long term. So Paul, the reason we want to talk about this is it's getting a lot of attention in AI circles. It shows there are some really different opinions on how AI infrastructure might or should play out.
**** · What's important to take away from this?
**** · first just shout out to Daresh for pushing on this. that you don't you don't see these kinds of hard-hitting interviews of these tech leaders very often.
**** · If you do, it's the last time that they're talking to that person.
**** · But Darkh has a very strong reputation in in the AI industry in Silicon Valley. He's extremely knowledgeable about the topic, so he can grill on very specific technical details of these chips. and he did and he didn't back down. it got uncomfortable. It's one of the first times I' ever seen Jensen flustered he was pissed, it seemed.
**** · Now, I'm sure he probably appreciated the intellectual challenge of the conversation, but he was not taking kindly to the insinuations that Daresh was making. And so, it just made for really fascinating conversation and I was I got to be honest, I wasn't really fully understanding the full conversation. I follow the space pretty closely, but there was definitely just technical stuff they were getting into where I was "Wow, yeah, I got to I have to go do some homework on this one to really understand how, what he's challenging him on here." But at a high level, it is another good example that the brightest minds in AI don't necessarily agree on some of the very fundamental things about the industry about definitions of AGI about whether or not large language models are the path forward and are going to get us where we want to go or if we need world models Yan Lun would said whether or not jobs are going to go away or more jobs are going to be created whether we should sell chips to China nobody knows but the fact that we're in a place in society where we're having these conversations openly to me is fantastic.
**** · that's what should be happening. there should be dialogue about really important issues and then listening to each other and allowing those conversations to happen.
**** · So to me more than anything I don't have a personal opinion I I tend to side with Darkh when I listen to the arguments and I listen to Jensen's responses there's something missing from his responses. It's when he gets asked about jobs or when, some of the other labs leaders. I just feel there's some nuance missing either because they're intentionally leaving it out or because they really believe that they're **** · and I feel that way about the chips in China. And if you remember the government wasn't allowing this, this was not something. And then Trump didn't even know who Jensen was when he retook office. we covered that on the first episode. He had no idea who Jensen or Nvidia was when his this current term started and they're "Well, you should know him. He's one of the 10 richest people in the world and he's probably the most important company in America now." and then he got to know him and then they re they removed the restrictions after that. So, yeah, I don't know. I this is a really challenging complex environment. Gavin Baker, somebody we talked about that we really and follow. Yeah.
**** · he's with Jensen on this and I again I just the people I follow who I trust on topics and I try and process what they're saying and I just I don't know that there's a clean answer to this one. And honestly I don't know we're going to know who was until it's too late. maybe Jensen's and Jensen's convinced the administration they should sell them and they're going to sell them and then a year from now we're going to realize that was a bad idea.
**** · Yeah.
**** · I don't know whether they admitted it at that point or not. I don't know. To your point though, it did show a different path from what we typically have. look, I get it. CEOs don't want to go on podcasts and get into huge arguments that drop their stock price or whatever, but it was eye opening because you look back and you're we're not getting a lot of this push back and debate on a lot of these big podcasts. Understandably so. They probably don't get you burn your sources 100%. So, it's not a, judgment against any of these people, but you're man, I wish we had more of this.
**** · Yeah.
**** · And I do the one other one that came to mind that I had a similar feeling with was on the when Gersonner was questioning Sam Alman.
**** · Yeah.
**** · And it was that same uncomfortable wow, he's pushing hard on this one and Sam's about to walk out of the room thing. And I've seen that with Elon Musk a few times where he just got pushed on something. He's screw you, man. I don't need to sit here and take this. So again, I kudos to these leaders who are willing to sit down and have the hard conversation.
**** · There's one Sundar Pachai recently where he was doing one I don't remember the name of the podcast, but it was a weird one cuz they're sitting there drinking Guinnesses together and stuff.
**** · Oh, it's the one with the Collisons, from Stripe, I think.
**** · Okay. Is that what it is?
**** · I think it's one of them. The Collison brothers that does I forget what it's called.
**** · They ask them hard questions. Yeah. And I I to me I when leaders take the hard questions and provide thoughtful answers even if they're they're not fully baked answers. just the fact that it's listen this is a hard one. We're working through this. but I know how hard it is for PR and comms people to book that interview because if you don't know how this stuff works you go through the PR and comms people, you try and convince them to do the interview probably have to tell them what the questions are going to be. it's not easy to get those interviews unless you're just buddies with them and they go around the PR and comm's team. And if that's what happened here, that's the last time that's happening. That was that was the other thing I took away.
**** · It's either he was willing to get into this debate or the PR and comm's team did not tell him what was coming or Daresh went around what he told he was going to cover cuz some one of those had to be true in this environment.
**** · Yeah.
**** · So, next up, the web analytics firm Similar Web published its latest Genai traffic share update this week. So, we've covered this a few times in the past. They measure the actual share of traffic across major Genai consumer products, and it's really interesting to see how quickly these can reshuffle. So, over the past year, Chat GPT is still in first place. but its dominance has eroded significantly.
**** · Gemini, Claude, and Deep Seek have all picked up meaningful share. 12 months ago, Chad GPT had just over 77% of Genai traffic. A month ago though, however, again, year-on-year, that number was just over 56%. In the same time frame, Gemini went from 6% to over 25%. That's more than quadrupling. Grock went the other way, going from just over 7% to almost 4%. Perplexity was flat at 1.6%. Copilot went from 1.38% to 1.99%.
**** · Claude had a pretty dramatic short-term jump, though it's still pretty far behind. 12 months ago, it was at 1.4% of traffic. 3 months ago, that was about 2.22%.
**** · And then it nearly doubled in a single month to cross the 6% mark. So that lines up with all this interest we have started to see in things cloud code and the latest Opus 4.5 through 4.7 models. So Paul just really interesting to see how quickly in a year things can change here. Obviously it's not a perfect benchmark of how much usage is happening but interesting to see Gemini really eating into that share.
**** · Yeah, there's a deja vu if you go back, you've probably seen it, Mike, but those that's this awesome interactive buy bar chart where it shows the browser wars through the years and Chrome non-existent and then all of a sudden boom and Chrome just comes to dominate.
**** · You start getting that feeling here where they're they're late to the party and then just they just eat away over time. And I'm not saying Chrome's going to end up with 92% of the market share here, but yeah, it's definitely balancing out. It's it's fascinating to watch these things move and to see the jump specifically with Gemini and Claude.
**** · Well, just overall too, even us then also on the Converse side talking about the new Google, Code Red, it's never sit here and say because of a headline, someone is dead in the water or someone is winning now. It's that'll change next week.
**** · Yeah, for sure.
**** · Paul. So, next up, we're gonna do our AI use case spotlight we've started to do every week. So, as a quick reminder, we hear from listeners all the time that one of their favorite parts of our discussions is when we talk about how we're using AI ourselves, both sometimes at Smarter X and in our own lives. So, each week we're going to try when we have time to give you a dedicated look under the hood at some real AI use cases we're exploring or deploying in our own work. So, Paul, this week first up, I can share some use cases that jumped out to me quickly this past week. if you have anything you've been working on, we'd love to hear it, too.
**** · Yeah, sounds good. Go for it.
**** · Awesome. So, first use case that I found that was pretty impressive was using Claude code to prep a bit for a talk I'm giving at a conference this week.
**** · So, I talked I think last week about how I had used it strategically to prep, but this was something much more mundane. So in this talk I'm going through 40 different AI tools in 40 minutes which is a talk I've given before super fun but you have to make it from scratch every time you do this. To make that talk work visually I needed dozens of product images, screenshots and logos. It is a nightmare to pull these manual. I've done it for years now. It takes hours. It sucks. It's stupid. And obviously you can't reuse stuff because screenshots change. So there's very little you can reuse from past talks. So I had Claude Code try to fire up multiple agents in parallel in a few different waves. So the first wave was four sub agents. Each of them were charged with pulling product screenshots, press kit photos, and hero images in order. So you're going through and saying, "Do these images exist online that you can find?" as the second wave, it starts looking up URLs with the forward slashpress or forward slashbrand because sometimes it just doesn't find images and there's a press page or something. And then the third wave was a final fallback which is hey, go just grab the logo from Wikipdia comments, which is pretty publicly available. Took 15 straight minutes, which is longer on the longer side for some of the stuff I do. And I got a lot of really good usable images. not remotely perfect, but just the fact that it could help me conceive this was super cool. And I just enjoyed the experiment of oh, this thing I never would have thought about. cuz you think oh, maybe it could go generate images. no, just go find me the files I need and it's downloading, dozens of files to my home computer just being here's the images. And some of them are good, some of them are bad, but definitely saved me time, which is near and dear to my heart. Mike, you and I have both spent months of our lives going online looking for logos and product shots for present.
**** · I don't even want to quantify how much time I have waste.
**** · The logo and the logos is probably the one to really highlight because that's a lot easier to standardize and find online. It was dropping the ball sometimes with different types of images just because it didn't have that artistic sense of what I was looking for, but even then it was still really helpful. one other quick thing which I will not advocate everyone goes crazy with, but this was a personal one where I use an accountant for our federal and state taxes, but we typically file our city taxes on our own. And it's super straightforward Kudos to Lakewood, Ohio. It's pretty simple. You go online, there's a few menus you go through as long as you have all your other stuff done. But it's taxes. I don't know what I'm doing. I am super prone to making mistakes. So, I fired up Cloud Code. I gave it all our tax returns and docs.
**** · And then I just screenshotted every menu that I was working through and said "Walk me through exactly how to do this so I don't screw anything up." It made it way faster. It was way less scary and frustrating if than if I'd had to do this on my own, which is nice. But here's the kicker. It caught something I almost certainly would have missed. So, one of our W TWS has a tax credit from a city outside of our home city. Long story short, you got to add this because otherwise you pay the wrong amount in city taxes. Claude Code caught this and if I had made this mistake, this would have cost me $1,100. So, the Claude Max plan paid for 6 months of itself. That's awesome. It's taxes.
**** · So, that's all I got this week. But, I'm not advocating you replace an accountant or anything. The accountant has been the best money I've ever spent in my life. Go pay accountants lots of money. They're helpful. But it was nice as a double check on something I had to do myself.
**** · That's really cool.
**** · the one I was going to share is just a quick simple one, but again, sometimes I think it's just good to hit these simple ones.
**** · I also have one related to a keynote I'm doing. So I'm doing the Movable Inc. Think Summit on June 16th in New York. I'm doing the opening keynote for that event. And I did this event last year. And I did the state of AI for business and marketing, five things every professional should know. And it's my standard keynote. And I do that talk 30 times a year at different conferences and private events. And then it's always customized for that event. But it's this Think Summit has a lot of returning people. So you want to go give the same talk again, but they also want an updated state of it's okay, what's new? Where are we at? What's changed since last year?
**** · So I don't want to just give the same talk again. But I'm what?
**** · What we really need to focus on is of that talk. The big thing that's evolving is really the onedimensional progress being the agentic. I just listen to this podcast. You hear it over and over and over again. So I came up with a session that I was calling state of AI agents and the augmented marketer. So the whole premise is giving marketers superpowers because it's for a marketing audience. And so I went into chatbt which I've used to draft abstracts before. I gave it the abstract for my standard state of AI talk. And I said, "Here's where I want to go with this one. I want to focus in on agents.
**** · I want to talk about it being, augmented marketers and how it's complementing what they're capable of. Can you use the format for my previous state of talk and write me an abstract for this one?" So again, if I was going to sit down and do this, I'm probably one and a half hours, Mike, between drafts, write the first one, tweak it, shorten it.
**** · It's not insignificant, but not a massive lift, but I didn't have time to do it. And they needed it that day. we had the, great call and it's "let's get the agenda live." So here's what it wrote. AI is entering a new era, one defined by agents. These systems can do more than generate content. They can reason, take action, and support work across research, planning, creation, analysis, and execution. For marketers, that means a fundamental shift in how teams operate and how value is created.
**** · In this keynote, Paul Ritzer explores what the rise of AI agents means for marketers and business leaders, where the technology is headed, and how organizations can prepare for a future in which marketers are increasingly augmented by intelligent systems. So it's perfect. That's it. So, I'll still Yeah, I'll still go through the high level of the things that are fundamentally happening in the space, but I'm going to then probably 80% of the talk is going to zoom in on the agent stuff and probably cover a lot of the things we've been covering the last couple months here on the podcast. So, again, it doesn't have to be automating 50 hours of work. sometimes it's just just give me that one hour because this is a heavy mental a cognitive load for me to write a fresh one.
**** · But I've said before on the podcast and I said it in my AI Academy courses, AI is better than me at writing abstracts. I'm and I'm okay with that. it is not something I fully and find fulfillment in writing. H plus this chat GBD it's trained on everything I've created all my courses all my stuff. So it knows generally how to write this stuff really well and honestly I probably would have turned it over to the marketing team otherwise say hey could you all take a shot at drafting this and I didn't need to just have Jad JPD do it.
**** · Yeah I it's been so valuable for abstracts. I love it.
**** · Yeah.
**** · next up, Paul. Each week, we are also trying to spotlight one of the courses in AI Academy and give people a real actionable takeaway from that course, whether or not you decide to ever take it. And this week's course, we talked about at the top of the episode, is AI for manufacturing. So, I figured I'd take a quick minute to go through what we've got going on in this course. eventually we'll be interviewing the instructors who do the courses. This one was by Taylor Raidy, our director of research, who offline I interviewed and talked through what was going on in the course and what we need to learn from it. So, I figured I'd dive into that real quick before we wrap up with some AI product and funding updates.
**** · So, AI for manufacturing really there's a core reframe that the course argues that no other function really faces in the same way.
**** · So there's this idea that manufacturing has a unique speed of reality problem so to speak. So you have things happening on the shop floor on the factory floor in seconds and minutes not hours or days. So there could be a small deviation in how a machine works some type of slight drift on how a tool works. And the key is by the time a human engineer has figured out what's going on, thousands of more units at scale often have already shipped perhaps with defects or something wrong that's degrading the machinery. And this is why you have endofline inspection that exists as a catch-all. It's why equipment gets run until calendarbased maintenance or until it breaks.
**** · So this idea is there's a loop.
**** · Something went wrong and then someone knew about it. Historically, that loop has always been too slow. And every downstream decision in manufacturing has been shaped around this delay. And here's the key insight is there's all these classic really useful frameworks that manufacturing leaders grew up on. Things lean six sigma or Kaizen. They're genuinely good at what they were built for, but they're built, and this is back to that expectations thing we talked about, Paul. They're built for a world where these things are measured in days, in weeks, in quarters. They structurally cannot keep up with failure modes that unfold in seconds. And that is unfortunately the reality of a modern connected plant. So that's overall the structural reason that AI is hitting manufacturing much differently. We're not really seeing your best engineers being outclassed by AI. It's more that AI with access to telemetry can watch 10,000 simultaneous signals in real time in the way that humans never could in the first place. So that's a really important highle point and how this shows up in terms of an actionable takeaway is the goal is to find the single longest feedback delay in your operation and that is the gap between when a problem occurs whether that's a defect deviation etc or even a supplier issue or an equipment anomaly and the gap between that and when a human becomes aware of it. So two final things I'll leave you with that are specific questions from the course that are worth sitting with as you consider how can AI be integrated into my manufacturing operations is one what percentage of your defects are detected during production versus after and which of your experienced engineers spends the most time manually reconciling data across all your different systems before they can even start troubleshooting.
**** · That is the strategic bottleneck that this course kicks off teaching you how to understand, diagnose, and resolve. So, go on over to academy.smarterx.ai. We'll also include the exact link to manufacturing in the show notes AF for manufacturing that you can go check out the individual course series or an AI Academy membership to learn more from Taylor who has extensive experience in the manufacturing industry about how to resolve these issues and how to move forward from here.
**** · Paul. So, we've got to end up here a bunch of different AI product and funding updates that I'm going to blitz through as fast as possible. And you said, I think last week or the week before, any of these probably could have been a main or rapidfire topic. We've got tons of them going on.
**** · Gets longer and longer every week. I say it really does.
**** · I'm going to say this. I think at some point we might have to just start doing these product and funding updates as a standalone podcast episode. It might be. I don't know, man. They're just they're every week it gets longer and we're cutting them too. It's let's just stop here this is enough 100%. Well until we phase this out we'll do one more this week but we'll keep going. I'm not saying we do it but it's getting there.
**** · But even this first one a small little thing called Claude Opus 4.7 launched this past week. So this is the new anthropic flagship model with meaningfully improved coding and computer use vision. It's now available across claude products. Claude Code at the same time also launched something called routines which puts Claude code on autopilot via saved configurations that run on schedules, API calls or GitHub events from Anthropic's cloud infrastructure. So it enables unattended work nightly backlog maintenance, alert triage and PR reviews. At the same time, Anthropic has restructured Claude enterprise pricing from a flat fee of up to $200 per user per month to a $20 base fee plus usagebased billing. Claude Code is unbundled and moving to per token pricing in that shift as well. At the same time, Anthropic has attracted investor offers at an $800 billion valuation. They've received multiple investor offers at roughly 800 billion.
**** · The company has so far resisted those, but that's more than double February's $350 billion pre- money valuation.
**** · Anthropic has also reported that they are automating alignment research using Claude. They published research showing that Claude Opus 4.6 can autonomously propose and run alignment experiments, though the study also flagged there were risks with reward hacking as part of that. That's it for anthropic news at the moment. Some other updates, Harvey has launched agents for endtoend legal work. So legal AI platform Harvey launched something called Harvey agents.
**** · They autonomously produce memos, redline contracts, due diligence reports, and slide decks across 14 plus different practice areas. Deepseek, the Chinese AI startup, is in talks to raise at least $300 million at a $10 billion valuation. its first ever funding round that's timed with the launch of its new V4 flagship model. Meta has hired AI engineer Joshua Gross from Mera Marott's thinking machines lab. Another person leaving that lab.
**** · They've hired him into their super intelligence lab. That makes him the fifth founding member to defect from Thinking Machines to Meta. AI chip maker Sarahass has filed for an IPO on NASDAQ under the ticker CRBS at a 22 to$25 billion valuation target. They're reporting about $510 million in revenue in 2025 which is up 76%. They have major customers that include AWS and Open AI.
**** · The Open AI announcement related to that is a pretty big deal there.
**** · Yeah, I don't think Jensen was happy about that one.
**** · Yeah.
**** · Microsoft has a couple other things going on this week as well. So they're apparently developing open claw inspired co-pilot features via an internal team called ocean 11. That's building a version of three of co-pilot that runs in the background and takes autonomous actions on emails, calendars and ro specific tasks. We are expecting a debut of that at Microsoft build in June. They also added new co-pilot inword capabilities for legal, finance, and compliance professionals, including track changes toggled from co-pilot, automatic tables of contents, and real-time visibility into Copilot's multi-step edits.
**** · Cambridge philosopher Henry Chevlin announced that Google DeepMind has recruited him for a new philosopher role focused on machine consciousness, human AI relationships, and AGI readiness. That's the first dedicated full-time philosophy position at a major AI lab. We'll see if that's the last. Google also launched Gemini for Mac, a free native Apple Silicon Desktop app that lets any user share any window for contextual help and includes integrated image, video, and music generation.
**** · Google is also bringing AI mode into Chrome on desktop and mobile, opening web pages alongside the AI interface instead of forcing tab switching and letting users add multiple tabs, images, and PDFs as context. And last, on the Google front, Google introduced skills in Chrome, a feature that lets users save reusable AI prompts as one-click Gemini workflows for frequently used AI tasks. One last Apple update here. Bloomberg reports Apple is working on a display free smart glasses called N50 for late 2026 or early 2027 and former AI chief John Gandria officially exits this past week after a year of resting investing following the disappointments with Apple Intelligence.
**** · And last but not least, Salesforce unveiled something called Headless 360, exposing its entire platform as APIs, MCP tools, and CLI command line interface commands. So AI agents can operate Salesforce without opening a browser.
**** · That was 17 or 18. I'm counting **** · That was 18 product openings or something.
**** · 18 product. And we definitely cut some of these. I promise you. And again, there's so many of these that I would love to riff on, but yes, we are we are past and I probably have some where I'm supposed to be, Same.
**** · Yeah, you're staying late at the office and I'm I'm probably supposed to be at a party somewhere.
**** · You're at an event or something, **** · one last announcement here, Paul. As always, we are running our AI pulse survey. So, go to smarterx.ai/pulse to see this week's survey and contribute to it. It is related to some of the top things we have talked about this week.
**** · So, we're going to ask a question about AIdriven search and we're also going to ask a question seeing if AI agents are starting to change at all how your team works or if you're still mostly using chatbased AI. Should be interesting to check those out.
**** · Paul, thanks again.
**** · Yeah, that was Thanks for doing this on the road, too.
**** · Yeah, man. Every week, dude. Well, and next week, I was looking at the calendar as we were going through some of those. We'll find a way to do next week's episode. Mike is at Experience Inbound Milwaukee, **** · Yeah. Yep.
**** · and I'm at Aquia Engage in Colorado. So, if either if you're going to be at either of those events, hit me and Mike up. Love to say hi in person to people. but yeah, we're we are both traveling. So, we'll figure it out. We'll we'll find a way to thread the needle on that one, too, somehow.
**** · Mike. Well, I will see you when I get back to the office later this week.
**** · sounds good, Paul. See you, everyone. Thanks for listening to the artificial intelligence show. Visit smarterx.ai to continue on your AI learning journey. And join more than 100,000 professionals and business leaders who have subscribed to our weekly newsletters, downloaded AI blueprints, attended virtual and in-person events, taken online AI courses, and earned professional certificates from our AI academy, and engaged in the Smarter X Slack community. Until next time, stay curious and explore AI.