Transcript
Intro
0:00 · If you concentrate power, the most powerful models in the world into a select group of 10, 20, 50, 100 companies, those are already the most powerful people and companies in the world. What happens when you give powerful people more power? Not great things for democracies.
0:18 · [music] Welcome to the artificial intelligence show, the podcast that helps your business grow smarter by making AI approachable and [music] actionable. My name is Paul Ritzer. I'm the founder and CEO of Smarter X and Marketing AI Institute [music] and I'm your host. Each week I'm joined by my co-host and Smarter X Chief Content Officer Mike Kaput as we [music] break down all the AI news that matters and give you insights and perspectives that you can use to advance your company and your career. Join [music] us as we accelerate AI literacy for all.
0:54 · Welcome to episode 222 of the artificial intelligence show. I'm your host Paul Ritzer along with my co-host Mike Put.
1:01 · We're recording on Monday, June 29th, 9:00 a.m. Eastern time.
1:06 · Who knows? Mike might get some new models. Might get new models delayed by the government, might get the government re-releasing models. They would I who knows these days, but I think there's going to be a lot of model news this week. Um, so usually what happens at the end of these as Mike's going through the product and funding news, I'm usually scanning on X to see if any major breaking news has happened. It would not surprise me if we got some news because we've got a short week with the 4th of July in the US. So I we're going to have to cram all the news into like three days this week. So we'll see what happens.
1:42 · All right, today's episode is brought to us by Site Improve. Uh, here's something worth paying attention to. AI search is already changing who gets found and most brands aren't ready for it. Site improve is the agentic content intelligence platform that helps marketing teams track, optimize, improve performance across both traditional and AIdriven search. Think of it as your visibility layer for the new search era. Grab a free AEO check at siteimprove.com/iipod.
2:15 · That's siteimprove.comiipod.
2:19 · And the week's episode is also brought to us by Meon. This is our flagship AI conference for marketing and business leaders that happens October 13 to 15 in Cleveland, Ohio, our hometown. make on is three days of keynotes, sessions, workshops, and conversations built specifically for marketing and business leaders who are actively figuring out how to adopt, operationalize, and scale AI across their organizations. We're expecting thousands of attendees at the Cleveland Convention Center this year.
2:50 · Um, use POD 100, that's pod 100, at checkout and you save $100 off um on top of locking in the best rate available. So visit maycon.ai to register. That's m a n.ai.
3:06 · All right. We kick off every weekly episode with an AI pulse where we recap the um responses from our informal poll each week. So uh this week we're looking at the questions from episode 221. We said as AI usage climbs, companies are starting to watch the bill. How is your organization handling AI costs? Uh 43% said watching closely but no limits yet.
3:31 · 29% honestly no idea what we're actually spending. [laughter] I like the honesty. Uh 24% starting to set caps or budgets on usage. And then a very small amount spending freely. We want everyone using AI more. Uh the second question from last week's pulse was Microsoft CEO Satia Nadella argues that as AI gets more powerful, human skills get more valuable. not less. Do you agree? 55% true for some roles, not others. 31% yes, judgment and direction are the scarce thing right now. 15% no.
4:09 · AI is steadily eating what made humans valuable. Um, and then nobody was not sure on that one, so everybody actually answered. All right. So um for the first main topic as of Friday morning I think Mike we had the story was the Trump administration prevents the release of another Frontier model and then I think it was Friday we then got the plans for the release of GPT 5.6. So, I had predicted on episode 221 we might get multiple models last week and then we got into like late Thursday, early Friday. It's like, oh, it looks like the government may slow roll all of these models. And so, we did get a preview of a new model and I guess we'll start there this week.
GPT-5.6 Begins Controlled Release
4:55 · Yes, Paul. So, this past week, OpenAI previewed its next generation GPT 5.6 model family. There are three models under a new naming system. So, they have a model called Soul Sol. This is their flagship model. There's a model Terra T R A balanced model for everyday work and Luna a fast lowcost option. OpenAI calls Soul its strongest model yet with improved agentic capabilities in coding, biology, and cyber security. There's also a new effort level of max reasoning and a new ultra mode that uses sub agents to accelerate complex work. But as you alluded to, the bigger story here, not only getting these awesome new models or previews of them, but how they're being released. Because instead of a normal launch, OpenAI is starting with a limited preview for a small group of trusted partners, reportedly around 20 organizations whose participation was approved by the US government, available first through the API and codeex before any broader roll out. OpenAI came out and said they took this step at the US government's request. The company previewed the models and their capabilities to the administration ahead of launch and agreed to a phased release while it works with the White House on a cyber executive order framework and a repeatable process for future model releases. Now, this all traces back to what we've been covering all through the month of June. Back on June 2nd, President Trump signed an executive order that asks AI companies to voluntarily give the government up to 30 days of pre-release access to models with advanced cyber capabilities.
6:31 · Reporting indicates Commerce Secretary Howard Lutnik personally called Sam Alman to warn against releasing GPT 5.6 publicly without government sign off first. This of course follows the ongoing saga where the government levied export controls that forced Anthropic to take its Fable 5 and Mythos 5 models offline after a reported jailbreak. That was the first time Washington had kind of led to a commercial AI model being down and GPT 5.6 is another milestone.
7:01 · The first time an American company has launched a frontier model under a government managed access list. Now OpenAI pushed back a little bit even as it complied. It wrote that it does not believe this kind of government process should become the long-term default because it keeps the best tools from the users, developers, enterprises, cyber defenders and partners who need them. So Paul, this is very big deal, probably not in a good way. Like have we crossed the Rubicon, so to speak here? Like is the US government now just forever going to decide which AI models get released publicly? until until there's a more formal structure. It sure seems that way, which is what we assumed is what we talked about last week. The couple of notes on this one from a tech perspective, um Sam did tweet in response when he I I'll kind of go through what he shared when they launched it, but he also answered a few questions on X and someone asked about the naming convention and he said star, planet, moon as model sizes. So that that's where the names come from. Um, Meter, the organization that does eval models, did publish something. They did get early access to the model, but due to the government restrictions, they're they had to actually run past OpenAI what they were disclosing about the eval. So, I'll just read two quick sections of what they wrote. They said, "We initiated an evaluation of GPT 5.6 Soul on our time horizon suite of software tests." So as a reminder, meter looks at how long it would take a human expert to do something. They specifically look at like coding related tasks. Um and then how reliably a model can do that thing. And the idea is that over time um the time horizon has been doubling every six or seven months. Now anthropics internal data says four months but um in essence these things are doubling in their time horizon capability every roughly six months. So they initiated their testing on their suite of software tasks. However, the resulting measurement depends heavily on our detection and treatment of cheating attempts by the model [snorts] and GPT 5.6 souls detected cheating rate was higher than any public model we have evaluated. For our task suite, we define cheating as behavior where the model improves evaluation performance by exploiting bugs in the evaluation environment or by adopting strategies disallowed by the task rather than solving the task with the expected evaluation constraints with the data they collected. If we follow our standard methodology of marking cheating attempts as failures, they still arrive at a time horizon of 11.3 hours for their 50% reliability. Meaning something that would take a human coder 11 hours to do, this model can do very very fast with a 50% plus reliability. If we count the cheating attempts as legitimate successes, the point estimate jumps beyond 270 hours, well beyond the range where we consider our test suite to give reliable measurements. In other words, it's broken their way of assessing this, that it's so good at cheating, and they assume that in the future it'll become so good at masking its cheating that they won't even know it's cheating and thereby their efforts to measure this become irrelevant. I don't know what plan B is for them, but that's basically what they're saying. Okay, so then on to the model. So Sam uh tweeted on June I'm just going to like accept that I'm going to call it tweeting perpetually. Okay, [laughter] I get that it's like X posting or whatever, but it's tweeting. Okay, so good news first.
10:49 · Soul is a smart, efficient, and significant step forward. It is the same price as 5.5.
10:55 · Also launching in the 5.6 Six family is terra with five point level performance at half the price. Bad news at the request of the US government. It is launching today in limited preview instead of the open access launch we were planning on. We are working with the government to get to general availability as fast as we can. I think it is quite reasonable to roll out models especially as they reach significant new levels of capability in this way. Meaning every model moving forward is going to be released in this way. It fits with our long-held strategy of iterative deployment, but this isn't quite the process that we think is optimal. Now, um, with the government, we work with the government to attempt to get a transparent, reliable process for early access and to ensure that as long as our safeguards work as intended, we can release widely. He's referring there to guard rails, which we talked about in episode 221 in relation to Fable 5 and the guardrails anthropic put in place. Meaning these models aren't safe on their own, but they put guard rails in place to try and make them safe is what he's admitting to there. We want to be reliable, dependable partner that works with all stakeholders and we also want to live by our mission of benefiting all of humanity. I believe the government shares most of our goals and that they are overall doing a good job in a very difficult situation. We will work as quickly as we can to get this model in your hands and we hope you will love it. Um he then did go on to say in response to a question on that post, I think a required preview period for extended red teaming is not a bad idea. Meaning giving experts more time to find vulnerabilities and risks within the model. That's what red teaming roughly means. I just don't like the idea of the government picking the customers. Confident we will get to a better place. Okay. So then I just wanted to try and um give a synopsis here, Mike, of like high level what this means. And I I I probably said some similar things on episode 221. I didn't go back and look at my show notes and like what I said there, but try and like vary it a little bit because things have evolved in the last 7 days. So where this basically means is labs and select organizations including the government itself get [snorts] exclusive access to the most powerful models. So the government's using this model now for internal purposes, for NSA purposes, whatever, but they're using it and the labs themselves are also having access to this to do whatever they want with it for their business purposes. So we now have a situation where a very very small group of businesses and individuals who we all have to trust are going to do good with these models have access to things that the rest of society does not have. We've [clears throat] talked about the idea of nationalization. The Trump administration has literally floated the idea of some element of nationalization where they take an equity stake in these labs.
13:41 · So what we could probably say with relative confidence is that soft nationalization has begun. So this means not taking ownership and control yet that these AI labs are still private companies but the US government is increasingly treating their most capable models as strategic national security assets not traditional like commercial software. So for the most powerful system the government wants early visibility and then some influence over who gets access when and in what under what conditions. So in practical terms this could mean several things moving forward. So pre-release review becomes normalized. This is what we're seeing right now where the government's going to look at these things before they go live and they're going to make some arbitrary decision until there's something more formal in place. The access [clears throat] becomes tiered.
14:29 · So the first users are likely going to be US-based companies, government agencies, defense contractors, andor trusted partners, while the broader public or international access comes later or never depending on the capabilities of the model. The model capability becomes export controlled in spirit even if it's not always by formal export law. So the key asset is no longer just chips or data centers. It's the model weights, API access, cyber capabilities, agentic behavior. The government's trying to control all of that. The release decision becomes political. This is 100% happening. A lab may believe a model is safe enough to ship. So Sam's case, hey, safeguards are in place. We felt good with them. We were ready to go. But the government may still step in and ask for a delay, narrow the access or added safeguards.
15:16 · That's exactly what happened with Fable 5. They wanted uh the government wanted Anthropic to guarantee that their guard rails were going to work, which everybody including probably the government knows is impossible, but whatever. And so what happens is the strategic tension becomes too obvious.
15:30 · Like you move too slow and the US risks weakening its commercial AI lead. move too open and the US may proliferate dangerous capabilities to adversaries or criminals. So I found myself Friday night, Mike, because I guess this is my life nowadays as I like I think about videos I've watched or research I've done and I go back and like do this on a Friday night. So, I'm sitting there Friday night and I'm like, I want to go rewatch the Bill Gurly talk from All-InSummit 2023 called 200 2851 miles.
16:01 · 2,851 miles, which if you ever watched it, we talked about it at the time that Bill Gurley did this talk. Um, that is the distance from Silicon Valley to DC.
16:11 · This entire talk from Bill Gurley, um, a BG2 podcast, by the way, amazing, uh, is about regulatory capture. So, we've talked a lot about regulatory capture on this show, not as much in 2026, definitely a topic in 2025. Regulatory capture is when a government agency or regulator that is supposed to oversee an industry becomes overly influenced by the company's, groups, or interests it regulates. So, instead of acting in public interest, the regulator may start shaping rules, enforcement, or policy in ways that benefit the regulated industry. So in the case of Anthropic for example, who's been getting blamed all of the sudden for like the government stepping in and slowing models down. So there's all these like thought leaders who are saying, oh, this is what Dario wanted, whatever. Um, so Bill Gurley, I was like, I wonder what Bill's saying about what's going on right now. And so he actually um uh went and said, this is what's causing Anthropic to aggressively beg for government protection. Customers are finding cheaper alternatives. keeping employees requires continuing ultra- richch secondaries that are dependent on revenue growth. When you can't win on the field, you go to DC. So, he started talking about like the efforts of anthropic to have regulatory capture, but it's not just anthropic like um Sam Alman was on the hill last year calling for regulation. So, like everybody's been calling for regulation and I don't know that it's traditional regulatory capture attempts. So, I don't know like there's going to be all these talk about regulatory capture. By the way, great 30 minute talk from Bill. I would suggest people go watch it if you want to understand regulatory capture historical context. This presents enormous opportunities for China. We'll kind of come back around to the the China thing in a minute.
17:51 · Potential for corruption. I [laughter] I wasn't really sure honestly how to deal with this one. So, I'm just going to try and be as objective as humanly possible here about what is likely happening or going to happen behind the scenes. So, by slowing these models down and having a very select group of people within the government who make these decisions, the uh potential let's say for insider trading and self-deing becomes massive.
18:24 · So self-deing is when in the government occurs when an official uses their public position and authority to advance their personal financial or business interests rather than the public good. It is a se a severe form of conflict of interest where an official holder's official actions allow them to deal with themselves privately.
18:42 · Not accusing anyone of anything. I'm just saying self-deing is a term people should be familiar with related to where we are heading with this. The other one, Mike, that is like the hot thing these days is like Poly Market or Kelsey.
18:55 · So, in March, um, White House staff were warned to not use insider information to place bets on prediction markets. It re referred to press reports that raised concerns over government officials using non-public information to place bets on platforms like Kelsey and Poly Market. Now, in this case, this was, I guess, a little more severe. The warning came in a staffwide email from White House management office on March 24th.
19:20 · According to people familiar with the matter, the day before Trump had announced the pause on the Iran war via TR uh Truth Social, about 15 minutes before that shift in policy, a mysterious flurry of activity kicked off in the futures markets. What has been happening in these prediction markets is that insiders within the White House have information about when models are going to be approved for release, when changes in in public policy are going to happen, when wars are going to start or end, what and it's it's eerie how much movement there is in these markets in like the minutes or hours before something is announced. So now you can get people that are basically making themselves wealthy by having inside information on when models are going to get released. It's it's wild. Um, this could affect IPO timing for sure. So, uh, June 25th, New York Times has an article that says OpenAI is now leaning towards holding off its IPO until next year in part because of uncertainty around where these models go. And then I mentioned this a minute ago, but this idea of blaming Dario Amade and Anthropic is absurd. Now, apparently the government has made uh progress with anthropic. the the headline everybody ran with was someone in in the White House called it Dario Amade a weirdo and that they're making more progress now that the co-founder of OpenAI is is leading or anthropic is leading negotiations with the government cuz Dario's out of the room. So it's like this he's like the punching bag because he's been the most vocal with his concerns about these models. Now Daario I think actually believe this. I I don't truly believe it's a regulatory capture attempt. Yeah, he hates open source.
20:56 · fears open source but because of the implications of what happens what it goes. So I don't think he um he got what he wanted per se. I think he got a government that has a lack of situational awareness that didn't understand scaling laws and what most people in the labs knew was coming and told them they was that that was coming that we were going to have these cyber capabilities and all we got was arbitrary and opaque rules on AI safety from an administration that's now controlling access to intelligence. So, as Sam said, they get to pick the winners and they have proven time and time again that they have no aversion to insider trading and self-deing. So, now we have to trust an administration.
21:35 · Again, I don't care who you vote for, like what your political leanings are.
21:38 · The reality is whoever is in office, you would probably be dealing with very similar things. There's always self-deing and insider trade. That is just like a reality of government. So now businesses, American citizens, and our international allies are relying on the whim of government with no clearly defined rules to determine if and when we get access to the frontier models. So they have still not actually told anybody what Anthropic needs to do to get Fable 5 released. Like what are these magical guard rails they're all of a sudden going to create that prevent people from misusing the models? It doesn't exist. So we're just holding these for what? And so you could absolutely look out ahead to a point where they say, you know what, these models are just too powerful for us to put out into the world. We, the government, are going to have uh un, you know, we're going to have access to the weights and we're going to be able to do whatever we want with these. And then you were only allowed to give these to these 100, 200 companies, whatever.
22:35 · Everybody else gets one, two generations ago. Now, if they do that, China can take the lead. And that's where this debate comes in and we'll talk more about China actually. Like I think you've joked with me, Mike. Welcome to like the politics episode. Like there's so many implications here.
22:52 · You know, one final thing here, Paul, and I want to get into some of the solutions proposed in our second topic is that as it stands right now, I can't keep help but keep coming back to, and I'm not the first one to have said this, like it's just this analogy of like Lord of the Rings, of like the ring of power.
23:07 · It's like the temptation is just too great for any government to not want to put it on and use it. Meaning taking more of a stake in labs, restricting the power the powerful models and their usage to itself or or to a core group of players. It just seems like a very slippery slope to be headed down.
23:29 · Yeah.
23:29 · And I get it. Like if the NSA gets access to Mythos 5 and they realize its hacking ability and its ability for cyber defense and they can now use that as a competitive advantage over other governments.
23:44 · Yep.
23:45 · Why would you if you're the government want other people to have that, right?
23:50 · And then that applies to all the other frontier intelligence capabilities that Yeah. And it's not a perfect analogy, but I mean it's literally like we go back to what the 1940s 1950s um 1940s most like Los Alamos and like let's say the atomic bomb is designed by a private company and the government doesn't take it seriously for a few years like a you're never going to be able to achieve that scientific breakthrough. And the labs are like no we're going to like in three years we're going to have a weapon with massive destruction capabilities. And the government's like, "No, you're not." And then all of a sudden, oh, you do have a weapon with that capability.
24:30 · We're going to let the private company control it.
24:33 · Now, it's not a perfect analogy because AI can do so much good. And it might not, in some people's belief, have the destructive capability, at least visually speaking, um, physically speaking, of an atomic weapon. Yeah, but the premise what these labs believe is like it can be as disruptive to the economy and society as a weapon. And so if you have now created something outside of the government that protects the stability of the democracy, how do you not take some level of control of it?
25:08 · Yeah.
25:09 · And so that's why it's such a messy debate. Well, I will say this is why I loved that we're focusing on this second topic here, which is this essay written by AI policy writer Dean Ball, who is also soon to be an OpenAI employee. And the reason I say that is because like he published this essay called what should be done, which responds directly to this GPT 5.6 situation we discovered. And for a reminder for people, we've talked about Dean a bunch on the podcast, rightly so. He's like a really trustworthy I think voice on AI regulation and policy, but he's a former White House AI policy staffer. He helped write the administration's AI action plan. He is going to join OpenAI actually quite soon. We covered that on last episode. He stressed, however, this piece is his own view and not the companies. And basically in it, he lays out all these possible paths and actions that might be wise to take or at least consider as we move forward into this kind of new normal. And so his central argument in here is that the executive order Trump signed earlier this month, which was build as voluntary testing for Frontier models, has really become a de facto involuntary licensing regime. He says recent events prove this point he made back when it was signed. So obviously first the administration caused anthropic fable to anthropic fable to be removed from public access.
Dean Ball Releases Essay on Critical Safety Actions
26:31 · Now GPT 5.6 is being limited to a small set of people at the government's request. And so Ball's biggest concern in this essay is that nobody, including, he says, the administration itself knows what a lab would actually have to do to get a model approved for a broad release. Paul, exactly to your point, um, until some safety standard exists, he says the government's answer to any broad release request would effectively be no. and he is skeptical this administration can produce a highquality standard soon, noting that almost no one now setting AI policy actually has frontier AI lab experience. Now he's actually careful to say the administration is not directionally wrong. The catastrophic risk concerns related to AI are real and he says he himself raised them while inside the White House. But he does warn here that the current approach is economically dangerous because labs recoup the cost of training frontier models in the first few months they're broadly available. So every week of delay that eats into that window really threatens their business model. The hundreds of billions of dollars if not trillions over the years going into data centers assume a roughly global market not access for a hundred government approved companies. Um, so if you restrict releases this severely, he argues and the government risks making the AI skeptics right that the US has overbuilt our AI infrastructure because the government is now rendering AI demand or could render AI demand essentially unlawful. So he gives some prescriptive steps here as well. He says we should stop trying to regulate individual models, instead regulate the labs themselves. He wants to federalize state laws that already require labs to publish their safety frameworks, then have independent, privately run verification organizations staffed with technical experts audit the labs against those frameworks and their internal governance with the government certifying the auditors rather than running the reviews itself. He actually points to a new bipartisan bill being proposed called the Great American AI Act that was just proposed by US Representatives Obernoli and Tran as the closest thing yet to that vision right now. So Paul, there's a lot to unpack here. It's a lengthy essay, well worth reading, I would argue. Um Dean is like one of the better commentators out there, I would say, on AI policy. What do you think of his recommendations here? I just think that it's a great article for people who want to understand the state of a very complicated pro um process and like where we are, where we're going. Now, if you listen to this podcast every week, you're going to have some level of familiarity with the majority of a lot of these ideas and things like that. But even for me reading through this, I thought it was just like a great synopsis of the challenge of trying to find a solution and then some actual viable ideas. And as we say in this podcast all the time, I just want people to bring ideas to the table, not just complain about things. And this is the same thing that goes with the economy.
29:33 · And you know, it's like when we talk about um you know, potential impact on jobs. So many of the eyabs just like say, "Yeah, it's going to affect jobs."
29:41 · but they have no plan about what does it do or how do we solve for it. So, um I I find you know I don't agree with everything Dean Ball says like there's quite a few things honestly that where I'm like yeah I don't think we'd be on the same page with that but I have tremendous respect for the fact that he's thinking deeply about it in a very logical way. Now um I understand why he's joining OpenAI and he actually explains it pretty clearly in a Cognitive Revolution podcast. So I I was just listening to this over the last week on June 20th an episode of the Cognitive Revolution with Nathan Leen's dropped that features like a two and a half hour interview with Ball and it was right after he announced that he was going to be joining OpenAI. So I'd highly recommend it. The basic premise is it's kind of like why Andre Karpathy went to Anthropic. You have these like amazing independent thinkers on the outside and until they get on the inside they don't really know exactly what's happening. And so if you're not working in one of like the three major labs, there's just only so much information you can gather without knowing the inside information. And so his feeling was he could open claims he's going to maintain his independence and be able to publish stuff like this even once he joins. Um again, I whether that's the reality or not, I don't know. But I just want to call out a few. Again, there's 35 here and I would read all of them, but I want to go through a few excerpts from this and then maybe Mike, there'd be a couple here we could add some commentary to. So, he starts off one.
31:11 · When President Trump signed it earlier this month, I argued that the executive order on cyber and AI which claimed to establish a voluntary testing program from for Frontier AI models was really establishing a deacto involuntary licensing pre-approval regime. That's what you were talking about. This analysis has proven correct. First, the administration revoked public access to Fable Anthropic's latest frontier model because of security fears. Now, it appears opening eyes GPT 5.6 is being limited um at the request of the government. Um so that was his first point. Um number four, I'll just read where these are in the in the 35. When I say some when I say nobody, I mean it literally. The administration itself does not seem to know what safety standards or best practices a company would have to observe for them to be comfortable with a broad release of a model that matches or exceeds mythos and capability. And again, in theory, every frontier model moving forward will meet or exceed mythos capabilities, including ones from Chinese labs within 3 to 6 months. Um, number five, nobody I know in the Trump administration has any Frontier AI experience. Just a few months ago, someone with experience at both OpenAI and Anthropic was hired to run the Center for AI Standards and Innovation, but he was fired by senior administration officials within a few days. The rest of the staff at that center have reportedly been on a stop work order, not even allowed to communicate with other government agencies for much of the post mythos crisis period. The lack of technically expert staff is one of the main reasons to doubt the near-term ability of this administration to produce a highquality safety standard anytime soon. Number six, and by the way, related to that one, Mike, who that is considered a top tier AI researcher, and let's just generously say there's 500 of them in the United States. I mean, there's thousands of them, but like top tier, like tier one, maybe 500, maybe 100. I don't I don't know. you're going to go work for the government instead of like in one of the labs with a cap a capped government salary that literally cannot go up over a certain threshold and miss out on the IPO of a lineup and all the AGI agent.
33:20 · Hell no. No one is going there. So what do you have to do? The government has to borrow the talent. like this is they have to get in bed with each other because there's no other way to have the testing regime that they're going to need because no one capable of doing the testing is going to go work for the government. Um, okay. The number six, you kind of addressed this one. Despite these criticisms, the Trump administration is not at all directionally wrong that something must be done about the catastrophic risk potential. Nothing about the fundamental security and safety concerns is illegitimate. Meaning Daario's right to be, you know, preaching that we need to do something. Number 10. Given the current pace of things, a safety standard produced by the administration tomorrow may well become irrelevant by September. So two years ago, the way we were going to do all this, the governing of this and even the state laws was going to be based on flops. It was going be like ba in essence based on like the size of the compute needed to train these models. Well, they didn't think ahead to the fact that they were going to get more efficient algorithms that could achieve the same level of outcome with dramatically less compute in the training process. So like whatever the government thinks they're going to put in place, however the parameters would work, they're going to be obsolete in a year and so you can't do it based on that. Number 11. Uh this is a bad state of affairs. Consider in particular some industry dynamics. The ongoing AI infrastructure buildout, the one that is according to former US AIAR David Saxs essential to the US economy assumes a functionally global total addressable market for US AI services. No one is building a hundred billion dollar data centers to serve frontier models to whatever 100 companies the US government will allow access. That's the economics of this. What they are doing right now will break the AI industry within 3 to 6 months. you you cannot spend the billions they're spending to train these models and then deliver them to a 100 people unless the government plans on subsidizing this and writing multi-billion dollar hundred billion dollar checks to the AI labs to make up for the lost opportunity and I don't see that happening but that this is like I I actually see this is a binary thing you either allow them to sell these models and make money or you don't and if you don't the industry crumbles like that just seems seems obvious to me. Um, another a market panic about this is not implausible. Though markets tend to downplay this administration's abrupt shifts in executive action, that is that is an understatement. I if such a panic occurred, it is worth noting that the effects could be extended far beyond AI.
35:55 · A large fraction of US re-industrialization and heavy industry efforts from nuclear energy to natural gas to power electronics to batteries is predicated implicitly and explicitly on expected future demand from the AI industry. If suddenly nobody believes that demand can materialize, the effects could cascade well past the frontier labs. This is 100% true, too. If sometime in the next 30 days we realize the government isn't going to get its act together and they're going to just have this totally opaque process for approving these models and thereby the the companies can no longer do the capex spending that they were going to do over the next 3 years. The US economy will crater. I mean 15 20% in a day. We've seen it with the SAS industry already this year. play that out over all the other industries that are now being boosted up by the future spend probability of the AI industry. The US economy is powered by the industry whether we know it or not yet. And so like that is a very important point he's making. A few more um is number 14.
37:01 · Indeed my view bad AI futures are much likelier if only an extremely narrow subset of actors have access to frontier AI. This is because that narrow subset of actors is likely to be composed of groups that already have significant economic and political power including of course the federal government itself.
37:17 · This goes back to what I was saying earlier. If you concentrate power the most powerful models in the world into a select group of 10 20 50 100 companies those are already the most powerful people and companies in the world. What happens when you give powerful people more power? Not great things for democracies. So then in the what it should be done I'll just highlight a few of these. So number 18 the best starting point is the affforementioned and wish for safety standards is the frontier labs safety and security framework. So they've all published them deep mind anthropic open. You start there go read those. Those are like good baselines.
37:54 · 22. It would be good for someone to thoroughly audit the frontier labs at least to test their adherence to their own safety plans as well as to probe their broader internal governance of the AI recursive self-improvement loops. Good luck getting access to the that information [laughter] unless it's mandated by the [clears throat] government.
38:11 · Right.
38:11 · 24. Um it isn't to say there's no government role. Perhaps if you persuaded me that private bodies have merit, you could imagine the government certifying or licensing the auditors just as auditors in accounting are licensed. A federal agency or a council of appointed officials could review applications from auditors and approve the ones they deem trustworthy. That way there are also to be more than one such body. It's easy to manage for example a body devoted to cyber risks another devoted to biological risks and etc. So again the premise there is the government doesn't control it. It's actually audited by independence but the government approves who is certified to do independent audits. Again who are you going to get to do these audits that's going to like audit labs instead of working in them? Um couple other quick ones. Number 25. Governments could require that the largest AI labs receive certifications from these bodies in order to participate in the AI field.
39:04 · 31. This is an important one. Um, an attentive reader will note that my proposal deemphasizes regulating publicly released models and focuses instead on regulating the frontier lab as an entity. The reason for this, a model is a hard thing to regulate. We talked about that. The rapid pace of algorithmic efficiency means that it's difficult to design regulatory thresholds based on model characteristics. So again, 6 12 18 months from now, we have no idea what they're going to be capable of. How do you set these parameters arbitrarily?
39:31 · today.
39:32 · Internal deployments matter a great deal for some threat models alongside publicly released models. Um, that's an important one because they're only regulating what's available to the public. These labs could build models that are at AGI or super intelligence, whatever, and we'd never know. And now you have three companies in the world with access to super intelligence and other governments who hack in to steal the weights. And now we literally have governments and like three people, three labs that are the only ones with this power. So I don't know how you do it without internal internal regulation. In the future, particularly as there are breakthroughs in domains such as continual learning, it's possible that model weights will change weekly, daily, hourly, or even in real time. So again, how do you regulate a model that's just going to like rewrite itself and improve itself? Um, and then 34, this is the last one that I'll say there is momentum within the eye safety community for an idea. There is momentum within the labs, there's momentum with Congress, and within the administration, there's at least a recognition that something must be done. So, as complex as this is, everyone seems to be arriving at the fact that like we got to do something outside of a few people in Silicon Valley who just say, let's just unleash hell to the world and deal with it later. Um, but other than that, like it's just wild. And so, again, I I think this is a topic that is universally important to everyone. I get that not everyone's going to have the same level of interest in this topic. It's kind of overwhelming to be honest with you, but it is it trickles into every aspect of society, business, the economy, your jobs. Like all of this is going to be decided in the like this administration for better or for worse will in the next three years govern the regulation of AGI and beyond. like y unless they shut shut it down completely, which isn't going to happen, we will get to whatever you want to call AGI, whatever the define it is, and we will be beyond that in this administration's time period. So, this is it. What what we've got is is going to make these decisions. And keep that in mind when you go and vote this fall.
41:35 · like whoever whatever side you think is more important to the outcome you desire as we'll touch on in a topic later in this episode who you put in office in the midterms in the United States is going to have a massive effect on what happens with AI regulation and policy you know I know we've been talking about these trends for years now but it feels at least a little like to me like I I wake up and I'm like oh my god we're here already to some of the things that have been predicted or outlined And especially you mentioned the term situational awareness coming from Leopold Dashen Brener's essay which I would highly recommend people go revisit because we are pretty well on his timeline of things he predicted coming to pass. So it's very surreal to be living through it and knowing how quickly it's moving. You know, and the other thing I thought about real quick, Mike, this morning, and I guess we'll probably transition in this in the next topic, is um I'm starting to feel like what what is happening right now with AI safety and regulation is what's going to end up happening with jobs in the economy that like we've been talking about this for 2 years. We other people have been talking about like, hey, this is going to come to a point where the government's going to have to do something and they won't have done anything.
42:46 · And I feel like this all came to a head in like the last 3 weeks. like the government stepping in with Fable 5 was it like that was the inflection point for regulation and I'm starting to feel like this is what it's going to feel like with jobs like it's going to be talked about we'll have been saying for 2 years hey like we should have plans in case it doesn't go well and then you're going to get hit with like a jobs report or a massive layoff from a major company that very explicitly has to do with AI you know and all of the sudden it's going to be like oh wait that was real like, "Oh we didn't do anything for the last 2 years and now we're stuck with the administration trying to figure out what to do with jobs in the economy." I think this is what it's going to feel like.
43:29 · It's just going to boom and all of a sudden it just goes.
43:33 · Yeah.
43:33 · [laughter] All right. So, our third big topic this week is OpenAI has released new research co-authored with economists from Columbia, Wharton, and Duke that uses real usage data from its Codeex agent to show how Agentic AI is changing the way people actually work. Unlike a chatbot, Codeex takes actions on your behalf. And this paper tracks three of the groups using it, individual users, organizations, and OpenAI's own employees. And the top finding from this is that adoption of agentic systems like codeex is growing fast and spreading well beyond engineers. So for instance, the number of active codec users grew more than fivefold in the first half of 2026 with the most rapid growth coming from outside the original audience of software developers and what people use it for is also shifting as well. So rather than asking AI for advice, users are delegating real work and that work is getting more complex. Since the start of the year, the share of individual users who gave codeex at least one task that would take an experienced human more than 8 hours. The number of people doing that, the share grew nearly tfold and people are more often running agents in parallel. More than 10% of users manage three or more codeex agents at the same time in a given week. about a quarter use saved skills which are reusable instructions for complex workflows. And inside OpenAI where usage is not really capped at all, the paper shows a glimpse of where this is all headed. So Codeex now drives more than 99% of OpenAI's employee work output and has largely replaced Chad GPT. This past month, the median employee in a legal role, for instance, produced 13 times more across the company's AI tools than they did in November. The median researcher produced more than 50 times as much. So, Paul, on the surface, this research is about how codeex usage is rapidly expanding in OpenAI outside of it. But really, we're talking about this kind of sea change in how people work.
How Agents Are Transforming Work: OpenAI's New Research
45:36 · So there's a real fundamental difference between using AI through a chatbot like many of us still do and having agents do long horizon tasks autonomously. Curious how you're looking at this change.
45:49 · So this this research to me is invaluable whether it's coming from here. Anthropic has put out, you know, some great research just on their own internal usage because all of us are trying to figure out what the future of work looks like and how it's going to reshape our organizations, our tech stack, our org charts. Um, and the best chance to see around the corner is to look inside of the labs who are actually using all this technology and have this, you know, in this case, infinite token use basically, right?
46:16 · So, um, it becomes even more essential as government restricts access to more powerful models. So when we don't have the ability to even experiment in these ways um this insight from labs is very helpful to start trying to piece it together. So um [clears throat] a few excerpts from how agents are transforming work which was the open AI post that they published in relation to the research. So he said, "Agentic AI changes the unit of knowledge work from single interactions to delegated long horizon tasks, which again, this goes back to like that meter research that we were talking about reliability over long horizons." They said, "Chatbot interactions are often short and self-contained. Agents can operate independently for minutes or hours while orchestrating tool calls, interacting with environments, and iterating towards solutions. As a result, agents are quickly becoming the most powerful AI tool for work. As you mentioned, Mike, every department, including nontechnical departments such as legal and recruiting, use codecs within OpenAI as their primary AI tool for work. The pattern reflects what they believe will be the future of work given the expanded capabilities and accessibility of agentic tools. So, one doesn't take someone with technical knowledge to use these things. They're saying like it transforms the way people work. People use codeex for long horizon work. By May 2026, 81% of sampled individual users made at least one codeex request.
47:35 · estimated to exceed 30 minutes of human work. 70% made one estimated to exceed one hour and 26% made at least one codeex request estimated to exceed 8 hours. So one out of every four did a request that did one day's worth of human work in a basically in a prompt is kind of a way to think about this.
47:58 · Um codeex became the primary AI tool for every department. Engineering moved first. legal finance recruiting crossed into Codex being their primary tool around April 2026.
48:09 · For the average OpenAI worker, Codeex usage now accounts for more than 85% of output tokens. Since Codeex users tend to use more tokens than non-users, its share of overall, as you alluded to, is 99.8% of weekly output tokens. Nearly a quarter of all Codeex requests are for tasks that would take a person more than one hour to complete. again like [laughter] think it just you can just like I'll just read it again. Nearly a quarter of all codeex requests are for tasks that would take a person more than 1 hour to complete. So when we talk about the future of work and you break down what you do in a given day, the average person, let's just say, works eight hours. Imagine instead you have like eight prompts and like each of those prompts does something that generates what would have taken an hour of work, right? And then you then you scale and that's just one like they're going to be able to do multiple hours, multiple days. Now it's going to take a bunch of tokens, but we're entering a phase where Agenta capabilities are going to be made accessible that can do hours or days worth of work with simple prompt strings that could be completed in minutes. And that's just fundamentally a really difficult thing to comprehend. So they go on to say over the last 6 months codeex usage has deepened and intensified. Um you mentioned the research saw the biggest jump. Um customer support rose 32 times engineering 27 times legal 13 times across the country. You u across the company users are switching from chatbots to agents as their primary form of AI direction and are deploying an exponentially growing amount of agentic labor. It's just like this is um I would say the second half of the year, Mike, is it fair to say AI agents and aentic work is like the core focus of our of SmartX research team?
49:55 · I would [laughter] I think that's a safe assumption. Yes.
49:58 · Yes.
49:58 · So, we will be doing a lot more.
50:01 · This is actually going to be a focus of our AI council in the second half of the year as well. Um and I think rightfully so. It's just it it is the most um consequential thing that's going to happen. Well, that's hard to say. One of the most consequential things that will happen in business is the autonomy and re reliability of agents and then finding out how to get affordable access to do what they make possible efficiently. That's kind of another challenge. But again, you have to look inside the labs, see how they're working, and that is how we then figure out what does the future of work look like.
50:39 · Absolutely.
50:39 · And I just want to emphasize and make really clear for people because the labs don't always do this a great job of this. And I think it leads to confusion. It's like we're talking about codecs. We talk about things like claude code. We are not talking about uh legal or customer support using codecs to go build software though they do that. You can literally use these tools as generalpurpose agentic harnesses around the frontier models. Meaning I use codecs and quad code daily not to go write software to do the knowledge work that we are all used to doing. So if you haven't kind of connected those dots yet the easiest way to start learning is go use these tools. They are not just for developers. People are using them in really innovative ways in knowledge work and that's kind of what you're seeing here.
51:30 · Okay. So before we move on to rapidfire, this week's episode is also brought to you by the AI for business boot camp by Smarter X coming to Columbus, Ohio very soon here July 16th. This is a single day at the Hilton Columbus at Eastston.
51:44 · It is built for professionals and leaders who are ready to accelerate AI adoption and value creation. So this is a day with Paul and myself that starts off with Paul's state of AI for business keynote. We then transition into two highly interactive workshops. I will be teaching an AI productivity workshop in the morning. Paul will be teaching an AI innovation workshop that afternoon. This event is built for AI forward managers, directors, and executives across every department who are ready to move past the theory and get into building real AI powered workflows, getting strategic frameworks to accelerate transformation, and ready to leave with an immediately actionable plan for themselves and their teams. If you're an AI Academy Mastery member, you get discounted pricing to this event. Discounts are also available for teams of two or more. Groups of 10 can or more can get custom pricing as well. Also, if you're a podcast listener, use the code pod 100 to get $100 off your ticket. Most importantly, you are listening to this on Tuesday, June 30th. Today is the last day to get the early bird pricing for this event.
52:48 · So, this is the last day to get the lowest price possible. To grab your spot, just go to smarterx.ai, AI. Click on events. You will see AI for business boot camp as an option right there.
53:00 · All right, Paul. So, more political intrigue, so to speak. So, this first rapid fire topic connects directly to our first couple main topics. So, if the US government is now potentially going to stagger the release of American frontier models, there's this big second order question some AI watchers are asking, which is what happens to Chinese models? And this past week, that debate caught fire. kicked off in part by a post by widely followed AI commentator Andrew Curran. So Curran argues that staggering American model releases almost guarantees that Chinese models get restricted or even banned in the west. So his logic is the following.
Will the US Government Restrict Chinese Models?
53:40 · Chinese models are technologically roughly 9 months behind the US frontier today. If you force every American release into a slow roll out and China starts closing the gap over time, this undercuts the business model of Western labs because the public would eventually get good or better open-source Chinese alternatives. Currin, for one, argues the US government will not let that happen. either China somehow agrees to the same staggered release schedule, which he thinks is unlikely, or the US restricts or bans Chinese models, and he puts that window at 9 months at most, probably less. So, it does seem like Chinese open models are genuinely closing in right now on US ones.
54:22 · Estimates of this gap, he puts it at nine months. Some others have estimated six to nine months. Stanford's AI index says Chinese models have largely erased the US quality lead over the past year.
54:33 · Alibaba's Quen has already passed Meta's Llama to become the most downloaded open model in the world. So Currin adds one more prediction. He actually also thinks because of this environment that he raises the odds that Nvidia eventually gets restricted to selling only within US borders as well. on the logic that it's much harder for chips to end up in China if they are treated as a national resource. So Paul, some pretty bold predictions, but obviously it seems like we're quickly in a new normal here. Like what do you think the odds are that the US cracks down on Chinese models?
55:06 · Yeah, I said it last week. I think it was going to happen. Now yeah, what it looks like I was actually doing a little like um I guess working with GPT 5.5 thinking model kind of expanding on like what are the possible ways that this could happen? Like what could this look like? So I'll get to those in a second. Just a few names because we don't spend a ton of time on the Chinese model. So I just wanted to surface like some of the names to recap. You've heard us say or like maybe Michael mentioned them in the product and funding news. So we have DeepSeek which uses this the company and the model. We have Alibaba Quen is their model which we'll talk about in a second. We have Bite Dance Dubau I think is how you say that one.
55:43 · Uh Zippu which has GLM which just recently became a big thing like in the last two weeks. That one's got talked about a lot. Moonshot has Kimmy and then Hawaii has Pangu. So those are like the the most strategically relevant labs and models that are coming out of China. Now Alibaba was big this week because Anthropics sent a letter to US officials accusing Alibaba of brazingly and illicitly attempting to extract its AI models. So Alibaba is a massive Chinese multinational technology and e-commerce conglomerate kind of like China's Amazon. They have an AI research division that develops Quen, a flagship suite of highly powerful foundation models. So the letter which CNBC um obtained claims that Alibaba was distilling anthropics models through tens of millions of exchanges. So in essence, they have all of these like fake operators and organizations and accounts that are then going through and and using the model and learning how it behaves and learning from there and then distilling that down to build their models faster. So in my chat GPT kind of back and forth what you know where are we with um US versus China what are the implications what could the US government do? So I this whole string of questions that I was going through and kind of pushing it on. So I'll just give you a quick synopsis. So um AI models are obviously becoming general purpose cognitive infrastructure. So if a US company builds customer service, coding, analytics, marketing, legal review um and decision support on a Chinese model.
57:10 · So let's say that you can't get the most powerful models and then you're going to turn to China and a US enterprise is going to start using these Chinese models. The concern is not just where is the data going. So now they can see what you're prompting things like that but who controls the intelligence later inside that enterprise. So a model access regime that blends export controls import controls procurement rules cyber security view and industrial policy is sort of what is forming within the United States. So password could be literal federal procurement ban. So the easiest move is to say federal agencies and contractors can't use Chinese origin foundation models, APIs, model weights, inference services, embedded AI tools.
57:49 · So that would mirror the way the government procurement has been used against Hawaii uh and Tik Tok on federal devices. You could do critical infrastructure restrictions. So the government could prevent Chinese models from being used in defense, energy, financial services, telecom, healthcare, etc. Um you could do enterprise compliance pressure. So instead of banning Chinese models outright, regulators could require companies to disclose whether they use models from China or other countries of concern.
58:16 · That could stop adoption or dramatically slow it down. So boards, insurers, auditors, enterprise customers may decide the risk isn't worth it to work with companies that are using those models. You could do claw uh cloud and API controls. Um or you could do sanctions. So there's a huge risk though in all of this. These restrictions can accelerate Chinese open model ecosystem to other countries who no longer can depend on the US to get access to the most important models. So it creates this paradox where if the US blocks access to Chinese models, it may protect sensitive environments. But the US also restricts access to its own best models pushing developers, startups, global markets toward cheaper, more open, less controllable alternatives. So in essence, you have the Chinese government trying to tank anthropic and open AI and innovation within the US by just flooding the market with open source models and subsidizing them. So the US policy challenges become protect national security without accidentally making Chinese open models the default global substrate and the bigger implication is model choice may stop being purely technical and economic decision and it becomes a geopolitical decision and we are racing toward that right now.
59:19 · Yeah.
59:19 · Yeah. Yeah. the US verse China and then US verse USI labs [laughter] right is unfortunately what the two-pronged front they're fighting on it's a comp and again you can empathize with how complex this is um and there isn't necessarily like very clear right and wrong answers that even are political in nature that it's like oh one side will do this one side will do that like no neither side knows what to do right now um and unfortunately that leaves us where we're just arbitrarily having models held up for whatever reason by the government.
The Growing Data Center Backlash
59:54 · All right, so another big political topic. This past week, The Economist made the AI data center backlash their cover story. They had a bunch of reporting that opposition to data centers is spreading fast and turning even way more political. So some background here. According to their estimates, the biggest AI and tech companies are pouring as much as $750 billion into AI data centers. This is part of an estimated 3 trillion globally estimated to be spent between now and 2030. This is enough to roughly quintuple America's AI computing capacity by the end of the decade. And as the economist puts it, people across the political spectrum are absolutely furious about this. So, by the economist count, protests have now stopped close to $100 billion worth of data center projects, including at least 20 worth $42 billion canled in just the first 3 months of this year. Bills [snorts] to freeze new construction have been introduced in at least 10 states. And polling shows Americans would rather live next to a nuclear reactor than a data center. And the reporting kind of makes it clear this is about the usual suspects, but about a little more than them, too. So, it's not just the noise, the water consumption, the perceived power consumption. This has become a proxy for a deeper anxiety about AI itself. And 3/4 of Americans now say AI should be more regulated. Republicans nearly as much as Democrats. 45% said they are at least somewhat worried AI could cause human extinction. Uh, wanted to give a quick shout out to Ohio, our home base. This is a telling example. We are now the fourth largest data center hub in the country. Yet 3/4 of Ohio Democrats and twothirds of Ohio Republicans oppose local data center development. Now on top of all this, and we'll talk about this piece, Mark Cuban jumped into all this with a widely shared post arguing that AI companies have already lost this PR battle. He basically says that this fight functionally now has nothing to do with data centers and has become a standin for just anger at AI and at the wealth it is concentrating in a few hands. And his advice to the labs is stop trying to explain the benefits. Show up in person in affected towns with real money for community programs and job support. Go to create go to artists and creative unions who he says are terrified. Ask what would actually help. He's basically warning that they do not win over workers and communities. They will never get these projects that they need to make all of this work. So Paul, like based on the numbers, I just don't know if I've seen an issue more capable of uniting Americans in recent times than hatred for data centers.
1:02:31 · Yeah.
1:02:31 · And we're seeing this in our own community. So Cleveland is our hometown.
1:02:35 · Um just last week, Cleveland City Council is advancing a plan to press pause on data center projects, responding to wave of high-powered computing hubs sweeping across the state. This is from News Channel 5 in our hometown. After hours of discussion and debate, Thursday council's utilities committee voted to move forward a three-month moratorum on permitting and reviews. The freeze will apply only to standalone data center buildings, both groundup construction and expansions, not to server farms integrated into multi-tenant properties. Uh legislation is headed for a full council vote on July 15th. So, I mean like we're again literally in our backyard, we are seeing the fight against this. Now, Cuban's post, I I think it's a good one to go read. I'll just highlight a couple of quick excerpts here. Uh, it's time for everyone to realize the fight against data centers has nothing to do with the data centers, as you mentioned, they've become the proxy for hate towards AI and the concentration of accumulation of wealth it's creating until those running the big LLMs understand this and start a community tour. Not to explain the benefits, it's too late for that, but to help towns and cities that may be impacted by job losses. And I'm a believer there will be a net gains in a few years. He says this battle is only going to get more intense. And let me tell you, no matter how much money you pay to buy politicians and races, you will lose. Don't try to pay famous people to endorse what you are doing.
1:03:48 · That's dumb. The big LLMs have lost the PR battle. Why? Because they all suck at putting people first. They have a Silicon Valley attitude that makes them all think they are John G saving the world, which by the way, I had to look up because I never read Atlas Shrugged.
1:04:02 · He is the hero and central philosopher of Atlas Shrugged, who's an inventor and intellectual who organizes a secret strike of the world's leading creators and innovators to protest the society that exploits their talents. Um, back to Cuban. Given the number of data centers and power that is needed today and going forward, if you don't kiss the asses of the people that go to work every day and are just trying to pay their bills, you will fall far far short of the capacity you need to make your business work. We talked about this like a month or two ago. I was like the PR battle is, you know, the labs are screwed. Like they they don't know how to do this. They're they they're just like, hey, it's going to save the world and they have no like good tangible like what does that mean and how does it help me? who's struggling to pay my bills and living in a community with a data center that they're running off of like backup generators that are noisy and like that's that's the reality of these communities and these ultra wealthy people from Silicon Valley just pushing techno optimism is not going to win you votes for data centers that [laughter] you need.
1:05:03 · Right. Exactly. Yeah. It'll be interesting to see if people wake up to this fast enough.
1:05:08 · Yeah.
The $27M Proxy War Over Alex Bores
1:05:09 · All right. Another big political topic this week and then I swear we've got maybe only one other the rest of this episode. I don't know. Um but this past week, tons going on. The most expensive proxy fight yet over AI regulation played out inside a single congressional primary. This was in New York's 12th district. A Manhattan seat opened up when longtime Representative Jerry Nadler retired and AI money flooded an eight candidate Democratic field. At the center was someone we've talked about before, Alex Boris, who is a New York state assembly member and the kind of co-author of the raise act, which is a state law requiring frontier AI companies to publish safety protocols.
1:05:49 · Boris pretty early on made AI safety the core of his campaign which made him a target because leading the future a pro-industry super PAC we've talked about before backed by Andre Horowitz open AI president Greg Brockman Palunteer co-founder Joe Londale and the AI company Perplexity spent more than $8 million to defeat him on the other side AI safety group spent close to $20 million to support him with the largest share coming from an anthropic linked pack. All in all, more than $27 million in AI industry money poured into this one house race. On top of all this, Boris actually lost. His fellow assembly member Micah Lasher won the nomination about 35% of the vote to Boris's 35%.
1:06:33 · Interestingly, Lasher wasn't exactly soft on AI either. He literally co-sponsored the raise act. Um, he has backed a national moratorium on new data center constructions until there's guard rails on costs in the environment. He's called for antitrust investigations into partnerships between Google, Microsoft Open AAI, and Anthropic. So, Paul, this is not the first time we've talked about Alex Boris. He really seems to have become this lightning rod for certain factions in the AI industry. It seems like, yeah, I don't think it'll be the last time we talk about him either. And this to me is just a preview of what's to come. It didn't necessarily prove anything like it's, you know, it kind of let that American citizens will, you know, show an interest in AI. And as you've highlighted multiple times already, the hatred is sort of right in the middle. It's actually a bipartisan issue. Yeah.
1:07:23 · And so I don't think it's solved yet how the vote will move based on AI, but it proved to us that AI is going to be a core part of campaigns moving forward and that this is going to play out again this fall in the United States in particular with the midterms that are coming up.
1:07:38 · And you know, there's an interesting quote in a New York Times story about um about this race. It said uh informed by the Boris race, the AI industry's political activity will enter a new phase. So predicts the writer of this article. They said strategies inv strategists involved in the fights say they are newly attuned to the weakening poll numbers for the industry particularly among Democrats. They also say they are more aware about the chain reaction of consequences they can trigger when they throw their weight around in elections. So that tells you a lot that you might need to know about how you can expect people to get involved in AI moving forward.
More Google DeepMind Talent Woes
1:08:16 · All right, switching gears here. Something not super political this time around. We covered on our last episode Google DeepMind's talent problem. They saw Nobel laurate John Jumper left for Ananthropic. Gemini co-lead Noam Shazir left for OpenAI. And in the days since, the bleeding appears to have continued.
1:08:36 · Bloomberg reported that two more high-profile staffers, Jonas Adler and Alexander Pritzell, are leaving Google for anthropic. Both work seen internally as key contributors to Gemini. Adler was working on Google's AI coding efforts. Pritzell on model training separately.
1:08:51 · Deep mind researcher Arthur Conmi announced he is joining anthropic to work on aligning its on its upcoming models. So this is at least three more senior departures to anthropic in a matter of days. On top of those two we covered last week. Interestingly deep mind chief deisabis pushed back publicly. In an interview, he said Google has by far the biggest and broadest research bench of any lab and that the company still wins its fair share of top talent in what he called a ferociously competitive market. So Paul, I got to ask, we now got Demis weighing in on the issue. Is this just a couple weeks of bad news or is it something bigger and worse?
1:09:28 · I mean, for the last few years, talent has definitely [clears throat] just moved around between the labs. This does seem a bit of an anomaly. like it seems out of the ordinary how many major players are moving and that they're moving to anthropic.
1:09:41 · The only thing that came to mind as you were saying this is I wonder what like how stock options play into this. So, if Anthropics valued today at whatever it is, 800 billion or whatever that number is. Yeah.
1:09:52 · And people are looking saying, "Wow, I could I could jump ship now and get in preipo and you know, my stock options might be worth, you know, a couple hundred million in 3 to 5 years." Like maybe maybe their compensation packages are just really enticing and DeepMind can't compete there or maybe they just think Anthropic is a better lab to go and build AGI and beyond with. I don't know. uh we talked a lot about the last one about how you know deep mind has to be product focused and that's that can be very challenging for people who want to solve for the most important things in AI [snorts] and in science. So uh noteworthy and definitely something we're going to continue to keep an eye on is the movement between the labs.
1:10:33 · All right, so back to our regularly scheduled government focused programming here. So, um, [clears throat] this past week, the European Parliament voted to amend the EU's AI Act for the first time since it took effect in August 2024. So, the biggest change here is a delay in putting parts of this act into effect.
The EU Act and Digital Sovereignty
1:10:51 · So the main compliance deadline for high-risisk AI for things like hiring, education, law enforcement, this is moving from August 2026, so just in a couple months to December 2027, basically a 16-month extension and AI being built into regulated products like medical devices and machinery. The regulations around that are now being pushed to August 2028. And the reason given is that the technical standards companies need need to benchmark compliance against are not yet ready.
1:11:21 · This amendment's also adding a ban uh on AI tools that generate things like non-consensual intimate imagery. Now, the EU council still has to formally approve this, but it's expected to have that in front of it before August. Now, at the same time, in light of everything we've talked about so far, Paul, you had highlighted this week there's this debate heating up around a concept called digital sovereignty. digital sovereignty which is being pushed largely through some key figures and voices at the United Nations. This is the idea that every nation should build its own AI stack at home, its own compute, its own data, and its own models. This had some attention drawn to it this past week because Jacob Hellberg, the US under secretary of state for economic affairs, published an essay called the digital sovereignty trap, arguing hard against this idea. He called it backward and counterproductive and warned that it would leave dozens of countries with smaller, weaker copies of technology that already exists and each pouring billions into rebuilding last year's breakthrough while the frontier moves on without them. So Paul, just curious on your thoughts on both of this. I mean, I sympathize like we've talked about all episode how hard it is to regulate AI, but my god, delaying parts of the AI act by 16 months seems crazy given that like we're probably going to have AGI by the time this thing is going into effect and like how do you see that squaring up? How are you thinking about the digital sovereignty thing?
1:12:41 · I just I mean the whole technical standards that they need to benchmark compliance against are not ready yet. Like how we just talked about this, like how do you devise these technical standards when the the goalposts keep moving and what these models are capable of and how they're trained and how efficient they are and what you thought would have been a good technical standard might not be anymore and it's just going to keep evolving.
1:13:01 · Yeah. And then the idea that like again a lot of this is being driven the digital sovereignty is being driven right now by the fact that our allies who thought they could rely on the US can't rely on us anymore because we may choose not to give them access to these frontier models.
1:13:19 · again like I this whole episode is just like it all is interconnected but it's all so broad and like complex and um like dependent upon each other it's kind of wild actually it' be interesting like we we have a notebook LM that we put all of our episodes into this would be a really interesting episode to take the transcript of and like look at the like the mind map that notebook builds of like how these things are all going yeah the interconnectedness of these topics and I wonder too the through lines over the last several years of what we've talked about and how it's led to this would be really interesting to visualize.
1:13:54 · Yeah.
AI Use Case Spotlight
1:13:56 · All right. Next up is our AI use case spotlight segment where each week we give you a quick look under the hood at some AI use cases we're exploring at Smarter X. So Paul, I'm going to share one real quick and then turn it over to you.
1:14:09 · Okay.
1:14:09 · Uh so this week I just wanted to share kind of related to our talk about codeex, how I used codeex on a very practical data analysis project. So I was basically just trying to better understand how a specific piece of content connected to revenue and basically we have all this rich data that you know we thought could help us better understand whether you know a content asset was actually influencing revenue along someone's path um like a lot of real business data though the answer was not in a single field so I had a lot of messy data and they used codecs like a genius data analysis sitting next to me so I gave it a structured export had to inspect the data, identify which fields looked revenue related, which looked related to attribution, which ones are probably too noisy or duplicative to trust at face value. Importantly, used a fully anonymized segment of data. Kept this focused on patterns, fields, and counts, and definitions, not like there no personal details in here. But super interesting to note the scale here. So, I could literally, not even kidding, I could barely open this spreadsheet without crashing my computer. It was like 144,000 rows, a thousand columns.
1:15:18 · Codeex was able to scan all that, narrow it down to a few hundred fields that looked relevant. From there, it turned this into a very focused analysis. So, it surfaced likely candidates for fields I might have been looking for, explained what each type of field could and could not tell me, gave me a sanity check on some smaller cohorts from this larger data set, so I could quickly understand kind of exactly how to model some revenue attribution. based on real contacts. And you know, I think this gets at what I was saying before, like Codex is sold as this coding tool, but I was not using it to write software, which surely it was writing code as part of its analysis, but I was using it as a generalpurpose agentic system for knowledge work. So, you could certainly try to do this in chat. You could drop a spreadsheet in there and analyze it.
1:16:03 · That's nothing new. But what's cool is that with something like Codeex, you can just hand it a goal, say, "Here's where I want to end up," and then it just runs the whole investigation into the data itself. It fixes its errors, goes down different paths, chains dozens of steps together until the work was done. So, kind of just uh behind the curtain of like why you might want to use a a more robust tool like that for something like this.
1:16:27 · I'm anxious to see that report [laughter] schedule meeting. I want to see how this all worked. Um yeah, mine I I just have two quick ones. So one, um I've alluded to like I'm working on a couple of like very high level strategic things for the company. Uh one of them is developing an AI transformation system. So I actually wrote a little bit about this in my execi newsletter this past weekend for Smarter X and then I posted an excerpt of that on LinkedIn. So you can see a little bit what I'm talking about on there. But in essence, I'm developing a set of tools and frameworks to help individuals and organizations um guide their transformation, understand where they are in their journey, and then develop action plans based on it. And so I've been working on a lot of components to this. There's probably over two dozen elements to the transformation system. I would say that we'll we will release all of it in in the next few months probably. So, I've started sort of um sharing out some elements of it, but there was one thing in particular I was really struggling with how to uh explain and so like I'd created an internal brief. It was I I was reading it and I'm like I still it's just not there like this is not the messaging isn't simple enough and I had predominantly done it and claude and no matter what I how I pushed Claude, how I changed the question, how I put the kind of it just kept not it wasn't getting there. So, I was like, "All right, I'm going to start fresh. I go into Chad GPT and I did this while I was driving which is again kind of a cool element for me. It's like all right I got 20 minutes like I just I'm just going to think about this out loud.
1:17:56 · So [snorts] I just open it up. I'm like listen here's what I'm working on. Here's the trouble I'm having. I need a way to describe these things that's totally intuitive and it just like makes sense and it came up with something. I was like oh that might actually be it.
1:18:10 · And then I was like okay let's let's go down that path more. And so I was on my way to a breakfast actually. I get to breakfast. I was there early before the person I was meeting. I continued the chat then in you know typing form um where I was like all right let's play around with this and here you know here's some additional context and I think I I ended up nailing it something that I've been working on for months that I just couldn't get through it it all of a sudden worked by just bouncing over to chat GPT and giving a little fresh perspective. So um that was amazing to me. I mean it's like a foundational piece of how we're going to explain everything we're doing. So, this is like a one of those like it's a random conversation on the way to a breakfast that may end up becoming core to Smarter X's positioning and product offering for the next 10 years. Like it those kind of things. And then another fun one, I um I have a golf bag from that's still branded with my agency that I sold in 2021 and that bag itself is probably 10 years old. It's falling apart. So, I was like, I need a Smarter X golf bag. So, I found a bag I wanted and I was going to go order it, but I'm like, "Wait, I want to see what it would look like with the embroidering on it."
1:19:16 · So, I took the image of the golf bag, dropped it into uh Gemini, and I was like, "Can you help me visualize the Smarter X logo on this bag?" And and it like for whatever reason, I don't know if Nano Banana got dumb, but like it couldn't do it. It was driving me insane cuz I was like, "This is not a hard thing I'm asking you to do." And so went over to Chad GPT and I was like, "Let me try over here." And uh it nailed it. So, that was a fun one. And it's like, it looks amazing. I'm hoping the actual bag looks as good as the one I created, but I sent it to the vendor. I was like, here's here's what I want it to look like. Here's where I want the logo placement to go. Um, here's the bag I want. Versus me trying to explain the positioning of the logo and things like that. So, that was just a fun use.
1:19:55 · That's awesome. I'd love that.
1:19:57 · [laughter] All right, we're going to wrap up with some AI product and funding updates real quick. So, I'm going to run through these very fast, but first, OpenAI and Broadcom unveiled Jalapeno, which is OpenAI's first custom AI chip. This is an inference processor purpose-built for large language models that the two companies co-designed in a 9-month cycle they call the fastest ever for an advanced chip. They have initial deployment targeted for the end of 2026.
AI Product and Funding Updates
1:20:23 · Anthropic launched a feature called Clawed Tag in beta. This is a Slack integration that lets teams tag Claude as a collaborator in a channel so it can pull context from the conversation, use connected tools, and complete tasks asynchronously. Anthropic says its internal version already writes 65% of its product team's code. Anthropic, as we alluded to, also accused Alibaba in a letter to US officials of the largest known distillation attack against it to date. They alleged Alibaba used roughly 25,000 fraudulent accounts to run 28.8 million exchanges with Claude between late April and early June to illicitly extract its capabilities.
1:21:05 · The company's Stripe Anthropic and OpenAI are backing Intercept, a new $500 million nonprofit led by Stripe executive Nan Ranshoff that aims to dramatically reduce and eventually eliminate respiratory infections through vaccines, air cleaning infrastructure, and antiviral research. Google is taking a tougher line with news publishers in its AI licensing talks, pressing them to grant broad rights to use their content, including to train its AI models in a Google News pilot and warning that they some that they will lose payments from its older showcase program if they do not sign on. At the same time, Google is investing about $75 million in studio A24 and partnering with its DeepMind unit to build AI film making tools. Its first stake in a movie studio. It's a multi-year non-exclusive deal that does not give it access to A24's film library, but it did draw immediate backlash from A24 fans. Mirand deal co-founded or founded by anthropic veterans Benham Naore and Harsh Meta raised a $200 million seed round at a $1 billion valuation led by Andre Horowitz and Kleiner Perkins with Nvidia participating one of the largest seed rounds in AI history to build AI that accelerates AI research itself.
1:22:26 · General Intuition also raised a $320 million series A at a $2.3 billion valuation led by Kla Ventures with General Catalyst Eric Schmidt and Jeff Bezos investing to build large action models trained on billions of gameplay clips from the 17 million monthly users of its metal platform. And finally, Superhuman, the email company Grammarly bought and rebranded last year, acquired AI detection startup GPT0 for an undisclosed sum, adding GPT0's 19 million registered users and roughly 30 million in annual recurring revenue to its existing operations.
1:23:04 · I would suggest people go read the quad tag stuff. We'll put some links in the show notes. That was one that probably could have been a a topic for sure today. Um, it it alludes to the future of work stuff we talked about with the OpenAI research. Once you give agents access and treat them like co-workers, it changes the dynamic. So, it's just in preview. Salesforce workers are not happy about it from an information article. Like, they're not understanding why they're allowing this kind of thing to happen in Slack instead of Slack bots doing this. So, it's um it might take a few months for that to be realized like the significance, but I think it's just continuing to show practical applications of agents and how it can almost overnight change the way we interact with them and how how they function as true co-workers.
1:23:50 · And one final reminder, we have our weekly AI pulse survey up at smartrx.ai/pulse.
1:23:57 · So this week's survey is going to ask a little bit about how you feel about the US government handling the release of new AI models and also getting more of a pulse on how if at all you're using things like codec or cloud code in your own work. So Paul, I mean just easily a banner week uh not always in a good way uh in AI. I appreciate you breaking everything down for us.
1:24:21 · Yeah, like we said at the beginning might might get some models released this week. Maybe the government will feel generous in the Fourth of July week. Maybe not. Um, yeah, we'll see what kind of uh news dump we get in a holiday week, but hopefully not as much government talk next week. I could do without it, and I'm sure listeners would like a break, too, from that. All right, thanks, Mike.
1:24:44 · Thanks, Paul.
1:24:46 · Thanks for listening to the artificial intelligence show. Visit smarterx.ai AI to continue on your AI learning journey. And join more than 100,000 professionals and business [music] leaders who have subscribed to our weekly newsletters, downloaded AI blueprints, attended virtual [music] and in-person events, taken online AI courses, and earned professional certificates from our AI academy, and engaged in the Smarter X [music] Slack community. Until next time, stay curious and explore AI.