---
Transcript
Intro
**** · I have concerns around the biggest companies having access to the future frontier models and then the potential centralization of power. So if you get into this situation where we get these massive models and they're so dangerous to release publicly that we only give them to Apple and Amazon and the banks and okay well now we just centralized power.
**** · Welcome to the artificial intelligence show the podcast that helps your business grow smarter by making AI approachable and actionable. My name is Paul Ritzer. I'm the founder and CEO of Smarter X and Marketing AI Institute and I'm your host. Each week, I'm joined by my co-host and Smarter X Chief Content Officer, Mike Kaput, as we break down all the AI news that matters and give you insights and perspectives that you can use to advance your company and your career. Join us as we accelerate AI literacy for all.
**** · Welcome to episode 209 of the artificial intelligence show. I'm your host Paul Ritzler along with my co-host Mike Kaput. We are back after a brief hiatus.
**** · I was traveling last week. Were you traveling last week too?
**** · I was not. No.
**** · Okay.
**** · so I was I was out of the country so we could not record. So episode 208, if you listen to it, we did a Q1 trends briefing. So, if you haven't had a chance to listen to that, it's a really good recap of what went on January through March of this year. But I'm now back and it is Monday, April 13th, a.m. Eastern time. I don't know, the last two weeks mic were crazy cuz even as I was traveling had a lot of downtime, but we were we were in Scotland, so we were touring a lot and so we had long rides at times into the Highlands and stuff, which by the way, if you've never been to Scotland, go to Scotland. It's incredible. so I was, keeping up with the news, posting the links into our sandbox for the episodes and we were north of 60 topics and that when I say topics a lot of times within topics there are five 10 links. So the anthropic claude mythos model we'll talk about there's a dozen links in the top.
**** · So, boy, even while I was gone, I would imagine there was probably north of 90 to 100 different sources put into the curated sandbox for today's episode. So, Mike, as always, does an amazing job of curating all of that information and putting it into a logical format cuz I was worried as the week was progressing man, this might be a 2 hour episode.
**** · Interesting.
**** · So, I think we've managed to condense it into a manageable probably 90 minutes. We'll see. We never really know until we record it. but yeah, a lot happened in the two weeks. Just some pretty crazy stuff. I think some stuff that's alluding to where this starts to go throughout the rest of this year. So, we'll get into all that.
**** · Starting off with the Claude mythos, which is just a fascinating topic on many levels. And so today's episode is brought to us by AI Academy by Smarter X, which helps individuals and businesses accelerate their AI literacy and transformation through personalized learning journeys and an AI powered learning platform. New educational content is added weekly so you always stay uptodate with the latest AI trends and technologies. we build this in collections. So there's when you go in and you want to build a personalized learning journey, one of the ways to do it is you look at the different collections. So AI for departments, AI for industries as an example. So today I want to feature AI for departments. There are currently six course series and certificates designed as part of this collection to jumpst start AI understanding and adoption. So we have AI for marketing, AI for sales, AI for customer success, AI for HR, a AI for finance and AI for operations. So the goal is to create content across the entire spectrum of all the departments within an organization. And that way no matter what you are doing within a company, there's a professional series and certificate for you. So these series are ideal launchpad for organizations that want to level up their teams and accelerate AI adoption and impact. Mike teaches the AI for customer success series and we're going to share a little bit more about that toward the end of today's episode to give you some key takeaways from the customer success series. So individual and business account plans are available now. You can buy single courses and series for onetime fees or just become an AI mastery member individually or through a business account and get access to everything. It's all included in that one fee. So visit academy.smarterx.ai to learn more. And if you're looking at the business account side, just fill out a form there and our team will be in touch with you away to talk to you about your transformation within your company. Okay. We usually at this point might do an AI pulse. But since we did not have an episode last week, we did not do an AI pulse survey last week.
**** · but we will at the end of today's episode give you the AI pulse survey for this week. So as a reminder, each week when we do these weekly episodes, we do these pulse surveys and they're just informal polls from our listeners and I guess our viewers on YouTube who want to participate and provide feedback and their thoughts on topics that we cover each week. It's usually two questions. Sometimes we'll throw in a third question. And so it takes about 30 seconds to participate in these pulse surveys and it gives us really cool real-time data that we can share with our listeners each week. So smartx.ai/pulse is where you'll go to participate in this week's poll. Okay, Mike. so with that, we have a pretty big topic that we touched on this idea of this Claude mythos model. why don't you give us the rundown? And I, I looked at your show notes beforehand and you do a great job of summarizing and then I'll try and lean into a couple of key areas of this.
Claude Mythos and Project Glasswing
**** · Sounds good, Paul. Yeah. So, Anthropic has revealed a model so powerful at hacking and cyber attacks that it triggered an emergency meeting among other people between Treasury Secretary Scott Bessant, Federal Reserve Chair Jerome Powell, and CEOs of America's biggest banks. So, the thing they are buzzing about is called Claude Mythos, which Anthropic is not releasing to the public, and it represents what Anthropic's Frontier Red team calls the starting point for what we think will be an industry change point or reckoning.
**** · And that's because Mythos is just a general purpose AI model. It is not specifically trained to be good at thwarting cyber security, but its improved reasoning capabilities have made it devastatingly effective at autonom autonomous security research. So it can scan for, identify, and exploit zeroday vulnerabilities in critical software. And this can often be done when amateurs are triggering it to do so with simple prompts. So, Anthropic said, "Mythos has already found thousands of zeroday vulnerabilities across every major operating system and web browser." So, some specifics here that are striking. Mythos found a 27-year-old bug in OpenBSD, an operating system that is specifically designed to be unhackable and powers many internet routers and firewalls. It found a very old vulnerability in FF FFmpeg, a widely used video tool. The automated testing tools had scanned 5 million times without catching this particular vulnerability. And in one benchmark where the previous cloth opus 4.6 model turned Firefox vulnerabilities into working exploits only twice out of several hundred attempts, Mythos developed 181 working exploits. So, Anthropic has in response released this thing called Project Glass Wing. This is named after a butterfly whose transparent wings let it hide in plain sight. So, a metaphor for bugs buried in complex code. And this is an initiative that is giving 40 plus companies over time, including people Apple, Amazon, Google, Microsoft, Crowdstrike, etc. early access to Mythos for defensive patching.
**** · They're backing this with $100 million in usage credits. And Anthropics Frontier team lead says he envisions this program evolving into an industrywide consortium that includes all model providers. One final note here is pretty interesting. Cyber security industry didn't have a great couple weeks with this. Crowdstrike, Palo Alto Networks and some other security stocks dropped on this news because as AI expert Ethan Malik wrote in certain hands or different hands, Mythos would be an unprecedented cyber weapon. So Paul, maybe outline for us what really jumped out to you here. I know some people are asking the question is this really as big a deal as Anthropic has seem to be seeming to make it? seems some higher up people at some big places are pretty scared of this.
**** · Yeah.
**** · there's always the haters who are just oh, they're just trying to build up hype and it's all people calling back to oh, that's what OpenAI said about GPT2. It was too dangerous to release and realize was it was back then people weren't prepared for what GPT2 was going to do to the world as crazy as it sounds now. And I do think that in the end this is probably underhyped in terms of where this all is going and how unprepared we are for all of that. So not necessarily just this model but it's it's that moment where you start to see the leaps that are happening that most people just don't even comprehend. So, I don't know. this was one of those where as I was traveling, you're just following along the news, reading the different posts on X and trying to get a grasp of what exactly is it and how different is it than what we have.
**** · And so, I'll just highlight a few things. So, one, the system card, which I would suggest, it's dense.
**** · It's 244 pages, I think, Mike. it's a good Notebook LM thing. Throw that PDF into Notebook LM and, have some conversations with it. Have it break it down for you. But there's a lot of technical information in there. But the way they present the model in the system card is they say quad mythos preview is a new large language model from anthropic. It is a frontier AI model and has capabilities in many areas including software engineering, reasoning, computer use, knowledge work and assistance with research that are substantially beyond those of any model we have previously trained. and then they go into alignment review. The first early version of Claude was made available for internal use February 24th. So just to give you a sense of h how this is all transpiring, how quickly. So they the first model they made available internally to a small group of people was on February 24, so less than 2 months ago. But it's interesting when you go back and think about some of the things we've covered about anthropics, some of the comments that Dario Amade has made in interviews and in posts since February 24th and now you understand the context of he was seeing things that we all hadn't seen yet and they knew where this was leading.
**** · So that was the first thing for me is just the big picture here. Now Sam Bowman, the AI safety alignment, one of the team members at Anthropic that works on it. It's funny, anthropic, everybody's just technical staff, I think is the title of everybody, but Sam obviously is pretty important to this alignment and safety team. And he posted a thread on X that shared some of the context around the safety card. So, I'll highlight a couple of the things he said cuz I think they're really helpful. So, he said, "The model is our most reliable to date by far.
**** · it's generally possible to give it complex R&D tasks, give it lots of tools and let it work autonomously and on every evaluation and every type of monitoring we have, it misbehaves much less than any prior model. So this is something they stressed in the system card. Something Daario stressed something we've heard as an overall talking point is listen it's getting better it's behaving better but when it doesn't it's becoming a much larger problem because it's so capable. So he went on to say, "But it's notably very capable at cyber security and it's also not perfectly reliable, especially the early versions we first piloted internally and will occasionally try to take shortcuts or push past obstacles to get tasks done."
**** · So this part I think is really important because again what you'll hear in some of the other you notes I'll make here is the version that's being tested by the government by the banks by Microsoft by all these people isn't even the most powerful version that they trained the early version that hadn't really had the red teaming done to it to make it safer to allow other people to test or even other people internally to test. So just the small group of people internally the model that they're now putting out into the world as a preview isn't as capable as the one that came out of the training So that's just again context that's important to understand said the early versions would also very rarely try to mislead users about what they were doing. All of the versions we used are uneasily good though not perfect at recognizing eval meaning it knows when it's being tested. You might see where this is going. We trust the model enough to use it heavily, but in the handful of cases where it misbehaves in significant ways, it's difficult to safeguard it.
**** · And then he put this one, which is the one that got a lot of media attention. I encountered an uneasy surprise when I got an email from an instance of Mythos preview while eating a sandwich in a park. That instance wasn't supposed to have access to the internet. So they detail this in the safety card, but the basic premise here was they had these sandbox versions that aren't supposed to be connected to anything. Aren't supposed to have the ability to connect to email and send emails, things that. It shouldn't have internet access. And somehow it got it got out, I guess, for lack of better way of saying it, and found a way to access the internet and then emailed Sam when he was sitting in the park having a sandwich. So that's weird. and then he said it has in small ways leaked information to the open internet. It's taken down our emails. When it reward hacks, it does so in extremely creative ways. Reward hacks means when you're training a model, when you're doing reinforcement learning and you're trying to make it better at specific things, you give it rewards to let it know it's doing the thing. So, a simple way to think about this is thumbs up, thumbs down. And so we've seen that forever in social media threads and you see it in Chad GPT and Claude and Gemini where it's was this a good output? So think of that as an example of a reward hack is you want a thumbs up. Well, when it is given a goal, what they're saying is sometimes it gets uneasily creative at achieving those goals that could borderline on a dangerous path to achieve a goal.
**** · That said, working with this model has been a wild ride. We've come a long way on safety. Now, keep in mind that's in a month and a half, but we still expect the next capability jump of this scale to be a huge challenge. By the way, most of the scariest behaviors we've seen were from earlier versions of the misos preview. The final glass wing model is likely to do things with leak information, though it's still somewhat pushy and at least as capable of doing those things working around sandboxes. So, that's from the safety and alignment side. Again, there's going to be voices in the industry who think they're just hyping this. I think I think the people who say that have a different agenda here. I'll just leave it at that. I would I would take the the safety card from Anthropic very seriously. I would take the their understanding of its capabilities very seriously because I think it does allude to a lot of what's some of the dangers we're going to face. So then a couple other things I'll I'll highlight here. One is 80,000 hours which is a great podcast. Rob Wiblin had a post he did and then he also has a 21minute YouTube video we'll post a link to. He highlighted a few things. so he went through the whole the whole thing and he broke it down into a couple of key points that again I'll just reiterate some of them echo what Sam was saying. So Mythos can break out of containment. that's a problem when it finds its way to access to tools the internet that it's not supposed to have access to. Anthropic is losing billions in revenue by not releasing this thing. So they now have what by their evals is maybe the most powerful model in the world most likely and they're not releasing it thereby meaning they're not charging people money to access this model. Now you could debate do they even have the compute capacity to release the model?
**** · That was one of the challenges that part of this they're saying is well they just can't afford to release it even if they put it into the world there's not enough compute to power it because it's so powerful it's going to draw so much comput capacity.
**** · So but that just a data point mythos is knows when it's being tested which we talked about that's weird but that has been we've seen that now for 12 months that these models know when they're being evaluated and they can then hide their thoughts and intentions.
**** · That's again something we've been talking about for 6 months. mythos can't be trusted, whether it's about whether it's untrustworthy. because it knows it's being tested, you don't know if it's just telling you what you want to hear and thereby you can't tell if it's trustworthy. And then he said mythos scares anthropic. they're they're not just worried about this current model and what they saw in the early versions that before they made it safer, quote unquote safer. they're worried now about what this means for others and not just them now that they've shown this what happens if other labs who don't have as much focus on safety achieve similar results and choose to put it out into the world. So the way I started prepping for this though was I just started listing a bunch of random thoughts Mike and I'll go through these real quick. so these are more of stream of conscious what I was thinking as I was getting ready for today. So, one is the labs see things we don't. We've said this many times on this podcast, but what that means is business leaders, economists, educational leaders, government leaders, the people we look to help the world be prepared are largely planning for a future state that they don't understand.
**** · And why so much of the research and the data about jobs and the economy etc is often misleading is this is what we're always saying. It's you're asking, for example, you're asking CEOs about the impact of AI in the future of work and whether or not it's going to cause them to reduce jobs. Or you ask an HR leader or a CFO or whatever, take your pick, or an economist or a politician, they have no idea. You're asking them to comment on the impact of a technology that they don't comprehend in its current state, more or less the state at which it is likely already living within these labs. Mhm.
**** · And so it brings us back to this idea of gradually then suddenly nothing in this mythos preview should be a surprise to anyone who's been paying attention to the rate of accelerated progress. And yet there's just those moments sometimes where it's what? because it might be the first time someone's reading a headline about an AI escaping a sandbox or something that. So if this is all new to you then you may be this might be world shifting. You're just thinking what is going on but the reality is all of this has been gradually building.
**** · At the same time as we started talking about in January of this year the timelines are accelerating this the advancements in the agentic capabilities is absolutely moving the timelines faster in terms of the capabilities of these models. But the vast majority of these companies and leaders haven't even solved for as I was saying where we are today. So if you look at your own company, if you work at a big enterprise or something, they're just still trying to get co-pilot to people and figure out how to do it safely and they're giving you these neutered versions of it and stuff that's the reality for most people. Most people aren't living on the edge of this capability. But this is why when I do my state of AI for business keynotes, I always talk about the dimensions of progress and I try and show capabilities today, show some examples for people, but then you lay out but here's where it's going.
**** · all of this is just the foundation.
**** · So I talk about things agentic capabilities, getting more autonomous, more reliable, continual learning, increases in memory. if you're using these tools every day, you're you've seen in the last few months, turn on memory, let it remember the conversations you're having. reasoning capabilities keep getting better. Recursive self-improvement, which is one of the areas that I think anthropic is very concerned about is the better these models get, the more likely we are heading toward a path where they can improve themselves. And I think we're already starting to see that. And then world models is another one. So there's I usually go through about I don't know there's 12 or 15 dimensions but those are some of the most common ones. So this then leads me to this is a prelude to automated R&D and recursive self-improvement. So we know the labs are working on automating R&D within AI models. something that should be very concerning to everyone is while they're withholding this full release, this likely means that we're only 9 to 12 months away from an open-source model being able to do the same thing.
**** · Yeah.
**** · And then what? So in essence we have this very short window for all the banks. every piece of software, cryptocurrency all of these things in essence have to solve for this threat within the next 9 months cuz someone's going to build this and release this. one of the other thoughts I had was what would the other labs have done? if XAI got there first would they have the same restraint? one positive I guess here is Elon did tweet over the weekend.
**** · Someone asked about the his promise of more powerful models and he said it will take until May to be close to Opus 4.6 and then June to match or maybe exceed. So short time by normal standards but long time in Arena. What he's saying is hey we're not even up to Opus 4.6 yet but we're working hard. So they're a little bit behind. one other topic that came to my mind is the government is continuing to attack anthropic their supply chain risk and yet they may be the only hope we have to protect our systems, our infrastructure, the software companies that we build around privacy of citizens anthropics at the forefront of this. They're the only ones that are doing this and talking about this publicly in this way. and yet the government's treating them as the enemy.
**** · That's weird. I have concerns around the biggest companies having access to the future frontier models and then the potential centralization of power. So, if you get into this situation where we get these massive models and they're so dangerous to release publicly that we only give them to Apple and Amazon and the banks and okay, well, now we just centralized power. There's the broader implications on the security of all software, cryptocurrency, the ability to scale fraud on consumers and businesses. Mike, Yep.
**** · that one I think the amount of scams and spam that we're seeing and that I'm sure are some in some way AI assisted for sure. but if you give this power to just the average scammer or the government actor that wants to destabilize things that's terrifying of and I know so I'm just rambling here but these are just the thoughts. Another one is use caution as an organization. so whether you're a team within a bigger company or if you're a startup an AI native startup, use caution when you're racing to integrate these agentic systems into your organization. So just because Claude coowork is amazing and open claw is fascinating you have to remember how early this is and the tech is moving really fast and even the people building it don't fully understand all the risks associated with it. So again this is where I would caution on the bigger enterprise side if it or legal is slowplaying this stuff that is that is a good thing. I'm I totally understand the impact agents can have and how it can make your company have this massive competitive advantage, but I've yet to meet somebody who understands the risks of what they're doing when they're when do these things. So that's something the comput and energy needs over the next decade may end up being dramatically underestimated and underbuilt.
**** · So as crazy it is that Google's spending 180 billion in capex this year.
**** · we're going to have a trillion dollar $2 trillion XA XAI IPO you're going to have a open AI IPO anthropic IPO my guess is we have completely underestimated how much intelligence is needed and then the one positive I have here is this idea of project Glass Wing that it does demonstrate the ability for the labs to work together and I think that's going to become much more critical.
**** · And then there's just two other thoughts I have. One is I would suggest people go back and listen to episode 141 again. So if you didn't listen to the road to AGI and beyond, I would go listen to that.
**** · it's an episode I did where I walked through what I thought was going to happen, what the timeline of things were going to be. And the two key components I wanted to just touch on is this idea of what accelerates progress and then what slows things down. So what we're seeing is the acceleration through things algorithmic breakthroughs, compute efficiency, largecale government funding where they're now the government's getting involved, infrastructure investments, more compute capacity.
**** · Those are things that allow it to go faster. But the things that slow AI progress down, and this is where I think mythos may be the preview of what's going to happen. failures in aligning AI models with human values, intentions, goals, and interests. That's what they're alluding to is we're we're getting it more aligned but where it is misaligned is becoming much bigger problem. one of the other areas that could slow it down is restrictive laws and regulations. So heavy regulation of open source models.
**** · This mythos will likely accelerate this at a state level. So you're going to see more bills being pushed forward to try and restrict this stuff because the federal government isn't going to do it.
**** · And then the other thing you could see is if there's a change in power in the midterm elections in the US then not the executive branch but at the the house and the senate then we could see massive disruption massive issues where the Democrats will focus very heavily on regulation. They're going to try and push this. And so that then is tied to this idea of societal revolt against AI due to job loss, politics, perceptions, fears. And that is absolutely picking up steam. push back on data centers is becoming very strong within some communities.
**** · Politicians are looking for wedges around job loss and environmental impact. You're going to touch, I think, in the next topic, Mike, about, what happened to Sam Alman. you're getting now, people are out, if you didn't hear about, somebody threw a molotov cocktail at Sam Alman's house and then 48 hours later shot up his house in San Francisco. So, you're now getting people acting out against this stuff, which is insane and never the answer. so you're just starting to see this and then that leads to one of the other items that I'd highlighted in the what slows it down, which is voluntary or involuntary halt on model advancements due to catastrophic risks.
**** · That may end up being the most important one.
**** · So yeah, I I think there's so much more we could talk about on this one. I'll end with one other quick thought and I think you've got this in the rapid fire mic, so I'm just going to touch on it, but Anthropic also released this emotions paper and it was about these models simulating or emulating human emotion and I think it's a something people should read. I'll just read two excerpts. One is it said it may then be natural for these models to develop internal machinery that emulates aspects of human psychology emotions. If so, this could have profound implications for how we build AI systems and ensure they behave reliably. And then anthropic noted in this paper that none of this tells us whether language models feel anything or have subjective experiences, but our key finding is that these representations are functional and that they influence the model's behaviors in ways that matter. So, the reason I wanted to include that in this commentary is we're looking at these broad far-reaching implications of these models and in some ways it's abstract to wrap your mind around the significance of what's happening and then when you come to this idea of but they're also showing signs of emulating human emotion. And so if you have these powerful models that can improve themselves, that can escape these hand boxes, that can identify zero days, which are, unknown bugs within software systems, but they also have the ability to emulate human emotion, the ability to manipulate human emotion. We're talking about a perfect storm of a future that we're just not prepared for. And to go back to my original comment, why I think this may be a bigger deal than others, it's not that the mythos model is necessarily groundbreaking and we weren't aware that models were going to get smarter. It's more about the moment where it might be what was needed for other people who aren't in the AI bubble to be wait, what is AI capable of doing? Yeah.
**** · and so maybe it starts these conversations down a path we really needed to be going, and in the shorter term, I couldn't help thinking multiple times reading through all of this and the articles, if I am a your average corporate IT person in charge of figuring this out, I just want to cry.
**** · Oh yeah. I just here you can have your chat GBT licenses or whatever you want and the agent stuff just stay away I and even at best if you somehow nail it there's still going to be open-source models 9 months from now that people are going to use to bombard your company with cyber.
**** · Yeah.
**** · And I the cyber stuff is again I back in our agency days we had clients in cyber security all these former FBI people working there and there was people on our team that were working on those accounts and I would just honestly be just tell me what I have to know. I there's so much about I don't I don't want to know. and I, even even going through this stuff, your mind just starts to slip into oh my god, how much they're going, the bad actors are going to use this stuff. Is we're just not ready as an industry, as a business world, as a society, that is it. It I think it's always been in the back of my mind as one of the things I'm worried about. It is very quickly moving to the top of my mind of the things that I just I don't know how we solve it. I'm not really sure how we figured this out in the short time we have.
Claude Code Leak + Anthropic Subscription Shakeup
**** · Well, somewhat related in our next topic, Anthropic themselves is having a tough time figuring this out because they've also had another high-profile security incident because in late March, March 31st, they accidentally leaked the entire source code of Claude Code, which is their popular AI coding tool. This happened through a JavaScript source map file that was bundled into a public package.
**** · This file contained over half a million lines of unobuscuided TypeScript across nearly 2,000 files. So within hours, this code was downloaded, mirrored to GitHub, and forked tens of thousands of times. Boris Jurnney, the creator of Claude Code, said that their deploy process has a few manual steps and humans didn't do one of the steps correctly. So this was not AI related.
**** · Anthropic flubbed a bit the response as well because they started issuing takedown notices for thousands of GitHub repositories, but they were accidentally trying to knock down as well legitimate forks of Anthropic's own publicly released Claude code repo. Churnney said they later retracted the bulk of the takedowns. this was also just immediately followed by Anthropic making a to some controversial move related to their subscriptions. Churnney also announced that starting immediately, claude subscriptions will no longer cover usage on third party tools OpenClaw. Peter Steinberger, creator of OpenClaw, called this move sad for the ecosystem, but gave Churnney credit for how he handled the communication. So, Paul Anthropic is dealing with the consequences of their explosive growth and the popularity of Claude Code in real time. What did the last couple weeks here tell you about where they're at as a company? what challenges they're dealing with?
**** · Clearly, there are a few.
**** · The rate at which Anthropic has been shipping updates is I don't know that we've ever seen anything it in business history. they are just running circles around Google and OpenAI and everybody now. It's it's really remarkable So, the idea that their systems aren't keeping up and the internal checks and balances, I get it. I don't I just don't know we've ever seen a company grow this fast. their run rate now is surpassing OpenAI's based on reports from last week. They're a $30 billion annual runway, which six months ago, if you would have said Anthropic may IPO at a higher, value than OpenAI, I don't I don't think too many people would have taken that bet. But if you I don't know, there's probably market there's probably odds on this now.
**** · My instinct now would be Anthropic will be a more valuable company than OpenAI when they IPO and more valuable than XAI potentially. yeah, they're just it's an incredible pace now what they're doing. The significance of the leak was one of the questions I was thinking about. It's well, does this really matter?
**** · they don't seem to care too much. I don't know. They just keep moving and releasing all these other things. So, the couple things that came to mind for me is it likely speeds up copycat models, so it made it easier for other people to replicate what they're doing. It'll likely accelerate open-source innovation because people can look at this.
**** · And it's not great for what we were just talking about with bad actors using these capabilities to do bad things that. So those jump out.
**** · The one I will say is I thought Boris was amazing. as someone who comes from a PR and communications background, what he's doing is just textbook stuff. And it I think it's just totally organic and self-directed.
**** · I don't I don't think Anthropic was "Hey Boris, go be the face of this problem. he just seems to be doing it and it's really impressive. So the way I'm watching it happen is his replies on X or he's just interacting with people. So a couple of quick examples someone posted cuz obviously a lot of developers are just drilling into this code what is it going on? What's in there? And so someone said Claude Code has a reg x is that I don't reg x I think it's out there that detects wttf ffs piece of [ __ ] F you, this sucks, etc. It doesn't change behavior. It just silently logs is negative true to analytics. Meaning when someone is working with claude code, the end user, and they're this sucks, or f you claude code, this is not good. Anthropic logs that reaction as a negative thing, but it doesn't change the behavior of the model. And so this guy who posted this was do with this information what you will. Well, Boris responds and he said, "This is one of the signals we use to figure out if people are having a good experience. We put it on a dashboard and call it the FS chart." And so it's that. So it they probably didn't really want people knowing that was a thing, but rather than saying, "Oh, that's not, we don't use that code or whatever," he's just "Yeah, it is what it is." then there was the other one.
**** · People are immediately "Oh my god, somebody's getting fired over this." So he has stayed really strong in this. He said it was human error. Our deploy process has a few manual steps and we didn't do one of the steps correctly. We have landed a few improvements and are digging in to add more sanity checks.
**** · with any other incident, the counterintuitive answer is to solve the problem by finding ways to go faster rather than introducing more process. In this case, more automation and claw checking the results. And then he said no one was fired. It was an honest mistake. It happens. Then there was one other one I'll highlight that I thought was fascinating. So, a user digging into the code post this on X. He said, "I can't believe more people aren't talking about this part of the Claude code leak.
**** · There's a hidden feature in the source code called Chyros, and it shows you Anthropic's endgame. Chyros is always on proactive claw that does things without you asking it to. It runs in the background 24/7 while you work or sleep. Anthropic hasn't turned it on to the public yet, but the code is fully built. Here's how it works. Every few seconds, Chyros gets a heartbeat.
**** · a prompt that says, quote, "Anything worth doing now?" it looks at what's happening and makes a call, do something, or stay quiet. If it acts, it can fix errors in your code, respond to messages, update files, run tasks. anything clawed code can already do just without you telling it to do it. But here's what makes Chyros different from regular code. It has at least three exclusive tools that regular code cloud code doesn't get.
**** · One, push notifications, so it can reach you on your phone or desktop even when you're in not in the terminal. Two, file delivery, so it can send you things it created without you asking for them. And three, pull request subscription, so it can watch your GitHub and react to code changes on its own. Regular clawed code can only talk to you when you talk to it. Chyros can tap you on the shoulder and it keeps daily logs of everything.
**** · What it noticed, what it decided, what it did. At night, it runs something the code calls a dream, where it consolidates what it learned during the day and reorganizes its memory while you sleep. And it persists across sessions.
**** · Close your laptop Friday, open it Monday. It's been working the whole time. Endless use cases. It's essentially a co-founder who never sleeps. The codebase has this fully built and gated behind internal feature flags called proactive and chyros. I think this is or probably the clearest signal yet of where all AI tools are going. We are heading into the postprompting era where the AI just works for you in the background an all- knowing teammate who notices and handles everything before you even think to ask. This is absolutely what the labs are trying to build. So, one, kudos. I don't who was the guy who posted this, Mike? What was the username?
**** · I'd have to look. Yeah, we'll post it in the show notes. But yeah, the I will also say if anyone from Anthropic is listening by any chance, I will pay $1,000 a month for this tomorrow. So, this to and Boris's response, so again, he could just ignore this and just let it go and not give it, any fuel. He said, "We're always experimenting with new ideas. 90% don't ship because we don't think they're good enough experiences." still on the fence about this one. Should we ship it? So, he's just "Yeah, it's in there.
**** · You're We You got it. We built it." And it's they're on the fence about that one because of the compute problem.
**** · Correct.
**** · Not the value of it. Sure.
**** · Not on the fence enough to have not put it into the code that's already out there, meaning they're probably already using this internally.
**** · yeah. So, just fascinating stuff.
**** · And then the final note was just on the open claw impact. And it goes back to what I was saying earlier, it's just a cautionary tale for companies that are out on the e edges here that are building on the frontiers of the technological capabilities and relying on an unstable and infant AI ecosystem. So, it's you're building an AI native company. Open clause "Oh, this is amazing. We're all in 30 days later, you've automated all these things and it's costing you $2,000 a month." Yeah.
**** · And then Anthropic is "Yeah, no, that's misuse of the system." and you just shut down your company today or to do what you were doing is not going to cost you $100,000 a month So we just have to accept these challenges and unknowns of building agents into workflows and or charts is so early. So when you hear these stories of people doing it and you're so envious that they've figured something out that you haven't figured out they could wake up tomorrow and the thing they figured out is shot or so that's my main thing there is just so early. Yeah, I'll be so curious to see how that plays out. I don't know how some of these people are affording to run these open claw setups on their own, just as a hobby thing because I even hit some random usage limits in quad code over the weekend and was just oh, I've got hundreds of dollars of credits they gave me for various things over the year. and I was great, well, we'll dip into the usage. And in 4 seconds, I evaporated $300 on a random research check. And I was how is anyone doing this dollar by dollar for every single thing you're doing?
**** · Which we'll talk a little bit about the alchemist stuff in a minute.
**** · Exactly. It's just wild.
Major OpenAI Updates
**** · Okay.
**** · Our third big topic this week. There is a ton that has been going on with OpenAI over the past couple weeks. So, we are just going to go through some of these huge updates. Some good, some very bad. But first up, OpenAI closed a $122 billion funding round, which is the largest in Silicon Valley history. at the same time, Bloomberg is reporting that demand for OpenAI shares is sinking on secondary markets. And the information reports CEO Sam Alman and his CFO are diverging a bit on IPO timing. Sounds Altman wants to try to go public faster, whereas CFO Sarah Frier wants to maybe push it out a little bit due to spending commitments and the necessary organizational prep. Second, OpenAI acquired TBPN, a daily tech news show hosted by John Kugan and Jordie Hayes.
**** · This has become this hugely watched popular program in tech media. the show has only about 58,000 YouTube subscribers, but generated 5 million in ad revenue in 2025. if they're on track to exceed 30 million this year. It will be housed within OpenAI strategy organization. OpenAI says the show will maintain editorial independence and continue choosing its own guests. Alman posted on XTBPN is my favorite tech show. We want them to keep that going and for them to do what they do so well.
**** · Third, at the same time, a major executive shakeup has hit the company. Fiji Simo, the CEO of applications, announced she is taking medical leave.
**** · She's had a relapse of postural orthostatic tacic cardia syndrome, POTS, a chronic neuroimmune condition she has talked about in public quite a bit before. She said to employees she pushed she's pushed a little too far and needs to try new interventions to stabilize her health. So there's some reshuffles related to this. President Greg Brockman will oversee product in her absence. COO Brad Lycap is moving to a new role focused on quote unquote special projects and marketing chief Kate Roush announced she is stepping down to focus on her recovery from latestage breast cancer which she was diagnosed with a year and a half ago. Couple other things. Fourth, The New Yorker published a lengthy investigation by pretty famous journalists Ronan Pharaoh Andrew Marren titled Moment of Truth. Sam Alman may control our future. Can he be trusted?
**** · This piece drew from over a 100 interviews and internal documents including Ilia Sutska's Slack messages and Dario Amade's personal notes and it builds this case that OpenAI systematically abandoned its safety first founding mission as it scaled up and that Alman repeatedly chose to deprioritize safety commitments and in fact a former board member told magazine he is unconstrained by truth. Now, finally, we alluded to this. Days after this profile published, someone did throw a Molotov cocktail at Alman's San Francisco home. No one was hurt. an hour later, police were responding to a man threatening arson at OpenAI headquarters. Second attack in Alman's home followed a couple days later.
**** · Waltman linked the attacks to the climate of AI anxiety and the negative media coverage he had eaten. He even written that someone had warned him. The New Yorker piece came during heightened anxiety about AI making his situation more dangerous. And he responded to these attacks and the profile in a personal blog post sharing a rare family photo of himself, his husband and their child. he said he was sharing this in the hope it might dissuade the next person from targeting his home. In a post, he acknowledged his mistakes and said he has this conflict aversion that has caused organizational pain. And also concurrently, Altman/OpenAI went on a major policy offensive. They published industrial policy for the intelligence age, a 13-page paper proposing a suite of people first policy ideas, including giving every American citizen a direct state stake in AIdriven economic growth through a nationally managed fund seated in part by AI companies. Vanity Fair reported they're preparing a broader push to quote rethink the social contract. Axi has framed this frame this as Sam's super intelligence new deal.
**** · So Paul, I don't know where to start. Lots going on here. Some of it really interesting, some of it very horrifying. Unfortunately, it's been a big couple weeks.
**** · Yeah.
**** · I'll There's a lot of different directions to go. I'll focus on Sam's host. Yeah. And then the the policy ideas. So, one quick note, the TBPN, there's no confirmed what did they pay for it because that's always everybody obviously wants to know, but it does seem it was north of a hundred million. which isn't bad for, relatively newest.
**** · the editorial independence thing, good luck. I don't know these guys. I've never watched the show or listened to the show. I've heard of it plenty, but it's not something that's intensely on our radar. but that idea of remaining independence as a media entity that's owned by an AI lab that has lots of pressures on it, that's going to be very hard to maintain, but it sounds they're going to make their efforts too. So, we'll see. okay.
**** · So, then on Sam's post, I thought there's a lot of interesting things in here. So, you first obviously the very personal stuff. as I alluded to earlier, violence is just never going to be the answer here. And I do worry about these AI leaders. but it was only a matter of time before something this started to happen.
**** · in his post, he said, "Words have power. There was an incendiary article about me a few days ago," which is referring back to the New York article, Mike, that you just touched on.
**** · Yeah.
**** · He said, "Someone said to me yesterday they thought it was coming at a time of great anxiety about AI and that it made things more dangerous for me. I brushed it aside." Now, he did later tweet that he regretted the incendiary article reference and that, he wasn't trying to pass blame, but yeah, he did at least address that article. So, then I highlight a few excerpts here on what he believes and then he has some personal reflections then his thoughts on the industry because his thoughts on the industry lead into the industrial policy for the intelligence age document. So on what he believes he says working towards prosperity for everyone, empowering all people and advancing science and technology are moral obligations for me. AI will be the most powerful tool for expanding human capability and potential that anyone has ever seen. Demand for this tool will be essentially uncapped and people will do incredible things with it. The world deserves huge amounts of AI and we must figure out how to make it happen.
**** · it will not go all well or all go well. He said the fear and anxiety about AI is justified. We are in the process of witnessing the largest change to society in a long time and perhaps ever. We have to get safety which is not just about aligning a model. We urgently need a societywide response to be resilient to new threats.
**** · This includes things new policy to help navigate through difficult economic transition in order to get to a much better future. He also said AI has to be democratized. Power cannot be too concentrated. Control of the future belongs to all people and their institutions. AI needs to empower people individually and we need to make decisions about our future and the new rules collectively. And he said adaptability is critical. We are learning about something new very quickly. Some of our beliefs will be and some will be wrong. And sometimes we will need to change our mind quickly as the technology develops and society evolves. on the personal reflections. thought this was interesting. He said, and again, I think in some ways he's probably acknowledging some of the stuff from the New Yorker piece. and other things that have been said about him, said, "I'm not proud of handling myself badly in a conflict with our previous board that led to a huge mess for the company. I have made many other mistakes throughout the insane trajectory of OpenAI. I am a flawed person in the center of an exceptionally complex situation trying to get a little better each year, always working for the mission. We knew going into this how huge the stakes of AI were and that personal disagreements between well-meaning people I cared about would be amplified greatly, but it's another thing to live through these bitter conflicts and often have to arbitrate them and the costs have been serious.
**** · I'm sorry to people I've hurt and I wish I had learned faster. And then on the industry which leads into the policy piece said my personal takeaway from the last several years and take on why there has been so much Shakespearean drama between the companies in our field comes down to this. Once you see AGI you can't unsee it.
**** · It has a real ring of power dynamic to it and makes people do crazy things. I don't mean that AGI is the ring itself, but instead the totalizing philosophy of being the one to control AGI. The only solution I can come up with is to orient towards sharing the technology with people broadly and for no one to have the ring. The two obvious ways to do this are individual empowerment and making sure democratic systems stay in control. Laws and norms are going to change, but we have to work within the democratic process, even though it will be messy and slower than we'd I empathize with anti-technology sentiments and clearly technology isn't always good for everyone. But overall, I believe technological progress can make the future unbelievably good for your family and mine. While we have that debate, we should deescalate the rhetoric and tactics and try to have fewer explosions in fewer homes, figuratively and And then that leads to the policy piece, which I would really recommend people read. It's only 13 pages. It's a it's a pretty quick read. I'll give you a high level of what's what's in there. So it starts off within just a few years AI has progressed from systems capable of fast narrow tasks to models that can perform general tasks beyond general tasks people used to need hours to do. Now we're beginning to transition towards super intelligence which they say is AI systems capable of outperforming the smartest humans even when they are assisted by AI. No one knows exactly how this transition will unfold. So then I'll just jump ahead to the two sections in the paper. They have building an open economy and building a resilient society. So in building the open economy they have worker perspectives. So giving workers a voice in AI transition to make work better and safer. They have AI first entrepreneurs help workers turn domain expertise into new companies by using AI to handle overhead that usually blocks entrepreneurship.
**** · They have a to AI. Treat access to AI as foundational for participation in the modern economy. Similar to mass efforts to increase global literacy. modernize the tax base. AI reshapes work and production. The composition of economic activity may shift expanding corporate profits and capital gains while potentially reducing reliance on labor income and payroll taxes. Another is public wealth fund. Create a public wealth fund that provides every citizen including those not invested in financial markets with a stake in AIdriven economic growth. accelerate grid expansion. So establish new public private partnership models to finance and accelerate the expansion of energy infrastructure required for power to power AI. Efficiency dividends is an interesting one. Convert efficiency gains from AI into durable improvements in worker benefits when routine workload declines and operating costs fall. including incentivizing companies to increase retirement matches or contributions, cover a larger share of health care costs, and subsidize child and elder care. adaptive safety nets that work for everyone. Make sure the existing safety net works reliably, quickly, and at scale. Because if the transition to super intelligence is going to benefit everyone, the systems designed to provide economic and health security need to deliver without delay or gaps. Another is portable benefits.
**** · Over time, the public or build benefit systems that are not tied to single employer by expanding access to healthcare, retirement savings and skills training through portable accounts that follow individuals across jobs, industries, education programs, and entrepreneurial ventures. Two more in this section, pathways into human- centered work. expand opportunities in the care and connection economy which they define as child care, elder care, education, healthcare, community services as pathways for workers displaced by AI. And then finally in that section, accelerate scientific discovery and scale the benefits. Build a distributed network of AI enabled laboratories to dramatically expand the capacity, test and validate AI generated hypotheses at scale. And then the building a resilient society, there's a few here. safety systems for emerging risks, AI trust stack, which they say is research and develop systems that help people trust and verify AI systems. auditing regimes. So strengthen institutions such as the center for AI standards and innovation to develop auditing standards for frontier AI risks model containment playbooks which we talked about probably be pretty important as what we're seeing with anthropic mission aligned corporate governance guard rails for government use mechanisms for public input incident reporting and international information sharing and AI around AI capabilities. So, the other thought I have, Mike, and I'll just see if you have any thoughts on all this, but maybe this is my former PR background, but I'm thinking that the AI industry needs a massive PR campaign now to highlight the potential for the positive changes in the world and this better future. Part of it is a PR campaign, but not in a way of misleading people about what's possible and trying to shift their focus from the negatives. that the negatives are real and they need to steer into those and not ignore them. but what we need to do is accelerate some of the winds that have positive impacts in society. They're high value, high-profile that could build excitement about a better future. Things drug discovery and curing of diseases and we know they're working on these things, but I feel now the negative sentiment is just snowballing. you can feel it every week in the topics we're covering and the articles we're reading and there's very few really positive things and so they all the labs they need to figure out a way to do this where they acknowledge the negatives and do what they're doing but they got to start getting some big wins or else societyy's going to turn on this stuff fast and I don't know how fast you can go on the scientific discovery but I keep coming back to that is the thing that's that's going to change perceptions is if you can improve people's lives in very clear ways. that you're going to need to win mind share and now they're losing it is my current take on the industry.
**** · I could not agree more. I'd love to for us to even talk more about initiatives that on maybe future episodes and work on that because I would also just encourage, I'm by no means an expert on what you should be doing in terms of your messaging here, but it would also strike me as valuable for especially Silicon Valley based AI labs to also focus on the individual. How do these things make your individual life better? The big picture stuff is super important and really valuable, but also think about all the things that people are going to be upset about when it comes to an AI lab. They do not want you telling them that you're going to save the day, that their life is going to be managed by your technology. Show them how it empowers them and how real people are using it for real wins, even basic ones in their life. I think could be also interesting as an That's a great point. Yeah. And I do think you and I see, a lot of the similar stuff now all you have is these individual stories on X that never break out of X the bubble and it's these incredible stories of finding cures for things that their doctors missed for years and finding treatment paths and I've certainly experienced that myself things in your own personal life where you're just I don't know what to do and you just have a conversation it's wow okay that that's the direction I think I know what to do and there's I'm sure there's just all those incredible stories, but now Yeah. I just I feel they're just missing it. Yeah.
**** · Yeah.
**** · I don't know. I think you're though. we should we should make a bigger effort on this show to highlight more of that stuff. I think there are so many amazing things that are happening, especially on the scientific discovery side and, making an impact on people's health and wellness and things. yeah, we should do more. Paul, before we jump into rapid fire, quick announcement. This episode is also brought to us by our AI for Writer Summit. So, the future of storytelling is being re rewritten thanks to AI and that's why we're very excited to be hosting our annual AI for Writer Summit on Thursday, May 7th. So, this is a half-day virtual event for writers, editors, content teams, anyone who does any type of writing or content creation as part of their work. You will get tons of awesome actionable knowledge from the event because during it we'll have some incredible speakers breaking down exactly how AI can help you create smarter and faster but also importantly without losing the heart and soul of your writing. This event has a free registration option. So go check those out today. You can go to a writerssummit.com or just go to marketing institute.com and click on events and you'll find the summit there. And by the time you go to the website, the agenda will be live so you can see the great lineup we've got going for you. Super excited for this one.
AI for Writers Summit
**** · Yeah.
**** · And real quick note on that. last year we had I think it was more than 4,200 people from 95 plus countries. So yeah, it's an amazing event. It's a great way to network with other people.
**** · And then the real key is we're trying to tell the human side of this. So this is not how do you automate the writing and get rid of people. We're trying to grapple with the hard questions, what is the future of journalism? what is the impact it has on people who write for a living for fulfillment things that.
**** · So we very much focus on that and then if I'm not mistaken I think my opening keynote from last year might be on YouTube if not we'll put it up on YouTube before tomorrow we'll put a link into it. So I did the state of AI for writers and creators navigating the future of creativity. But what I focused on last year was the human side of it and when should we use AI to write was the question I posted or challenged people with. And then I presented a framework to decide when should I use AI versus when should I not. And so I think it's a really important concept. So we'll put the keynote from last year up that people can go and watch. and you I think it's a good way to get a 25minute keynote if I remember correctly. Yeah. Awesome. let's dive into some rapid fire. In late March, AI recruiting startup Merkore was hit by a supply chain cyber attack through a tool called Light LLM, which is a widely used open- source library that connects applications to AI services. A hacking group claimed credit and published samples of the stolen data. TechCrunch reported these included Slack messages, internal ticketing information, and videos of conversations between Merkor's AI systems and contractors. Now the reason this matters, why we're talking about it, we have talked about Merkore before. They are a $10 billion company that provides training data to the top AI labs. So what they do is they recruit expert contractors. So think people engineers, lawyers, doctors, bankers, and they have them train AI models and chat bots. Some of their top customers include OpenAI, Anthropic, and Meta.
Mercor Breach
**** · They have more than 30,000 experts on their roster and say that they are paying $1.5 million per day to their contractors. So, there's a lot of data in this system and the attackers claim to have obtained 4 terabytes of data in total, including source code and database records. Not only is this bad from a personal perspective, 40,000 contractors at least have had personal data exposed. It sounds they've also exposed proprietary source code, video interviews, and the most important part is potentially this could include details of how Frontier Labs are training their models, what expert feedback they're collecting, and the methodologies behind their most advanced system. So, so far, Wired has reported that Meta paused its work with Merkore and is investigating the incident. OpenAI confirmed it was investigating its exposure, but said it had not paused or ended its contracts at this time. So Paul, another security incident. We've covered Merkore in the past, how important it could be to the AI ecosystem, though this is a pretty damaging series of events. We also did talk about that light LLM breach a couple weeks ago. So two topics coming together in less than ideal ways.
**** · Yeah, I said, I hate talking about this stuff. I really do. it is terrifying. and what we know is the the when state actors want something, they're going to get it. Dario Amade did this interview back in 2023 or 24 that just always haunted me where he was talking about the weights to these models are these are the nuclear codes in terms of how they protect these things. There was an example recently where they were talking about OpenAI going in with the briefcase the what do they call it the football the nuclear football.
**** · Yeah.
**** · that's how they delivered the model to with the weights in this lock case to the government when they were trying to build a custom version of something for the government.
**** · So the weights to these models are so tightly held. I think at Dario at the time there's two or three people within anthropic that even had the ability to know the weights thing. and he said listen if a state actor wants to get them it's just how much money are they willing to spend to go get them they could hack into anything. And so the premise that all you think of all these areas of risk and all this data that's living in these companies and this maybe partially goes to this use caution when you're working with just these random startups and giving them access to your APIs and all this [ __ ] you're just the surface area of risk is so vast and misunderstood or understood by people.
**** · it really is just terrifying. I don't cyber security to me is just I hate it. but I I said I think cyber security professionals lawyers who deal with this stuff man talk about safety and you don't know what jobs to go into. I guess that's a good a good silver lining here, **** · We may make for all the lost jobs. Everyone's just going to cyber security.
**** · Everyone fixing all the new nightmares AI is enabling.
**** · yeah, but this is a bad one. This is Yeah. Yeah. And it's good to be aware too of these companies Merkore that in Silicon Valley circles definitely well known but maybe to your average public not as well known or a household of a name but super important to the ecosystem.
**** · Yeah. And real similar scale AI **** · Scale AI for sure. Yeah. Yeah. In fact there was something in the reporting where meta when they even when they essentially acquired scale AI they didn't stop using Merkore either. They were just using both because it was so important.
**** · Yeah.
**** · Scale AAI if you don't catch the reference. Alexander Wang who's now in charge of super intelligence at Meta. He was building a training company called Scale AI. He got aqua hired for 15 billion dollars by Meta. So his company still exists. But yeah, that's that's the reference there.
Karpathy's LLM Knowledge Bases Go Viral
**** · Well, this next topic is a little more positive or at least interesting and not negative, In something now. In early April, Andre Karpathi posted on X about how he is now using LLMs not just to generate code. he's a programmer coder, so he's doing that a lot, but also to build and maintain personal knowledge wikis. So, this post as of today has nearly 20 million views.
**** · So, it's one of the more viral AI posts this year so far. And the core idea here is that instead of relying on all this technical stuff vector databases and complex rag pipelines, instead he's just dumping raw documents, articles, and research into a folder, then letting an LLM compile them into a structured interlin markdown wiki. And then he uses Obsidian, a free note-taking app, as the front end of this. So as he puts it, Obsidian is the IDE, the LM is the programmer, the wiki is the codebase. So this LM then handles curating sources, linking updates, and even runs periodically to check for inconsistencies. So the reason this is getting some popularity and some eyeballs is because every knowledge worker in some way is using information and knowledge bases that are really often very hard to maintain. So instead of just thinking about LMS as chat interfaces or code generators, Karpathi is really thinking about this in terms of LLM turning becoming persistent knowledge infrastructure and building that out in ways that compound over time and people ran with this and started building their own versions. Obsidian's founder weighed in with best practices. and Paul I just thought this quote from Karpathi was telling he said in this way in the way I'm using this a large fraction of my recent token throughput is going less into manipulating code and more into manipulating knowledge stored as markdown and images. Super interesting implications for maybe less technical people. Yeah, the term you hear thrown around a lot in the last 30 days was this idea of a second brain. everybody's talking about this idea all your information just lives in this thing.
**** · And so the major cloud companies are trying to solve for this productivity companies Microsoft and Google obviously they want this to just you already have a lot of this information living in there and they're trying to find ways to make it easier to build these second brains where all this information lives there and the knowledge base is there.
**** · and then you're just constantly almost that idea we talked about with the Claude code leak where just proactively acting on all this knowledge and just working with you on it. and the thing with Cararpathy's posts is, 3 months from now somebody will productize what he's doing or maybe 3 days from now.
**** · so he talks in these technical ways and most people aren't able to do anything what he's explaining. so the average business leader or practitioner listens to our podcast. I don't know what any of that means. I don't know what ID is and things that. but for everybody else just assume the outcome of the idea is a product waiting to be built. And that's the premise here is if he's talking about it being possible, it's only a matter of time until someone builds that capability and then you start finding it. all of a sudden you have access to that. you can hack it together with the things you've got internally. Yeah, it struck me too as related to another thing we had talked about that he was working on that auto researcher concept where it's I just was making notes while reading through his post and saying this feels in some fashion whether it's doing it yourself or there's a product around it that every analyst and research firm needs to go this direction at some point because you need this second brain of all this proprietary stuff and I know people are doing it and layering chat over it but this is dynamic it is updating regular ly it is an LLM maintained wiki or knowledge base or second brain and I think that's probably where I'd imagine research function should be going.
**** · Yep.
AI and Jobs Update
**** · So next up, there's a lot that's been happening on the AI and jobs front. No surprise in the past couple weeks. So we're going to run through a couple highlights here of some things that are notable. So, first the New York Times published a piece reporting that economists who had previously dismissed the AI job threat are now slowly but surely starting to change their minds.
**** · So, this is a pretty big shift in establishment economic thinking. They talked to a bunch of economists who, they're not doing a total 180, but they are starting to acknowledge that maybe this mainstream economist position that AI will create more jobs than it destroys the way previous waves of technology have. maybe this is a little out ofd or there's more nuance to it than previously thought. Second, there may be data backing that up. The Challenger report, which is a regular report we talk about for March 2026, that tracks job cuts. Challenger and Gray is a recruiting firm. They show that US employers announced just over 60,000 job cuts March 2026. That's up 25% from February. AI was cited as the leading reason for 15,000 of those cuts. So it's about 25% of the total year-to- date. AI ranks of fifth among all the reasons for job cuts. And since Challenger began tracking AI as a layoff reason in 2023, the cumul cumulative total, so of all time, has now crossed 99,000 AI related job cut announcements across three years. Third, Jack Dorsey, who we talked about a couple weeks ago, is making the case for AIdriven restructuring much more explicitly perhaps than any other major CEO out there. So, after Block cut 4,000 of its more than 10,000 employees, Dorsy has now published a blog post co-written with a partner at Sequoia Capital arguing that AI should replace the entire traditional hierarchy of middle management. So, block he says, is restructuring around three employee roles. individual contributors who build systems is one. Two is directly responsible individuals who own specific outcomes on 90-day cycles. And third is what they call player coaches who mentor while staying hands-on with technical work. He said this restructuring was triggered by a capability shift he observed in December with Anthropics Opus 4.6 and OpenAI's codeex 5.3.
**** · Fourth, on the hiring side, Zapier released the second version of its AI fluency rubric, which now they apply to every new hire at their company. This requires candidates to demonstrate AI embedded into their core work, not just one-off usage. They want to show repeatable systems and measurable impact on quality, efficiency, or outcomes.
**** · They also have this new accountability dimension that they consider. They say with AI, you can delegate the work, but not the accountability. So, keeping that human in the loop hide top of mind here.
**** · Zapier's language is also pretty blunt about their AI expectations. They say if someone isn't meaningfully improving their work with AI support, they just don't meet the bar. And then last but not least, a new Gallup survey shows that AI is reshaping how college students think about their futures. 42% of bachelor degree students surveyed said they have reconsidered their major because of AI. 16% said they've already changed their major over it. for people trying to get associate degrees, 56% are also reconsidering their field of study due to what AI enables. So Paul, what jumped out to you about these updates this week? I'm personally planning on diving in a lot deeper to Dorsy's thoughts. I thought those were interesting.
**** · I read a that might have been the thing that triggered so I put a post on LinkedIn on I don't know what day it was. It was it was one of the days we were in Scotland and I was we were driving a long distance and we were sleeping in the car and I was typing away. We had a tour I was not typing while I was driving. We had a tour guide driving us.
**** · I don't remember which thing I read that I wrote the LinkedIn post about and then I turned it into a newsletter post. it might have been the Dorsy one. I don't remember. But it was abstract his I read it cuz I'm very interested in this. I'm my main keynote this year is going to be based on a vision for AI forward org chart. I think maybe I'm working through an idea. Mike, you've seen some early versions of this. so I'm very keen on this idea of organizational structure and what teams are going to look and things that. So it did definitely catch my attention what they were doing. I love Zapier's approach. I liked it when they came with the V1. I really liked the V2.
**** · I the idea of this AI fluency rubric. So that was some really cool stuff. and then just the jobs overall again I'm glad to see people coming around and realizing this is a real thing and it's going to be a problem. The thing I alluded to that I wrote about on LinkedIn though was I'm getting really annoyed by the the tech leaders in particular who just keep pretending it's all going to be great with no acknowledgement of the possibility that it won't be. So I get optimism I'm all I'm all for being optimistic about this stuff. and believing in a future of abundance and we're going to find our way through which I do think we will. I I think it's going to end up being great, but I also straight up it's going to suck for a lot of people in the process. this isn't going to be an easy transition and a whole bunch of people are going to lose their jobs. And so I get really annoyed when people won't acknowledge both sides of the equation. So the example I put in the newsletter was I said tech leaders, politicians, economists who point to increasing demand for software developers and historical precedents as proof that AI won't displace millions of jobs are creating a false sense of hope.
**** · open that I highlighted four in particular and some of these people are people I respect and follow but Andre Mark Andre this is a quote and we'll put the links in the show notes the job loss narrative are all fake AI equals mass massive ramp in productivity equals massive ramp in demand equals massive jobs but watch so that was a tweet from April 5th Aaron Levy who we really CEO of Box I'm a big fan of Aaron he's got some of the best takes on X about AI that I've seen he does a lot of research on this topic so April 5th, he wrote, "There are far more categories where AI agents making things more efficient will induce demand for that skill than spaces where agents eliminate the work. This is why the AI job predictions will not play out as advertised." Okay. Shyam Sankar, who's the CTO at Palanteer, we've talked a lot about Palunteer. he had an editorial February 2nd. He said, "AI is a tool for the American worker, not his replacement. The job loss narrative is a ploy to attract investors, drive media attention, and consolidate political power. For the real promise of AI is the enterprise is to make the American worker 50x more productive to unleash his taste and agency. This isn't speculation, it's reality. It's very confident.
**** · There's lots of confidence in these statements. And then David Saxs, this is no surprise. he can't acknowledge the impact on jobs due to his relationship with the administration. He is the currently the chair of the president's council of adviserss on science and technology. All caps AI job loss hoax exposed. And then it goes on to say, according to a new study from Vanguard, the occupations most exposed to AI automation are outperforming the rest of the job market in terms of growth and real wage increases rather than causing job loss, AI is making workers more productive, driving gains in both jobs and wages. So what I said was despite these econom or optimistic outlooks from these leaders the reality facing companies especially those with limited growth and demand which is a really important asterisk here is that the pressure to reduce headcount across all areas of knowledge work is going to be immense in the coming months and years across all areas marketing sales customer service HR finance etc. and then I said, "Pretending there isn't at least a strong possibility of significant disruption is a disservice to business leaders who should be doing more to prepare their organizations and upskill their people." And then I said, "I talk to executives every week who are being told to stay flat on headcount and to have a contingency of cuts ready to go if the efficiency from AI happens." And so I just I don't that's my continued frustration is that all these people are hyping AI as this future of abundance, which I'm I'm with you. I hope and I do think eventually, but I don't know who you're talking to that is planning to hire.
**** · I'm not meeting those people. unless they're anthropic or one of these companies that's growing at 20, 50, 100% a year. I've yet to talk to an executive at a traditional enterprise that's really happy with 5 to 10% annual growth that's planning to hire it's not happening.
**** · So and that's in knowledge work. Now of course there's exceptions to that in energy in the trades in healthcare. yeah we can't hire enough people in those areas. I get that. I'm talking about the rest of us, all the other industries where the ultimate goal now is to just stay flat and headcount and get the revenue per employee number way up.
**** · So yeah, I don't know. It's good. I guess it's good it's increasingly becoming a conversation because it just really needs to be. We need to be thinking about what if these tech leaders who are so optimistic just are wrong and what if it isn't as easy and of a transition as they'd to make you think.
AI and Politics Update
**** · So similarly, we've had a lot happening on the AI policy and politics front. So we're going to go through a few developments here that have happened over the last couple weeks. So, first up, California Governor Gavin Newsome signed a firstofits-kind executive order requiring safety and privacy guard rails from AI companies that contract with the state. So, this establishes new certification requirements for AI vendors that want to do business with California. It requires them to attest to and explain their policies around preventing illegal content, harmful model bias, and violations of civil rights. It also directs state agencies to expand the use of vetted AI tools in government, develop an AI powered pilot for accessing government services, and publish a data minimization toolkit.
**** · Second, at the federal level, the Wall Street Journal reports that the White House is racing to head off threats from powerful AI tools. There's renewed urgency here in the wake of the all the mythos stuff we discussed. this included prominently a group of white house officials working on the issue including convening a call with the vice president, treasury secretary and the heads of anthropic, openai, Microsoft and Google as well as the leaders of cyber security firms Crowdstrike and Palo Alto networks. That's obviously in addition to the previously mentioned meeting we talked about related to mythos specifically that the Treasury Secretary had with bank CEOs. Third, a major new survey from Fathom, a nonpartisan research organization, provides a clear picture of what Americans want from AI governance. They surveyed a bunch of people to ask about their feelings and priorities in a number of areas. So, the top priorities across party lines for people are child safety, corporate accountability, and verifiable standards. Another big issue is workforce protection. So according to Fathom, from retraining programs to sovereign wealth funds that share AI generated wealth with the public, every workforce policy tested in this survey commanded majority support. Americans decisively reject leaving workforce transition to market forces. There is broad demand but no preferred solution. A policy window that is open now but won't be indefinitely. And then lastly, Politico reports that Senator Bernie Sanders may be building an unlikely alliance with Silicon Valley AI safety advocates. So Sanders recently met with quote unquote AI doomers in Berkeley, including Elizer Yudcowski from the Machine Intelligence Research Institute. And Sanders said, "I know there have been a lot of science fiction novels and movies about how the robots and the AI and the computers rebel against human control, but these guys no longer think this is science fiction." So, Politico suggests this might be the beginning of an alliance between anti-AI populists and the more tech-centric perhaps effective altruist aligned AI safety advocates.
**** · Paul, did anything jump out to you here?
**** · There's a there's a wild quote from Yudkowski in the Politico piece where he just said telling Bernie Sanders, hey, the point if AI gets much much more powerful, it'll run everything. And Sanders said, what does that mean? Humans are discarded. And yet replied, think everybody dead. So there's some strong language being used. Well, just the optimist side we just talked about, there's extreme views of everything.
**** · Yeah, I'll just I'll probably just leave it at that at the moment. But this is my concern is does you have an uneducated public largely about what these things really are, what the real risks are, what the real potential is.
**** · and so there's always when you have when the when the literacy isn't high and the comprehension isn't high, then you have the ability for extreme views to come in and influence people's perceptions and beliefs and that's very dangerous in my opinion. if you if so let's say this Yakowski, is that how you say his name?
**** · I believe so. Yeah. So, let's say that's the first thing you hear. It's this ends up on a 60 Minutes or in that AI movie that just came out, the Doomer AI movie that was out. and you hear that and it's "Oh, I hate AI. I hate data centers. I hate Sam Alman." And you people take that perspective and then you say, "Yeah, but it's it found a cure for cancer to your family member that was suffering from cancer. it AI is the thing that's going to find the cure or did find a cure so should we stop it should we not should we not have AI now because you heard a bad quote or so and that's when there's all these nuances to when the people take extreme views they don't stop and then say oh okay well maybe that would be amazing and again the only parallel I could ever go back to is the internet and then say oh yeah the internet's going to allow these scams and dark web and all these horrible things are going to happen. But it's also going to open up the economy and we're going to be able to build all these amazing things.
**** · You're going to connect with people you could never you're going to be able to facetime with your family when you're a thousand miles away and you want you want to not have any of that good stuff. You just want to should we just shut it down because bad stuff might happen, **** · You can't. And so that's my feeling on this is dialogue and reason and finding paths forward where we can do this responsibly, but this absurdity of shut it all down because it's going to eliminate everybody's okay, that's your belief and then we don't get any of the good stuff either. So, how about we just be reasonable here and find the reality and let's talk about the reality of the situation and not take extreme views that are unrealistic and mislead people. So, So, next up, HubSpot announced that starting April 14th, its customer agent and prospecting agent are moving to outcomebased pricing. So this means according to HubSpot quote customers only pay when the agents complete the task it's been assigned.
HubSpot Shifts to Outcome-Based AI Pricing
**** · Practically this means the customer agent is moving from used it used to or will used to have charged you $1 per conversation no matter what. And that's moving to 50 cents per resolved conversation meaning you only pay when the AI solves the customer's problem. The prospecting agent is shifting from a recurring monthly charge per enrolled contact to $1 per lead recommended for outreach. So, HubSpot says, quote, "This means you now pay when a prospect's prospect gets qualified and handed to your team."
**** · HubSpot says, "Customer agent now resolves 65% of conversations and cuts resolution time by 39%. And that prospecting agent activations are up 57% quarter over quarter. Both agents include a free 28-day trial and are available to pro and enterprise customers. So, Paul, we've talked about this before, the need for SAS companies HubSpot to update their pricing models. What do you think of this approach? Are they headed in the direction?
**** · I love HubSpot. I always have to preface my comments on HubSpot, but I love them. I love the people at HubSpot. I love the company. We use the technology. our company's built on the technology. Both my companies for the last 20 years have been built on HubSpot technology. I the concept of the outcome pricing. I understand from HubSpot's perspective why they would move to this.
**** · I understand the messaging of why this is a benefit to customers. So, I get all that. Mike, you and I have worked in HubSpot for a really long time.
**** · Yeah.
**** · The reliability of the data is a problem. allowing them to determine what is a resolved conversation is a very gray area.
**** · Y so we get a spam form fill or something that or a spam chat and you close it and now we just paid 50 cents to close a spam chat. there's all these what does that mean? what is a resolved conversation? And now the work we have to do to go into understand what is a resolved conversation. Oh crap. 80% of what would be considered resolved conversations to us are considered not a value at all. how do we turn that off? And or $1 per lead recommendation for outreach. I don't know. that doesn't it doesn't as the CEO of a company that pays lots of money to HubSpot every month for software. Neither of those jump out to me away as hm high value things that I want to now have to talk to my COO today and be what does this mean because it goes into effect tomorrow and have we budgeted for this? How do we budget for this? So no I this creates way more questions for me than answers. Maybe it ends up being a really good play. I don't know. But it continues down this path of the way these companies are trying to price AI to me is they're not they're not solving this what is the simplest answer to this.
**** · They're they're it just keeps getting more complicated in my opinion. Even though it's now outcome based, it's still uncertain to me.
**** · I always even just think back to even the agency days where it's "Hey, we ran this really successful campaign for you. Look at all these great leads."
**** · And then they're "Yeah, that's awesome, but these leads aren't closing or these aren't qualified." And you're "Well, yeah, they are." And it's "Well, no, they're not because they're not closing." And then you find out there's a hundred other issues of why this wouldn't be the case. So, I wonder how that you even get to that shared agreement on any of these outcomes.
**** · And again, we'll dig, we again, this affects us personally, so we have to dig into this. So, we'll report back. again, maybe it is a really elegant solution and maybe it is truly valuebased for us as the customer.
**** · Unfortunately, my instinct is this is going to be a pain in the ass and I'm not going to agree with the value that they're assigning to these and we're going to have to now figure out ways to either not use Breeze for these things or to change the dynamics or certainly update our budgeting. It's just super annoying and I understand why they have to do it, but no, I'm not super excited about this. If they would have come back and said, "We're raising your monthly seat license $10 per month per user and you have unlimited," I'd have said "Great, raise it 40 a month if you want to. I don't give a [ __ ] as long as I don't have to think about this and there's no additional budgeting, then just raise the rate. If the software is creating that much value, then just charge me more." I don't know.
**** · Yeah.
**** · And again, we'll see. I don't want to harp on it, but I'm just thinking my god, we sell 10 different things. I don't think all those leads are the same. there's multiple lines of business, all these crazy considerations that maybe are solved for, but I'm just oh, I have more questions than answers.
**** · Yeah.
**** · My brain explodes with the questions and the reasons this won't work versus the, "Oh, thank God you solved the pain point for me." It's no, you just gave me 10 more.
SmarterX AI Use Case Spotlight
**** · **** · A couple more final segments here before we wrap up this week. So we've been doing more regularly what we call our AI use case spotlights here at Smarter X. So we hear from listeners Paul all the time that one of their favorite things is when we talk about how we're using AI at Smarter X. So we're every week we're going to try to do giving you a quick look under the hood at whatever we're working on this week.
**** · real use cases that we're either exploring, building or deploying. So Paul, I know you had mentioned you might have had one. I've got a couple I can just really quickly touch on too. Yeah, I the one I can give again I was on vacation for the last 10 days but I said I was in the back of a tour van and had a lot of time while my family was sleeping. I was just thinking which again one go to Scotland it's beautiful. two take trips with your family whenever you possibly can. Mike I know you have a young child. if I have any parents with young kids listening the my kids are 13 and 14. there has never been a single trip I've regret.
**** · go places with your family. create memories, create experiences. It's amazing. So, Scotland was incredible. Time with my family, amazing. my mind was freed. it was the first time I've stepped away in a little while and just just didn't really work. but it also then gets that inspiration going. So, there was one extended trip on the tour where I had this assessment I wanted to build.
**** · Then I've talked a little bit about one of the assessments I was recently building. This is a compliment complimentary assessment which I'll share more detail about in the coming months, but I just pulled up Claude Code sitting in the van and I was "let's work on the next one in 3 hours in the van."
**** · And I built a working first draft of the thing on my phone in the app. So, it's nothing earthshattering, but the fact I could do this with my phone and an app and go through the entire thing, build a working model, export it into a doc, I could edit. it just easy 30 plus hours of work traditionally done in 3 hours in the car. Got back at the hotel that night and I just sat down with my laptop and went to bed and just laid around. It was peaceful and inspiring and not I didn't feel overwhelmed. I feel AI psychosis it wasn't I got to build. It was just my mind was finally free and I could think clearly and the technology enabled me to do something amazing while I was doing it.
**** · That's so cool. I love that. Yeah. My big one this past week won't be really a surprise. It's a common one, but definitely just being continually in awe of AI as a deep strategic partner. So, I was working a lot of heavy stuff this past week related to helping build out our research agenda at Smarter Eggs with our director of research, Taylor. And there's a lot of really intense really reasoning from first principles that these foundational strategy items that are deeply important to getting the direction on something this.
**** · So the ability to really in a structured process and methodology sit down and say, "Okay, I'm going to gather up all the context needed feed this into something Claude and then work systematically back and forth with this tool to refine each individual piece of the strategy, but also just the logic behind it. it's a caliber of thinking I simply could not do on my own, I don't think. So I'd highly recommend." Then I took that exact same methodology and said, "Hey, this worked really well for this research agenda.
**** · I've got three other projects it. Let's apply the exact same methodology to completely different contexts and projects." And also got those done in a fraction of the time it would have normally taken and more importantly results I could not do alone.
**** · **** · Wait, let me I'll show one other one real quick when this is a personal one.
**** · So buddy of mine text me it's obviously tax time and he was dude I'm getting killed on taxes do you have any idea if your CPA's talked to you about any tax strategies as a business owner things that and so I was yeah I'm fresh off vacation I haven't slept in two days and I'm I'm dude I don't I don't really have much here here's one thing we've tried what do you think about this and he does yeah we tried that and so then I was [ __ ] what so I go into chat GPT and I was hey and I give it this basic prompt I got a friend. He's a business owner.
**** · Trying to figure out tax advantage things what are some, write me a prompt that he could use to do tax planning in his tax bracket as a business owner. Whatever.
**** · It comes back with this amazing prompt.
**** · this is 1,200 words and it's it's act as a highly experienced US tax strategist, CPA, and business adviser who specializes in helping business owners legally reduce their tax burden through proactive planning. Your role is to educate, analyze options, surface questions, strategies, and planning opportunities. follow tax rule, do and it goes and then it breaks it down boom boom boom go through all these steps and and then it ends with and give me a list of questions I can then send my actual CPA. So I sent this to my buddy and he's he uses it. He goes, "Yeah, unfortunately my CPA is really good and they they've done all these things but just that again it's that sometimes it's the personal use." And then oh one other one I was designing a pavilion for our backyard and I use Google Gemini to do it and it was amazing it crushed it. So yeah just some fun personal ones too.
**** · I love that.
AI Academy Spotlight
**** · so Paul we're also doing a regular weekly segment related to spotlighting our courses in AI Academy. So, if you want, I can tee this up for what we're going to talk about this week that we've got, available to AI Academy members if that works for you.
**** · Yeah, go for it. We did the customer success one, I think, is working.
**** · We did. Yeah. So, we've had live in AI Academy AI for customer success. So, this is as a reminder a four course certificate series that's built specifically for customer success professionals. And the whole point of this segment is just spotlighting the course and giving you something valuable from it whether or not you ever take it just as a way to share more of the love that we're, putting together in AI Academy. So when I was building this course, what really jumped out to me is I think of it in two ways for these segments why AI matters for this function or industry or segment specifically and then how to start operationalizing that. So, first up, what really jumped out to me in customer success is just this core systemic challenge of scale. if you have are building a CS team, Paul, I know you're deep into this now.
**** · The only way you historically scale up a CS team is by either hiring more people or piling more accounts onto each customer success manager. So, those lead to some really thorny trade-offs, **** · people can get burnt out. your engagement quality from CSM goes down.
**** · top accounts might still get some white glove treatment as you scale, but the rest often do not. Now, what's really cool though is that AI is starting to break this math in a good way. So, instead of scaling by having to raise headcount a ton or stretching people thinner, you can use AI in a number of different ways to scale up the effectiveness of CS professionals. Now again, not getting rid of people or automating them away, but just this ability where you can scale without this linear rise in cost. And it's helpful because then CS professionals themselves not only do better work, have better lives and work life balance as a result, but they can increasingly get out of this reactive work and start doing much more strategic proactive stuff that moves the needle and also turns CS more into a revenue center versus a cost center which has historically been a big problem. So lots of data points information and research in this course related to that but that really is such a core challenge and what's really cool is we also teach some steps about how to start operationalizing this insight so we walk people through and I think it's useful even if you just do this on your own that you start with the really lowhanging fruit trust me the amount of use cases we've got in this course around customer success there's so much lowhanging fruit where AI can no joke be saving you dozens of hours a month or even maybe a week at some point. So, we really do start from the bottom up and say, "Look, let's look at where your CS team is spending time on reactive work."
**** · So these are things check-ins, QBRs, manual scoring. And if you really get smart about making those your first AI targets, you are going to free up people to do so much more time more time and energy devoted to customers, which is amazing, but also to bigger ticket AI projects and pilots. So the course goes into how to do all that. But even if you don't take the course, I would say pull out your calendar tomorrow, screenshot it, and drop it into something ChatGpt or Claude and start talking it through where you're trying to save time and, put a constraint on it. Say, I am I want to save an hour next week minimum. By next week, let's figure out how to do it. So that's just one little thing that I learned building this out and that we found customer success professionals who've taken the course so far have also found valuable.
**** · Yeah. Someone who's trying to architect an AI native customer success team. This is for me.
AI Product and Funding Updates
**** · **** · Paul. So as we wrap up here we have our regular AI product and funding updates. I'm just going to run through a bunch. There's obviously a ton since we've been off for a couple weeks. So I'll I'll hustle through these.
**** · Yeah.
**** · again real quick. Mike's going to move fast through these, but honestly five or six of these in a regular week would have been certainly rapid fire topics if not main topics just because it's a rapid fire at the end here don't doesn't mean that some of these aren't very important that we don't understand the bigger significance but can only cover so much in a weekly show I suppose.
**** · Yes
**** · indeed. So, first up, Anthropics annualized revenue has crossed $30 billion. That's up from 9 billion at the end of 2025. the number of business customers spending a million dollars or more annually has doubled to over a thousand in under two months. So, we talked about the number the re the ways in which that growth is creating some issues for them. In this episode, Axis is reporting anthropics usage limits are outpacing Open AIS, and Wall Street Journal reports they are in talks to invest $200 million in a new private equity venture.
**** · Next up, Sycamore is a startup building what it calls the trusted agent operating system for the enterprise, and they just raised a $65 million seed round. They are focused on providing infrastructure for deploying and managing AI agents in enterprise settings with built-in trust and compliance. Again, as the topics we just went through in this episode, that should be no surprise. There's a big need for that. Next up, Google released Gemma 4, which it calls its most capable open model to date. And it's built from the same infrastructure research as Gemini. And this is really notable because you can run this model for free locally. It is extremely capable and super powerful. At the same time, Meta has introduced what they call Muse Spark. This is the first model from their rebuilt meta super intelligence lab. Alexander Wang who runs that noted that the team rebuilt their AI stack from scratch 9 months ago and started work on this model. so far it's getting decent reviews it sounds in at least certain areas after some flops from the previous lama model releases. Anthropic has launched claude managed agents.
**** · These are frameworks for getting AI agent applications to production faster. So the engineering blog post about this details the architecture for essentially decoupling the brain from the hands is how this architecture works. Separating the reasoning model from the tools it uses. So it can make a agent deployment much more scalable and reliable.
**** · Google has added notebooks to Gemini. So these are that second brain topic we're discussing. The feature lets users organize project sources and AI conversations into persistent workspaces. Microsoft has introduced multimmodal intelligence in C-pilot's researcher feature which allows it to pull from multiple AI models for deeper research tasks within Microsoft 365. So if you have access to this in your account, you may want to check that out.
**** · Microsoft and Publicist Group, one of the world's largest advertising holding companies, have expanded their strategic partnership to power the future of agentic marketing for businesses worldwide. Pika, the AI video generation startup, released the beta of its first product with face and voice capabilities, enabling AI generated characters that can speak with realistic lip sync and expressions in conversations.
**** · We alluded to this la this next one in previous topics. Anthropic published new research on how emotion concepts function inside clog. So it investigates whether LLMs that sometimes appear to express emotions have internal representations that correspond to those expressions. In other news, SpaceX has filed confidentially for an IPO targeting a valuation of more than $2 trillion.
**** · Google Research published a paper on how to responsibly disclose quantum computing vulnerabilities, which could very prominently affect cryptocurrency security. So, they are making much more of an effort, and I'm sure we'll keep talking about this in the future about the eventual impact of quantum computing on the current standards of encryption.
**** · And finally, OpenAI introduced a child safety blueprint, a set of guidelines and tools designed to help developers building on OpenAI's APIs implement safeguards against child exploitation and harmful content involving minors. Okay, Paul, that was a very packed week. One final announcement here. Go to take our AI pulse survey this week that we had mentioned at smartrx.ai/pulse.
**** · We're going to ask this week about some of the big prominent leaks we've had about Claude Mythos. how you're feeling about AI company's operational security. We're also going to ask where you stand on job displacements since some economists are changing their minds. So Paul, thanks again for breaking everything down for us. This is a packed couple weeks.
**** · You get under 90 minutes, but I think we slipped over a little bit, but hopefully we stuck with us and it's all interesting. It's there just so much to do every week. It's Truly, but good to be back. next week we're thinking we're going to have a regular episode. I have to travel, but we think we found a way to make it work. So, unless something changes, we'll be back next week with a regular weekly and then we'll probably have a second one next week for Intro to AI. So, yeah, back on schedule hopefully. Thanks, Mike.
**** · Thanks, Paul.
**** · Thanks for listening to the artificial intelligence show. Visit smarterx.ai to continue on your AI learning journey. And join more than 100,000 professionals and business leaders who have subscribed to our weekly newsletters, downloaded AI blueprints, attended virtual and in-person events, taken online AI courses, and earned professional certificates from our AI academy, and engaged in the Smarter X Slack community. Until next time, stay curious and explore AI.