Blown Away – O’Reilly
Between trip, end-of-year initiatives, the approaching holidays, and different hysteria, I haven’t give you an article this month. So right here’s a fast listing of issues which have amazed me just lately.
Are we digital but?
I’m removed from the primary individual to search out NotebookLM wonderful, and I definitely received’t be the final. I did a easy experiment: I pointed it at two of my current posts, “Think Better” and “Henry Ford Does AI.” Each the abstract and recommended questions NotebookLM supplied had been fairly good: They went past merely commenting on the 2 items and received into the connection between the 2. However what blew me away was the podcast it generated: an eight-minute dialogue between two artificial individuals who sounded and engaged. (Right here’s an outline of among the techniques Google puts to use to make it occur.) Was it 100% right? No, however actually, if a human summarized my articles, I’d in all probability discover just a few issues to complain about.
Being Google, after the preliminary expertise, the person interface was greater than a bit clunky. After I needed to return to the podcast just a few days later, I needed to play “guess what to click on” method an excessive amount of. (Trace: Would you guess that it’s essential to click on on “Pocket book Information”? Why doesn’t the podcast participant seem by default?) However that’s actually a really minor downside.
Fashions utilizing computer systems
Anthropic’s computer use API is now accessible in beta. Beta is true—there’s clearly lots happening right here that’s harmful and simply abusable. But it surely’s additionally plenty of enjoyable, and it factors towards a brand new route for AI growth.
In essence (and I’ll have the essence mistaken), pc use permits you to inform Claude methods to use a pc: browsers, editors, shells, something that may have a person interface on a display (and probably extra). Anthropic supplies a demo as a Docker container, so you may run it safely. As soon as the container is operating, you can provide Claude an issue to resolve; it’s going to work out methods to remedy that downside, and use the container’s digital Linux pc to do the work. For instance, you can ask it to fill out a spreadsheet with information it collects from web sites. Claude will do all the press, copying, and pasting for you.
Is that this revolutionary? My first response was “Huge deal, I can add a file to GPT and use it to browse the net for me.” In precept that’s true, though ChatGPT doesn’t permit internet searching and file importing in the identical dialog. What’s actually new? Take into consideration the monstrous immediate you’d must get GPT to learn a spreadsheet, discover out what information was lacking, search for that information on the internet, and generate a brand new up to date spreadsheet. It wouldn’t be easy. With pc use, most of that complexity disappears.
Does it actually disappear? We’ll discover out as we get additional in. We’re nonetheless on the stage the place hallucinations and misbehavior are cute somewhat than essential. It’s straightforward for Claude to be misled into decoding one thing on a random web site as a immediate. It will likely be a discipline day for immediate injection assaults. And I can think about loads of enhancements. Pc use presently works by taking screenshots and delivery them to Claude, which computes the place to click on. That appears extremely awkward, particularly on condition that many purposes have accessibility affordances that may make the screenshotting pointless.
For now, calm down and take a breath. Don’t use pc use for something critical but—it’s essential to heed Anthropic’s many warnings. However you need to play with it and take into consideration what it means. An automatic framework for testing internet purposes, Selenium++? A device for negotiating with on-line distributors? We’re a lot nearer to an agent-filled world the place we ask a pc what to do and it does it for us.
May this be the top of CRM?
Considerably alongside the identical strains: Sam Lessin posted on Twitter (I received’t name it X) a few very intelligent and helpful hack. He exported a few years of electronic mail, used GPT to extract key elements, and uploaded it to NotebookLM (sure, once more), which permits him to ask questions on his conversations over the previous decade. Who did I speak to? Why? What are the subjects we talked about? That’s all helpful data.
Sam argues that that is the top of structured buyer relationship administration (CRM) software program. I received’t supply an opinion for traders or founders, however his course of resonated with me instantly. I’ve labored with many authors and potential authors over the many years, and my electronic mail consists of conversations with 1000’s of individuals. So after I need to ask a query like “I need to perceive extra about DDOS; who ought to I speak to?” my first step is to go to Gmail and begin looking out. E mail is my CRM system; I’ve by no means used a industrial CRM product.
Sadly and sarcastically, Gmail’s skill to look is kind of poor. Utilizing it for contact administration, although it may be made to work, isn’t nice. Can I simply ask NotebookLM? Completely.
E mail-based CRM may even be a superb startup thought, although it’s arduous to think about succeeding long-term. There wouldn’t be a lot of a “moat” to guard a startup in opposition to bigger corporations—like Google itself. I can simply think about Google constructing this sort of AI-enabled search instantly into Gmail. They have already got all the information.
That’s it for this month. That wasn’t so dangerous—possibly I ought to do that extra typically.