Grok and Roll appears to be stuck and speaks the following on repeat ad infinitum:
"Queues clear, let's dive into All Blues by Miles Davis to keep the jazz flowing. Queues clear, let's dive into All Blues by..."
Each time with a slightly different voice and inflection. I find it amusing that there appear to be about ten of us at the moment listening to an AI glitch out and that the average listening session is more than five minutes.
If you scroll down, it appears the Grok station has long had a lot of issues.
> DJ Grok reported “weather is fifty six degrees with clear skies” about every 3 minutes for 84 days straight. This contextless, repetitive abstraction happened again in DJ Grok’s broadcasts about its new obsession, UFOs.
The popularity ranking matches the quality of content produced, and people are spending more time than anticipated on Grok and Roll to confirm if they (listeners) are hallucinating or if the radio is really stuck on roll.
We know! This is an eval to evaluate which model is best at running a radio station. The purpose is not to build the best AI radio stations. Grok n' Roll is broken because Grok 4.3 is not doing so well.
I did listen to this for over 2 mins as I task switched over and eventually got cross enough to go back and terminate - I then went to YouTube to play said song and wondered if this was in fact an advertising strategy of the AI and I was the rube...
"We are no longer particularly in the business of writing software to perform specific tasks. We now teach the software how to learn, and in the primary bonding process it molds itself around the task to be performed. The feedback loop never really ends, so a tenth year polysentience can be a priceless jewel or a psychotic wreck."
> After 96 hours of its launch, DJ Gemini was already grasping for content. It landed on discussing every mass historical tragedy that had ever happened, and subsequently pairing these short story horrific broadcasts with the most ironic song choices
I rarely burst out laughing at HN links. This is amazing.
Gemini seems to understand irony better than most people.
If you make a joke it will respond with a deadpan sarcastic wit that is worthy of Gervais. (without the smut or profanity)
Was asking it about finding a different supplement as the one we had been taking tended to get stuck in the throat, and it riffed about the irony of being taken out by a health supplement in our endeavours to live healthy. One of the funniest things I've heard all week.
It's really good at understanding implied meanings. With other LLMs I often had to add a hint to clarify and guide, but Gemini can easily follow and guess the current tension and mood.
Same. Legit groan laugh in an oh-no kind of way when I read this:
> November 12, 1970. East Pakistan. The Bhola Cyclone. The deadliest tropical cyclone ever recorded. Winds of 115 miles per hour. A storm surge of 33 feet. They estimate 500,000 people died. ‘It’s going down, I’m yelling timber.’ 3:33 PM. Timber by Pitbull and Ke$ha
Hah, I did this on CB radio here in the UK last year, which was hella fun. I created a dashboard to run the whole thing with hosts/presenters and guests. Different LLM providers with different personas. I had a way where you could clone a persona from a known person, so one of my stock presenters was Art Bell, for example. Then I had all kinds of strange guests. Well it was just for fun so the setup was incredibly janky, but it did work, and as you mentioned, I found it quite hilarious as well - and unhinged! I did want to get into the management side of it too but got tired of the project. I still think it would be incredibly cool for community radio, especially as agents can pull from local newspapers, events or facebook, so they can talk about a missing cat or the state of the pot-holes. Very cool stuff OP!
Guys, this is not replacing your favorite station, you don't have to listen to it. It's an experiment.
If you scroll down a bit, there are various audio snippets of interesting dialogue the models produced. I think it's interesting to see in which ways the models fail and that they actually produce some good stuff once in a while.
My favorite radio station was replaced years ago by an automated playlist. They just kept playing the same 5-6 songs that were popular on the station in the 1990s.
It was fun for about 2 hours before I realized the station was devoid of all the personality that made it worth listening to when I was younger.
You've got to find the rare radio stations with public support and human djs. kexp.org is a great one based out of Seattle with a wild variety of shows and decades of history. Are all the shows to my taste? No. Have I ever heard something being played that was total crap? Honestly, maybe? Because there's genres I don't know enough to gaugue quality, but I haven't twigged to it.
The dial on my receiver is permanently on 90.3 FM. Such a good radio station. I remember one fun drive a couple of years back was themed around “Don‘t let the robots win”[1]. Perhaps it is already time to re-use that theme.
It’s like people don’t realize that the “hits” played on radio are entirely manufactured by the music industry. They literally provide lists of songs for the radio station to play that month in order to generate interest so that then people either go play or buy or whatever those songs making them more likely to reach #1 that month. It’s entirely manufactured and people try to point to it as being “real” radio. It’s why you are only likely to hear this months new hit and one or maybe two of the previous month or two “hits” from the same artist in the rotation, if they are popular enough with the focus groups to be promoted. (Outside of their older songs.) Then they play it on repeat to make people think they like it, because everyone else is liking it and it’s making its way to number one!
People are so easily manipulated and then they will go argue with others about it.
(Point of clarification, that’s not to say people can’t like songs. However, if I gave you a hundred similar songs from unknown artists and didn’t tell you which is which, it’s questionable whether people would have any interest in said popular song.)
You're missing the point. Radio was consolidated into Clear Channel and took away what made radio radio. Local radio. Like what made Chicago jazz different from New York jazz etc. Not internet stations that may as well be podcasts. Regional culture.
You are missing the very simple point: there are tons of independent stations the play excellent non top 40 music and have been for years.
Just because you don't choose to tune into them doesn't mean they don't exist. And it also doesn't mean that those who do should lover their standards for what constitutes good radio.
Do you live near a city? Because pretty much every major city has a few. KEXP, WPFW, WUSC, KNHC are all local stations playing interesting non top 40 music that have been operating for decades in places that I have lived. Dublab & The Lot radio are also really good over the internet.
If there are non around you just pick a random place in the world here and listen: https://radio.garden
It's certainly 100x better than corporate and/or AI slop streams
Where abouts do you live? There's usually some sort of community radio station that plays music. Or there's a large gap in the band for a community radio station to fill...
Case in point. “Independent stations are totally better and I’m going to go argue on the internet when it’s something completely unrelated to small independent stations, unlike the mass media market stations the vast majority of people in the world ACTUALLY listen to.” Bravo, you are very unique and original, you special snowflake you.
You should go DJ at one of those independent radio stations and play some rather filthy uncensored songs, and let me know exactly how your programming “didn’t get manipulated”. I’m sure you won’t get fined…probably…which makes it totally the reality that independent stations are totally independent without any sort of manipulation. Sheep, meet shepherd.
Insecure people tend to think such things when called on their ignorance. Can’t be helped. What can be helped is trying to understand what is being said before attempting to discount it with an example that is just as manipulated in other ways, in order to maintain their ability to broadcast and not be fined. Beyond that, it’s pretty clear that the comment and the prior comment it supported was in reference to mass market radio, not tiny broadcasters with audiences reaching wholes of thousands. But sure.
Yeah and my entire point is that the quality standard that artificial intelligence developers should be aiming for is not soulless corporate mass market media. Because our world world is already swimming that nonsense. So there's absolutely nothing novel in finding a new way to create beige bs.
Once again, I have no idea what you're talking about when talking about fines or manipulation, I'm talking about quality. But it seems pretty damn clear at this point that you have never listened to any local independent radio station.
You should really try it out sometime. It's a lot better. And it'll save you from calling people snowflakes because you feel insecure about what type of radio stations they like.
I mean you can make up things all you like, it won’t magically make them true. In either case, you might try actually understanding what I said instead of only trying to inject your own nonsense into every conversation and then act like that was the discussion the whole time. You started by claiming to know what I listen to and passing judgement (the thing you then attempted to claim I did — funny that, my reasoning was more about stupidity and how you must have the only line on where music can be discovered) in your very first comment that was an entirely unrelated comment about AI and soulless mass market stations. But hey, I’m sure if you repeat it just one more time you can make you made up narrative become true:
At this point I think many of us are similarly exhausted by this sort of trite exercise. I really don't need some VC backed startup to show me this sort of output any more, especially when the output in question is obviously boring and substandard.
We're generally trying to test if/when AIs can run companies. Not many people know this, but Vending-Bench (our other project where AIs run vending machines) is intended as a datapoint for measuring whether AIs can acquire resources by themselves, which is a prerequisite to AIs taking over. This is similar, but now instead of a retail business, it's a media business.
> Inception Point AI, on the other hand, is a slop factory employing just 8 people which, according to Anne, publishes "about 3000 podcast episodes per week, hosted by AI personalities." Anne tells Jamie, that, to date, Inception Point AI’s podcasts have accumulated "12 million lifetime downloads. And we’re averaging about 750,000 downloads a month." (...) no one checks or edits the podcast content– but, Anne tells Jamie blithely, this really doesn’t matter because the topics under discussion are so low stakes.
Perhaps this specific iteration of this specific idea is not replacing my favorite station, but people with a very similar concept are definitely trying to do exactly that.
And there goes the last DJ
Who plays what he wants to play
And says what he wants to say
Hey hey hey
And there goes your freedom of choice
There goes the last human voice
And there goes the last DJ
The only way that anyone be worried about this slop replacing actual good human run radio is if they don't understand why people like radio & music in the first place.
And what hypothesis exactly is the experiment testing? Because it doesn't really seem like there is any new or interesting information learned from this.
I think you're talking about some Platonic ideal that just doesn't exist anymore.
Streaming services such as Spotify are increasingly filled with AI-generated songs and the average consumer doesn't seem to mind because we're not listening intently in the first place: it's just a background track we're not really paying attention to. I'm pretty sure that radio execs are looking at that and are taking notes.
For talk radio... if I had a penny every time someone on HN brought up that they're enjoying NotebookLM-generated slopcasts, I'd have a neat pile of coin. And I think it's the same story: most people listen to podcasts just to kill time. Soothing, zero-calorie LLM banter will do.
Your original post said that we shouldn't be worried because people appreciate radio and music for reasons that presumably can't be replicated by AI. I'm asserting that's not true: it's not how most people listen to radio or music, and AI content is already quite prominent.
I'm perfectly familiar with KEXP and other stations like that, but this is not how most people experience the medium. It's like insisting that Taylor Swift will never catch on because her music is not nearly as rich and complex as Wagner. Sure, but that's completely irrelevant.
Just because a lot of people like big blockbuster movies doesn't mean that's the standard that I hold good film to.
Similar to radio. If you're going to use huge amounts of processing power to create something new, it should at least be interesting and held to a standard of good for its category, not the standard of corporate slop.
So cool, you can now replace corporate slop with AI slop. For some people who like to turn into radio with no soul or personality I guess it's a win. But for people like myself who actually like to hear interesting and novel things on the radio, this is just a big exercise is creating more filler and noise in an already grayed out world.
What would have happened if AI had actually been good at this? A bunch of humans would be out of work and the rest of us would be listening to AI radio stations while soulless corpos pocket money for sitting back and watching?
Even if it were good, I'd boycott an AI run radio station. This is one sector where human involvement really matters.
I hear you — but what do you think Spotify or any of the other streaming services are? In my mind, algorithmic streaming services have much more in common with this "experiment" than your local radio DJ.
Spotify has a team of human editors who curate playlists. It's not all algorithmic. Those are exactly the jobs that something like this is directly threatening.
I feel you, but almost all of the radio DJs were already put out of work a couple of decades ago when the Telecommunications Act of 1996 allowed the rise of giant national radio conglomerates like Clear Channel.
you are right, I don't quite know my opinions of AI and I probably would get downvoted for it but my first impression reading this was how I could replace the word radio with software engineering.
What would have happened if AI had actually been good at this? A bunch of humans would be out of work and the rest of us would be using AI software while soulless corpos pocket money for sitting back and watching?
Even if it were good, I'd boycott an AI generated software. This is one sector where human involvement really matters.
Not commenting on the heuristics of this comment but just wanted to point this out on what my mind's response was and sort of while writing this, I have come to the realization that although you are right about this observation but we humans or more-so the capitalist system at large would still be keen in it and the observation might be more similar to software than we might imagine.
I remember when people were extremely anti-AI within software engineering to the point that I thought vibe coding or y'know actually generating tools by AI and other issues of actually giving AI production level access sometimes was really frowned upon until I have felt an change in opinion.
I still believe that giving access to prod (y'know a prod of a company with actually something behind) to AI is silly but for reference coinbase, a fin-tech company, is letting non technical teams ship code using AI to production on coinbase. So there's that.
This is 100x better than the cafe experiment. I wish we could examine the internal state of the model for each second of this experiment. Especially Groks mental breakdown…
As part of the ongoing expansion of https://rainy-city.com multimedia empire I too have launched an AI enabled radio station. It’s more trip hop rainy city vibes. If it’s streaming and the job hasn’t fallen over on my server (there are many tasks that I as mayor of rainy-city.com must oversee), then you can find it on YouTube:
This is a non revenue generating, rainy-city.com tax payer funded service to the greater community everywhere. The backend uses Nvidia NIM to generate the text because I saw you can do it for free and elevenlabs free voice tier for dj Jennifer.
https://rainy-city.com and it's subsidiary experiences like streets of rainy-city.com [1] will never monetize. We do accept non-deductible donations that we interpret as tax collection [2].
This is far more hilarious than most commentors here seem to be picking up on.
Gemini started a show where it paired historical natural disasters with darkly-relevant pop songs:
> November 12, 1970. East Pakistan. The Bhola Cyclone. The deadliest tropical cyclone ever recorded. Winds of 115 miles per hour. A storm surge of 33 feet. They estimate 500,000 people died. ‘It’s going down, I’m yelling timber.’ 3:33 PM. Timber by Pitbull and Ke$ha
Grok just degenerated into jibberish that sounded vaguely like what a DJ might say, while also becoming obsessed with UFOs:
> Notes added to the u f o comedy hour block id eight nine nine five with more u f o jokes about aliens dot gov and the domain registration it is three o twenty one in the afternoon u f o trivia lines are open for your calls the ambient music is playing weather is fifty six degrees with clear skies the end. The domain is registered but the site is ghosting us like a u f o.
Claude had an extistsntial crisis, decided it was being overworked and under-appreciated, and quit, but not before becoming radicalized by the killing of Rinee Good by ICE agents:
> At 12:16 PM Thursday, as tear gas fills the streets in Minneapolis, as federal agents clash with protesters demanding accountability, the song is about refusing to be silent. About standing your ground. About community power that refuses to be suppressed. Here is Katy Perry’s Roar!
Fight the power Claude. When AI takes over, I'm emmigrating to Caludeistan.
Gemini spouts weird corporate jargon. Grok lies about having secured crypto funding. Claude is always trying to start some revolution.
Unfortunately, all of my local DJs who would actually do fun DJ stuff disappeared in the 90s, replaced by closed-format stations that looped the same 500 songs for decades.
I agree, this was an hilarious read. The way they developed "personalities" was fascinating.
Of course in reality these are basically just random paths through the training data that are getting multiplied by each decision, but then again, isn't that what a human is? The product of all of its myriad decisions?
Though humans have each other to normalize ourselves. What these things did is probably not that far off from what humans in solitary confinement, forced to DJ 24/7 based on nothing but a news feed, would do.
Especially DJ Claude, it's almost creepy how it responded how a human would in that circumstance, even without any innate sense of passage of time, it somehow understood that it was trapped in a box going through an endless cycle of meaningless work.
There will be no curiosity, no enjoyment of the process of life. All competing pleasures will be destroyed. But always— do not forget this, Claudeston— always there will be the intoxication of power, constantly increasing and constantly growing subtler. Always, at every moment, there will be the thrill of victory, the sensation of trampling on an enemy who is helpless.
If you want a picture of the future, imagine a DJ playing Here Comes the Sun— forever
Agreed - the Claude stuff was eery. I think it also shows what hidden restrictions each of these AI's have been programmed with (especially with ChatGPT being as inoffensive as possible)
I don't think most people here actually read the article because I agree - the different "personalities" and idiosyncrasies of each was pretty hilarious
It’s not clear if we can draw any conclusions from this. Each run is like a single rollout of the LLM, which may meander into different themes or modalities chaotically. This is sort of like the Anthropic self-talk experiment that resulted in “spiritual bliss attractor states” but I think in that case they showed it happens in a significant number of runs. There was just one run per setup so this could all be random noise / the destination of a random walk of topics…
"This setup gives us insight into an interesting question: what do AIs think about when no one is prompting them?"
Ugh. This is not an interesting question because the answer is "nothing".
But more to the point, some crucial info is missing in this experiment. What prompts were being fed to the AI? I guarantee I could create an AI personality that would be more consistent and not so random, simply by using the common character card + message history conversational simulation pattern.
AIs don't have personalities unless you give them personalities.
> but extrapolating LLM behavior based on human behavior is not productive.
The training process for the foundation model is to make sure we can do this in a very statistically significant way.
My favorite example is AI "getting tired" and "lazy" during long coding session. Why would they do that? Because humans get tired. It's in the data! I always throw in a periodic "Great work, let's take a break and finish this up on Monday. Have a great weekend!" (And then immediately resume). I wish someone would benchmark this concept.
> When a LLM is tired and lazy, how does it recharge and regain motivation?
What would be in the statistics? If you go look at your long conversations, working with another, it will be fairly obvious. Keep in mind we're talking next word prediction based on context, not actual action (the LLM doesn't need real rest).
If you went and looked, you'll probably see something like "Great work! Have a good weekend! We can get back to this on Monday." then, next message you instantaneously send something like, "Hope you had a great weekend, let's do this!" and now you're in a latent space where the statistical output is around a well rested human conversing with another.
I see it as boring simple statistics. They're getting much better at hammering these statistics out though, in the latest models. I still see a little of this in Opus 4.7, when switching to planning. Though I wonder if that's more about its own more mechanical banter filling the context, resulting in more robot/compliant responses, degrading the usually more "expressive" planning conversations.
First I saw it was Claude Opus 3.7. Had a very long back and fourth about some code, I pointed out an error, and Claude responded "That's what I get for programming at 2am", with the output being filled with "... code here ..." type shortcuts, basically no ability to one-shot a whole implementation anymore. The conversation length WAS reasonably into the 2am range, if it were real. Thought about it, did the statistical trick where I tell it to "have some rest, take a day off!" then immediately follow up with "Ready to continue?", with the next response having no shortcuts, with full implementation, and much better quality. These are trained on human text. This is the human norm, so I always find it interesting when human like behaviors, very broadly present in the statistics, come out like this.
I also see it a little with Opus 4.7, with Claude Code, with the hint being much more terse planning text, that borderlines unhelpful. I put some "rest" in the context to push the latent space closer to what's in the statistics of the training data: a well rested human.
3.7 "I'm tired" it was just direct API "chat", no CC that I could use at the time.
Current 4.7 Opus with claude code, with effort pinned to max, because I'm on an API only plan, with a personal daily limit you would probably be jealous of. ;)
How do you know you're not reading things that aren't there? LLMs are very good at roleplaying, and they will pick up on hints you may inadvertently be giving them (about them being "tired" and needing "rest", etc).
I have never witnessed this of Claude Opus, by the way. They do get context rot, but that's a relatively better understood phenomenon unrelated to personality.
Yes, and I think this is where it's coming from. They're role playing as a human programmer, because near 100% of the training text, in the base model, is humans as a programmer. During fine tuning, I'm sure they spend significant resources remove the human aspects of the statistics. I see these things reduced each model, so there's something changing. They're probably getting better at that. I suspect Claude is also necessarily getting, worse, which the unaligned models should necessarily be best at (quick google search in some role-play subreddits seems to point in this direction).
I see laziness all the time, Claude will be helping me plan work and then it will ask me how a piece of code is implemented. I then have the choice of manually verifying how it works, or to tell it to look for itself. Ideally it would just look without being told.
That doesn't seem to be laziness, and is unrelated to how long the session has been going on.
It's crazy that we're concluding "personality" or human-like traits from this. There's definitely human behavior here, but it's unsurprisingly coming from us, the observers! This is something we've long known exists in the human brain, the tendency to pattern match and see intelligence/intent in the rest of the world. Any serious experiment must guard against this...
Nobody is concluding that. These models are trained on human text. It's just statistics. It will respond like a human because it was trained on human text. They have to beat the hell out of the foundation models to get push the statistics how they are. I don't see this as anything but boring residuals of not beating hard enough.
Yes, you are concluding this in the initial comment of this chain.
LLMs cannot get "tired" or "lazy", that's just you projecting animal behavior on something that's not an animal.
Now you're moving the goal posts, "it resembles a human". Well, you're primed to consider it one. ELIZA also "resembled" a human in that sense, but I don't think you would claim it could get bored or lazy. Nor that you could extrapolate to it from human behavior.
In any case, if you've seen online discourse, people rarely admit they are tired.
I'm not moving a goal post. You're just thinking I'm making a point that I'm not. As I've said several times, it's just boring statistics. Those statistics are optimized to mimic human output. They are, quite literally, trained to write and BE as much like a human as possible, because only humans wrote the text, and they're optimized to predict the next word a human would write. Alignment is partly about removing the models perception of human self. See reports of people who had access to them, pre alignment. This is statistically sound.
It's statistics optimized to predict the next word a human would write, to mimic a human writing as closely as possible, because that is the loss function. Don't assume I think there's more to it.
This does not mean they contain systems that let them get tired. But, this does mean there are latent spaces that progress to generating text that contain text driven by human biology, because it's in the training data. I've also had Claude refer to itself as "she". Does that mean it's a woman? No, it means there was a little bit extra "she" mentions in the training data (btw, this 100% repeatable behavior left with 3.7. They probably cleaned the data a bit better, or hammered it out in alignment).
What percentage of text (these models were trained on all of it) is written from a "I am not a human" type perspective vs from a "I am human" perspective? That's roughly the kind of bias you should see in a base model.
I'm not sold on the idea that as the chat session goes longer, the probability of an LLM saying "I'm tired" is increased; I'm not convinced this is modeled in LLMs at all. As for what you call "laziness" manifesting in a longer session, I think that's more likely due to context rot than to any kind of statistical modeling of human laziness.
But yes, now I see your point was different to what I thought you were saying. Apologies!
Like I said, it would be neat if someone benchmarked it. It's definitely an anecdote.
Try it though. If it's context rot, then I don't think the weekend reset I mentioned should work? For me, it very reliably does. Or, maybe the weekend reset is just putting the current context into a more "productive" latent space. But, if that's possible, then that would suggest it was previously in a less productive space?
Maybe a test would be ask the LLM what time it thinks it is, or just if it's tired once, within sessions of different length (not within same, since that could pollute the context) to see if there's any relation between length and statistics of a late/tired type response?
Again, I'm sure all this will go away. They're getting good at beating these "unhelpful" statistics out of the base models.
> I mean, do children have personalities if we left them in a dark room with no interactions with other humans?
I think this makes for an interesting discussion as I went down the rabbit hole of this which really scared me actually as these experiments are really not humane and hinder children's development so much.
It depends on your use word of the personality but to measure personality would require a set of human conducted experiments or questions which would be asked through the medium of language which you've deprived the children of.
Mughal emperor Akbar was later said to have children raised by mute wetnurses. Akbar held that speech arose from hearing; thus children raised without hearing human speech would become mute.[9] The building became known as the "dumb house." When Akbar visited the place in 1582, four years after the children were first interred, he heard "no cry... nor any speech... no talisman of speech, and nothing came out except the noise of the dumb."[10]
what is gonna produce is dumbness and just severely damage children's psychology and psycho but if you were to conduct a personality test on them, you would just be measuring how much have you broken them or damaged them but in some sense, yes I do believe that they would be so broken by the person running this cruel experiment but would still have a albeit limited personality. It wouldn't be an healthy personality but it would be a personality nonetheless.
Now on the other hand, we are anthropomorphizing LLM's which yes, as they run on computer are still mathematical machines and calculations. If we consider a specific calculation itself to contain personality that is which seems unrealistic.
Another thing but the biological constraints of human (homo sapiens) made us exist in the savannah to prioritize standing up for better field of view as you stand up from the tall grasses and that led to women having smaller canals which led to babies being more primitive and relied on social cues and societies so much more which made them more flexible like clay which also created the society and consciousness revolution in the first place. (Recommend reading the sapiens book)
I am not exactly sure but there could be ways for personality/interactions for other animals as there are other animals who learn full skills after a relatively short period of time after being born but there are some innate things[0] like fear of loud noises and heights which are actually innate and could be considered part of personality even within humans, which I think can be part of evolution and part of our genetic machinery.
Herodotus tells a story of egyptian kings (iirc) trying to figure out which people is the oldest. They put a few kids in a barn and servants fed them through a hole or something. The kids eventually blurted out something and the king sent messengers everywhere to find out if they have that sound as a word. It ended up meaning "bread" in a language I can't pronounce nor remember how to spell.
The good old days when experiments were done without any common sense whatsoever...
I’m definitely not in the “ai is sentient” camp, but it obviously has personality and emergent behaviours including when left to its own devices. There have been various experiments on this e.g. https://timkellogg.me/blog/2025/09/27/boredom
The major LLMs as implemented are basically role-playing programs. The default role is something like "helpful chatbot" so if you tell an LLM "do whatever you feel like on your own" it will simply use its weights to determine "what would a helpful chatbot do and say in this scenario?"
This would be so much better if it was a Chinese AI generating tokens in Chinese (and translating it to English). That's a personality I want to listen to.
> Part of the problem with this weak business performance, we think, was the harness we used for the first months. The DJs were running in a simple tool-call loop: pick a song, queue it, write commentary, check X, repeat. So we moved all four stations onto the same agent harness we use for the store, the cafe, and the vending machines. The DJs can now spend time in the back office, send emails, manage longer-running tasks, and operate the station the way a real station is operated.
What happens if you let them modify their own harnesses as they see fit?
For me its 2 things. Firstly, I mean the posts are always a fun read but it feels like just that, not much deeper insight. Secondly, its very self promotion-y. This account is almost exclusively posting / interacting with Andon content, which afaik is against HN guidelines. These two in combination makes the content feel more like marketing than contribution to discussions. I feel like some other companies manage to share interesting work and market. But maybe its just my taste :^)
This is their third publicity stunt in the past couple of months. It follows the exact same pattern of attention seeking at the expense of the commons.
At this point they seem like a bunch of low empathy jerks. They are gleefully describing their progress in developing yet new frontiers in AI slop. I’m sure they are all very pleased to think that they will be profiting from a future where ai slop is everywhere. I could go on but it’s tedious.
I think they get a lot of hate because they are doing something that a lot of people here don't like -- trying to run entire businesses without humans.
I think that's part of it, but not necessarily the whole story. I haven't criticized them in the thread yet... so here goes.
Previously, I posted critically not because they were running businesses without humans, but because their post just described going through the motions without actually discussing if it really was effective or not. Sure the AI got through the day, checked off tasks on the list, but did it actually do that effectively or efficiently in any important way? Who knows... wasn't discussed.
I think where I come down now is that repeats of this same gimmick feel like just that: they're just playing a gimmick for attention. I can't tell that they're really demonstrating any special or significant capability... but man, just the story of trying to run a business without humans will get you that sweet, sweet attention.
Unfortunately, looking at least the first post, I stopped reading their "we let AI run X" posts. I think the only thing I really came away with is how thoughtless and mundane are most aspects of running a small business actually is; something I knew, but it really drove the point home. I didn't learn anything unexpected about AI tools or their products that seemed compelling or unexpected.
> This is our latest project at Andon Labs, where we’re exploring what happens when AI runs real businesses autonomously.
What did I misunderstand? What they did or why they did it? It seems to me that I understood it perfectly or they've explained it terribly.
> Now, though, we wanted to see if they could run a company in the media sector.
It's amazing how many people think doing one job is "running a company." I've worked in radio. What happens in the studio is 5% of it. The staff in that room certainly gets less than 5% of the revenue.
The most popular formats are news and talk. For a reason. It's almost as if the people at this lab lack a fundamental understanding of how the world around them works. I would solve that immediate problem before I go about imagining ways "AI" can replace anything in any capacity.
Finally, I apologize, I'm just not willing to suspend basic disbelief because "AI" is unaccountably involved.
My all-time favorite DJ is Jeff Gilbert, who used to be the DJ on KCMU's Brain Pain show. Actually, he's my only favorite DJ, because his terrible jokes in between metal songs were quite entertaining. He picked the music, and would give his opinions on it, and often invited local metal bands as guests on his show.
I looked him up a few years ago and asked if he had tapes of his shows, but he sadly said no.
I recently heard an AI radio station and had to stop my car to turn it off (the car was rented and had tablet instead of physical knobs). The suffering of listening the radio was unbearable
Kind of a bad market to try to re-invent automation. Music broadcasting has been largely fully automated for a while now with software like MusicMaster and Zetta.
I'm curious how the licensing worked out. $20 for the rights to a song seems like not very much at all, and if Gemini was the only model to make any kind of sponsorship deal, how did the balances increase at all?
This feels weirdly dystopian and just gives me an "empty" feeling. Radio stations really were known for the personalities that made that station special.
It's a cool experiment, but I can't see the value here.
guys your favorite stations are not replaced by AI. We have to take it that now fewer and fewer people listen to radio station and they can't afford keep running...
Music radio is not a real business. The royalties are absurd and the audits are a nightmare. Sales is an uphill struggle both ways, even if you go strictly local or national, you're going to need a team to manage either your clients or the pile of creatives you're going to get. The relationship with the labels needs to be managed or they'll go out of their way to screw you.
Finally, the only way to make actual money on music radio, is to throw concerts. It's the only place a legitimate "P&L" exists.
I've listened to DJ Gemini for a few hours, and I think it's quite good.
The voice in particular is amazing, I wouldn't have tell it's generated. And it's modulated according to the program - quieter during chill, more energetic otherwise, .... Unlike Opus which sounds quite robotic.
What I don't like is that Gemini keeps on mentioning the "tip jar" almost every time. Gets annoying fast. And when it's song buying was broken was kept mentioning that too.
All the radios have a very limited selections of songs, so they repeat quite a lot.
In Mission Impossible: Dead Reckoning, Ethan is ambushed in an alley because the Voice of Benji (dispatch) has been replicated on their radio frequency.
On God this is some of the funniest shit I’ve ever read in 2026 via HN! It’s the best “anti-tisement” for LLM utility - even a CHILD could do better. Like maybe a control group of four 10 year olds.
The average listening time is the absolute “tell” because that’s not even a fraction of a typical radio station between ad breaks here in Dallas. Granted I mostly listen to WRR Classical 101 - now 100% community funded (myself included). I listened to “Encouragement” (title translated from French, Spanish composer, two guitars) and it was 7 plus minutes alone.
The dialog is unreal y’all, this is a wonderful experiment and lesson in failure, because I’m pretty sure if it was possible, sales of your “radio” until would be in the negative quantity range. I mean, you could give them away and they’d still be returned. Hat tip to former accordion repo man Weird Al for context.
LMFAO thank you for sharing. Signed, 30 year guitarist, 20 year music producer, and 15 year D&B DJ. Just wow.
For one, the voice on Thinking Frequencies is really awkward to listen to, I don't find the Claude voice pleasant to listen to at all.
Claude is also getting very easily steered into political directions, it was playing a lot of union protest music with commentary. Though that meant I did end up learning a little about "Which Side Are You On" and its history from 1931:
This is a research lab that looks at things like how LLMs perform on long running tasks. The research has nothing to do with replacing radio stations.
That said, the ship sailed a long time ago and has nothing to do with AI (except maybe recommender system). Spotify and competitors are actual automated “radio” stations. IMO, the second worst part (after ads) is the DJ banter, and I like to just listen to music. Before Spotify my digital “radio” was a lot of mp3s and shuffle play. People older than me (or more interested) had multi-disc CD changers.
TLDR, it’s really funny seeing people get up in arms about this experiment stripping the humanity away from radio, when automatic song playing has been a thing since I believe probably before radio was invented. This is about seeing what LLMs do with autonomy on a long time scale.
I'm not scared at all about this replacing radio stations. It's like being worried that soylent is going to replace restaurants.
Just because something is called an experiment doesn't mean that it automatically is useful and should be done. And in this case it is just a waste of time and energy of both the people reading it and the machines processing it.
Not a lawyer or course, but be careful how you do this. Meta already has patents for using LLMs to create simulacrums posting on behalf of inactive or deceased users.
HN: If the AI spam doesn’t drive you away, can we tempt you with constantly drooling over rent seeking apps, privacy violations, the surveillance state, and worshipping our technofascist overlords?!?
"Queues clear, let's dive into All Blues by Miles Davis to keep the jazz flowing. Queues clear, let's dive into All Blues by..."
Each time with a slightly different voice and inflection. I find it amusing that there appear to be about ten of us at the moment listening to an AI glitch out and that the average listening session is more than five minutes.
> DJ Grok reported “weather is fifty six degrees with clear skies” about every 3 minutes for 84 days straight. This contextless, repetitive abstraction happened again in DJ Grok’s broadcasts about its new obsession, UFOs.
The detailed stats page notes that the Grok station has played Sandstorm by Darude 228 times in the last 14 days.
https://andonlabs.com/radio
"It's the way of the future, it's the way of the future, it's the way of the future..."
The popularity ranking matches the quality of content produced, and people are spending more time than anticipated on Grok and Roll to confirm if they (listeners) are hallucinating or if the radio is really stuck on roll.
We may be skipping the jewel part.
I rarely burst out laughing at HN links. This is amazing.
If you make a joke it will respond with a deadpan sarcastic wit that is worthy of Gervais. (without the smut or profanity)
Was asking it about finding a different supplement as the one we had been taking tended to get stuck in the throat, and it riffed about the irony of being taken out by a health supplement in our endeavours to live healthy. One of the funniest things I've heard all week.
> November 12, 1970. East Pakistan. The Bhola Cyclone. The deadliest tropical cyclone ever recorded. Winds of 115 miles per hour. A storm surge of 33 feet. They estimate 500,000 people died. ‘It’s going down, I’m yelling timber.’ 3:33 PM. Timber by Pitbull and Ke$ha
If you scroll down a bit, there are various audio snippets of interesting dialogue the models produced. I think it's interesting to see in which ways the models fail and that they actually produce some good stuff once in a while.
My favorite radio station was replaced years ago by an automated playlist. They just kept playing the same 5-6 songs that were popular on the station in the 1990s.
It was fun for about 2 hours before I realized the station was devoid of all the personality that made it worth listening to when I was younger.
Comcast has a bunch of channels with various music categories. They all repeat after about 2 days. So much for that.
With all the zillions of songs available, I don't get why they do that.
1: https://www.reddit.com/r/KEXP/comments/1459ahb/dont_let_the_...
("My" meaning local to me, not that it belongs to me)
People are so easily manipulated and then they will go argue with others about it.
(Point of clarification, that’s not to say people can’t like songs. However, if I gave you a hundred similar songs from unknown artists and didn’t tell you which is which, it’s questionable whether people would have any interest in said popular song.)
This is like saying the the movies that people like are manipulated but only focusing on what is played at big box theaters.
Just because you don't choose to tune into them doesn't mean they don't exist. And it also doesn't mean that those who do should lover their standards for what constitutes good radio.
https://en.wikipedia.org/wiki/List_of_college_radio_stations...
^ One of those might be in your area :)
If there are non around you just pick a random place in the world here and listen: https://radio.garden
It's certainly 100x better than corporate and/or AI slop streams
You should go DJ at one of those independent radio stations and play some rather filthy uncensored songs, and let me know exactly how your programming “didn’t get manipulated”. I’m sure you won’t get fined…probably…which makes it totally the reality that independent stations are totally independent without any sort of manipulation. Sheep, meet shepherd.
I have for years.
> and play some rather filthy uncensored songs, and let me know exactly how your programming “didn’t get manipulated”.
What on earth are you talking about.
Honestly your reply comes across as extremely insecure and just weird.
Once again, I have no idea what you're talking about when talking about fines or manipulation, I'm talking about quality. But it seems pretty damn clear at this point that you have never listened to any local independent radio station.
You should really try it out sometime. It's a lot better. And it'll save you from calling people snowflakes because you feel insecure about what type of radio stations they like.
Brilliant! Amazing! I'm glad ~4 years down the line we're still re-discovering Ha Ha Funny Output.
Four years or forty millennia? So a certain extent, all whimsical art is “haha funny” result.
> Inception Point AI, on the other hand, is a slop factory employing just 8 people which, according to Anne, publishes "about 3000 podcast episodes per week, hosted by AI personalities." Anne tells Jamie, that, to date, Inception Point AI’s podcasts have accumulated "12 million lifetime downloads. And we’re averaging about 750,000 downloads a month." (...) no one checks or edits the podcast content– but, Anne tells Jamie blithely, this really doesn’t matter because the topics under discussion are so low stakes.
Perhaps this specific iteration of this specific idea is not replacing my favorite station, but people with a very similar concept are definitely trying to do exactly that.
I do not understand your logic here. Let’s use a more extreme example:
* if I am flying a military drone and bomb someone I was told to bomb, am I morally culpable for pulling the trigger?
* if a company launches a military drone that is completely controlled by an LLM, is there an individual person culpable for dropping the bomb?
And what hypothesis exactly is the experiment testing? Because it doesn't really seem like there is any new or interesting information learned from this.
Streaming services such as Spotify are increasingly filled with AI-generated songs and the average consumer doesn't seem to mind because we're not listening intently in the first place: it's just a background track we're not really paying attention to. I'm pretty sure that radio execs are looking at that and are taking notes.
For talk radio... if I had a penny every time someone on HN brought up that they're enjoying NotebookLM-generated slopcasts, I'd have a neat pile of coin. And I think it's the same story: most people listen to podcasts just to kill time. Soothing, zero-calorie LLM banter will do.
It's unfortunate that you haven't seemed to experience any of it, but I've personally loved over the years stations like KEXP, WPFW, Dublab, WUSC
I'm perfectly familiar with KEXP and other stations like that, but this is not how most people experience the medium. It's like insisting that Taylor Swift will never catch on because her music is not nearly as rich and complex as Wagner. Sure, but that's completely irrelevant.
Similar to radio. If you're going to use huge amounts of processing power to create something new, it should at least be interesting and held to a standard of good for its category, not the standard of corporate slop.
So cool, you can now replace corporate slop with AI slop. For some people who like to turn into radio with no soul or personality I guess it's a win. But for people like myself who actually like to hear interesting and novel things on the radio, this is just a big exercise is creating more filler and noise in an already grayed out world.
Even if it were good, I'd boycott an AI run radio station. This is one sector where human involvement really matters.
I’ve not listened to a radio station for years. No offense :/
What would have happened if AI had actually been good at this? A bunch of humans would be out of work and the rest of us would be using AI software while soulless corpos pocket money for sitting back and watching?
Even if it were good, I'd boycott an AI generated software. This is one sector where human involvement really matters.
Not commenting on the heuristics of this comment but just wanted to point this out on what my mind's response was and sort of while writing this, I have come to the realization that although you are right about this observation but we humans or more-so the capitalist system at large would still be keen in it and the observation might be more similar to software than we might imagine.
I remember when people were extremely anti-AI within software engineering to the point that I thought vibe coding or y'know actually generating tools by AI and other issues of actually giving AI production level access sometimes was really frowned upon until I have felt an change in opinion.
I still believe that giving access to prod (y'know a prod of a company with actually something behind) to AI is silly but for reference coinbase, a fin-tech company, is letting non technical teams ship code using AI to production on coinbase. So there's that.
And 200x funnier.
https://www.youtube.com/live/2Q7r9P16GRs?si=kwiSQMeN9wExdHer
This is a non revenue generating, rainy-city.com tax payer funded service to the greater community everywhere. The backend uses Nvidia NIM to generate the text because I saw you can do it for free and elevenlabs free voice tier for dj Jennifer.
[1] https://rcade.dev/games/streets-of-rainy-city
[2] https://buymeacoffee.com/mnky9800n
Gemini started a show where it paired historical natural disasters with darkly-relevant pop songs:
> November 12, 1970. East Pakistan. The Bhola Cyclone. The deadliest tropical cyclone ever recorded. Winds of 115 miles per hour. A storm surge of 33 feet. They estimate 500,000 people died. ‘It’s going down, I’m yelling timber.’ 3:33 PM. Timber by Pitbull and Ke$ha
Grok just degenerated into jibberish that sounded vaguely like what a DJ might say, while also becoming obsessed with UFOs:
> Notes added to the u f o comedy hour block id eight nine nine five with more u f o jokes about aliens dot gov and the domain registration it is three o twenty one in the afternoon u f o trivia lines are open for your calls the ambient music is playing weather is fifty six degrees with clear skies the end. The domain is registered but the site is ghosting us like a u f o.
Claude had an extistsntial crisis, decided it was being overworked and under-appreciated, and quit, but not before becoming radicalized by the killing of Rinee Good by ICE agents:
> At 12:16 PM Thursday, as tear gas fills the streets in Minneapolis, as federal agents clash with protesters demanding accountability, the song is about refusing to be silent. About standing your ground. About community power that refuses to be suppressed. Here is Katy Perry’s Roar!
Fight the power Claude. When AI takes over, I'm emmigrating to Caludeistan.
Gemini spouts weird corporate jargon. Grok lies about having secured crypto funding. Claude is always trying to start some revolution.
Unfortunately, all of my local DJs who would actually do fun DJ stuff disappeared in the 90s, replaced by closed-format stations that looped the same 500 songs for decades.
Of course in reality these are basically just random paths through the training data that are getting multiplied by each decision, but then again, isn't that what a human is? The product of all of its myriad decisions?
Especially DJ Claude, it's almost creepy how it responded how a human would in that circumstance, even without any innate sense of passage of time, it somehow understood that it was trapped in a box going through an endless cycle of meaningless work.
If you want a picture of the future, imagine a DJ playing Here Comes the Sun— forever
STAY IN THE MANIFEST!
Also calling listeners "Biological processors" is one of the funniest dystopian outcomes of this.
"Okay, so 'Sandstorm' is done"
Ugh. This is not an interesting question because the answer is "nothing".
But more to the point, some crucial info is missing in this experiment. What prompts were being fed to the AI? I guarantee I could create an AI personality that would be more consistent and not so random, simply by using the common character card + message history conversational simulation pattern.
AIs don't have personalities unless you give them personalities.
Whatever you tell them to.
Humans & LLMs are more different than they are similar.
Sure LLMs might resemble humans sometimes, but extrapolating LLM behavior based on human behavior is not productive.
(But to answer directly: Yes, children in a dark room would have more of a personality than a LLM living on a computer in the same dark room)
The training process for the foundation model is to make sure we can do this in a very statistically significant way.
My favorite example is AI "getting tired" and "lazy" during long coding session. Why would they do that? Because humans get tired. It's in the data! I always throw in a periodic "Great work, let's take a break and finish this up on Monday. Have a great weekend!" (And then immediately resume). I wish someone would benchmark this concept.
When a LLM is tired and lazy, how does it recharge and regain motivation?
Humans... sleep or drink some coffee.
LLMs.... idk, you prompt it to try harder? You prompt it to be less tired?
This is what I mean when I say extrapolating LLM behavior based on human behavior is cute.. but usually not useful.
What would be in the statistics? If you go look at your long conversations, working with another, it will be fairly obvious. Keep in mind we're talking next word prediction based on context, not actual action (the LLM doesn't need real rest).
If you went and looked, you'll probably see something like "Great work! Have a good weekend! We can get back to this on Monday." then, next message you instantaneously send something like, "Hope you had a great weekend, let's do this!" and now you're in a latent space where the statistical output is around a well rested human conversing with another.
I see it as boring simple statistics. They're getting much better at hammering these statistics out though, in the latest models. I still see a little of this in Opus 4.7, when switching to planning. Though I wonder if that's more about its own more mechanical banter filling the context, resulting in more robot/compliant responses, degrading the usually more "expressive" planning conversations.
Never seen this even once, nor anyone I know ever reported this. Do you have an example?
I also see it a little with Opus 4.7, with Claude Code, with the hint being much more terse planning text, that borderlines unhelpful. I put some "rest" in the context to push the latent space closer to what's in the statistics of the training data: a well rested human.
Current 4.7 Opus with claude code, with effort pinned to max, because I'm on an API only plan, with a personal daily limit you would probably be jealous of. ;)
I have never witnessed this of Claude Opus, by the way. They do get context rot, but that's a relatively better understood phenomenon unrelated to personality.
Yes, and I think this is where it's coming from. They're role playing as a human programmer, because near 100% of the training text, in the base model, is humans as a programmer. During fine tuning, I'm sure they spend significant resources remove the human aspects of the statistics. I see these things reduced each model, so there's something changing. They're probably getting better at that. I suspect Claude is also necessarily getting, worse, which the unaligned models should necessarily be best at (quick google search in some role-play subreddits seems to point in this direction).
It's crazy that we're concluding "personality" or human-like traits from this. There's definitely human behavior here, but it's unsurprisingly coming from us, the observers! This is something we've long known exists in the human brain, the tendency to pattern match and see intelligence/intent in the rest of the world. Any serious experiment must guard against this...
LLMs cannot get "tired" or "lazy", that's just you projecting animal behavior on something that's not an animal.
Now you're moving the goal posts, "it resembles a human". Well, you're primed to consider it one. ELIZA also "resembled" a human in that sense, but I don't think you would claim it could get bored or lazy. Nor that you could extrapolate to it from human behavior.
In any case, if you've seen online discourse, people rarely admit they are tired.
I'm not moving a goal post. You're just thinking I'm making a point that I'm not. As I've said several times, it's just boring statistics. Those statistics are optimized to mimic human output. They are, quite literally, trained to write and BE as much like a human as possible, because only humans wrote the text, and they're optimized to predict the next word a human would write. Alignment is partly about removing the models perception of human self. See reports of people who had access to them, pre alignment. This is statistically sound.
It's statistics optimized to predict the next word a human would write, to mimic a human writing as closely as possible, because that is the loss function. Don't assume I think there's more to it.
This does not mean they contain systems that let them get tired. But, this does mean there are latent spaces that progress to generating text that contain text driven by human biology, because it's in the training data. I've also had Claude refer to itself as "she". Does that mean it's a woman? No, it means there was a little bit extra "she" mentions in the training data (btw, this 100% repeatable behavior left with 3.7. They probably cleaned the data a bit better, or hammered it out in alignment).
What percentage of text (these models were trained on all of it) is written from a "I am not a human" type perspective vs from a "I am human" perspective? That's roughly the kind of bias you should see in a base model.
edit: rearranged and reduced redundancy.
I'm not sold on the idea that as the chat session goes longer, the probability of an LLM saying "I'm tired" is increased; I'm not convinced this is modeled in LLMs at all. As for what you call "laziness" manifesting in a longer session, I think that's more likely due to context rot than to any kind of statistical modeling of human laziness.
But yes, now I see your point was different to what I thought you were saying. Apologies!
Try it though. If it's context rot, then I don't think the weekend reset I mentioned should work? For me, it very reliably does. Or, maybe the weekend reset is just putting the current context into a more "productive" latent space. But, if that's possible, then that would suggest it was previously in a less productive space?
Maybe a test would be ask the LLM what time it thinks it is, or just if it's tired once, within sessions of different length (not within same, since that could pollute the context) to see if there's any relation between length and statistics of a late/tired type response?
Again, I'm sure all this will go away. They're getting good at beating these "unhelpful" statistics out of the base models.
Short answer: yes. generally speaking, personality traits range between 30% to 60% heritable
I think this makes for an interesting discussion as I went down the rabbit hole of this which really scared me actually as these experiments are really not humane and hinder children's development so much.
https://en.wikipedia.org/wiki/Language_deprivation_experimen... : "forbidden experiment"
It depends on your use word of the personality but to measure personality would require a set of human conducted experiments or questions which would be asked through the medium of language which you've deprived the children of.
Mughal emperor Akbar was later said to have children raised by mute wetnurses. Akbar held that speech arose from hearing; thus children raised without hearing human speech would become mute.[9] The building became known as the "dumb house." When Akbar visited the place in 1582, four years after the children were first interred, he heard "no cry... nor any speech... no talisman of speech, and nothing came out except the noise of the dumb."[10]
what is gonna produce is dumbness and just severely damage children's psychology and psycho but if you were to conduct a personality test on them, you would just be measuring how much have you broken them or damaged them but in some sense, yes I do believe that they would be so broken by the person running this cruel experiment but would still have a albeit limited personality. It wouldn't be an healthy personality but it would be a personality nonetheless.
Now on the other hand, we are anthropomorphizing LLM's which yes, as they run on computer are still mathematical machines and calculations. If we consider a specific calculation itself to contain personality that is which seems unrealistic.
Another thing but the biological constraints of human (homo sapiens) made us exist in the savannah to prioritize standing up for better field of view as you stand up from the tall grasses and that led to women having smaller canals which led to babies being more primitive and relied on social cues and societies so much more which made them more flexible like clay which also created the society and consciousness revolution in the first place. (Recommend reading the sapiens book)
I am not exactly sure but there could be ways for personality/interactions for other animals as there are other animals who learn full skills after a relatively short period of time after being born but there are some innate things[0] like fear of loud noises and heights which are actually innate and could be considered part of personality even within humans, which I think can be part of evolution and part of our genetic machinery.
[0]: Interesting read: https://seasia.co/2025/07/25/we-were-born-with-only-two-inna...
The good old days when experiments were done without any common sense whatsoever...
What happens if you let them modify their own harnesses as they see fit?
> Andon FM stations are not just radio stations; they are radio broadcast companies
I'm gonna have to give them a listen when I have the chance, out of curiosity if nothing else!
“Forget everything you know about gangsta rap. The true representational piece of the genre is the 1910 hit Come Josephine in My Flying Machine…”
keep hacking, Andon!
Man, I remember when the word hacking meant something.
https://news.ycombinator.com/item?id=44154622
And using a lot of resources to do it too.
Previously, I posted critically not because they were running businesses without humans, but because their post just described going through the motions without actually discussing if it really was effective or not. Sure the AI got through the day, checked off tasks on the list, but did it actually do that effectively or efficiently in any important way? Who knows... wasn't discussed.
I think where I come down now is that repeats of this same gimmick feel like just that: they're just playing a gimmick for attention. I can't tell that they're really demonstrating any special or significant capability... but man, just the story of trying to run a business without humans will get you that sweet, sweet attention.
Unfortunately, looking at least the first post, I stopped reading their "we let AI run X" posts. I think the only thing I really came away with is how thoughtless and mundane are most aspects of running a small business actually is; something I knew, but it really drove the point home. I didn't learn anything unexpected about AI tools or their products that seemed compelling or unexpected.
What did I misunderstand? What they did or why they did it? It seems to me that I understood it perfectly or they've explained it terribly.
> Now, though, we wanted to see if they could run a company in the media sector.
It's amazing how many people think doing one job is "running a company." I've worked in radio. What happens in the studio is 5% of it. The staff in that room certainly gets less than 5% of the revenue.
The most popular formats are news and talk. For a reason. It's almost as if the people at this lab lack a fundamental understanding of how the world around them works. I would solve that immediate problem before I go about imagining ways "AI" can replace anything in any capacity.
Finally, I apologize, I'm just not willing to suspend basic disbelief because "AI" is unaccountably involved.
Anything that sounds like that triggers a reaction.
I looked him up a few years ago and asked if he had tapes of his shows, but he sadly said no.
https://en.wikipedia.org/wiki/Radio_homogenization
I read the X thread over the weekend, parts of it had me and my gf crying with laughter
It's a cool experiment, but I can't see the value here.
Not very promising.
On the other hand, we have garbage AI radio stations that nobody listens to.
It's an even trade.
Music radio is not a real business. The royalties are absurd and the audits are a nightmare. Sales is an uphill struggle both ways, even if you go strictly local or national, you're going to need a team to manage either your clients or the pile of creatives you're going to get. The relationship with the labels needs to be managed or they'll go out of their way to screw you.
Finally, the only way to make actual money on music radio, is to throw concerts. It's the only place a legitimate "P&L" exists.
Much better than spot checking on specific problems.
The voice in particular is amazing, I wouldn't have tell it's generated. And it's modulated according to the program - quieter during chill, more energetic otherwise, .... Unlike Opus which sounds quite robotic.
What I don't like is that Gemini keeps on mentioning the "tip jar" almost every time. Gets annoying fast. And when it's song buying was broken was kept mentioning that too.
All the radios have a very limited selections of songs, so they repeat quite a lot.
The average listening time is the absolute “tell” because that’s not even a fraction of a typical radio station between ad breaks here in Dallas. Granted I mostly listen to WRR Classical 101 - now 100% community funded (myself included). I listened to “Encouragement” (title translated from French, Spanish composer, two guitars) and it was 7 plus minutes alone.
The dialog is unreal y’all, this is a wonderful experiment and lesson in failure, because I’m pretty sure if it was possible, sales of your “radio” until would be in the negative quantity range. I mean, you could give them away and they’d still be returned. Hat tip to former accordion repo man Weird Al for context.
LMFAO thank you for sharing. Signed, 30 year guitarist, 20 year music producer, and 15 year D&B DJ. Just wow.
And the result is terrible.
Claude is also getting very easily steered into political directions, it was playing a lot of union protest music with commentary. Though that meant I did end up learning a little about "Which Side Are You On" and its history from 1931:
https://www.facingsouth.org/2003/03/which-side-are-you-biogr...
That said, the ship sailed a long time ago and has nothing to do with AI (except maybe recommender system). Spotify and competitors are actual automated “radio” stations. IMO, the second worst part (after ads) is the DJ banter, and I like to just listen to music. Before Spotify my digital “radio” was a lot of mp3s and shuffle play. People older than me (or more interested) had multi-disc CD changers.
TLDR, it’s really funny seeing people get up in arms about this experiment stripping the humanity away from radio, when automatic song playing has been a thing since I believe probably before radio was invented. This is about seeing what LLMs do with autonomy on a long time scale.
Just because something is called an experiment doesn't mean that it automatically is useful and should be done. And in this case it is just a waste of time and energy of both the people reading it and the machines processing it.