Various LLM Smells

(shvbsle.in)

62 points | by speckx 2 hours ago

14 comments

Planktonne 38 minutes ago
> The LLM generated writing obviously felt significantly better than my own writing.
A general pattern for LLMs is that they look really good at things you are bad at. What that means is that if you find yourself thinking of its output as significantly better than yours in a particular domain, there's a high chance that you are not equipped to judge that quality effectively.
[-]
- flatline 0 minutes ago
  I don't disagree about the probability, but the current frontier models are not completely useless for writing even in areas where I have significant knowledge. I would not have said that a year ago. You have to watch them like a hawk -- they are good at spitting out plausible sounding nonsense that is hard even for an expert to discern. But the dice roll going on behind the scenes is continually more biased towards being correct/useful than not.
- dvt 34 minutes ago
  Honestly, I can't fathom thinking that LLM writing is even remotely passable. People that think this should honestly read more. One book a month is hardly an aspirational goal. You don't even have to read Melville or Hemingway or Chaucer or Shakespeare, just pick up any popular NYT best seller, and it'll be significantly better than anything an LLM can generate.
  [-]
  - gchamonlive 8 minutes ago
    Really hard to take your comment serious when the only post on dvt.name is a hello world page, because at least OP is trying to publish and you are lacking moral high ground to judge him thinking LLM writing is good.
    [-]
    - dvt 2 minutes ago
      Lol my blog was hacked recently and I've been lazy about moving my backed-up mySQL DB to the new WP installation. Not sure where moral high ground enters the picture. If I really wanted to be an asshole, I'd cite a book I co-wrote and another I edited.
  - xienze 16 minutes ago
    > I can't fathom thinking that LLM writing is even remotely passable. People that think this should honestly read more.
    This makes me think you're only exposing yourself to high quality writing online and from an intelligent circle of friends and coworkers. The average person's reading and writing abilities are _atrocious_ and only getting worse. LLM prose is significantly better than what the average person can produce.
    [-]
    - sublinear 9 minutes ago
      Are we also saying it's acceptable to feed people junk because it's better than what they would cook?
      At some point you're just making bad excuses for false scarcity.
- skydhash 25 minutes ago
  I dabble in drawing and I find LLM images (and maybe some non LLM one) abhorrent. As for why, I can think are no consistency (perspective, small details, and color theory) and too much details making it a visual noise. In most painting, the artist will have a subject that is most detailed (to draw the eyes) and from there, the lost of details will follow some kind of logic. This is how you pinpoint what the artist is most interested in. LLM looks like a filter applied to a montage of pictures.
  [-]
  - gchamonlive 14 minutes ago
    It's like a gross looking slice of pizza, it's mindbending because at first it looks good, after all it's pizza, but something in it makes it really disgusting
- bell-cot 35 minutes ago
  Mnemonic: geLL-Mann amnesia effect
rimeice 16 minutes ago
Scrolling down a LinkedIn feed is hilarious at the moment.
My favourite one today from today:
“The tax isn't the problem. The mindset is.”
1970-01-01 12 minutes ago
The LLM doesn't smell like authentic writing but it does a great job for fast and cheap words. We've gained something similar to fast food. Words made very cheap, very fast, easily digestible, but they have no emotion. In short stints it does have a place in the world.
spdustin 18 minutes ago
- “(The) honest caveat:” (or “genuine caveat:”, both with the colon)
- “(The) honest answer:” (again, with colon)
- “The thing to internalize:”
- “The smoking gun:”
(really, sentences that start with “The <tag suggesting the next clause is the key point>:” are a strong tell, but those four are the most prolific)
- “load bearing” (when not talking about architecture)
- “blast radius” (when not talking about actual explosives, but rather the effect of an event/action)
- “smoke test” (esp. when “sanity check” is more apropos)
- Lists of three clauses/adjectives where the third is really just a combination of the first two
- Referring to the “shape” of things figuratively
- Social media posts that end with “Curious if anyone…”
- Stories or anecdotes using. “Oh. Oh.” (where the second “oh” is italicized)
Edit: Yes, some of those last ones are terms that we often use as devs...but I would argue about the actual frequency of their use. Plus, these tells live on in prose generated by the latest models.
n42 51 minutes ago
```
  No ___, no ____. Just _____
```
or using "honest" to describe an approach.
[-]
- GrinningFool 2 minutes ago
  Jab, jab, thrust is how I think about that pattern. Or tap tap whack, if you prefer. And it shows up for for positives too:
  "Smooth. Effortless. A perfect fit for your needs".
  In any style of informal or persuasive writing this shows up , as if it has to drive the point in.
  I kind of wish we'd stop talking openly about what the tells are. It's nice to be able to determine with fair accuracy - but it couldn't last forever.
KronisLV 40 minutes ago
> The "JetBrains Mono" font
Thought for sure we'd get a critique of Inter overuse. JetBrains Mono is a lovely font, though.
[-]
- fortyseven 24 minutes ago
  It's my daily driver, so I kind of twitched a bit saying that list in here. I never noticed because I was using it anyway, I guess.
dvt 40 minutes ago
It's kind of interesting how genuinely hard it is to get models to deviate from basically all of these tropes. You can straight up tell it "I hate that card design, do something different, get creative!" and it'll do something either (a) ugly as sin (clearly just essentially a random walk through parameters) or (b) some same-y derivation of that card.
In coding, I've noticed a few tropes as well: everything is a "contract" or an "artifact" (clearly trained on like three decades of Java lol), everything is constantly "backwards-compatible" or "versioned" (even if working on a brand new greenfield project), and a few others.
[-]
- jkdufair 36 minutes ago
  If claude says "load bearing" once more, I think I'll vomit.
  [-]
  - dieselgate 15 minutes ago
    That's a funny one. I don't use LLMs at all but "load bearing" is such a common/over-used internet joke for DIY building projects and stuff like "load bearing caulk". Have never heard it in a software sense really so am slightly perplexed
  - dvt 15 minutes ago
    Hah, ChatGPT constantly says "that's real" or "less about X, more about Y."
docheinestages 25 minutes ago
You are right to push back.
danielodievich 44 minutes ago
All of those are included in the bulk of the documents passing my work input these days. It is infuriating. Out of principle I maintain 100% me in all my writing but I don't know if it matters. Well maybe it does... an interviewee recently complimented me on the "nicest and most human resume" they saw recently. That felt good
[-]
- exe34 24 minutes ago
  Do you send your resume to people before you interview them?
mil22 39 minutes ago
Those cards, so familiar! Exactly what Opus produced for me.
Did Anthropic and/or OpenAI deliberately train their models to produce websites with a specific design language, or did these stylistic preferences emerge naturally as some kind of LLM-selected optimum?
manoDev 25 minutes ago
Welcome to the future of fast-food software. Taste of deep frying and preservatives.
dionian 43 minutes ago
KPI cards, purple gradients
poszlem 19 minutes ago
What I find amazing is how HARD it is to make the LLM produce a piece of text that does not sound like slop. I have had dozens of sessions where I tried to make it write like a human would, and yet it still uses those tired writing phrases. I don't understand why neither openai, nor anthropic are able to do anything to make it better, and in some cases it feels like we are actually going backwards.
nikhilpareek13 2 minutes ago
[dead]