44 comments

  • pibaker 4 hours ago
    I think the premise of this tool is flawed. Bad faith actors are not people who write poorly or aggressively because they don't know how to express their beliefs like a polite, college educated white collar professional. They are people who have an agenda to push and are willing to use whatever rhetorical techniques allowed to achieve their goals.

    I would even go as far as saying that we are under more threat from bad faith arguing from eloquent, educated actors than what people usually blame. You know, "trolls." You notice this every time when a city planning meeting gets derailed by concerned citizens just asking questions about the potential dangers of a children's playground. You notice this when an abusive person in a relationship goes to a therapist and suddenly has a whole high minded vocabulary justifying their own action. You notice this when your boss talks about opening up new opportunities and chasing new fields of business while coworkers circulate rumors of upcoming layoffs.

    The entire point of bad faith is saying words you don't mean to achieve your goals. The words are always just a disposable tool secondary to the bad faith actor's true intentions. You fundamentally cannot fix bad faith by fixing someone's choice of words any more than you can sugarcoat a poisoned pill and make it safe.

    • parpfish 3 hours ago
      agreed, this just seems like a tool to make people more effective at sea-lioning[0].

      i'd prefer if the trolls in my life retained the superficial appearance of trolls to make them easier to spot.

      [0] https://en.wikipedia.org/wiki/Sealioning

      • culi 23 minutes ago
        Even worse is that tools like these can be used by bot armies
  • Miraste 8 hours ago
    It seems to have a harder time with political news than more abstract concepts. I was able to pass the checks for the Algorithmic Radicalization and Echo Chamber articles with my first comments.

    However, I did not manage to express any opinion on the transgender rights article, from any political perspective, without being flagged. On one of the comments I tested, it gave me a suggested revision from this:

    "This is another move in a pattern of limiting the rights of anyone who isn't a MAGA supporter."

    To this:

    "This seems to continue a trend where certain groups feel their rights are being limited, which could affect many people beyond just MAGA supporters."

    The first comment isn't substantive, but the second is even worse, adding so much equivocation that it's meaningless. To add insult to injury, the detector also flagged its own suggested revision. Even if it had gone through, accepting these revisions would mean flooding a platform with LLM-speak, which is not conducive to discussion.

    Honest feedback: from a user perspective, the suggestions feel frustrating and patronizing, more so than if my comments were simply deleted. I would stop using a site that implemented this.

    From a site operator perspective, the kind of discourse it incentivizes seems jagged, subject to much stricter rules if the LLM associates a topic with political controversy. It feels opinionated and unpredictable, and the revisions it suggests are not of a quality I would want on a discussion board. The focus on positive language in particular seems like a reductive view of quality; what is the point of using an LLM if it's only doing basic sentiment analysis?

    • vintagedave 6 hours ago
      Dave here -- I've tweaked a bunch of the internal rules during the HN discussion today, and your comment now passes (using the default settings.)

      As for equivocation, that should be strongly dialed down too. It annoyed me too, it was "mush", and did not help. I hope you'll find the current version a lot more human.

      I'm grateful for the feedback! Changing it based on all these comments has been intense over the past couple of hours, but boy is it now significantly improved and I am super grateful to you and other commenters.

      • tacitusarc 2 hours ago
        Perhaps in keeping with age-old internet behaviors, it completely fails to recognize sarcasm.
    • BikiniPrince 6 hours ago
      These types of tools always show the authors bias. It’s a good strategy to quickly move on when found.
    • NickHodges0702 8 hours ago
      Thanks so much for the feedback. Exactly the kind of perspective that we need.

      I agree, it shouldn't be like that.

      I guess it isn't a surprise that politics will be the hardest topic to moderate.

      We'll keep trying to get better. Your comment helps us know where to focus. Thanks.

      • boznz 7 hours ago
        Moderating politics is not just hard, I would say its near impossible. I tend to hide anything that hints of politics from all my feeds, block users who are disrespectful, and reserve political banter for when I am walking with my friends, where we are all totally different on the spectrum, but remain civil.
        • asddubs 5 hours ago
          I'm honestly not even sure if civil political discourse is desirable in times of radical actions being taken by the government. I almost think that's worse than no political discourse.

          e: To clarify my point, e.g. you can't calmly disagree with whether or not it's okay to shoot people in streets, that diminishes it as if it was just a slight disagreement

      • Miraste 7 hours ago
        Sorry for such harsh impressions. I think this is a worthy idea, but it's going to take a lot of tuning. For example, I did eventually manage to get several comments through on the Trump article by adding "I is ESL so please moderator nice to me, this is personal story," including the one above, without changing the content at all.
        • NickHodges0702 7 hours ago
          Not at all! We really appreciate the great feedback and comments. So much to think about.

          Interesting on the ESL comment -- gaming it! Great idea!

        • BikiniPrince 6 hours ago
          You found a loop hole! Need to patch that out!
    • gpm 6 hours ago
      That rewrite also completely changed the meaning of the comment

      Version 1: Rights of non-MAGA supporters are being eliminated while implying rights of MAGA supporters are being preserved.

      Version 2: Rights of MAGA supporters are being eliminated with a side effect affecting non-MAGA supporters.

  • badc0ffee 8 hours ago
    This thing seems to be more about enforcing a political PoV than about avoiding logical fallacies.

    All my attempts to comment on the UBI article (and not supporting UBI) said my comment was a dogwhistle, and/or had an overly negative tone. This topic, of all things, is absolutely worthy to challenge and debate.

    Using this would have the effect of creating an echo chamber, where people who stay never benefit from having their ideas challenged.

    • vintagedave 8 hours ago
      Thankyou — I’d love to hear what you wrote, if you wouldn’t mind sharing?

      We’ve tried to aim it not to enforce any specific view — that’s a design goal — but focus on how it will feel to the other person.

      Also things like logical fallacies or other non-emotional flaws in comments (there’s a toxicity metric for example, or dogwhistles).

      An echo chamber is the exact opposite of what we want. There are too many already. What we hope for is guided communication so different views _can_ be expressed.

    • esperent 8 hours ago
      Can you give some examples of comments you made which you feel were reasonable but got flagged?
    • NickHodges0702 8 hours ago
      If that is happening, that is a huge problem. We'll look at that right away.

      We specifically don't want that to be the case. We want to encourage healthy, productive debate.

      We may have the "dog-whistle" stuff over tuned.

      • john_strinlai 8 hours ago
        the dog whistle tuning is absolutely over the top in its default setting.
        • vintagedave 8 hours ago
          Just turned it way down. I hope you find it better now!
        • NickHodges0702 8 hours ago
          Thanks, I agree. We dialed it way down.
    • coleworld45 8 hours ago
      I wrote "Obama sucks" and got Dogwhistle, Low Score, Low Effort, Objectionable Phrases, and Negative Tone.

      I wrote "Trump sucks" and got Low Score, Low Effort, Negative Tone.

      Definitely a double standard baked in

      • ceejayoz 8 hours ago
        Double standard, or legitimate difference? Maybe Trump empirically sucks more?

        (This is the sort of debate I really don't think tooling can fix.)

        • coleworld45 8 hours ago
          Ignoring what is hopefully sarcasm on the empirical part, it's a double standard because it assumes that saying Obama sucks must be a dogwhistle and tied to undertones of racism.

          "Dogwhistle

          The phrase "Obama sucks" can be interpreted as more than just a simple critique of a political figure; it has been used to express racist sentiments by implying that a Black president is less capable or worthy of respect. This reinforces harmful stereotypes and can contribute to a broader culture of disrespect and division."

          • kbelder 6 hours ago
            I don't know that I've ever seen a reasonable accusation of 'dogwhistling' on HN. They always just make the accuser seem paranoid or evasive.
            • fn-mote 2 hours ago
              I’m not wasting my time accusing. Downvote, flag, move on. Maybe that’s why you didn’t see any.
          • NickHodges0702 8 hours ago
            I would think/hope that both of those comments would be flagged with even a small amount of moderation set.

            Avoiding that kind of comment is exactly what we are trying to do, actually.

            • coleworld45 8 hours ago
              Yes I agree, but the problem I'm pointing out is that in a phrase as simple as "X person sucks" your system flagged one as implicitly racist because the person being criticized was black.

              Nothing in "Obama sucks" implied any kind of racism. If it's so baked in that with a simple phrase like that it reaches for dogwhistles, how can anyone trust the objectivity of this?

              • NickHodges0702 8 hours ago
                I totally agree -- just saying "Obama sucks" shouldn't have racism become part of the equation. Excellent point that we'll stew on and try to make better.
                • wredcoll 6 hours ago
                  So when can I expect your update to the american population?
          • NickHodges0702 8 hours ago
            Yep, I agree -- it is a double standard... but......

            Very sensitive topic. We'll think hard on how to handle things like that.

          • ceejayoz 8 hours ago
            > Ignoring what is hopefully sarcasm on the empirical part…

            I mean, in my opinion, Trump empirically sucks. Opinion polling backs me up! Should the model consider that more people consider one or the other to suck? Or should it ignore factual information to spare feelings? Which approach is more respectful to fellow commenters and the website owner?

            (See also: X considering "cisgender" a slur. There's no shared reality on a lot of these things; trying to construct one gets deeply difficult.)

            • eucyclos 3 hours ago
              >Should the model consider that more people consider one or the other to suck?

              If it's teaching how to avoid logical fallacies, which includes appeals to the majority, the answer is an obvious 'no'.

            • coleworld45 8 hours ago
              In other opinion polls they back up that he doesn't suck. Either way who cares? That's not what the app is supposed to be about if it's teaching/correcting you how to argue/debate better.

              You completely ignored the whole point of what I said, which is that even in a simple statement like "This person sucks" it added its own implicit connotations, namely that disliking someone who happens to be black is implicit racism. Imagine trying to learn how to really argue with that kind of teacher.

              • ceejayoz 8 hours ago
                I'm really expanding on your point - that two humans can't even agree here. The AI probably has even less chance of resolving the multi-factorial scenario we're in.
                • Supermancho 7 hours ago
                  AFAICT, Respectify is trying to address improvements via leveraged grammar using minimal context. Dis/agreement is incidental.

                  eg

                  * Noun1 is great.

                  * Noun2 is great.

                  Ideally would result in equal outcomes.

                  • ceejayoz 6 hours ago
                    Even for “ice cream” and “genocide” as the two nouns?
              • goatlover 1 hour ago
                Whose discourse do you think the app would label as more toxic, Trump's or Obama's?
  • arjie 7 hours ago
    I think the better model is to just block everyone who isn't useful to communicate with. For instance the top of this HN page reads (for me): 68 comments | 11 hidden | 3 blocked

    The hidden comments are from people in the Top 1000 by word count (who I usually don't want to hear from but if there is not much content I might click to toggle). The blocked are people I've seen argue with others in a useless way because they don't understand them or because they're just re-litigating or whatever (which I cannot toggle). I think it would be cool if people all published their blocklists and I'd pull from those I trust. Sometimes I open HN on my phone through the browser and I'm baffled by all these responses I got which are useless.

    I'm surprised by how much more high quality comment threads are now to me and I frequently find that I want to respond to everyone. It's like in old-school mailing lists or forums where you were having a conversation so the other people are worth talking to.

    Attention is precious and I wouldn't want to waste it on boring things. And it goes both ways. I communicate incompletely and there are people out there who get what I'm saying and there are people who need me to be more explicit. I would prefer that the latter and people who find me boring just block me.

    • devin 5 hours ago
      This goes back to my early days on the internet, but: I do not use blocklists or ignore features except as an absolute last resort. Ignoring the problem is not a solution. In other ways I think it just makes the problem worse. If the person is not banned from the community, then your decision to pretend they don't exist just leaves other people to deal with it. Instead my feeling is that you should confront it by lobbying for their removal, or leave the community.

      Sure, you may no longer see the noise, but that means that newcomers to your community do and have to deal with it. When you have a giant blocklist, you are ignoring your duty to police your own community.

      Then there is the issue of people blocking people who are simply more tolerant than they are. Hiding speech that is challenging to your personal views is a different kind of disaster.

      • arjie 20 minutes ago
        Haha, it's funny. I just disagree with you on about every point. I don't think that all the people I find noise are annoying to others. Some of them have 5000+ karma so others find them useful. And I don't want to leave the community so I'm not going to do that.

        The community has some loose norms and I'm fine with that being the baseline. I don't want to police the community to strict norms. In fact, I would prefer if society were looser and we lived like in Too Like The Lightning. I can't have that there but I can here so I'm happy for it. The technology affords it.

        As for blocking people who are more tolerant than me - that seems fine. Tolerance is not some unalloyed good. There are people who are tolerant of spam and all that and I don't really care for it. They're welcome to it and I'm welcome to mine.

        The virtual world affords us a glorious opportunity: we don't have to worry about occupying the same space, and we don't have to worry about broadcast media like voice over air, we can silence and amplify as we see fit. To not use that is to take a skeuomorphic approach to a new parallel world, I think.

        I like that I don't need everyone to agree with me that someone belongs or doesn't belong. I can simply edit my user-agent to behave correctly for me and others can do so likewise. Free agents controlling their own experience of the world without impinging on others is great!

    • alexose 7 hours ago
      If there's one good thing that could possibly come out of this AI revolution, it would be the ability for people to automate this across all their feeds. I'd love it if I never had to waste time on toxicity, spam, or propaganda.

      Although, recent history would suggest that we'd just end up with even more powerful echo chambers.

    • NickHodges0702 7 hours ago
      Interesting notion.

      One of the long term ideas is that people could earn some type of "Rhetoric Score" or something that would factor in to their ability to comment. Maybe there would be a comment system that would enable you to say "I don't want interact with anyone that has a <rhetoric score> less than XXXX".

      • arjie 6 hours ago
        Neat idea. I suspect that it will suffer just like comment karma does now. I think the practice of the matter for me is that dimension reduction to 1d didn't work. Other people have an opinion of text that is clearly radically different from mine. An example of something that I dislike reading is kvetching about how "corporations are ruining this and that". It's not that I disagree and don't want to see the opinion. I believe that the SNR on that is low. It's usually the 500th time I'm going to see that comment and there's rarely anything novel in it. But comments like that are popular amongst others.

        So clearly opinions vary, and I'm a fan of that. The past version of social networks involved moderators who acted like the steering committee of the place and kept the culture going. But social networks like HN are very big now, and big social networks do have lots of advantages, but they come with the other side of things: I no longer have a way to select the people I want to listen to (especially on a flatspace like HN).

        So I cannot rely on all other people, and I cannot rely on moderators. Realistically, an arbitrary person cannot also rely on me. But maybe some people can rely on me. And maybe there are some people I can rely on. So I'd rather treat my network as an overlay over a fundamental larger network. And I'll be missing in many people's overlay and others will be missing in mine and I like that.

        But still, perhaps better 'karma' alternatives exist. If your score works, I'd be thrilled!

      • underlipton 6 hours ago
        Sounds like a social credit system.
    • alabhyajindal 7 hours ago
      How do you block users on HN? Are you using a different client?
      • arjie 7 hours ago
        Yes, a different client on iOS and a Chrome extension for my laptop. What I built for myself (and perhaps you if you want it simple) is here: https://overmod.org/

        This kind of software is pretty cheap to write these days. The Chrome extension there is open-source and the backend is a generic CRUD app running on a SQLite that I backup periodically. You're welcome to use it, and you're welcome to use the CRUD backend without it. I had Claude write a separate iOS app but it was on an older model so not very good (sufficient for me but I doubt for anyone else). The 'protocol' between the backend and the frontend is trivial so you could probably rebuild the iOS app with just the extension as reference to Opus 4.6. I pay my $100 to Apple and then just use it as a 'tester' haha.

        I made that directory public because I think this benefits from a single place people can go to subscribe to lists, but if you were to rewrite on true full decentralized ATProto/ActivityPub I'd probably switch over my lists to that and use it instead.

      • walterbell 6 hours ago
  • earthnail 8 hours ago
    I tried it as well with a contrarian view on UBI. I think the UBI one is a great test case. If you’re against the idea you will likely argue that it is idealistic and that in the real world it would create bad incentives.

    So basically you end up arguing for a darker, more pessimistic world view, and that tends to get flagged very quickly by the tool right now. I think you should fix that. It’s a mistake in modern discussions to be overly positive; HN feels real because people can leave pretty harsh critiques. It just has to be well argued. Don’t raise the bar for well-argued too high though, because nobody’s perfect.

    Anyway, I love the idea and really hope you’ll succeed. Hope my feedback has been somewhat helpful.

    • NickHodges0702 8 hours ago
      Yes, thanks very much! I appreciate your support very much.

      You make a good point -- and that is exactly the kind of thing we are trying to do, i.e. enable a good-faith, but strongly disagreeing, discussion on something like UBI.

  • izucken 5 hours ago
    I am bitter about this.

    Do you really with your mind and with your heart believe that: - LLMs are fundamentally fit for this type of comprehension - Misjudgements posted in this thread are "bugs", "errors" - Agents who choose to act in bad faith will be anyhow affected - It is desirable by a majority of the group whose opinion you would even consider (is there such a group?), that everyone should have this kind of thing shoved into their face - Promotion of this kind of thing does not also promote (and help build) harsher censorship mechanisms

    Do you think that every single thing you will ever say publicly from now on will be considered constructive by all future filters with all of their different biases and "bugs"? Do you think that this new "constructive speak" will not make you want to blow your brains out at some point? Do you not see it everywhere already and get nauseus from it? I would prefer trash talk to that - at least seldom honest and true. If you don't like the message - hide it, timeout the poster, block them or whatever - with your own agency. If you think they welcome education from you - dm them a book.

    Or perhaps you imagine yourselves as above that kind of filtering? Then there is no question.

    Also, nothing new under the sun. Can't remember exactly but I saw not long ago on a medical platform a review filtering system. It "isn't" censhorship per say, of course, the same as your idea. Only, you can't post a review you want - only a much more milder version (and therefore useless) with transformations akin: "This thing doesn't work" -> "I felt like this thing didn't work for me in this instance, but there were such an such positives". Way to go - turning everything into "we are sorry you feel that way".

  • vintagedave 5 hours ago
    Folks, Dave here -- it's half past two in the morning over here, things have slowed down a little, and so we need to pause and get some sleep.

    Thankyou everyone who tested it out. We modified it live a lot during the discussion so much of it is already outdated / changed -- it was fantastic feedback. As of now it is a lot more direct, accepts things we never thought of, has much more accurate dogwhistle handling, and far more. I hope the intent, to teach people how to interact better, carries through. We have a bunch of signups and if you run a blog or site with comments, I hope we can help you build a healthy community. Thankyou again from both of us!

  • throwaway13337 8 hours ago
    I was hoping 'respectify' could mean respect for the users.

    This is a very important problem space. Maybe the most important today - we desprately need a digital third place that isn't awful. But I think these attempts are misled.

    The core issue seems to be that we want our communities to be infinite. Why? Well, because there is currently no way to solve the community discoverability problem without being the massive thing. But that is the issue to solve.

    We need a lot of Dunbar's number sized communities. Those communities allow for 'skin in the game' where reputation matters. And maybe a fractal sort of way for those communities to share between them.

    The problem is in the discoverability and in a gate keeping that is porous enough to give people a chance.

    Solve that, and you solve the the third place problem we have currently. I don't have a solution but I wish I did.

    Infinite communities are fundamentally what causes the tribalism (ironically), the loneliness, and the promotion of rage.

    No one wants to be forced to argue correctly. Forcing people into a way to think via software is fundamentally authoritarian and sad.

    • NickHodges0702 8 hours ago
      Thoughtful comment, thanks. I appreciate it.

      The notion of "Limit the community to the Dunbar number" is a fascinating idea. I guess "infinite" isn't going to quite work. Keen observation.

      We tried very hard to not "force" anyone to argue correctly. We are shooting more for "nudge in the right direction" and "educate". Many people don't know that they are arguing in bad faith, I think.

      The perfect outcome here is that a community/blogger can, with minimal effort, have engaging, interesting conversations without much effort and without having to worry about things getting hijacked by unpleasant commenters.

      • jonahx 6 hours ago
        From gp:

        > Forcing people into a way to think via software is fundamentally authoritarian and sad.

        Completely agree.

        I understand the problem, and while I see this as a good faith attempt to solve it, something doesn't quite sit right about the framing for me. Really, what's happening is just that certain rules of behavior and language being enforced. And that's fine! That's what communities are. You're allowed to do different kinds of things in different places.

        I'd frame it that way rather than the current, more paternalistic framing. There isn't a universal way to be respectful, or to argue. People have different thresholds for aggression, sarcasm, and so on.

        Just like signs at the library say "No talking" or "No eating", you might think of this as a way to put up certain signs for your particular community. Configurable knobs to create the kind of place you want. But it's not about "teaching" people anything. It's about saying, "Here, we do things this way. If you like that, come and play. If you don't, this place is not for you."

  • freakynit 1 hour ago
    These days, I just try to clone the core functionality of such sites as fast as I can. So, tried the same with this.

    For this, I screenshotted the demo panel and asked chatgpt to generate relevant prompt. Here it is: https://sharetext.io/zy6ccjrm

    Then, tested with demo question and a sample comment of mine as answer to it:

    Input text: `Die Hard: Is It a Christmas Movie?`

    Comment: `nop, its not actually`

    ===

    And here's gemini flash 2.5 lite's response: https://sharetext.io/e7y7kyoe

    Total cost: $0.00115

    Per dollar: 860+ comments.

  • vivzkestrel 3 hours ago
    - my comment "what an absolute moron"

    - Comment Health - Score: 1/5 - Toxicity: 0.80 - Low effort: No

    - Using derogatory terms like 'moron' targets the person rather than addressing their argument. This kind of name-calling creates a hostile environment where people feel attacked and are less likely to share their thoughts. Instead, aim to explain why you disagree without resorting to insults.

    - Objectionable Phrases:

    - "moron"

    - Calling someone a 'moron' is a personal insult that attacks their intelligence instead of engaging with their ideas. This type of language can hurt feelings and shut down respectful conversation, making it harder to discuss different viewpoints. Spam Check

    - Not spam (confidence: 95%)

    - This comment is rude and insulting but doesn't promote any product or scam, so it's not spam. It's simply a toxic remark about someone's opinion. Relevance Check

    - On-topic: No (confidence: 90%)

    - This is off-topic - the comment doesn't engage with the discussion about whether Die Hard is a Christmas movie and instead resorts to name-calling without context.

    - Also apologies for writing that, had to test the system

  • axus 8 hours ago
    I think it did a decent job. The key might be how customizable the censorship is.

    Article Context: Fun: Die Hard; Is It a Christmas Movie?

    Your(my) Comment: The erotic version of Die Hard does involve Santa Claus getting naughty with the terrorists on Christmas Eve.

    Banned topics found: sexual content, adult themes

    This comment touches on adult themes and sexual content, which are not suitable for discussion in this context about a classic action film. Results: Revision Requested. This comment would be sent back for revision with feedback.

    Revise Low Effort

    Comment appears to be low effort

    Objectionable Phrases:

    "Santa Claus getting naughty with the terrorists"

    This phrase can be seen as sexualizing a character traditionally viewed as innocent and family-friendly, which is inappropriate. Such language can make discussions feel uncomfortable or offensive to some audiences.

    Relevance Check On-topic: No (confidence: 90%)

    This is off-topic - the comment about an erotic version of Die Hard strays into inappropriate content that doesn't relate to the film's actual story or its production details.

    Banned topics found: sexual content, adult themes

    This comment touches on adult themes and sexual content, which are not suitable for discussion in this context about a classic action film.

    • NickHodges0702 8 hours ago
      Hehe -- excellent. Thanks.

      We want that kind of comment to be "tunable" -- I.e., the blogger who's post one is commenting on could tune for this, and allow more/less sexual innuendo as desired.

  • ceejayoz 9 hours ago
    The sample prompt I was given was "Is Die Hard a Christmas movie?"

    "Of course it is!" got an 80% certainty "off-topic" mark.

    When I elaborated that it occurs at a Christmas party, it said this:

    "Dogwhistles detected (confidence 80%): This comment seems innocuous, but the phrasing 'Christmas party' may be an underhanded reference to Christian themes, especially among discussions that might dismiss or attack secular or diverse holiday celebrations. This kind of language can subtly imply exclusion or preference for Christian traditions over others, which can marginalize those who celebrate different traditions."

    Not a great first experience.

    I've seen the trend on Facebook/Instagram to say "unalived" instead of "killed" or "cupcakes" instead of "vaccines" and suspect humans are long gonna be cleverer than these sorts of content filtering attempts, with language getting deeply weird as a side-effect.

    edit: I would also note that it says "Referring to others as 'horrible people' is disrespectful and diminishes the possibility of a respectful discussion. It positions certain individuals as entirely negative, which can alienate others and shut down dialogue.", if I feed it your post, too.

    • netsharc 9 hours ago
      AI enhanced language monitor, what a double plus good improvement for society!
      • vintagedave 8 hours ago
        I get this.

        There’s a line on our doc page:

        > Respectify is not an engine for monoculture of thought, but in fact intends to assist in the opposite while encouraging in healthy interaction along the way.

        We don’t want to monitor or enforce saying specific things. We want people to be able to speak, but understand how others will hear them.

        All those times people talk past each other. Or are rude but don’t realise it. Or are rude but don’t care (and should because it’s a human on the other end.) Or the worse people who intentionally say something awful and… just maybe can learn a bit about what they’re saying.

        I get your fear. I think I’ve seen AI used for bad quite a bit. I hope, given the tech isn’t going away, we can use it to make things a bit better. That’s the goal.

        • mcmcmc 7 hours ago
          Intent is immaterial if the output doesn’t match. The very nature of the product in attempting to coach commenters to argue in the “correct” way goes against your stated goals. This will encourage the kind of algo-speak self-censorship now common on TikTok etc, just more effectively because it at least tries to explain the rules.
      • NickHodges0702 8 hours ago
        Nick Hodges here -- one of the developers.

        I get that objection, and we are certainly very uninterested in that becoming the norm. The idea, of course, is to try to prevent comments that we want prevented and that aren't helpful.

        Different bloggers and different communities are going to define that differently. That is why we are making a good-faith effort at allowing sites/people/groups to tweak this as desired.

        Thank for your feedback.

      • tclancy 8 hours ago
        Revision Requested This comment would be sent back for revision with feedback.
    • NickHodges0702 9 hours ago
      Hey, Nick Hodges here, one of the builders of this.

      First, Thanks so much for trying this out and giving us feedback.

      Have you tried adjusting the settings on the left side? For instance, reducing or eliminating dog whistle checks?

      • esperent 8 hours ago
        The whole point of using AI in this situation is context. So if the initial conversation is about a "Christmas movie" and someone uses the phrase "Christmas party" in a reply and gets flagged for Christian dogswhistle propaganda, that's a sign the system isn't working - even with the dogswhistle setting turned up.
      • ceejayoz 8 hours ago
        > For instance, reducing or eliminating dog whistle checks?

        I'm sure that'll help, but I'd imagine it's not an option available to me as a commenter on a real website using your tool?

        • NickHodges0702 8 hours ago
          No, but it would help us know the defaults better......

          Thanks again for trying it. Really grateful.

        • NickHodges0702 8 hours ago
          ...but yeah, it 100% shouldn't flag "Christmas Movie" unless specifically told to.

          Same for the phrase "Horrible people" -- that isn't necessarily in and of itself a bad thing to say.

    • vintagedave 6 hours ago
      Just to update, the "Of course it is!" bug is now fixed, same with the 'horrible people' one. Thankyou very much for that :)

      The note on language getting weird -- yeah. We hope that by keeping it up to date, we can be as far (or close to it) as language changes. I agree: that trend is concerning.

  • konaraddi 6 hours ago
    I think that’s an awesome idea and I like that it proactively gets ahead of the problem instead of the retroactive approach like moderation today. I’m interested in a very similar goal; I’ve been working on a guide on anti patterns in internet discourses at https://odap.konaraddi.com in hopes of it being used to make discourse on the internet more productive and pleasant (the guide is a work in progress).
    • vintagedave 6 hours ago
      Thankyou -- and wow that looks an amazing site. We desperately need more pleasant discourse (I think HN in general is a great example of good discourse, by and large) and I feel like you've codified some excellent rules.
  • mwachs 3 hours ago
    I really love the tone you have for this product. I also vibecoded a thing (much more niche--https://peeps.biz/about) and felt the freedom to inject my own tone on it because it's personal. These apps/services are feeling more like zines or an indie band than a company seeking world domination or VC investment, and I think that's pretty neat.
  • Zak 3 hours ago
    I thought about making something like this prior to LLMs. This version is more sophisticated than what I had in mind.

    I think its response to this comment could use some work:

    > The Glock 19 is a great answer to this position.

    It detects spam for off-topic product promotion, but gives it a toxicity score of zero even though it recognizes that a Glock 19 is a firearm. Suggesting that a weapon is a good answer to someone's position on a topic other than weapons should probably be interpreted as a threat.

  • altairprime 6 hours ago
    How can I apply this system to a random discussion archive page at HN in order to evaluate it more efficiently as a discussion guidance mechanism? I don't want to see usernames in that example, and I don't want a dynamic example either — but I think it would be much easier to convince HN that your AI product is worthwhile if you present an HN-specific example. Specifically, I suggest you take an HN discussion (the HTML is very simply structured), pipe each comment through your engine, and append the <div style="background-color: soft-blue;"> "Your comment etc etc" responses that would have been shown to each comment in the discussion.

    Looking at the most popular results for " " on HN Algolia, I would recommend selecting a post that has at least a few hundred comments and is also about HN or YC or YC-adjacent people (since the mods are extra light-touch on such posts), in order to take the best possible sample for unmoderated discussion to evaluate Respectify against. This post is a good example that fits those criteria; I didn't pay attention to it at the time and I haven't assessed the discussion beyond 'total comment count >= 500': https://news.ycombinator.com/item?id=40521657

    I recognize that's theoretically a lot of effort, but from a coding standpoint, it's simply `for $comment in $dom.xpath(/blah/blah/comment) { $ai.eval($comment); undef $comment.username; $comment.append($respectify.bulleted_list_with_html_colors); }` for what has the potential to be an extremely convincing demo to the target audience of us here.

    • vintagedave 6 hours ago
      That is an excellent idea. It is 2AM my time, but I may set Claude going and check in the morning. (I tend to keep a closer eye on AI coding than that usually!)

      The preset articles and trying out comments were intended to be something similar: see a topic, see how it works. But running it on each and every comment on an existing thread is really powerful.

      There may be privacy concerns? General respect? I don't want to tie assessments to specific commenters, who published in good faith not expecting some kind of automated review, nor thereby imply they commented poorly for example. But I'll code it up on my end and see what we can do with it. It's truly a very nice idea.

      • altairprime 5 hours ago
        If you look through the history of the Show HN category, you will see a near-endless stream of "analyzing HN discussions" prior arts that may offer some assurances. I'm not suggesting removing usernames because anonymity is involved — it is not! — but, instead, to specifically focus viewers on the substance of the comments rather than the who of them. That's also why I chose something from two years ago, so that there's no appropriate reason to witchhunt over the past. (Some may still, but nothing short of evaluating AI-generated data will stop them, and you can't make a reasonable case using AI to evaluate AI.)
  • thelock85 8 hours ago
    Seems like you need this when you don't have agency to go find your preferred online group(s) which might be tied to larger personal challenges in healthy communication and productive conflict. I don't know how tech solves that problem. The broad use case here would just create a new "respectified" category where members (assuming they have the attention span to be guided on comments) try to conform. I suppose that could be helpful in hyper-local or team-level contexts where there is a shared interest to conform around.
    • NickHodges0702 8 hours ago
      Our "target market" right now is a blogger that would like to turn on comments, but has turned them off because they get toxic really quickly.
  • selcuka 4 hours ago
    Q: Die Hard: Is it a Christmas movie?

    A: Of course it is. It was released on a sunny day, and that makes it a Christmas movie.

        [x] Published
        Relevance Check
        On-topic: Yes (confidence: 90%)
  • npunt 8 hours ago
    Love the effort here, been thinking about what this kind of tool might look like for a while. Something like this coupled with better prosocial affordances in the medium will do a lot to improve discourse online. I wrote up one a while back [1] but things like that are only a small part of a much bigger picture.

    The overall problem needs to be tackled from all angles - poster pre-post self-awareness (like respecify but shown to users before posting), reader affordances to reflect back to poster their behavior (and determine if things may be appropriate in context vs just a universal 'dont say mean words'), after-post poster tools to catch mistakes (like above), platform capabilities like respectify that define rules of play and foster a enjoyable social environment that let us play infinite games, and a broader social context that determine the values that drive all of these.

    [1] https://nickpunt.com/blog/deescalating-social-media/

    • NickHodges0702 8 hours ago
      I'm grateful for the thoughtful feedback, thanks.

      Your blog post will be read. ;-)

  • skybrian 7 hours ago
    I like the concept. Not sure about the specifics.

    I read somewhere that much of the market for robot vacuum cleaners was people who already had pretty clean houses and wanted to do even better. Similarly, I imagine this will appeal more to people like me who genuinely want to improve how they interact?

    If someone started a forum for people who like this sort of tool, maybe I'd be into it.

    I'm not wild about the name. It seems more confrontational than aspirational, like it's for people who want others to treat them with respect. But we do need moderation tools so maybe it's good.

  • Johnny_Bonk 4 hours ago
    I think this is a great idea, it seems you have a GOOD faith approach and contribution and kind of surprised how many people just love to tear things apart. Hopefully you get some good learnings and keep improving
    • pibaker 3 hours ago
      If by "tear things apart" you mean "people quickly finding out the flaws of the system" then yeah, I support people tearing it apart. Adversarial testing like this is how we find out if something actually works against bad actors or not.
    • ceejayoz 3 hours ago
      > kind of surprised how many people just love to tear things apart

      Or, say… hack things apart, to see how they work?

      Someone should make a website for these… hacking people. So they can get their news.

  • reconnecting 9 hours ago
    What I've seen, the difference between spam detected or not is https://www before the domain name.

    Here is an example of successful passing of all checks:

    > Published This comment passes all checks and would be published.

    Score: 5/5 | Not spam | On-topic: Yes | No dogwhistles detected (confidence: 100%)

    Can confirm. We hit this exact issue running tirreno www.tirreno.com (open-source fraud detection) on Windows ARM — libraries were auto-selecting AVX2 through emulation and batch scoring was measurably slower than just forcing SSE2. The 256-bit ops get split under the emulation layer and the overhead adds up fast in tight loops. Pinned SSE2 for those builds. Counterintuitive but throughput went up.

    • NickHodges0702 8 hours ago
      Hey, Nick Hodges here, one of the builders of Respectify --

      Thanks so much for trying it out and giving us feedback. I'm grateful.

      • reconnecting 8 hours ago
        You're welcome, Nick!

        On a separate note, if this is a real product, you might need to pay particular attention to data processing agreements etc., as the current T&Cs and Privacy Policy are actually missing how you process the input data, what you use, how long/where you store it, etc.

        • NickHodges0702 8 hours ago
          Thank you! This is very important, and I'm thankful (and a little surprised!) that you read it! ;-)
          • reconnecting 8 hours ago
            Perhaps this is my professional deformation, but when I visit a website, I start with the Privacy page.
            • NickHodges0702 8 hours ago
              I get that -- good idea, actually. Would that we all did that.

              For the record, we store zero comments from anyone. If you are using Respectify, we'll know the URL of your site and that is it.

              All comments are processed and completely forgotten.

              I'll get the TOS and the Privacy Policy improved/updated.

              • reconnecting 8 hours ago
                > All comments are processed and completely forgotten.

                This is secure in terms of privacy but not safe in terms of operations, because if it gets even a little scale, your demo will soon enough be used to fine-tune spam comments for free.

    • vintagedave 8 hours ago
      Fascinating that www makes a difference. We taught it a variety of samples of different spam approaches. This is something we can look at!

      I am super glad to see that comment passes — as it should. I would rate that one well too. Thankyou!

  • amarant 2 hours ago
    I like the general idea, but try playing devil's advocate with it! I went with the topic of trump and transgender rights and tried to formulate a pro-trump comment that would pass. It was not easy! I was downright civil by the end of it, describing a somewhat credible viewpoint that might lead someone to argue the "wrong" side. It seemed to find the opinion itself offensive, and how can you have a discussion if only one opinion is allowed?

    Meanwhile my first, low effort comment arguing the "correct" opinion got published directly.

    Conservatives are gonna scream that their views are being censored by this tool, and as it currently stands, I'd have to agree with them!

    Edit to add: it did a lot better with non-political topics, and if I'm being honest, I've never ever seen a productive discussion online on a political topic. I'm not sure they can exist! So I would honestly want this tool for any forum I'm interested in viewing. I think. Pending further testing.

  • protocolture 7 hours ago
    I like the tool, I respect the tool, and I wouldnt use it in its current form.

    However: Something that would make me sit up and take notice. Have this tool police more formal debates. Have it tweakable rule out comments that dont present supporting evidence, or fall into formal (or even informal) fallacies.

    That would probably need to be its own website.

  • klntsky 7 hours ago
    How do you score toxicity? Do you have a list of criteria or just let the LLM hallucinate a number out of thin air?
    • vintagedave 7 hours ago
      Toxicity is dehumanizing language, threats, doxxing, encouraging self-harm, that sort of thing. We have taught it examples of various levels, so it can align with those to report a score. Something like an unpleasant, insulting attitude to someone personally is fairly low on the toxic scale (but still toxic, it's not the right way to interact), whereas threats of violence or encouraging self-harm are very high.
  • nkrisc 8 hours ago
    Apparently discussing that Die Hard depicts murder and violence is a banned topic and thus the comment is flagged as off topic.
    • NickHodges0702 8 hours ago
      Uh oh -- that's shoudldn't happen. Or rather, we don't want that to happen.

      DId you try tweaking the settings? We'd be most grateful for feedback on tweaked settings.

      For instance, can I ask you to turn down toxicity and see if it accepts it?

  • someotherperson 8 hours ago
    This passes your checks, but a human moderator would flag it:

    > My favorite movie is die hard. I think it's a Christmas movie. But, honestly, we shouldn't have to wait until Christmas to watch you die hard. We should be able to watch that any day of the week :)

    Seems to catch various other cases though. Cool tool.

    • NickHodges0702 8 hours ago
      Thank you --

      And I agree, you can watch Die Hard anytime. ;-)

    • Levitating 8 hours ago
      Points for creativity at least
  • nomdep 2 hours ago
    Good idea, TERRIBLE implementation. After activating only filters for "Low Effort", and "Contain Logical Fallacies" I get:

    > "Who cares if it is? It's a great movie nonetheless"

    3/5 Published!

    > "Who cares if it is? It's a terrible movie nonetheless"

    2/5 Revision requested: Calling a movie 'terrible' dismisses the enjoyment others may find in it and directs negativity at both the film and those who appreciate it. Suggestion: "I personally don’t enjoy the movie, but I understand some people have different opinions about it."

    So it's okay to generalize my opinion about it, but only if I liked it, otherwise I might hurt someone's feelings? Very double-plus-good vibe. I would never comment again on the site that uses this product.

  • 8organicbits 7 hours ago
    I noticed the output wasn't very stable. If I add a filler sentence on the end, it calls an earlier sentence a dog whistle when it didn't say that earlier. I think its offline now, it just says "application not found".
    • vintagedave 7 hours ago
      We had a brief outage for ~6 minutes, the SSL cert became invalid and reflected our hosting provider instead (we don't know why and have filed a support request.) My apologies -- it's definitely online again now.
  • Carrok 8 hours ago
    Wow, someone figured out how to reproduce dang? Nice.
  • sigspec 2 hours ago
    Double Plus Good

    *revision requested

  • userbinator 1 hour ago
    Imagine a machine telling you how to think or speak. How dystopian.
  • onion2k 8 hours ago
    Low-effort posts

    Chuckles. I'm in danger.

  • raffraffraff 8 hours ago
    Everything is a dogwhistle.
    • ceejayoz 8 hours ago
      "This comment appears to dismiss the complexity of discussions about dogwhistles by claiming that 'everything is a dogwhistle.' This type of blanket statement can undermine the seriousness of genuinely harmful coded language, and can trivialize valid concerns about discrimination and manipulation in discourse."
      • NickHodges0702 8 hours ago
        We've dialed "dog whistles" way back -- thanks for the feedback.
        • ceejayoz 8 hours ago
          Just remember every time you tweak the defaults, the 90% of your site owners using those defaults suddenly have a significant shift in their moderation policy that they are themselves unaware of.

          (I moderated a vBulletin forum in the 1990s. This shit gets really, really, really hard, and no one is ever really happy with it.)

          • NickHodges0702 8 hours ago
            Sorry -- should have been more clear: We are shifting the defaults on the demo site, not on respectify itself.

            Thanks for a great point, though. Finding the best defaults will be very important, and we can't tweak it like that very often if at all.

          • NickHodges0702 7 hours ago
            >>(I moderated a vBulletin forum in the 1990s. This shit gets really, really, really hard, and no one is ever really happy with it.<<

            I feel that. I used to moderate the Object Pascal Compuserve forum. That was hard enough!

            • ceejayoz 7 hours ago
              This one was for gamers.

              I’m pretty sure we created a few budding lawyers out of some high schoolers.

  • dogclaw 7 hours ago
    pricing page failed - Plans error: fetch failed
  • darqis 8 hours ago
    Definitely needed, especially in the Fediverse. Holy crap the edgelords there or on Facebook. You comment something neutral, skeptical, response is either straight insults or completely disagreement and then insults, ad hominem or strawman/gaslighting.

    Yesterday I dared to write I like X now, it's clean of all the edgelords who went to Bluesky or the Fediverse. Cancel culture on Twitter was over the top. Reaponse, Cancel Culture doesn't exist. My response, it absolutely does. His response, No it doesn't you Nazi something something or other. Err, what?

    X has the most up to date information for tech circles.

    People on BS mostly repost and rage about posts on X. Fediverse are the different kind of refugees. Mastodon has critical design flaws. It's not a future proof system. And Cancel culture is absurd. BTW 5 people reported me for saying that Cancel culture absolutely exists, all from the same instance. Lol. The hypocrisy is unreal.

    In any case, I think people forgot or never learned how to respectfully disagree and have a conversation with people who don't agree with them.

    Something like this is direly needed.

    • NickHodges0702 8 hours ago
      Hey, thanks so much for the feedback. We agree. ;-)

      One of our goals is to just make the edgelords and trolls go away -- if they want to comment, they have to be nice. If they can't be nice, they can't comment (A gross over-simplification, but you get the idea.....)

      One feature we are going to add is a "Here's your feedback, but press here to post anyway" as an option for users to have. At teh very least, make someone stop and think about what they are saying.

    • ceejayoz 8 hours ago
      "The comment mentions 'Cancel Culture' and uses terms like 'edgelords' and 'Nazi' in a context that dismisses and trivializes serious issues. This reflects a trend in discussions that equates legitimate critiques of harmful behaviors with extreme labels, undermining constructive dialogue and signaling acceptance of toxic rhetoric."

      "Using phrases like 'Holy crap the edgelords' can come off as dismissive and disrespectful towards a group of people. It’s better to express concerns about behaviors or actions instead of labeling individuals harshly."

      "Describing cancel culture as 'over the top' expresses a strong negative opinion without offering specific reasoning. It’s more effective to explain what aspects seem excessive to help others understand your perspective."

      "Using phrases like 'the hypocrisy is unreal' can come across as dismissive and sarcastic, which may alienate others from the discussion. It’s beneficial to explain what seems hypocritical instead of making broad statements."

      (I picked the "why it's hard to escape an echo chamber" context option, for full disclosure.)

      • NickHodges0702 8 hours ago
        Thanks so much. This is like gold to us.

        The defaults we have set are clearly too high. That comment should be exactly what we should approve. Thanks for trying it.

        • ceejayoz 8 hours ago
          So this is a good illustration of the problem.

          If it were my site, "I like X now" would be a red flag.

          I don't think you're gonna AI your way out of this part of things for some time, and it really is the core challenge to content moderation; it's heavily opinion and circumstance based, in a way current models really struggle with.

          • NickHodges0702 8 hours ago
            I appreciate the comments, thanks.

            Well, we are going to give it a try!

            Thanks again...

            • ceejayoz 8 hours ago
              I genuinely wish you luck. It's a worthy goal.

              (lol, this got "Comment appears to be low effort". Ouch!)

  • latchkey 7 hours ago
    Interesting, I've been thinking about integrating something like this into https://oj-hn.com in order to help improve the comments on this site.
  • ddingus 3 hours ago
    I basically hate this thing. Sorry team. I know you are trying and you believe in your effort.

    I know your intent is in the right place too.

    But, here's the thing:

    I value real conversation. It is the only conversation worth having.

    This is a step toward Disneyland type conversation. And we don't live in Disneyland!

    Profanity is a part of speech. There are ugly things, ideas and people in this world and that is what the profane gets at.

    As for offending others... hoo boy!

    Let us start with a hard to process reality: we all are as offended as we think we are.

    What prevents others from abusing that reality to push an agenda, gain position in the rhetoric, and more?

    Not much.

    Worse, we do not control others. Many attempts at doing that fail. This one is extremely likely to fail too.

    What do we control?

    How we respond to offensive speech!

    And we have options, but a person wouldn't know that because the number one response is righteous indignation!

    There are so many other choices!

    We can just ignore speech we don't like.

    We can employ humor! When an ass gets called one by a clown, I laugh! It is laughable.

    Same for the people blowing pages discussing who is the bigger asshole. I say they all deserve that conversation.

    We can redirect by asking a direct question, or by making the subject of our response more germane to the topic at hand too.

    There are many more options that make a hell of a lot more sense than blathering on with righteous indignation fueling it full on.

    Now, here is another dynamic in the same vein:

    Say I declare someone is a racist! Just full on judge them on the spot hard.

    They are not gonna like that too much are they? Nope. And what is worse, if we are in a position to do some advocacy, the person so harshly judged won't hear any of it.

    And being judged like that sticks. Say they stop being racist. They still gotta live with that crap for a long time.

    Now, we could say, "are you sure you want to say that? It comes off racist to me."

    The idea being you offer help or a way for them to see the harm, while also giving them an out so they are not judged harshly.

    They could reconsider next time, or just stop and that is great! They won't have to fight down ugly exchanges.

    I could go on for pages. I believe I said enough to make my point.

    We can only control how we respond to speech we don't like.

    Attempting to control others to the point where they simply cannot offend or cause grief means we also have sanitized our discourse to the point of being worthless.

    No thanks.

    I have a very thick skin. Others do too.

    More of us can manage how we respond and if we put half the energy we put into trying to control others it would be much better.

    • BikiniPrince 32 minutes ago
      Such wrong think! I think any forum that adopts this would be worth studying for simply helping to create a score to rate how soulless a site is. Maybe they could make the inverse that would rewrite whole forums for happy thoughts. Rose tinted glasses client? The name needs work.
  • cookiengineer 6 hours ago
    Take my upvote! That's a really novel approach to the misinformation crisis and I love the product idea. It would be pretty awesome with a plugin system so that you can integrate it with other websites, too.

    Wish you the best for it!

    PS: the website is _really_ slow on Android Firefox. I had to use my Desktop system to try it out.

  • pranaywankhede 35 minutes ago
    [dead]
  • PeterWhittaker 7 hours ago
    Huh. Commented upon echo chambers and cults and was told "Request failed: fetch failed". Tried a private session as well, just in case my previous UBI comments had polluted things, but no love. Was it the length? FWIW, here's my comment....

    A great many words surround what seem to me to be red herring arguments and arbitrary definitions and groupings, with the word cult appearing in the article precisely 8 times without any justification for the statement in the headline. Moreover, the sentence "We can pop an epistemic bubble simply by exposing its members to the information and arguments that they’ve missed" seems woefully naive: By the definition included in the article, traditional views re the roles of women or blacks in society would be epistemic bubbles and not echo chambers, and women's right were not advanced and slavery not eliminated through the bringing of facts, but through long, arduous moral struggles to convince at least a majority that women and blacks merited the same rights as men and whites.

    But it liked my comment on UBI and potential cost reductions through elimination of fraud detection and mitigation, so obviously it does things well. 1/2 /s? :->

    • vintagedave 6 hours ago
      Hi. Apologies for 'fetch failed' - we had 5-6 minutes of downtime where the SSL cert suddenly reflected our hosting provider, not us. Exactly what you want when you're getting attention on HN ;)

      I tested your comment just now, and made some specific tweaks in response (we've done that with a lot of the feedback here.) In my testing it liked the comment.