3 comments

  • rootnod3 1 hour ago
    Sorry, so the tool is now even circumventing human review? Is that the goal?

    So the agent can now merge shit by itself?

    Just the let damn thing push nto prod by itself at this point.

    • baxtr 5 minutes ago
      I’m not saying this is, but if I were a malicious state actor, that’s exactly the kind of thing I’d like to see in widespread use.
    • ljm 56 minutes ago
      Someone’s gonna think about wiring all this up to Linear or Jira, and there’ll be a whole new set of vulnerabilities created from malicious bug reports.
    • literalAardvark 56 minutes ago
      In some workflows it's helpful for the full loop to be automated so that the agent can test if what's done works.

      And you can do a more exhaustive test later, after the agents are done running amok to merge various things.

    • danenania 27 minutes ago
      I don’t think “ready to merge” necessarily means the agent actually merges. Just that it’s gone as far as it can automatically. It’s up to you whether to review at that point or merge, depending on the project and the stakes.

      If there are CI failures or obvious issues that another AI can identify, why not have the agent keep going until those are resolved? This tool just makes that process more token efficient. Seems pretty useful to me.

  • joshuanapoli 49 minutes ago
    This looks nice! I like the idea of providing more deterministic feedback and more or less forcing the assistant to follow a particular development process. Do you have evidence that gtg improves the overall workflow? I think that there is a trade-off between risk of getting stuck (iteration without reaching gtg-green) versus reaching perfect 100% completion.
  • mcolley 7 hours ago
    Super interesting, any particular reason you didn't try to solve these prior to pushing with hooks and subagents?
    • dsifry 7 hours ago
      I did! The issue however, is having a clear, deterministic method of defining when the code review was 'done'. So the hooks can fire off subagents, but they are non-deterministic and often miss vital code review comments - especially ones that are marked in an inline comment, or are marked as 'Out of PR Scope' or 'Out of range of the file' - which are often the MOST important comments to address!

      So gtg builds all of that in and deterministically determines whether or not there are any actionable comments, and thus you can block the agent from moving forward until all actionable comments are thoroughly reviewed, acted upon or acknowledged, at which point it will change state and allow the PR to be merged.

      • blutoot 1 hour ago
        I thought hooks are always fired if you use it as a PreToolUse event. Wouldn’t that work for the GitHub action tools from the GitHub mcp?