19 comments

  • barlines 34 days ago
    Claude Code now does this natively without need for a 3rd party browser like Comet: https://support.claude.com/en/articles/12012173-getting-star...
  • DougN7 30 days ago
    I didn’t realize AI could interact with browsers like this already (guess I’m naive). Isn’t this setting up for the scenario where the AI is duped into logging into your bank account and transferring your money away? Not sure I have enough trust to allow an AI to touch a browser.
    • observationist 30 days ago
      People are already going full Leroy Jenkins with this stuff, and OpenAI, other labs are snarfing up their usage data. Hopefully with their brave sacrifice, they can figure out all the security pitfalls before it becomes common enough that someone with a clever jailbreak ends up pulling of a billion dollar heist, or orders pizza for half the country.

      It's 100% absolutely not safe yet. You can effectively copy and paste Pliny prompts and pwn any of the frontier lab models. Anyone with a little time and creativity can tailor a unique one and set hidden text traps for AI browsers or agents, and depending on what access you've given the software it could be very dangerous.

      • rvz 30 days ago
        Great time to be an offensive security researcher specialising in researching LLM adversarial attacks.
        • observationist 30 days ago
          Yeah - the red team folks probably have one of the most fun jobs in the world right now.
          • luckman212 30 days ago
            Depends on your definition of "fun"
    • joenot443 30 days ago
      There are folks on X running vibe-coded Polymarket arbitrage bots playing with hundreds of thousands of dollars. Some people have pretty wild risk tolerances!
    • tbocek 30 days ago
      That's a valid concern. I took a more constrained approach for web searches for exactly this reason. Instead of giving the LLM full browser control, I built a Firefox extension that only handles web search client-side.

      When my local LLM (llama.cpp) needs to search, it opens DuckDuckGo in a new window, loads the result pages in tabs, extracts content with Readability.js, and feeds it back. You stay in the loop - can see what's loading, solve captchas if needed. Less autonomous than Comet/Playwright, with a narrower use-case, but also less risk.

      Its still a prototype though: https://github.com/tbocek/llm-local-web-search

    • wat10000 30 days ago
      It's totally setting up for exactly that scenario. You need to ensure the browser that it uses is totally unprivileged if you're going to do this, or at the very least that it can only access a small set of trusted destinations.
  • EMM_386 30 days ago
    "claude --chrome" does this out of the box and works pretty well.
    • lostmsu 29 days ago
      Is it included in "--yolo"?
  • desireco42 30 days ago
    I tried it. My Perplexity premium expired, maybe that is it, but it barely did anything.

    When I put prompt you suggested, it did open Perplexity in Comet and then I guess didn't get response even though Perplexity did research, so it used regular search mcp to get results...

    It is cool idea, this is what I would like to have, something to automate boring stuff. Find all LinkedIn connections that are not active and remove them from my network for example.

    I don't think it is your mcp or code, as tech is just not there yet. It is much easier to accomplish this through other automations.

  • robertheadley 32 days ago
    I was going to ask what makes this better than just using Playwright and this largely answers that question. I will have to try it out and see how it compares.

    I haven't really had luck with MCP in general for quite a while though. I have just been using Google Antigravity for most of my vibe coding needs.

  • prettyblocks 30 days ago
    I've used chrome devtools mcp successfully to do all kinds of advanced in browser tasks, agents like claude code can write js and inject it into the context in a live browser and do all kinds of neat tricks. I've used this extensively in gemini-cli.
  • 404softwarelabs 33 days ago
  • nurettin 30 days ago
    I noticed that /plugins

    Doesn't lost plugins anymore. I'm sure I installed playwright using that menu, but now it lists no plugins (and the plugin can't be found locally)

    However, claude add mcp and /mcp still works.

  • bpavuk 30 days ago
    a brittle MCP that connects a brittle (unless using Opus 4.5) CLI to a brittle browser? (see: Scamlexity, an actual vulnerability name)

    I trust Claude in Chrome a lot more, and I trust my own hands and eyes most.

  • daliusd 30 days ago
    There is Browser MCP that works reasonably well: https://browsermcp.io/

    What's the difference?

  • giancarlostoro 30 days ago
    I was just thinking to myself this morning, I wonder if I can make Claude Code and Comet work together... Now I have my answer!
  • nickpeterson 30 days ago
    Anyone know of any good articles around having claude code build playwright test suites for a given website and parameters?
  • mrcwinn 30 days ago
    Claude in Chrome is excellent - as is Claude in Excel. I was shocked at how useful the latter is.
  • dbbk 30 days ago
    They literally already have Chrome integration… sorry you wasted your time
  • ramon156 30 days ago
    I just used puppeteer for this until it came natively
  • jMyles 30 days ago
    Any notions of how this differs from Vibium?
  • thedudeabides5 30 days ago
    nice, literally had claude tell me it couldnt do browser today.
  • nubskr 30 days ago
    Another day, another MCP server. Wake me up when we stop needing a new protocol for every AI tool to talk to every other AI tool.