The future of code search is not regex – 100x faster than ripgrep

(fff.dmtrkovalenko.dev)

26 points | by neogoose 3 hours ago

10 comments

  • kristopolous 2 hours ago
    I ran across this fascinating tool a few days ago researching embedding models on hugging face.

    Advertised as "ColGREP Semantic code search for your terminal and your coding agents",

    I haven't put it in any harness yet but I probably should.

    https://github.com/lightonai/next-plaid/tree/main/colgrep

    I've also tried astgrep (also known as sg) but llms really mess up on them. I think you'd need to fine tune.

    If anyone has cracked that case I'd love to hear about it

  • genewitch 57 minutes ago
    considering that ripgrep has marginal overhead over just reading the files to /dev/null, how exactly does this achieve 100x speedup?

    I have a lot of use for something that can search ~1GB of text "instantly", but so far nothing beats rg/ag after the data has been moved into RAM.

    • anilakar 53 minutes ago
      The trick to optimization is not "doing faster" but "doing less". I already feel rg is missing a ton of results I want to see because it has a very large ignore list by default.
  • swiftcoder 31 minutes ago
    Is there a write up of the underlying approach? The summary on the repo mentioned SIMD, but not a whole lot else.
  • pjmlp 14 minutes ago
    It has never been ripgrep for decades for those of us on IDEs.
  • neogoose 3 hours ago
    I have open sourced the fastest code search implementation. Comprehensive SDK for both file finder and grep file search that is over 100x faster than ripgrep
    • siva7 1 hour ago
      I don't get this submission title. Your tool uses regex but the title claims the future is not about regex.
      • molszanski 37 minutes ago
        I think it is about input. Before I had to type regex, now I just type text and fuzzy finds more, regex style. Awkward wording, but code seems cool.
    • MaxMonteil 3 hours ago
      This looks cool!

      You should add a link to the GitHub repo for the project itself, at first I wasn't even sure what it was called.

      I found this link https://github.com/dmtrKovalenko/fff.nvim

  • dig1 35 minutes ago
    ctags, GNU Global and even "ugrep -Q" would like to have a few words with you ;)
  • globular-toast 1 hour ago
    Why is it "for neovim"? Surely such a thing would be useful in many applications?
    • ramon156 50 minutes ago
      Because it's being dishonest from multiple angles.

      - it has regex, so the title is weird - it definitely wouldn't be 100x faster than rg - its an sdk, so its apples to oranges anyway

  • asdfadsfaf 39 minutes ago
    I don't get it how can I search anything but the file name?
  • schrodinger 1 hour ago
    How's it work? Embed tokens and use euclidean distance or something?
  • forrestthewoods 7 minutes ago
    Websites that don’t tell me what they’re doing are infuriating. I’m on mobile. This landing page experience is awful.