Five Practical Lessons for Serving Models with Triton Inference Server

(talperry.com)

23 points | by talolard 24 days ago

1 comments

talolard 24 days ago
Hi HN,
I’m wrapping up a role where I spent a significant amount of time writing Triton kernels. It’s a fantastic tool, but the learning curve has some sharp edges. I wanted to share a few practical "notes from the field" for anyone moving beyond the very opaque docs.
[-]
- greatgib 19 days ago
  It is probably very interesting but your website is broken on mobile and so unreadable with the text column of mine 5 characters large maximum...
  [-]
  - kelipso 19 days ago
    Lol yeah, potentially interesting article since I programmed with triton before so I’ll bookmark it, but each line is one word long!
  - LTL_FTC 19 days ago
    I found that rotating my phone to landscape makes it readable, just as an fyi
- bigdict 19 days ago
  Huh? Triton inference server and Triton the kernel language are two distinct, very different things… Is this AI-generated?
  [-]
  - LTL_FTC 19 days ago
    Just yesterday I was reading through this five year old post on triton by its creator. Triton was their PHD thesis and they coined the name before the inference server was renamed to it… if this is what you are referring to?
    Here is the Reddit thread:
    https://www.reddit.com/r/MachineLearning/comments/otdpkx/n_i...