There is No Spoon. A software engineers primer for demystified ML

(github.com)

83 points | by jmatthews 7 hours ago

8 comments

janalsncm 6 hours ago
I personally think it is much more important to have strong statistical intuitions rather than intuitions about what neural networks are doing.
The latter isn’t wrong or useless. It’s simply not something a typical software engineer will need.
On the other hand, wiring up LLMs into an application is very popular and may be an engineer’s first experience with systems that are fundamentally chaotic. Knowing the difference between precision and recall and when you care about them will get you a lot more bang for your buck.
I would suggest the gateway drug into ML for most engineers is something like: we have a task and it can currently be done for X dollars. But maybe we can do it for a tenth of the price with a different API call. Or maybe there’s something on Huggingface that does the same thing for a fixed hourly cost, hundreds of times cheaper in practice.
[-]
- jmatthews 6 hours ago
  I'm just trying to develop the lens where I can see a problem and know what properties of it are meaningful from an ML standpoint.
  Coming from a specific domain where I have a sharpened instinct for how things are haven't really given me the ability to decompose the problem using ML primitives. That's what I'm working on.
bonoboTP 5 hours ago
Just read a good textbook instead of this LLM-written stuff. For example those by Murphy or Prince or Bishop. Or one of many YouTube lecture series from MIT or Stanford. There are many primer 101 tutorials and Medium posts. But if you actually want to learn instead of procrastinating, pick up a real textbook or work through a course.
[-]
- mememememememo 3 hours ago
  Or just train some NNs. If more time write the code and understand the tensor operations.
- jmatthews 5 hours ago
  I've bounced off of many good textbooks. Even Karpathy's YouTube series was too dense for me. I'm trying to come in at a more palatable level.
  This was a two day exploration where I provided the syllabus and ran through it with Claude Code, asking questions, trying to anchor it to stuff I understand well. I feel like the artifact has value.
  [-]
  - bonoboTP 5 hours ago
    I think chatting with an llm alongside a textbook can be helpful but producing learning material when you yourself are a novice is not really that valuable.
    [-]
    - NewsaHackO 3 hours ago
      Yes, and it is borderline irresponsible to even make this.
  - antonvs 4 hours ago
    It's AI slop. You're letting a machine gaslight you.
hilliardfarmer 1 hour ago
Please stop trying to trick us into reading AI generated text.
"This isn't a textbook or a tutorial. It's a mental model — the abstractions you need to reason about ML systems the way you already reason about software systems."
[-]
- thirtygeo 42 minutes ago
  I got to that and just stopped reading!
whoamii 5 hours ago
Feature request for HN, Adblocker, etc: please block pages with the text “it isn’t X, it is Y”.
[-]
- thirtygeo 40 minutes ago
  Careful, you'll just make the output generators harder to spot! Www We need to keep their 'Tells' hidden from them..
- ggambetta 1 hour ago
  Can we also ban anything where the second line is "let that sink in"? And anything claiming that "X is a masterclass in Y" (especially for (tweet, empathy))?
zar1048576 5 hours ago
Nice weekend project! Even though there are copious resources out there (textbooks, videos, etc.), those may not appeal to everyone. People have different preferred modalities for consuming information and there is always value in (correctly) reframing concepts in a way that can be better understood by people who don’t resonate with traditional textbooks and YouTube videos. I’m glad you found a formulation that works for you, and judging by the number of upvotes, it resonated with others as well. At the very least, I’m sure that working on this improved your understanding as well!
jmatthews 7 hours ago
This is my weekend project. I am building up my pattern recognition in machine learning. By that I mean see X problem, instantly think of Y solution. The primer markdown file is the artifact of that exploration.
read it from top to bottom or better have your favorite language model read it and then explore the space with a strong guided syllabus.
[-]
- janalsncm 6 hours ago
  Framing a business problem in terms of ML is indeed important. Where does classification come in, where does regression come in, when to use retrieval, when to use generative solutions. Would be a good section to add imo.
  [-]
  - jmatthews 6 hours ago
    I tried to tackle that under Topology for the problem but it may not be well named https://github.com/dreddnafious/thereisnospoon/blob/main/ml-...
- TheTaytay 2 hours ago
  I quite liked this. It feels approachable and to-the-point.
oleggromov 7 hours ago
Thank you for sharing! Saved to bookmarks to read on my free time. Hopefully I'll get some soon :)
[-]
- jmatthews 6 hours ago
  thanks Oleg
eddie-wang 4 hours ago
[dead]