Whenever a new piece of technology comes out (these days, mostly AI) I go to some effort to understand it. Usually I end up writing a post about it, so I can be confident that I do understand1.
Whatโs the point of doing this? Obviously my explainers about diffusion models are shallow: certainly they arenโt detailed enough to do useful research on diffusion models. Whatโs the point, then?
In my view, good engineering requires having reliable shallow intuitions about how things work. You donโt need a full understanding of how things work, or even a good enough understanding to work usefully in that area of the stack. But itโs still useful to try to minimize the number of technologies in your stack that are purely black boxes.
continue reading on seangoedecke.com
โ ๏ธ This post links to an external website. โ ๏ธ
If this post was enjoyable or useful for you, please share it! If you have comments, questions, or feedback, you can email my personal email. To get new posts, subscribe use the RSS feed.