top of page



How do we tackle noisy recognition?
Something I've been thinking about a lot lately is how humans handle noisy recognition. Maybe you recognize the image above, if not you...
Ethan Smith
6 days ago13 min read
107 views
0 comments


The Hard Problem of Psychiatry
The Psychiatrist's Office Once upon a time, I was set on becoming a psychiatrist. I had always been deeply interested in psychology....
Ethan Smith
Apr 828 min read
24 views
0 comments

On Vibe Coding
The Distillery - a look under the hood of vibe coding Introduction Vibe coding may be one of the best and worst things 2025 has had to...
Ethan Smith
Mar 289 min read
112 views
0 comments

Boneless Attention and Low Rank Attention Layers
I’ve seen a lot of convoluted tutorials on attention but nothing really made it click for me more as understanding as mixing a projected...
Ethan Smith
Mar 238 min read
414 views
0 comments


There are probably a lot of special people.
One conviction I hold very strongly is that "special" people are possibly much more common than we may be lead to believe. While sure, we...
Ethan Smith
Mar 2113 min read
126 views
0 comments


The Need for Relative Optimizers | Hypothesis on Muon
Presently, most optimizers used in deep learning do not explicitly accommodate their updates with respect to the expected range of...
Ethan Smith
Mar 1811 min read
453 views
0 comments


Minimum Faith
Within the study of machine learning, you'll often hear that the objective is to find the solution that maximizes likelihood . We have a...
Ethan Smith
Mar 147 min read
55 views
0 comments

Softmax Attention is a Fluke
Calibrated Attention Calibrated Attention NanoGPT Attention is the magic ingredient of modern neural networks. It is the core of what has...
Ethan Smith
Mar 1310 min read
2,671 views
1 comment

Kolmogorov Complexity
Mandelbrot function: Zn+1 = (Zn)^2 + C | Location: -1.4732524061369524549 + -0.0058138265122775765014 i , Radius:...
Ethan Smith
Mar 114 min read
148 views
0 comments

To create something new, you need to make some noise.
One of the most interesting things about the development of AI was the order of achieved milestones. Relatively small models can create...
Ethan Smith
Feb 127 min read
286 views
0 comments

How I like to think about diffusion
It's a bit hard to see in the diagram but in addition to being convolved with a gaussian, these points are also drifting towards zero....
Ethan Smith
Jan 262 min read
157 views
1 comment

Classifier free guidance and reinforcement learning
https://sweet-hall-e72.notion.site/Classifier-Free-Guidance-to-Approximate-RL-9f78c02801c6434da61f37c8d843c5bf
Ethan Smith
Jan 261 min read
70 views
0 comments

The Tough Case for Free Will
"Would someone without free will do this?" is one of the most common responses I've heard to the proposition that free will might not...
Ethan Smith
Jan 1311 min read
94 views
0 comments


How absurd it is, that anything is at all.
Art from "In the Aeroplane over the Sea" by Neutral Milk Hotel There are several dimensions by which we may feel small. You may feel...
Ethan Smith
Jan 1210 min read
251 views
1 comment


AI should help humans understand each other.
Empathy through technology
Ethan Smith
Oct 18, 202410 min read
421 views
0 comments

Why are Modern Neural Nets the way they are? And Hidden Hypernetworks.
https://sweet-hall-e72.notion.site/Why-are-Modern-Neural-Nets-the-way-they-are-And-Hidden-Hypernetworks-6c7195709e7b4abbada921875a951c54
Ethan Smith
Oct 6, 20241 min read
163 views
0 comments

Do Diffusion Transformers Deserve The Hype?
https://sweet-hall-e72.notion.site/Do-Diffusion-Transformers-Deserve-The-Hype-9b9ca7bead374b47aac96558714c203b
Ethan Smith
Jul 28, 20241 min read
238 views
0 comments

Automated LoRA Discovery and Teaching Neural Networks to make Neural Networks
https://sweet-hall-e72.notion.site/Automated-LoRA-Discovery-and-Teaching-Neural-Networks-to-make-Neural-Networks-22aa3b5ad66e4bc985ff2c93...
Ethan Smith
May 26, 20241 min read
263 views
0 comments

Diffusion and Autoregressive Models for Learning to Solve Mazes
https://sweet-hall-e72.notion.site/Diffusion-and-Autoregressive-Models-for-Learning-to-Solve-Mazes-c3bc4bcdfa304ecd9531ee5445a4da66
Ethan Smith
May 21, 20241 min read
361 views
0 comments

I Learn to Diffuse, or Data Alchemy 101: a Mnemonic Manifesto
https://arxiv.org/pdf/2208.03998
Ethan Smith
May 12, 20241 min read
162 views
0 comments
bottom of page