Linear Algebra Intuition
Every AI model is just matrix math wearing a fancy hat.
Vectors, Matrices & Operations
Every neural network is just matrix multiplication with extra steps.
Matrix Transformations & Eigenvalues
A matrix is a machine that reshapes space. Learn what it does to every point, and you understand the whole transformation.
Calculus for ML: Derivatives & Gradients
Derivatives tell you which way is downhill. That is all a neural network needs to learn.
Chain Rule & Automatic Differentiation
The chain rule is the engine behind every neural network that learns.
Probability & Distributions
Probability is the language AI uses to express uncertainty.
Bayes' Theorem & Statistical Thinking
Probability is about what you expect. Bayes' theorem is about what you learn.
Optimization: Gradient Descent Family
Training a neural network is nothing more than finding the bottom of a valley.
Information Theory: Entropy, KL Divergence
Information theory measures surprise. Loss functions are built on it.
Dimensionality Reduction: PCA, t-SNE, UMAP
High-dimensional data has structure. You find it by looking from the right angle.
Singular Value Decomposition
SVD is the Swiss Army knife of linear algebra. Every matrix has one. Every data scientist needs one.
Tensor Operations
Tensors are the common language between data and deep learning. Every image, every sentence, every gradient flows through them.
Numerical Stability
Floating point is a leaky abstraction. It will bite you during training, and you will not see it coming.
Norms & Distances
Your distance function defines what "similar" means. Choose wrong and everything downstream breaks.
Statistics for ML
Statistics is how you know if your model actually works or just got lucky.
Sampling Methods
Sampling is how AI explores the space of possibilities.
Linear Systems
Solving Ax = b is the oldest problem in mathematics that still runs your neural network.
Convex Optimization
Convex problems have one valley. Neural networks have millions. Knowing the difference matters.
Complex Numbers for AI
The square root of -1 is not imaginary. It is the key to rotations, frequencies, and half of signal processing.
The Fourier Transform
Every signal is a sum of sine waves. The Fourier transform tells you which ones.
Graph Theory for ML
Graphs are the data structure of relationships. If your data has connections, you need graph theory.
Stochastic Processes
Randomness with structure. The math behind random walks, Markov chains, and diffusion models.