- Published on
Math Behind Cross-Entropy Loss
In this blog, we will learn about the math behind Cross-Entropy Loss with a step-by-step numeric example.
In this blog, we will learn about the math behind Cross-Entropy Loss with a step-by-step numeric example.
In this blog, we will learn about the math behind gradient descent with a step-by-step numeric example.
In this blog, we will learn about the math behind backpropagation in neural networks.
In this blog, we will learn about why we scale the dot product attention by √dₖ in the Transformer architecture with a step-by-step numeric example.
In this blog, we will learn about the math behind Attention - Query(Q), Key(K), and Value(V) with a step-by-step numeric example.