All Blogs

AI Agent Memory

AI Agent Memory

In this blog, we will learn about AI Agent Memory - why agents need it, the memory stack, the four core operations (write, read, update, forget), how memory flows at runtime, and the common mistakes.

AI Agent Explained

AI Agent Explained

In this blog, we will learn about the AI Agent - what it is, how it is different from a plain LLM, its five core parts, how it works end to end, the main types, and when to use one.

RMSNorm (Root Mean Square Layer Normalization)

RMSNorm (Root Mean Square Layer Normalization)

In this blog, we will learn about RMSNorm, a faster and simpler alternative to Layer Normalization that powers most modern Large Language Models like Llama, Mistral, Gemma, Qwen, PaLM, and DeepSeek.

Decoding DeepSeek-V4

Decoding DeepSeek-V4

In this blog, we will learn about DeepSeek-V4, the new family of open Mixture-of-Experts language models that natively supports a one-million-token context with dramatically lower inference cost.

LoRA - Low-Rank Adaptation of LLMs

LoRA - Low-Rank Adaptation of LLMs

In this blog, we will learn about LoRA - Low-Rank Adaptation of Large Language Models.

Math Behind RoPE (Rotary Position Embedding)

Math Behind RoPE (Rotary Position Embedding)

In this blog, we will learn about the math behind Rotary Position Embedding (RoPE) and why it is used in modern Large Language Models.

Grouped Query Attention

Grouped Query Attention

In this blog, we will learn about Grouped-Query Attention (GQA) and how it differs from Multi-Head Attention (MHA).

Math Behind Cross-Entropy Loss

Math Behind Cross-Entropy Loss

In this blog, we will learn about the math behind Cross-Entropy Loss with a step-by-step numeric example.

Math Behind Gradient Descent

Math Behind Gradient Descent

In this blog, we will learn about the math behind gradient descent with a step-by-step numeric example.