Deep Learning

Deep Learning is a subset of machine learning that uses multi-layered neural networks to learn complex…

Deep Learning

Dropout

Dropout is a regularisation method that randomly ignores some neurons during training to improve generalisation.

Deep Learning

Neural Network

Neural Network is a model loosely inspired by the brain, made of layers of connected nodes…

Deep Learning

Activation Function

Activation Function is a function applied to a neuron's output that lets neural networks model non-linear…

Deep Learning

Softmax

Softmax is a function that turns a set of numbers into probabilities that sum to one,…

Deep Learning

Weights

Weights is the learned parameters that determine how strongly inputs influence a neural network's output.

Deep Learning

Bias Term

Bias Term is an extra learnable value added in a neuron that shifts the activation, helping…

Deep Learning

Transformer

Transformer is a neural network architecture based on attention that powers most modern language and multimodal…

Deep Learning

Attention Mechanism

Attention Mechanism is a technique that lets a model weigh which parts of the input matter…

Deep Learning

Parameter

Parameter is an internal value a model learns during training; large models have billions of them.

Deep Learning

Gradient Descent

Gradient Descent is an optimisation method that iteratively adjusts model parameters to reduce prediction error.

Deep Learning

Backpropagation

Backpropagation is the algorithm that calculates how to adjust each parameter in a neural network to…

Deep Learning

Loss Function

Loss Function is a measure of how far a model's predictions are from the correct answers,…

Deep Learning

Epoch

Epoch is one full pass of the training algorithm over the entire training dataset.

Deep Learning

Batch Size

Batch Size is the number of training examples processed before the model updates its parameters.

Deep Learning

Learning Rate

Learning Rate is a setting that controls how big each step is when a model updates…

Deep Learning

Recurrent Neural Network

Recurrent Neural Network is a neural network designed for sequential data, where each step's output depends…

Deep Learning

Overparameterization

Overparameterization is using a model with far more parameters than strictly needed, common in modern deep…