Articles Videos Events Research Papers ML Wikis Podcasts White papers Tutorials

Wikis

Info-nuggets to help anyone understand various concepts of MLOps, their significance, and how they are managed throughout the ML lifecycle.

Stay up to date with all updates

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

Synthetic & Generative AI

Diffusion Models

Models designed to generate realistic, high-resolution images of varying quality.

Diffusion Models are a class of generative models designed to generate realistic, high-resolution images of varying quality. They work by iteratively applying Gaussian noise to the original data in the forward diffusion process and then learning to recover the data by reversing the noising process. Post training, the diffusion model can generate data by passing randomly sampled noise through the learned denoising process. In addition to achieving advanced image quality, Diffusion Models offer several advantages, such as not requiring adversarial training, scalability and parallelizability.

Let's understand the diffusion process in detail:

Denoising diffusion modelling involves a two-step process:

Forward Diffusion Process: In this step, a Markov chain of diffusion steps is performed. Noise is systematically and randomly introduced to the original data gradually. The purpose is to simulate a diffusion process where noise is added over time, creating a sequence of data points.
Reverse Diffusion Process: The reverse diffusion process attempts to undo or reverse the effects of the forward diffusion. Its goal is to generate the original data from the noised or diffused version. By iteratively removing the added noise in a controlled manner, the reverse diffusion process aims to reconstruct the initial data, effectively restoring it to its original state.

Forward Diffusion Process

In the forward diffusion process, Gaussian noise is incrementally introduced to the input image x₀ over a sequence of T steps. The process begins by sampling a data point x₀ from the real data distribution q(x) (represented as x₀ ~ q(x)). Subsequently, Gaussian noise with a variance parameter βₜ is added to the previous latent variable xₜ₋₁, generating a new latent variable xₜ. This newly generated variable follows a distribution q(xₜ | xₜ₋₁), reflecting the conditional distribution of xₜ given xₜ₋₁. The gradual addition of noise over the T steps simulates the diffusion process, transforming the original input image into a sequence of progressively noised data points.

Credits: https://lilianweng.github.io/posts/2021-07-11-diffusion-models/

Where, q(xₜ∣xₜ₋₁) is defined by the mean μ as:

And ∑ as ∑ₜ=βₜI, where I is the identity matrix, and Σ will always be a diagonal matrix of variances. As the number of steps T approaches infinity, xₜ converges to an isotropic Gaussian distribution.

Reparameterization trick

The reparameterization trick is employed to address the computational challenge associated with sampling from q(xₜ | xₜ₋₁) and calculating xₜ, especially when dealing with a substantial number of steps. This trick provides a workaround, enabling us to sample xₜ efficiently at any given time step from the distribution. Instead of directly sampling xₜ, the reparameterization trick involves expressing the sampling operation in a way that separates the randomness from the parameters, making it amenable to straightforward and efficient sampling

Know more about the reparameterization trick here.

Reverse Diffusion Process

The reverse diffusion process involves training a neural network to reconstruct the original data by undoing the noise introduced during the forward pass. Estimating q(xₜ₋₁|xₜ) is challenging, since it can require the entire dataset. To overcome this, a parameterized model represented as p_θ (Neural Network) is employed to learn the relevant parameters. When βₜ is sufficiently small, the distribution approximates a Gaussian, simplifying the process by parameterizing only the mean and variance. This allows the neural network to effectively learn how to reverse the noise-induced changes and recover the original data.

The neural network is trained to predict the mean and variance for each time step. Here μ_θ(xₜ,t) is the mean, and ∑_θ(xₜ,t) is the covariance matrix.

In addition to achieving advanced image quality, Diffusion Models offer several advantages, such as not requiring adversarial training, scalability and parallelizability. The generated samples can then be used for various applications, like data augmentation, simulation, and generating creative content.

Some popular diffusion models include GLIDE, DALL.E-3 developed by OpenAI, Imagen created by Google, and Stable Diffusion.

References:

A Very Short Introduction to Diffusion Models: https://kailashahirwar.medium.com/a-very-short-introduction-to-diffusion-models-a84235e4e9ae
What are Diffusion Models? https://lilianweng.github.io/posts/2021-07-11-diffusion-models/

Here are some more articles you might find helpful:

How diffusion models work: the math from scratch — https://theaisummer.com/diffusion-models
‍Introduction to Diffusion Models for Machine Learning — https://www.assemblyai.com/blog/diffusion-models-for-machine-learning-introduction

Is Explainability critical for your AI solutions?

Schedule a demo with our team to understand how AryaXAI can make your mission-critical 'AI' acceptable and aligned with all your stakeholders.

Book a Demo

AryaXAI provides the most accurate explainability and alignment stack to deliver accurate, true-to-model explainability, monitoring, risk management, and alignment techniques essential for highly mission-critical or regulated AI solutions.

Address: CoWrks, 3rd Floor, Prudential Building,
Powai, Mumbai- 400076

Products

Explainable AI ML Monitoring ML Audit Policy Control Pricing

Resources

Articles Videos White papers Research paper Podcasts Events Tutorials Wikis

Company

About us Research Contact us Career

hello@aryaxai.com

Stay up to date with all updates

Thank you! Your submission has been received!

Oops! Something went wrong while submitting the form.

Terms and Conditions Privacy Policy Payments and Refunds Policy Content Removal

Privacy Evaluation

F-Score (F1-Score)

Constant Features

High Feature Correlation

Target Drift

Stochastic Gradient Descent (SGD)

RandomForest

CatBoost (Categorical Boosting)

LightGBM (Light Gradient Boosting Machine)

XGBoost (eXtreme Gradient Boosting)

CTGAN (Conditional Tabular Generative Adversarial Network)

GPT-2 (Generative Pre-trained Transformer 2)

Internet Information Service Algorithm Recommendation Management Regulations

Generative AI Measures in China

Provisions on the Administration of Deep Synthesis of Internet-based Information Services

Artificial Intelligence and Algorithmic Fairness Initiative

The EU AI Act

Artificial Intelligence Risk Management Framework (AI RMF 1.0)

Federal Trade Commission (FTC)

President Biden's Executive Order on AI

Principles for Responsible AI

Digital India Act

Draft National Data Governance Framework Policy

National Strategy for Artificial Intelligence #AIFORALL: NITI Aayog

National Cybersecurity Reference Framework

Global Partnership on Artificial Intelligence (GPAI)

Top-k

Temperature

Low-Rank Adaptation (LoRA)

Quantization

Hallucination

Multi-modal models

Mixture of experts (MoEs)

Mamba

Opensource vs. Closed Source Models

Large Language Models (LLMs)

Kolmogorov–Smirnov test (K–S test or KS test)

Wasserstein distance

Jensen-Shannon(JS) Divergence

Population Stability Index (PSI)

Kullback-Leibler (KL) divergence

Model confidence score

Feature Importance Store

Fairness/ Bias Monitoring

Recall/ Sensitivity or True Positive Rate

Specificity/ True Negative Rate:

Precision-recall curve

Confusion Matrix

F score

ROC Curves and ROC AUC

Data Drift

Model Drift

Synthetic & Generative AI

Diffusion Models

Models designed to generate realistic, high-resolution images of varying quality.

Let's understand the diffusion process in detail:

Denoising diffusion modelling involves a two-step process:

Forward Diffusion Process: In this step, a Markov chain of diffusion steps is performed. Noise is systematically and randomly introduced to the original data gradually. The purpose is to simulate a diffusion process where noise is added over time, creating a sequence of data points.
Reverse Diffusion Process: The reverse diffusion process attempts to undo or reverse the effects of the forward diffusion. Its goal is to generate the original data from the noised or diffused version. By iteratively removing the added noise in a controlled manner, the reverse diffusion process aims to reconstruct the initial data, effectively restoring it to its original state.

Forward Diffusion Process

Where, q(xₜ∣xₜ₋₁) is defined by the mean μ as:

Reparameterization trick

Know more about the reparameterization trick here.

Reverse Diffusion Process

The neural network is trained to predict the mean and variance for each time step. Here μ_θ(xₜ,t) is the mean, and ∑_θ(xₜ,t) is the covariance matrix.

Some popular diffusion models include GLIDE, DALL.E-3 developed by OpenAI, Imagen created by Google, and Stable Diffusion.

References: