Variational Autoencoder • VAE

A Variational Autoencoder (VAE) is a type of artificial neural network used in the field of machine learning for the purpose of generating new data. It's a type of autoencoder, a neural network used for learning efficient codings of input data.

While a traditional autoencoder learns to compress data from the input layer into a short code, and then uncompress that code into the original data, a VAE turns the problem around. Instead of producing a single output (the compressed code), the VAE produces a statistical distribution of possible outputs. It then samples from this distribution to generate new data.

Here's a simplified explanation of how VAE works:

  1. Encoder: First, an encoder network turns the input data into two parameters in a latent space of representations.

  2. Random Sampling: We randomly sample similar points from the latent normal distribution that is assumed to generate the data.

  3. Decoder: Finally, a decoder network maps these random samples back to the original input data.

The parameters of the VAE are trained via two loss functions: a reconstruction loss that pushes the decoded samples to match the initial inputs, and a regularization loss, which helps learn well-formed latent spaces and reduce overfitting to the training data. The end result is a model that can generate new data that looks like the data it was trained on.

VAEs have been applied in a variety of applications such as generating human faces, handwriting styles, digital art, and more.

 
Basic structure of a variational autoencoder.

The basic scheme of a variational autoencoder. The model receives x as input. The encoder compresses it into the latent space. The decoder receives as input the information sampled from the latent space and produces x’ as similar as possible to x.

 

Videos to learn more about VAEs

VAEs are typically of higher interest in China and interest levels have been pretty steady for the past 20+ years.

Articles about VAEs

Artificial Intelligence Blog

The AI Blog is a leading voice in the world of artificial intelligence, dedicated to demystifying AI technologies and their impact on our daily lives. At https://www.artificial-intelligence.blog the AI Blog brings expert insights, analysis, and commentary on the latest advancements in machine learning, natural language processing, robotics, and more. With a focus on both current trends and future possibilities, the content offers a blend of technical depth and approachable style, making complex topics accessible to a broad audience.

Whether you’re a tech enthusiast, a business leader looking to harness AI, or simply curious about how artificial intelligence is reshaping the world, the AI Blog provides a reliable resource to keep you informed and inspired.

https://www.artificial-intelligence.blog
Previous
Previous

Accelerating Change

Next
Next

Recurrent Neural Network • RNN