Synthetic image generation techniques for training machine learning models

One of the most common problems in Computer Vision is the lack of images when training ML models. In deep learning, a large amount of data is required to make neural networks to learn relevant characteristics of inputs and then to perform the inference process correctly; because when models are trained on limited samples they are not able to generalise to unseen data. Even if pre-trained models (transfer learning) are used, the images for the particular cases are often insufficient and the mo del is not trained correctly.

At Keepler we have faced this challenge in projects involving object detection in images, more specifically in anomaly detection projects. Given this situation we have seen the need to look for methods to generate synthetic images (data augmentation), with the aim of making viable projects with a reduced dataset of images. Specifically, we have researched two techniques:

Generation of images using classical data augmentation procedures: distortions, rotations, colour change, etc. to the original images.

Generation of images with GANs (Generative Adversarial Networks); specifically the use of Cycle GANs to make a context change (style transfer) to original images and generate new ones.

The generation of images or any type of data is very common in a large number of projects where data is limited. Increasing variability of training data allows for greater generalization of models; it can also reduce the cost of data collection and labelling.

Throughout the following white paper that you can download, we will see in detail the methods used, some simple and some more complex, to produce synthetic images needed in the training of computer vision models.

Download for free this white paper about Data Augmentation 👇

Title: How to use Data Augmentation when you have limited data
Authors: Ángela García, Data Scientist at Keepler & Adriana A. Bogdan, Data Scientist at Keepler

[email-download download_id=”46752″ contact_form_id=”46762″]

Ángela García

+ posts

Data Scientist at Keepler. “Passionate about technology and science, I love solving problems and facing new challenges. In my spare time I like to read, currently studying a bachelor in humanities and philosophy.”

Adriana A. Bogdan

+ posts

Data Scientist at Keepler. "I am a Data Scientist passionate about building models that fix problems and exploring data to draw meaningful conclusions. Being a part of new technologies and trying out innovative solutions is what motivates me the most. I love to learn about life through travelling, this way I can get to know different cultures, lifestyles and cuisines."

0 Comments

Data Management Becomes Semantic and Cognitive

Feb 3, 2026

2025 ended with a paradox that organisations can no longer afford to overlook: never have we seen so many AI initiatives deployed, yet never has the gap between adoption and real business value been so clear. According to McKinsey’s latest report, The State of AI in...

Not more AI — orchestration. The era of the Agentic Mesh.

Jan 29, 2026

In 2026, “having agents” is no longer an advantage. The advantage is Agentic Mesh. During 2024 and 2025, many companies did what was expected: they tested AI through “pilots”. A copilot for the sales team. A bot for support. An assistant for finance. Promising...

The Gap Between Strategy and Execution in the Age of AI

Dec 17, 2025

As we approach 2026, artificial intelligence is no longer a shiny novelty or a laboratory experiment; it has become a business imperative. Yet, when analysing the European corporate landscape, we find a paradox: ambition is high, but scalable execution remains the...

Synthetic image generation techniques for training machine learning models

Ángela García

Adriana A. Bogdan

0 Comments

Leave a ReplyCancel reply

Ángela García and Adriana A. Bogdan

October 11, 2022

AI

Categories

Archive

You May Also Like

Data Management Becomes Semantic and Cognitive

Not more AI — orchestration. The era of the Agentic Mesh.

The Gap Between Strategy and Execution in the Age of AI

Synthetic image generation techniques for training machine learning models

Ángela García

Adriana A. Bogdan

0 Comments

Leave a ReplyCancel reply

Ángela García and Adriana A. Bogdan

October 11, 2022

AI

Categories

Archive

You May Also Like

Data Management Becomes Semantic and Cognitive

Not more AI — orchestration. The era of the Agentic Mesh.

The Gap Between Strategy and Execution in the Age of AI

Discover more from Keepler | The AI Enabler Partner