Generating synthetic time series data in case of data scarcity

From ISLAB/CAISR
Title Generating synthetic time series data in case of data scarcity
Summary Generating synthetic time series data in case of data scarcity
Keywords
TimeFrame
References - Using generative adversarial networks (GAN) to simulate central-place foraging trajectories

https://besjournals.onlinelibrary.wiley.com/doi/full/10.1111/2041-210X.13853

- Data-Driven Crowd Simulation with Generative Adversarial Networks http://rainbow-doc.irisa.fr/pdf/2019_amirian_casa.pdf

Prerequisites
Author
Supervisor Alexander Galozy, Peyman Mashhadi
Level Master
Status Open


This project addresses generating synthetic sequential data in case of real data scarcity. The idea is that there exists statistics about the real data but only very few data points. The goal of the project is to develop a generative method (for example GANs, transformers, variational autoencoder) to generate synthetic data with as similar as possible statistics to the real data. Then, the small sample from real data will be utilized to generate the trajectory of sequential data.

In many of the real-world industry data problems, we face we are only provided with a small number of data samples of time-series data, but data statistics are readily available. It is of high interest to diversify the little data we have to train better machine learning models. Further, we would like to generate realistic synthetic data beyond mere data augmentation, that is task specific, thus resulting in better sample complexity of our developed algorithms.