This is so interesting : I trained convolutional frame-interpolation model (FILM from google) on synthetic blue / red boxes image dataset and ran it on real image.
feedsImage
197