Known limitations and important notes

As of September 2020

  • No TensorFlow integration
  • Currently only supports ImageNet
  • Unknown effect on model accuracy of transcoding from various JPEG formats to H.265
  • Current transcoding filters failed on 81 images of the ImageNet 2012 dataset forcing them to be excluded. More information can be found in the dataset’s README.
  • Current transcoding filters required 111 images of the ImageNet 2012 dataset to first be transcoded to PNG prior to the final H.265 format. More information can be found in the dataset’s README.
  • High resolution images stored in the bzna_input track of the input samples are currently not available through the Dataloader. Their varying size prevent them from being decoded using a single hardware decoder configuration. The selected solution is to represent the images in the HEIF format which will be completed in future development.
  • It is currently not possible to compose transformations like you can with torchvision.transforms.Compose but SimilarityTransform should cover most of the necessary images transformations.