Idea for next machine learning series - Keras with large datasets using generators/augmentation - nobody currently has a good guide for this online

by: jphoward, 6 years ago


Dear sentdex,

Love your videos.

Like you I previously used tflearn/tensorflow for my ML and am migrating to Keras.
One thing I've found is there is nowhere on the net that shows how to use Keras with a huge (as in many multiples GPU-sizes e.g. ImageNet) dataset with data-augmentation etc.

Keras has generators which can use HDF5 datasets for this but all of the examples I've found rely on infinite loops, are messy, and make data-augmentation very tricky.

This post is one of many examples of people struggling with it: https://github.com/keras-team/keras/issues/1627
Another: https://github.com/keras-team/keras/issues/68
And another: https://stanford.edu/~shervine/blog/keras-how-to-generate-data-on-the-fly.html
One user even asks about data augmentation and the response notes:
"I just checked the docs and it looks that Keras' ImageDataGenerator class is an end-to-end black box. No worries though as there is still the possibility to add these functionalities manually to DataGenerator."

After considerable searching I am yet to find a good tutorial and I suspect it would give you serious traffic to your excellent YouTube channel.

A big fan,

James



You must be logged in to post. Please login or register an account.



I would love to see a generative model like a GAN

-ali.fakhir33 6 years ago

You must be logged in to post. Please login or register an account.