Perhaps, you can start with an over-specified architecture, use weight decay and use early stopping immediately. In november i posted in another post of yours about checkpoints, my main objective at that time ( and still is) is to do hyperparam optimization. My point is: I wanna to be able to reload the same model and continue training until min loss AND change model hyperparams (Archs, Batch Size, Epoch Num) and retrain it with the same data split as before (or same dataset, as if i were forced to change splitting due to different batch size).

The question maybe is: Is there in Keras a proxy to hyperparam tuning, aside that one of sklearn (that doesnt work too well with Keras checkpoints).

It was always different every time. First build the model architecture. Then load the saved weight (or model, I am using the. I can share the github link so that you can have a look at my code if everything is fine. Hi Jason, thank you very much for your post, it is very useful. Hi Jason, Thanks for the clarification.

As of now I am doing it in a single program. I will write a separate code to create the model architecture again.

I have a question. I am ready to use the final model externally. Should I just take one of the trained models from one of the folds. Would you train the final model on all of the data.

With a validation set, you have an indication of when it starts to overfit, while training with all of the data means the models gets to see more data. The problem is if you train on all of the data. Hi Jason, I was wondering if there there is any hard and bound rule to use minimization of validation loss for early stopping.

What are the pros and cons of this approach in your opinion. Thank you for all the amazing notes. I have a question regarding training testing data split. I want to use training, testing and validation data sets.

I also want to have a random split for training and testing data sets for each epoch. Is it possible in Keras. Or in simpler words can I do like this: 1. Split data into training and testing 2. Split the training data into training and validation. Now fit a model for training data, use validation data and predict and get the model accuracy 4.

If model accuracy is less than some required number go back to step to step 3 and re shuffle and get a new combination of another random training and validation datasets. Use the previous model and weights, improvise this or increment the weights from this state 5. Do this till a decent accuracy with validation is achieved 6.



