How to Create Efficient Training Pipelines with TensorFlow data.Dataset (Tensorflow Datasets)

Рет қаралды 10,440

Greg Hogg

Күн бұрын

Пікірлер: 37

@GregHogg Жыл бұрын

Take my courses at mlnow.ai/!

@user-wr4yl7tx3w 2 жыл бұрын

the way I understood it, is that on the first epoch, the processed data gets cached. and therefore, subsequent epochs, we don't need to preprocess, since we can now use the cached data instead. since they are the same anyway. no need to repeat it for each epoch.

@GregHogg 2 жыл бұрын

Yes, that's right. In my phrasing, I may have dumbed it down slightly while still providing what's important to understand

@vasudevraghav2109 Жыл бұрын

What an awesome tutorial man! Keep up the great work, really learned a lot!

@GregHogg Жыл бұрын

Thanks I really appreciate that!

@ShoaibKhanZ 3 ай бұрын

F!!! its a functional lazy eval. Argh, was clear as mud before, now crystal clear. Thanks!

@datapro007 Жыл бұрын

Great video Greg, thank you.

@hussainsalih3520 Жыл бұрын

very well document , I hope you make a video on best practice neural network when it faced overfitting

@freya291 Жыл бұрын

Hello, and thank you for this great content. What may happen if you have multi-input models and large datasets? Would you use the for loop to separate the input features and the labels?

@Amir-gi5fn 4 ай бұрын

9:41 Oh so it turns data from uint8 to float32 only when needed and this actually saves a lot of memory I've already preprocessed my data but can now do resizing in Pipeline

@inakitodc6816 Жыл бұрын

OMG this was so fcking useful, really good and deep material. Thanks!

@GregHogg Жыл бұрын

Glad to hear it!!

@iciukas 7 ай бұрын

Excellent! Clear and helpful

@GregHogg 7 ай бұрын

Super glad to hear it!

@oranuzrad8832 Ай бұрын

I Loved it!

@fiknthapis6231 Жыл бұрын

Good stuff

@harshitjami4771 Ай бұрын

In the video when your doing the first 'for' loop after the mapping the example_dataset, the loop broke after the first iteration so it must have only cached the first sample in the example_dataset and then when you shuffled the samples and then iterated through the first sample (in the newly shuffled dataset) it must have mapped and then cached that first time iterated sample in the examples_dataset because it was not previously iterated through. Did i understand it correctly?

@user-wr4yl7tx3w 2 жыл бұрын

what would be also cool is to see a video on vision transformer?

@GregHogg 2 жыл бұрын

Sounds good.

@Live_draw_today Жыл бұрын

Please make a video on how to train voice model stem by step in pycharm, please sir make video on this

@anwerhassan154 Жыл бұрын

thank you very much, but how can I print the result from the model and visualization the actual data and prediction data

@arsheyajain7055 2 жыл бұрын

Wonderful

@GregHogg 2 жыл бұрын

Thank you 😊

@elafakceramicmagasin3705 2 жыл бұрын

hi greg we need a road map for sql or at least some course suggestions , or a tutorial from you will be great

@GregHogg 2 жыл бұрын

You might want to check this Specialization out: kzbin.info/www/bejne/jGfTi5JjabCDj6M

@user-wr4yl7tx3w 2 жыл бұрын

Is there a way to see what you have already included in the pipeline?

@GregHogg 2 жыл бұрын

That's a great question. I actually don't know the answer.

@user-wr4yl7tx3w 2 жыл бұрын

may be PyTorch also has something like this? is it very different?

@GregHogg 2 жыл бұрын

I'm just learning it now. When I feel I understand everything well enough, I will teach it back :)

@koji4059 2 жыл бұрын

Could you make a review on HyperSkill Python course. I’ve heard it’s very good but not sure.

@GregHogg 2 жыл бұрын

Never heard of it.

@koji4059 2 жыл бұрын

@@GregHogg It’s also known as JetBrains Academy. It’s entire structure is project based learning and they have a Python Core track and a new Machine Learning in Python course.

@user-wr4yl7tx3w 2 жыл бұрын

doesn't cache simply mean, take that data and put it in local memory for reuse again? not sure what point I am missing.

@GregHogg 2 жыл бұрын

Their caching process is rather complex. I would just think of it as remembering.

@user-wr4yl7tx3w 2 жыл бұрын

it seems like a really unintuitive way to create a pipeline. wouldn't it be better to create a list of first class functions, where each function within the list does a step in the pipeline.

@GregHogg 2 жыл бұрын

You could make a function where it does a list of those functions