Пікірлер
@Aman-Satat
@Aman-Satat 3 күн бұрын
thank you! got what i was looking for
@anantharamaniyer9135
@anantharamaniyer9135 5 ай бұрын
Watching this again reveals even more ideas! Many thanks for this Dan. Also do you have similar ones using Datashader and hvplot using pandas / polars /dask for plotting line,bar charts etc?
@4096fb
@4096fb Жыл бұрын
Thanks for the video! One question, what to do when I have z as pd.Series and not as a matrix? Not sure what would be the right way to convert it to matrix. I can use reshape, but I'm not sure it will shape the matrix as required.
@agracian1
@agracian1 Жыл бұрын
Hi Dan, got the following error in cell [7] when trying to run your script in Colab: --------------------------------------------------------------------------- TypeError Traceback (most recent call last) <ipython-input-7-f76c603a8d2f> in <cell line: 1>() ----> 1 feature_matrix_customers, features_defs = ft.dfs(entities=entities, 2 relationships=relationships, 3 target_entity="customers") 1 frames /usr/local/lib/python3.10/dist-packages/featuretools/utils/entry_point.py in function_wrapper(*args, **kwargs) 30 # call function 31 start = time.time() ---> 32 return_value = func(*args, **kwargs) 33 runtime = time.time() - start 34 except Exception as e: TypeError: dfs() got an unexpected keyword argument 'entities'
@-mwolf
@-mwolf Жыл бұрын
awesome intro
@apratimdey6118
@apratimdey6118 Жыл бұрын
Hi Dan, this was a great introductory video. I am learning Dask and this was very helpful.
@ecemgungor6208
@ecemgungor6208 Жыл бұрын
Thanks for the videos. I have a question about multivariate data. I have three independent variables and would like to see their occurrences by coloring the data based on their probability densities (plot type can be contour, surf etc.) Which function should I use? Could you please help me with this?
@saidmkc3069
@saidmkc3069 Жыл бұрын
Hey Dan, can you explain how to change the color scale in green - yellow - red?
@spadecake
@spadecake Жыл бұрын
Great video, worth trying the notebook after reading the paper
@joseleonardosanchezvasquez1514
@joseleonardosanchezvasquez1514 2 жыл бұрын
Great
@FabioRBelotto
@FabioRBelotto 2 жыл бұрын
I've added dask delayed to some functions. When I visualize it, there are several parallel works planned, but my cpu does not seems to be affected (its only using a small percentage of it)
@FabioRBelotto
@FabioRBelotto 2 жыл бұрын
Thanks man. It's really hard to find some information about dask
@parikannappan1580
@parikannappan1580 2 жыл бұрын
4:12 ,visualize() , where can in get documetation? I tried to use for sorted() did not work
2 жыл бұрын
Hi 😀 Dan. Thank you for this video. Do you an example which uses the apply() function? I want to create a new column based on a data transformation. Thank you!
@felixtechverse
@felixtechverse 2 жыл бұрын
How to schedule a dask task? for example how to put a script to run every day at 10:00 with dask for example
@2false637
@2false637 2 жыл бұрын
Saved my ass, man. Thanks!
@vegaalej
@vegaalej 2 жыл бұрын
Many thanks for this excellent video! It is really clear and helpful! I just have one question, I tried to run the notebook, and ran pretty well after some minor updates. Just the last line I was not able to make it run: # never run out-of-memory while training "model.fit_generator(generator=dask_data_generator(df_train), steps_per_epoch=100)" Gives me an error message: InvalidArgumentError: Graph execution error: TypeError: `generator` yielded an element of shape (26119,) where an element of shape (None, None) was expected. Traceback (most recent call last): TypeError: `generator` yielded an element of shape (26119,) where an element of shape (None, None) was expected. [[{{node PyFunc}}]] [[IteratorGetNext]] [Op:__inference_train_function_506] Any recommendation on how I should modify it to make it run? Thanks AG
@piotr780
@piotr780 2 жыл бұрын
very good ! thank you ! :)
@obamaengineer4806
@obamaengineer4806 2 жыл бұрын
fantastic video must say,,keep going sir...ur really really great...teaching such a complex thing in such short video ythat too so clear...thanks a lottt again
@JimmyGelhaar
@JimmyGelhaar 2 жыл бұрын
Great job on this video, Dan! Datashader is pretty sweet!
@_XY_
@_XY_ 2 жыл бұрын
👏👏
@alikoko1
@alikoko1 2 жыл бұрын
Great video! Thank you man 👍
@淘宝买的会员
@淘宝买的会员 2 жыл бұрын
Thank you. You should come back and bring more content.
@pritommojumder367
@pritommojumder367 2 жыл бұрын
Nice demo. Thank you for sharing. Keep up the good work.
@aydoganavcoglu2675
@aydoganavcoglu2675 2 жыл бұрын
Many thanks your very sophisticated steps by steps instructions! I would like to ask you how we can reach these kind of data set like more than a millition being able to use datashader?
@vegaalej
@vegaalej 2 жыл бұрын
Great Video an Explanation! Thank you very much for it! IT is really helpful! I tried to run the notebook, and it ran pretty well after some minor updates. I just had problems to run the latest part. "never run out-of-memory while training", seems the generator or steps per epoch part is giving some prblem I cant fogure hout how to solve. Any possible suggestion on how to fix the code? Thanks! InvalidArgumentError: Graph execution error: TypeError: `generator` yielded an element of shape (26119,) where an element of shape (None, None) was expected. Traceback (most recent call last): TypeError: `generator` yielded an element of shape (26119,) where an element of shape (None, None) was expected.
@AlejandroGarcia-ib4kb
@AlejandroGarcia-ib4kb 2 жыл бұрын
Interesting, I have the same problem in the last part of the notebook. Seems it is related to IDE, it needs and update.
@krocodilnaohote1412
@krocodilnaohote1412 2 жыл бұрын
Man, great video, thank you!
@r.m10234
@r.m10234 2 жыл бұрын
Thanks Man!
@sripadvantmuri89
@sripadvantmuri89 2 жыл бұрын
Great explanations for beginners!! Thanks for this...
@marialuisaargaezsalcido4957
@marialuisaargaezsalcido4957 2 жыл бұрын
Hi, trying to do your excerise code , but an error appears : TypeError: dfs() got an unexpected keyword argument 'entities'.
@danbochman
@danbochman 2 жыл бұрын
Hi Maria, It has been 2 years, so they probably changed their dfs function arguments. Looking in the documentation for dfs: featuretools.alteryx.com/en/stable/generated/featuretools.dfs.html It seems this function now expects an argument called entityset And this entityset is: "entityset (EntitySet) - An already initialized entityset. Required if dataframes and relationships are not defined." EntitySet seems to be a new class : featuretools.alteryx.com/en/stable/generated/featuretools.EntitySet.html Perhaps my guide is outdated in terms of syntax but the concept should stay the same!
@summerxia7474
@summerxia7474 2 жыл бұрын
Very nice video!!!! Thank you so much!!! Pls make more about this hands-on video! You explain them very clear and helpful!!!!
@ГерманРыков-ъ6в
@ГерманРыков-ъ6в 2 жыл бұрын
amazing. Very interesting theme.
@samable9585
@samable9585 2 жыл бұрын
wonder what is difference between encoding and mapping. For example if STATE_CD goes from 1 to 50 say, now its numeric - can it be used in AI learning without resorting to one hot encoding?
@danbochman
@danbochman 2 жыл бұрын
If you map states to numbers 1 to 50, it can be used in ML, but you inserted an internal relationship that doesn’t exist (state 1 is more similar to state 2, very far from state 50)
@samable9585
@samable9585 2 жыл бұрын
@@danbochman thanks for the reply. understood
@bodenseeboys
@bodenseeboys 2 жыл бұрын
Chapeau, well explained and healthy usage of memes!
@datadiggers_ru
@datadiggers_ru 2 жыл бұрын
Great intro. Thank you
@asd222treed
@asd222treed 2 жыл бұрын
Great video! Thank you for sharing.But I think your code would have some incorrect codes in machine learning with dask part. There is no X in the code (model.add(..., input_dim=X.shape[1], ... ) and when I training model.fit_generator, the tensor flow saids model.fit_generator is deprecated.. and finally displayed error - AttributeError: 'tuple' object has no attribute 'rank'
@danbochman
@danbochman 2 жыл бұрын
Hey! Whoops, I must've changed the variable name X to df_train and wasn't consistent in the process, it probably didn't pop a message to me because X was still in my notebook workspace. You can either change df_train to X or change X to df_train X <==> df_train. Just be consistent and it should work!
@gholamrezadar
@gholamrezadar 3 жыл бұрын
Great video with good examples. Loved the MiniNet part. Thank you.
@madhu1987ful
@madhu1987ful 3 жыл бұрын
Hey thanks for the awesome video and the explanation. I have a use case. I am trying to build a Deep learning tensorflow model for time series forecasting. For this I need to use multinode cluster for parallelization across all nodes. I have a single function which can take data for any 1 store and predict for that store. Likewise I need to do predictions for 2 lakh outlets. How can I use dask to parallelize this huge task across all nodes of my cluster? Can you please guide me. Thanks in advance.
@danbochman
@danbochman 3 жыл бұрын
Hi Madhu, Sorry, wish I could help, but node cluster parallelization tasks are more dependent on the framework iteself (e.g. Kubernetes), than Dask. You have the dask.distributed module (distributed.dask.org/en/stable/), but handling the multi-worker infrastructure is where the real challenge lies...
@cristian-bull
@cristian-bull 3 жыл бұрын
I really appreciate the batch-on-the-fly example with keras.
@markp2381
@markp2381 3 жыл бұрын
Great content! One question, isn't it strange to use function in this form: function(do_something)(on_variable) instead of function(do_something(on_variable)) ?
@danbochman
@danbochman 3 жыл бұрын
Hey Mark! Thanks. I understand what you mean, but when a function returns a function this makes sense, as opposed to a function which outputs the input to the next function. delayed(fn) returns a a new function (e.g. "delayed_fn"), and this new function is then called regularly delayed_fn(x). So its delayed(fn)(x). All decorators are functions which return callable functions. In this example they are used quite unnaturally because I wanted to keep both versions of the functions. Hope the explanation helped!
@guideland1
@guideland1 3 жыл бұрын
Really great Dask introduction and the explanation is so easy to understand. That was useful. Thank you!
@Единыймир.Переводыисубтитры
@Единыймир.Переводыисубтитры 3 жыл бұрын
Many thanks. Now I understand why the file was not read
@rashidsyed
@rashidsyed 3 жыл бұрын
Nice
@vitorruffo9431
@vitorruffo9431 3 жыл бұрын
Good work sir, your video has helped me to get started with Dask. Thank you very much.
@unathimatu
@unathimatu 3 жыл бұрын
This is really greAT
@DanielWeikert
@DanielWeikert 3 жыл бұрын
Great. I would also like to see more on DASK and Deep Learning. How exactly would this generator be used in pytorch? Instead of the DataLoader. Thanks for the video(s)
@chenzakaim3
@chenzakaim3 3 жыл бұрын
יש משהו דומה על למידת מכונה?
@dwivedys
@dwivedys 3 жыл бұрын
Excellent!
@pizonshetu39
@pizonshetu39 3 жыл бұрын
Thank you so much for your explanation, I learned more in this one video than reading multiple articles where my mind felt bogged and bored each time I read a line but this is so digestible and easy to understand
@fish3485
@fish3485 3 жыл бұрын
Dan, I just found your videos. They’re great! Will you be making any more?
@danbochman
@danbochman 3 жыл бұрын
Hey Benjamin, glad you liked it! Unfortunately, I don't think I'll be making new videos soon... Just out of curiosity, what kind of videos/topics would you be interested in?