Thank you so much for the explanation! But I wanted to ask you something: what should I do when I have for instance a and b close, b and c close, but a and c distant? should I still put the three of them together? or should I do them separately, and in this case, which group should I agregate first?
@PegaAnalytics5 күн бұрын
That's an interesting question. We first aggregate the points that are closest - so that would be a and b. You then say a and c are distant. Relative to what? We really need a point d do decide that, I think. Then it's a question of whether c is closer to a&b or closer to d. But there are a couple of further details that we need to think about, and that weren't discussed in this video (I hope I will address these in some new year videos!). First, when we ask about distance of c from a&b what do we mean? We have to define distance more carefully. In the video I used the average location (think of a centre of mass). So we are not comparing the distance of c to a, but the distance of c to the midpoint of a and b. There are alternative distance metrics that we can use - we could take the closest point in a cluster, in which case we would look at the distance of c to b; or we could take the furthest point in a cluster, in which case we would look at the distance of c to a. Different distance metrics allow us to describe different shapes of clusters (think of spherical clusters versus elongated string-like clusters). In all these cases we would compare these distances against the distance from c to d. Any finally, perhaps we want to stop the agglomeration process and keep d and or c separate from a&b because we think that we have reached the optimal number of clusters. To do that we need to define what we mean by "optimal" - but a rule of thumb is to look for the point where we transition from small distances to large distances - and this is something I will discuss in my next video (probably early January because I am decorating the room that I use for recordings!).
@NehaSingh-kx5xe9 күн бұрын
Thanks
@PegaAnalytics8 күн бұрын
You're welcome :)
@NilkanthChapole-s4d27 күн бұрын
Thanks for practical insights.
@PegaAnalytics27 күн бұрын
You're welcome!
@kanchandatta4668Ай бұрын
What software you are using for such analysis?
@PegaAnalyticsАй бұрын
I was using JMP Software.
@kanchandatta4668Ай бұрын
@PegaAnalytics thank you so much for your reply. Is this software available in online?
@PegaAnalyticsАй бұрын
It's commercial software, produced originally by SAS: www.jmp.com/
@kanchandatta4668Ай бұрын
Not clear.
@PegaAnalyticsАй бұрын
Sorry that it wasn't helpful for you.
@kanchandatta4668Ай бұрын
@PegaAnalytics it was my short comings not your.
@PegaAnalyticsАй бұрын
Well I can see that you also produce educational videos. One of the joys of teaching is to try and come up with new ways of explaining concepts. Different explanations work well for different people - ourselves included!
@carbonsoot4254Ай бұрын
I click the link but error pop up. could you please share jsl. document of lesson 6 to download again? Make Column Switch Handler doesnt work in my code.
@PegaAnalyticsАй бұрын
Congratulations on getting to the last part of the series. I'm packaged the videos and the associated code into a Udemy course. You'll need to enrol for the course the get access to the files, but it doesn't cost you anything. Plus also, it should give you direct access to me if you have any future JSL questions. Here's the link: www.udemy.com/course/jsl-decoded/
@BunlornSunАй бұрын
What type of data should we do normality test?
@PegaAnalyticsАй бұрын
For me personally, I do a lot of process capability analysis, which is very sensitive to the assumption of normality of the data. For many other statistical methods (regression, two-sample t-test, etc) we construct a t-test and derive a p-value; these statistical methods assume that our error terms (residuals) are normally distributed.
@shankarduvvuri37742 ай бұрын
Thank you, David. This is the best explanation of dendrogram that I got. I am so grateful to you for not overloading this video with math concepts. Selecting alphabets to represent data points, made it even easier to understand. God bless you sir.
@PegaAnalytics2 ай бұрын
This you so much for taking the time to leave this comment, and for your kind words. This type of feedback gives me huge motivation to make further videos.
@oleboeffel3 ай бұрын
Great video!
@someboredkid79333 ай бұрын
Thank you! This helped a lot
@nike0adidas24 ай бұрын
Great explanation!
@gilbertomoreno93924 ай бұрын
Excellent💯 video thank you for sharing!!
@KingsleyIkpa-agodo4 ай бұрын
Thank you, sensei. This is the most clear explanation I've got about hierarchical clustering. Please I'm interested in learning more about how to know the optimum clusters for a dataset. Thank you.
@davidb416x4 ай бұрын
Thanks for your positive feedback. My next video will discuss the optimal number of clusters
@kellynguyen49345 ай бұрын
This video is great. Thanks.
@PegaAnalytics4 ай бұрын
Thank you!
@Alison_Li5 ай бұрын
Thanks,it`s very useful for me! Look forward to seeing you in live stream. I like 2 pm.
@nsnsna5855 ай бұрын
Thank you very insightful 🌹🌹
@PegaAnalytics5 ай бұрын
That's great to hear. Thanks!
@freidmannguyen34306 ай бұрын
Very useful content. Thank you very much
@PegaAnalytics6 ай бұрын
Glad it was helpful!
@markchahl64086 ай бұрын
David: two more use cases: 1) teaching JMP: a journal is a great way to package a training class into one file 2) documenting problem solving: i support manufacturing facilities and am often involved in problem solving. as i work on the problem i save notable reports to a journal. when done, i export the journal to MS Word and then add narration to explain the data, problem solving steps, root causes, and recommendations, etc..
@PegaAnalytics6 ай бұрын
Hi Mark, thanks for your contribution. I totally agree, these are two perfect use-cases for journals. It's interesting that you export a problem-solving journal to MS Word, once you have completed the work. I think that's probably a very helpful step. Something I will look try-out myself!
@markchahl64086 ай бұрын
@@PegaAnalytics Why MS Word? Journal Text Sledgehammer add-in is an improvement, but still nowhere near the features/ease of use of MS Word. Also, customers may not have JMP. MS SharePoint can't index JMP files, but can index Word files.
@sobha4046 ай бұрын
Thanks a bunch for this simplified and clear explanation, it would be a pleasure if you could share with us how could we make dendrograms from Pulsed field electrophoresis Gel , thank you :)
@PegaAnalytics6 ай бұрын
Funny you should ask that ... the following paper is next on my reading list: "Pulsed-field gel electrophoresis (PFGE) analysis of Listeria monocytogenes isolates from different sources and geographical origins and representative of the twelve serovars" www.academia.edu/111312006/Pulsed_field_gel_electrophoresis_PFGE_analysis_of_Listeria_monocytogenes_isolates_from_different_sources_and_geographical_origins_and_representative_of_the_twelve_serovars
@PegaAnalytics6 ай бұрын
I've looked at this in a bit more detail, and to be honest, handling these type of data is beyond my area of expertise. I did find some general information that I found helpful: A guide to interpreting electrophoresis gels: bento.bio/resources/bento-lab-advice/interpreting-electrophoresis-gels-with-bento-lab/#:~:text=The%20smallest%20bands%20are%20at,is%20up%20the%20ladder%20scale. (pulsed-field addressed larger DNA molecules but I presume the principles on interpretation remain the same). Any analytical technique requires digital data. I found this: Data processing of pulsed-field gel electrophoresis images www.ncbi.nlm.nih.gov/pmc/articles/PMC6940661/ The data processing would seem to me the critical step, which will ultimately result in the generation of tabulated data that would be amenable to cluster analysis. The columns of this tabulation would correspond to metrics that describe the banding, which each sample being represented by a row. I would guess that this data processing is integrated into most laboratory systems that produce pulsed-field electrophoresis gel?
@biancamangion77506 ай бұрын
Thank you for helping me understand dendrograms!
@PegaAnalytics6 ай бұрын
Happy to hear the video helped :)
@Helenalovedog6 ай бұрын
Thank you for the wonderful video! I had a very vague understanding of this concept before watching it. However, after going through the video, everything became crystal clear, and I experienced a profound moment of enlightenment. Your exceptional teaching skills and ability to break down complex ideas into understandable components have truly been an eye-opener for me. I am deeply grateful for your efforts in creating such an informative and insightful resource.
@PegaAnalytics6 ай бұрын
Thank you so much!
@刘洋-o7x6 ай бұрын
how can choose the calculation(simulation)ways will influence the simulation result, so that means must base on the actually status and the experience to choose the perfect mode to simulation, right?
@jupjup20237 ай бұрын
Thanks for the great video! It would be very appreciated if you will discuss how to select the optimum number of clusters in future videos. 🙂
@PegaAnalytics7 ай бұрын
I appreciate your feedback. I'll make a note to make a video about identifying the optimal number of clusters - thanks for the suggestion.
@МиленМилтанов7 ай бұрын
Very helpful video. Congratulation.
@PegaAnalytics7 ай бұрын
Thanks!
@PatrickGaller-g1w8 ай бұрын
This is great. Thanks!
@PegaAnalytics8 ай бұрын
Glad you liked it!
@markchahl64088 ай бұрын
David: Great video! I've been using this platform for about ten years and even I learned a bunch of things from you! For solving manufacturing problems, this platform is outstanding. Thanks for creating this.
@PegaAnalytics8 ай бұрын
Thanks Mark, I always value your feedback.
@Mike-s4n5p8 ай бұрын
thank you for video this is amazing content, you are a saint and a scholar cheers
@PegaAnalytics8 ай бұрын
Thank you so much, that is very generous of you!
@statslikejazz8 ай бұрын
Great content #PegaAnalytics! Never thought of using arrange in rows in the fit model platform! When would you not use fit separatly?
@PegaAnalytics8 ай бұрын
Hi, thanks for your feedback. I can't think of a good reason to not use the 'fit separately' option. Perhaps I might use it if I am doing a demonstration and want to focus on model visualisation and interpretation without dwelling on the details of model building. But in a real-world scenario, I would always want to spend my time building the best model I can for each response, and for that I would use 'fit separately'. The 'fit separately' option allows you to refine model terms individually for each response, whereas if not selected you refine model terms en-masse across the responses. Prior to version 14 this was the only way of handling multiple responses - apart from running Fit Model separately for each model. And the benefit of specifying multiple responses rather than running multiple Fit Models, is that you can view all the models simultaneously in the prediction profiler and contour profiler, as well as perform numerical optimisations that take account of all the responses.
@chrisjeson8 ай бұрын
Can you share the .jsl document of lesson 6 to download?
@PegaAnalytics8 ай бұрын
You should be able to access it here: www.pega-analytics.co.uk/blog/how-to-write-a-jsl-script/
@chrisjeson8 ай бұрын
@@PegaAnalytics Thank you!
@SnowCat19978 ай бұрын
Thank you so much this is really easy to understand
@PegaAnalytics8 ай бұрын
Glad it was helpful!
@Nanoscientist28 ай бұрын
Great video! thank you
@PegaAnalytics8 ай бұрын
You are welcome!
@franciscoruiz28138 ай бұрын
I have a JMP question. I am getting only Ppk values, not Cpk values for a non-normal distribution. In your example you shown Cpk. Where is the setting for it? I have search in the JMP preference and I have not found the correct one. Please let me know when you have a moment. Thanks
@PegaAnalytics8 ай бұрын
I had the preference "Ppk Capability Labeling" turned off. This is one of the preferences for the Distribution Platform( File> Preferences> Platforms> Distribution ).
@franciscoruiz28138 ай бұрын
@@PegaAnalytics Thank you
@steffenbugge47369 ай бұрын
So simple, yet so powerful! I will never look at a fit model the same way again! Thanks for sharing!
@franciscoruiz28139 ай бұрын
Thanks David, How do you get the 84 median? and also is the 5.8 factor dependent of the data set?
@PegaAnalytics9 ай бұрын
Thanks for your questions. The median value of 84 is based on fitting a Lognormal distribution to the data then taking the median of fitted distribution. That median calculation can either be done using a simple script based on the shape and scale parameters for the distribution (see my video on using the script editor as a calculator) or directly from JMP (after fitting a distribution, from the associated red triangle select Profilers> Quantile Profiler, and enter 0.5 for the quantile). Yes, you are correct, the value of 5.8 is specific to these data. Whereas for a Normal distribution we can always say that the width is +/- 3 sigma, that is not true for other distribution types. The value of 5.8 was determined by calculating the interval from the 50% percentile (the median) to the (100-0.135)% percentile. Similar to the median calculation, this can be done with a simple JSL script or by getting the numbers from the Quantile Profiler for the fitted distribution.
@franciscoruiz28139 ай бұрын
Thank you@@PegaAnalytics
@markchahl64089 ай бұрын
Thank you, David! This 30 year JMP user continues to learn from you.
@PegaAnalytics9 ай бұрын
Thanks for your feedback Mark.
@rlwsu9 ай бұрын
Thank you very much great video.
@PegaAnalytics9 ай бұрын
I'm glad you liked it. I appreciate the feedback. Thanks!
@tufailkhan646310 ай бұрын
Great tips
@PegaAnalytics10 ай бұрын
Thanks!
@steffenbugge473610 ай бұрын
Thanks for a instructive and simple presentation of the Z-Score method and the Percentile method. I will definitively try the Z-Score method next time I do capability analysis!
@PegaAnalytics10 ай бұрын
Thanks for your positive feedback :)
@Nanoscientist210 ай бұрын
Very nice to tutorial. Thank you.
@PegaAnalytics10 ай бұрын
Glad you liked it
@Alison_Li10 ай бұрын
Hi ,I like all your vedios! Can you make some vedios to introduce the app builder? I want to learn how only use a little script to accomplish a program.
@PegaAnalytics10 ай бұрын
I'm glad you are enjoying the videos. It's really helpful to get requests, so I know what is of interest to people. I'll add the app builder to my list of topics!
@Alison_Li10 ай бұрын
@@PegaAnalytics Thanks!
@Alison_Li10 ай бұрын
👍
@Alison_Li11 ай бұрын
👍
@markchahl640811 ай бұрын
Nice video, David! Going to share it with my team that does global manufacturing support.
@PegaAnalytics11 ай бұрын
That's wonderful. Thanks Mark.
@Alison_Li11 ай бұрын
Thank you. I learned a lot.
@PegaAnalytics11 ай бұрын
Glad it was helpful!
@steffenbugge473611 ай бұрын
Excellent stuff David! Please keep the videos coming!
@PegaAnalytics11 ай бұрын
Thanks Steffen. I appreciate the encouragement!
@Alison_Li11 ай бұрын
Great vedio!
@PegaAnalytics11 ай бұрын
Thanks for the feedback. Part 2 tomorrow!
@madhuacharyya696311 ай бұрын
Hello. Thank you for the video. It is interesting. I have downloaded the dataset from JMP Help menu. However, I am interested to know how have you grouped the variables under several sub-groups such as milling, blending, compression, spraying, raw materials
@PegaAnalytics11 ай бұрын
That's a good point. My version of the table was slightly different to the one that you can find in the sample data. On the left of the table, where it lists all the column names, you can select multiple column names and then right-click: you will then see an option to "group columns". JMP will assign a default name to the group e.g. "Mill Time etc.", but you can click on the name and edit it to be something more descriptive. ( You will also find differences in the order of the rows. For illustration purposes I wanted a condition where a control chart had recently gone out of control. To achieve this I think I sorted the data by descending order of mill time. )
@Mike-s4n5p11 ай бұрын
awesome stuff please keep posting
@PegaAnalytics11 ай бұрын
Thanks! I really appreciate your feedback.
@PegaAnalytics Жыл бұрын
For 2024 I am planning some new video content relating to DOE. What specific topics would be of interest? Please let me know if the comments :)
@gauravdhumal128716 күн бұрын
RSM CCD design and its analysis to predict the response.
@PegaAnalytics Жыл бұрын
In 2024 I will post videos for the complete build of this oneway advisor ... unless anyone can suggest an alternative use-case ... ? :) The other possibility is to do some of this as a live stream. What are your thoughts, please let me know in the comments.