29:01 What is Positivity? Positivity means that for every combination of confounders (or covariates) C in the population, there is a non-zero probability of receiving each level of the treatment T. In other words: There must be individuals in the data who have some chance of being treated and some chance of not being treated for all relevant subgroups defined by C. Mathematically, this is often expressed as: P(T=t, W=w) > 0, for all t and w
@karannchew25342 сағат бұрын
16:05 The statement doesn't make sense. To make Average Treatment Effect equal to associational difference (in another words, to make the difference observed between the two groups the same as the casual difference), we need to ensure Ignorability. And Exchangeability. Ignorability: ( Y(1), Y(0) ) ∟ T 16:05 But this statement doesn't make sense. "Potential outcome Y(1) and potential Y(0) are independent of the Treatment." How is that possible?? The outcome is surely dependent on the treatment?? Isn't that the purpose of treatment? "Having headache and not having headache is independent of "wearing shoes"" - doesn't make sense. May be it tries to say: The grouping of the subjects or the characteristics of the subjects being assigned the treatment are independent of the treatment? The potential outcome is independent of any other factors associated to the treatment other than the treatment itself? --
@karannchew25343 сағат бұрын
14:30 In this example, is confounding the reason the groups being unequal?
@jiaxinyuan340818 сағат бұрын
Can someone tells me why are all edges of the essential graph are undirected in 0:42? Isn't there immorality at node C?
@karannchew2534Күн бұрын
25:26 ".. because you *can* observe that, you *can* compute.." 😵
@karannchew2534Күн бұрын
Association can be due to: 1. Confounding 2. Casual Correlation is a type of association measurement. There can be other measurements. Correlation definition = LINEAR statistical dependence -- 26:28 Average treatment effect = Average of the difference between the outcome with treatment and outcome without treatment in the population = E[Y(1) - Y(0)] = (assuming linearity of expectation) Difference between average of outcome with treatment and average of outcome without treatment among the population = E[Y(1)] - E[Y(0)] ≠ (if there's any confounding factor) Difference between *conditional expectations* ie. the average value based on simply grouping of or conditioning on treatment and non-treatment data, because that's a man-made mathematical/statistical operation that only results in statistical meaning ie. association/correlation that includes both confounding and the natural/true casual effects = E[Y|T=1] - E[Y|T=0] = (If there is no confounding factor) E[Y|T=1] - E[Y|T=0] -- The expected outcome of sub-population under treatment T = E[ Y(t) | W=w ] = E[ Y | do(T=t), W=w) ] = E[ Y | t, w ] *Marginalizing Over W* How to calculate the overall expected outcome across the entire population under treatment , not just for a specific sub-population, of small w? This is achieved by marginalising over W, meaning we take the expectation across all possible values of W, weighted by their probability distribution. E[Y(t)] = E[Y|do(T=t)] = E_sub_W E [Y|t, W] and the W in the last bit of the equation is a random variable. E_sub_W := The expectation over all possible values of W, treating W as a random variable. This accounts for the fact that different sub-populations (with different values of W) may have different probabilities of occurring in the population. *Why Marginalization?* 1. We want to compute the treatment effect E[Y(t)] averaged over the entire population, not just for a specific subgroup. 2. W might influence both the treatment assignment T and the outcome Y, so we must account for its distribution when aggregating over the population. Marginalisation ensures that the calculated Y reflects the combined contributions of all sub-populations, weighted appropriately.
@jiaxinyuan3408Күн бұрын
How does faithfulness derives that X1 depends on X3 in 0:52? Doesn't Faithfulness tells us from data to graph?
@jiaxinyuan34082 күн бұрын
I'm a little be confused at the definition of Instrumental unconfoundedness Assumption now. Here it is applied in 2:32, where backdoor paths are considered as from U->...->Z. Cuz in previous course 8.1, it's clearly written as un unblocked backdoor paths to Y. Does the Y refer to the outcome at certain path we care about?
@FerencChan10 күн бұрын
I have a question about the second step: why are ACD, ACE, and BCE not immorality,don't they also meet two conditions?
@jiaxinyuan3408Күн бұрын
How could they meet the second condition that C should not be in the conditioning set that makes {A,D}, {A,E}, {B,E} conditionally independent?
@chongsun787213 күн бұрын
I think the biggest confusion for me is that why X causes Y but X and Y can also be statistically independent for LMA.
@faanross24 күн бұрын
These lectures are amazing, thanks Brady
@SvetlanaBondarevaАй бұрын
Reading "The Book of Why" and watching the lectures! Love it! Thank you for the course Brady!
@DataTranslatorАй бұрын
How do we condition on the empty set ?
@chongsun7872Ай бұрын
I remembered I have read many many materials to try to understand Simpson's paradox and understand when to combine groups. This is THE most clear one I have ever listened to!
@joe_hoeller_chicagoАй бұрын
Prof Pearl is the best!
@Al-MahiАй бұрын
Diving into causal learning.!
@BenediktKnopp-z6y2 ай бұрын
great explanation, helped me a lot to understand this!
@iffatarasanzida77042 ай бұрын
Very well described. Thank You!
@what_a_new_world2 ай бұрын
Thank you for your video. give me the answer please.... I have an exam tomorrow... okay bye....
@andreicristea9972 ай бұрын
Great content! I am studying in the university now Causality subject and your youtube videos just give me the right amount of information to understand the topics that were more abstract to me (given just by the definitions). I just got confused though by the slide on the timestamp 0:25. You talk there about association (red dotted arrow), and it seems like it should two a two sided arrow, since association is purely statistical notion and doesn't have a direction. So I guess its either a typo or I misinterpreted something 😁
@rukman-sai3 ай бұрын
Question: Hi, I don't know if this question is too late but here it goes. On slide 9, how does having the additional observational data increase the number of interventions from (N-1) to N instead of decreasing it. I understand that one of those 'N' interventions is the observational data. So is this observational data useless if we still need (N-1) more single interventions. I mean more data should decrease the number of additional single node interventions necessary right?
@choutycoh58284 ай бұрын
ASSUME A TABLE CAN BLOCK the impact from THE C to X(7::45)
@TegegnBergene4 ай бұрын
could you be kind to to share stata sytax for counterfactual prediction in the Multinomial endogenous switching model- you can use my email that i will send if I learn you are willing to help me
@yogeshsingular4 ай бұрын
Really great work by you to make a complex set of topics in casual inference accessible to everyone
@peasant123454 ай бұрын
Thanks for the video. It looks like in the 8:36 example the difference between counterfactual and potential outcome is that in counterfactual U is random but in potential outcome case , where things could be resolved by do calculus, U is an unknown predetermined parameter. Am I correct?
@RayRay-yt5pe4 ай бұрын
For real man, thank you so much for everything you have done!!. You have made this stuff really accessible for folks like me!
@RayRay-yt5pe4 ай бұрын
I think the middle notation is more intuitive to read.
@TheProblembaer25 ай бұрын
How would the immorality change if we were to condition on X2? Or is the argument at 29:33 about conditioning on X2? I thought If we were to condition on X2, P(x1) and P(x2) were to be dependent?
@peasant123455 ай бұрын
the gunshot treatment. lmao
@TheProblembaer25 ай бұрын
I SAW THE DEATH STAR!
@moienr41045 ай бұрын
Can someone explain at 2:00 why Brady wrote P(x_3) instead of P(x_3 | x_1)? Did he use the fact that x_1 and x_3 are independent? Isn't he already trying to prove that they are independent by getting to 2:47? Thank you!
@moienr41045 ай бұрын
I got it! At 2:00 he used local Markov assumption!
@jiaxinyuan34087 күн бұрын
P(x1)P(x3)P(x2|x1,x3) is there because of Bayesian Network Factorization
@JTedam5 ай бұрын
Neal, I would like to propose an alternative explanation to yours using a scientific realism perspective on causal inference. You said, In scenario 1, treatment has an effect on the condition which has an effect on the probability of the outcome. And in scenario 2: the condition has an effect on the treatment which has an effect on the probability of the outcome. The treatment is the mechanism and the condition, the context or circumstances. I would argue the mechanism is the cause and the condition is the trigger. In scenario one, the outcome is considered in the basis of the mechanism alone and in scenario 2, the outcome is deduced on the basis of the trigger condition of the mechanism. Mechanisms are always triggered in context to create outcomes. Mechanism are causes but their outcomes are always shaped by context. I would also add that some mechanisms may be hidden, meaning there are other unobservable mechanisms - which under experimental conditions (!closed systems) can be controlled. In open systems, the outcome may not be easily predictable because of these unobservable mechanisms. In you Mr example, these mechanisms could be environmental - co-morbidities, contra-indications, emotional state, age related etc. So I would be cautious about relying on data driven inferencing alone.
@JTedam5 ай бұрын
Neal, I would like to propose an alternative explanation to yours using a scientific realism perspective on causal inference. You said, In scenario 1, treatment has an effect on the condition which has an effect on the probability of the outcome. And in scenario 2: the condition has an effect on the treatment which has an effect on the probability of the outcome. The treatment is the mechanism and the condition, the context or circumstances. I would argue the mechanism is the cause and the condition is the trigger. In scenario one, the outcome is considered in the basis of the mechanism alone and in scenario 2, the outcome is deduced on the basis of the trigger condition of the mechanism. Mechanisms are always triggered in context to create outcomes. Mechanism are causes but their outcomes are always shaped by context. I would also add that some mechanisms may be hidden, meaning there are other unobservable mechanisms - which under experimental conditions (closed systems) can be controlled. In open systems, the outcome may not be easily predictable because of these unobservable mechanisms. In you Mr example, these mechanisms could be environmental - co-morbidities, contra-indications, emotional state, age related etc. So I would be cautious about relying on data driven inferencing alone.
@bethelosuagwu80215 ай бұрын
Thanks, I got so much intuition in the course! It must have been a lot of work for you to put it together!
@bethelosuagwu80215 ай бұрын
I note that from the python code on github, if you condition on Age alone the exact effect is recovered i.e 1.05. Is this because proteinuria is a collider? Does this mean that we should not include colliders in the linear regression?
@johnmcintire36846 ай бұрын
Was there a followup question to “Assistance to the poor or welfare question?” … something like …. “Would your opinion change if you were told that “Welfare” is just another term for “Assistance to the poor” ?
@董园6 ай бұрын
Is there 5 other possible graphs in the Markov Equivalence class of the final question?
@peasant123456 ай бұрын
very interesting examples
@ear_w0rm6 ай бұрын
Turn on auto-subtitles please
@ear_w0rm6 ай бұрын
Please tell me if I understood the idea correctly - the syntax using do describes the same thing as the syntax using potential outcome. It's just a different way of presenting the same problem (identification)
@ear_w0rm6 ай бұрын
it's amazing
@ear_w0rm6 ай бұрын
At 2:53, we can prove all this using the minimality assumption, right?
@ear_w0rm7 ай бұрын
We can model the mathematical expectation with different models, right? Even those that cannot be interpreted. Will the calculation of the casual effect be correct in this case?
@sameershah1417 ай бұрын
Basic clarification question - X is subset is W? And can be same as W as well, correct? W is set of all covariates and X is set of variables on which treatment effect is conditioned.
@the_kid7777 ай бұрын
Here I was, excited to learn about the Rule 1 of do-calculus, and then seeing a bunch of symbols I don't understand in the next moment. I think a whole lot of people will understand better if you gave them the intuition first and not the symbols and notations, Brady.
@the_kid7777 ай бұрын
Thank you.
@dhananjaypoddar74867 ай бұрын
Is the answer to the question at the end 1? After drawing the essential graph, I found only one edge that is undirected (A-B), and if I am not wrong, since we know from the essential graph that there is an edge between A and B, intervening for either one of them should reveal the direction of the edge.
@jiaxinyuan340820 сағат бұрын
From my understanding at 1:44, essential graph only contain skeletons and immoralities, therefore, D-E, D-H, E-G are still undirected. But for me it's not so clear why we need to take immoralities into consideration rather than skeletons solely, cuz only undirected edges are considered when calculating clique, no? I got log2(4)+1=3 in the end, where A-B-C-D are the largest clique. (I'm not sure neither)
@ranyiwei76197 ай бұрын
very nice work
@tifaleong54307 ай бұрын
thank you so much for delivering this course!
@HanhNguyen-xx8qb8 ай бұрын
You explained complex concepts with such great clarity. Thank you for the invaluable series!