Edited, memorised or added to reading queue

on 14-Jun-2022 (Tue)

Do you want BuboFlash to help you learning these things? Click here to log in or create user.

#causality #statistics

𝔼[π‘Œ | do(𝑑), 𝑍 = 𝑧] vs 𝔼[π‘Œ | 𝑍 = 𝑧]

𝔼[π‘Œ | 𝑍 = 𝑧] simply refers to the expected value in the (pre-intervention) population where individuals take whatever treatment they would normally take ( 𝑇 ). This distinction will become important when we get to counterfactuals...

statusnot read reprioritisations
last reprioritisation on suggested re-reading day
started reading on finished reading on


Parent (intermediate) annotation

Open it
he intervention do(𝑑) occurs. For example, 𝔼[π‘Œ | do(𝑑), 𝑍 = 𝑧] refers to the expected outcome in the subpopulation where 𝑍 = 𝑧 after the whole subpopulation has taken treatment 𝑑 . In contrast, <span>𝔼[π‘Œ | 𝑍 = 𝑧] simply refers to the expected value in the (pre-intervention) population where individuals take whatever treatment they would normally take ( 𝑇 ). This distinction will become important when we get to counterfactuals in <span>

Original toplevel document (pdf)

cannot see any pdfs




Flashcard 7093198392588

Tags
#DAG #causal #edx #has-images
[unknown IMAGE 7093198916876]
[unknown IMAGE 7093193149708]

statusnot learnedmeasured difficulty37% [default]last interval [days]               
repetition number in this series0memorised on               scheduled repetition               
scheduled repetition interval               last repetition or drill

pdf

cannot see any pdfs







#DAG #causal #inference
In this book, we stress the need to take the causal question seriously enough to articulate it, and to delineate the separate roles of data and assumptions for causal inference. Once these foundations are in place, causal inferences become necessarily less casual, which helps prevent confusion. The book describes various data analysis approaches to estimate the causal effect of interest under a particular set of assumptions when data are collected on each individual in a population. A key message of the book is that causal inference cannot be reduced to a collection of recipes for data analysi
statusnot read reprioritisations
last reprioritisation on suggested re-reading day
started reading on finished reading on

pdf

cannot see any pdfs




#DAG #causal #inference
This is not a philosophy book. We remain agnostic about metaphysical concepts like causality and cause. Instead, we focus on the identification and estimation of causal effects in populations, that is, numerical quantities that measure changes in the distribution of an outcome under different interventions.
statusnot read reprioritisations
last reprioritisation on suggested re-reading day
started reading on finished reading on

pdf

cannot see any pdfs




Flashcard 7095673556236

Tags
#causality #has-images #statistics


Question
In defining the backdoor criterion (Definition 4.1) for the backdoor adjustment (Theorem 4.2), not only did we specify that the [...] set π‘Š blocks all backdoor paths, but we also specified that π‘Š does not contain any descendants of 𝑇
Answer
adjustment

statusnot learnedmeasured difficulty37% [default]last interval [days]               
repetition number in this series0memorised on               scheduled repetition               
scheduled repetition interval               last repetition or drill

Parent (intermediate) annotation

Open it
In defining the backdoor criterion (Definition 4.1) for the backdoor adjustment (Theorem 4.2), not only did we specify that the adjustment set π‘Š blocks all backdoor paths, but we also specified that π‘Š does not contain any descendants of 𝑇

Original toplevel document (pdf)

cannot see any pdfs







Flashcard 7095675653388

Tags
#causality #statistics
Question
Definition 3.2 (What is a [...]?) A variable 𝑋 is said to be a cause of a variable π‘Œ if π‘Œ can change in response to changes in 𝑋
Answer
cause

statusnot learnedmeasured difficulty37% [default]last interval [days]               
repetition number in this series0memorised on               scheduled repetition               
scheduled repetition interval               last repetition or drill

Parent (intermediate) annotation

Open it
Definition 3.2 (What is a cause?) A variable 𝑋 is said to be a cause of a variable π‘Œ if π‘Œ can change in response to changes in 𝑋

Original toplevel document (pdf)

cannot see any pdfs







Flashcard 7095678274828

Tags
#causality #statistics
Question

𝔼[π‘Œ | do(𝑑), 𝑍 = 𝑧] vs 𝔼[π‘Œ | 𝑍 = 𝑧]

[...] simply refers to the expected value in the (pre-intervention) population where individuals take whatever treatment they would normally take ( 𝑇 ). This distinction will become important when we get to counterfactuals...

Answer
𝔼[π‘Œ | 𝑍 = 𝑧]

statusnot learnedmeasured difficulty37% [default]last interval [days]               
repetition number in this series0memorised on               scheduled repetition               
scheduled repetition interval               last repetition or drill

Parent (intermediate) annotation

Open it

Original toplevel document (pdf)

cannot see any pdfs








#causality #has-images #statistics
Consider the following general kind of path, where β†’ Β· Β· Β· β†’ denotes a directed path: 𝑇 β†’ Β· Β· Β· β†’ 𝑍 ← Β· Β· Β· ← π‘Œ . Conditioning on 𝑍 , or any descendant of 𝑍 in a path like this, will induce collider bias
statusnot read reprioritisations
last reprioritisation on suggested re-reading day
started reading on finished reading on


Parent (intermediate) annotation

Open it

Original toplevel document (pdf)

cannot see any pdfs




Flashcard 7095681944844

Tags
#causality #statistics
Question

Assumption 3.2 (Minimality Assumption)

1. Given its parents in the DAG, a node 𝑋 is independent of all its non-descendants (Assumption 3.1).

2. Adjacent nodes in the DAG are [...].

Answer
dependent

statusnot learnedmeasured difficulty37% [default]last interval [days]               
repetition number in this series0memorised on               scheduled repetition               
scheduled repetition interval               last repetition or drill

Parent (intermediate) annotation

Open it
Assumption 3.2 (Minimality Assumption) 1. Given its parents in the DAG, a node 𝑋 is independent of all its non-descendants (Assumption 3.1). 2. Adjacent nodes in the DAG are dependent.

Original toplevel document (pdf)

cannot see any pdfs







Flashcard 7095683779852

Tags
#causality #has-images #statistics


Question
If we condition on a descendant of 𝑇 that isn’t a [...], it could unblock a path from 𝑇 to π‘Œ that was blocked by a collider. For example, this is the case with conditioning on 𝑍 in Figure 4.13. This induces non-causal association between 𝑇 and π‘Œ , which biases the estimate of the causal effect
Answer
mediator

statusnot learnedmeasured difficulty37% [default]last interval [days]               
repetition number in this series0memorised on               scheduled repetition               
scheduled repetition interval               last repetition or drill

Parent (intermediate) annotation

Open it
If we condition on a descendant of 𝑇 that isn’t a mediator, it could unblock a path from 𝑇 to π‘Œ that was blocked by a collider. For example, this is the case with conditioning on 𝑍 in Figure 4.13. This induces non-causal association between 𝑇 a

Original toplevel document (pdf)

cannot see any pdfs







Flashcard 7095685614860

Tags
#Data #GAN #reading #synthetic
Question

[...] GAN - Summary

There are many ways of dealing with this problem using different levels of granularity. We presented one that can be easily extended to handle temporal sequences. Its main advantages are modularity and scalability as it can be extended to multiple granularity levels.

The main drawback of this approach is that it may become difficult to implement if the database contains too many one to many relations and nested hierarchical levels.

Answer
hierarchical

statusnot learnedmeasured difficulty37% [default]last interval [days]               
repetition number in this series0memorised on               scheduled repetition               
scheduled repetition interval               last repetition or drill

Parent (intermediate) annotation

Open it
hierarchical GAN - Summary There are many ways of dealing with this problem using different levels of granularity. We presented one that can be easily extended to handle temporal sequences. Its main

Original toplevel document (pdf)

cannot see any pdfs







[unknown IMAGE 7095689547020] #ML_in_Action #has-images #learning #machine #software-engineering
statusnot read reprioritisations
last reprioritisation on suggested re-reading day
started reading on finished reading on

pdf

cannot see any pdfs




#ML_in_Action #learning #machine #software-engineering
Machine Learning Engineering in Action
statusnot read reprioritisations
last reprioritisation on suggested re-reading day
started reading on finished reading on

pdf

cannot see any pdfs




#ML_in_Action #learning #machine #software-engineering
I had some successes but many failures, and generally left a trail of unmaintainable code in my wake as I moved from job to job. It’s not something that I’m particularly proud of. I’ve been contacted by former colleagues, years after leaving a position, to have them tell me that my code is still running every day. When I’ve asked each one of them why, I’ve gotten the same demoralizing answer that has made me regret my implementations: β€œNo one can figure it out to make changes to it, and it’s too important to turn off.” I’ve been a bad data scientist. I’ve been an even worse ML engineer. It took me years to learn why that is. That stubbornness and resistance to solving problems in the simplest way created a lot of headaches for others, both in the sheer number of cancelled projects while I was at companies and in the unmaintainable technical debt that I left in my wake. It wasn’t until my most recent job, working as a resident solutions architect at Data- bricks (essentially a vendor field consultant), that I started to learn where I had gone wrong and to change how I approached solving problems.
statusnot read reprioritisations
last reprioritisation on suggested re-reading day
started reading on finished reading on

pdf

cannot see any pdfs




#ML_in_Action #learning #machine #software-engineering
Part 1 (chapters 1–8) is focused primarily on the management of ML projects from the perspective of a team lead, manager, or project lead. It lays out a blue- print for scoping, experimentation, prototyping, and inclusive feedback to help you avoid falling into solution-building traps
statusnot read reprioritisations
last reprioritisation on suggested re-reading day
started reading on finished reading on

pdf

cannot see any pdfs




#ML_in_Action #learning #machine #software-engineering
Part 2 (chapters 9–13) covers the development process of ML projects. With examples (both good and bad) of ML solution development, this section carries you through proven methods of building, tuning, logging, and evaluating an ML solution to ensure that you’re building the simplest and most maintainable code possible
statusnot read reprioritisations
last reprioritisation on suggested re-reading day
started reading on finished reading on

pdf

cannot see any pdfs




#ML_in_Action #learning #machine #software-engineering
Part 3 (chapters 14–16) focuses on β€œthe after”: specifically, considerations related to streamlining production release, retraining, monitoring, and attribution for a project. With examples focused on A/B testing, feature stores, and a passive retraining system, you’ll be shown how to implement systems and architectures that can ensure that you’re building the minimally complex solution to solve a business problem with ML
statusnot read reprioritisations
last reprioritisation on suggested re-reading day
started reading on finished reading on

pdf

cannot see any pdfs




#ML_in_Action #learning #machine #software-engineering
The vast majority of the time, a project that fails to make its way to production for sustained utility has issues that are rooted in the very early phases. Before even a single line of code is written, before a serving architecture is selected and built out, and long before a decision on scalable training is made, a project is doomed to either cancellation or unused obscurity if planning, scoping, and experimentation is not done properly.
statusnot read reprioritisations
last reprioritisation on suggested re-reading day
started reading on finished reading on

pdf

cannot see any pdfs