Edited, memorised or added to reading queue

on 05-Jul-2022 (Tue)

Do you want BuboFlash to help you learning these things? Click here to log in or create user.

#ML_in_Action #learning #machine #software-engineering
I had some successes but many failures, and generally left a trail of unmaintainable code in my wake as I moved from job to job. It’s not something that I’m particularly proud of. I’ve been contacted by former colleagues, years after leaving a position, to have them tell me that my code is still running every day. When I’ve asked each one of them why, I’ve gotten the same demoralizing answer that has made me regret my implementations: “No one can figure it out to make changes to it, and it’s too important to turn off.” I’ve been a bad data scientist. I’ve been an even worse ML engineer. It took me years to learn why that is. That stubbornness and resistance to solving problems in the simplest way created a lot of headaches for others, both in the sheer number of cancelled projects while I was at companies and in the unmaintainable technical debt that I left in my wake. It wasn’t until my most recent job, working as a resident solutions architect at Data- bricks (essentially a vendor field consultant), that I started to learn where I had gone wrong and to change how I approached solving problems.
statusnot read reprioritisations
last reprioritisation on suggested re-reading day
started reading on finished reading on

pdf

cannot see any pdfs




#ML_in_Action #learning #machine #software-engineering
Part 2 (chapters 9–13) covers the development process of ML projects. With examples (both good and bad) of ML solution development, this section carries you through proven methods of building, tuning, logging, and evaluating an ML solution to ensure that you’re building the simplest and most maintainable code possible
statusnot read reprioritisations
last reprioritisation on suggested re-reading day
started reading on finished reading on

pdf

cannot see any pdfs





#causality #has-images #statistics
This is known as M-bias
statusnot read reprioritisations
last reprioritisation on suggested re-reading day
started reading on finished reading on


Parent (intermediate) annotation

Open it
This is known as M-bias due to the M shape that this non-causal association flows along when the graph is drawn with children below their parents.

Original toplevel document (pdf)

cannot see any pdfs




Flashcard 7101970517260

Tags
#causality #has-images #statistics


Question
In defining the backdoor criterion (Definition 4.1) for the backdoor adjustment (Theorem 4.2), not only did we specify that the adjustment set 𝑊 blocks [...], but we also specified that 𝑊 does not contain any descendants of 𝑇
Answer
all backdoor paths

statusnot learnedmeasured difficulty37% [default]last interval [days]               
repetition number in this series0memorised on               scheduled repetition               
scheduled repetition interval               last repetition or drill

Parent (intermediate) annotation

Open it
In defining the backdoor criterion (Definition 4.1) for the backdoor adjustment (Theorem 4.2), not only did we specify that the adjustment set 𝑊 blocks all backdoor paths, but we also specified that 𝑊 does not contain any descendants of 𝑇

Original toplevel document (pdf)

cannot see any pdfs







#causality #statistics
“Potential outcomes” are sometimes referred to as “counterfactual outcomes,” but we will never do that in this book because a potential outcome 𝑌(𝑡) does not become counter to fact until another potential outcome 𝑌(𝑡 0 ) is observed
statusnot read reprioritisations
last reprioritisation on suggested re-reading day
started reading on finished reading on


Parent (intermediate) annotation

Open it

Original toplevel document (pdf)

cannot see any pdfs




Flashcard 7101975497996

Tags
#causality #statistics
Question
An “estimator” is a function that takes a dataset as input and outputs [...].
Answer
an estimate

statusnot learnedmeasured difficulty37% [default]last interval [days]               
repetition number in this series0memorised on               scheduled repetition               
scheduled repetition interval               last repetition or drill

Parent (intermediate) annotation

Open it
An “estimator” is a function that takes a dataset as input and outputs an estimate.

Original toplevel document (pdf)

cannot see any pdfs







Flashcard 7101977070860

Tags
#causality #statistics
Question
More generally, the potential outcome 𝑌(𝑡) denotes what your outcome would be, if you were to take treatment 𝑡 . A potential outcome 𝑌(𝑡) is distinct from the observed outcome 𝑌 in that not all potential outcomes are observed. Rather all potential outcomes can potentially be observed. The one that is actually observed depends on the value that the [...] takes on
Answer
treatment 𝑇

statusnot learnedmeasured difficulty37% [default]last interval [days]               
repetition number in this series0memorised on               scheduled repetition               
scheduled repetition interval               last repetition or drill

Parent (intermediate) annotation

Open it

Original toplevel document (pdf)

cannot see any pdfs