Scientific Memo

Preamble

David Hume (Wikipedia)

Experimental design is not a new concept and randomised control trials (RCTs) are our solid gold standard of doing quantitative research, when no apparent physical laws are available to validate observations. However, it is very expensive to design RCTs, not ethical or either not possible due to logistical reasons in some cases. Then we fall into Causal Inference's heuristic frameworks, such as potential outcomes, matching, and time-series interventions in imagining counterfactuals and interventions. These methods provide immensely successful toolbox for quantitative scientist where by systems do not have any known physical laws. DAGs and do-calculus, differentiates from all these approaches that try to move away from full heuristics. In this post we try to postulate this formally in mathematical terms in the context of causal inference over observational data framework. We established that DAGs and do-calculus bring mathematically more principled way of practicing causal inference akin to theoretical physics attitude.

Definition of Heuristic Causal Inference (HeuristicCI) : Observational Data

Heuristics in general implies an algorithmic approximate solution, usually appear as numerical and statistical algorithms in causal inference whereby full RCT is not available. This can be formalised as follows,

Definition (HeuristicCI) Given dataset of $n-$dimensions $\mathscr{D} \in \mathbb{R}^{n}$ observation, having variates of $X=x_{i}$, with each having different sub-sets (categories within $x_{i}$), having at least one category of observations. We want to test causal connection between two distinct subsets of $X$, $\mathscr{S}_{1} , \mathscr{S}_{2}$, given an interventional versions or imagined counterfactual where by at least one of the subset is available, $\mathscr{S}_{1}^{int} , \mathscr{S}_{2}^{int}$. Using an algorithm $\mathscr{A}$ that processes dataset to test an effect size $\delta$ using a statistic $\beta$, as follows, $$ \delta= \beta(\mathscr{S}_{1} , \mathscr{S}_{1}^{int})-\beta(\mathscr{S}_{2} , \mathscr{S}_{2}^{int})$$ statistic $\beta$ can be result of a machine learning procedure as well and difference in $\delta$ is only a particular choice, i.e., such as Average Treatment Effect (ATE). The algorithm $\mathscr{A}$ is called HeuristicCI.

Many of the non-DAGs and do-calculus methods directly falls into this category, such as potential outcomes, uplift, matching and synthetic controls. This definition could be quite obvious to practitioners that has a good handle in mathematical definitions. Moreover, HeuristicCI implies solely data-driven approach to causality inline with Hume's pure-empirical view-point.

Primary distinction in practicing DAGs that it brings causal ordering naturally [suezen23pco] with scientist's cognitive process encoded, where by HeuristicCI search for statistical effect size that has a causal component in fully data-driven way. However, a HybridCI would entails using DAGs and do-calculus in connection with data driven approaches.

Conclusion

In this short exposition, we introduced HeuristicCI concept that category of methods that do not use DAGs and do-calculus explicitly in causal inference practice. However, we do not put a well designed RCTs in this category. Because, as a gold standard approach whereby properly encoded experimental design generates full interventional data reflecting scientist's domain knowledge.

References and Further reading

Looper repo : A resource list for causality in statistics, data science and physics
[suezen23pco] Practical Causal Ordering: Why weighted DAGs are powerful for causal inference?
Related Wikipedia articles

Please cite as follows:

@misc{suezen23hci,

title = {Mathematical Definition of Heuristic Causal Inference: What differantiates DAGs and do-calculus?},

howpublished = {\url{https://science-memo.blogspot.com/2023/11/heuristic-causal-inference.html},

author = {Mehmet Süzen},

year = {2023}

}

Postscript A: Why Pearlian Causal Inference is very significant progress for empirical science?

Judea Pearl's framework for causality sometimes referred to as “mathematisation of causality”. However, “axiomatic foundations of causal inference” is fair identification, Pearl's contribution to the field is in par with Kolmogorov's axiomatic foundations of probability. Key papers of this axiomatic foundations are published in 1993 (back-doors) [1] and 1995 (do-calculus) [2].

Original works of Axiomatic foundation for causal inference:

[1] Pearl, J., “Graphical models, causality, and intervention,” Statistical Science, Vol. 8, pp. 266–269, 1993.

[2] Pearl, J., “Causal diagrams for empirical research,” Biometrika, Vol. 82, Num. 4, pp. 669–710, 1995.

Scientific Memo

Friday, 10 November 2023

Mathematical Definition of Heuristic Causal Inference:
What differentiates DAGs and do-calculus?

Mehmet Suzen

Related

(c) Copyright 2008-2024 Mehmet Suzen (suzen at acm dot org)

Friday, 10 November 2023

Mathematical Definition of Heuristic Causal Inference: What differentiates DAGs and do-calculus?

(c) Copyright 2008-2024 Mehmet Suzen (suzen at acm dot org)

Mathematical Definition of Heuristic Causal Inference:
What differentiates DAGs and do-calculus?