Journal of Data and Information Science ›› 2023, Vol. 8 ›› Issue (1): 21-28.doi: 10.2478/jdis-2023-0006

• Opinion • Previous Articles     Next Articles

Causal inference using regression-based statistical control: Confusion in Econometrics

Fan Chao, Guang Yu()   

  1. School of Management, Harbin Institute of Technology, Harbin 150001, China
  • Received:2022-10-24 Revised:2023-01-10 Accepted:2023-01-30 Online:2023-02-20 Published:2023-02-22
  • Contact: Guang Yu (Email: yughit@126.com).

Abstract:

Regression is a widely used econometric tool in research. In observational studies, based on a number of assumptions, regression-based statistical control methods attempt to analyze the causation between treatment and outcome by adding control variables. However, this approach may not produce reliable estimates of causal effects. In addition to the shortcomings of the method, this lack of confidence is mainly related to ambiguous formulations in econometrics, such as the definition of selection bias, selection of core control variables, and method of testing for robustness. Within the framework of the causal models, we clarify the assumption of causal inference using regression-based statistical controls, as described in econometrics, and discuss how to select core control variables to satisfy this assumption and conduct robustness tests for regression estimates.

Key words: Causal Inference, Regression, Observational Studies, Econometrics, Causal Model