Pooled Time-Series Cross-Sectional Analysis
Dr Robert Walker, Atkinson Graduate School of Management, Willamette University
10 - 21 August (two week course / 35 hrs)
Dr. Robert W. Walker is Associate Professor of Quantitative Methods in the Atkinson Graduate School of Management at Willamette University (2012-). He earned a Ph. D. in political science from the University of Rochester in 2005 and has previously held teaching positions at Dartmouth College, Rice University, Texas A&M University, and Washington University in Saint Louis. His current research develops and applies semi-Markov processes to time-series, cross-section data in international relations and international/comparative political economy. He teaches courses in quantitative methods/applied statistics and microeconomic strategy and previously taught four iterations in the U. S. National Science Foundation funded Empirical Implications of Theoretical Models sequence at Washington University in Saint Louis.
- This course is designed for students who already have training in basic statistics and knowledge of linear regression analysis. The course deals with problems arising from combining the time and space dimensions in statistical data analysis. We will work with aggregated time series cross-sectional data e.g. countries/firms/individuals over time. This data structure has the advantage of allowing for testing highly general theories with a wide scope but renders data analysis more complicated because one has to consider the time series aspects (dynamics) and cross-sectional aspects (spatial correlation/unit heterogeneity) at the same time. The course examines the problems arising from this complex data structure and provides techniques to control and account for specific complications. We will start out by discussing characteristics and types of pooled data and underlying assumptions of basic statistical models for panel data. We then address specification problems such as complex error structures, different kinds of heterogeneity (e.g. unit and slope), dynamic specification issues (lag structures), missing data, spatial heterogeneity and dependency, time invariant and rarely changing variables in panel data analysis with correlated unit specific effects among others. Furthermore, we will look at different data generating processes and adequate estimation procedures for limited dependent variables. The course combines a more theoretical introduction with practical analysis of diverse data sets using STATA. Students are encouraged to bring their own data sets and present their research puzzles as motivating examples.
- The course requires knowledge of inferential statistics and considerable linear algebra (matrices) and is designed to further develop the understanding of statistical problems arising from the complex structure of pooled data. The course mostly deals with questions of specification and model choice; it is a practical course that enables students to more tightly link empirical models with their theoretical arguments and make model choices that are adequate for the data structure at hand. The course materials are designed to help participants to solve their own estimation problems and increase the reliability and efficiency of their statistical results. The course is targeted to social scientists, business academics with average (or better) statistical skills with a strong interest in applied empirical research and data analysis. The focus lies on practical problems of macro panel data analysis.
- The course requires average or better skills and knowledge in inferential statistics, including basic understanding of maximum likelihood and generalized linear estimation methods. In addition, participants should have an understanding of matrix algebra. In addition, participants need a basic familiarity with STATA. The course is designed to build on a good working knowledge of cross-section multiple regression models and basic multivariate time-series models. This includes knowledge of the underlying assumptions of basic linear models (principally stationarity) and how to deal with violations (heteroskedasticity, autocorrelation) of Gauss-Markov assumptions. Participants should be able to interpret regression coefficients, standard errors and significance tests.
Notes on Readings
- Most basic and introductory econometrics textbooks include a chapter on panel-data and pooled models. Wooldridge (2002), Hsiao (2003) and Baltagi (2008) are more advanced and technical treatments, the latter two with exclusive emphasis on panel data. Beck and Katz (1995) develop the work-horse model for political scientists. Beck (2001) summarizes the discussion of pooled analysis in political science. Specification and conceptual issues are discussed in Plümper et al. (2004). Wawro (2002) and Wilson et al. (2007) introduce issues arising from the dynamic dimension of panel data and compare different models/specifications for dynamic panel data. Beck and Katz (2007) discuss slope heterogeneity and the application of random co-efficients models. Plümper and Troeger (2007) raise the issue of time-invariant and rarely changing variables in panel data analysis with correlated unit specific effects and suggest a solution to the problem
Representative Backround Reading
- Baltagi, Badi H. 2008: Econometric Analysis of Panel Data, Wiley and Sons Ltd.
- Beck, Nathaniel and Jonathan Katz 1995: What to do (and not to do) with Time-Series Cross-Section Data, American Political Science Review 89: 634-647.
- Beck, Nathaniel 2001: Time-Series-Cross-Section Data: What Have We Learned in the Past Few Years? Annual Review of Political Science 4: 271-293.
- Plümper, Thomas and Vera E. Troeger 2007: Efficient Estimation of Time-Invariant and Rarely Changing Variables in Finite Sample Panel Analyses with Unit Fixed Effects: Political Analysis 15: 124-139.
- Plümper, Thomas, Troeger, Vera E. and Philip Manow 2005: Panel Data Analysis in Comparative Politics. Linking Method to Theory: European Journal of Political Research 44: 327-354.
- Wawro, Gregory 2002: Estimating Dynamic Panel Data Models in Political Science: Political Analysis 10: 25-48.
- Wilson, Sven E. and Danial M. Butler 2007: A Lot More to Do: The Sensitivity of Time-Series Cross-Section Analyses to Simple Alternative Specifications. Political Analysis 15: 101-123.
- Wooldridge, Jeffrey M. 2002: Econometric Analysis of Cross Section and Panel Data, MIT Press, Cambridge.