1E Introduction to Multilevel Models with Applications

Paul Lambert, University of Stirling
9 - 20 July (two week course / 35 hrs) 

Detailed Course Outline [PDF]

THIS COURSE IS NOW FULLY BOOKED AND WE ARE OPERATING A WAITING LIST

Course Content

Many social science datasets feature the ‘clustering’ or ‘hierarchical nesting’ of individual cases within larger units of analysis for example in household surveys there may be several individual respondents clustered within the same household. Multilevel models (also known as hierarchical random effects models) are a class of statistical models which provide analytical tools for dealing with data of this nature. They provide a convenient means to undertake regression analysis which takes account of, and can help to summarise, patterns of clustering. As such multilevel models are an important tool in social statistics, and of potential relevance to almost any study using complex data.

This course provides an applied introduction to multilevel modelling for social science datasets. It will introduce the statistical features of multilevel models, deal with approaches to handling data which has clustered or hierarchical elements, and provide training in specifying multilevel models for linear and categorical outcome variables in a variety of survey data scenarios. The course has an emphasis on the practical application of multilevel models, and will seek to convey both the attractions and limitations of a multilevel modelling approach.

Course Objectives

The course seeks to provide participants with a solid grounding in the application of multilevel models. This involves combining a strong understanding of how multilevel models are formulated (and their relationship to other types of statistical model); a fluency in handling clustered and hierarchical structures in social science data; and an ability to specify, and critically interpret the results of multilevel models in popular statistical analysis packages.

The course will provide training in handling data and specifying multilevel models in the packages Stata, SPSS/PASW, R and MLwiN. Stata features a wide range of options for both handling complex data, and specifying multilevel models, and accordingly in practical sessions we will use Stata more often than any other package. SPSS has many comparable facilities for organising data, but a more limited range of options for multilevel modelling itself. R has a particularly wide range of analysis options, but is a relatively challenging package to use for most social science data projects. MLwiN is specialist software designed explicitly for estimating multilevel models, which also provides an attractive interface for understanding the specification and properties of statistical models. Training covers examples in all of these packages, and using several different social survey datasets: this is an ambitious objective which seeks to provide participants with important operational skills which are not widely taught.

There are three benefits to studying the practical application of multilevel models. Firstly, multilevel models are important devices for exploring the character of clustered or hierarchical structures within a dataset (for example, to compare the scale of pupil-level and class-level influences in educational research on data which features pupils clustered within classes). Secondly, they are often used simply in order to adequately control for hierarchical structural features within data (that is, when a pattern of clustering is not substantively important, but does need to be controlled for). Finally, a thorough introduction and review of the practical implementation of multilevel models also serves as an effective means of refreshing understanding of the implementation and interpretation of statistical models in the social sciences more generally.

Course Prerequisites

This is an introductory course, but participants will benefit from having moderate levels of previous statistical training, and relevant experience in statistical software applications. In addition, prior to attending the course, all participants would benefit from reading a research article which uses a multilevel model, and an introductory article or chapter on the methodology (suggestions given below).

The course is suitable for participants who have received statistical training at least to the level of understanding the application of conventional regression modelling approaches (e.g. multiple regression and logistic regression), and who are fluent in popular descriptive analytical techniques and the statistical tests behind them (e.g. chi-square tests and correlation calculations). Most participants are likely to benefit from preparatory study or revision of material which covers generating and interpreting the outputs from conventional regression analyses, such as on co-efficient effects and indicators of model fit (e.g. Allison 1999; Tarling 2009). The course will take conventional regression models as its starting point, and build onwards to multilevel models and other related extensions topics in statistical modelling.

The course is also most suitable for participants with a basic level of fluency in using statistical software packages for social science data analysis. The course features lab material spanning several packages (Stata, SPSS, R and MLwiN, with Stata used most often) and using several different social science datasets. Previous exposure to the syntax languages of Stata or SPSS will be an advantage, since the practical materials involve programming in such languages. The course should be accessible to people who have little previous experience in this area, since background materials on the software packages will be made available, but students without some background in the programming of software using syntax should be prepared that extra effort will be required during the opening periods of the course in order to follow the lab exercises. In addition, in advance of the course, participants will be expected to read the extended software guide prepared for the course, which discusses and illustrates modes of operation of the packages involved (to be circulated in advance of the session).

Reading

The following text will be included in the coursepack and used throughout the course:

Hox, J.J. 2010. Multilevel Analysis: Techniques and Applications , 2nd Edition. London: Routledge.

Background Reading

Rabe-Hesketh, S. and Skrondal, A. 2008. Multilevel and Longitudinal Modelling Using Stata, Second Edition. College Station, Tx: Stata Press.

[top of page]