Research Group

Data science

Data science unifies statistics, data analysis and their related methods in order to understand and analyse phenomena in applications ranging from healthcare to finance.

It is an interdisciplinary research area that employs techniques and theories drawn from many fields. In our department our main areas of research interest are statistics and actuarial science, and operational research.

Many of our academics are also members of the university's Institute for Analytics and Data Science, and work closely with the Institute for Social and Economic Research, the School of Computer Science and Electronic Engineering, and the School of Life Sciences.

Statistics and actuarial science

Statistics is concerned with modelling and analysing data, designing experiments, and making inferences, predictions and decisions in the presence of risk.

Operational Research

Operational Research deals with the application of mathematical optimisation to help make better decisions and arrive at optimal or near-optimal solutions to complex decision-making problems.

At Essex our work has an emphasis on human-technology interaction and focuses on practical applications that overlap with other disciplines, notably industrial engineering and operations management.

You can find out more about our academic's areas of research interest using the staff list below.

Essex Data Science Seminar Series

Our group runs a regular research seminar series throughout the academic year. Along with hosting talks from our academics and research students, we also invite experts from other institutions to present their latest work.

Our seminars are open to anyone at the University of Essex who may be interested in the topic being discussed.

Upcoming research seminars

4th March 2021 - "Sampling-Assisted Inference of Intractable Models" - Bo Zhang, University of Essex.

25th February 2021 - "Symmetric measures of variability induced by risk measures" - Dr Tolulope Fadina, University of Essex.

18th February 2021 - "Using A.I. and street-view images for estimating socio-economic indicators" - Dr Mario Gutiérrez-Roig, University of Essex.

28th January 2021 - "Selection bias, missing data and causal inference" - Professor Kate Tilling, University of Bristol.

21st January 2021 - "Linear Algebra and Neural Approaches for Representation Learning" - Dr Tingting Mu, University of Manchester.

Highlights of Autumn 2020 seminars

EpiViz: an implementation of Circos plots for epidemiologists

Matt Lee, a PhD student from the University of Bristol, delivered a talk on the use of Circos plots in epidemiology.

Biological pathways involve numerous processes, but epidemiology studies predominantly focus on single exposure and single outcome associations. This is primarily because identifying meaningful intermediate associations that can be taken forward for further analysis is complex.

In his talk, Matt discussed how tools like EpiViz can be used to produce simple and efficient Circos plots for those new to programming and data visualisation. By giving people a tool that makes data visualisation easier to produce, epidemiologists can gain a better understanding of the results of complex epidemiological studies. Greater insight in to the results can help increase the impact of such studies.

Related papers

Matthew A Lee, George McMahon, Ville Karhunen, Kaitlin H Wade, Laura J Corbin, David A Hughes, George Davey Smith, Debbie A Lawlor, Marjo-Riitta Jarvelin, Nicholas J Timpson, Common variation at 16p11.2 is associated with glycosuria in pregnancy: findings from a genome-wide association study in European women, Human Molecular Genetics, Volume 29, Issue 12, 15 June 2020, Pages 2098–2106

A Statistician’s Botanical Garden - The Ideas behind Trees, Model-Based Trees and Random Forests

Classification and regression trees, model-based trees and random forests are powerful statistical methods from the field of machine learning. However, while individual trees are easy to interpret, random forests are "black box" prediction methods. Despite this, they provide variable importance measures, that are being used to judge the relevance of the individual predictor variables.

In this seminar, Professor Carolin Strobl introduced the rationale behind trees, model-based trees and random forests, and illustrated their potential for high-dimensional data exploration, while also pointing out limitations and potential pitfalls in their practical application.

Related papers

Fokkema, M., & Strobl, C. (2020). Fitting prediction rule ensembles to psychological research data: An introduction and tutorial. Psychological Methods, 25(5), 636–652.

Detecting the hierarchical structure of the cell nucleus

Chromatin consists of DNA wrapped around histones and forms complex three-dimensional structures within the cell nucleus with various degrees of compaction.

Genes have been shown to be repressed by their proximity to the nuclear periphery or activated by being in contact with special regulatory regions called enhancers. Thus the relative positioning of genes and their interactions with other regions are very important in determining whether they are expressed or not.

In this talk, Iona Olan from the University of Cambridge discussed her work on cellular senescence, a phenotype associated with dramatic changes in its chromatin interactions network relative to normal cells. Senescence corresponds to permanent cell cycle arrest and has been shown to act as a protective barrier against tumourigenesis.

Related papers

Kosuke Tomimatsu, Dóra Bihary, Ioana Olan, Aled Parry, Stefan Schoenfelder, Adelyne Chan, Guy Slater, Yoko Ito, Peter Rugg-Gunn, Kristina Kirschner, Camino Bermejo-Rodriguez, Masako Narita, Tomomi Seko, Hiroyuki Kugoh, Ken Shiraishi, Koji Sayama, Hiroshi Kimura, Peter Fraser, Shamith Samarajiwa, Masashi Narita, Locus-specific induction of gene expression from heterochromatin loci during cellular senescence, Nature Research, pre-print.

Our academics

Dr Joseph Bailey

Lecturer in Environmetrics

Department of Mathematical Sciences, University of Essex

Research area: Statistics

Dr Yanchun Bao

Lecturer in Data Science and Statistics

Department of Mathematical Sciences, University of Essex

Research area: Statistics

Dr Hongsheng Dai

Reader in Statistics

Department of Mathematical Sciences, University of Essex

Research area: Statistics

Dr Tolulope Fadina

Lecturer in Actuarial Science and Finance

Department of Mathematical Sciences, University of Essex

Research area: Actuarial science.

Dr Mario Gutierrez-Roig

Lecturer in Data Science and Statistics

Department of Mathematical Sciences, University of Essex

Research area: Statistics

Dr Stella Hadjiantoni

Lecturer in Data Science and Statistics

Department of Mathematical Sciences, University of Essex

Research area: Statistics

Dr Andrew Harrison

Senior Lecturer in Data Science

Department of Mathematical Sciences, University of Essex

Research area: Statistics

Dr Junlei Hu

Lecturer in Actuarial Science

Department of Mathematical Sciences, University of Essex

Research area: Actuarial science

Professor Berthold Lausen

Professor of Data Science

Department of Mathematical Sciences, University of Essex

Research area: Statistics.

Dr Peng Liu

Lecturer in Actuarial Science and Finance

Department of Mathematical Sciences, University of Essex

Research area: Actuarial science.

Dr Osama Mahmoud

Lecturer in Data Science and Statistics

Department of Mathematical Sciences, University of Essex

Research area: Statistics.

Dr Fanlin Meng

Lecturer in Data Science

Department of Mathematical Sciences, University of Essex

Research area: Statistics

Dr Yassir Rabhi

Lecturer in Data Science and Statistics

Department of Mathematical Sciences, University of Essex

Research area: Statistics.

Professor Abdellah Salhi

Professor of Operational Research

Department of Mathematical Sciences, University of Essex

Research area: Operational research

Dr Spyridon Vrontos

Senior Lecturer in Actuarial Science

Department of Mathematical Sciences, University of Essex

Research area: Actuarial science

Dr Jackie Wong Siaw Tze

Lecturer in Actuarial Science

Department of Mathematical Sciences, University of Essex

Research area: Statistics, actuarial science.

Dr Xinan Yang

Senior Lecturer in Operational Research

Departmental of Mathematical Sciences, University of Essex

Research area: Operational research