Summer Schools

Analytics, Data Science and Decision Making

Essex Analytics, Data Science and Decision Making Summer School


Monday 24 July - Friday 28 July 2023

The Institute for Analytics and Data Science presents its annual Summer School, bringing you cutting-edge courses across the fields of data science, analytics and decision making. 

Our Summer School provides a unique opportunity to attend state-of-the-art courses delivered by world-leading academics and industry partners from around the globe and network with peers in an invaluable forum for knowledge exchange.

Applications for our 2023 summer school have now closed. For information about future summer schools and any enquiries, please contact: iadssum@essex.ac.uk


Institute of Analytics and Data Science on social media:

Overview

Data driven policymaking is central to ensuring a stable future. At Essex, research in data science and artificial intelligence is part of our DNA.

In 1967, Tony Brooker became the founding Chair of Computer Science at Essex, having followed in Alan Turing’s footsteps at the Computing Machine Laboratory in Manchester, in the 50s.

In 2016, the first UNESCO Chair in Analytics and Data Science was conferred on Essex academic Professor Maria Fasli with the key objective of highlighting the critical role that data plays in promoting equality, sustainable development and how it can enhance people’s lives.

Now, whether you're interested in the curation and management of big data, advanced techniques and methods including artificial intelligence and statistical methods, or applications of big data in fields ranging from business and finance to bioinformatics, you'll find a set of relevant courses for you.

Teaching programme

Whether you’re exploring data science, or have advanced practice, we have a range of introductory and academic courses to suit your needs.

Some courses may have prerequisites, or preparation may be required before attending. All this information is detailed in the course outline, which you will be able to download from spring 2023.

There are typically 4 parallel sessions running each day, mainly six hours in length, and delegates are able to create a tailored programme of study by selecting from a range of different focuses. The 2023 schedule will resemble the following course outline*:

Course Outline* 

Focus Topic
Focus 1: Machine Learning
  • Introduction to Machine Learning
  • Tree-based Models for Machine Learning in Data Analytics
  • Machine Learning for Causal Inference from Observational Data
Focus 2: Advanced R Topics
  • Introduction to Data Visualisation Using R. 
  • GIS Analysis in R
  • Bayesian Analysis in R
Focus 3: Optimisation Topics
  • Synergy of Optimisation and Machine Learning 
 Focus 4: Brain Computer Interfaces
  • A Hands-On Introduction to Non-Invasive Neural Interfaces
 Cross Themes
  • Best Practice Analytics
  • Empowerment
  • Ethics, Data Protection, Security, and Liability in the Age of Big Data
  • Federated Learning
  • Introduction to Network Science
  • Introduction to Python
  • Introduction to R

2023 Course topics

More details on course topics being offered in 2023*

There will be a detailed course pack sent to delegates closer to the summer school date with further information and any changes, as well as access to a shared drive for materials on these courses.

Introduction to Python

This Introduction to Python course is for beginners. We aim to introduce fundamental programming concepts using Google Colab. We will introduce variables, data types, casting, string, Booleans, operators, lists, tuples, loops, conditions, functions, and a bit of NumPy. This course is designed for those who are coming from a non-technical background.

Data Protection, Security, Ethics and Liability in the Age of Big Data

This session aims to introduce the current EU and UK data protection regime and the changes to be brought in by the General Data Protection Regulation applicable since May 2018, in spite of Brexit. Furthermore, the session will present and allow for discussion of the specific challenges big data analytics bring, especially in light of the reports published by various data protection regulators on big data both at UK and EU levels. Special attention will be given to security requirements in data protection law.

The last two hours (with both speakers) will introduce the ethical issues arising from Big Data and present the correlative legal issues that may arise in light of Data Protection legislations and of criminal law. Torts and contracts will not be covered.

Introduction to R

R is an interactive computing environment and programming language designed for statistical analysis and graphics. Extensions to the basic capabilities of R are straightforward to produce and share with others. It is widely and increasingly used in many Big Data fields of research including bioinformatics.

Because of its power and flexibility, R is more demanding to learn than traditional statistical packages but rewards some initial effort. This course is based on tested material that we have been using for several years to help research students, postdocs and faculty get started in their own data analysis, and is refined each time based on feedback. It is aimed at people who may have little or no programming experience.

Introduction to Machine Learning

The aim of this course is to provide an introduction to Machine Learning and a discussion of the types of problems it is suitable for. The course will then introduce Kernel Machines and show how they can provide robust but flexible classifiers when the number of training points is limited.

A Hands-On Introduction to Non-Invasive Neural Interfaces

The course will introduce the basic neuroscientific principles and engineering concepts required to understand and participate to this exciting field. Participants will gain insight into the generation of electrophysiological signalsin the nervous systemand the corresponding methods of signal processing and feature extraction. Through practical exercises participant will learn how record electrophysiological signals, and design and develop Brain-Computer interfaces (BCIs) and peripheral nervous interfaces(PNIs). PNIs are devices that directly interact with the peripheral nervous system while BCIs translate patterns of brain activity into messages for artificial devices.

Introduction to TensorFlow and Deep Learning

The course introduces Tensorflow as a programming language from scratch and shows how to use it to build simple neural networks and perform backpropagation. Students are encouraged to program along with the tutor. The basic underlying workings of TensorFlow and neural networks are taught without resorting to higher-level black box packages, so that students can gain a fundamental understanding of how deep learning works. The course also gives an introductory overview of popular deep learning models, including convolutional neural networks and recurrent neural networks.

Introduction to Data Visualisation Using R

In the era of misinformation and fake news, producing data visualisations that are clear and interpretable to an audience is essential in engaging people with data. Whilst there are many software packages available to produce data graphics, many offer limited customisation of graphics, or are not easily reproducible. This course will explore tools to produce high-quality graphics using the R programming language, focussing on the “ggplot2” package. The “ggplot2” package allows almost endless customisation of data visualisations, has a number of excellent extension packages that add further flexibility, and, being in R, is entirely script-based and therefore highly reproducible. This course will equip attendants with the skills to produce high-quality data visualisations using the “ggplot2” package and extensions, and would be beneficial to people working in any field where data visualisation is important. The course will be suitable for those with little to intermediate prior programming experience.

Tree-based Models for Machine Learning in Data Analytics

In this course, participants will learn how to work with tree-based models to solve data science problems in Python. Everything from using a single tree for regression or classification to more advanced ensemble methods will be covered. Participants start learning about basic CARTs (classification and regression trees) followed by implementation of bagged trees, Random Forests, and boosted trees using the Gradient Boosting Machine, or GBM. The course will include dedicated practical sessions for these techniques and allow the participant to create high performance tree-based models for a real-world dataset.

Synergy of Optimisation and Machine Learning

In the first part of the course, we will discuss modern optimisation approaches that do not require significant investment of expertise and time in algorithm development, but still allow to tackle real-world problems.We will cover the following topics: the meaning of optimisation and the relevance to decision support/making, off-the-shelf solvers, algorithm complexity, simple exact algorithms, simple heuristics, metaheuristics, algorithm configuration and tuning. The second part of the course will provide foundations for exploiting the strong connections between optimisation and data science. In a series of exercises, you will see how the techniques studied in the first part of the course are used in machine learning. The aim is to enhance understanding, and so the usage, of optimisation within machine learning. Conversely, it is being increasingly recognised that the control of optimisation algorithms would itself benefit from application of data science techniques. We will present methods, with exercises, that are being developed for data science to improve the performance of existing optimisation methods in many real-world problems.

Overall, the course presents the close interactions between data science and optimisation. You will gain deeper understanding of the optimisation within machine learning and decision support systems, and so how to make more effective use of them.

Recurrent neural networks with Keras

This course teaches a deep understanding of how recurrent neural networks work, what they are used for, and how to implement them efficiently using Keras and Tensorflow. The day culminates with unique advanced recurrent neural network examples applied to control problems. Note that natural-language processing examples will not be covered.

Bayesian Analysis in R

Bayesian statistics are increasingly popular in many scientific disciplines. In this course, you will learn the theoretical underpinnings of Bayesian approaches and the differences between Bayesian and frequentist statistics. You will also learn how to implement, plot, and interpret Bayesian models in R. Finally, you will learn more about some of the advanced options for statistical modelling in this framework, including multi-level modelling and generalised linear approaches.

Machine learning for Causal Inference from Observational Data

This course will introduce the basic principles of causal modelling (potential outcomes, graphs, causal effects) while emphasising the key role of design and assumptions in obtaining robust estimates. It will also cover the basic principles of machine learning and the use of machine learning methods to do causal inference (e.g. methods stemming from domain adaptation and propensity scores). Lastly it will show how to implement these techniques for causal analysis and interpret the results in illustrative examples. By the end of this course participants should: understand the distinction between causal effects and associations and appreciate the key role of design and possibly untestable assumptions in the estimation of causal effects, understand the role of training and testing models on data and the use of regularization to avoid overfitting, and be able to position machine learning within the causal tool chain.

Learning Under Different Training and Testing Distributions

Systems based on machine learning methods often suffer a major challenge when applied to the real-world datasets. The conditions under which the system was developed will differ from those in which we use the system. Few sophisticated examples could be email spam filtering, stock prediction, health diagnostic, and brain-computer interface (BCI) systems, that took a few years to develop.

Will this system be usable, or will it need to be adapted because the distribution has changed since the system was first built? Apparently, any form of real-world data analysis is cursed with such problems, which arise for reasons varying from the sample selection bias or operating in non-stationary environments. This tutorial will focus on the issues of dataset shifts (e.g. covariate shift, prior-probability shift, and concept shift) and will cover transfer learning for managing to learn a satisfactory model.

Best Practice Analytics

This course will look at methods and tools that can help us create high-quality analytics and reproducible results. We will also look at how to move from a single analyst, spreadsheet driven approach to collaborative analytics that follows a best practice governance model. Adopting practices from test driven software development, we will look at how to establish an analytics process based on documentation, versioning, testing, peer review, collaboration and risk evaluation. We will use examples in R, Shiny, Python and Jupyter Notebooks to illustrate the ideas taught in the course.

The aim is to give you an understanding of the challenges you will face when running your own real-world data analytics project and introduce you to a number of principles you can follow to achieve high-quality reproducible results.

GIS Systems in R

Geographic information systems (GIS) software form a powerful tool in the analysis of many types of spatial data, from understanding political trends in different areas, mapping the spread of infectious diseases, or understanding the impacts of climate change across the globe. This course will focus on the “sf” package, and will explore the merits and functionality of working with “simple features” based objects in geographical analyses. This one-day course will familiarise users with the array of GIS packages available in R, and enable users to carry out basic GIS operations on a variety of different geographical data formats.

Introduction to Network Science

This one-day introductory course on network science will give a broad overview of the different concepts and methods commonly applied in social network analysis. We will first consider different kinds of network data and their representation and discuss the basics of network visualisation, including a hands-on example using the free software visone as an example. We will also discuss different kinds of applications and usage scenarios of network science in business and social contexts. The second part of the course will introduce exploratory and descriptive methods for the analysis of networks, at three levels of granularity: at the node level, the subgroup level, and the network level. The third part of the course will introduce inferential or statistical network analysis, including the basic ideas behind a range of models like the exponential random graph model and its various extensions, latent space models, the quadratic assignment procedure, and related techniques. We will cover the implementation of these methods in a very cursory way using R, but the focus is on the methods, not their implementation. Overall, this course is an introductory-level teaser for interested academics, practitioners, and data scientists who would like to explore what they can possibly do with their relational data in the way of exploration and prediction.

Empowerment

Empowerment measures how much an agent is in control of the world it itself perceives. This information-theoretic intrinsic motivation measure was originally conceived to understand the development of more complex behaviours via simple gradients, but has since spread to many other domains. In this course we will first cover the core idea of Empowerment, introducing basic concepts of information theory, intrinsic motivation and the perception action loop. After a short practical exercise we will then take a look at some neural network approximation, and the application of empowerment to fields such as deep learning, robot control and multi-agent reinforcement learning.

Introduction to Federated Machine Learning

This course will explore the range of training and networking challenges in relation to model exchange among client and server nodes. We will also cover the recent developments in the field and some well know solutions to enhance privacy and security in federated machine learning. The second half of the day will entail hands-on experiments using state-of-the-art open-source European federated learning platform, FEDn, which will be hosted over the Network Convergence Laboratory (NCL) in UEssex. Based on the proposed theoretical and practical sessions, the participates will have a good understanding of the topic and it will provide the foundation required to further explore the field.

* Please note: Our courses are subject to change, and if changes come through updates will be made to this list.

Social Events, Keynotes and Networking

Throughout the summer school we will be providing a variety of events to enhance your experience.

  • 24 July - Welcome Evening Reception
  • 25 July - Keynote Speaker (TBC)
  • 27 July - Formal Dinner Evening at Wivenhoe House Hotel

We are currently revising our schedule for 2023 so more information with regards to these social events will be updated in the coming months.

In the past we have had representation from a range of leading industry partners and academic institutions worldwide including BT, Fraunhofer, Microsoft, MSXInternational, Celtra, Micro Focus and Micro Focus Vertica.

A fanned out pile of booklets with an orange cover and "Transforming the World" in white text on a grey banner.
"The Big Data Summer School was a great experience! It was the most varied timetable I've seen that covers data science topics and I'm really glad I attended."
John Tipple Summer school delegate, 2018
          

Eligibility

There are no specific eligibility requirements to book a place on the Summer School. We welcome attendees (students, researchers, academics and professionals) in the emerging fields of big data, data science and analytics who would like to find out more or be updated on the current trends and developments in this exciting and fast-developing field.

We accept applicants from all levels and experience from around the world and offer a range of introductory and academic courses to suit your needs.

Whilst we don't restrict access to courses, when selecting the courses that make up your programme please pay attention to our recommended prerequisites. These prerequisites are stipulated to ensure that all attendees make the best out of their time with us.

If you are unsure whether a certain published course is right for you, please do not hesitate to contact us at iadssum@essex.ac.uk and a member of the IADS team will be happy to advise you on whether a course is suitable for your requirements and skillsets.

Fees

What is included in the fees

  • A tailored programme of courses and access to preparatory and bonus materials from our providers using a secure drive.
  • Expert providers and business industry leaders to provide you with high level content, hands-on experience and professional insight. 
  • Keynotes from experts in the field. 
  • A delegate pack full of resources, information, and memorabilia.
  • Evening social activities and networking events. 
  • Daily lunches and snacks from our on campus caterer. 
  • Certification upon completing your course, endorsed by the University of Essex and the Institute for Analytics and Data Science.

Please note: Accommodation is not included in the fees and instructions on how to book discounted accommodation on campus will be provided once your place on the summer school is confirmed. 

Tuition fee discounts are available for students at partner universities - please get in touch for details.

The fees for 2023 can be found below: 

Early registration (until 19 May 2023)

Audience Cost
Commercial participants £715
University of Essex students £500
University of Essex staff or alumni £570
Non-University of Essex students £570
Non-University of Essex Academics £610
Public sector partners £610

Standard registration (20 May to 30 June 2023)

Audience Cost
Commercial participants £770
University of Essex students £540
University of Essex staff or alumni £615
Non-University of Essex students £615
Non-University of Essex Academics £655
Public sector partners £655

Late registration (1 to 7 July 2023)

Audience Cost
Commercial participants £825
University of Essex students £580
University of Essex staff or alumni £660
Non-University of Essex students £660
Non-University of Essex Academics £705
Public sector partners £705
"Some brilliant instructors who made extremely complicated methods relatively clear and understandable…I left the course feeling confident to at least attempt to use some methods that were previously completely off-limits"
IADS Analytics, Data Science and Decision Making Summer School delegate, 2022

How to apply

Applications for our 2023 summer school have now closed. For information about future summer schools and any enquiries, please contact: iadssum@essex.ac.uk

Upon completion, you will then be notified of the outcome, and if successful, directed to the payment portals below.  If you have any further queries, please contact iadssum@essex.ac.uk

Self funding

You can pay for your place online via our Webshop which you will be sent a link to after completing the application form.

Paying by Proficio

If you are paying for your course fee using University of Essex Proficio funds, you will need to use the University of Essex Proficio platform. In addition to enrolling via Proficio, applicants should ensure they have completed the online application form. 

Paying by invoice

If your institution requires an invoice to process payment, please email us stating how much the invoice should be for, and the name and postal address of the person making the payment, to iadssum@essex.ac.uk 

After payment

Once you've paid, we will send you our enrolment guidance pack with instructions on how to finalise your place and access our preparatory materials to prepare for your training with us.

“A great networking experience”
Michael Kalu IADS Analytics, Data Science and Decision Making Summer School delegate, 2022
"The Summer School was well organised, the courses I attended were very informative and the keynotes were excellent."
Analytics and Data Science Summer School Delegate 2016

Bringing together
great minds

Over 32 essential training courses
Over 60 providers and keynotes from around the world
6 years of running successful summer training
Over 550 satisfied delegates in the past six years
Lecturer at an executive education course
Lecturer
Summer School certificates
Evening social event
Get in touch
Essex Analytics, Data Science and Decision Making Summer School Institute for Analytics and Data Science