Module Details

CE807-7-SP-CO: Text Analytics

Year: 2016/17
Department: Computer Science and Electronic Engineering
Essex credit: 15
ECTS credit: 7.5
Available to Study Abroad / Exchange Students: No
Full Year Module Available to Study Abroad / Exchange Students for a Single Term: No
Outside Option: No

Supervisor: Professor Massimo Poesio
Teaching Staff: Professor Massimo Poesio
Contact details: School Office, e-mail csee-schooloffice (non-Essex users should add to create full e-mail address), Telephone 01206 872770.

Module is taught during the following terms
Autumn Spring Summer

Module Description

The aim of this module is to provide students with an understanding of text analytics and its applications. Students will be introduced to state of the art methods for extracting structured information (e.g. opinions about products) from unstructured textual data, in particular in social media; and to techniques for summarizing and analyzing this information.

Learning Outcomes:

After completing this module, students will be expected to be able to:

1. Use text classification techniques for a variety of applications
2. Develop systems for identifying the entities mentioned in text, the relations between them, and the opinions expressed about these entities
3. Analyze data extracted from social media such as blogs and tweets
4. Develop systems for summarizing textual information.

Outline Syllabus:

1. Text classification: techniques and applications

2. Sentiment analysis

3. Extracting information from text: entities, relations

4. Summarizing textual information

5. Analyzing social media.

Learning and Teaching Methods

Mode of delivery:

2 hours of lectures per week, 2 hours of laboratory time per week.


40 per cent Coursework Mark, 60 per cent Exam Mark


Assignments: 1. Text classification and sentiment analysis (20% of module, involving the development of a system e.g., for sentiment analysis of Twitter data, assessment by code and report) to be submitted to FASer in week 20. 2. Information extraction (20% of module, involving the development of a system e.g., for disambiguation to Wikipedia of query logs, assessment by code and report) to be submitted to FASer in week 24.


  • Recommended
  • Richert and Coelho - Building Machine Learning Systems with Python (2nd ed) - Pack Press (RC)
  • Manning, Raghavan & Schutze - Introduction to Information Retrieval - Cambridge, 2008 (MRS)
  • Morris - Text Processing in Java - Colloquial Media
  • Other useful references
  • Jurafsky&Martin - Speech and Language Processing, 2nd ed. - Prentice-Hall