Course catalogue
This course catalogue provides you with information about the analytical courses available on the Office for National Statistics’ Learning Hub. You will need a Learning Hub account to access the training materials presented here. More information is available on how to request a Learning Hub account.
Use the links in the table of contents to navigate to a course that you are interested in. For each course you will find the suggested course length, delivery type, skill level, course developer and a link to register.
Table of contents
- Introduction to R
- Introduction to Python
- Introduction to Data Visualisation
- Data Visualisation in R
- Data Visualisation in Python
- Introduction to Natural Language Processing in R
- Introduction to Natural Language Processing in Python
- Natural Language Processing in Python
- Introduction to Machine Learning in R
- Introduction to Machine Learning in Python
- Statistics in R
- Statistics in Python
- Quality Assurance of Predictive Modelling
- Modular Programming in Python and R
- Introduction to Pyspark
- Foundations of SQL
- How to request Learning Hub access
1. Introduction to R
This newly redesigned Introduction to R is a 2-day course with a week between the sessions to allow assimilation of learning and time to practice. This course focuses on applying skills throughout, and building confidence, independence, and resilience so that you can continue your learning beyond the classroom. No prior knowledge is needed to take part in this course.
Length | 2 days |
Course type | Online and face-to-face |
Skill level | Intermediate |
Developed by | Analysis Function |
Register | Learning Hub |
2. Introduction to Python
This newly redesigned Introduction to Python is a 2-day course with a week between the sessions to allow assimilation of learning and time to practice. This course focuses on applying skills throughout, and building confidence, independence, and resilience so that you can continue your learning beyond the classroom. No prior knowledge is needed to take part in this course.
Length | 2 days |
Course type | Online and face-to-face |
Skill level | Intermediate |
Developed by | Analysis Function |
Register | Learning Hub |
3. Introduction to Data Visualisation
This is a theoretical course that covers the best practices in how to present your data in tables and graphs, following the guidelines for official statistics.
Length | 1 hour |
Course type | Online |
Skill level | Beginner |
Developed by | Analysis Function |
Register | Learning Hub |
4. Data Visualisation in R
This course explains how to take the best practice principles from Introduction to Data Visualisation and practically apply these using the programming language of R. It will take you from first principles through to producing production ready visualisation.
Length | 12 hours |
Course type | Online |
Skill level | Intermediate |
Developed by | Analysis Function |
Register | Learning Hub |
5. Data Visualisation in Python
This course explains how to take the best practice principles from Introduction to Data Visualisation and practically apply these using the programming language of Python. It will take you from first principles through to producing production ready visualisation.
Length | 12 hours |
Course type | Online |
Skill level | Intermediate |
Developed by | Analysis Function |
Register | Learning Hub |
6. Introduction to Natural Language Processing in R
In this introductory course we will cover the basics of Natural Language Processing (NLP) topics including the process of ‘cleaning’ a dataset, exploring it and applying simple feature engineering techniques to transform the data which will help you for expanding your analysis capabilities.
You will be able to:
- Understand the fundamental concepts and techniques of natural language processing
- Learn to use a regular expression to extract patterns from text
- Understand and apply the necessary steps to ‘clean’, explore and transform the dataset in the appropriate order
- Implement text analysis (sentiment analysis) with real datasets.
Length | 2 days |
Course type | Online |
Skill level | Intermediate |
Developed by | Analysis Function |
Register | Learning Hub |
7. Introduction to Natural Language Processing in Python
Natural Language Processing (NLP) is a sub-field of Artificial Intelligence. It is used for processing and analysing natural language, and can be scaled up to large amounts of text. This is an Introduction to Natural Language Processing, and thus the main concepts are about text cleaning, processing, exploring datasets, and becoming familiar with working with text based data in Python.
Length | 2 days |
Course type | Online |
Skill level | Intermediate.
It is suggested to have completed the Introduction to Python course. |
Developed by | Data Science Campus |
Register | Learning Hub or contact us |
8. Natural Language Processing in Python
This course adds further topics, building on content from the Introduction to Natural Language Processing (NLP) in Python course. You will learn important components used for modelling text including numerical feature representation for text and basic forms of modelling language. This course assumes the introduction course has been completed, and some machine learning experience.
Length | 4 to 6 hours |
Course type | Online |
Skill level | Intermediate.
You need to have completed Introduction to NLP in Python. You should also have completed Introduction to Machine Learning in Python, or have similar experience in Machine Learning. |
Developed by | Data Science Campus |
Register | Learning Hub |
9. Introduction to Machine Learning in R
This course builds on the fundamentals of programming and statistical concepts to introduce the domain of machine learning in R. The course employs the state of the art “mlr3” machine learning package for implement of classification, regression and cluster analysis experiments.
Length | 3 days |
Course type | Online |
Skill level | Intermediate |
Developed by | Analysis Function |
Register | Learning Hub |
10. Introduction to Machine Learning in Python
Machine learning is becoming more and more integrated in modern data analysis. Before performing advanced prediction, basic concepts in machine learning modelling must be understood and applied. In this newly refreshed course you will learn about a range of machine learning topic areas as well as how to actually perform the modelling operations using the Python programming language and the package scikit-learn. The course covers key aspects of machine learning including:
- Data preparation
- Regression
- Classification
- Dimension reduction
- Clustering
Length | 3 days |
Course type | Online and face-to-face |
Skill level | Intermediate.
You need to have taken Introduction to Python and have experience analysing data in Python, using pandas, Knowledge of basic statistics; probability and distributions and experience using Jupyter Notebooks is required. |
Developed by | Data Science Campus |
Register | Learning Hub or contact us |
11. Statistics in R
This course is for someone interested in learning statistical methods and how to apply those methods in R. It is an even split of theory and coding and covers:
- Exploratory Data Analysis
- Statistical Tests
- Linear Regression
- Model Adequacy and Model Selection
- Generalised Linear Models
Length | 16 hours |
Course type | Online |
Skill level | Intermediate |
Developed by | Analysis Function |
Register | Learning Hub |
12. Statistics in Python
This course introduces the basics of carrying out statistical analysis in Python. It covers exploratory data analysis and constructing and interpreting linear and generalised linear models in Python.
Length | 12 hours |
Course type | Online |
Skill level | Intermediate |
Developed by | Data Science Campus |
Register | Learning Hub |
13. Quality Assurance of Predictive Modelling
As models are used more frequently to make predictions and decisions across industries it becomes important that good practice is followed in model design and usage. In this course we will explore important issues that arise in the field of statistical modelling and machine learning for prediction. The course is a series of modules exploring quality issues related to predictive modelling.
Length | 6 to 8 hours |
Course type | Online |
Skill level | Intermediate.
This course is relevant for both model assurers and analysts. Some statistical experience is expected although no programming experience is required. Analysts would benefit from completing Introduction to Machine Learning in Python, or Machine Learning in R before this course, although it is not required. |
Developed by | Data Science Campus |
Register | Learning Hub |
14. Modular Programming in Python and R
As programming becomes more crucial to analysis, the need for well-structured code has increased. A key component in well written, reproducible code is the idea of “modular” design. This is the splitting up of scripts into different units which have specified processes and outputs.
In this course you will learn important concepts relating to modular design, how to convert code to functions and modules in either R or Python.
Length | 4 to 5 hours |
Course type | Online |
Skill level | Intermediate.
Participants will need:
|
Developed by | Data Science Campus |
Register | Learning Hub |
15. Introduction to Pyspark
This course will give you an understanding of Pyspark, the Python interface to the distributed processing tool Spark. With it, you will be able to handle huge datasets effortlessly, and process, query, and manipulate data which is beyond the reach of traditional programming languages.
Length | 2 days |
Course type | Face-to-face |
Skill level | Intermediate |
Developed by | Analysis Function |
Register | Learning Hub |
16. Foundations of SQL
This course introduces the basic syntax of SQL, which is applicable to any flavour of the language used. The course uses an online platform to complete exercises in SQLite.
The main topics covered are:
- Basic SQL queries
- Manipulating tables and their properties
- Joining tables
- Database management
Length | 6 hours |
Course type | Online |
Skill level | Intermediate |
Developed by | Analysis Function |
Register | Learning Hub |
17. How to request Learning Hub access
To access courses from our catalogue, you will need access to a Learning Hub account.
Access is offered to those working within the Government Analysis Function or working within an analytical role with a collaborating organisation. To check eligibility, for new account requests or if you require help accessing a pre-existing Learning Hub account, please email the Data Science Campus providing the following:
- Your name (given and surname)
- Organisation
- Job title
Please add “Requests for Learning Hub access” as the email subject.
Any information you provide:
- will be strictly confidential
- will only be available to the Data Science Campus Faculty
- will be stored securely on ONS systems.
We collect your data to:
- help administer the creation of your Learning Hub account
- report aggregated, anonymised summary totals of learners to the UK Statistics Authority on a quarterly basis as required by our work programme.
We will not:
- sell or rent your data to third parties.
- share your data with third parties for marketing purposes.
For more information on how the Office for National Statistics processes your data, please refer to the ONS data protection privacy information.