Basic principles of a TRE: The 5 safes

Safe data: data is treated to protect any confidentiality concerns.
Safe projects: research projects are approved by data owners for the public good.
Safe people: researchers are trained and authorised to use data safely.
Safe settings: a secure environment prevents unauthorised use.
Safe outputs: screened and approved outputs that are non-disclosive.

https://ukdataservice.ac.uk/help/secure-lab/what-is-the-five-safes-framework/

Open science and Trusted Research Environments

Dundee Data Meetup 26 November 2024
Simon Li

@manics
@penguinoops.bsky.social

Who am I?

The Health Informatics Centre

Overview

Trusted Research Environments

Sensitive data?

Sensitive or "Special category" data

Example: Can undiagnosed heart failure be detected from routine medical records?

A very rough workflow

1: Obtain approval and funding for the research project

2. What raw data do we have?

3. Extraction and Pseudonymisation of data

4. Data made available to researchers in TRE

5. Researchers analyse data

6. Researchers publish results

→ Statistical disclosure control

How do you design a TRE?

Basic principles of a TRE: The 5 safes

Balancing the 5 safes

What does a TRE look like?

Open Science: what is it?

Open-science in TREs

The problem

Part of the solution....

Open infrastructure: open-source isn't enough

Federated analysis

Open collaboration

Standard Architecture for Trusted Research Environments

What do the public think?

2014: Care.data

Public trust is really important!

The NHS has learnt from that

"People in the UK overwhelmingly support the use of their health data, with appropriate safeguards, to benefit themselves and others."

Bedtime reading

Open science and Trusted Research Environments

Dundee Data Meetup 26 November 2024Simon Li

@manics @penguinoops.bsky.social

Who am I?

The Health Informatics Centre

Overview

Trusted Research Environments

Sensitive data?

Sensitive or "Special category" data

Example: Can undiagnosed heart failure be detected from routine medical records?

A very rough workflow

1: Obtain approval and funding for the research project

2. What raw data do we have?

3. Extraction and Pseudonymisation of data

4. Data made available to researchers in TRE

5. Researchers analyse data

6. Researchers publish results

→ Statistical disclosure control

How do you design a TRE?

Basic principles of a TRE: The 5 safes

Balancing the 5 safes

What does a TRE look like?

Open Science: what is it?

Open-science in TREs

The problem

Part of the solution....

Open infrastructure: open-source isn't enough

Federated analysis

Open collaboration

Standard Architecture for Trusted Research Environments

What do the public think?

2014: Care.data

Public trust is really important!

The NHS has learnt from that

"People in the UK overwhelmingly support the use of their health data, with appropriate safeguards, to benefit themselves and others."

Bedtime reading

Dundee Data Meetup 26 November 2024
Simon Li

@manics
@penguinoops.bsky.social