How is Data Linked?

How is Data Linked?

How is data linked for research?

Creation of Linkage IDs

To allow data about the same person to be linked across different data collections, data linkers within a Data Linkage Unit (DLU) create unique Linkage IDs (a random string of numbers and letters).

To do this, the data custodians provide the personal information portion plus the local Record ID of each record in their data collections to the DLU. The data custodian requires approval from an HREC before providing the data. The other portion of the record containing the health, education or other data (known as content information) remains with the data custodians, meaning that the data linkers never have access to this data.

Upon receiving the personal information and Record IDs at the DLU, the data linkers assign a Linkage ID to each person. These Linkage IDs are stored on secure computer servers and can only be accessed by authorised DLU staff.

Data custodians provide regular updates of the personal information and Record IDs to the data linkers. The data linkers then check the new data against the existing personal information to see if they already have Linkage IDs for these records using a statistical probability method.

For each record that is determined to be for a new person in the system, the data linkers create a new Linkage ID which is then added to the DLU's Linkage ID collection.

Provision of linkable data to researchers

Researchers wishing to access the data that data custodians hold must undergo a stringent application process requiring approval from each data custodian and also from a HREC that certifies that the study is valid and in the public interest.

Once a project is approved, the data custodians and the staff at the DLU work together to determine which records are required for the study to ensure the minimum amount of information is provided to the researcher. The data linkers then use the Linkage IDs to create Project Linkage IDs that are specific for the approved study. They then send the Project Linkage IDs along with the Record IDs of the required records to the data custodians.

Using the Record IDs, the data custodians extract the required records from their collections and replace the personal information of each record with its matched Project Linkage ID. The researcher is then provided with the content data of each record and its corresponding Project Linkage ID by each data custodian.

Using the Project Linkage ID, the researcher can determine which records from different datasets belong to the same person without having access to the personal information in order to create a merged dataset for their analysis.

The access to and use of linked datasets is strictly controlled and managed.

shutterstock_586204247 Hospital Theatre.jpg


PHRN is closely monitoring the COVID-19 situation and has taken steps to mitigate the risk of transmission to our staff. In line with advice on social distancing, all staff at PHRN program office have moved to work from home. Staff are currently all working in line with our usual work patterns and our landline numbers are being diverted to staff at home, so will either be answered, or it will be possible to leave a voicemail. Please bear with us as we move to this new way of working. We will be participating in any meetings we need to attend remotely via Zoom. We are reviewing our work plans in light of the current situation and will make changes as required to ensure we are serving our network, researchers and community in this changing situation.

iStock-525755929 - Researcher in Meeting.jpg


WA Department of Health is seeking to establish a contract for the provision of an expert resource which will provide consultation to the Department of Health and the Department of Jobs, Tourism and Industry on a joint initiative. The ‘Strategic Linkage Advisor’, will provide strategic linkage advice, research, analysis, stakeholder management and inter-governmental coordination on matters relating to data linkage.