Ministry of Justice Synthetic Data First Cross-Justice System Linking Dataset, England and Wales, 2011-2023

The Ministry of Justice (MoJ) Data First Synthetic Data Project aims to improve engagement with Data First datasets by making synthetic versions of content available to enable more rapid development of research proposals and to thereby enhance the potential for linked administrative data to improve...

Ausführliche Beschreibung

Gespeichert in:  
Bibliographische Detailangaben
Körperschaft: Großbritannien. VerfasserIn (VerfasserIn)
Medienart: Elektronisch Buch Statistik
Sprache:Englisch
Veröffentlicht: Colchester UK Data Service 2025
In:Jahr: 2025
Online-Zugang: Volltext (Resolving-System)
Verfügbarkeit prüfen: HBZ Gateway
Subito Bestelldienst: Jetzt bestellen.
Schlagwörter:
Beschreibung
Zusammenfassung:The Ministry of Justice (MoJ) Data First Synthetic Data Project aims to improve engagement with Data First datasets by making synthetic versions of content available to enable more rapid development of research proposals and to thereby enhance the potential for linked administrative data to improve understanding and outcomes across justice systems. The project has led the development of two components: a dataset generation platform and an initial release of lo-fidelity, synthetic data tables. This study includes a synthetically-generated version of the Ministry of Justice Data First cross-justice system linking dataset. Synthetic versions of all 43 tables in the MoJ Data First data ecosystem have been created. These versions can be used / joined in the same way as the real datasets. As well as underpinning training, synthetic datasets should enable researchers to explore research questions and to design research proposals prior to submitting these for approval. The code created during this exploration and design process should then enable initial results to be obtained as soon as data access is granted. The cross-justice system linking datasets allows users to join up information from data sources across the justice system (courts, prisons, probation) and should be used in conjunction with other datasets shared as part of the Data First Programme. Records relating to individual justice system users can be linked using unique identifiers provided for people involved. This connects people involved in different parts of the criminal justice system or that have interacted with the civil or family courts. This allows for longitudinal analysis and investigation of repeat appearances and interactions with multiple justice services, which will increase understanding around users, their pathways and outcomes. This dataset does not itself contain information about people or their interactions with the justice system, but acts as a lookup to identify where records in other datasets are believed to relate to the same person, using our probabilistic record linkage package, Splink. The person link table contains rows with references to all records in the individual datasets that have been linked to date plus new identifiers, generated in the linking process, which enables these records to be grouped and linked across the datasets.
DOI:10.5255/UKDA-SN-9394-1