Cambridge Law Corpus, 1550-2023

The Cambridge Law Corpus (CLC) is a corpus designed for legal AI research. It consists of over 250,000 court cases from the UK. Most cases are from the 21st century, but the corpus includes cases as old as the 16th century. Together with the corpus, annotations on case outcomes for 638 cases, done b...

Full description

Saved in:  
Bibliographic Details
Authors: Östling, Andreas (Author) ; Sargeant, Holli (Author) ; Xie, Huiyuan (Author) ; Bull, Ludwig (Author) ; Terenin, Alexander (Author) ; Jonsson, Leif (Author) ; Magnusson, Måns (Author) ; Steffek, Felix 1975- (Author)
Format: Electronic Research Data
Language:English
Published: Colchester UK Data Service 2024
In:Year: 2024
Online Access: Volltext (Resolving-System)
Check availability: HBZ Gateway
Subito Delivery Service: Order now.
Keywords:

MARC

LEADER 00000nam a22000002 4500
001 1881550559
003 DE-627
005 20240226072445.0
007 cr uuu---uuuuu
008 240226s2024 xx |||||o 00| ||eng c
024 7 |a 10.5255/UKDA-SN-856927  |2 doi 
024 8 |a 856927  |q SN 
035 |a (DE-627)1881550559 
035 |a (DE-599)KXP1881550559 
040 |a DE-627  |b ger  |c DE-627  |e rda 
041 |a eng 
084 |a 2,1  |2 ssgn 
100 1 |a Östling, Andreas  |e VerfasserIn  |4 aut 
245 1 0 |a Cambridge Law Corpus, 1550-2023  |c Östling, A., University of Uppsala, Sargeant, H., University of Cambridge, Xie, H., University of Cambridge, Bull, L., CourtCorrect, Terenin, A., University of Cambridge, Jonsson, L., Ericsson, Magnusson, M., University of Uppsala, Steffek, F., University of Cambridge 
264 1 |a Colchester  |b UK Data Service  |c 2024 
336 |a Text  |b txt  |2 rdacontent 
337 |a Computermedien  |b c  |2 rdamedia 
338 |a Online-Ressource  |b cr  |2 rdacarrier 
520 |a The Cambridge Law Corpus (CLC) is a corpus designed for legal AI research. It consists of over 250,000 court cases from the UK. Most cases are from the 21st century, but the corpus includes cases as old as the 16th century. Together with the corpus, annotations on case outcomes for 638 cases, done by legal experts, are provided. The Word files were cleaned and transformed into an XML format. PDF files were converted to textual form via optical character recognition (OCR). The resulting text files were then converted to the XML standard format. Because of legal and ethical considerations, the full Cambridge Law Corpus (CLC) is only available for research purposes under restrictions and available via Related Resources. A smaller dataset consisting of 15 selected cases from the CLC is available on the University of Cambridge Apollo Data Repository which can be accessed via Related Resources. The Cambridge Law Corpus is a corpus designed for legal AI research. It consists of over 250,000 court cases from the UK. Most cases are from the 21st century, but the corpus includes cases dating from the 16th century. It was funded by the research project, Legal Systems and Artificial Intelligence, which was jointly supported by the UK’s Economic and Social Research Council, part of UKRI, and the Japanese Society and Technology Agency (JST), and involved collaboration between Cambridge University (the Centre for Business Research, Department of Computer Science and Faculty of Law) and Hitotsubashi University, Tokyo (the Graduate Schools of Law and Business Administration). 
650 4 |a Law 
650 4 |a legal decisions 
650 4 |a Courts 
650 4 |a legal records 
655 7 |a Forschungsdaten  |0 (DE-588)1098579690  |0 (DE-627)857755366  |0 (DE-576)469182156  |2 gnd-content 
700 1 |a Sargeant, Holli  |e VerfasserIn  |4 aut 
700 1 |a Xie, Huiyuan  |e VerfasserIn  |4 aut 
700 1 |a Bull, Ludwig  |e VerfasserIn  |4 aut 
700 1 |a Terenin, Alexander  |e VerfasserIn  |4 aut 
700 1 |a Jonsson, Leif  |e VerfasserIn  |4 aut 
700 1 |a Magnusson, Måns  |e VerfasserIn  |4 aut 
700 1 |a Steffek, Felix  |d 1975-  |e VerfasserIn  |0 (DE-588)1012084698  |0 (DE-627)660895714  |0 (DE-576)184696313  |4 aut 
856 4 0 |u https://doi.org/10.5255/UKDA-SN-856927  |x Resolving-System 
951 |a BO 
ELC |a 1 
LOK |0 000 xxxxxcx a22 zn 4500 
LOK |0 001 4491047960 
LOK |0 003 DE-627 
LOK |0 004 1881550559 
LOK |0 005 20240226072455 
LOK |0 008 240226||||||||||||||||ger||||||| 
LOK |0 040   |a DE-2619  |c DE-627  |d DE-2619 
LOK |0 092   |o n 
LOK |0 852   |a DE-2619 
LOK |0 852 1  |9 00 
LOK |0 935   |a foda 
ORI |a WA-MARC-krimdoka001.raw