Resources


Database Open Access

I-CARE: International Cardiac Arrest REsearch consortium Database

Edilberto Amorim, Wei-Long Zheng, Jong Woo Lee, Susan Herman, Mohammad Ghassemi, Adithya Sivaraju, Nicolas Gaspard, Jeannette Hofmeijer, Michel J A M van Putten, Matthew Reyna, Gari Clifford, Brandon Westover

The clinical and EEG data for this dataset originates from seven academic hospitals in the U.S. and Europe led by investigators part of the International Cardiac Arrest REsearch consortium (I-CARE).

Published: Dec. 14, 2023. Version: 2.1

Visualize waveforms

Database Credentialed Access

Medication Extraction Labels for MIMIC-IV-Note Clinical Database

Akshay Goel, Almog Gueta, Omry Gilon, Sofia Erell, Amir Feder

Medication extraction NLP labels for 600 discharge summaries in MIMIC-IV-Note dataset.

Published: Dec. 12, 2023. Version: 1.0.0


Database Credentialed Access

CAD-Chest: Comprehensive Annotation of Diseases based on MIMIC-CXR Radiology Report

Mengliang Zhang, Xinyue Hu, Lin Gu, Tatsuya Harada, Kazuma Kobayashi, Ronald Summers, Yingying Zhu

The CAD-Chest dataset provides comprehensive annotations of disease, including disease severity, uncertainty, and location based on the MIMIC-CXR radiologist reports.

chesr x-ray disease label

Published: Dec. 8, 2023. Version: 1.0


Database Open Access

Simulated Obstructive Disease Respiratory Pressure and Flow

Jaimey Anne Clifton, Ella Frances Sophia Guy, Trudy Caljé-van der Klei, Jennifer Knopp, James Geoffrey Chase

Outlined is a pressure, flow, and volume dataset using a using a modular device to simulate the effects of obstructive pulmonary disease in healthy people. 20 healthy subjects were included in this dataset.

Published: Nov. 13, 2023. Version: 1.0.0


Challenge Credentialed Access

BioNLP Workshop 2023 Shared Task 1A: Problem List Summarization

Yanjun Gao, Dmitriy Dligach, Timothy Miller, Majid Afshar

This is the data storage for BioNLP Workshop Shared Task 1A: Problem List Summarization.

bionlp clinical natural language processing electronic health record summarization

Published: Nov. 12, 2023. Version: 2.0.0


Database Open Access

Respiratory dataset from PEEP study with expiratory occlusion

Ella Frances Sophia Guy, Jaimey Anne Clifton, Trudy Caljé-van der Klei, Rongqing Chen, Jennifer Knopp, Knut Moeller, James Geoffrey Chase

Outlined is a pressure, flow, volume, dynamic circumference, and EIT assessed aeration dataset from resting breathing with REO at increasing CPAP PEEP settings. Vapers, asthmatics, smokers, and otherwise healthy people were included in the trial.

Published: Nov. 10, 2023. Version: 1.0.0


Database Credentialed Access

BOLD, a blood-gas and oximetry linked dataset

João Matos, Tristan Struja, Jack Gallifant, Luis Filipe Nakayama, Marie Charpignon, Xiaoli Liu, Jaime dos Santos Cardoso, Leo Anthony Celi, An Kwok Wong

An open-source pulse oximetry and arterial blood gas dataset, derived from MIMIC-III, MIMIC-IV, and eICU-CRD

electronic health records health equity pulse oximetry intensive care unit

Published: Nov. 8, 2023. Version: 1.0


Model Credentialed Access

Characterization of Stigmatizing Language in Medical Records

Keith Harrigian, Ayah Zirikly, Brant Chee, Alya Ahmad, Anne Links, Somnath Saha, Mary Catherine Beach, Mark Dredze

A suite of classifiers for detecting three types of stigmatizing language in electronic medical records. Trained on MIMIC-IV discharge notes.

mimic clinical natural language processing large language models domain transfer bias stigmatizing language

Published: Nov. 6, 2023. Version: 1.0.0


Software Open Access

Transformer-DeID: Deidentification of free-text clinical notes with transformers

Callandra Moore, Lucas Bulgarelli, Tom Pollard, Alistair Johnson

Fine tune transformer-based neural networks to deidentify clinical text data.

deidentification neural networks transformers

Published: Nov. 2, 2023. Version: 1.0.0


Database Contributor Review

CARMEN-I: A resource of anonymized electronic health records in Spanish and Catalan for training and testing NLP tools

Eulalia Farre Maduell, Salvador Lima-Lopez, Santiago Andres Frid, Artur Conesa, Elisa Asensio, Antonio Lopez-Rueda, Helena Arino, Elena Calvo, Maria Jesús Bertran, Maria Angeles Marcos, Montserrat Nofre Maiz, Laura Tañá Velasco, Antonia Marti, Ricardo Farreres, Xavier Pastor, Xavier Borrat Frigola, Martin Krallinger

CARMEN-I is a Spanish corpus of 2,000 clinical records from Hospital Clínic, Barcelona. It covers COVID-19 patients and comorbidities, serving as a resource for training clinical NLP models and researchers in NLP applied to clinical documents.

de-identification anonymization clinical ner

Published: Nov. 2, 2023. Version: 1.0