Database Open Access

MIMIC-III Waveform Database

Benjamin Moody George Moody Mauricio Villarroel Gari D. Clifford Ikaro Silva

Published: April 7, 2020. Version: 1.0


When using this resource, please cite: (show more options)
Moody, B., Moody, G., Villarroel, M., Clifford, G. D., & Silva, I. (2020). MIMIC-III Waveform Database (version 1.0). PhysioNet. https://doi.org/10.13026/c2607m.

Additionally, please cite the original publication:

Johnson, A. E. W., Pollard, T. J., Shen, L., Lehman, L. H., Feng, M., Ghassemi, M., Moody, B., Szolovits, P., Celi, L. A., & Mark, R. G. (2016). MIMIC-III, a freely accessible critical care database. Scientific Data, 3, 160035.

Please include the standard citation for PhysioNet: (show more options)
Goldberger, A., Amaral, L., Glass, L., Hausdorff, J., Ivanov, P. C., Mark, R., ... & Stanley, H. E. (2000). PhysioBank, PhysioToolkit, and PhysioNet: Components of a new research resource for complex physiologic signals. Circulation [Online]. 101 (23), pp. e215–e220.

Abstract

The MIMIC-III Waveform Database contains 67,830 record sets for approximately 30,000 ICU patients. Almost all record sets include a waveform record containing digitized signals (typically including ECG, ABP, respiration, and PPG, and frequently other signals) and a “numerics” record containing time series of periodic measurements, each presenting a quasi-continuous recording of vital signs of a single patient throughout an ICU stay (typically a few days, but many are several weeks in duration). A subset of this database contains waveform and numerics records that have been matched and time-aligned with MIMIC-III Clinical Database records.


Background

The MIMIC-III Waveform Database contains thousands of recordings of multiple physiologic signals (“waveforms”) and time series of vital signs (“numerics”) collected from bedside patient monitors in adult and neonatal intensive care units (ICUs).

The MIMIC-III Waveform Database is a companion to the MIMIC-III Clinical Database, which contains detailed clinical information about most of the patients represented in the Waveform Database [1]. Since the contents of each database were collected independently, in partially deidentified form, matching the clinical data with the waveform data is a non-trivial task, and only a subset of Waveform Database records has been matched with Clinical Database records. See the MIMIC-III Waveform Database Matched Subset for more information.


Methods

Unlike the original MIMIC Database, waveforms were collected in a largely automated fashion, from all of the bedside monitors in certain adult and neonatal ICUs. Not all of the ICUs in the hospital were included, and the data archiving process did not run continuously, but while it was running, all waveforms from those ICUs were captured and archived. As a result, these records represent a random sample of patients in those specific ICUs.

Recorded waveforms and numerics vary depending on choices made by the ICU staff. Waveforms almost always include one or more ECG signals, and often include continuous arterial blood pressure (ABP) waveforms, fingertip photoplethysmogram (PPG) signals, and respiration, with additional waveforms (up to 8 simultaneously) as available. Numerics typically include heart and respiration rates, SpO2, and systolic, mean, and diastolic blood pressure, together with others as available. Recording lengths also vary; most are a few days in duration, but some are shorter and others are several weeks long.

The project was approved by the Institutional Review Boards of Beth Israel Deaconess Medical Center (Boston, MA) and the Massachusetts Institute of Technology (Cambridge, MA). Requirement for individual patient consent was waived because the project did not impact clinical care and all protected health information was deidentified.


Data Description

Each recording comprises two records (a waveform record and a matching numerics record) in a single record directory (“folder”) with the name of the record. To reduce access time, the record directories have been distributed among ten intermediate-level directories (listed below). The names of these intermediate directories (30, 31, ..., 39) match the first two digits of the record directories they contain.

In almost all cases, the waveform records comprise multiple segments, each of which can be read as a separate record. Each segment contains an uninterrupted recording of a set of simultaneously observed signals, and the signal gains do not change at any time during the segment. Whenever the ICU staff changed the signals being monitored or adjusted the amplitude of a signal being monitored, this event was recorded in the raw data dump, and a new segment begins at that time.

Each composite waveform record includes a list of the segments that comprise it in its master header file. The list begins on the second line of the master header with a layout header file that specifies all of the signals that are observed in any segment belonging to the record. Each segment has its own header file and (except for the layout header) a matching (binary) signal (.dat) file. Occasionally, the monitor may be disconnected entirely for a short time; these intervals are recorded as gaps in the master header file, but there are no header or signal files corresponding to gaps.

The numerics records (designated by the letter n appended to the record name) are not divided into segments, since the storage savings that would be achieved by doing so would be relatively little.

Physiologic waveform records in this database contain up to eight simultaneously recorded signals digitized at 125 Hz with 8-, 10-, or (occasionally) 12-bit resolution. Numerics records typically contain 10 or more time series of vital signs sampled once per second or once per minute.

Technical Limitations

Waveforms or numerics missing:
Occasionally, technical limitations of the data acquisition system make it possible to create a physiologic waveform record but not a numerics record, or vice versa.
A given signal may not be available throughout an entire record:
Records in the MIMIC-III Waveform Database vary in length; some are several weeks in duration. It is common for the physiologic signals to be interrupted or changed occasionally during recordings of such long duration. When using a viewer such as LightWAVE, all signals available at any time during a record are listed, although in most cases only a subset is visible at any given time.
Gaps and patient identification:
The waveform and numerics records have been extracted from raw data dumps collected from the bedside monitors using a facility provided by the monitor manufacturer. The raw data dumps contain files of data collected from a single patient monitor during a single monitoring session (which may last days or weeks). Usually the monitoring session ends when the patient is discharged, so that the data in a single file come from a single patient. Occasionally, however, the monitor is not reset when the patient is discharged, and the session continues after a new patient has been admitted; in this case the raw data file contains data from two (or more) patients, with a gap (an interval during which no waveforms or numerics are recorded) that is typically an hour or more in duration. Such gaps may also appear if the monitor is temporarily disconnected (for example, for a laboratory test) and then reconnected to the same patient. Since the raw data files do not usually contain patient identifiers, it is not trivial to determine with certainty if the data before and after a gap were collected from the same patient.
Ideally, each MIMIC-III Waveform Database record should contain data from only one patient. All raw data files containing gaps of an hour or more have been split into separate records in order to decrease the likelihood that any record contains data from multiple patients. An ongoing project is to examine the sets of records created this way, matching them with MIMIC-III Clinical Database records when possible, to determine if and how they should be reassembled.
Inter-waveform alignment problems:
The method used for MIMIC waveform data extraction was not designed for inter-waveform analysis. The waveform data contain unspecified/unknown filtering delays and/or unknown inter-channel delays, which may not be constant in a given record. Therefore, although the ECGs are time-aligned with each other, there may be a (changing) delay of up to 500ms between any of the other waveforms in the data. For example, the pulse transit time measured between different waveforms may be unreliable (either in absolute or relative terms).
ECG limitations:
The ECG signals in the waveform records were originally sampled with 12-bit precision at a high sampling rate, and were then scaled and decimated to 500 samples per second (per signal). The scaling reduced the effective amplitude resolution from 12 bits to 9 or 10 bits in typical cases, and as little as 7 bits in some cases. From each set of 4 consecutive decimated samples of the same ECG signal, one was recorded (chosen using a turning-point compressor, a technique sometimes called “peak-picking”). The result is an ECG signal sampled 125 times per second, but at intervals that vary between 2 and 14 ms (averaging 8 ms). Since the interval between any given pair of samples was not available to us, the reconstructions of the ECG signals assume uniform 8 ms intervals. These signals with reduced time and amplitude resolution, and sampling jitter introduced by the “peak-picking”, were the only ECG signals that were possible to capture from the ICU monitors. Although ECGs reconstructed in this way can be readily interpreted visually, they are unsuitable as input for certain algorithms for ECG analysis, particularly those that are sensitive to frequency-domain features of the signal. Note that these limitations apply only to the ECG signals, not to the other signals, which were originally sampled at uniform 8 ms intervals (125 samples per second) and were not scaled prior to capture.

Usage Notes

The following example illustrates the organization of the database:

  • Intermediate directory 31 contains all records with names that begin with 31.
  • Record directory 3141595 is contained within intermediate directory 31.
  • All files associated with physiologic waveform record 3141595 and its companion numerics record 3141595n are contained within record directory 31/3141595.
    • The first line of the master header file for waveform record 314595 (31/3141595/3141595.hea) indicates that the record is 242353557 sample intervals (about 22 days at 125 samples per second) in duration, and that it contains 427 segments and gaps. (See header(5) in the WFDB Applications Guide for details on the format of this text file.) The first segment is named 3141595_0001, and it is 2888500 sample intervals (6 hours, 15 minutes, and 8 seconds, at 125 samples per second) in duration. At the end of the master header file, a comment (# Location: nicu) specifies the ICU in which the recording was made (the neonatal ICU, in this case).
    • The layout header file for this record (31/3141595/3141595_layout.hea) indicates that five ECG signals (I, II, III, AVR, and “V”), a respiration signal, and a PPG signal are available during portions of the record. (The five ECG signals are not all available simultaneously.)
    • The header file for the first segment of this record (31/3141595/3141595_0001.hea) shows that a PPG signal (“PLETH”), a respiration signal, and ECG leads II and AVR are available throughout this initial segment.
  • The matching numerics record is named 3141595n, and its header file (31/3141595/3141595n.hea) shows that it is 1938730 sample intervals (about 22 days at 1 sample per second) in duration, and that it contains heart rate (“HR”, which is measured from the ECG, as well as “PULSE”, measured from one or more pulsatile signals), noninvasive blood pressure (raw as well as systolic, diastolic, and mean), respiration rate, and SpO2.

Any WFDB application can read any waveform record from this database directly from the PhysioNet web server (i.e., without downloading the record first) using a record name of the form mimic3wdb/3x/3xyyyyy/. Numerics records can be read using the longer form mimic3wdb/3x/3xyyyyy/3xyyyyyn (note that the final 3xyyyyy must be repeated and followed by n to specify the numerics record).

For example, if you have installed the WFDB Software Package, you can read the first 10 seconds of waveform record 3141595 using this rdsamp command:

rdsamp -r mimic3wdb/31/3141595/ -p -v -t 10

To read the first 10 seconds of the matching numerics record 3141595n, use this command instead:

rdsamp -r mimic3wdb/31/3141595/3141595n -p -v -t 10

Notice that the first command produces 1250 samples of each waveform (125 samples per second, for 10 seconds), but the second command produces only 10 samples of each vital sign (1 sample per second, for 10 seconds).


Release Notes

Version 1.0 of the MIMIC-III Waveform Database supersedes previously-released versions of the MIMIC-II Waveform Database. The numbered records (3000003 to 3999988) are identical to those in version 3.2 of the MIMIC-II Waveform Database. The Matched Subset, however, uses different subject IDs and surrogate dates, corresponding to version 1.4 of the MIMIC-III Clinical Database.


Acknowledgements

We wish to thank Philips Healthcare, as well as the Beth Israel Deaconess Medical Center, for their invaluable support in making this project possible.

Many people have contributed to this project over the past 18 years, and it would be impossible to list them all. In particular, we would like to acknowledge Michael Craig, Tin Kyaw, and Mohammed Saeed, for their efforts in collecting and organizing the original MIMIC-II Waveform Database, upon which this database is based.


Conflicts of Interest

The authors have no conflicts of interests to declare.


References

  1. Johnson, A. E. W., Pollard, T. J., Shen, L., Lehman, L. H., Feng, M., Ghassemi, M., Moody, B., Szolovits, P., Celi, L. A., & Mark, R. G. (2016). MIMIC-III, a freely accessible critical care database. Scientific Data, 3, 160035. https://dx.doi.org/10.1038/sdata.2016.35

Parent Projects
MIMIC-III Waveform Database was derived from: Please cite them when using this project.
Share
Access

Access Policy:
Anyone can access the files, as long as they conform to the terms of the specified license.

License (for files):
Open Data Commons Open Database License v1.0

Discovery

DOI (version 1.0):
https://doi.org/10.13026/c2607m

DOI (latest version):
https://doi.org/10.13026/gs83-bd50

Corresponding Author
You must be logged in to view the contact information.

Files

Total uncompressed size: 6.7 TB.

Access the files

Visualize waveforms

Folder Navigation: <base>/matched/p06
Name Size Modified
Parent Directory
p060034
p060038
p060048
p060049
p060053
p060057
p060072
p060074
p060079
p060083
p060104
p060115
p060118
p060122
p060130
p060146
p060164
p060169
p060170
p060180
p060181
p060198
p060207
p060219
p060226
p060228
p060235
p060254
p060262
p060272
p060274
p060294
p060295
p060303
p060309
p060325
p060341
p060343
p060353
p060355
p060380
p060389
p060393
p060403
p060409
p060419
p060422
p060424
p060432
p060436
p060441
p060518
p060523
p060531
p060534
p060550
p060593
p060598
p060603
p060614
p060641
p060659
p060674
p060675
p060680
p060685
p060716
p060719
p060726
p060727
p060735
p060737
p060739
p060747
p060749
p060753
p060767
p060773
p060775
p060778
p060782
p060783
p060798
p060809
p060825
p060829
p060838
p060850
p060852
p060868
p060892
p060894
p060919
p060929
p060949
p060952
p060958
p060961
p060968
p060969
p060975
p060977
p060983
p060985
p060987
p060994
p060998
p061003
p061012
p061024
p061030
p061041
p061053
p061073
p061095
p061106
p061118
p061119
p061121
p061132
p061144
p061149
p061150
p061156
p061157
p061170
p061182
p061187
p061195
p061196
p061201
p061207
p061213
p061223
p061248
p061282
p061295
p061296
p061301
p061316
p061378
p061390
p061406
p061434
p061441
p061458
p061472
p061492
p061496
p061516
p061519
p061522
p061538
p061551
p061568
p061569
p061574
p061588
p061590
p061591
p061597
p061619
p061620
p061622
p061624
p061630
p061642
p061648
p061656
p061658
p061663
p061667
p061685
p061729
p061738
p061742
p061748
p061751
p061764
p061771
p061787
p061802
p061809
p061823
p061828
p061833
p061835
p061846
p061856
p061876
p061877
p061898
p061913
p061924
p061928
p061932
p061943
p061949
p061956
p061959
p061980
p061984
p061991
p062004
p062032
p062033
p062035
p062061
p062087
p062114
p062126
p062160
p062183
p062186
p062194
p062212
p062215
p062227
p062232
p062239
p062248
p062254
p062259
p062261
p062268
p062284
p062295
p062298
p062305
p062323
p062346
p062380
p062389
p062393
p062395
p062415
p062416
p062430
p062438
p062447
p062457
p062466
p062478
p062497
p062512
p062514
p062515
p062522
p062528
p062530
p062538
p062539
p062543
p062561
p062562
p062564
p062571
p062586
p062603
p062606
p062608
p062613
p062622
p062637
p062641
p062646
p062647
p062650
p062664
p062674
p062681
p062684
p062689
p062693
p062699
p062715
p062717
p062721
p062722
p062731
p062734
p062735
p062750
p062751
p062765
p062782
p062791
p062795
p062798
p062824
p062829
p062833
p062835
p062841
p062854
p062884
p062913
p062917
p062919
p062926
p062937
p062950
p062965
p062974
p062980
p063003
p063017
p063024
p063028
p063039
p063053
p063059
p063063
p063073
p063074
p063076
p063107
p063113
p063116
p063130
p063131
p063133
p063135
p063138
p063145
p063173
p063187
p063192
p063201
p063219
p063220
p063226
p063236
p063237
p063238
p063245
p063249
p063272
p063290
p063320
p063327
p063359
p063364
p063368
p063372
p063383
p063402
p063403
p063404
p063431
p063437
p063444
p063447
p063453
p063456
p063461
p063467
p063471
p063482
p063486
p063489
p063492
p063496
p063499
p063507
p063512
p063525
p063531
p063541
p063544
p063552
p063559
p063563
p063571
p063579
p063582
p063598
p063599
p063601
p063621
p063628
p063639
p063646
p063659
p063660
p063669
p063683
p063700
p063701
p063706
p063710
p063721
p063733
p063741
p063750
p063755
p063756
p063757
p063762
p063764
p063771
p063773
p063785
p063792
p063834
p063865
p063878
p063890
p063898
p063899
p063912
p063922
p063925
p063934
p063936
p063938
p063944
p063952
p063958
p063961
p063987
p063992
p063999
p064004
p064008
p064024
p064025
p064026
p064027
p064047
p064067
p064082
p064083
p064089
p064099
p064100
p064101
p064112
p064120
p064136
p064137
p064145
p064150
p064160
p064188
p064191
p064192
p064194
p064195
p064206
p064216
p064230
p064238
p064253
p064260
p064277
p064280
p064282
p064287
p064296
p064297
p064298
p064332
p064334
p064336
p064361
p064368
p064374
p064377
p064384
p064411
p064421
p064450
p064459
p064465
p064472
p064485
p064491
p064502
p064519
p064523
p064524
p064538
p064550
p064557
p064558
p064601
p064621
p064652
p064663
p064666
p064673
p064687
p064700
p064701
p064717
p064719
p064721
p064741
p064752
p064771
p064772
p064785
p064798
p064830
p064833
p064839
p064845
p064846
p064873
p064897
p064906
p064916
p064925
p064927
p064935
p064944
p064952
p064965
p064967
p064969
p064970
p064988
p064992
p064994
p064996
p065003
p065006
p065013
p065033
p065049
p065050
p065052
p065055
p065057
p065072
p065097
p065107
p065112
p065144
p065147
p065161
p065175
p065176
p065180
p065187
p065192
p065217
p065240
p065247
p065256
p065263
p065267
p065268
p065310
p065342
p065358
p065370
p065393
p065401
p065404
p065411
p065421
p065425
p065431
p065442
p065444
p065448
p065476
p065481
p065484
p065490
p065513
p065515
p065516
p065527
p065535
p065537
p065560
p065565
p065570
p065575
p065594
p065599
p065604
p065611
p065634
p065636
p065645
p065652
p065656
p065659
p065665
p065675
p065689
p065703
p065710
p065719
p065728
p065732
p065743
p065753
p065759
p065760
p065779
p065787
p065793
p065810
p065824
p065833
p065835
p065837
p065843
p065849
p065854
p065861
p065871
p065879
p065890
p065893
p065895
p065900
p065915
p065925
p065946
p065951
p065956
p065970
p065979
p065999
p066031
p066033
p066037
p066043
p066046
p066055
p066058
p066061
p066063
p066067
p066068
p066072
p066083
p066093
p066105
p066110
p066152
p066157
p066170
p066172
p066184
p066189
p066200
p066206
p066208
p066213
p066217
p066228
p066232
p066244
p066264
p066280
p066288
p066298
p066310
p066311
p066320
p066322
p066326
p066338
p066346
p066365
p066383
p066384
p066386
p066405
p066412
p066419
p066424
p066437
p066463
p066473
p066474
p066479
p066481
p066483
p066496
p066504
p066505
p066507
p066530
p066534
p066542
p066556
p066559
p066560
p066566
p066571
p066572
p066580
p066595
p066598
p066603
p066614
p066643
p066654
p066677
p066682
p066686
p066692
p066696
p066706
p066710
p066717
p066720
p066727
p066753
p066756
p066765
p066770
p066772
p066786
p066796
p066804
p066807
p066811
p066818
p066822
p066823
p066825
p066831
p066851
p066859
p066871
p066878
p066880
p066892
p066894
p066898
p066899
p066903
p066910
p066919
p066936
p066955
p066957
p066965
p066974
p066986
p067005
p067017
p067042
p067050
p067051
p067054
p067058
p067072
p067087
p067101
p067111
p067112
p067140
p067149
p067154
p067158
p067172
p067213
p067222
p067223
p067227
p067239
p067245
p067248
p067256
p067283
p067284
p067301
p067316
p067321
p067323
p067341
p067343
p067344
p067347
p067348
p067358
p067367
p067377
p067380
p067384
p067386
p067388
p067413
p067415
p067418
p067429
p067442
p067446
p067449
p067451
p067461
p067468
p067475
p067477
p067511
p067533
p067546
p067583
p067589
p067617
p067619
p067620
p067625
p067633
p067636
p067639
p067651
p067653
p067659
p067675
p067680
p067684
p067687
p067689
p067696
p067710
p067711
p067715
p067717
p067718
p067722
p067735
p067740
p067744
p067747
p067758
p067761
p067763
p067771
p067772
p067774
p067788
p067796
p067798
p067803
p067813
p067821
p067831
p067853
p067856
p067858
p067877
p067903
p067912
p067924
p067931
p067939
p067956
p067976
p067996
p068001
p068003
p068007
p068024
p068036
p068059
p068065
p068068
p068075
p068094
p068095
p068099
p068116
p068123
p068126
p068127
p068128
p068172
p068174
p068228
p068231
p068242
p068244
p068251
p068268
p068285
p068296
p068297
p068299
p068300
p068307
p068344
p068356
p068375
p068385
p068391
p068396
p068401
p068422
p068425
p068426
p068435
p068439
p068450
p068453
p068457
p068464
p068475
p068505
p068515
p068526
p068527
p068531
p068533
p068534
p068542
p068543
p068564
p068589
p068605
p068607
p068611
p068621
p068623
p068624
p068629
p068645
p068651
p068655
p068663
p068673
p068676
p068685
p068690
p068703
p068710
p068720
p068724
p068728
p068736
p068738
p068744
p068780
p068785
p068789
p068797
p068806
p068807
p068808
p068812
p068824
p068836
p068857
p068860
p068870
p068875
p068890
p068892
p068902
p068905
p068907
p068909
p068911
p068915
p068916
p068919
p068922
p068946
p068947
p068949
p068956
p068962
p068964
p068965
p068980
p068989
p068991
p069000
p069006
p069011
p069020
p069027
p069039
p069047
p069052
p069074
p069081
p069082
p069088
p069093
p069108
p069118
p069138
p069144
p069146
p069157
p069162
p069169
p069170
p069182
p069194
p069201
p069215
p069219
p069221
p069233
p069250
p069251
p069265
p069268
p069270
p069272
p069274
p069293
p069296
p069339
p069343
p069344
p069351
p069352
p069354
p069367
p069370
p069371
p069388
p069395
p069396
p069407
p069411
p069434
p069435
p069442
p069447
p069450
p069464
p069465
p069483
p069484
p069498
p069501
p069512
p069518
p069522
p069531
p069537
p069559
p069563
p069577
p069578
p069579
p069586
p069591
p069596
p069598
p069615
p069620
p069624
p069626
p069639
p069650
p069651
p069654
p069675
p069676
p069679
p069681
p069684
p069693
p069694
p069696
p069709
p069746
p069763
p069764
p069776
p069778
p069786
p069799
p069857
p069871
p069890
p069891
p069903
p069905
p069925
p069943
p069956