Database Open Access

MIMIC-III Waveform Database

Benjamin Moody George Moody Mauricio Villarroel Gari D. Clifford Ikaro Silva

Published: April 7, 2020. Version: 1.0


When using this resource, please cite: (show more options)
Moody, B., Moody, G., Villarroel, M., Clifford, G. D., & Silva, I. (2020). MIMIC-III Waveform Database (version 1.0). PhysioNet. https://doi.org/10.13026/c2607m.

Additionally, please cite the original publication:

Johnson, A. E. W., Pollard, T. J., Shen, L., Lehman, L. H., Feng, M., Ghassemi, M., Moody, B., Szolovits, P., Celi, L. A., & Mark, R. G. (2016). MIMIC-III, a freely accessible critical care database. Scientific Data, 3, 160035.

Please include the standard citation for PhysioNet: (show more options)
Goldberger, A., Amaral, L., Glass, L., Hausdorff, J., Ivanov, P. C., Mark, R., ... & Stanley, H. E. (2000). PhysioBank, PhysioToolkit, and PhysioNet: Components of a new research resource for complex physiologic signals. Circulation [Online]. 101 (23), pp. e215–e220.

Abstract

The MIMIC-III Waveform Database contains 67,830 record sets for approximately 30,000 ICU patients. Almost all record sets include a waveform record containing digitized signals (typically including ECG, ABP, respiration, and PPG, and frequently other signals) and a “numerics” record containing time series of periodic measurements, each presenting a quasi-continuous recording of vital signs of a single patient throughout an ICU stay (typically a few days, but many are several weeks in duration). A subset of this database contains waveform and numerics records that have been matched and time-aligned with MIMIC-III Clinical Database records.


Background

The MIMIC-III Waveform Database contains thousands of recordings of multiple physiologic signals (“waveforms”) and time series of vital signs (“numerics”) collected from bedside patient monitors in adult and neonatal intensive care units (ICUs).

The MIMIC-III Waveform Database is a companion to the MIMIC-III Clinical Database, which contains detailed clinical information about most of the patients represented in the Waveform Database [1]. Since the contents of each database were collected independently, in partially deidentified form, matching the clinical data with the waveform data is a non-trivial task, and only a subset of Waveform Database records has been matched with Clinical Database records. See the MIMIC-III Waveform Database Matched Subset for more information.


Methods

Unlike the original MIMIC Database, waveforms were collected in a largely automated fashion, from all of the bedside monitors in certain adult and neonatal ICUs. Not all of the ICUs in the hospital were included, and the data archiving process did not run continuously, but while it was running, all waveforms from those ICUs were captured and archived. As a result, these records represent a random sample of patients in those specific ICUs.

Recorded waveforms and numerics vary depending on choices made by the ICU staff. Waveforms almost always include one or more ECG signals, and often include continuous arterial blood pressure (ABP) waveforms, fingertip photoplethysmogram (PPG) signals, and respiration, with additional waveforms (up to 8 simultaneously) as available. Numerics typically include heart and respiration rates, SpO2, and systolic, mean, and diastolic blood pressure, together with others as available. Recording lengths also vary; most are a few days in duration, but some are shorter and others are several weeks long.

The project was approved by the Institutional Review Boards of Beth Israel Deaconess Medical Center (Boston, MA) and the Massachusetts Institute of Technology (Cambridge, MA). Requirement for individual patient consent was waived because the project did not impact clinical care and all protected health information was deidentified.


Data Description

Each recording comprises two records (a waveform record and a matching numerics record) in a single record directory (“folder”) with the name of the record. To reduce access time, the record directories have been distributed among ten intermediate-level directories (listed below). The names of these intermediate directories (30, 31, ..., 39) match the first two digits of the record directories they contain.

In almost all cases, the waveform records comprise multiple segments, each of which can be read as a separate record. Each segment contains an uninterrupted recording of a set of simultaneously observed signals, and the signal gains do not change at any time during the segment. Whenever the ICU staff changed the signals being monitored or adjusted the amplitude of a signal being monitored, this event was recorded in the raw data dump, and a new segment begins at that time.

Each composite waveform record includes a list of the segments that comprise it in its master header file. The list begins on the second line of the master header with a layout header file that specifies all of the signals that are observed in any segment belonging to the record. Each segment has its own header file and (except for the layout header) a matching (binary) signal (.dat) file. Occasionally, the monitor may be disconnected entirely for a short time; these intervals are recorded as gaps in the master header file, but there are no header or signal files corresponding to gaps.

The numerics records (designated by the letter n appended to the record name) are not divided into segments, since the storage savings that would be achieved by doing so would be relatively little.

Physiologic waveform records in this database contain up to eight simultaneously recorded signals digitized at 125 Hz with 8-, 10-, or (occasionally) 12-bit resolution. Numerics records typically contain 10 or more time series of vital signs sampled once per second or once per minute.

Technical Limitations

Waveforms or numerics missing:
Occasionally, technical limitations of the data acquisition system make it possible to create a physiologic waveform record but not a numerics record, or vice versa.
A given signal may not be available throughout an entire record:
Records in the MIMIC-III Waveform Database vary in length; some are several weeks in duration. It is common for the physiologic signals to be interrupted or changed occasionally during recordings of such long duration. When using a viewer such as LightWAVE, all signals available at any time during a record are listed, although in most cases only a subset is visible at any given time.
Gaps and patient identification:
The waveform and numerics records have been extracted from raw data dumps collected from the bedside monitors using a facility provided by the monitor manufacturer. The raw data dumps contain files of data collected from a single patient monitor during a single monitoring session (which may last days or weeks). Usually the monitoring session ends when the patient is discharged, so that the data in a single file come from a single patient. Occasionally, however, the monitor is not reset when the patient is discharged, and the session continues after a new patient has been admitted; in this case the raw data file contains data from two (or more) patients, with a gap (an interval during which no waveforms or numerics are recorded) that is typically an hour or more in duration. Such gaps may also appear if the monitor is temporarily disconnected (for example, for a laboratory test) and then reconnected to the same patient. Since the raw data files do not usually contain patient identifiers, it is not trivial to determine with certainty if the data before and after a gap were collected from the same patient.
Ideally, each MIMIC-III Waveform Database record should contain data from only one patient. All raw data files containing gaps of an hour or more have been split into separate records in order to decrease the likelihood that any record contains data from multiple patients. An ongoing project is to examine the sets of records created this way, matching them with MIMIC-III Clinical Database records when possible, to determine if and how they should be reassembled.
Inter-waveform alignment problems:
The method used for MIMIC waveform data extraction was not designed for inter-waveform analysis. The waveform data contain unspecified/unknown filtering delays and/or unknown inter-channel delays, which may not be constant in a given record. Therefore, although the ECGs are time-aligned with each other, there may be a (changing) delay of up to 500ms between any of the other waveforms in the data. For example, the pulse transit time measured between different waveforms may be unreliable (either in absolute or relative terms).
ECG limitations:
The ECG signals in the waveform records were originally sampled with 12-bit precision at a high sampling rate, and were then scaled and decimated to 500 samples per second (per signal). The scaling reduced the effective amplitude resolution from 12 bits to 9 or 10 bits in typical cases, and as little as 7 bits in some cases. From each set of 4 consecutive decimated samples of the same ECG signal, one was recorded (chosen using a turning-point compressor, a technique sometimes called “peak-picking”). The result is an ECG signal sampled 125 times per second, but at intervals that vary between 2 and 14 ms (averaging 8 ms). Since the interval between any given pair of samples was not available to us, the reconstructions of the ECG signals assume uniform 8 ms intervals. These signals with reduced time and amplitude resolution, and sampling jitter introduced by the “peak-picking”, were the only ECG signals that were possible to capture from the ICU monitors. Although ECGs reconstructed in this way can be readily interpreted visually, they are unsuitable as input for certain algorithms for ECG analysis, particularly those that are sensitive to frequency-domain features of the signal. Note that these limitations apply only to the ECG signals, not to the other signals, which were originally sampled at uniform 8 ms intervals (125 samples per second) and were not scaled prior to capture.

Usage Notes

The following example illustrates the organization of the database:

  • Intermediate directory 31 contains all records with names that begin with 31.
  • Record directory 3141595 is contained within intermediate directory 31.
  • All files associated with physiologic waveform record 3141595 and its companion numerics record 3141595n are contained within record directory 31/3141595.
    • The first line of the master header file for waveform record 314595 (31/3141595/3141595.hea) indicates that the record is 242353557 sample intervals (about 22 days at 125 samples per second) in duration, and that it contains 427 segments and gaps. (See header(5) in the WFDB Applications Guide for details on the format of this text file.) The first segment is named 3141595_0001, and it is 2888500 sample intervals (6 hours, 15 minutes, and 8 seconds, at 125 samples per second) in duration. At the end of the master header file, a comment (# Location: nicu) specifies the ICU in which the recording was made (the neonatal ICU, in this case).
    • The layout header file for this record (31/3141595/3141595_layout.hea) indicates that five ECG signals (I, II, III, AVR, and “V”), a respiration signal, and a PPG signal are available during portions of the record. (The five ECG signals are not all available simultaneously.)
    • The header file for the first segment of this record (31/3141595/3141595_0001.hea) shows that a PPG signal (“PLETH”), a respiration signal, and ECG leads II and AVR are available throughout this initial segment.
  • The matching numerics record is named 3141595n, and its header file (31/3141595/3141595n.hea) shows that it is 1938730 sample intervals (about 22 days at 1 sample per second) in duration, and that it contains heart rate (“HR”, which is measured from the ECG, as well as “PULSE”, measured from one or more pulsatile signals), noninvasive blood pressure (raw as well as systolic, diastolic, and mean), respiration rate, and SpO2.

Any WFDB application can read any waveform record from this database directly from the PhysioNet web server (i.e., without downloading the record first) using a record name of the form mimic3wdb/3x/3xyyyyy/. Numerics records can be read using the longer form mimic3wdb/3x/3xyyyyy/3xyyyyyn (note that the final 3xyyyyy must be repeated and followed by n to specify the numerics record).

For example, if you have installed the WFDB Software Package, you can read the first 10 seconds of waveform record 3141595 using this rdsamp command:

rdsamp -r mimic3wdb/31/3141595/ -p -v -t 10

To read the first 10 seconds of the matching numerics record 3141595n, use this command instead:

rdsamp -r mimic3wdb/31/3141595/3141595n -p -v -t 10

Notice that the first command produces 1250 samples of each waveform (125 samples per second, for 10 seconds), but the second command produces only 10 samples of each vital sign (1 sample per second, for 10 seconds).


Release Notes

Version 1.0 of the MIMIC-III Waveform Database supersedes previously-released versions of the MIMIC-II Waveform Database. The numbered records (3000003 to 3999988) are identical to those in version 3.2 of the MIMIC-II Waveform Database. The Matched Subset, however, uses different subject IDs and surrogate dates, corresponding to version 1.4 of the MIMIC-III Clinical Database.


Acknowledgements

We wish to thank Philips Healthcare, as well as the Beth Israel Deaconess Medical Center, for their invaluable support in making this project possible.

Many people have contributed to this project over the past 18 years, and it would be impossible to list them all. In particular, we would like to acknowledge Michael Craig, Tin Kyaw, and Mohammed Saeed, for their efforts in collecting and organizing the original MIMIC-II Waveform Database, upon which this database is based.


Conflicts of Interest

The authors have no conflicts of interests to declare.


References

  1. Johnson, A. E. W., Pollard, T. J., Shen, L., Lehman, L. H., Feng, M., Ghassemi, M., Moody, B., Szolovits, P., Celi, L. A., & Mark, R. G. (2016). MIMIC-III, a freely accessible critical care database. Scientific Data, 3, 160035. https://dx.doi.org/10.1038/sdata.2016.35

Parent Projects
MIMIC-III Waveform Database was derived from: Please cite them when using this project.
Share
Access

Access Policy:
Anyone can access the files, as long as they conform to the terms of the specified license.

License (for files):
Open Data Commons Open Database License v1.0

Discovery

DOI (version 1.0):
https://doi.org/10.13026/c2607m

DOI (latest version):
https://doi.org/10.13026/gs83-bd50

Corresponding Author
You must be logged in to view the contact information.

Files

Total uncompressed size: 6.7 TB.

Access the files

Visualize waveforms

Folder Navigation: <base>/matched/p02
Name Size Modified
Parent Directory
p020013
p020018
p020060
p020062
p020066
p020071
p020101
p020115
p020124
p020128
p020129
p020132
p020172
p020181
p020190
p020199
p020238
p020242
p020246
p020263
p020265
p020268
p020303
p020312
p020316
p020324
p020326
p020327
p020345
p020354
p020372
p020375
p020389
p020403
p020407
p020410
p020448
p020450
p020459
p020460
p020471
p020474
p020476
p020479
p020486
p020545
p020546
p020564
p020575
p020577
p020582
p020589
p020598
p020612
p020620
p020624
p020643
p020658
p020677
p020678
p020679
p020689
p020704
p020705
p020726
p020742
p020748
p020763
p020766
p020789
p020794
p020795
p020801
p020836
p020839
p020840
p020846
p020848
p020856
p020858
p020860
p020865
p020900
p020908
p020919
p020922
p020923
p020929
p020931
p020936
p020940
p020966
p020968
p020984
p020986
p020990
p021002
p021011
p021013
p021015
p021025
p021030
p021048
p021050
p021071
p021072
p021081
p021088
p021090
p021093
p021108
p021115
p021123
p021138
p021139
p021148
p021150
p021151
p021152
p021155
p021156
p021161
p021162
p021179
p021187
p021192
p021195
p021202
p021219
p021242
p021244
p021247
p021258
p021265
p021270
p021271
p021275
p021284
p021305
p021306
p021308
p021317
p021318
p021321
p021323
p021328
p021334
p021349
p021373
p021397
p021416
p021419
p021431
p021438
p021443
p021444
p021447
p021448
p021449
p021460
p021481
p021483
p021484
p021496
p021504
p021507
p021514
p021517
p021521
p021538
p021543
p021548
p021559
p021561
p021570
p021575
p021580
p021584
p021630
p021642
p021663
p021666
p021667
p021673
p021683
p021688
p021706
p021709
p021710
p021712
p021730
p021734
p021737
p021739
p021747
p021766
p021769
p021771
p021773
p021775
p021786
p021792
p021797
p021805
p021809
p021811
p021817
p021819
p021845
p021857
p021860
p021873
p021876
p021900
p021901
p021910
p021920
p021939
p021954
p021965
p021968
p021974
p021975
p021986
p022017
p022018
p022034
p022039
p022049
p022068
p022071
p022077
p022104
p022114
p022118
p022120
p022122
p022130
p022134
p022138
p022140
p022152
p022156
p022159
p022165
p022180
p022181
p022200
p022207
p022218
p022221
p022225
p022231
p022234
p022241
p022242
p022264
p022266
p022281
p022285
p022289
p022298
p022303
p022304
p022306
p022310
p022316
p022322
p022326
p022335
p022336
p022337
p022339
p022348
p022354
p022364
p022365
p022373
p022383
p022384
p022389
p022393
p022401
p022414
p022418
p022423
p022429
p022432
p022438
p022442
p022450
p022461
p022462
p022464
p022466
p022491
p022495
p022496
p022499
p022505
p022508
p022537
p022550
p022557
p022565
p022577
p022584
p022585
p022588
p022600
p022603
p022606
p022616
p022642
p022648
p022657
p022664
p022669
p022673
p022687
p022714
p022718
p022722
p022731
p022735
p022752
p022766
p022769
p022771
p022774
p022782
p022788
p022791
p022795
p022801
p022804
p022809
p022817
p022836
p022859
p022862
p022879
p022880
p022888
p022904
p022908
p022918
p022921
p022930
p022932
p022933
p022936
p022937
p022942
p022954
p022956
p022960
p022961
p022962
p022980
p022983
p023001
p023015
p023020
p023028
p023030
p023034
p023038
p023042
p023047
p023048
p023060
p023061
p023065
p023085
p023091
p023097
p023100
p023105
p023120
p023126
p023130
p023132
p023150
p023154
p023162
p023178
p023180
p023193
p023197
p023200
p023201
p023238
p023264
p023270
p023291
p023292
p023298
p023299
p023318
p023321
p023324
p023325
p023336
p023339
p023344
p023351
p023363
p023364
p023368
p023371
p023380
p023384
p023390
p023401
p023413
p023440
p023448
p023450
p023451
p023452
p023456
p023459
p023468
p023469
p023470
p023474
p023475
p023503
p023510
p023529
p023539
p023550
p023552
p023568
p023575
p023577
p023578
p023580
p023582
p023584
p023590
p023591
p023594
p023599
p023603
p023613
p023617
p023619
p023620
p023626
p023627
p023637
p023642
p023652
p023657
p023666
p023673
p023674
p023675
p023677
p023678
p023687
p023693
p023696
p023707
p023749
p023761
p023762
p023771
p023778
p023780
p023782
p023787
p023790
p023811
p023824
p023826
p023847
p023869
p023874
p023875
p023876
p023885
p023888
p023890
p023893
p023895
p023907
p023913
p023922
p023929
p023933
p023934
p023944
p023959
p024004
p024007
p024018
p024029
p024030
p024042
p024063
p024064
p024076
p024084
p024099
p024123
p024129
p024133
p024142
p024152
p024157
p024177
p024185
p024218
p024227
p024228
p024232
p024238
p024242
p024244
p024271
p024276
p024281
p024282
p024283
p024289
p024308
p024320
p024327
p024355
p024357
p024387
p024411
p024417
p024431
p024438
p024443
p024446
p024447
p024455
p024457
p024460
p024461
p024475
p024477
p024508
p024514
p024532
p024547
p024548
p024552
p024556
p024559
p024560
p024562
p024567
p024569
p024573
p024591
p024597
p024605
p024609
p024622
p024626
p024646
p024656
p024666
p024690
p024693
p024711
p024730
p024743
p024746
p024748
p024792
p024793
p024799
p024804
p024807
p024810
p024822
p024825
p024828
p024856
p024860
p024865
p024876
p024897
p024899
p024902
p024922
p024923
p024924
p024925
p024927
p024938
p024942
p024949
p024958
p024967
p024975
p024979
p024984
p024986
p025006
p025016
p025017
p025024
p025030
p025039
p025049
p025058
p025073
p025081
p025104
p025107
p025111
p025115
p025116
p025117
p025131
p025140
p025141
p025167
p025168
p025171
p025174
p025178
p025189
p025197
p025203
p025206
p025207
p025222
p025225
p025228
p025229
p025234
p025255
p025271
p025284
p025297
p025299
p025304
p025313
p025317
p025318
p025326
p025328
p025329
p025332
p025354
p025356
p025367
p025372
p025373
p025400
p025404
p025428
p025429
p025446
p025452
p025466
p025471
p025505
p025506
p025522
p025528
p025553
p025557
p025574
p025575
p025581
p025585
p025602
p025603
p025610
p025621
p025627
p025630
p025635
p025658
p025659
p025662
p025664
p025668
p025679
p025699
p025708
p025724
p025725
p025729
p025741
p025757
p025759
p025770
p025772
p025800
p025835
p025851
p025857
p025858
p025860
p025862
p025882
p025886
p025915
p025916
p025939
p025949
p025954
p025987
p025988
p025989
p026018
p026024
p026027
p026031
p026037
p026039
p026043
p026054
p026055
p026063
p026069
p026079
p026085
p026087
p026094
p026097
p026105
p026109
p026133
p026134
p026136
p026151
p026156
p026161
p026192
p026211
p026212
p026219
p026221
p026228
p026233
p026256
p026267
p026270
p026271
p026274
p026277
p026282
p026285
p026288
p026296
p026300
p026303
p026306
p026318
p026324
p026325
p026351
p026356
p026377
p026380
p026381
p026382
p026391
p026395
p026398
p026399
p026406
p026421
p026435
p026446
p026459
p026467
p026469
p026472
p026480
p026494
p026502
p026504
p026506
p026511
p026519
p026523
p026560
p026568
p026575
p026576
p026579
p026594
p026628
p026632
p026637
p026639
p026661
p026673
p026688
p026693
p026695
p026698
p026705
p026709
p026710
p026711
p026712
p026714
p026715
p026732
p026734
p026737
p026747
p026759
p026761
p026769
p026771
p026781
p026827
p026837
p026845
p026863
p026868
p026872
p026879
p026884
p026893
p026897
p026901
p026905
p026925
p026926
p026930
p026964
p026978
p026990
p026996
p027002
p027022
p027026
p027060
p027077
p027083
p027084
p027102
p027106
p027119
p027132
p027147
p027148
p027155
p027162
p027172
p027177
p027185
p027192
p027193
p027194
p027195
p027197
p027200
p027202
p027210
p027212
p027213
p027215
p027221
p027223
p027232
p027235
p027237
p027241
p027245
p027266
p027282
p027321
p027326
p027329
p027337
p027338
p027343
p027351
p027355
p027367
p027372
p027374
p027379
p027398
p027423
p027425
p027428
p027429
p027434
p027436
p027439
p027441
p027446
p027456
p027463
p027464
p027486
p027504
p027530
p027539
p027540
p027542
p027551
p027554
p027555
p027577
p027584
p027585
p027599
p027616
p027636
p027638
p027639
p027643
p027648
p027661
p027677
p027687
p027689
p027691
p027695
p027696
p027697
p027708
p027710
p027778
p027791
p027793
p027796
p027799
p027800
p027801
p027823
p027829
p027833
p027842
p027845
p027850
p027860
p027861
p027884
p027887
p027890
p027891
p027905
p027910
p027925
p027927
p027953
p027961
p027962
p027969
p027971
p027981
p028019
p028037
p028039
p028044
p028048
p028052
p028061
p028065
p028073
p028075
p028077
p028079
p028083
p028085
p028089
p028093
p028094
p028095
p028110
p028149
p028166
p028170
p028172
p028180
p028187
p028189
p028221
p028260
p028270
p028281
p028291
p028294
p028331
p028338
p028339
p028340
p028354
p028364
p028365
p028386
p028416
p028419
p028423
p028443
p028460
p028461
p028496
p028499
p028505
p028507
p028508
p028510
p028511
p028514
p028525
p028530
p028531
p028536
p028541
p028587
p028594
p028611
p028616
p028625
p028627
p028628
p028629
p028644
p028654
p028660
p028671
p028676
p028684
p028698
p028702
p028706
p028707
p028721
p028727
p028729
p028753
p028758
p028762
p028765
p028772
p028774
p028775
p028777
p028785
p028789
p028806
p028808
p028813
p028827
p028865
p028868
p028869
p028875
p028880
p028882
p028883
p028887
p028897
p028900
p028901
p028902
p028903
p028905
p028909
p028910
p028911
p028920
p028927
p028930
p028941
p028955
p028961
p029005
p029007
p029027
p029035
p029043
p029049
p029057
p029066
p029073
p029093
p029100
p029102
p029106
p029116
p029120
p029125
p029127
p029131
p029133
p029137
p029148
p029164
p029167
p029191
p029199
p029215
p029216
p029251
p029262
p029270
p029299
p029300
p029336
p029337
p029343
p029358
p029377
p029378
p029388
p029411
p029426
p029463
p029466
p029468
p029470
p029477
p029478
p029493
p029503
p029507
p029509
p029511
p029512
p029527
p029529
p029530
p029541
p029544
p029553
p029556
p029569
p029570
p029573
p029576
p029581
p029619
p029620
p029622
p029629
p029638
p029660
p029664
p029678
p029697
p029712
p029730
p029767
p029769
p029770
p029799
p029826
p029829
p029840
p029861
p029862
p029866
p029869
p029871
p029872
p029875
p029878
p029884
p029937
p029946
p029949
p029961
p029967
p029968
p029969
p029972
p029999