Database Open Access

MIMIC-III Waveform Database

Benjamin Moody George Moody Mauricio Villarroel Gari D. Clifford Ikaro Silva

Published: April 7, 2020. Version: 1.0


When using this resource, please cite: (show more options)
Moody, B., Moody, G., Villarroel, M., Clifford, G. D., & Silva, I. (2020). MIMIC-III Waveform Database (version 1.0). PhysioNet. https://doi.org/10.13026/c2607m.

Additionally, please cite the original publication:

Johnson, A. E. W., Pollard, T. J., Shen, L., Lehman, L. H., Feng, M., Ghassemi, M., Moody, B., Szolovits, P., Celi, L. A., & Mark, R. G. (2016). MIMIC-III, a freely accessible critical care database. Scientific Data, 3, 160035.

Please include the standard citation for PhysioNet: (show more options)
Goldberger, A., Amaral, L., Glass, L., Hausdorff, J., Ivanov, P. C., Mark, R., ... & Stanley, H. E. (2000). PhysioBank, PhysioToolkit, and PhysioNet: Components of a new research resource for complex physiologic signals. Circulation [Online]. 101 (23), pp. e215–e220.

Abstract

The MIMIC-III Waveform Database contains 67,830 record sets for approximately 30,000 ICU patients. Almost all record sets include a waveform record containing digitized signals (typically including ECG, ABP, respiration, and PPG, and frequently other signals) and a “numerics” record containing time series of periodic measurements, each presenting a quasi-continuous recording of vital signs of a single patient throughout an ICU stay (typically a few days, but many are several weeks in duration). A subset of this database contains waveform and numerics records that have been matched and time-aligned with MIMIC-III Clinical Database records.


Background

The MIMIC-III Waveform Database contains thousands of recordings of multiple physiologic signals (“waveforms”) and time series of vital signs (“numerics”) collected from bedside patient monitors in adult and neonatal intensive care units (ICUs).

The MIMIC-III Waveform Database is a companion to the MIMIC-III Clinical Database, which contains detailed clinical information about most of the patients represented in the Waveform Database [1]. Since the contents of each database were collected independently, in partially deidentified form, matching the clinical data with the waveform data is a non-trivial task, and only a subset of Waveform Database records has been matched with Clinical Database records. See the MIMIC-III Waveform Database Matched Subset for more information.


Methods

Unlike the original MIMIC Database, waveforms were collected in a largely automated fashion, from all of the bedside monitors in certain adult and neonatal ICUs. Not all of the ICUs in the hospital were included, and the data archiving process did not run continuously, but while it was running, all waveforms from those ICUs were captured and archived. As a result, these records represent a random sample of patients in those specific ICUs.

Recorded waveforms and numerics vary depending on choices made by the ICU staff. Waveforms almost always include one or more ECG signals, and often include continuous arterial blood pressure (ABP) waveforms, fingertip photoplethysmogram (PPG) signals, and respiration, with additional waveforms (up to 8 simultaneously) as available. Numerics typically include heart and respiration rates, SpO2, and systolic, mean, and diastolic blood pressure, together with others as available. Recording lengths also vary; most are a few days in duration, but some are shorter and others are several weeks long.

The project was approved by the Institutional Review Boards of Beth Israel Deaconess Medical Center (Boston, MA) and the Massachusetts Institute of Technology (Cambridge, MA). Requirement for individual patient consent was waived because the project did not impact clinical care and all protected health information was deidentified.


Data Description

Each recording comprises two records (a waveform record and a matching numerics record) in a single record directory (“folder”) with the name of the record. To reduce access time, the record directories have been distributed among ten intermediate-level directories (listed below). The names of these intermediate directories (30, 31, ..., 39) match the first two digits of the record directories they contain.

In almost all cases, the waveform records comprise multiple segments, each of which can be read as a separate record. Each segment contains an uninterrupted recording of a set of simultaneously observed signals, and the signal gains do not change at any time during the segment. Whenever the ICU staff changed the signals being monitored or adjusted the amplitude of a signal being monitored, this event was recorded in the raw data dump, and a new segment begins at that time.

Each composite waveform record includes a list of the segments that comprise it in its master header file. The list begins on the second line of the master header with a layout header file that specifies all of the signals that are observed in any segment belonging to the record. Each segment has its own header file and (except for the layout header) a matching (binary) signal (.dat) file. Occasionally, the monitor may be disconnected entirely for a short time; these intervals are recorded as gaps in the master header file, but there are no header or signal files corresponding to gaps.

The numerics records (designated by the letter n appended to the record name) are not divided into segments, since the storage savings that would be achieved by doing so would be relatively little.

Physiologic waveform records in this database contain up to eight simultaneously recorded signals digitized at 125 Hz with 8-, 10-, or (occasionally) 12-bit resolution. Numerics records typically contain 10 or more time series of vital signs sampled once per second or once per minute.

Technical Limitations

Waveforms or numerics missing:
Occasionally, technical limitations of the data acquisition system make it possible to create a physiologic waveform record but not a numerics record, or vice versa.
A given signal may not be available throughout an entire record:
Records in the MIMIC-III Waveform Database vary in length; some are several weeks in duration. It is common for the physiologic signals to be interrupted or changed occasionally during recordings of such long duration. When using a viewer such as LightWAVE, all signals available at any time during a record are listed, although in most cases only a subset is visible at any given time.
Gaps and patient identification:
The waveform and numerics records have been extracted from raw data dumps collected from the bedside monitors using a facility provided by the monitor manufacturer. The raw data dumps contain files of data collected from a single patient monitor during a single monitoring session (which may last days or weeks). Usually the monitoring session ends when the patient is discharged, so that the data in a single file come from a single patient. Occasionally, however, the monitor is not reset when the patient is discharged, and the session continues after a new patient has been admitted; in this case the raw data file contains data from two (or more) patients, with a gap (an interval during which no waveforms or numerics are recorded) that is typically an hour or more in duration. Such gaps may also appear if the monitor is temporarily disconnected (for example, for a laboratory test) and then reconnected to the same patient. Since the raw data files do not usually contain patient identifiers, it is not trivial to determine with certainty if the data before and after a gap were collected from the same patient.
Ideally, each MIMIC-III Waveform Database record should contain data from only one patient. All raw data files containing gaps of an hour or more have been split into separate records in order to decrease the likelihood that any record contains data from multiple patients. An ongoing project is to examine the sets of records created this way, matching them with MIMIC-III Clinical Database records when possible, to determine if and how they should be reassembled.
Inter-waveform alignment problems:
The method used for MIMIC waveform data extraction was not designed for inter-waveform analysis. The waveform data contain unspecified/unknown filtering delays and/or unknown inter-channel delays, which may not be constant in a given record. Therefore, although the ECGs are time-aligned with each other, there may be a (changing) delay of up to 500ms between any of the other waveforms in the data. For example, the pulse transit time measured between different waveforms may be unreliable (either in absolute or relative terms).
ECG limitations:
The ECG signals in the waveform records were originally sampled with 12-bit precision at a high sampling rate, and were then scaled and decimated to 500 samples per second (per signal). The scaling reduced the effective amplitude resolution from 12 bits to 9 or 10 bits in typical cases, and as little as 7 bits in some cases. From each set of 4 consecutive decimated samples of the same ECG signal, one was recorded (chosen using a turning-point compressor, a technique sometimes called “peak-picking”). The result is an ECG signal sampled 125 times per second, but at intervals that vary between 2 and 14 ms (averaging 8 ms). Since the interval between any given pair of samples was not available to us, the reconstructions of the ECG signals assume uniform 8 ms intervals. These signals with reduced time and amplitude resolution, and sampling jitter introduced by the “peak-picking”, were the only ECG signals that were possible to capture from the ICU monitors. Although ECGs reconstructed in this way can be readily interpreted visually, they are unsuitable as input for certain algorithms for ECG analysis, particularly those that are sensitive to frequency-domain features of the signal. Note that these limitations apply only to the ECG signals, not to the other signals, which were originally sampled at uniform 8 ms intervals (125 samples per second) and were not scaled prior to capture.

Usage Notes

The following example illustrates the organization of the database:

  • Intermediate directory 31 contains all records with names that begin with 31.
  • Record directory 3141595 is contained within intermediate directory 31.
  • All files associated with physiologic waveform record 3141595 and its companion numerics record 3141595n are contained within record directory 31/3141595.
    • The first line of the master header file for waveform record 314595 (31/3141595/3141595.hea) indicates that the record is 242353557 sample intervals (about 22 days at 125 samples per second) in duration, and that it contains 427 segments and gaps. (See header(5) in the WFDB Applications Guide for details on the format of this text file.) The first segment is named 3141595_0001, and it is 2888500 sample intervals (6 hours, 15 minutes, and 8 seconds, at 125 samples per second) in duration. At the end of the master header file, a comment (# Location: nicu) specifies the ICU in which the recording was made (the neonatal ICU, in this case).
    • The layout header file for this record (31/3141595/3141595_layout.hea) indicates that five ECG signals (I, II, III, AVR, and “V”), a respiration signal, and a PPG signal are available during portions of the record. (The five ECG signals are not all available simultaneously.)
    • The header file for the first segment of this record (31/3141595/3141595_0001.hea) shows that a PPG signal (“PLETH”), a respiration signal, and ECG leads II and AVR are available throughout this initial segment.
  • The matching numerics record is named 3141595n, and its header file (31/3141595/3141595n.hea) shows that it is 1938730 sample intervals (about 22 days at 1 sample per second) in duration, and that it contains heart rate (“HR”, which is measured from the ECG, as well as “PULSE”, measured from one or more pulsatile signals), noninvasive blood pressure (raw as well as systolic, diastolic, and mean), respiration rate, and SpO2.

Any WFDB application can read any waveform record from this database directly from the PhysioNet web server (i.e., without downloading the record first) using a record name of the form mimic3wdb/3x/3xyyyyy/. Numerics records can be read using the longer form mimic3wdb/3x/3xyyyyy/3xyyyyyn (note that the final 3xyyyyy must be repeated and followed by n to specify the numerics record).

For example, if you have installed the WFDB Software Package, you can read the first 10 seconds of waveform record 3141595 using this rdsamp command:

rdsamp -r mimic3wdb/31/3141595/ -p -v -t 10

To read the first 10 seconds of the matching numerics record 3141595n, use this command instead:

rdsamp -r mimic3wdb/31/3141595/3141595n -p -v -t 10

Notice that the first command produces 1250 samples of each waveform (125 samples per second, for 10 seconds), but the second command produces only 10 samples of each vital sign (1 sample per second, for 10 seconds).


Release Notes

Version 1.0 of the MIMIC-III Waveform Database supersedes previously-released versions of the MIMIC-II Waveform Database. The numbered records (3000003 to 3999988) are identical to those in version 3.2 of the MIMIC-II Waveform Database. The Matched Subset, however, uses different subject IDs and surrogate dates, corresponding to version 1.4 of the MIMIC-III Clinical Database.


Acknowledgements

We wish to thank Philips Healthcare, as well as the Beth Israel Deaconess Medical Center, for their invaluable support in making this project possible.

Many people have contributed to this project over the past 18 years, and it would be impossible to list them all. In particular, we would like to acknowledge Michael Craig, Tin Kyaw, and Mohammed Saeed, for their efforts in collecting and organizing the original MIMIC-II Waveform Database, upon which this database is based.


Conflicts of Interest

The authors have no conflicts of interests to declare.


References

  1. Johnson, A. E. W., Pollard, T. J., Shen, L., Lehman, L. H., Feng, M., Ghassemi, M., Moody, B., Szolovits, P., Celi, L. A., & Mark, R. G. (2016). MIMIC-III, a freely accessible critical care database. Scientific Data, 3, 160035. https://dx.doi.org/10.1038/sdata.2016.35

Parent Projects
MIMIC-III Waveform Database was derived from: Please cite them when using this project.
Share
Access

Access Policy:
Anyone can access the files, as long as they conform to the terms of the specified license.

License (for files):
Open Data Commons Open Database License v1.0

Discovery

DOI (version 1.0):
https://doi.org/10.13026/c2607m

DOI (latest version):
https://doi.org/10.13026/gs83-bd50

Corresponding Author
You must be logged in to view the contact information.

Files

Total uncompressed size: 6.7 TB.

Access the files

Visualize waveforms

Folder Navigation: <base>/matched/p01
Name Size Modified
Parent Directory
p010013
p010023
p010030
p010042
p010045
p010049
p010061
p010069
p010075
p010077
p010083
p010086
p010096
p010124
p010134
p010152
p010186
p010188
p010205
p010209
p010241
p010246
p010247
p010257
p010277
p010289
p010305
p010315
p010317
p010320
p010337
p010342
p010384
p010391
p010419
p010423
p010425
p010428
p010432
p010433
p010434
p010455
p010464
p010475
p010485
p010487
p010510
p010513
p010525
p010532
p010534
p010552
p010564
p010581
p010595
p010600
p010604
p010611
p010618
p010629
p010635
p010638
p010651
p010652
p010653
p010655
p010656
p010667
p010674
p010679
p010686
p010689
p010694
p010710
p010721
p010725
p010738
p010748
p010766
p010769
p010774
p010782
p010785
p010799
p010814
p010842
p010847
p010852
p010856
p010859
p010872
p010906
p010917
p010924
p010925
p010926
p010928
p010939
p010957
p010973
p010985
p010995
p011003
p011018
p011032
p011043
p011055
p011061
p011066
p011086
p011096
p011099
p011137
p011138
p011143
p011147
p011154
p011161
p011162
p011171
p011172
p011187
p011191
p011200
p011205
p011232
p011235
p011236
p011242
p011244
p011247
p011261
p011268
p011279
p011280
p011291
p011292
p011318
p011320
p011323
p011328
p011330
p011341
p011342
p011347
p011370
p011372
p011380
p011388
p011403
p011421
p011431
p011442
p011446
p011464
p011467
p011473
p011486
p011509
p011512
p011529
p011546
p011558
p011561
p011563
p011590
p011591
p011596
p011597
p011600
p011604
p011609
p011622
p011638
p011641
p011658
p011667
p011679
p011681
p011684
p011688
p011694
p011698
p011700
p011703
p011710
p011723
p011727
p011728
p011744
p011752
p011756
p011757
p011762
p011763
p011764
p011785
p011787
p011801
p011815
p011827
p011829
p011840
p011850
p011852
p011855
p011861
p011870
p011876
p011877
p011901
p011907
p011912
p011931
p011945
p011948
p011949
p011951
p011957
p011978
p011988
p011992
p011995
p011998
p012000
p012008
p012013
p012014
p012070
p012077
p012078
p012090
p012094
p012104
p012110
p012112
p012113
p012115
p012116
p012122
p012124
p012140
p012141
p012167
p012169
p012171
p012174
p012175
p012181
p012182
p012184
p012187
p012203
p012212
p012215
p012217
p012251
p012267
p012274
p012277
p012284
p012294
p012306
p012319
p012331
p012344
p012351
p012365
p012367
p012371
p012372
p012375
p012383
p012388
p012400
p012403
p012410
p012411
p012435
p012445
p012461
p012473
p012481
p012482
p012508
p012531
p012536
p012564
p012565
p012567
p012573
p012575
p012577
p012581
p012586
p012589
p012599
p012619
p012631
p012632
p012663
p012673
p012679
p012693
p012704
p012708
p012709
p012712
p012724
p012727
p012733
p012739
p012748
p012752
p012753
p012772
p012788
p012795
p012797
p012799
p012806
p012807
p012812
p012821
p012823
p012829
p012831
p012834
p012849
p012856
p012869
p012878
p012883
p012903
p012914
p012915
p012920
p012941
p012942
p012947
p012968
p012974
p012982
p012987
p013002
p013013
p013033
p013049
p013052
p013071
p013072
p013096
p013099
p013101
p013110
p013121
p013123
p013136
p013144
p013146
p013149
p013150
p013171
p013181
p013183
p013191
p013195
p013212
p013214
p013218
p013226
p013253
p013259
p013265
p013266
p013273
p013274
p013286
p013295
p013308
p013314
p013316
p013329
p013333
p013353
p013354
p013355
p013361
p013364
p013373
p013378
p013422
p013432
p013435
p013436
p013437
p013438
p013439
p013463
p013476
p013480
p013485
p013489
p013494
p013500
p013501
p013508
p013532
p013536
p013561
p013564
p013569
p013570
p013582
p013593
p013599
p013600
p013615
p013618
p013628
p013629
p013640
p013646
p013649
p013667
p013668
p013689
p013694
p013705
p013710
p013715
p013716
p013719
p013720
p013728
p013731
p013739
p013752
p013754
p013759
p013772
p013782
p013793
p013818
p013830
p013837
p013839
p013840
p013844
p013850
p013852
p013854
p013868
p013877
p013902
p013927
p013948
p013960
p013970
p013973
p013993
p014005
p014014
p014019
p014034
p014036
p014037
p014048
p014050
p014054
p014057
p014058
p014059
p014070
p014079
p014094
p014096
p014098
p014104
p014106
p014123
p014131
p014161
p014167
p014168
p014174
p014179
p014186
p014189
p014190
p014197
p014204
p014205
p014233
p014240
p014245
p014251
p014256
p014263
p014266
p014279
p014286
p014297
p014298
p014299
p014321
p014322
p014325
p014328
p014330
p014334
p014346
p014370
p014386
p014391
p014448
p014458
p014469
p014478
p014486
p014495
p014497
p014506
p014520
p014524
p014529
p014532
p014533
p014539
p014542
p014544
p014551
p014561
p014573
p014579
p014584
p014592
p014603
p014611
p014622
p014626
p014651
p014664
p014669
p014679
p014692
p014702
p014703
p014714
p014724
p014728
p014749
p014755
p014761
p014763
p014766
p014772
p014777
p014780
p014784
p014793
p014822
p014824
p014828
p014836
p014855
p014857
p014862
p014863
p014865
p014873
p014884
p014897
p014898
p014899
p014900
p014909
p014914
p014918
p014919
p014922
p014928
p014929
p014932
p014935
p014936
p014938
p014946
p014947
p014953
p014975
p014982
p014995
p015013
p015021
p015026
p015046
p015052
p015055
p015079
p015082
p015093
p015110
p015119
p015124
p015128
p015141
p015144
p015150
p015168
p015181
p015185
p015198
p015208
p015218
p015226
p015230
p015243
p015254
p015266
p015268
p015270
p015279
p015295
p015298
p015301
p015302
p015303
p015308
p015315
p015329
p015333
p015361
p015369
p015382
p015385
p015426
p015427
p015435
p015458
p015461
p015464
p015465
p015466
p015470
p015474
p015480
p015485
p015488
p015490
p015509
p015514
p015524
p015531
p015538
p015545
p015557
p015558
p015563
p015567
p015569
p015583
p015595
p015610
p015619
p015624
p015625
p015631
p015632
p015637
p015640
p015644
p015645
p015652
p015654
p015669
p015679
p015683
p015684
p015687
p015701
p015703
p015716
p015725
p015727
p015733
p015749
p015769
p015770
p015775
p015809
p015810
p015817
p015821
p015831
p015838
p015841
p015852
p015877
p015883
p015885
p015900
p015902
p015903
p015904
p015911
p015917
p015922
p015924
p015929
p015963
p015964
p015965
p015974
p015977
p015982
p015997
p016002
p016013
p016019
p016024
p016032
p016046
p016063
p016071
p016076
p016088
p016105
p016112
p016115
p016117
p016121
p016122
p016127
p016129
p016139
p016156
p016161
p016164
p016172
p016192
p016196
p016199
p016210
p016216
p016236
p016256
p016258
p016261
p016265
p016275
p016279
p016280
p016286
p016296
p016322
p016336
p016337
p016343
p016352
p016353
p016360
p016365
p016373
p016391
p016392
p016399
p016423
p016436
p016447
p016455
p016463
p016490
p016492
p016499
p016504
p016511
p016516
p016533
p016550
p016552
p016554
p016560
p016561
p016565
p016568
p016581
p016590
p016592
p016607
p016608
p016619
p016637
p016639
p016640
p016642
p016666
p016677
p016684
p016685
p016691
p016709
p016715
p016723
p016727
p016740
p016748
p016775
p016776
p016798
p016804
p016806
p016810
p016821
p016827
p016839
p016849
p016853
p016856
p016860
p016864
p016873
p016874
p016876
p016881
p016888
p016909
p016916
p016924
p016927
p016932
p016949
p016961
p016967
p016992
p017002
p017018
p017021
p017026
p017028
p017038
p017041
p017054
p017058
p017069
p017072
p017075
p017083
p017092
p017097
p017105
p017112
p017122
p017128
p017133
p017145
p017149
p017152
p017182
p017196
p017216
p017218
p017231
p017236
p017246
p017260
p017262
p017280
p017285
p017286
p017290
p017293
p017299
p017300
p017312
p017313
p017318
p017337
p017344
p017366
p017372
p017394
p017401
p017412
p017419
p017421
p017423
p017440
p017443
p017451
p017456
p017457
p017472
p017483
p017488
p017497
p017516
p017522
p017539
p017557
p017582
p017589
p017616
p017617
p017629
p017646
p017663
p017666
p017667
p017671
p017674
p017690
p017691
p017692
p017696
p017697
p017702
p017712
p017717
p017727
p017735
p017736
p017743
p017748
p017753
p017757
p017761
p017764
p017765
p017774
p017775
p017785
p017791
p017795
p017798
p017803
p017807
p017808
p017810
p017812
p017814
p017822
p017826
p017828
p017847
p017848
p017865
p017875
p017882
p017886
p017901
p017902
p017913
p017920
p017929
p017944
p017948
p017954
p017959
p017976
p017997
p018000
p018035
p018088
p018108
p018123
p018126
p018139
p018166
p018167
p018169
p018200
p018205
p018219
p018225
p018229
p018233
p018239
p018248
p018254
p018258
p018264
p018269
p018280
p018300
p018322
p018333
p018356
p018357
p018358
p018365
p018376
p018377
p018393
p018401
p018402
p018403
p018413
p018418
p018429
p018469
p018487
p018489
p018498
p018516
p018524
p018535
p018546
p018568
p018572
p018584
p018595
p018597
p018614
p018615
p018621
p018624
p018626
p018633
p018642
p018643
p018676
p018681
p018685
p018687
p018688
p018689
p018695
p018696
p018711
p018727
p018733
p018737
p018738
p018739
p018740
p018753
p018786
p018812
p018815
p018818
p018837
p018846
p018852
p018875
p018892
p018896
p018897
p018910
p018921
p018928
p018942
p018952
p018959
p018962
p018970
p018971
p018975
p018982
p018988
p018995
p018996
p018998
p019005
p019012
p019016
p019018
p019029
p019031
p019038
p019040
p019053
p019055
p019059
p019087
p019093
p019099
p019102
p019125
p019145
p019155
p019169
p019208
p019213
p019218
p019220
p019233
p019246
p019248
p019265
p019296
p019297
p019308
p019309
p019311
p019330
p019333
p019338
p019342
p019344
p019346
p019361
p019364
p019371
p019372
p019375
p019405
p019411
p019412
p019418
p019430
p019442
p019445
p019453
p019465
p019470
p019484
p019493
p019513
p019523
p019538
p019553
p019560
p019563
p019578
p019598
p019603
p019604
p019608
p019618
p019620
p019624
p019627
p019634
p019640
p019644
p019649
p019655
p019666
p019675
p019685
p019700
p019718
p019726
p019733
p019734
p019757
p019765
p019769
p019771
p019811
p019815
p019817
p019827
p019834
p019848
p019866
p019872
p019891
p019898
p019918
p019931
p019936
p019938
p019947
p019965
p019975
p019977
p019980
p019981
p019999