Main Page

From SICDB Doc
Jump to navigation Jump to search

SICdb dataset, the documentation and our Software Package are, as of 11/22, in active development. We try to improve our project as fast as we can! Please contact us for every problem with data or software you find! We'd love to find some motivated researchers attain as much as possible from our dataset. Contact us if you face any issues!

Welcome to SICdb Documentation

The SICdb dataset provides insight to over 27 thousand intensive care admissions, including therapy and data of their preceding surgery. Data was collected between 2013 and 2021 from 4 of the intensive care units at the University Hospital Salzburg, having more than 3 thousand intensive care admissions per year on 37 beds. The dataset is deidentified and contains, amongst others, case information, laboratory, medication, monitor and respirator signal data.SICdb provides aggregated once-per-hour and highly granular once-per-minute data.


Reference Table

Nominal data is encoded, the reference table d_references provides information associated with a reference. Referenced fields of all data tables relate to the primary key of the reference table, d_references.ReferenceGlobalID. The value field d_references.ReferenceValue gives the variables value, the unit field d_references.ReferenceUnit holds the unit or measurement if applicable. Refer to chapter d_references for further information.

Data Description

The SICdb dataset contains billions of data entries, provided in 6 data tables. The base table is cases, which contains one entry per intensive care admission. The table contains, amongst others, information about the patient, like age, weight and sex, and case information like diagnosis, scores and ICD10 codes. All other data table relate to this table, where x.CaseID is associated with cases.CaseID. Most data has timing information, where the field x.offset is the amount of seconds from admission to event time.

Table laboratory contains laboratory values, table medication provides data about applied drugs. There are several generic data tables, which contain data sorted by their type. In table data_ref additional nominal/categorical data, one per admission, is found. In data_range items with a begin and end time are documented, this includes for example data about central lines or drainages. Table data_float_h contains float data, aggregated once per hour. To decrease table size minute data is serialized as a stream of IEEE 754 floats in data_float_h.rawdata

The database contains pseudoanonymized data of intensive care patients and has to be handled with appropriate care and respect.

Follow the instructions on Get Started documentation to begin!


The Dataset


The Dataset contains, amongst others, following data:


All nominal fields are referenced in table d_references.

The Database

While the raw Dataset in CSV files provides all possibilities to analyse and process data, it may be more convenient to browse in a fully configured and optimized database.

The SICdb Team provides a software environment to quickly select, view and export data from the database in various formats.

RooDataServer is a specific software created to access medical datasets like the SICDB Dataset.

-- > Quick Start <--