Get Started

From SICDB Doc
Revision as of 21:29, 17 September 2022 by Salkin (talk | contribs)
Jump to navigation Jump to search

Introduction

The SICdb dataset is provided in compressed .csv files, the minute values are even more consolidated. Refer to the Main Page for detailed description of files.

The SICdb dataset contains billions of entries, therefore building up a database may present a challenge. Therefore a 'as simple as possible' solution is provided. Our solution, we called it SICdb Environment, provides a fully preconfigured and fast environment to access, explore and export SICdb data. Refer to the Quick Start chapter if you know how the commandline and docker is working, skip to the Detailed Instructions for a more detailed reference.

Quick Start

Just like other ICU datasets SICdb is huge. While our compressed database, which originated from a 2 terabyte database, has only about 13gb, uncompressed you have to expect at least 60gb.

The database can be built up using Docker. After install navigate into the folder containing all the data and run "docker compose up". When the environment is running open http://localhost:5000 to install the dataset. The provided environment ist fully preconfigured, just press start and wait. Due to the vast size this may take 4-16 hours*. When install is finished the server has to be restarted, you may do this by reloading the page and then press the shiny restart button.

  • ) We work on a solution to provide a fully indexed database. Until now we have not found a legally safe way of distribution (repository).

Detailed Instructions

I'll write as fast as I can, so much to do...