Difference between revisions of "Get Started"

From SICDB Doc
Jump to navigation Jump to search
 
(19 intermediate revisions by the same user not shown)
Line 1: Line 1:
== Introduction ==  
== Introduction ==  


The SICdb dataset is provided in compressed .csv files, the minute values are even more consolidated. Refer to the [[Main Page]] for detailed description of files.
{{Beta Version}}


The SICdb dataset contains billions of entries, therefore building up a database may present a challenge. Therefore a 'as simple as possible' solution is provided. Our solution, we called it [[SICdb Environment]], provides a fully preconfigured and fast environment to access, explore and export SICdb data. Refer to the Quick Start chapter if you know how the commandline and docker is working, skip to the Detailed Instructions for a more detailed reference.
The SICdb dataset is provided in compressed .csv files, the minute values are even more consolidated. Refer to the [[File List]] for detailed description of files.  


== Quick Start ==
The SICdb dataset contains billions of entries, therefore building up a database may present a challenge.


Just like other ICU datasets SICdb is huge. While our compressed database, which originated from a 2 terabyte database, has only about 13gb, uncompressed you have to expect at least 60gb.
== Usage Examples ==


The database can be built up using [http://www.docker.com Docker]. After install navigate into the folder containing all the data and run "docker compose up". When the environment is running open http://localhost:5000 to install the dataset. The provided environment ist fully preconfigured, just press start and wait. Due to the vast size this may take 4-16 hours*. When install is finished the server has to be restarted, you may do this by reloading the page and then press the shiny restart button.
Scripting or Database Query examples for using SICdb are found at https://github.com/nrodemund/sicdb/tree/main/Examples


*) We work on a solution to provide a fully indexed database. Until now we have not found a legally safe way of distribution (repository).
== Relational Database Import ==


== Detailed Instructions ==
Refer to https://github.com/nrodemund/sicdb/tree/main/Import for some scripts for importing the dataset into a relational database system.
 
I'll write as fast as I can, so much to do...

Latest revision as of 08:51, 5 April 2024

Introduction

SICdb dataset and the documentation, as of 04/24, in active development. We try to improve our project as fast as we can! Please contact us for every problem with data or software you find! We'd love to find some motivated researchers attain as much as possible from our dataset. Contact us if you face any issues!

The SICdb dataset is provided in compressed .csv files, the minute values are even more consolidated. Refer to the File List for detailed description of files.

The SICdb dataset contains billions of entries, therefore building up a database may present a challenge.

Usage Examples

Scripting or Database Query examples for using SICdb are found at https://github.com/nrodemund/sicdb/tree/main/Examples

Relational Database Import

Refer to https://github.com/nrodemund/sicdb/tree/main/Import for some scripts for importing the dataset into a relational database system.