The amount and complexity of globally available and newly generated data is dramatically increasing. In many cases, the problem is not whether data is available but rather if one can easily access them in a structured manner. In addition to raw data, there are huge challenges in having ongoing access to and to be able to verify analyzes performed on data by many independent actors. For some data sets, taking infectious diseases as an example, parts of the data may be associated with high level of confidentiality, but at the same time it has to be fully searchable, trustable and available for integration to be useful. In addition the data must be available to a defined subset of researchers within a very short time horizon (e.g. hours).