Data to Knowledge Package (D2KP)
The end goal of the AquaINFRA project isn’t just to do good science; it’s to create science that can be verified and reused by anyone.
If a published paper is the menu, the Data-to-Knowledge Package (D2KP) is the recipe and the ingredients. It is the structured package that bundles your raw aquatic data, your analysis scripts, and your exact software environment together.
📽️ Video Tutorial: The D2KP Concept
Explore how D2KPs organize these resources and why they are essential for moving away from “black-box” environmental research.
🏗️ Inside a D2KP
A D2KP is not just a zip file of PDFs. It is a fully functional toolkit designed to offer different interaction levels depending on what the next researcher needs:
- The Data & Code: Every D2KP contains direct links to the raw data sources and the foundational source code (like Python or R scripts), guaranteeing full transparency.
- The Application (Galaxy): For those who want to use your methodology without writing code, the package includes a pre-configured workflow that can be executed directly in the Galaxy interface.
- The Raw Engine (API): Developers can use the underlying Web API (PyGEOAPI) to directly hook your analytical processes into their own custom systems or Digital Twins.
- The Developer Sandbox: For those who want to deeply interrogate or edit your code, they can launch a containerized Virtual Lab (R Studio via MyBinder) linked directly to your package.
🔍 Finding and Using D2KPs
[!NOTE] Interconnectivity When you find a dataset on the AIP, look for the “Related Content” section on the right side. This will indicate if an associated D2KP exists, showing you exactly how that data was used in previous research.