![Data Light@8x](https://8277274.fs1.hubspotusercontent-na1.net/hub/8277274/hubfs/Webpage%20assets/Icons/Product/Data%20Light@8x.png?width=80&height=80&name=Data%20Light@8x.png)
Data
A single place to manage all data assets in the cloud and from any other source. Use in computational work while tracking lineage, ensuring reproducibility, and reducing duplication.
Key capabilities
Manage data from multiple sources
Use data from multiple sources in your computational work. Native support for AWS S3, Athena, and Databricks. Choose whether data is brought into Code Ocean for enhanced reproducibility and performance, or remains external for convenience.
![manage-data-multiple-sources-2-1](https://8277274.fs1.hubspotusercontent-na1.net/hub/8277274/hubfs/Webpage%20assets/Product/Data/manage-data-multiple-sources-2-1.png?width=650&height=450&name=manage-data-multiple-sources-2-1.png)
Attach Data instantly to any analysis or pipeline
Data are managed separately from Capsules and Pipelines. This means a Data asset can be used in multiple Capsules, Pipelines, and by multiple users simultaneously and without duplication. All recently used internal Data are cached to EFS for fast access.
![instantly-attach-data](https://8277274.fs1.hubspotusercontent-na1.net/hub/8277274/hubfs/Webpage%20assets/Product/Data/instantly-attach-data.png?width=650&height=450&name=instantly-attach-data.png)
Track and share data with others
All Data can be shared with other users and/or groups in your AWS deployment with built-in permission and access control.
![track-share-data-with-others](https://8277274.fs1.hubspotusercontent-na1.net/hub/8277274/hubfs/Webpage%20assets/Product/Data/track-share-data-with-others.png?width=650&height=450&name=track-share-data-with-others.png)
Get instant provenance for all result data
Track Result provenance with the Lineage Graph. Click into any Result Data to see where it came from and how it was generated.
![instant-provenance-result-data](https://8277274.fs1.hubspotusercontent-na1.net/hub/8277274/hubfs/Webpage%20assets/Product/Data/instant-provenance-result-data.png?width=650&height=450&name=instant-provenance-result-data.png)
How Data works with the rest of the platform
![Collections@8x](https://8277274.fs1.hubspotusercontent-na1.net/hub/8277274/hubfs/Webpage%20assets/Icons/Product/Collections@8x.png?length=720&name=Collections@8x.png)
![Collections](https://8277274.fs1.hubspotusercontent-na1.net/hub/8277274/hubfs/Webpage%20assets/Homepage/Key%20product%20features/Collections.png?length=720&name=Collections.png)
![Lineage Colour@8x](https://8277274.fs1.hubspotusercontent-na1.net/hub/8277274/hubfs/Webpage%20assets/Icons/Product/Lineage%20Colour@8x.png?length=720&name=Lineage%20Colour@8x.png)
![Lineage Graph](https://8277274.fs1.hubspotusercontent-na1.net/hub/8277274/hubfs/Webpage%20assets/Homepage/Key%20product%20features/Lineage%20Graph.png?length=720&name=Lineage%20Graph.png)
![API@8x](https://8277274.fs1.hubspotusercontent-na1.net/hub/8277274/hubfs/Webpage%20assets/Icons/Product/API@8x.png?length=720&name=API@8x.png)
![API](https://8277274.fs1.hubspotusercontent-na1.net/hub/8277274/hubfs/Webpage%20assets/Homepage/Key%20product%20features/API.png?length=720&name=API.png)
Built for bioinformatics
-
Data analysis
Data analysis
Use ready-made template Compute Capsules to analyze your data, develop your data analysis workflow in your preferred language and IDE using any open-source software, and take advantage of built-in containerization to guarantee reproducibility.
-
Data management
Data management
Manage your organization's data and control who has access to it. Built specifically to meet all FAIR principles, data management in Code Ocean uses custom metadata and controlled vocabularies to ensure consistency and improve searchability.
-
Bioinformatics pipelines
Bioinformatics pipelines
Build, configure and monitor bioinformatics pipelines from scratch using a visual builder for easy set-up. Or, import from nf-core in one click for instant access to a curated set of best practice analysis pipelines. Runs on AWS Batch out-of-the-box, so your pipelines scale automatically. No setup needed.
-
AI & Machine Learning
AI & Machine Learning
Code Ocean is uniquely suited for Artificial Intelligence, Machine Learning, Deep Learning, and Generative AI. Install GPU-ready environments and provision GPU resources in a few clicks. Integration with MLFlow allows you to develop models, track parameters, manage models from development to production, while enjoying out-of-the-box reproducibility and lineage.
-
Multiomics
Multiomics
Analyze and work with large multimodal datasets efficiently using scalable compute and storage resources, cached packages for R and Python, preloaded multiomics analysis software that works out of the box and full lineage and reproducibility.
-
Image processing
Image processing
Process images using a variety of tools: from dedicated desktop applications to custom-written deep learning pipelines, from a few individual files to petabyte-sized datasets. No DevOps required, always with lineage.
-
Cloud management
Cloud management
Code Ocean makes it easy to manage data and provision compute: CPUs, GPUs, and RAM. Assign flex machines and dedicated machines to manage what is available to your users. Spot instances, idleness detection, and automated shutdown help reduce cloud costs.
-
Result provenance
Result provenance
Keep track of all data and results with automated result provenance and lineage graph generation. Assess reproducibility with a visual representation of every Capsule, Pipeline, and Data asset involved in a computation.
![data-analysis](https://8277274.fs1.hubspotusercontent-na1.net/hub/8277274/hubfs/Webpage%20assets/Homepage/Use%20cases/data-analysis.png?width=1320&height=920&name=data-analysis.png)
Data analysis
Use ready-made template Compute Capsules to analyze your data, develop your data analysis workflow in your preferred language and IDE using any open-source software, and take advantage of built-in containerization to guarantee reproducibility.
![data-management](https://8277274.fs1.hubspotusercontent-na1.net/hub/8277274/hubfs/Webpage%20assets/Homepage/Use%20cases/data-management.png?width=1320&height=920&name=data-management.png)
Data management
Manage your organization's data and control who has access to it. Built specifically to meet all FAIR principles, data management in Code Ocean uses custom metadata and controlled vocabularies to ensure consistency and improve searchability.
![bioinformatic-pipelines](https://8277274.fs1.hubspotusercontent-na1.net/hub/8277274/hubfs/Webpage%20assets/Homepage/Use%20cases/bioinformatic-pipelines.png?width=1320&height=920&name=bioinformatic-pipelines.png)
Bioinformatics pipelines
Build, configure and monitor bioinformatics pipelines from scratch using a visual builder for easy set-up. Or, import from nf-core in one click for instant access to a curated set of best practice analysis pipelines. Runs on AWS Batch out-of-the-box, so your pipelines scale automatically. No setup needed.
![AI-MLflow](https://8277274.fs1.hubspotusercontent-na1.net/hub/8277274/hubfs/Webpage%20assets/Homepage/Use%20cases/AI-MLflow.png?width=1320&height=920&name=AI-MLflow.png)
AI & Machine Learning
Code Ocean is uniquely suited for Artificial Intelligence, Machine Learning, Deep Learning, and Generative AI. Install GPU-ready environments and provision GPU resources in a few clicks. Integration with MLFlow allows you to develop models, track parameters, manage models from development to production, while enjoying out-of-the-box reproducibility and lineage.
![multiomics](https://8277274.fs1.hubspotusercontent-na1.net/hub/8277274/hubfs/Webpage%20assets/Product/Product%20overview/Use%20cases/multiomics.png?width=1320&height=920&name=multiomics.png)
Multiomics
Analyze and work with large multimodal datasets efficiently using scalable compute and storage resources, cached packages for R and Python, preloaded multiomics analysis software that works out of the box and full lineage and reproducibility.
![image-processing](https://8277274.fs1.hubspotusercontent-na1.net/hub/8277274/hubfs/Webpage%20assets/Product/Product%20overview/Use%20cases/image-processing.png?width=1320&height=920&name=image-processing.png)
Image processing
Process images using a variety of tools: from dedicated desktop applications to custom-written deep learning pipelines, from a few individual files to petabyte-sized datasets. No DevOps required, always with lineage.
![cloud-management](https://8277274.fs1.hubspotusercontent-na1.net/hub/8277274/hubfs/Webpage%20assets/Product/Product%20overview/Use%20cases/cloud-management.png?width=1320&height=920&name=cloud-management.png)
Cloud management
Code Ocean makes it easy to manage data and provision compute: CPUs, GPUs, and RAM. Assign flex machines and dedicated machines to manage what is available to your users. Spot instances, idleness detection, and automated shutdown help reduce cloud costs.
![result-provenance](https://8277274.fs1.hubspotusercontent-na1.net/hub/8277274/hubfs/Webpage%20assets/Product/Product%20overview/Use%20cases/result-provenance.png?width=1320&height=920&name=result-provenance.png)
Result provenance
Keep track of all data and results with automated result provenance and lineage graph generation. Assess reproducibility with a visual representation of every Capsule, Pipeline, and Data asset involved in a computation.