Cloud-based computational science platform
Collaborative, reproducible bioinformatics for R&D
Code Ocean is a reproducible and traceable computational science platform for collaborative research in the cloud. Set up environments, provision compute, develop analysis with others, and scale bioinformatic pipelines.
Trusted by thousands of computational scientists in Biotech and Pharma:
It takes time to set up environments and provision cloud computing
It can be difficult to work with others on the same computational projects
It’s hard to reproduce research in different environments, if at all
Most bioinformatics relies on manual, DIY workflows
Set up environments and provision compute
Build what you need with built-in tools and automated Dockerfile generation. Develop in your preferred cloud workstation and attach data of any size. Deploy to the cloud in one click, and shut down automatically when idle.
Collaborate with others on powerful workflows
Develop computational research with others, from straightforward analysis to parallelized bioinformatics pipelines. Share and co-develop any Compute Capsule, Pipeline, or Data asset within your deployment.
Get automated result provenance and reproducibility
See the provenance of all results with the Lineage Graph and preserve everything needed for full reproducibility. All development tracked with Git, and no lock-in: export anything, at any time, and run it anywhere else.
What our customers say
“Code Ocean's self-service capabilities make it easy for our scientists to do their work reproducibly. New users to the platform can get far with just a little support, giving our engineers time to focus on domain-specific challenges.”
Dr. David Feng
Director of Scientific Computing, Allen Institute
What our customers say
“Code Ocean totally solves tracking and reproducing analysis for researchers and increases trust in the results.”
Benjamin Haibe-Kains, Ph.D
Senior Scientist, Princess Margaret Cancer Centre
What our customers say
“Code Ocean sped up our internal image pre-processing computational workflow by at least 10x, improving collaboration and productivity between our global teams ... removing the need for painful cloud computing infrastructure set up.”
Dimitris Polychronopoulos
Director of In Silico Biology, Ochre Bio
What our customers say
“Code Ocean opens the door to automating and scaling major aspects of the research process. With the Code Ocean platform, our scientists can focus on their core competencies, do more, think bigger, and achieve higher goals.”
Yuval Kalugny
VP Engineering, CytoReason
What our customers say
“The biggest win at Lantern is the pipeline we developed, and the value of Code Ocean was that it allowed the whole team to work together ... we had a shared environment for collaboration.”
Peter Carr
Principal Platform Architect, Lantern Pharma
Built for bioinformatics
-
Data analysis
Data analysis
Use ready-made template Compute Capsules to analyze your data, develop your data analysis workflow in your preferred language and IDE using any open-source software, and take advantage of built-in containerization to guarantee reproducibility.
-
Data management
Data management
Manage your organization's data and control who has access to it. Built with FAIR principles in mind, data management in Code Ocan utilizes custom metadata and controlled vocabularies to ensure consistency and improve searchability.
-
Bioinformatics pipelines
Bioinformatics pipelines
Build, configure and monitor bioinformatics pipelines from scratch using a visual builder for easy set-up. Or, import from nf-core in one click for instant access to a curated set of best practice analysis pipelines. Runs on AWS Batch out of the box, so your pipelines scale automatically. No setup needed.
-
AI & Machine Learning
AI and Machine Learning
Code Ocean is uniquely suited for Artificial Intelligence, Machine Learning, Deep Learning, and Generative AI. Install GPU-ready environments and provision GPU resources in a few clicks. Integration with MLFlow allows you to develop models, track parameters, manage models from development to production, while enjoying out-of-the-box reproducibility and lineage.
-
Multiomics
Multiomics
Analyze and work with large multimodal datasets efficiently using scalable compute and storage resources, cached packages for R and Python, preloaded multiomics analysis software that works out of the box and full lineage and reproducibility.
-
Image processing
Image processing
Process images using a variety of tools: from dedicated desktop applications to custom-written deep learning pipelines, from a few individual files to petabyte-sized datasets. No DevOps required, always with lineage.
-
Cloud management
Cloud management
Code Ocean makes it easy to manage data and provision compute: CPUs, GPUs, and RAM. Assign flex machines and dedicated machines to manage what is available to your users. Spot instances, idleness detection, and automated shutdown help reduce cloud costs.
-
Result provenance
Result provenance
Keep track of all data and results with automated result provenance and lineage graph generation. Assess reproducibility with a visual representation of every Capsule, Pipeline, and Data asset involved in a computation.
Data analysis
Use ready-made template Compute Capsules to analyze your data, develop your data analysis workflow in your preferred language and IDE using any open-source software, and take advantage of built-in containerization to guarantee reproducibility.
Data management
Manage your organization's data and control who has access to it. Built with FAIR principles in mind, data management in Code Ocan utilizes custom metadata and controlled vocabularies to ensure consistency and improve searchability.
Bioinformatics pipelines
Build, configure and monitor bioinformatics pipelines from scratch using a visual builder for easy set-up. Or, import from nf-core in one click for instant access to a curated set of best practice analysis pipelines. Runs on AWS Batch out of the box, so your pipelines scale automatically. No setup needed.
AI and Machine Learning
Code Ocean is uniquely suited for Artificial Intelligence, Machine Learning, Deep Learning, and Generative AI. Install GPU-ready environments and provision GPU resources in a few clicks. Integration with MLFlow allows you to develop models, track parameters, manage models from development to production, while enjoying out-of-the-box reproducibility and lineage.
Multiomics
Analyze and work with large multimodal datasets efficiently using scalable compute and storage resources, cached packages for R and Python, preloaded multiomics analysis software that works out of the box and full lineage and reproducibility.
Image processing
Process images using a variety of tools: from dedicated desktop applications to custom-written deep learning pipelines, from a few individual files to petabyte-sized datasets. No DevOps required, always with lineage.
Cloud management
Code Ocean makes it easy to manage data and provision compute: CPUs, GPUs, and RAM. Assign flex machines and dedicated machines to manage what is available to your users. Spot instances, idleness detection, and automated shutdown help reduce cloud costs.
Result provenance
Keep track of all data and results with automated result provenance and lineage graph generation. Assess reproducibility with a visual representation of every Capsule, Pipeline, and Data asset involved in a computation.
Product features
Capsules
Pipelines
Data
Lineage Graph
Collections
Apps
Admin
API
Who is Code Ocean For?
IT & Engineering
Computational Scientists
R&D Leadership