Frequently asked questions

Answers to the most frequently asked questions about our platform and its features. Have more questions? Book a demo through the link below.

Book a demo Get product overview

General

Who is Code Ocean for?

Anyone working with data, but especially anyone working in regulated environments (like the life sciences). This includes computational scientists and bioinformaticians working directly with data and the IT and engineering teams supporting them.

Where is Code Ocean installed?

Code Ocean installs directly into your AWS account in about 40 minutes. If your data is already on AWS, you can leave it exactly where it is: no need to move or upload your data anywhere else, or to any locked-in platforms.

If we use Code Ocean, are we locked into using it?

No. Unlike other platforms, we have a firm No Lock-In philosophy: anything you do in Code Ocean (e.g. in Compute Capsules or Pipelines) can be exported—with a single click—at any time.

What are your security and compliance standards?

Code Ocean is HIPAA, GxP, and ISO27001 compliant. We are currently working towards SOC2 compliance with an external auditor.

Capsules

How do Compute Capsules work?

Compute Capsules work by creating an immutable record of the code, environment, and data used in computational work, as well as the results produced. They are releasable, versionable, and reproducible. Whatever runs today will always run in the future.

What can you do with a Compute Capsule?

Virtually anything. Simple exploratory analysis, training machine learning models, creating API connections, building visualization apps, and more. However, you can do all these things at the same time as attaching data of any size, provisioning compute, and co-developing with others, all while creating an automatically traceable and reproducible record of work.

What cloud workstations are available for developing code?

VSCode, JupyterLab, RStudio, CLI, MATLAB, Shiny, Streamlit, Ubuntu Desktop (Beta) and IGV (Beta).

How are Compute Capsules different from just using Docker?

Docker is just one piece of what makes up a Capsule. Docker is natively integrated into Code Ocean so users don’t need to be familiar with Docker to take advantage of its containerization technology. One example is the automated Dockerfile generation.

Do Capsules integrate Git? Can you connect Capsules to your Git provider?

Both. All Capsules (and Pipelines) on Code Ocean integrate Git to track changes as development progresses, regardless of whether or not you’ve connected Code Ocean to your Git provider. We have native integrations for Github, Gitlab, Bitbucket and Azure DevOps.

Is there a file size limit for data you can add to a Capsule?

No. You can attach data of any size to any Capsule. Data and Capsules are managed separately, meaning you can attach the same data asset to multiple Capsules at the same time, reducing data duplication, and potentially decreasing the total cost of ownership over time.

Pipelines

What language are Pipelines based on?

Nextflow. Pipelines have native integration of nextflow, with nextflow scripts automatically created and written as uesrs work in the visual pipeline builder.

Which nextflow DSL is supported?

Both. Building Pipelines with Code Ocean’s Pipeline Builder UI uses DSL1. Custom pipelines support both DSL1 and DSL2.

Do we always have to use the visual pipeline builder tool?

No. Pipelines can be unlocked for users to write their own custom script in Nextflow. Users can fully leverage all Nextflow capabilities within a custom pipeline.

Can you assign different computing resources for different parts of the pipeline?

Yes. Each Capsule can have different computing resources applied (when part of a pipeline in the visual UI), depending on requirements.

I already have a pipeline I want to use. Can I bring that into Code Ocean?

Yes. Users can import an existing pipeline from a git repository, import from nf-core, or upload files to make their custom pipeline. This allows them to quickly create pipelines when first starting with Code Ocean.

Data

Where is my data stored?

Internal Data that you’ve brought into Code Ocean for enhanced reproducibility are stored in an organized S3 bucket inside your AWS account. External data remains external and is accessed by linking to S3 bucket locations.

Where can I bring in new data from?

Virtually anywhere. You can bring in data by local upload, and native integrations include AWS S3, AWS Athena, and Databricks. Code Ocean Apps offers several custom Data Connector Capsules, or you can make your own by connecting through API.

Are there any limits to the size of data I can work with?

No. There are no limits to the size of Data that Code Ocean supports, and any data asset can be connected to any Compute Capsule or Pipeline.

Is my data safe?

Yes. Code Ocean supports user Secrets for any credentials that are required to access private data and assumable IAM roles.

Can I share data?

Yes. Data can be easily shared with users and groups. Secrets and Roles ensure data access is a seamless experience for users.

Do you support custom metadata?

Yes. Admins can create custom keys and specify value and input types for users. Custom metadata allows for improved search, management, and organization of data.

Lineage Graph

Can you always guarantee reproducibility?

Yes, if all source data has been imported and made immutable. If you’ve used an external data source that is subject to change, reproducibility cannot be guaranteed.

Can you click into each node to see details?

Yes. You can interact with every node on the Lineage Graph. Click on any asset that you have access to and explore the full details of which version was run, when it was run, and who by.

Can we use this to connect to an external lineage tool?

Yes, using our API, you could connect to your own instance of an open-source tool, e.g. OpenLineage.

Collections

What can be added to a Collection?

You can add all four of the main computational assets in Code Ocean: Compute Capsules, Pipelines, no-code Apps, and Data.

Who can add assets to a Collection?

The owner(s) of an asset can add or remove it from a Collection.

Are all assets in Collections visible to all users?

No. Every asset has its own access permissions. Users will only see what they have permission to see in a Collection.

How do we set up the Collections folder structure?

Admins determine the Collections folder structure for the organization.

Is there anything we can’t add to a Collection?

Capsules, Pipelines, and Apps that are not yet released/still in development cannot be added to a Collection. This makes sure that anything added to a collection has passed important checks.

Apps

What options are available for making apps?

There are three options. Streamlit and RShiny are both available as cloud workstations. Any Capsule or Pipeline can also be converted into an app with the app panel.

Can you make apps out of both Capsules and Pipelines?

Yes. Any Capsule or Pipeline can be converted into a parameterized no-code App that can then be released for use by non-coders.

Can non-coding users bring their own data to use in an App?

Yes. Users can easily swap out App data for their own in an App Panel. This is also possible with Streamlit and RShiny.

Are there readymade apps already available in the platform?

Yes. You can find more information about this and a full list of apps in our documentation.

Do you ever add new apps to this list?

We often add new apps, usually at the request of existing customers. To find out if we can help you with this, please get in touch.

Admin Panel

What methods of authentication do you support?

We support built-in authentication, SAML SSO, and OpenID Connect (OIDC).

How can I track user behavior, resources, storage, etc?

An analytics dashboard allows administrators to monitor usage, costs, storage, and activity across their entire deployment. Admins are also able to get direct access to users' assets (there are hyperlinks that make is very easy for admins to identify and navigate to them).

Can I put limits on user compute?

Yes. Dedicated machines can be inactivated to prevent users from selecting these machines for use in their workflows.

Is there support for custom environments?

Yes. Custom environments can be deployed or even created from a Compute Capsule. Admin-managed Starter Environments are a great way to standardize workflows across the organization.

Compute Capsules