Jupyter Notebooks

TileDB Cloud enables the user to launch Jupyter notebooks within the UI console. It spins up Jupyter notebook instances in the Kubernetes cluster in us-east-1. The user can install any extra packages in the notebook. The notebook server environment is destroyed on shutdown. Any extra packages installed will not persist across server instances.

Every user gets a 2GB persistent storage in an EBS volume (also in us-east-1). This is mounted as the home directory in the notebook server. All contents in the home directory will persist across server restarts. The user does not get charged for storage!

Currently, TileDB offers two notebook server sizes:

Size

CPUs

Memory

Small

2

2GB

Large

16

60GB

As explained in the Pricing and Billing section, notebooks are charged based on the size of the notebook server and duration it is run for.

Currently notebook usage is charged either to an organization a user belongs to or, if the user is not part of an organization, to the user themselves. We plan a future improvement to allow selecting who to charge for the notebook usage.

TileDB Cloud offers three notebook images, with the following installed packages:

  • Basic Data Science:tiledb, libtiledb-sql-py, plotly, ipywidgets, graphviz, pandas, pydot, trimesh, numpy, chardet, numba, tiledb-r, voila, opencv, tiledb-cloud, pybabylonjs, envbash, tiledb-ml

  • Genomics: Everything in the Basic Data Science notebook plus:snakemake, tiledb-vcf, htslib, bcftools, pybedtools

  • Geospatial: Everything in the Basic Data Science notebook plus:cartopy, datashader, descartes, folium, geos, geotiff, holoviews, imagemagick, laszip, libnetcdf, proj, shapely, scikit-build, proj, gdal, rasterio, mb-system, pdal, fiona, geopandas, scikit-mobility, xarray, tiledb-segy, capella-tools

Last updated