Comment on page
Learn to launch a JupyterLab session on the DNAnexus Platform, via the DXJupyterLab app.
DXJupyterLab is accessible to all users of the UK Biobank Research Analysis Platform and the Our Future Health Trusted Research Environment.
If you have used DXJupyterLab before, the page will display a list of your previous sessions run across different projects.
This will open a window from which you can start a new JupyterLab environment. In this window, you can configure your session, e.g. specify its name, select an instance type, and choose the project in which JupyterLab should be started.
You can adjust the
durationof the session, after which the environment will automatically shut down. Based on this duration and the instance type, the estimation of the price will be shown in the bottom-left corner (if you have access to the billing information for the selected project).
If you select
Enable Spark Cluster, a JupyterLab environment with a standalone Spark cluster will be started. With this option, you can also set the number of nodes in the cluster. This number includes the master (one node) and the worker nodes.
featureoptions available are
ML, IMAGE_PROCESSING, and
STATA. Selecting the
PYTHON_Rfeature (default option) loads the environment with Python3 and R kernel and interpreter. Selecting the
MLfeature loads the environment with Python3 and Machine Learning packages such as TensorFlow, PyTorch, CNTK as well as Image Processing package Nipype but it does not contain R. Selecting the
IMAGE_PROCESSINGfeature loads the environment with Python3 and Image Processing packages such as Nipype, FreeSurfer and FSL but it does not contain R. The FreeSurfer package requires a license to run. Details about License creation and usage can be found here. The
STATAfeature requires a license to run. For a detailed list of libraries included in each of these
featureoptions, see the in-product documentation.
First, the JupyterLab will be in an "Initializing" state, where it waits for the worker to spin up and for the JupyterLab server to be up and running. Clicking on the row corresponding to your session and the
iicon in the top right corner will display more information corresponding to the JupyterLab job.
Once the JupyterLab server is running, the session state will change to
Readyand the name of the session will turn into a link. By clicking this link, you can open a JupyterLab environment page in your browser. You can access your job via the URL
job-xxxxis the ID of the DXJupyterLab's job.
You can start the JupyterLab environment directly from the command line by running the app:
$ dx run app-dxjupyterlab
Once the app starts, you may check if the JupyterLab server is ready to server connections, which will be indicated by the job's property
running. Once it is running, you can open your browser and go to:
job-xxxxis the ID of the job running the app.
In order to run the Spark version of the app, use the command:
$ dx run app-dxjupyterlab_spark_cluster
You can check the optional input parameters for the apps on the DNAnexus platform (platform login required to access the links):
From the CLI, you can learn more about
dx runwith the following command:
$ dx run -h APP_NAME
where APP_NAME is either