Bash Apps

Learn to write a basic Bash app .

If you have not already, you should install the DNAnexus SDK and walk through the Intro to Building Apps tutorial.

Bash apps are the simplest apps you can create on DNAnexus. They include a shell script which runs on a virtual machine on the cloud. The script, written by the app developer, is responsible for downloading inputs, processing, and uploading outputs.

Downloading and Using File Inputs

There are two ways to download inputs: one by one, or all at once.

To download all inputs at once (recommended), use the dx-download-all-inputs utility. Add "--parallel" to allow multiple downloads in parallel:dx-download-all-inputs --parallel.

Inputs are downloaded to a folder called "in" under the home folder. Each input is placed under its own subfolder "~/in/name_of_input_field/", named after the input field. Files retain their original filenames as supplied by the user who launched the applet.

For example, if your applet defines a file input called "mappings" and a user runs it with a file called "SRR001.bam", it will be downloaded into ~/in/mappings/SRR001.bam.

The system defines the following helper bash variables, which you can use in your applet:

Bash variable

Content

$mappings_path

~/in/mappings/SRR001.bam

$mappings_name

SRR001.bam

$mappings_prefix

SRR001

$mappings

{"$dnanexus_link": "file-F77Bp7002302Zb343BF1FpG0"}

The variable $mappings_prefix is automatically computed by the system by starting from the filename and removing any suffixes that match the patterns specified in dxapp.json for this input field.

To download inputs one by one, use the following syntax:

dx download "$name_of_input_field"

This will download the file to the current working directory. It will retain the original file as supplied by the user who launched the applet. You can use the "$xxxxxx_name" variable (as shown in the table above) to refer to that filename.

To name the local file using a different name than the original filename, use the following syntax:

dx download "$name_of_input_field" -o local_filename

To stream the local file and pipe it to another command, use the following syntax:

dx cat "$name_of_input_field" | command

You can combine these strategies. You can download or stream some inputs one by one, and the rest all at once, using the following syntax:

dx-download-all-inputs --except name_of_input_field1 --except name_of_input_field2

dx download "$name_of_input_field1"

dx cat "$name_of_input_field2" | command

Downloading Inputs Using dx-mount-all-inputs

An alternative to dx-download-all-inputs is the dx-mount-all-inputs command-line utility. You can use it by adding this line to your script.sh:

$ dx-mount-all-inputs

dxfuse is required for the dx-mount-all-inputs to work. You can download the dxfuse binary from https://github.com/dnanexus/dxfuse/releases and add it to ./resources/usr/bin/dxfuse.

dx-mount-all-inputsuses the same directory structure as illustrated above for dx-download-all-inputs, except that the files are mounted to the respective location rather than downloaded. When using dx-mount-all-inputs, input files do not take up local storage because they are mounted using Linux FUSE technology and streamed behind the scenes transparently when accessed.

Uploading Outputs

There are two ways to upload outputs: one by one, or all at once.

To upload outputs one by one, use the following syntax:

id=$(dx upload /path/to/local/file --brief)

dx-jobutil-add-output name_of_output_field "$id"

If you would like the uploaded file to have a different name than the local file, add the following to the dx upload command:

--path remote_filename

If you would like the uploaded file to appear under a subfolder in the applet outputs, add the following to the dx upload command:

--path /subfolder/remote_filename --parents

If you would like the uploaded file to contain metadata, for example set a property (key/value pair), add the following to the dx upload command:

--property key=value

To upload all outputs at once, create a folder "out" under the home folder, and create a subfolder named after each output field. Place a file under each subfolder, and call the "dx-upload-all-outputs" utility:

mkdir -p ~/out/name_of_output_field1/ ~/out/name_of_output_field2/

mv file1 ~/out/name_of_output_field1/

mv file2 ~/out/name_of_output_field2/

dx-upload-all-outputs

If you would like any uploaded file to have a different name than the local file, rename them as you move them:

mv file1 ~/out/name_of_output_field1/renamed_file1

If you would like any uploaded file to appear under a subfolder in the applet outputs, create the subfolder like this:

mkdir -p ~/out/name_of_output_field1/subfolder/ mv file1 ~/out/name_of_output_field1/subfolder/

If you would like the uploaded files to contain metadata, for example set a property (key/value pair), make the following changes:

  1. Install the "attr" Linux executable by specifying the "attr" Ubuntu package in your dxapp.json.

  2. For each file, and for each key/value pair you would like to attach, set the respective Linux extended file system attribute like this:

    attr -s key -V value ~/out/name_of_output_field1/file1

  3. Add "--xattr-properties" to the dx-upload-all-outputs invocation:

    dx-upload-all-outputs --xattr-properties

Last updated

Copyright 2024 DNAnexus