Describing Data Objects
You can describe objects (files, app(let)s, and workflows) on the DNAnexus platform using the command dx describe
.
Describing an Object by Name
Objects can be described using their DNAnexus platform name via the command line interface (CLI) using a path.
Describe an Object With a Relative Path
Objects can be described relative to the user's current directory on the DNAnexus platform. In the following example, we describe the indexed reference genome file human_g1k_v37.bwa-index.tar.gz
.
NOTE: The entire path is enclosed in quotes due to the space in the folder name Original files. Instead of quotes, you can escape special characters with the
\
character:dx describe Original\ files/human_g1k_v37.bwa-index.tar.gz
.
Describe an Object in a Different Project Using an Absolute Path
Objects can be described using an absolute path. This allows us to describe objects outside the current project context. In the following example, we dx select
the project "My Research Project" and dx describe
the file human_g1k_v37.fa.gz
in the "Reference Genome Files" project.
Describe an Object Using Object ID
Objects can be described using a unique object ID.
In this example, we describe workflow object "Exome Analysis Workflow" using its ID. This workflow is publicly available in the "Exome Analysis Demo" project.
Due to the amount of information contained in a workflow (including multiple app(let)s, inputs/outputs, and default parameters), the dx describe
output can seem overwhelming.
Manipulating Outputs
The output from a dx describe
command can be used for various purposes. The optional argument --json
will convert the output from dx describe
into JSON format for advanced scripting and command line use.
In this example, we will describe the publicly available workflow object "Exome Analysis Workflow" and return the output in JSON format.
We can parse, process, and query the JSON output using jq
. Below, we process the dx describe --json
output to generate a list of all stages in the aforementioned exome analysis pipeline.
We can output the "executable" value of each stage present in the "stages" value of the dx describe
output above using the command below.
General Response Fields Overview
Last updated