Comment on page
Learn to build and use scatter plots in the Cohort Browser.
The Cohort Browser is accessible to all users of the UK Biobank Research Analysis Platform and the Our Future Health Trusted Research Environment.
Scatter plots can be used to compare the distribution of values in a field containing numerical data, across different groups in a cohort. In a scatter plot, each such group is defined by its members sharing the same value in another field that also contains numerical data.
Primary field values are plotted on the x axis. Secondary field values are plotted on the y axis.
In the scatter plot below, each dot represents a particular combination of values, found in one or more records in a cohort, in fields Insurance Billed and Cost. The lighter the dot at a particular point, the fewer the records that share that combination. Darker dots, meanwhile, indicate that relatively more records that share a particular combination.
Scatter Plot: Insurance Billed x Cost
In some cases, a field containing numeric data may also contain some non-numeric values. These values cannot be represented in a scatter plot. The message "This field contains non-numeric values" will appear below the scatter plot, as in this sample chart:
Scatter Plot Based on Field or Fields Containing Non-Numeric Values
Clicking the "non-numeric values" link will display detail on those values, and the number of record in which each appears.
Detail on Non-Numeric Values
In the Cohort Browser, scatter plots can show up to 30,000 distinct data points. If you create a scatter plot that would require that more data points be shown, you'll see this message above the chart:
Scatter Plot with Warning Message about Data Point Limit
Scatter plots are not supported in Cohort Compare.
- Integer Sparse
- Float Sparse