OpenSAFELY CLI
The main tool for using the OpenSAFELY platform locally is the opensafely
Python module, which is run via the command-line interface (CLI).
Its main function is to run data extraction and analysis scripts that are specified in the project pipeline, in a way that mimics the production environment where real data is accessed.
It also contains other functions relating to the OpenSAFELY workflow, such as updating codelists from OpenCodelists.
Installing opensafely
🔗
This is a command-line program.
To install, go to the Anaconda prompt and run the following command (or use another method to install the module if you know how):
pip install opensafely
To check this has installed successfully, run opensafely --version
.
Updating opensafely
🔗
You should keep the tool up to date as much as possible. You can upgrade to a new version of opensafely
by running:
opensafely upgrade
The above command only works with opensafely version 1.6.0 or newer. If you are using an older version, you will first need to upgrade it with:
pip install --upgrade opensafely
Using opensafely
at the command line🔗
To view the in-built documentation for each command, run opensafely --help
at the terminal, which will list all the ways in which you can use it.
You can also use opensafely run --help
to learn more about the run
command, for example.
To run any of these commands for a specific OpenSAFELY project, you need to change the directory of your prompt to be the repository of the project.
For example, cd C:/Users/me/my-git-repos/my-repo
.
More information on how to use the opensafely
module is available in specific sections elsewhere, but some key functions are described briefly below.
run
🔗
The most common command you'll run.
This runs actions defined in the project.yaml
file and is the main way of testing your code.
For example,
opensafely run make_graph
will run the make_graph
action.
To run or to force run?
The run
command takes --force-run-dependencies
or -f
arguments,
where the latter is the short form of the former.
However, what do these arguments do?
When an action is a dependency of another action,
the run
command uses the dependency action's outputs
-- and one of these arguments, if one is present --
to determine whether the dependency action should also run.
If you specify the action to run but don't pass one of these arguments, then:
- The action is run, whether or not its outputs exist.
- Its dependencies are also run, if their outputs do not exist. Conversely, its dependencies are not run, if their outputs exist.
If you specify the action to run and pass one of these arguments, then:
- The action is run, whether or not its outputs exist.
- Its dependencies are also run, whether or not their outputs exist.
What about the run_all
action?
Think of all actions as dependencies of the run_all
action.
If you specify the run_all
action but don't pass one of these arguments,
then for each action:
- If the action's outputs exist, then it is not run.
- If the action's outputs do not exist, then it is run.
If you specify the run_all
action and pass one of these arguments, then:
- All actions are run, whether or not their outputs exist.
codelists
🔗
This command is for working with codelists.
Use
opensafely codelists update
to retrieve each codelist listed in /codelists/codelists.txt
from OpenCodelists.
It will add (or update) the codelist .csv
files to the codelists/
folder.
Use
opensafely codelists check
to check if the codelist files are up-to-date with thse listed in ./codelists/codelists.txt
.
See the Codelist section for more information on codelists.
Updating Docker images🔗
To run your code on your machine, the opensafely
tool uses the same Docker
images that run in the secure server environments. These are updated
periodically, for example when new libraries are installed. If you have error
messages about missing libraries, your Docker images may need upgrading.
To pull the most recent Docker images to your machine, run:
opensafely pull