Getting started#

This page should contain a short guide on what the plugin does and a short example on how to use the plugin.

Installation#

Use the following commands to install the plugin:

git clone https://github.com/aiidateam/aiida-diff .
cd aiida-diff
pip install -e .  # also installs aiida, if missing (but not postgres)
#pip install -e .[pre-commit,testing] # install extras for more features
verdi quicksetup  # better to set up a new profile
verdi plugin list aiida.calculations  # should now show your calculation plugins

Then use verdi code setup with the diff input plugin to set up an AiiDA code for aiida-diff.

Usage#

A quick demo of how to submit a calculation:

verdi daemon start         # make sure the daemon is running
cd examples
verdi run test_submit.py        # submit test calculation
verdi calculation list -a  # check status of calculation

If you have already set up your own aiida_diff code using verdi code setup, you may want to try the following command:

diff-submit  # uses aiida_diff.cli

Available calculations#

calcjobaiida_diff.calculations.DiffCalculation

AiiDA calculation plugin wrapping the diff executable. Simple AiiDA plugin wrapper for ‘diffing’ two files.

Inputs:

code, (AbstractCode, NoneType), optional – The Code to use for this job. This input is required, unless the remote_folder input is specified, which means an existing job is being imported and no code will actually be run.
file1, SinglefileData, required – First file to be compared.
file2, SinglefileData, required – Second file to be compared.
metadata, Namespace
- call_link_label, str, optional, is_metadata – The label to use for the CALL link if the process is called by another process.
- computer, (Computer, NoneType), optional, is_metadata – When using a “local” code, set the computer on which the calculation should be run.
- description, (str, NoneType), optional, is_metadata – Description to set on the process node.
- dry_run, bool, optional, is_metadata – When set to True will prepare the calculation job for submission but not actually launch it.
- label, (str, NoneType), optional, is_metadata – Label to set on the process node.
- options, Namespace
  - account, (str, NoneType), optional, is_metadata – Set the account to use in for the queue on the remote computer
  - additional_retrieve_list, (list, tuple, NoneType), optional, is_metadata – List of relative file paths that should be retrieved in addition to what the plugin specifies.
  - append_text, str, optional, is_metadata – Set the calculation-specific append text, which is going to be appended in the scheduler-job script, just after the code execution
  - custom_scheduler_commands, str, optional, is_metadata – Set a (possibly multiline) string with the commands that the user wants to manually set for the scheduler. The difference of this option with respect to the prepend_text is the position in the scheduler submission file where such text is inserted: with this option, the string is inserted before any non-scheduler command
  - environment_variables, dict, optional, is_metadata – Set a dictionary of custom environment variables for this calculation
  - environment_variables_double_quotes, bool, optional, is_metadata – If set to True, use double quotes instead of single quotes to escape the environment variables specified in environment_variables.
  - import_sys_environment, bool, optional, is_metadata – If set to true, the submission script will load the system environment variables
  - input_filename, (str, NoneType), optional, is_metadata – Filename to which the input for the code that is to be run is written.
  - max_memory_kb, (int, NoneType), optional, is_metadata – Set the maximum memory (in KiloBytes) to be asked to the scheduler
  - max_wallclock_seconds, (int, NoneType), optional, is_metadata – Set the wallclock in seconds asked to the scheduler
  - mpirun_extra_params, (list, tuple), optional, is_metadata – Set the extra params to pass to the mpirun (or equivalent) command after the one provided in computer.mpirun_command. Example: mpirun -np 8 extra_params[0] extra_params[1] … exec.x
  - output_filename, str, optional, is_metadata
  - parser_name, (str, NoneType), optional, is_metadata – Set a string for the output parser. Can be None if no output plugin is available or needed
  - prepend_text, str, optional, is_metadata – Set the calculation-specific prepend text, which is going to be prepended in the scheduler-job script, just before the code execution
  - priority, (str, NoneType), optional, is_metadata – Set the priority of the job to be queued
  - qos, (str, NoneType), optional, is_metadata – Set the quality of service to use in for the queue on the remote computer
  - queue_name, (str, NoneType), optional, is_metadata – Set the name of the queue on the remote computer
  - rerunnable, (bool, NoneType), optional, is_metadata – Determines if the calculation can be requeued / rerun.
  - resources, dict, required, is_metadata – Set the dictionary of resources to be used by the scheduler plugin, like the number of nodes, cpus etc. This dictionary is scheduler-plugin dependent. Look at the documentation of the scheduler for more details.
  - scheduler_stderr, str, optional, is_metadata – Filename to which the content of stderr of the scheduler is written.
  - scheduler_stdout, str, optional, is_metadata – Filename to which the content of stdout of the scheduler is written.
  - stash, Namespace – Optional directives to stash files after the calculation job has completed.
    - source_list, (tuple, list, NoneType), optional, is_metadata – Sequence of relative filepaths representing files in the remote directory that should be stashed.
    - stash_mode, (str, NoneType), optional, is_metadata – Mode with which to perform the stashing, should be value of aiida.common.datastructures.StashMode.
    - target_base, (str, NoneType), optional, is_metadata – The base location to where the files should be stashd. For example, for the copy stash mode, this should be an absolute filepath on the remote computer.
  - submit_script_filename, str, optional, is_metadata – Filename to which the job submission script is written.
  - withmpi, (bool, NoneType), optional, is_metadata – Set the calculation to use mpi
- store_provenance, bool, optional, is_metadata – If set to False provenance will not be stored in the database.
monitors, Namespace – Add monitoring functions that can inspect output files while the job is running and decide to prematurely terminate the job.
parameters, DiffParameters, required – Command line parameters for diff
remote_folder, (RemoteData, NoneType), optional – Remote directory containing the results of an already completed calculation job without AiiDA. The inputs should be passed to the CalcJob as normal but instead of launching the actual job, the engine will recreate the input files and then proceed straight to the retrieve step where the files of this RemoteData will be retrieved as if it had been actually launched through AiiDA. If a parser is defined in the inputs, the results are parsed and attached as output nodes as usual.

Outputs:

diff, SinglefileData, required – diff between file1 and file2.
remote_folder, RemoteData, required – Input files necessary to run the process will be stored in this folder node.
remote_stash, RemoteStashData, optional – Contents of the stash.source_list option are stored in this remote folder after job completion.
retrieved, FolderData, required – Files that are retrieved by the daemon will be stored in this node. By default the stdout and stderr of the scheduler will be added, but one can add more by specifying them in CalcInfo.retrieve_list.