Installation#

AmpSeeker does not require installation, however, users must have snakemake and conda installed in order to run the workflow.

Conda

Conda is an open-source package management system, which allows users to install a variety of packages and programs whilst managing software environments and dependencies. In general, we recommend using the miniconda installation.

https://conda.io/projects/conda/en/latest/user-guide/install/index.html

Snakemake

Snakemake is a workflow management system to create reproducible and scalable computational analyses. AmpSeeker is written in snakemake, which itself is python-based. The Snakemake authors encourage installation through conda.

https://snakemake.readthedocs.io/en/stable/getting_started/installation.html

You may also need pandas, which you can install with pip install pandas.

bcl-convert

Unfortunately, bcl-convert is not available via conda, so if you wish to demultiplex from bcl files, we must either install singularity and specify --use-singularity at the command line when running snakemake (recommended), or install bcl-convert directly from Illumina manually. We recommend installing with singularity, as not all versions of bcl-convert can generate index fastq reads, which are required.

To install singularity, please use the command below:

conda install -c conda-forge singularity

To install bcl-convert, please follow the instructions here: https://emea.support.illumina.com/sequencing/sequencing_software/bcl-convert.html


AmpSeeker#

The next step is to clone the AmpSeeker github repository to your system. You may use the below command, replacing my_repo_name with something of your choice. This may be useful in case you ever use AmpSeeker on multiple datasets.

git clone --recursive https://github.com/sanjaynagi/AmpSeeker.git my_repo_name

Alternatively, if you think you may want to contribute to AmpSeeker, you may wish to fork the repository to your github account prior to cloning. For more information, please the docs on contributing.

As the workflow contains a submodule (compkaryo), we should use git clone --recursive to clone the repository.

Installing dependencies#

When we run AmpSeeker, it is important to use the --use-conda directive - this will automatically install all necessary packages into isolated software environments, so we do not have to install any other dependencies.

Testing the setup#

To test whether AmpSeeker is configured correctly, from the root directory you can perform a dry-run on the test dataset, with the following command:

snakemake --cores 1 --directory .test/ --use-conda -n