-
Notifications
You must be signed in to change notification settings - Fork 4
Description
We previously discussed adding support for sorting FASTQ files by sequence. This would both improve compression (leading to gains in performance and also size on disk) and also potentially improve the preview behaviour particularly for barcoded libraries.
Sorting should only be supported for merged reads (since sorting a set of R1/R2/R3 FASTQ files and leaving them all in the right order seems like too much of a side project).
We will want to make sure that if you re-run a pipeline that already has generated a sorted read file, that it doesn't have to re-sort again since that will just waste a bunch of time.
The plugin should also have an option to sort in reverse order, which mostly just gives the user another try if the "top" of the file is not informative (because it's all the wild type barcodes, for example).