The loadSeq command loads a sequence file or files for assembly. See our website for a list of supported file types.

Parameter Description Allowed values
blockContig Used in the reference-guided workflow. [text string]
blockContigID Used in the reference-guided workflow. [number]
blockName Used in the reference-guided workflow. [text string]
blockPos Used in the reference-guided workflow. [number]
DiscardLinkerless Specifies whether reads that do not have a linker sequence should be discarded from the assembly. [ true / false ]
file (required) Specifies the directory and file name of the sequence file(s) to be loaded. A folder may also be specified, in which case all of the sequence files within that folder will be loaded. [directory/filename enclosed in quotes]
groupName Used to identify the multi-sample group name for a read file. [text string]
isPair Specifies whether the query files contain paired end data. [ true / false ]
linker The directory and file name of the .fas, fna, or .sff file containing the 454 linker sequences. If not specified, SeqMan NGen will use its default 454 linker sequence: GTTGGAACCGAAAGGGTTTGAATTCAAACCCTTTCGGTTCCAAC. [directory/filename enclosed in quotes]
max The maximum distance for the paired end constraint. [number]

Default = 10000
maxSeqs Specifies the maximum number of reads to load from a file. [number]
mergePairs Specifies whether the reads are paired end data that overlap and should therefore be merged. [ true / false ]
min The minimum distance for the paired end constraint. [number]

Default = 0
minSeqLen Minimum length of a sequence required to include it in the assembly. [number]
multiplex Specifies whether reads are from a multi-sample run. [ true / false ]
seqTech Specifies the offset to be used when converting compressed quality scores into numerical values. These are the offsets used for the technology specified:

Data Type / Value / Offset
IonTorrent / IonTorrent / 33
Applied Biosystems SOLiD / SOLiD / 33
Illumina / Illumina / 64
Roche 454 / 454 / 33
Other types / normalScore / 33

Note 1: For 454,quality scores for homopolymeric runs of ≥ 2 are oriented from 5’ to 3’ on the top strand.

Note 2: If possible, the data type of unknown data is determined automatically based on the first data file.
[IonTorrent|SOLiD|Illumina|454|normalScore|Other]
templateFragment Used in reference-guided assemblies with gap closure. [number]

Example:

loadSeq
   file: “/Library/ABC_project/ABC_sequences.fas”

Need more help with this?
Contact DNASTAR

Thanks for your feedback.