This example can be adapted to extract relevant annotated features (e.g., specific feature types or all features with specified annotations) for uses such as building BLAST databases or consensus matrices or performing alignments. Script A generates an overlapping CDS (specifically, the yeaC fragment). Using Scripts B or C solves this problem. Scripts B and C produce identical results, but only B also outputs the nucleotide sequence file.
|Goal||To extract a set of annotated CDS features from a genome as protein sequences|
|Output A|| LOCUS U00096:yeaA 414 bp DNA 13-JAN-2012
Source complement (1..414)
/note=”***Needs review***Cut segment head by 1860039 and tail by 2778768 units.”
/note=”***Needs review***Cut segment tail by 314 units.”
1 atggctaata aaccttcggc agaagaactg aaaaaaaatt tgtccgagat gcagttttac
61 gtgacgcaga atcatgggac agaaccgcca tttacgggtc gtttactgca taacaagcgt
121 gacggcgtat atcactgttt gatctgcgat gccccgctgt ttcattccca aaccaagtat
181 gattccggct gtggctggcc cagtttctac gaaccggtaa gtgaagaatc cattcgttat
241 atcaaagact tgtcacatgg aatgcagcgc atagaaattc gttgcggtaa ctgtgatgcc…
Need more help with this?