Goal To extract the C-regions (exons), J-segments and V-segments from this T-receptor locus sequence (AF159056), producing output in FASTA format.
Script 4Matt.fas=extract(AF159056.gbk,'V_segment,J_segment,C_region')
Output (Matt.fas) >Matt:1
atgcggtgggccctagcggtgcttctagctttcctgtctcctggtgagtgcgctgcctacagagaggatcacgggttttgttttgttttgttattttcttcttttgcaaggagcgacatactaagaaatgcctcattatattttgtgttgttcccattgcagccagtcagatatcttccaacttggaagggagaacgaagtcagtcaccaggctgactgggtcatctgctgaaatcacctgtgatcttcctggagcaagtaccttatacatccactggtacctgcaccaggaggggaaggccccacagtgtcttctgtactatgaaccctactactccagggttgtgctggaatcaggaatcactccaggaaagtatgacactggaagcacaaggagcaattggaatttgagactgcaaaatctaattaaaaatgattctgggttctattactgtgccacctgggacagg
>Matt:2
atgcagtgggccctagcggtgcttctagctttcctgtctcctggtgagtgcgctgcctacagagaggatcacgggttttgttttattttcttcttttgcaaggagtaccatactaaggaattcctcattatattttgtgttgttcccactgcagccagtcagaaatcttccaacttggaagggagaacgaagtcagtcatcaggcagactgggtcatctgctgaaatcacttgtgatcttgctgaaggaagtaacggctacatccactggtacctacaccaggaggggaaggccccacagcgtcttcagtactatgactcctacaactccaaggttgtgttggaatcaggagtcagtccagggaagtattatacttacgcaagcacaaggaacaacttgagattgatactgcgaaatctaattgaaaatgactctggggtctattactgtgccacctgggacggg
>Matt:3-25(similar; omitted here to conserve space)

>Matt:26
atacactactgctgcagctcacaaacacctctgcatattacatgtacctcctcctgctcctcaagagtgtggtctattttgccatcatcacctgctgtctgcttggaagaacggctttctgctgcaatggagagaaatca

Need more help with this?
Contact DNASTAR

Thanks for your feedback.