Volume: 1, 2007DNASTAR Quarterly NewsletterIssue: 3
DNASTAR Quarterly Newsletter
Coming Events
10/25 Workshop ASHG, Next Generation Sequencing Analysis Software Q4 2007 Enhanced Next Generation Sequencing Assembler (pre-release)
See Us At
9/27–9/28 NIH Research Show, Washington, DC 10/18 Univ of Washington, Seattle, WA
10/9–10/11 Biotechnica, Hannover, Germany 10/23–10/27 American Society of Human Genetics, Washington, DC
10/17 Fred Hutchinson Cancer Ctr, Seattle, WA 12/1–12/5 American Society of Cell Biology
Letter from the General ManagerTom SchweiVice President and General Manager

Dear DNASTAR® Customers and Friends:

We live in an exciting time, where personalized medicine and other techniques that seemed a long way off just a few years ago are fast becoming routine medical procedures. We have recently begun speaking with individuals and organizations that use next-generation DNA sequencing techniques and DNASTAR’s software for human and other diagnostic purposes. At DNASTAR, we are truly excited to be part of this medical revolution and we are proud to be able to support global efforts in these and other areas

This past summer has been very exciting for us at DNASTAR.
  • As mentioned in our last quarterly newsletter, we launched ArrayStar 2.0 in June. ArrayStar is our first commercial product in the microarray gene expression software area.
  • In late July, we launched the Lasergene 7.2 software upgrade, which enhances Lasergene’s ability to read and analyze large data files from 454 next-generation sequencing machines on a standalone basis and/or with Sanger sequences.
  • In July and August, we enhanced our SeqMan Genome Assembler development-stage software to read Solexa sequencing instrument data in addition to Sanger and 454 data, which it already handled well. While still development-stage software, we have placed this software with several key customers and we welcome additional opportunities to put it to the test if you have a need for it.
As always, please let me or anyone else at DNASTAR know if we can improve the way in which we are serving you.

Sincerely,


Vice President and General Manager

ApplicationsUsing Custom Gateway® Vectors in SeqBuilder

SeqBuilder supports four commonly used cloning methods: Restriction Enzyme Cloning, Directional TOPO® Cloning, TA Cloning, and Gateway® Cloning. In conjunction with these methods, a Cloning Vector catalog is installed with SeqBuilder, providing vectors for you to use for each cloning method. However, you may have custom vectors you wish to use. To use a custom Gateway® destination vector within SeqBuilder, the following process should be used:

Click to enlarge

1. Open your custom vector sequence in SeqBuilder.

2. Add two features labeled attR1 and attR2 on the appropriate sub-sequences where recombination occurs. Note that the attR sites should be added to the top strand, with attR1 appearing upstream of attR2.

3. Save your changes and close the sequence file.

4. Open the Cloning Vector catalog, CloningVectors.sbp, located in the following directory:

  • Windows® XP/2000: C:\Documents and Settings\All Users\Shared Documents\DNASTAR\Lasergene 7 Data
  • Windows VistaTM: C:\Users\Public\Public Documents\DNASTAR\Lasergene 7 Data
  • Macintosh®: Hard Drive:Applications:DNASTAR:Lasergene

5. Within the Cloning Vector catalog, open the Gateway Vectors folder and then click on the Destination Vectors folder to select it.

6. Go to File»Import and navigate to the custom vector sequence file that you just edited. Selecting this sequence file will add it to the Destination Vectors folder, and make it available to you in the Gateway Cloning Advisor the next time you use Gateway® Cloning.

We are interested in updating our catalog to include new vectors. If you have a vector you would like to see added, please let us know at info@dnastar.com.

Product PreviewLASERGENE v7.2 — New Tools to Help with Today's Data Analysis Challenges

Flowgram — Click to enlarge

DNASTAR recently launched its latest version of Lasergene version 7.2. Continuing its policy of frequently upgrading its software to provide its users with the ability to keep up with the rapidly changing sequence analysis field, the company has expanded the sequence assembly capability and visualization capability of Lasergene while also providing a version that is fully compatible with Microsoft VistaTM.

Lasergene v7.1 started DNASTAR’s introduction into expanding our assembly capabilities to handle the growing Next Generation sequencing needs of our customers. Users were able to assemble using both Sanger methods and those of 454 Life SciencesTM. These assemblies could be performed either separately or together on a common backbone. No other commercially available software provided that Feature!

Lasergene 7.2 builds on that Feature by providing users the capability to assemble larger projects faster than before and to obtain visualizations that are pertinent to the pyrosequencing technology of 454. Now it is possible to assemble entire bacterial genomes on a modest desktop computer in a short period of time using Lasergene 7.2!

The new visualizations are designed to provide users with additional analytical information. The availability to generate Flowgrams when working with the 454 data provides valuable information on the sequence itself. As seen in the illustration, Quality Scores for each base call are included in the histogram.

A second feature focuses on the depth of coverage of their assembly that users want to see. A visualization permits easy recognition of assembled regions where the coverage exceeds the target.

DNASTAR continues to strive to provide our customers with the sequence analysis tools they will need to provide their research with the latest methods and technologies available to them. Click here for additional information on Lasergene 7.2.

Tips and TricksAnne StoverTechnical Support Specialist

  • Adding large number of sequences in SeqMan Pro
  • Viewing partial translations in SeqBuilder
  • Viewing chromatograms in SeqMan Pro

If you have particular Tricks for using Lasergene that you believe would benefit your colleagues, let us know so we can share them.

Adding large number of sequences in SeqManPro

Is it taking you a long time to enter large numbers of sequences into SeqMan Pro?

Many factors influence the speed of entering sequences to be assembled, including your processor speed, and the amount of memory that is available on your computer.

However, one trick to help speed things up is to drag and drop your sequences onto the Unassembled Sequences window, rather than clicking on the button, or going through the File menu.

Simply select your sequences, or file containing the sequences, you want to assemble from wherever it is saved on your computer. Then, drag and drop the file(s) onto an empty Unassembled Sequences window in SeqMan Pro. Depending on the size and type of your file(s), doing this can exponentially increase the speed with which your sequences are added!

Viewing partial translations in SeqBuilder
Click to enlarge

Do you want to view the translation of a feature without displaying the translation for the entire sequence?

SeqBuilder allows you to do a partial translation for any portion of your sequence.

To do this:

1. Select your feature, or any portion of the sequence you want to translate.

2. Expand the Partial Translation folder found in the curtain on the left of your view.

3. Select New Translation. You’ll be prompted to name your new translation before it’s displayed underneath your sequence.

Viewing chromatograms in SeqMan Pro

After completing your assembly in SeqMan Pro, do you want to view your chromatograms?

Rather than opening each chromatogram individually by clicking on the triangle next to the sequence name, use one of the following shortcuts to open all of the traces in your contig at one time:

  • On Windows, hold down the Alt key and click the triangle next to any of the sequence names in your assembly.
  • On Macintosh, hold down the Option key and click the triangle next to any of the sequence names in your assembly.
Lasergene Module PreviewMegAlign

MegAlign offers researchers a choice of four pairwise and four multiple sequence alignment methods for aligning nucleic acid or polypeptide sefquences. Users can enter their own sequences or load public data directly from NCBI. To find more related sequences for alignment, a BLAST query can be run or the Entrez text query interface can be utilized, followed by dropping in the sequences wanted from the list of matches.

Click to enlarge

Alignments views can be customized to highlight the similarities or differences of the sequences. Differences in chemical, structural or functional characteristics between sequences, researchers groupings or consensi can be displayed. researchers can construct phylogenetic trees, generate detailed numerical reports or export data of sequence comparisons. MegAlign provides valuable tools permitting comparisons of gene families or sequence pairs that can be customized for presentations and publications.

MegAlign Features

  • Sequence Entry
    • Import data from many popular file formats
    • Read sequences and features from other Lasergene project files
    • Edit template sequence and share newly created features through integration with SeqBuilder and other Lasergene modules
    • Download sequences from NCBI* using accession number, BLAST or text search

  • Alignments and Analyses
    • Align DNA, protein and DNA + protein sequences
    • Perform multiple sequence alignments using:
      • Multiple Sequence Alignments:
        Method Advantages Disadvantages Applications
        ClustalV
        1. very fast
        2. may handle end-gaps better than ClustalW
        1. alignment not as accurate as ClustalW – slow, accurate
        1. aligning many DNA and protein sequences irrespective of relationships
        2. creating preliminary alignments to optimize subalignments for ClustalW refinement
        ClustalW –
        fast approximate
        1. fast
        1. might not be as robust as ClustalW –
          slow, accurate
        1. same as ClustalV
        ClustalW –
        slow,
        accurate
        1. most accurate method for typical alignments
        1. might not handle end-gaps optimally
        2. slow
        1. same as ClustalV
        2. popular method for refining alignments for phylogenetic analysis
        Jotun-Hein
        1. alternate to Clustal
        2. facilitates tests of robustness of predicted phylogenies
        3. fast
        1. not recommended when sequence relationships are uncertain
        2. lower capacity than Clustal
        3. not intended for DNA alignments
        1. aligning modest numbers of closely related, short to moderate length proteins
      • Pairwise Alignments:
        Method Advantages Disadvantages Applications
        Wilbur–Lipman
        1. fast
        1. not as accurate as Martinez–Needleman–Wunsch
        2. may be better if there are many long sequences
        1. DNA alignments
        Martinez–Needleman–Wunsch
        1. more accurate than Wilbur–Lipman
        1. slow
        1. DNA alignments
        Lipman–Pearson    
        1. protein alignments
        DotPlot method
        1. apply to DNA and protein alignments
         
        1. DNA alignments
        2. protein alignments
    • Create sub-alignments from selected ranges of longer alignments
    • Reconstruct phylogeny
    • Calculate sequence similarity and distance
    • Edit an adjust final alignment manually if desired
    • Export data and alignment into popular formats

  • Graphical Displays and Tools
    • Customize alignment displays
    • Highlight matches or mismatches to the consensus or other sequence with distinct colors or shading
    • Define the consensus by residue or by chemical, structural or charge characteristics
    • Create a custom consensus based on researcher’s own residue classification scheme
    • Display consensus strength as color-coded histograms
    • Copy selected sections of alignment report and paste into other applications

    Click to enlarge
  • Bootstrapping
    • Bootstrapping is a process that compares phylogenetic trees generated from similar alignments and counts the number of times a specific branching pattern occurs. Those counts appear on the branches expressed in percentage terms. The branch with higher values are statistically more alike.
    • Bootstrapping is used to test for consistency and robustness of a tree created using a Clustal method.
    • Bootstrapping can provide a measure of the robustness of a feature of the data dependent upon the chosen criterion for comparison.
Featured ProductLasergene

Click to enlarge
Lasergene is a leading software tool used by molecular biologists in a wide range of sequence assembly and analysis applications. Through its cross-application design, users can easily share much of their data and synchronize updates to quickly be able to observe the results of their experimentation.

Lasergene is a suite of 7 different modules, each of which is provides users with unique sequence analysis capabilities. Researchers are able to search and retrieve sequences using integrated BLAST and Entrez tools.



SeqMan Pro, is the module that permits assembly of sequences in contigs. The module has undergone several expansions with the introduction of Lasergene 7.2. Several improvements have been made to this module that expand the types of projects researchers can efficiently perform. Key features include:
  • Assemble E. coli size and larger genomes — Enhancements to the SeqMan Pro assembler now permit users to assemble genomes equivalent or larger than E. coliwith a modest desktop computer.
  • Assemble large Expressed Sequence Tag (EST) and Single Nucleotide Polymorphism (SNP) data sets and other projects requiring extensive depth of coverage analysis. SeqMan Pro has been improved to more accurately perform large depth of field coverage EST and SNP projects.
  • Read 454 Life SciencesTM output files (*.sff, *.fna, .fas and *.qual types) Pyrosequencing techniques offered by 454 Life Sciences offer the most established applications in this area. Researchers can import file formats directly from 454®, use along with Sanger sequencing methods and take advantage of the analysis tools of Lasergene.
  • Maximum Expected Coverage — The Maximum Expected Coverage is a user defined depth of coverage expected in the assembly. Once sequences are assembled, areas exceeding the value are designated by a thick red line in the Strategy view.
  • View Flowgram diagrams for 454® assembly visualizations — Flowgram diagrams using *.sff output can be generated (below). Base calls and Quality scores for each call are shown within the histogram.
  • Increase in speed of the module in performing small and large assemblies. Lasergene 7.2 allows users to assemble large bacterial genomes using 454 Life Sciences pyrosequencing methods. Flowgrams are available to view the assembly and Quality Scores can also be shown in the histogram. The newest module, is the tool that permits assembly of sequences into contigs from a wide range of project sizes. Editing can be performed to eliminate poor quality data and contaminating data to give better final results. SeqMan Pro uses DNASTAR’s unique Trace Quality Evaluation system for generating consensus sequence calls. Lasergene version 7.1 is the first that allows users to work with both traditional Sanger data and data generated by next-generation technologies such as 454®. Assemblies exceeding 600,000 in size have been successfully performed with SeqMan Pro using 454® data.

MegAlign, is used primarily to generate pairwise and multiple sequence alignments of DNA and/or protein. A wide range of analytical algorithms are available for these alignments, giving you the flexibility needed in your work. Additional information can be obtained by going to the DNASTAR website and looking at the Lasergene Information Sheet.

SeqBuilder is the module that enables users to edit nucleic acid and amino acid sequences and to view the sequences in seven different ways, including linear and circular maps. Because SeqBuilder is integrated into most other Lasergene modules, edits are shared automatically with other applications.

GeneQuest helps users in identifying and annotating coding regions and other features of interest in your DNA sequence. It aids in the location of genes and regulatory elements. Once methods have been applied results can be displayed graphically.

PrimerSelect is the module used to design primers and probes for PCR, sequencing, transcriptions and hybridization experiments. You can analyze oligos from either those resulting from your input parameters or have a sorted list of possible primers created for DNA, RNA or back-translated protein templates. Users can manipulate parameters, edit primers preview the primer edit’s effects on sites and coding note conditions for ideal annealing conditions.

Protean is the protein structure prediction module of Lasergene. It can be used to predict and display patterns, secondary structural characteristics and physiochemical properties of proteins using a comprehensive library of analytical methods.

EditSeq is provided with every Lasergene system to enable you to work on nucleic acid and protein sequences of all sizes from a variety of formats, including GenBank, FASTA, MacVector, GCG®, Text, ABI®, and data from the clipboard. You can even access NCBI's databases by accession number or utilizing the integrated internet interface to search BLAST and Entrez text databases.

Customer ProgramFree Educational Software

DNASTAR has been a pioneer in working with large and small academic institutions alike to bring state-of-the-art sequence analysis to the classroom. Over the last decade, hundreds of instructors from around the world have made Lasergene part of their standard curriculum by taking advantage of DNASTAR's free educational software offer. This program is designed to assist small programs that focus primarily on the training of students verses research to utilize advanced sequence analysis tools.

Depending on the institution’s needs, DNASTAR's educational system offer is available for one semester at a time. It provides the laboratory with a full suite of Lasergene that can be used with either Windows or Mac platforms. It is a fully functional, full Lasergene suite including all seven modules of the latest version of the software.

If you believe that your institution may qualify, click Free Educational Software, to be directed to the form on our website that will gather your contact information, course information, and the software products you plan to install to assist you in teaching the basics of DNA and protein sequence assembly and analysis to students in your class.

A DNASTAR representative will contact you to provide specific details and determine your eligibility. In addition, before DNASTAR ships the software, the department chair will need to sign a license agreement for the product you plan to use in your class — Lasergene. Please contact DNASTAR if you have additional questions.

Copyright © 2006–2007, DNASTAR Inc.
3801 Regent Street, Madison, WI, 53705, U. S. A.
Phone: +1-608-258-7420     Toll Free: +1-866-511-5090     Fax: +1-608-258-7439
http://www.dnastar.com
To unsubscribe simply click HERE and your name will be removed from future DNASTAR Newsletter mailings.