• Software
    • DNASTAR LASERGENE
      Comprehensive Sequence Analysis
      • Lasergene Molecular Biology
      • Lasergene Genomics
      • Lasergene Protein
    • NOVA APPLICATIONS
      Protein Modeling
      • NovaFold AI
      • NovaFold
      • NovaFold Antibody
      • NovaDock
  • Workflows
    • Molecular Biology Workflows
      • Automated Virtual Cloning
      • Clone Sequence Verification
      • Gel Electrophoresis Simulation
      • Multiple Sequence Alignment
      • Pairwise Sequence Alignment
      • PCR Site-Directed Mutagenesis
      • PCR Primer Design
      • Phylogenetic Analysis
      • Plasmid Maps
      • Sanger Sequence Assembly
      • Sequence Editing and Annotation
  • Protein Analysis
    • Antibody Modeling
    • Antibody Phage Display
    • Epitope Prediction
    • Protein Docking
    • Protein Sequence Analysis
    • Protein Stability Prediction
    • Protein Structural Alignment
    • Protein Structure Analysis
    • Protein Structure Prediction
  • Genomics
    • Clinical Research
    • De Novo Genome Assembly
    • Mauve Genome Alignment
    • Metagenomic Assembly
    • Variant Analysis
    • Viral Genome Analysis
    • Whole Genome/Whole Exome
  • Transcriptomics
    • ChIP-Seq Data Analysis
    • De Novo Transcriptome Assembly
    • RNA-Seq Alignment
  • Services
    • Protein Services
    • Genomic Services
  • Pricing
  • Resources
    • Product Updates
    • Product Notifications
    • Blog
    • Educational Software Request
    • Documentation
    • Technical Requirements
      • File Formats
      • Licensing Options
  • Training
    • Help + Tutorials
    • Webinars
    • Technical Support Request
  • About
    • Careers
    • Distributors
    • Legal Information
    • Privacy Policy
  • Contact

QUESTIONS? CALL 866.511.5090

DOWNLOAD FREE TRIAL
SHOPPING CART
MY ACCOUNT
DNASTAR DNASTAR
  • Software
    • DNASTAR LASERGENE
      Comprehensive Sequence Analysis
      • Lasergene Molecular Biology
      • Lasergene Genomics
      • Lasergene Protein
    • NOVA APPLICATIONS
      Protein Modeling
      • NovaFold AI
      • NovaFold
      • NovaFold Antibody
      • NovaDock
  • Workflows
    • Molecular Biology
      • Automated Virtual Cloning
      • Clone Sequence Verification
      • Gel Electrophoresis Simulation
      • Multiple Sequence Alignment
      • Pairwise Sequence Alignment
      • PCR Site-Directed Mutagenesis
      • PCR Primer Design
      • Phylogenetic Analysis
      • Plasmid Maps
      • Sanger Sequence Assembly
      • Sequence Editing and Annotation
    • Protein Analysis
      • Antibody Modeling
      • Antibody Phage Display
      • Epitope Prediction
      • Protein Docking
      • Protein Sequence Analysis
      • Protein Stability Prediction
      • Protein Structural Alignment
      • Protein Structure Analysis
      • Protein Structure Prediction
    • Genomics
      • Clinical Research
      • De Novo Genome Assembly
      • Mauve Genome Alignment
      • Metagenomic Assembly
      • Variant Analysis
      • Viral Genome Analysis
      • Whole Exome/Genome Sequencing
    • Transcriptomics
      • ChIP-Seq Data Analysis
      • De Novo Transcriptome Assembly
      • RNA-Seq Alignment and Analysis
  • Services
    • Protein Services
    • Genomic Services
  • Pricing
  • Resources
    • Product Updates
    • Product Notifications
    • Blog
    • Educational Software Request
    • Documentation
    • Technical Requirements
      • File Formats
      • Licensing Options
  • Training
    • Help + Tutorials
    • Webinars
    • Technical Support Request
  • About
    • Careers
    • Distributors
    • Legal Information
    • Privacy Policy
  • Contact

How to Create the Best Phylogenetic Tree for Your Data Using MegAlign Pro

How to Create the Best Phylogenetic Tree for Your Data Using MegAlign Pro

December 11, 2020 Best Practices, Molecular Biology, Workflows

Brian Walsh started his career as an instructor at the University of Wisconsin. Since joining DNASTAR in 2016, Brian has been active in customer support, technical support, and software development.  In 2019 Brian became the Scientific Lead for the MegAlign Pro application and has been working closely with the software developers on that team.

Introduction

In my early career as a phylogeneticist, I used whichever software was available in our lab.  For the first few years, this was fine. But as my projects grew to include hundreds of samples, the software became increasingly cumbersome and unresponsive.  In addition, I was needing to use 8+ often preposterously-complex applications to proceed from sequence data to a publication ready phylogenetic tree.

Brian Walsh Ph.D.
DNASTAR Sales Manager &
Scientific Lead for MegAlign Pro

These experiences fueled my vision for MegAlign Pro as an easy-to-use application for any researcher interested in constructing phylogenetic trees and exploring evolutionary relationships between organisms. MegAlign Pro’s graphic-rich interface allows you to load in a variety of taxa (sequences), perform a multi-sequence alignment, then build a publication-quality phylogenetic tree; often in just a few minutes from start to finish. And all within a single application.

In Part A of this post, I’ll describe how to use MegAlign Pro to create a phylogenetic tree. Though MegAlign Pro is very easy to use, complex computational algorithms and parameters are involved behind the scenes in calculating the most accurate phylogenetic tree. These calculations determine things like which taxa are placed in a particular clade and the lengths and positions of tree branches.

In Part B, I’ll show you how to compare different versions of the tree using different algorithms and settings. In Part C, I’ll describe symptoms that indicate that the sequence data has some issues, and how to fix those issues.

Part A: Constructing phylogenetic trees in MegAlign Pro

There are only four steps needed to create and view a phylogenetic tree in MegAlign Pro.

Step 1: Add sequences

Launch MegAlign Pro and use the Add sequences to project tool (green plus sign with “ACG”) to add two or more related taxa (sequences). The sequences must all be of the same type: DNA, RNA or protein.

Step 2: Choose a multiple alignment method and perform the alignment

Specify the multiple alignment method and begin the alignment by pressing the Align tool (green button with white triangle) and choosing the desired method; or by using the Align > (Re)Align Using (Method Name) menu command. Method choices include Clustal Omega, ClustalW, MAFFT, MUSCLE; if nucleotide sequences are loaded, Mauve is also provided as an option.

To get the most accurate tree for your data set, please refer to our recent blog post, Two ways to find the best MegAlign Pro multiple sequence alignment method for your data.

Step 3: Build the tree

Once the alignment has finished, click on the Tree tab. Press the button for the desired tree calculation algorithm: Neighbor joining: BIONJ or Maximum likelihood (RaxML).

Which method should you choose when constructing phylogenetic trees?

  • Choose Maximum likelihood (RaxML) if your data set is divergent, if it contains many taxa and/or very long sequences, or if you need support (bootstrapping) values. To see support values, you need to check bootstrapping in the parameters dialog. If bootstrap value is 50% or below, it’s a coin flip, most use 70% cutoff to trust phylogenetic analysis data.
  • Choose Neighbor joining: BIONJ if your project includes fewer than 4 sequences, the minimum needed to use Maximum likelihood (RaxML).

Each of these algorithms has parameters that can be customized prior to creating the tree.

Step 4: View the tree and the distance table

In the Tree view, notice the numbers on each branch: these are “distance” values. (If you don’t see numbers on the branches, use the Branch label menu in the Tree section of the Style panel to choose Distance.) “Distance” is the expected number of substitutions per site (base) and represents the degree of relatedness between two samples.

To view a customizable Distance table where you can examine the maximum likelihood phylogeny–among other values–click on the Distance tab. Use the Distance section of the Style panel to specify which two types of data (e.g., distance, percent identity, percent similar, percent gaps, sequence lengths, etc.) you’d like to see on the Upper right and Lower left of the Distance table.

Part B: Recalculating the tree using different algorithms or settings

Most of the time, I find that the “default” MegAlign Pro tree reflects the expected relationships between the taxa in my project. However, that doesn’t mean I don’t want to try out different settings to see how they affect that tree. Luckily for those of us with curious minds, MegAlign Pro makes it super easy to change settings and recalculate the tree as many times as we want.

Before I describe ways to recalculate a tree, I want to mention a very easy way to save and compare all the versions of your tree side-by-side: the Take snapshot tool, located in the upper right of the Tree view (see image).

Each time you change a parameter used to calculate the tree, MegAlign Pro will update the tree automatically and include a list of current settings in the tree header. To preserve a copy of each tree, along with the settings used to calculate it, use the Take snapshot tool.

And now for the customization options:

Option 1: Trim the “ragged ends” from your sequences

Do you have one sequence that sticks way out on the 3’ and/or 5’ end compared to other sequences? If so, you can trim off its ragged end(s) to get a more accurate alignment and tree.

Option 2: Choose a different multiple alignment method

To choose a different alignment method, use the Align tool (green button with white arrow) or the corresponding Align > Realign Using (Method Name) menu command. This will cause a new alignment to take place from scratch.

Option 3: Change the distance metric and/or gap treatment method

During the multiple alignment, distances are calculated using the Metric  and Gap treatment specified in the Choose Distance Parameters dialog. This dialog can be accessed, and new options selected at any time, using Distance > Parameters.

  • Choices for Metric are Uncorrected Pairwise Distance or Tamura-Nei (1993). In most cases, the two metrics yield essentially the same results. However, it is still worth changing to see how it affects your tree.
  • Choices for Gap treatment are Global gap removal or Pairwise gap removal. Choosing Global gap removal can ruin the accuracy of the alignment and leave behind very little data with which to build an accurate tree. Unless you have a particular reason for choosing it, we strongly recommend choosing Pairwise gap removal.

Option 4: Change the tree building algorithm and/or parameters

As discussed in the first part of this post, there are two tree-building algorithms available in MegAlign Pro. To rebuild the tree using a different algorithm, or the same algorithm with different settings, choose Tree > Compute Phylogeny Using > (Method Name).

Part C: Spotting problems with the sequence data

Does one of your trees look… “strange”? In the vast majority of cases, an oddball tree is the result of issues with sequence data rather than with the algorithms or parameters used to calculate the tree.

This table shows–in no particular order—some of the symptoms, causes and solutions for issues related to sequence data.

Symptoms

Clades in the Tree view appear incorrect (e.g., a mammal appearing in a fish clade).

A branch in the Tree view is much longer (e.g., 20 times longer) than any other branches.

%Identity values in the Distance table are above 50%, indicating that the evolutionary relationship between the sequences is essentially random.

Likely causes

A sequence has been mislabeled or is unrelated to or highly divergent from the other sequences.

The alignment contains sequences in opposing orientations.

Possible solutions

Remove unrelated or highly divergent sequences and reassemble the remaining sequences.

Reverse-complement the sequences that need it and perform a new multi-sequence assembly.

For further information on phylogenetic analysis in MegAlign Pro, please be sure to visit our Phylogenetic Analysis workflow page by using the link or the button below. This page has links to all the relevant training videos, written tutorials, the MegAlign Pro User Guide, and much more. From this page, you can also request a 14-day free trial of MegAlign Pro.

LEARN MORE
4
Share

Leave a Reply

Your email is safe with us.
Cancel Reply

Search Blog Posts

Categories

  • Best Practices
  • Clinical Research
  • DNASTAR Customer Stories
  • DNASTAR News
  • Events
  • Long Read Sequencing
  • Molecular Biology
  • Newsletters
  • Next-Gen Sequencing
  • Press Releases
  • Product Notifications
  • Product Updates
  • Publications
  • Resources
  • Structural Biology
  • Webinars
  • Workflows

Recent Posts

  • Lasergene 17.3.3 Release Notes June 29, 2022
  • Streamlining Variant Identification and Analysis Webinar June 23, 2022
  • Variant Annotation with Lasergene Genomics: The easy way to discover, annotate and filter sequence variants June 10, 2022
  • Expert-Guided Protein Structure Prediction Webinar May 13, 2022
  • Lasergene 17.3.2 Release Notes May 9, 2022

Tags

assembling sequences cloud Cloud Assemblies customers De Novo Assembly DNASTAR Genomics Lasergene Metagenomics Metagenomic Sequencing NCBI GenBank newsletters next-gen NGS NGS Sequence Alignment NGS Sequence Asembly publications seqbuilder pro SeqMan NGen sequence assembly Webinar

Archives

Find us on

Most Commented Posts

  • Lasergene 15.3 Release Notes By Katie Maxfield on October 24, 2018 4
  • EditSeq, PrimerSelect and classic MegAlign retired with the release of Lasergene 16.0 By Sharon Yildiz on July 12, 2019 4
  • How much disk space do I need for my templated genome assembly? By DNA STAR on November 24, 2015 4

Would you like to receive technical tips and special offers straight to your inbox?

  • Pricing
  • Software
  • Workflows
  • Resources
  • Training
  • About

Get a 14-Day free trial of our complete Lasergene package. Try before you buy!

FREE TRIAL DOWNLOAD

© 2022 — DNASTAR Privacy Policy

Prev Next
This website uses cookies to improve user experience and understand our web usage. By continuing to use our website, you consent to our use of cookies. Accept
Privacy & Cookies Policy
Necessary
Always Enabled