<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>DNASTAR &#187; Clinical Research</title>
	<atom:link href="http://www.dnastar.com/blog/category/clinical-research/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.dnastar.com/blog</link>
	<description>Blog for Life Scientists</description>
	<lastBuildDate>Wed, 05 Jul 2017 11:37:18 +0000</lastBuildDate>
	<language>en-US</language>
		<sy:updatePeriod>hourly</sy:updatePeriod>
		<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=4.0.5</generator>
	<item>
		<title>Rapid, Large-Scale Prioritizing of Human Variants with Lasergene Genomics Suite</title>
		<link>http://www.dnastar.com/blog/clinical-research/rapid-large-scale-prioritizing-of-human-variants-with-lasergene-genomics-suite/</link>
		<comments>http://www.dnastar.com/blog/clinical-research/rapid-large-scale-prioritizing-of-human-variants-with-lasergene-genomics-suite/#comments</comments>
		<pubDate>Wed, 07 Sep 2016 16:24:22 +0000</pubDate>
		<dc:creator><![CDATA[Katie Maxfield]]></dc:creator>
				<category><![CDATA[Clinical Research]]></category>
		<category><![CDATA[Next-Gen Sequencing]]></category>

		<guid isPermaLink="false">http://www.dnastar.com/blog/?p=1832</guid>
		<description><![CDATA[Lasergene Genomics Suite now includes access to the Variant Annotation Database (VAD) for human sequencing data. I recently spoke with DNASTAR Scientist, Dr. Tim Durfee about the VAD to get a better understanding of how the tool works and how &#8230; <a href="http://www.dnastar.com/blog/clinical-research/rapid-large-scale-prioritizing-of-human-variants-with-lasergene-genomics-suite/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<p><img class="alignright wp-image-1824 size-full" src="http://www.dnastar.com/blog/wp-content/uploads/2016/08/human-molecules.png" alt="human-molecules" width="219" height="400" /></p>
<p><em>Lasergene Genomics Suite now includes access to the Variant Annotation Database (VAD) for human sequencing data. I recently spoke with DNASTAR Scientist, Dr. Tim Durfee about the VAD to get a better understanding of how the tool works and how it can help genomics and clinical researchers with their variant analysis.</em></p>
<p>&nbsp;</p>
<p><strong>Can you describe what the Variant Annotation Database is?</strong></p>
<p>The VAD is a database resource that contains information on individual positions and alleles across the human genome. It is currently human genome specific. The major purpose of the VAD is to allow rapid prioritizing and ranking of the large number of variants found in any given sample relative to the reference genome. This can be on the order of thousands of variants for gene panels; tens of thousands for exomes; and millions for whole genomes. This kind of large-scale analysis is critical for the clinical sequencing market.</p>
<p>&nbsp;</p>
<p><strong>How can users access the information in the VAD?</strong></p>
<p>Annotation information for each called variant in a specific sample is automatically retrieved from the VAD during project setup in ArrayStar. With the upcoming Lasergene 14.0 release, it will be added to the project directly following assembly and variant calling. The data is accessible in the ArrayStar SNP table and can be used to filter and create gene and SNP sets. For examples on how this can be done, take a look at our <a title="VAD Tutorial" href="http://www.dnastar.com/arraystar_tutorials/#!Documents/tutorial4variantcomparisonusingthevariantannotationdatabase.htm" target="_blank">tutorial</a>.</p>
<p>&nbsp;</p>
<p><strong>What is the source of the annotations in the VAD?</strong></p>
<p>The data is from two major sources: the 1000 Genomes Project and dbNSFP (Database of Human Nonsynonymous SNPs and their Functional Predictions). As the name suggests, the dbNSFP data is on protein encoding positions in the genome. The data are organized into five broad categories:</p>
<p>&nbsp;</p>
<ol>
<li><strong>Allele and genotype frequencies</strong> from the 1000 Genomes phase 3 data as well as from NHLBI’s Exome Sequencing Project. The 1000 Genomes data is available as global frequencies as well as frequencies for 26 populations grouped into 5 super populations. This data is extremely useful for filtering. For example, if you’re studying a rare disease that only occurs in a small number of individuals, you wouldn’t expect a relevant SNP to occur at high frequency in the population – typically, you filter for variants that occur less than 5% or even less than 1% in the population.</li>
</ol>
<p>&nbsp;</p>
<ol start="2">
<li><strong>Functional impact prediction</strong> <strong>methods</strong>: LRT, MutationTaster, PolyPhen-2 (two models) and SIFT. The four methods use different strategies to predict whether a given non-synonymous change is deleterious to the function of the encoded protein.</li>
</ol>
<p>&nbsp;</p>
<ol start="3">
<li><strong>Evolutionary conservation scoring systems</strong>: GERP++, SiPhy, PhyloP and PhastCons. These methods use sequence alignments of the human genome with the corresponding regions of other organisms to produce scores of how conserved each particular base is across evolution. In coding regions, the more evolutionarily conserved the particular base is, the more likely having that base in that position is important for the function of the encoded protein. Some methods (e.g. GERP++) can also be used to assess the importance of bases outside the coding regions.</li>
</ol>
<p>&nbsp;</p>
<ol start="4">
<li><strong><strong>Pathogenicity information from ClinVar: </strong></strong>ClinVar is a central repository hosted by NCBI that catalogs and reviews human variation and its connection to disease.  The VAD uses the clinical significance field to allow filtering on different classifications including Benign and Pathogenic.</li>
</ol>
<p>&nbsp;</p>
<ol start="5">
<li><strong>Miscellaneous information</strong>: The VAD also contains other types of information such as links to dbSNP Uniprot and Interpro that allow the user to easily retrieve additional data from those resources.</li>
</ol>
<p>&nbsp;</p>
<p><strong>What are the advantages to using the VAD over a user’s own database or VCF file?</strong></p>
<p>If a user has huge VCF files with the annotations, they would have to manually go through each position and retrieve the relevant information for that allele. With the VAD, all the annotations are automatically retrieved and readily available for filtering. The VCF is more useful as a record file of all the variants and their annotations that can be shared between applications.  For example, a VCF of alleles of interest produced by ArrayStar can be used by SeqMan NGen in subsequent assemblies to report on those positions.</p>
<p>&nbsp;</p>
<p><strong>How does this compare to other tools on the market today?</strong></p>
<p>The major advantage of DNASTAR’s Variant Annotation Database is the seamless connection with the assembly and variant caller. With open source software, you have to first run the assembly, do the variant calling with a separate program, and then use yet another tool to add the annotation information. There is often a steep <a title="open source tools" href="http://www.dnastar.com/blog/next-gen-sequencing/is-open-source-ngs-software-for-you/">learning curve</a> with each of these tools, which can make the overall process laborious. The DNASTAR pipeline integrates all these steps into one suite and allows for multiple sample comparison and filtering. Additionally, we provide <a title="Assembly Accuracy" href="http://www.dnastar.com/t-ngs-assembly-accuracy.aspx">the most accurate assembly and variant calling</a>.</p>
<p>&nbsp;</p>
<p>&nbsp;</p>
<p><em>Want to learn more? Check out our <a href="http://www.dnastar.com/t-sub-solutions-genome-solutions-variant-analysis.aspx">variant analysis workflow</a> page to see videos and benchmarks on NGS assembly and variant analysis in Lasergene Genomics Suite.</em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.dnastar.com/blog/clinical-research/rapid-large-scale-prioritizing-of-human-variants-with-lasergene-genomics-suite/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>DNASTAR LabViews: Blaire Bacher of Orion Genomics</title>
		<link>http://www.dnastar.com/blog/clinical-research/dnastar-labviews-blaire-bacher-of-orion-genomics/</link>
		<comments>http://www.dnastar.com/blog/clinical-research/dnastar-labviews-blaire-bacher-of-orion-genomics/#comments</comments>
		<pubDate>Tue, 21 Jun 2016 18:48:46 +0000</pubDate>
		<dc:creator><![CDATA[Katie Maxfield]]></dc:creator>
				<category><![CDATA[Clinical Research]]></category>
		<category><![CDATA[DNASTAR LabViews]]></category>
		<category><![CDATA[Molecular Biology]]></category>

		<guid isPermaLink="false">http://www.dnastar.com/blog/?p=1692</guid>
		<description><![CDATA[In this installment of DNASTAR LabViews, we talk with Blaire Bacher of Orion Genomics about her work sequencing the palm genome and developing various clinical tests. Hear more about this exciting work and how DNASTAR software is helping advance this research in our &#8230; <a href="http://www.dnastar.com/blog/clinical-research/dnastar-labviews-blaire-bacher-of-orion-genomics/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<p>In this installment of <strong>DNASTAR LabViews</strong>, we talk with Blaire Bacher of Orion Genomics about her work sequencing the palm genome and developing various clinical tests. Hear more about this exciting work and how DNASTAR software is helping advance this research in our interview below, then check out our <a title="Customer Interview" href="http://www.dnastar.com/blog/category/customer-interviews/">full collection of customer interviews here</a>!</p>
<p>&nbsp;</p>
<p><iframe src="https://www.youtube.com/embed/B1GrflpBSl4" width="560" height="315" frameborder="0" allowfullscreen="allowfullscreen"></iframe></p>
]]></content:encoded>
			<wfw:commentRss>http://www.dnastar.com/blog/clinical-research/dnastar-labviews-blaire-bacher-of-orion-genomics/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Dr. Michael Pauly of Mapp Biopharmaceutical on Ebola and ZMapp</title>
		<link>http://www.dnastar.com/blog/clinical-research/qa-with-dr-michael-pauly-of-mapp-biopharmaceutical-on-ebola/</link>
		<comments>http://www.dnastar.com/blog/clinical-research/qa-with-dr-michael-pauly-of-mapp-biopharmaceutical-on-ebola/#comments</comments>
		<pubDate>Wed, 07 Oct 2015 13:49:02 +0000</pubDate>
		<dc:creator><![CDATA[Ellie Thomas]]></dc:creator>
				<category><![CDATA[Clinical Research]]></category>
		<category><![CDATA[DNASTAR LabViews]]></category>

		<guid isPermaLink="false">http://www.dnastar.com/blog/?p=1003</guid>
		<description><![CDATA[The Ebola epidemic in West Africa has proven to be a devastating reminder that humans as a race are not immortal- we are not exempt from catastrophic biological diseases capable of wiping out entire villages. &#160; In 2014 Kent Brantly &#8230; <a href="http://www.dnastar.com/blog/clinical-research/qa-with-dr-michael-pauly-of-mapp-biopharmaceutical-on-ebola/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<div id="attachment_1006" style="width: 266px" class="wp-caption alignright"><a href="http://www.dnastar.com/blog/wp-content/uploads/2015/10/Ebola_2.jpg"><img class="wp-image-1006" src="http://www.dnastar.com/blog/wp-content/uploads/2015/10/Ebola_2.jpg" alt="Ebola_2" width="256" height="255" /></a><p class="wp-caption-text">Microscopic view of the Ebola virus</p></div>
<p><em>The Ebola epidemic in West Africa has proven to be a devastating reminder that humans as a race are not immortal- we are not exempt from catastrophic biological diseases capable of wiping out entire villages.</em></p>
<p>&nbsp;</p>
<p><em>In 2014 Kent Brantly and Nancy Writebol were helping patients in Liberia when the two Americans were infected with Ebola. That is where Dr. Michael Pauly and Mapp Biopharmaceutical, Inc. stepped in with ZMapp™. ZMapp™ is an experimental drug for treating Ebola, which at the time, had only been tested on animals. With no better alternative, Brantly and Writebol were given the drug cocktail, comprised of three antibodies. Both patients recovered fully after the drug was given. Today, ZMapp™ is in clinical trials in West Africa.</em></p>
<p>&nbsp;</p>
<p><em>I sat down with Dr. Michael Pauly to learn more about this experimental drug, his company’s response to the Ebola epidemic, and his experience using DNASTAR software.</em></p>
<p>&nbsp;</p>
<p><strong>Tell us a bit about your recent work with ZMapp™</strong><strong>:</strong></p>
<p>A series of events came together, in terms of our development and the current outbreak. We joined forces with a group in Canada screening combinations of antibodies and generated some good animal data.  One of the antibody combinations appeared to be unusually efficacious in animal trialing. It was quite an unusual event that all happened about a year ago, coincident with the increase of this outbreak.</p>
<p>&nbsp;</p>
<p><strong>How exactly does ZMapp™</strong><strong> work to fight the virus?</strong></p>
<p>A lot of information is still being found out. The three antibodies in ZMapp™ interact directly with the virus. The degree to which they interact with different epitopes or proteins on the surface of that virus, and exactly how they interact, whether they are directly neutralizing, or whether they enhance the secondary immune response that people have, is still unclear. Something we’ve seen in the primate data, and something so interesting about cocktails, is that by themselves, the individual antibodies don’t show anywhere near the kind of effectiveness that they do in combination. The whole idea of antibody cocktails has really been bolstered as a part of this result.</p>
<p>&nbsp;</p>
<p><img class="alignleft wp-image-1022" src="http://www.dnastar.com/blog/wp-content/uploads/2015/10/Screen-Shot-2015-10-06-at-3.17.24-PM.png" alt="Screen Shot 2015-10-06 at 3.17.24 PM" width="234" height="248" /></p>
<p><strong>What was it like to be involved with the Ebola crisis? It’s been very dramatic, as you said, and pretty devastating.</strong></p>
<p>Ultimately for most everybody, it’s just tremendously motivating. Nobody has any trouble working all day and all night when faced with those sorts of issues. You’re just trying to make an effective product and that part makes your work life and the rest of your life pretty simple. Your work becomes very personal and in that way, quite meaningful. It gives everybody an extra added boost to work hard and make something that works.</p>
<p>&nbsp;</p>
<p style="text-align: left"><strong>What are some of the unique challenges you face in your work? </strong></p>
<p>We are always trying to relate antibody structure to function, and we spend a lot of time trying to figure out what makes an antibody express well or perhaps not so well. Another set of challenges is related to speed and throughput. We’re screening a lot of antibodies and reagents, and anything we can do to increase that throughput and, in the case of DNASTAR software, anything we can do to more effectively model and screen a sequence in order to understand structure function is helpful.</p>
<p>&nbsp;</p>
<p><strong>How does DNASTAR software help you with those challenges?</strong></p>
<p>We use the DNA side of your software all the time. We use it daily for DNA analysis. We have our own particular database of all the sequences we’ve made, and almost all the protein sequence that we currently use is in DNASTAR- based format.</p>
<p>&nbsp;</p>
<p>For a long time scientists just made antibodies in cell lines without knowing what the sequences were; they just purified the antibodies and used them as reagents. There are still cell lines going back ten years or more that are making good antibodies, but people don’t know the sequences of them. We sequence cell lines that express antibodies we don’t know the sequence of, so we use degenerate primer pools to try to retrieve the antibodies that are being expressed by the cell line. We use SeqMan Pro to assemble traces, make contigs, and try to find what look to be antibody sequences that are being expressed by various cell lines.</p>
<p>&nbsp;</p>
<p>Another thing we have always appreciated is DNASTAR has good support. You can call somebody and get help easily.</p>
<p>&nbsp;</p>
<hr />
<p>&nbsp;</p>
<p><em>Would you like to see your organization featured on the DNASTAR Blog? Contact </em><a href="mailto:thomase@dnastar.com"><em>thomase@dnastar.com</em></a><em> to schedule an interview. We love learning about the important work of our customers!</em></p>
]]></content:encoded>
			<wfw:commentRss>http://www.dnastar.com/blog/clinical-research/qa-with-dr-michael-pauly-of-mapp-biopharmaceutical-on-ebola/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>How Accurate Is Your Variant Caller?</title>
		<link>http://www.dnastar.com/blog/clinical-research/how-accurate-is-your-variant-caller/</link>
		<comments>http://www.dnastar.com/blog/clinical-research/how-accurate-is-your-variant-caller/#comments</comments>
		<pubDate>Wed, 23 Sep 2015 15:00:19 +0000</pubDate>
		<dc:creator><![CDATA[Katie Maxfield]]></dc:creator>
				<category><![CDATA[Clinical Research]]></category>
		<category><![CDATA[Next-Gen Sequencing]]></category>

		<guid isPermaLink="false">http://www.dnastar.com/blog/?p=938</guid>
		<description><![CDATA[If you are using Next Generation Sequencing (NGS) for clinical research, cancer genomics, genome-wide association studies, or other genomic research, the ability to identify variants with confidence is of utmost importance. For these projects, you need software that can correctly &#8230; <a href="http://www.dnastar.com/blog/clinical-research/how-accurate-is-your-variant-caller/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<p><img class="alignright wp-image-960" src="http://www.dnastar.com/blog/wp-content/uploads/2015/09/accuracy-target-300x288.png" alt="" width="260" height="250" />If you are using Next Generation Sequencing (NGS) for clinical research, cancer genomics, genome-wide association studies, or other genomic research, the ability to identify variants with confidence is of utmost importance. For these projects, you need software that can correctly identify the true variants, while minimizing false positives that could lead to wasted research effort.</p>
<p>&nbsp;</p>
<p>Most variant callers provide probability scores for each detected variant. However, these values alone do not tell you how reliable the overall results are. Just because a difference from the reference sequence is statistically significant doesn’t necessarily mean that it is an accurate call.</p>
<p>&nbsp;</p>
<h1><strong>How can you know if your results are accurate?</strong></h1>
<p>&nbsp;</p>
<p>In most studies, especially when looking for rare mutations, having a reliable reference set with known variations isn’t feasible. To test the accuracy of NGS alignment and variant calling in Lasergene Genomics Suite, we used SeqMan NGen to align whole human exome data from the Genome in a Bottle Consortium (GIAB) to the human genome. Because this is a well curated data set, we were able to compare the variant calls to the “answer” provided by GIAB. We also performed alignment and variant calling in several other software packages using the same data and comparable settings. We then looked at three metrics:</p>
<p>&nbsp;</p>
<ol>
<li><strong>Sensitivity</strong> – This is also known as the true positive rate, and is the ratio of correctly identified variants to the total known variants in the reference set. The higher the sensitivity, the greater the likelihood that a variant in the sample will be identified by the software.</li>
<li><strong>Specificity</strong> – Also known as the true negative rate, this is the ratio of non-variant calls to the total number of positions in the reference set that are known to be homozygous with the reference sequence. Specificity is inversely related to the number of false positives.</li>
<li><strong>False Discovery Rate (FDR)</strong> – This is the ratio of false positives to all variant calls made by the software. The FDR value for a variant caller allows you to understand how many variants in your project are likely to be false positives.</li>
</ol>
<p>Because an accurate alignment is a necessary precursor to accurate variant detection, these metrics also help you understand the alignment accuracy from various software pipelines.</p>
<p>&nbsp;</p>
<p>Here’s a peek at the results we obtained:</p>
<p>&nbsp;</p>
<p><img class="aligncenter wp-image-969 size-full" src="http://www.dnastar.com/blog/wp-content/uploads/2015/09/Table1.png" alt="" width="734" height="197" /></p>
<p>&nbsp;</p>
<p><strong>Check out our new <a href="http://www.dnastar.com/t-ngs-assembly-accuracy.aspx">Accuracy</a> page to see the full results of these comparisons across numerous exomes and other sample data sets and learn how Lasergene Genomics Suite stacks up against the competition!</strong></p>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dnastar.com/blog/clinical-research/how-accurate-is-your-variant-caller/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Discovery of Gene Candidates from NGS Data Using A Researcher Friendly Pipeline and Filters</title>
		<link>http://www.dnastar.com/blog/clinical-research/discovery-of-gene-candidates-from-ngs-data-using-a-researcher-friendly-pipeline-and-filters/</link>
		<comments>http://www.dnastar.com/blog/clinical-research/discovery-of-gene-candidates-from-ngs-data-using-a-researcher-friendly-pipeline-and-filters/#comments</comments>
		<pubDate>Thu, 20 Nov 2014 22:25:31 +0000</pubDate>
		<dc:creator><![CDATA[Kerri Phillips]]></dc:creator>
				<category><![CDATA[Clinical Research]]></category>
		<category><![CDATA[Next-Gen Sequencing]]></category>

		<guid isPermaLink="false">http://www.dnastar.com/blog/?p=370</guid>
		<description><![CDATA[In the past decade, the ability to determine complex mechanisms underlying disease has been made easier by a variety of factors, especially the availability of large amounts of data. In fact, with the continuously decreasing costs of obtaining whole exome &#8230; <a href="http://www.dnastar.com/blog/clinical-research/discovery-of-gene-candidates-from-ngs-data-using-a-researcher-friendly-pipeline-and-filters/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<p>In the past decade, the ability to determine complex mechanisms underlying disease has been made easier by a variety of factors, especially the availability of large amounts of data. In fact, with the continuously decreasing costs of obtaining whole exome DNA sequence data through next-generation sequencing (NGS) technologies, the challenge becomes less about the available data and more about the ability of researchers to tease out meaningful correlations.</p>
<p>&nbsp;</p>
<p>Today, researchers often find they must wait for the limited availability of biostatisticians and bioinformatics teams. However, with the right tools, it is hoped that the solutions to Mendelian and complex diseases will be discovered by investigators at their own desktop computers.</p>
<p>&nbsp;</p>
<p>DNASTAR combines the computational power required to prioritize relevant factors and visualize correlations in an easy-to-use, integrated software pipeline that puts the power of association studies into the hands of clinical researchers.</p>
<p>&nbsp;</p>
<p><strong>A Powerful Integrated Software Pipeline</strong></p>
<p>&nbsp;</p>
<p><a href="http://www.dnastar.com/blog/wp-content/uploads/2014/11/Figure1_Steps_twitter.png"><img class="alignright wp-image-373 size-medium" src="http://www.dnastar.com/blog/wp-content/uploads/2014/11/Figure1_Steps_twitter-300x300.png" alt="Figure1_Steps_twitter" width="300" height="300" /></a>The number and size of NGS data sets that are needed to conduct association studies can pose some challenges. First, the massive amount of raw data (typically 10-30 GB for a single exome and 300-400 GB for a whole genome) requires substantial computer resources for processing as well as for storage and management. Second, there is a series of computational tools required:</p>
<p>&nbsp;</p>
<ol>
<li>A large capacity NGS reference-guided assembler</li>
<li>A variation detection module</li>
<li>A variant annotation module</li>
<li>A visualization package for inspecting alignments and variant calls</li>
<li>An analytics module for comparing variants across samples including statistical analyses and discrete filtering</li>
</ol>
<p>&nbsp;</p>
<p>Stringing together and running the software tools needed to accomplish these tasks can be quite a hurdle. But, with the integrated <a href="http://www.dnastar.com/t-products-dnastar-lasergene-genomics.aspx">Lasergene Genomics Suite</a>, the flow of data is facilitated with an easy-to-use, intuitive, graphical interface. The suite consists of three programs: <a href="http://www.dnastar.com/t-nextgen-seqman-ngen.aspx">SeqMan NGen</a>, <a href="http://www.dnastar.com/t-seqmanpro.aspx">SeqMan Pro</a>, and <a href="http://www.dnastar.com/t-sub-products-genomics-arraystar.aspx">ArrayStar</a>.</p>
<p>&nbsp;</p>
<p><strong><a href="http://www.dnastar.com/blog/wp-content/uploads/2014/11/Kabuki-Filtering-Diagram.png"><img class="alignright wp-image-374 size-medium" src="http://www.dnastar.com/blog/wp-content/uploads/2014/11/Kabuki-Filtering-Diagram-291x300.png" alt="Kabuki Filtering Diagram" width="291" height="300" /></a>Case Study:  Kabuki Syndrome</strong></p>
<p>&nbsp;</p>
<p>As a demonstration of DNASTAR’s pipeline, a rare Mendelian disorder known as Kabuki syndrome was used. Exome data sets were obtained through dbGaP. These data sets were from the published Kabuki syndrome study (Ng <em>et. al.</em> Exome sequencing identifies MLL2 mutations as a cause of Kabuki syndrome. Nat. Genet. 42, 30-35 (2010)). The syndrome, which is caused by autosomal dominant mutations, is rare with approximately 400 cases reported worldwide.</p>
<p>&nbsp;</p>
<p>Ten case and eight control exome data sets were independently aligned to the human genome reference sequence using <a href="http://www.dnastar.com/t-nextgen-seqman-ngen.aspx">SeqMan NGen</a> which also identified and annotated variants. Variants from each assembly were then loaded together into <a href="http://www.dnastar.com/t-sub-products-genomics-arraystar.aspx">ArrayStar </a>resulting in over 5.7 million independent positions located in about 32,000 genes across all samples after coalescing. The samples were then organized into two groups, Kabuki and Control, to facilitate subsequent filtering.</p>
<p>&nbsp;</p>
<p>We first filtered at the variant level making three assumptions based on knowledge of the disease: 1) causal mutations would be non-synonymous changes, 2) causal mutations arose de novo so variants would occur in only one case sample and 3) no control sample would have any of the mutations. Stringent quality metric thresholds were also imposed to reduce noise. Over 11,000 variants in 6,352 genes met the criteria and were saved as a “SNP set.”</p>
<p>&nbsp;</p>
<p>This SNP set was then used as the variant pool in a second filtering step. This time to identify genes with mutations that met the following criteria: 1) mutations were inactivating (nonsense or frameshift), 2) they were rare and therefore not in dbSNP and 3) they were dominant and therefore occurred as heterozygotes. 845 genes met those criteria in at least one case sample. However, by increasing the level of detectance to 7 of 10 case samples the number of candidates was reduced to one, MLL2, consistent with the results of Ng <em>et. al</em>.</p>
<p>&nbsp;</p>
<p>The DNASTAR software makes this type of filtering easy for researchers with an intuitive filtering interface:</p>
<p><a href="http://www.dnastar.com/blog/wp-content/uploads/2014/11/KabukiFilterResults.png"><img class="aligncenter size-large wp-image-375" src="http://www.dnastar.com/blog/wp-content/uploads/2014/11/KabukiFilterResults-1024x568.png" alt="KabukiFilterResults" width="720" height="399" /></a></p>
<p><strong>Conclusions</strong></p>
<p>&nbsp;</p>
<p><strong></strong>Exome sequencing has revolutionized our ability to detect common, rare and private variants in the coding genes of an individual. By sequencing case and control cohorts and then comparing across the spectrum of variants, the genetic causes of Mendelian and complex diseases are being uncovered. NGS technologies and facile software pipelines that integrate assembly, variant calling/annotation and association analyses are essential partners in this endeavor.</p>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dnastar.com/blog/clinical-research/discovery-of-gene-candidates-from-ngs-data-using-a-researcher-friendly-pipeline-and-filters/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Q&amp;A With Dr. Sheldon Garrison of Promentis Pharmaceuticals</title>
		<link>http://www.dnastar.com/blog/clinical-research/qa-with-dr-sheldon-garrison-of-promentis-pharmaceuticals/</link>
		<comments>http://www.dnastar.com/blog/clinical-research/qa-with-dr-sheldon-garrison-of-promentis-pharmaceuticals/#comments</comments>
		<pubDate>Fri, 14 Nov 2014 15:00:38 +0000</pubDate>
		<dc:creator><![CDATA[Jackie Carville]]></dc:creator>
				<category><![CDATA[Clinical Research]]></category>
		<category><![CDATA[DNASTAR LabViews]]></category>

		<guid isPermaLink="false">http://www.dnastar.com/blog/?p=359</guid>
		<description><![CDATA[We chatted with Dr. Sheldon Garrison, Director of Pediatric and Rare Diseases at Promentis Pharmaceuticals about his experience with DNASTAR software. &#160; Tell us about your work! My research has primarily focused on the molecular and cellular biology mechanisms of &#8230; <a href="http://www.dnastar.com/blog/clinical-research/qa-with-dr-sheldon-garrison-of-promentis-pharmaceuticals/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<p><a href="http://www.dnastar.com/blog/wp-content/uploads/2014/11/Sheldon-Garrison.png"><img class="alignright size-full wp-image-360" src="http://www.dnastar.com/blog/wp-content/uploads/2014/11/Sheldon-Garrison.png" alt="Sheldon Garrison" width="180" height="180" /></a>We chatted with Dr. Sheldon Garrison, Director of Pediatric and Rare Diseases at <a href="http://www.promentispharma.com/">Promentis Pharmaceuticals</a> about his experience with DNASTAR software.</p>
<p>&nbsp;</p>
<p><strong>Tell us about your work!</strong></p>
<p>My research has primarily focused on the molecular and cellular biology mechanisms of rare pediatric diseases, neuroinflammatory pain and neuromuscular development. Quantifying mRNA expression levels across a broad range of genes has been a critical component within my projects. I use the Lasergene software to optimize everything from primer selection to analysis of our sequence data. The biggest challenges are linking the species-specific genetic and protein information to various pathways.</p>
<p>&nbsp;</p>
<p><strong>How has DNASTAR software helped you?</strong></p>
<p>The Lasergene software has really helped us with our bioinformatic-driven projects. We are constantly exploring both novel drug targets and working to gain a better understanding of current targets of interest. Particularly when working on physiological and pharmacology experiments, we need to have a high degree of confidence in our molecular work. The Lasergene software gives that to us, especially with gene sequence and protein analysis.</p>
<p>&nbsp;</p>
<p><strong>What does DNASTAR software do best, in your opinion?</strong></p>
<p>Data analysis, especially with sequence alignment and working with results from the sequencer. I have also come to appreciate the flexibility of the software with primer design. I have also recently started working with Protean 3D and find it to be extremely helpful with understanding poorly published proteins of interest.</p>
<p>&nbsp;</p>
<p><strong>Can you speak to DNASTAR&#8217;s support for you and your work?</strong></p>
<p>While at the Medical College of Wisconsin I managed a large departmental network because of my familiarity with the program. While things typically ran smoothly, any questions I had were answered same-day, which made my work incredibly easy.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dnastar.com/blog/clinical-research/qa-with-dr-sheldon-garrison-of-promentis-pharmaceuticals/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>GIAB Use Case:  Bringing NA12878 Call Sets to Kidney Disease</title>
		<link>http://www.dnastar.com/blog/clinical-research/giab-use-case-bringing-na12878-call-sets-to-kidney-disease/</link>
		<comments>http://www.dnastar.com/blog/clinical-research/giab-use-case-bringing-na12878-call-sets-to-kidney-disease/#comments</comments>
		<pubDate>Tue, 07 Oct 2014 14:52:01 +0000</pubDate>
		<dc:creator><![CDATA[Kerri Phillips]]></dc:creator>
				<category><![CDATA[Clinical Research]]></category>
		<category><![CDATA[Next-Gen Sequencing]]></category>

		<guid isPermaLink="false">http://www.dnastar.com/blog/?p=297</guid>
		<description><![CDATA[Nephropath™ incorporates DNASTAR pipeline for validating processes against NIST “gold standard.” &#160; The resources provided by the National Institute of Standards and Technology (NIST) Genome in a Bottle (GIAB) consortium promise to greatly improve the reliability of genetic assays. With &#8230; <a href="http://www.dnastar.com/blog/clinical-research/giab-use-case-bringing-na12878-call-sets-to-kidney-disease/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<p><em><strong>Nephropath™ incorporates DNASTAR pipeline for validating processes against NIST “gold standard.”</strong></em></p>
<p>&nbsp;</p>
<p>The resources provided by the National Institute of Standards and Technology (NIST) <a href="http://genomeinabottle.org/">Genome in a Bottle (GIAB) consortium</a> promise to greatly improve the reliability of genetic assays. With these tools, laboratories can integrate performance measures directly within the workflow of their testing operations.</p>
<p>&nbsp;</p>
<p><a href="https://www.nephropath.com/">Nephropathology Associates, Inc. (Nephropath™)</a>, a leading U.S. laboratory in the interpretation of kidney biopsies, was motivated to use the NIST materials by the need to demonstrate proficiency in their NGS platform for purposes of CAP/CLIA certification. They were encouraged to look into GIAB by a representative from Illumina and, after a discussion with Justin Zook at the 2013 ASHG conference, decided that using the NIST data was the best option for them. The approach was also appealing because it would provide a measure of the lab’s accuracy as they would be able to compare their data with that of others who use the same controls.</p>
<p>&nbsp;</p>
<p>As part of a collaborative project between Nephropath and DNASTAR, a new workflow has been added to DNASTAR’s assembly and variant calling software that supports use of the GIAB call sets.</p>
<p>&nbsp;</p>
<p>The workflow is designed to work with a “gold standard” control of the user’s choice, such as the set of reference materials for the HapMap/1000 Genomes CEU female NA12878 developed by the GIAB consortium, as shown in Figure 1.</p>
<div id="attachment_298" style="width: 730px" class="wp-caption alignnone"><a href="http://www.dnastar.com/blog/wp-content/uploads/2014/10/Figure-1.png"><img class="size-large wp-image-298" src="http://www.dnastar.com/blog/wp-content/uploads/2014/10/Figure-1-1024x523.png" alt="Figure 1. DNASTAR’s integrated Validated SNP Caller workflow used with NIST GIAB gold standard reference materials." width="720" height="367" /></a><p class="wp-caption-text">Figure 1. DNASTAR’s integrated Validated SNP Caller workflow used with NIST GIAB gold standard reference materials.</p></div>
<p>The purpose is to validate the efficacy of a procedure from sample prep through sequence analysis. At the end of the workflow, the lab obtains an automatically generated statistical report detailing the assembly sensitivity, specificity, and accuracy calculated according the ratios described in Table 1.</p>
<p>&nbsp;</p>
<div id="attachment_299" style="width: 475px" class="wp-caption alignright"><a href="http://www.dnastar.com/blog/wp-content/uploads/2014/10/Figure-2.png"><img class="size-full wp-image-299" src="http://www.dnastar.com/blog/wp-content/uploads/2014/10/Figure-2.png" alt="Table 1. The Validation Report calculations. " width="465" height="182" /></a><p class="wp-caption-text">Table 1. The Validation Report calculations.</p></div>
<p>Nephropath is currently using the Illumina MiSeq and Agilent SureSelectQXT with custom probes for 301 genes involved in kidney disease. They use DNA from NA12878 purchased from Coriell Institute as a sequencing control on every run. Each run is a pool of 9 samples plus the control sequenced with the paired-end MiSeq® Reagent Kit v3 (150 cycle). The NA12878 control FASTQ files generated after the run are loaded into DNASTAR’s SeqMan NGen® software for mapping/alignment against the human genome reference sequence and variant calling using the “Templated assemblies with control” option. To delimit the regions of the genome used for validation, Nephropath uses a BED file of either their entire targeted region or one containing an intersection between the GIAB high quality regions and the targeted regions. The latter is preferred when the most accurate statistics are required as suggested in <a href="//ftp-trace.ncbi.nih.gov/giab/ftp/data/NA12878/variant_calls/GIAB_integration/">this README file</a>. In this way, the NA12878 variant call set VCF file gets subsetted down to just the targeted regions using whichever BED file is selected. After the assembly is complete, every position specified by the BED file, including both variant and reference calls, is checked against the subsetted control VCF to determine true/false positives/negatives. Based on these annotated variant and reference call sets a validation report is generated by the DNASTAR ArrayStar® application, providing various statistics achieved at different sequencing depths and probability thresholds. An excerpt of such a report is given in Figure 2.</p>
<div id="attachment_300" style="width: 730px" class="wp-caption alignright"><a href="http://www.dnastar.com/blog/wp-content/uploads/2014/10/Figure-3.png"><img class="size-large wp-image-300" src="http://www.dnastar.com/blog/wp-content/uploads/2014/10/Figure-3-1024x473.png" alt="Figure 2. Nephropathology Associate’s Kidney Disease Gene Panel: Excerpts from a NA12878 Validation Report. Data provided by Marjorie Beggs (Nephropathology Associates) includes 301 genes from 13 renal disease categories. " width="720" height="332" /></a><p class="wp-caption-text">Figure 2. Nephropathology Associate’s Kidney Disease Gene Panel: Excerpts from a NA12878 Validation Report. Data provided by Marjorie Beggs (Nephropathology Associates) includes 301 genes from 13 renal disease categories.</p></div>
<p>&nbsp;</p>
<p>The pipeline, along with early results, were presented at the recent GIAB consortium workshop in a roundup of emblematic case studies on using the GIAB materials.</p>
<p>&nbsp;</p>
<p>Nephropath, in collaboration with DNASTAR, was recently awarded an SBIR phase I grant to further develop this workflow and software for clinical use. The long-term goal is to implement a fast, accurate and integrated workflow for clinical NGS.</p>
<p>&nbsp;</p>
<p>For more information on the new validation workflow, download , “<a href="http://www.dnastar.com/skins/skin_1/pdf/AccuracyOverview.pdf"><em>SeqMan NGen is a High Accuracy NGS Assembler: Assessment with NA12878 Reference Materials</em></a>,” from the DNASTAR website.</p>
<p>&nbsp;</p>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dnastar.com/blog/clinical-research/giab-use-case-bringing-na12878-call-sets-to-kidney-disease/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Q&amp;A with Dr. Luke Daum of Longhorn Vaccines and Diagnostics</title>
		<link>http://www.dnastar.com/blog/clinical-research/qa-with-dr-luke-daum-of-longhorn-vaccines-and-diagnostics/</link>
		<comments>http://www.dnastar.com/blog/clinical-research/qa-with-dr-luke-daum-of-longhorn-vaccines-and-diagnostics/#comments</comments>
		<pubDate>Thu, 25 Sep 2014 13:34:58 +0000</pubDate>
		<dc:creator><![CDATA[Jackie Carville]]></dc:creator>
				<category><![CDATA[Clinical Research]]></category>
		<category><![CDATA[DNASTAR LabViews]]></category>

		<guid isPermaLink="false">http://www.dnastar.com/blog/?p=280</guid>
		<description><![CDATA[We chatted with Dr. Luke Daum, Chief Scientific Officer at Longhorn Vaccines and Diagnostics about his work with Mycobacterium tuberculosis.  &#160; Tell us about your work! Currently, we do a lot work in Africa with specific focus on Mycobacterium tuberculosis &#8230; <a href="http://www.dnastar.com/blog/clinical-research/qa-with-dr-luke-daum-of-longhorn-vaccines-and-diagnostics/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<p><a href="http://www.dnastar.com/blog/wp-content/uploads/2014/09/Luke-Daum.png"><img class="alignright size-medium wp-image-281" src="http://www.dnastar.com/blog/wp-content/uploads/2014/09/Luke-Daum-300x227.png" alt="Luke Daum" width="300" height="227" /></a>We chatted with Dr. Luke Daum, Chief Scientific Officer at <a href="http://www.lhnvd.com/">Longhorn Vaccines and Diagnostics</a> about his work with <em>Mycobacterium tuberculosis. </em></p>
<p>&nbsp;</p>
<p><strong>Tell us about your work!</strong></p>
<p>Currently, we do a lot work in Africa with specific focus on <em>Mycobacterium tuberculosis</em> (MTB) detection and next-generation sequencing (NGS). We develop molecular reagents and products that simplify and enhance molecular detection from the point of sample collection, detection, and DNA sequencing. We also do work with influenza viruses and have recently done some whole-genome MRSA sequencing as well.</p>
<p>&nbsp;</p>
<p><strong>How has DNASTAR software helped you with your research goals?</strong></p>
<p>In 1999, I established the U.S. Air Force’s molecular influenza strain surveillance program and was first introduced to DNASTAR by the CDC. Since then, I’ve never let go. I prefer DNASTAR over other bioinformatics software because it’s so simple to use. I have observed and participated in the evolution and maturation of DNASTAR software from simple gene alignments of DNA off automated slab gel sequencers to today’s cutting edge, multimillion read NGS assemblies! DNASTAR continues to mature alongside rapid advances in sequencing technologies.</p>
<p>&nbsp;</p>
<p><strong>What does DNASTAR software do best, in your opinion?</strong></p>
<p>Since we do a lot of barcoding/indexing of patient isolate samples, I like the ease with which I can perform a multi-patient analysis within a single SeqMan NGen assembly-the software enables you to quickly switch within barcoded/indexed patients. I also like that with SeqMan NGen I can quickly input a reference gene or gene panel of interest against a full genomic library file to quickly assess for mutations in genes of interest. Sometimes you just want to look at specific genes and you don’t always need/want the hassle of dealing with an entire 4.2 million bp MTB genome!</p>
<p>&nbsp;</p>
<p><strong>Can you speak to DNASTAR&#8217;s support for you and your work?</strong></p>
<p>The DNASTAR team works closely with<a href="http://www.dnastar.com/t-sub-nextgen-sequencing-technologies-ion-torrent.aspx"> Life Technologies</a> and <a href="http://www.dnastar.com/t-sub-nextgen-sequencing-technologies-illumina.aspx">Illumina </a>to simplify the analysis of NGS data for the everyday scientist. Most scientists are good at empirical experimentation and molecular biology testing but a bit intimidated by the size of raw data output and the bioinformatics aspects of NGS. DNASTAR bridges the gap between 2-3 gigabytes of raw NGS sequence data and a thorough genetic analysis containing graphical alignments, mutational reports, and phylogenies.   For example, from incredibly large raw MiSeq data files containing 24 indexed MTB genomes I can easily use the tools in DNASTAR to assemble contigs and analyze TB isolates for mutations in antibiotic resistance genes. From here it’s simple to generate SNP reports, multiple sequence alignments, and phylogenetic trees.</p>
<p>&nbsp;</p>
<p><strong>Is there anything else you&#8217;d like to share with us?</strong></p>
<p>When I get into trouble I find it easy to contact DNASTAR technical support. Matthew Keyser and his team are always available for troubleshooting and resolving user issues.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.dnastar.com/blog/clinical-research/qa-with-dr-luke-daum-of-longhorn-vaccines-and-diagnostics/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
