Skip to main content
  • American Heart Association
  • Science Volunteer
  • Warning Signs
  • Advanced Search
  • Donate

  • Home
  • About this Journal
    • Editorial Board
    • General Statistics
    • Circulation Doodle
      • Doodle Gallery
      • Circulation Cover Doodle
    • → Blip the Doodle
    • Information for Advertisers
    • Author Reprints
    • Commercial Reprints
    • Customer Service and Ordering Information
  • All Issues
  • Subjects
    • All Subjects
    • Arrhythmia and Electrophysiology
    • Basic, Translational, and Clinical Research
    • Critical Care and Resuscitation
    • Epidemiology, Lifestyle, and Prevention
    • Genetics
    • Heart Failure and Cardiac Disease
    • Hypertension
    • Imaging and Diagnostic Testing
    • Intervention, Surgery, Transplantation
    • Quality and Outcomes
    • Stroke
    • Vascular Disease
  • Browse Features
    • AHA Guidelines and Statements
    • Bridging Disciplines
    • → Articles Bridging Discplines
    • Cardiovascular Case Series
    • Circulation Supplements
    • ECG Challenge
    • Hospitals of History
      • Hospital Santa Maria del Popolo, Naples, Italy
      • Minneapolis City Hospital
      • Pitié-Salpêtrière Hospital
      • Tufts Medical Center
      • Uppsala University Hospital
      • Vassar Brothers Medical Center (Poughkeepsie, NY)
      • Wroclaw Medical University
    • On My Mind
    • Podcast Archive
      • → Circulation on the Run, FIT Edition
    • → Subscribe to Circulation on the Run
  • Resources
    • Instructions for Authors
      • Accepted Manuscripts
      • Revised Manuscripts
    • → Article Types
    • → General Preparation Instructions
    • → Research Guidelines
    • → How to Submit a Manuscript
    • Journal Policies
    • Permissions and Rights Q&A
    • Submission Sites
    • Circulation CME
    • AHA Journals RSS Feeds
    • International Users
    • AHA Newsroom
    • Scientific Sessions 2017
  • AHA Journals
    • AHA Journals Home
    • Arteriosclerosis, Thrombosis, and Vascular Biology (ATVB)
    • Circulation
    • → Circ: Arrhythmia and Electrophysiology
    • → Circ: Cardiovascular Genetics
    • → Circ: Cardiovascular Imaging
    • → Circ: Cardiovascular Interventions
    • → Circ: Cardiovascular Quality & Outcomes
    • → Circ: Heart Failure
    • Circulation Research
    • Hypertension
    • Stroke
    • Journal of the American Heart Association
  • Facebook
  • Twitter

  • My alerts
  • Sign In
  • Join

  • Advanced search

Header Publisher Menu

  • American Heart Association
  • Science Volunteer
  • Warning Signs
  • Advanced Search
  • Donate

Circulation

  • My alerts
  • Sign In
  • Join

  • Facebook
  • Twitter
  • Home
  • About this Journal
    • Editorial Board
    • General Statistics
    • Circulation Doodle
    • → Blip the Doodle
    • Information for Advertisers
    • Author Reprints
    • Commercial Reprints
    • Customer Service and Ordering Information
  • All Issues
  • Subjects
    • All Subjects
    • Arrhythmia and Electrophysiology
    • Basic, Translational, and Clinical Research
    • Critical Care and Resuscitation
    • Epidemiology, Lifestyle, and Prevention
    • Genetics
    • Heart Failure and Cardiac Disease
    • Hypertension
    • Imaging and Diagnostic Testing
    • Intervention, Surgery, Transplantation
    • Quality and Outcomes
    • Stroke
    • Vascular Disease
  • Browse Features
    • AHA Guidelines and Statements
    • Bridging Disciplines
    • → Articles Bridging Discplines
    • Cardiovascular Case Series
    • Circulation Supplements
    • ECG Challenge
    • Hospitals of History
    • On My Mind
    • Podcast Archive
    • → Subscribe to Circulation on the Run
  • Resources
    • Instructions for Authors
    • → Article Types
    • → General Preparation Instructions
    • → Research Guidelines
    • → How to Submit a Manuscript
    • Journal Policies
    • Permissions and Rights Q&A
    • Submission Sites
    • Circulation CME
    • AHA Journals RSS Feeds
    • International Users
    • AHA Newsroom
    • Scientific Sessions 2017
  • AHA Journals
    • AHA Journals Home
    • Arteriosclerosis, Thrombosis, and Vascular Biology (ATVB)
    • Circulation
    • → Circ: Arrhythmia and Electrophysiology
    • → Circ: Cardiovascular Genetics
    • → Circ: Cardiovascular Imaging
    • → Circ: Cardiovascular Interventions
    • → Circ: Cardiovascular Quality & Outcomes
    • → Circ: Heart Failure
    • Circulation Research
    • Hypertension
    • Stroke
    • Journal of the American Heart Association
Basic Science for Clinicians

DNA Sequencing

Clinical Applications of New DNA Sequencing Technologies

Frederick E. Dewey, Stephen Pan, Matthew T. Wheeler, Stephen R. Quake, Euan A. Ashley
Download PDF
https://doi.org/10.1161/CIRCULATIONAHA.110.972828
Circulation. 2012;125:931-944
Originally published February 21, 2012
Frederick E. Dewey
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Stephen Pan
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Matthew T. Wheeler
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Stephen R. Quake
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Euan A. Ashley
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
  • Article
  • Figures & Tables
  • Info & Metrics
  • eLetters

Jump to

  • Article
    • INTRODUCTION
    • Historical Perspective
    • Next-Generation Sequencing Technologies
    • Third-Generation Sequencing Technologies
    • Processing High-Throughput Sequence Data
    • The Human Reference Genome and Its Limitations
    • Aligning Sequence Reads to the Human Reference Genome
    • Identifying Single-Nucleotide Variants and Small Insertions/Deletions
    • Identifying Large Structural Variants
    • Variant Quality Control and Genotype Validation
    • Haplotype Phasing With Use of High-Throughput Sequence Data
    • High-Throughput Sequencing and Mendelian Disease Genetics
    • High-Throughput Sequencing and Complex Disease Genetics
    • Genome Sequencing and the Clinic
    • Other Applications of High-Throughput Sequencing
    • Conclusions
    • Sources of Funding
    • Disclosures
    • References
  • Figures & Tables
  • Info & Metrics
  • eLetters
Loading
  • DNA polymorphism
  • genetic variation
  • genome
  • sequencing

INTRODUCTION

We are in a time of great change in genetics that may dramatically impact human biology and medicine. The completion of the human genome project,1,2 the development of low-cost, high-throughput parallel sequencing technology, and large-scale studies of genetic variation3 have provided a rich set of techniques and data for the study of genetic disease risk, treatment response, population diversity, and human evolution. Newly developed sequencing instruments now generate hundreds of millions to billions of short sequences per run, allowing for rapid complete sequencing of human genomes. These technological advances have facilitated a precipitous drop (Figure 1) in the cost per base pair of DNA sequenced. To capitalize on the potential of these technologies for research and clinical applications, translational scientists and clinicians must become familiar with a continuously evolving field. In this review, we will provide a historical perspective on human genome sequencing, summarize current and future sequencing technologies, highlight issues related to data management and interpretation, and finally consider research and clinical applications of high-throughput sequencing, with specific emphasis on cardiovascular disease.

Figure 1.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 1.

Sequencing milestones, costs, and output since completion of the human genome project. Note logarithmic scale for sequencing costs and bases produced per sequence run.

Historical Perspective

Genome sequencing has become synonymous with high-throughput sequencing, but it is instructive to revisit historical milestones. Although James Watson and Francis Crick published the first description of the crystallographic double-helix DNA structure in 1953,4 it was not until 2 decades later, with the nearly simultaneous development of Maxam-Gilbert and Sanger sequencing,5,6 that DNA sequencing became widely available to the research community. The Sanger method, which is based on DNA chain termination with a small concentration of radio- or fluorescently labeled dideoxy nucleotide triphosphate (dNTP) molecules followed by size separation by gel electrophoresis, became the research and commercial standard because of technical ease and reliability of results. This was the standard sequencing technology for >3 decades and remains the method of choice for sequencing short segments of DNA and confirming genotypes from other technologies. Sanger sequencing, in conjunction with several methods for identifying the approximate genetic locations (loci) harboring variations in DNA associated with disease, was the method used to define the basis of many Mendelian, or single-gene disorders.

More recently, a modified Sanger approach was the main sequencing engine for the first draft human genome sequence, which was produced by sequencing 500- to 600-bp segments of DNA in parallel (shotgun sequencing) and assembly of these sequence fragments into contiguous stretches of DNA (contigs) based on sequence overlap.1,2,7 Two sequences were released nearly simultaneously; the first was a product of the decade-long publicly funded Human Genome Project,1 and the second was released by the Celera Corporation, led by Craig Venter and colleagues.7 The accuracy and read lengths generated by this technology were advantageous to sequencing projects in which no template or reference sequence was available, but the sequence time (years) and cost (estimated at between 300 million and 3 billion dollars) of these early efforts precluded the use of this technology for large-scale human genome sequencing. However, the completion of a human genome reference sequence allowed for the development of a next generation of sequencing instruments that substantially reduced DNA sequencing time and cost.

Next-Generation Sequencing Technologies

The development of a draft human genome sequence, which has subsequently been revised to constitute a reference human genome sequence, facilitated the development of next-generation sequencing (NGS). NGS is a broad term that refers to a set of methods for (1) genomic template preparation, or the methodology for processing genomic DNA for downstream sequencing; (2) near-simultaneous, or massively parallel, generation of millions to billions of short sequence reads; (3) alignment of sequence reads to a reference sequence; (4) sequence assembly from aligned sequence reads and genetic variant discovery (Figure 2). Most investigators use the output from this final step, a list of genotypes for positions with at least 1 allele that differs from a reference sequence (variants) in all downstream analysis. Thus, whole genome sequence data generally refers not to ≈3 billion diploid genotypes that cover the known chromosomal positions, but the 3 to 4 million genotypes in each genome that differ from the reference sequence. Several NGS technologies exist that differ primarily in methods for clonal amplification of short fragments of DNA and sequencing the resulting short DNA fragments. Each has specific advantages in terms of read length, accuracy, and throughput (Table 1). All currently forego the time-consuming bacterial cloning step that was used for library preparation in the Human Genome Project. For full details of the technical aspects of each sequencing technology, we refer the reader to recent technological reviews.8,9 We will briefly review each technology here with a focus on advantages, disadvantages, and specific sequencing applications for each platform. One issue that deserves specific mention is that of read length. Shorter sequence reads (100 bp or shorter) are well suited to the biochemical reactions used by most of the sequencing technologies. However, the generation of short reads complicates sequence assembly, particularly in repetitive regions of the genome. The generation of longer sequence reads (1000 bp or longer) simplifies this task. Furthermore, the use of longer sequence reads spanning several variants aids in resolution of haplotype phase, which is the assignment of each allele in a heterozygous genotype to 1 chromosome of each homologous pair, eg, the assignment of an A allele in a A/G genotype to a paternally derived segment of chromosome 13.

Figure 2.
  • Download figure
  • Open in new tab
  • Download powerpoint
Figure 2.

Three generations of human genome sequencing technology. Three groups of sequencing technology are depicted: sequencing in the human genome project; second-generation sequencing as exemplified by the Illumina HiSeq 2000; and third-generation sequencing as exemplified by the Helicos Heliscope single-molecule sequencer.

View this table:
  • View inline
  • View popup
Table 1.

Sequencing Platform Comparison

Of the NGS platforms that are currently commercially available, the 454 (454 Life Sciences/Roche) instrument was developed first. This platform is based on pyrosequencing, which detects light emitted by secondary reactions initiated by the release of pyrophosphate during nucleotide incorporation.10 Advantages include long reads and facile mate-pair sequencing, a method that sequences both ends of a previously circularized DNA molecule. Pairing reads that span tens of kilobases of genomic template sequence further facilitates haplotype phasing and the identification of structural genetic variation such as deletions and insertions of large segments of DNA. Disadvantages include systematic errors in reading frame (“frameshift errors”) in certain circumstances and lower throughput and higher sequencing costs than other commercial technologies.

SOLiD (Applied Biosystems by Life Technologies) sequencing utilizes sequencing-by-ligation in which the sequence of a DNA template is read by competitive ligation of 2-base probes to the nascent DNA strand.11 Advantages include throughput (≈20–30 Gbp per run), and base-level error information encoded in the 2-base sequences, both of which make the platform suitable for human whole genome and exome variant discovery. The main disadvantage is the necessity to work with unconventional data formats for sequence reads and the reference genome.

The Illumina/Solexa (Illumina, Inc) platform is widely used for a variety of applications, including human whole genome and exome variant discovery and transcriptome sequencing (“RNAseq”), by virtue of easily prepared paired-end sequencing libraries, high throughput, and ease of analysis of its short-read information. After genomic DNA isolation, fragmentation, and several enzymatic modification steps, sequencing libraries are amplified from single DNA strands on glass surfaces. The resultant templates are sequenced by the use of an approach in which fluorescently labeled end-blocked nucleotides, which do not allow further DNA polymerization, are incorporated by DNA polymerase, the base-specific fluorescent color is detected via fluorescence imaging, the end block and fluorescent tag is enzymatically cleaved, and the process is repeated following image storage, yielding image-encoded nucleotide sequences.12 Drawbacks include comparatively short sequence reads (≤100 bp) and practical limits to insert sizes for paired-end sequencing.

Complete Genomics, Inc provides a sequencing service, in contrast to other companies that have primarily focused on providing sequencing instruments, that is, targeted solely toward human whole genomes. The instrument uses sequencing-by-ligation of hundreds of DNA nano-balls, or chained-replicates of 70-bp sequences of sheared genomic DNA modified by adaptor inserts.13 Theoretical throughput exceeds that of any of the NGS technologies described thus far.

Third-Generation Sequencing Technologies

A third generation of sequencing instruments has been developed that is defined by the lack of DNA or RNA amplification in template library preparation (single-molecule sequencing, Figure 2). By foregoing this step, these technologies require less genomic DNA, avoid polymerase chain reaction–introduced error and amplification bias, and may be superior for high-throughput sequencing applications, such as transcriptome sequencing (RNAseq), that depend on accurate quantification of relative DNA or RNA fragment abundance.

The first of these single-molecule sequencing technologies is the Helicos Heliscope (Helicos BioSciences). The specific Helicos chemistry is based on single-molecule sequencing by cyclic reversible terminator nucleotide incorporation.14 A single dye molecule is used to label the dNTPs and fluorescence microscopy is used to image the dye in sequencing reactions performed on single-molecule templates on solid support. The order in which each fluorescently labeled dNTP is added to the sequencing reactor determines the base sequence at that position. Notably, the instrument is also suitable for direct RNA sequencing without conversion to cDNA, thus avoiding error and copy number bias associated with reverse transcription.15

Pacific Biosciences have recently developed a method for imaging individual DNA polymerase molecules as they synthesize a nascent DNA molecule covalently attached to solid support.16 Advantages include read information that is theoretically as long as 1 kb or longer and real-time sequencing kinetics that reflect nucleotide methylation state and DNA secondary structure.17

Life Technology's Ion Torrent device is targeted toward individual laboratories interested in a small-footprint, medium-throughput sequencing platform. This sequencing engine is based on detection of hydrogen ions released from nucleotides incorporated into the growing DNA strand.18 This signal is detected in a solid-state semiconductor akin to a miniaturized pH meter, and the technology is theoretically suitable to single-molecule sequencing. Throughput is currently low (<1 Gbp per run), but the release of higher-density chips has made sequencing of transcriptomes and exomes feasible.

Nanopore sequencing technologies detect base-specific changes in ionic flux as DNA traverses small pores in solid surfaces that are placed in an electric field.19 Advantages to this method include theoretically unparalleled sequencing speed and minimal template preparation. At this point, however, detection speed and accuracy remain significant technological hurdles, because the transit speed of nucleic acids through nanopores in even minimal electric fields is several orders or magnitude higher than the highest detection frequency. Several enzymatic methods have been developed to slow transit time and facilitate detection of changes in ionic flux.20,21

Processing High-Throughput Sequence Data

Data generation from high-throughput sequencing is becoming less expensive and time consuming. Generating sequence data, however, is only the first step in extracting usable information from high-throughput sequencing. For output from most currently available sequencing platforms, several tasks must be performed before downstream analysis: (1) short-read mapping, or alignment of each sequence read to a reference genome to identify the genomic sequence represented by the short read; (2) base calling at every genomic position covered by aligned short reads; and (3) identification of sequence variation from the reference genome. The percentage of base positions that are read by properly aligned short reads is described by coverage. The number of times that a single base position is read by short-read sequences is termed “depth of coverage” and most investigators currently consider 30-fold (30×) average depth of coverage as a benchmark for high-quality genome sequence data. Before discussing these data management issues, it is worth highlighting some of the limitations of the current approach that utilizes a haploid reference sequence, ie, a sequence that has only 1 base for every genomic position.

The Human Reference Genome and Its Limitations

The human reference genome currently used for short-read alignment and variant calling (National Center for Biotechnology Information [NCBI] reference genome22) is derived from a collection of DNA samples from a small number of anonymous donors. It is currently the only finished-grade human genome in that it was assembled de novo from long sequence reads and covers ≈99% of known chromosomal positions with high fidelity. However, it represents a very small sampling of human genetic variation. Analysis in our laboratory using the 1000 genomes population variation data demonstrated that at ≈1.6 million genomic positions, the NCBI reference sequence differed from the major, or most frequent, allele in each of the 3 HapMap populations, including ≈800 000 positions at which all 3 population groups have major alleles that differ from the NCBI reference allele.23 In addition, the reference sequence contains thousands of common and rare disease risk alleles, including >20 rare disease susceptibility alleles such as the Factor V Leiden allele associated with hereditary thrombophilia.23,24 Various approaches to addressing these issues have been suggested, including the use of a “major allele” reference sequence. We have recently used this approach to identify the putative genetic basis for familial thrombophilia in a family quartet by use of whole genome sequencing.23 Notably, the multigenic risk for this trait we identified included the Factor V allele conferring activated protein C resistance, which would not have been identified in homozygous state by using the NCBI reference genome for variant identification.

Aligning Sequence Reads to the Human Reference Genome

There are several programs for mapping short reads to a reference genome; for an in-depth comparison of alignment programs, we direct the reader to a recent work by Li and Homer.25 Historically, mapping alignment with quality was the most widely used alignment algorithm,26 but this algorithm has been supplanted by other open-source solutions that are superior for longer (>35 bp) sequence reads. Although several alignment algorithms can be run on high-memory multiple core desktops and even laptops, parallel computing architecture, which utilizes multiple processors to perform alignment tasks simultaneously, reduces the time required for alignment severalfold. Unfortunately, few individual laboratories currently are able to provide this computing power. One solution is on-demand distributed or parallel computing architecture, ie, cloud computing. This approach is economical in the sense that elastic parallel computing environments allow users to select and utilize only processing and storage capacity necessary for current tasks.

Identifying Single-Nucleotide Variants and Small Insertions/Deletions

Following alignment to the reference genome, sequence reads are compared at every genomic position, producing a base call for each chromosomal position. For in-depth discussion of genotype calling from next-generation sequence data, including the use of linkage disequilibrium for genotype determination and probabilistic genotypes for low- and intermediate-coverage sequencing, such as that used in the 1000 genomes project, we direct the reader to a recent work by Nielsen et al.27 A variety of different algorithms incorporate base quality, which specifies the confidence of each base call within the individual short reads, mapping quality, or confidence of accurate mapping of each short read to the specified genomic locus, and the number of bases contributing to each of the possible 16 genotypes at a position, into a probabilistic score for genotypes at every chromosomal location. The most likely genotype is compared with the reference sequence, and, typically, only positions containing at least 1 base differing from the reference sequence are retained for downstream analysis. This fact has several important implications. First, the reference base is crucial to the identification of genetic variation: if the haploid reference base harbors the same allele predisposing to disease as the subject being sequenced, it will not appear in the variant list, potentially leading to underestimation of the burden of certain disease-associated alleles. Second, comparison between individuals, eg, in cosegregation and linkage studies, can be complicated by the degree of overlap between genetic variant sets such that the assumption of homozygous reference allele calls can bias exploratory studies for causative variants. Several variant calling solutions, notably, SAMtools28 and the Genome Analysis Toolkit29 have base calling algorithms that facilitate cohort-wide variant identification, which addresses this problem. Third, the reference sequence represents a small sampling of human genetic variation, and, as large-scale sequencing efforts are undertaken, ethnicity-specific major allele differences may impact alignment of short reads against the current reference genome and subsequent variant identification.

Identifying Large Structural Variants

Large structural rearrangements >1 kb, termed structural variants (SVs), encompass large deletions, duplications, insertions, inversions, and transposons. Largely ignored in many early sequencing efforts, emerging evidence suggests that these SVs are strongly associated with several Mendelian and complex diseases, including familial dilated cardiomyopathy, autism spectrum disorders, idiopathic mental retardation, schizophrenia, and Crohn's disease.30–35 In some cases, these large genetic variants underlie >15% of disease diagnoses.36 Several methods have been developed for identification of SVs, but 3 main methods have generally been accepted and are used for identification of specific types of SVs. A complementary, hybridization-based method for identifying SVs, comparative genomic hybridization, will not be discussed further here. Notably, however, because of high false-positive rates for SV detection with the use of high-throughput sequencing, this and other polymerase chain reaction–based methods are often used to confirm candidate SVs.

The first method for identification of SVs is mate-pair sequencing,37 which is based on sequencing 2 ends of a DNA molecule following circularization, providing paired short-read sequence information separated by hundreds to thousands of base pairs. A related technique, paired-end sequencing, is used routinely in most commercial sequencing technologies to provide paired short-sequence reads from each end of an amplified linear DNA molecule. Comparison of median insert size and orientation from paired-end reads to homologous chromosomal segments in the reference genome is used to identify structural rearrangements.38 Although sensitive for inversions and other copy neutral SVs, or SVs that do not change the copy number of the affected chromosomal region, and somewhat well suited to identifying start and end points of SVs (breakpoints), detection scope is limited by the size of the insert, in that only structural rearrangements spanned by the insert can be detected.

A second method for identification of structural rearrangements is based on regional variation in read depth, which in turn depends on copy number of the genomic region interrogated. Several methods have been developed for identification of significant differences in read depth in genomic regions relative to median read depth.39–43 This method for identification of SVs is ideally suited for identification of large insertions and deletions, but has limited capability to resolve breakpoints, and cannot distinguish copy neutral SVs from normal sequence.

The third method for identification of large SVs is split-read mapping, which is based on mapping elements with inserts in the reference genome or the sample genome to contiguous short-read sequences by using 1 end of the read as an anchor and the other end to search for possible breakpoints, yielding single-nucleotide level breakpoint resolution and novel sequence discovery in some cases.44 Finally, candidate SVs are often compared with known SVs identified by using population-scale sequencing or genotyping to provide probabilities of false discovery and improved breakpoint resolution.45

Variant Quality Control and Genotype Validation

Validation of sequence data has become a particularly difficult problem in interpretation of genetic variants discovered via high-throughput sequencing. Per genotype error rates for commercially available high-throughput sequencing technologies achieving an average depth of coverage of >30 times are currently between 1 in every 1000 to 1 in every 100 000 bases. By comparison, per genotype error rates for Sanger sequencing, the current standard for clinical applications, is between 1 in 100 000 and 1 in 1 000 000 bp. Filtering variants via a combination of quality score metrics for individual short reads and final genotypes can minimize errors. Roach et al46 and our group have demonstrated that leveraging family genotype information can also be useful for error identification, in that pedigree-based allele inheritance analysis can be used to identify not only inconsistencies with Mendel's laws of inheritance, but also regions in which short reads have been incorrectly mapped or genotyped. We have recently demonstrated a >90% reduction in the error rate by sequestering variants identified in these regions.23

Despite these and other advances in error reduction, however, high-throughput sequencing platforms do not yet provide the level of confidence about individual variants that would be required for routine incorporation into clinical care. To date, clinically important variants have mostly been resequenced by using Sanger-based chemistry or confirmed with oligonucleotide genotyping arrays. Both approaches are time and resource intensive. Alternative capture-based approaches, in which either a standard commercial or custom oligonucleotide set is used to select genomic regions of interest for high-coverage, high-throughput resequencing, are also costly and time consuming. Validation of small SVs such as insertions and deletions is even more difficult, often requiring cloning of single chromosomal segments before resequencing. Until the accuracy of high-throughput sequencing improves such that primary data do not require orthogonal confirmation, data validation will continue to be a major barrier to widespread incorporation of high-throughput sequence data into clinical applications.

Haplotype Phasing With Use of High-Throughput Sequence Data

Resolution of haplotype phase is important to understanding shared disease-associate chromosomal segments containing variants that tend to be inherited en bloc, compound heterozygous (2 or more risk alleles in 1 gene) and oligogenic (2 or more risk alleles in multiple genes) genotype-phenotype associations, regulatory effects of genetic variation, and differential parent of origin effects in disease association studies.47 Furthermore, large databases of phased-sequence data will be important resources for genome-wide association studies that utilize imputation, or estimation of genotypes not assayed by other technologies such as chip-based genotyping. This practice has become commonplace as investigators combine datasets to improve power to detect disease associations of small magnitude, and will be important for investigating rare variant effects. Short-read, high-throughput sequence data alone do not provide information about haplotype phase. However, several statistical algorithms based on pedigree information, common population haplotypes, and paired short reads have been developed that are applicable to high-throughput sequence data.47–50 Moreover, several investigators have developed experimental methods for haplotype phasing based on sorting individual metaphase chromosomes and subsequent sequencing,51 or from a combination of long-insert cloning and next-generation sequencing.52 Further development of these methods will be critical to the use of this tool for investigating disease biology.

High-Throughput Sequencing and Mendelian Disease Genetics

The utility of high-throughput sequencing for investigation of disease genetics is great. The application of NGS for the identification of cardiovascular disease–associated loci has resulted in several notable successes, including the identification of BAG3 mutations as a cause of dilated cardiomyopathy, mutations in SMAD3 associated with familial aortic aneurysms, and AARS2 and ACAD9 in familial mitochondrial cardiomyopathy35,53–55 (Table 2). These studies have provided intriguing hypotheses for follow-up work characterizing novel pathways in human cardiovascular disease. The genetic basis for several noncardiovascular diseases has similarly been explored by using exome and whole genome sequencing (Table 3). It is noteworthy that 2 studies have demonstrated the promise of NGS in aiding clinical diagnosis and management. Choi et al61 used exome sequencing to identify a mutation in SLC26A3 in a patient with the suspected renal salt-wasting Bartter syndrome; this finding allowed them to make the unanticipated diagnosis of congenital chloride diarrhea and modify clinical care accordingly. Worthey et al66 used exome sequencing to identify a missense mutation in the gene XIAP in a patient with intractable Crohn's-like inflammatory bowel disease, establishing a diagnosis of X-linked inhibitor of apoptosis deficiency. Subsequent allogeneic stem cell transplant resulted in dramatic improvement in the patient's gastrointestinal disease.

View this table:
  • View inline
  • View popup
Table 2.

Exome and Whole Genome Sequencing for Cardiovascular Disease Gene Identification

View this table:
  • View inline
  • View popup
Table 3.

Selected Studies Using Exome and Whole Genome Sequencing for Non-Cardiovascular Disease Gene Identification

Thus far, these studies have focused on well-characterized diseases with extreme phenotypic manifestations and cosegregation analysis of single-gene loci. However, filtering variants by cosegregation with the disease phenotype, and by comparison with population controls, eg, the dbSNP and 1000 genomes genetic variation databases has not always yielded a definitive answer. This difficulty is further compounded by the inclusion of nonvalidated single-nucleotide polymorphisms in recent iterations of these databases. Consequently, these repositories now contain a small but definite subset of putative variants that are actually sequencing errors. Filtering of variants in cosegregation studies by their presence in these databases may thus lead to the misidentification of damaging mutations as benign polymorphisms. Further annotation of variants by identity-by-descent status, which seeks to identify common ancestral disease-associated haplotypes, represents an evolution of the cosegregation approach.80,81

High-Throughput Sequencing and Complex Disease Genetics

More recently, there has been increasing interest in the use of high-throughput sequencing for association analysis with complex disease. Much of the focus has been on discovering the source of “missing heritability” of common complex diseases. Six years has elapsed since the first publication of a genome-wide association study of common genetic variants and common disease.82 Since then, hundreds of highly statistically significant, replicated associations with common disease have been found. However, most alleles identified via this technique confer modest risk, and the heritability of common disease explained by these alleles in isolation or aggregate is low.83 One of the hypotheses for this relative paucity of high-effect associations is that rare variants of large effect contribute in aggregate to common disease. By virtue of their rarity, these variants have not been included on current genotyping arrays, and, therefore, previous genome-wide association studies have been unable to assess their association with disease. Furthermore, several investigators have hypothesized that some of the modest associations between common variants and common disease are mediated via weak-linkage disequilibrium between common marker variants and rare, causative variants of large effect.84 Recently, Stefansson et al58 used a combination of large-scale chip-based genotyping and intermediate-depth (10×) whole genome sequencing of a smaller cohort of cases and controls to identify a rare variant in a novel locus strongly associated with sick sinus syndrome. Importantly, this is the first demonstration of the use of whole genome sequencing to identify an association between a rare variant and complex disease. Although it is not yet cost-effective to perform deep whole genome sequencing of large cohorts of individuals with common disease, as sequencing costs drop, genotype-phenotype association studies using whole genome sequencing may become feasible. Meanwhile, several efforts are currently underway to identify coding variants associated with complex phenotypes using exome sequencing.

An important advantage of whole genome over whole exome association studies is the ability to interrogate the noncoding genome. Despite systematic overrepresentation of protein coding regions on many genotyping arrays, 88% of significant genome-wide associations are located in intronic or intergenic regions.85 Thus, exome-targeted sequencing approaches are likely to miss the majority of significant genome-wide associations with common disease. For Mendelian disorders, the majority of underlying allelic variants identified thus far disrupt coding regions, and thus many early sequencing efforts have focused on the exome. However, it is likely that more comprehensive variant discovery will be required for discovery of many genotype-phenotype associations.

Genome Sequencing and the Clinic

Applying the bulk of genetic predictive information to whole genome sequence data from individuals is one of the most difficult tasks in NGS data interpretation. We previously developed and applied a methodology for interpretation of genetic and environmental risk in a single subject by use of a combination of traditional clinical assessment, whole genome sequencing, and integration of genetic and environmental risk factors,86 and have recently done so for a family quartet.23 A similar approach has been applied to carrier testing for severe recessive childhood disease risk by use of NGS and for detection of fetal aneuploidy via sequencing of maternal blood samples.87,88 One of the main challenges to the widespread application of these analytic schemes is incomplete and inconsistent status of publicly available genome annotation databases. Several annotation sources exist for gene regions, including the consensus coding sequence database,89 RefSeq,22 the UCSC KnownGenes database,90 and the GENCODE91 and ENSEMBL92 databases. Each has advantages in terms of coverage and accuracy; however, the inconsistent use of these data in the literature is an issue for replicating research findings. Similarly, several variant databases exist for associations with Mendelian disorders, including the Human Gene Mutation Database,93,94 the Online Mendelian Inheritance in Man (http://www.ncbi.nlm.nih.gov/omim), and many disease-specific databases. None are well suited to variant-level annotation of whole genome sequence data, and many contain annotation errors and common polymorphisms, by some estimates comprising ≈ >25% of the entries. Furthermore, these databases are contaminated by descriptions of susceptibility loci of questionable impact,87 and mutation annotations are often based on differing builds of the reference genome or outdated gene and protein sequences. Several prediction algorithms exist for predicting variant pathogenicity that are based on different combinations of evolutionary conservation, structural prediction, and physical properties of amino acid substitutions.95–99 However, they are limited in specificity and sensitivity, and concordance between predictions from the various algorithms is low.100 Databases for common variant–common disease associations85,86 and pharmacogenomic associations101 are more complete, but there is a great need for comprehensive, easily searchable, and accurate variant-level association databases as whole genome sequence data becomes more widely available.

Other Applications of High-Throughput Sequencing

Although high-throughput sequencing has become synonymous with whole genome and exome sequencing, there are many other emerging applications for the technology. The first of these is whole transcriptome sequencing, which uses massively parallel sequencing to sequence RNA transcripts in various physiological conditions. This unique application allows for determination of allele-specific expression, information about alternative splicing, RNA-editing events, and, via read depth, accurate quantification of mRNA copy number, and, therefore, gene expression. In comparison with oligonucleotide expression arrays, RNAseq is able to quantify transcript abundance with a greater dynamic range and accuracy at extremes of transcript abundance, allowing for more accurate quantification of gene expression and rich functional genomics information. Matkovich et al102 recently used a unique combination of RNAseq and a new technology, RNA-induced silencing complexes sequencing, to characterize cardiac mRNA regulation by micro-RNAs, small noncoding RNAs that regulate diverse cellular functions by facilitating mRNA degradation or inhibiting translation. Technologies that do not require generation and amplification of a cDNA library, such as the Helicos platform, are particularly well suited to this application because they require no previous knowledge of the transcriptome and avoid biases in gene expression measurements and sequencing errors that are related to reverse transcription.

Second, subsets of exomes can be queried in a high-throughput manner in the next generation of candidate gene studies by use of custom oligonucleotide-based capture techniques coupled with high-throughput sequencing.103–106 Combined with a pooled case-control approach, these study designs may prove to be valuable to gene finding or comprehensive sequence interrogation (“fine mapping”) of genomic regions that have been linked with inherited disease by other technologies such as array-based genotyping or repetitive element mapping.

Third, there is increasing focus on the use of high-throughput sequencing in clinical diagnosis via the rapid identification of cell-free DNA. Specific to cardiovascular medicine is the recent demonstration of the use of cell-free sequencing of blood samples for the identification of an organ-specific transplant DNA signature correlating with acute cellular rejection in a pilot study of heart transplant recipients.107 With confirmation in larger cohorts, technologies such as these may be combined with other functional assays of the genome such as gene expression arrays108 to obviate the need for endomyocardial biopsy surveillance in select patients.

Last, although we have focused in much of this review on inherited genetic information, there is an entirely separate dimension of heritable information that researchers are just beginning to explore on a genome-wide scale. Epigenetic traits, or heritable traits that do not involve DNA sequence changes, are often due to chemical modifications of the DNA molecule such as cytosine methylation in CpG regions.109,110 To date, bisulfite sequencing, in which 5-methyl-cytosine bases are converted to uracil by bisulfite and subsequently sequenced, identifying CpG regions with high uracil content that correspond to methyl-cytosine bases, has been the standard technique. However, single-molecule sequencers yield polymerase kinetics information that correlates with methylation status and other structural information such as DNA polymerase footprint and RNA and DNA secondary structure.

Conclusions

The whole genome sequencing era is here. Challenges remain to widespread sequencing of individuals. However, advances in high-throughput sequencing technologies have made possible initial strides in understanding the fundamental genetic basis for inherited disease, and sequencing personal genomes may someday allow for individualization of health care to genetics. Ten years out from the completion of the human genome sequence, we are about to enter an era in which a vast amount of sequencing information will be available to medical researchers and, ultimately, healthcare professionals. It is incumbent on physicians and scientists as stewards of this technology to ensure that high-quality sequence data are incorporated appropriately into research and clinical endeavors.

Sources of Funding

Dr Dewey was supported by NIH/NHLBI training grant T32 HL094274-01A2 and the Stanford Dean's Postdoctoral Research Fellowship. Dr Wheeler was supported by NIH National Research Service Award fellowship F32 HL097462. Dr Ashley was supported by NIH/NHLBI KO8 HL083914, NIH New Investigator DP2 Award OD004613, and a grant from the Breetwor Family Foundation.

Disclosures

Dr Quake is a founder, consultant, and equity holder in Helicos BioSciences and a founder, consultant, and shareholder of Fluidigm Corp. Dr Ashley is a founder and stockholder in Personalis. Dr Dewey is a stockholder and consultant for Personalis.

  • © 2012 American Heart Association, Inc.

References

  1. 1.↵
    International Human Genome Sequencing Consortium. Initial sequencing and analysis of the human genome. Nature. 2001; 409: 860– 921.
    OpenUrlCrossRefPubMed
  2. 2.↵
    International Human Genome Sequencing Consortium. Finishing the euchromatic sequence of the human genome. Nature. 2004; 431: 931– 945.
    OpenUrlCrossRefPubMed
  3. 3.↵
    1000 Genomes Project Consortium. A map of human genome variation from population-scale sequencing. Nature. 2010; 467: 1061– 1073.
    OpenUrlCrossRefPubMed
  4. 4.↵
    1. Watson JD,
    2. Crick FH
    . Molecular structure of nucleic acids; a structure for deoxyribose nucleic acid. Nature. 1953; 171: 737– 738.
    OpenUrlCrossRefPubMed
  5. 5.↵
    1. Maxam AM,
    2. Gilbert W
    . A new method for sequencing DNA. Proc Natl Acad Sci U S A. 1977; 74: 560– 564.
    OpenUrlAbstract/FREE Full Text
  6. 6.↵
    1. Sanger F,
    2. Nicklen S,
    3. Coulson AR
    . DNA sequencing with chain-terminating inhibitors. Proc Natl Acad Sci U S A. 1977; 74: 5463– 5467.
    OpenUrlAbstract/FREE Full Text
  7. 7.↵
    1. Venter JC,
    2. Adams MD,
    3. Myers EW,
    4. Li PW,
    5. Mural RJ,
    6. Sutton GG,
    7. Smith HO,
    8. Yandell M,
    9. Evans CA,
    10. Holt RA,
    11. Gocayne JD,
    12. Amanatides P,
    13. Ballew RM,
    14. Huson DH,
    15. Wortman JR,
    16. Zhang Q,
    17. Kodira CD,
    18. Zheng XH,
    19. Chen L,
    20. Skupski M,
    21. Subramanian G,
    22. Thomas PD,
    23. Zhang J,
    24. Gabor Miklos GL,
    25. Nelson C,
    26. Broder S,
    27. Clark AG,
    28. Nadeau J,
    29. McKusick VA,
    30. Zinder N,
    31. Levine AJ,
    32. Roberts RJ,
    33. Simon M,
    34. Slayman C,
    35. Hunkapiller M,
    36. Bolanos R,
    37. Delcher A,
    38. Dew I,
    39. Fasulo D,
    40. Flanigan M,
    41. Florea L,
    42. Halpern A,
    43. Hannenhalli S,
    44. Kravitz S,
    45. Levy S,
    46. Mobarry C,
    47. Reinert K,
    48. Remington K,
    49. Abu-Threideh J,
    50. Beasley E,
    51. Biddick K,
    52. Bonazzi V,
    53. Brandon R,
    54. Cargill M,
    55. Chandramouliswaran I,
    56. Charlab R,
    57. Chaturvedi K,
    58. Deng Z,
    59. Di Francesco V,
    60. Dunn P,
    61. Eilbeck K,
    62. Evangelista C,
    63. Gabrielian AE,
    64. Gan W,
    65. Ge W,
    66. Gong F,
    67. Gu Z,
    68. Guan P,
    69. Heiman TJ,
    70. Higgins ME,
    71. Ji RR,
    72. Ke Z,
    73. Ketchum KA,
    74. Lai Z,
    75. Lei Y,
    76. Li Z,
    77. Li J,
    78. Liang Y,
    79. Lin X,
    80. Lu F,
    81. Merkulov GV,
    82. Milshina N,
    83. Moore HM,
    84. Naik AK,
    85. Narayan VA,
    86. Neelam B,
    87. Nusskern D,
    88. Rusch DB,
    89. Salzberg S,
    90. Shao W,
    91. Shue B,
    92. Sun J,
    93. Wang Z,
    94. Wang A,
    95. Wang X,
    96. Wang J,
    97. Wei M,
    98. Wides R,
    99. Xiao C,
    100. Yan C,
    101. Yao A,
    102. Ye J,
    103. Zhan M,
    104. Zhang W,
    105. Zhang H,
    106. Zhao Q,
    107. Zheng L,
    108. Zhong F,
    109. Zhong W,
    110. Zhu S,
    111. Zhao S,
    112. Gilbert D,
    113. Baumhueter S,
    114. Spier G,
    115. Carter C,
    116. Cravchik A,
    117. Woodage T,
    118. Ali F,
    119. An H,
    120. Awe A,
    121. Baldwin D,
    122. Baden H,
    123. Barnstead M,
    124. Barrow I,
    125. Beeson K,
    126. Busam D,
    127. Carver A,
    128. Center A,
    129. Cheng ML,
    130. Curry L,
    131. Danaher S,
    132. Davenport L,
    133. Desilets R,
    134. Dietz S,
    135. Dodson K,
    136. Doup L,
    137. Ferriera S,
    138. Garg N,
    139. Gluecksmann A,
    140. Hart B,
    141. Haynes J,
    142. Haynes C,
    143. Heiner C,
    144. Hladun S,
    145. Hostin D,
    146. Houck J,
    147. Howland T,
    148. Ibegwam C,
    149. Johnson J,
    150. Kalush F,
    151. Kline L,
    152. Koduru S,
    153. Love A,
    154. Mann F,
    155. May D,
    156. McCawley S,
    157. McIntosh T,
    158. McMullen I,
    159. Moy M,
    160. Moy L,
    161. Murphy B,
    162. Nelson K,
    163. Pfannkoch C,
    164. Pratts E,
    165. Puri V,
    166. Qureshi H,
    167. Reardon M,
    168. Rodriguez R,
    169. Rogers YH,
    170. Romblad D,
    171. Ruhfel B,
    172. Scott R,
    173. Sitter C,
    174. Smallwood M,
    175. Stewart E,
    176. Strong R,
    177. Suh E,
    178. Thomas R,
    179. Tint NN,
    180. Tse S,
    181. Vech C,
    182. Wang G,
    183. Wetter J,
    184. Williams S,
    185. Williams M,
    186. Windsor S,
    187. Winn-Deen E,
    188. Wolfe K,
    189. Zaveri J,
    190. Zaveri K,
    191. Abril JF,
    192. Guigo R,
    193. Campbell MJ,
    194. Sjolander KV,
    195. Karlak B,
    196. Kejariwal A,
    197. Mi H,
    198. Lazareva B,
    199. Hatton T,
    200. Narechania A,
    201. Diemer K,
    202. Muruganujan A,
    203. Guo N,
    204. Sato S,
    205. Bafna V,
    206. Istrail S,
    207. Lippert R,
    208. Schwartz R,
    209. Walenz B,
    210. Yooseph S,
    211. Allen D,
    212. Basu A,
    213. Baxendale J,
    214. Blick L,
    215. Caminha M,
    216. Carnes-Stine J,
    217. Caulk P,
    218. Chiang YH,
    219. Coyne M,
    220. Dahlke C,
    221. Mays A,
    222. Dombroski M,
    223. Donnelly M,
    224. Ely D,
    225. Esparham S,
    226. Fosler C,
    227. Gire H,
    228. Glanowski S,
    229. Glasser K,
    230. Glodek A,
    231. Gorokhov M,
    232. Graham K,
    233. Gropman B,
    234. Harris M,
    235. Heil J,
    236. Henderson S,
    237. Hoover J,
    238. Jennings D,
    239. Jordan C,
    240. Jordan J,
    241. Kasha J,
    242. Kagan L,
    243. Kraft C,
    244. Levitsky A,
    245. Lewis M,
    246. Liu X,
    247. Lopez J,
    248. Ma D,
    249. Majoros W,
    250. McDaniel J,
    251. Murphy S,
    252. Newman M,
    253. Nguyen T,
    254. Nguyen N,
    255. Nodell M,
    256. Pan S,
    257. Peck J,
    258. Peterson M,
    259. Rowe W,
    260. Sanders R,
    261. Scott J,
    262. Simpson M,
    263. Smith T,
    264. Sprague A,
    265. Stockwell T,
    266. Turner R,
    267. Venter E,
    268. Wang M,
    269. Wen M,
    270. Wu D,
    271. Wu M,
    272. Xia A,
    273. Zandieh A,
    274. Zhu X
    . The sequence of the human genome. Science. 2001; 291: 1304– 1351.
    OpenUrlAbstract/FREE Full Text
  8. 8.↵
    1. Mardis ER
    . A decade's perspective on DNA sequencing technology. Nature. 2011; 470: 198– 203.
    OpenUrlCrossRefPubMed
  9. 9.↵
    1. Metzker ML
    . Sequencing technologies – the next generation. Nat Rev Genet. 2010; 11: 31– 46.
    OpenUrlCrossRefPubMed
  10. 10.↵
    1. Wheeler DA,
    2. Srinivasan M,
    3. Egholm M,
    4. Shen Y,
    5. Chen L,
    6. McGuire A,
    7. He W,
    8. Chen YJ,
    9. Makhijani V,
    10. Roth GT,
    11. Gomes X,
    12. Tartaro K,
    13. Niazi F,
    14. Turcotte CL,
    15. Irzyk GP,
    16. Lupski JR,
    17. Chinault C,
    18. Song XZ,
    19. Liu Y,
    20. Yuan Y,
    21. Nazareth L,
    22. Qin X,
    23. Muzny DM,
    24. Margulies M,
    25. Weinstock GM,
    26. Gibbs RA,
    27. Rothberg JM
    . The complete genome of an individual by massively parallel DNA sequencing. Nature. 2008; 452: 872– 876.
    OpenUrlCrossRefPubMed
  11. 11.↵
    1. Valouev A,
    2. Ichikawa J,
    3. Tonthat T,
    4. Stuart J,
    5. Ranade S,
    6. Peckham H,
    7. Zeng K,
    8. Malek JA,
    9. Costa G,
    10. McKernan K,
    11. Sidow A,
    12. Fire A,
    13. Johnson SM
    . A high-resolution, nucleosome position map of C. elegans reveals a lack of universal sequence-dictated positioning. Genome Res. 2008; 18: 1051– 1063.
    OpenUrlAbstract/FREE Full Text
  12. 12.↵
    1. Bentley DR,
    2. Balasubramanian S,
    3. Swerdlow HP,
    4. Smith GP,
    5. Milton J,
    6. Brown CG,
    7. Hall KP,
    8. Evers DJ,
    9. Barnes CL,
    10. Bignell HR,
    11. Boutell JM,
    12. Bryant J,
    13. Carter RJ,
    14. Keira Cheetham R,
    15. Cox AJ,
    16. Ellis DJ,
    17. Flatbush MR,
    18. Gormley NA,
    19. Humphray SJ,
    20. Irving LJ,
    21. Karbelashvili MS,
    22. Kirk SM,
    23. Li H,
    24. Liu X,
    25. Maisinger KS,
    26. Murray LJ,
    27. Obradovic B,
    28. Ost T,
    29. Parkinson ML,
    30. Pratt MR,
    31. Rasolonjatovo IM,
    32. Reed MT,
    33. Rigatti R,
    34. Rodighiero C,
    35. Ross MT,
    36. Sabot A,
    37. Sankar SV,
    38. Scally A,
    39. Schroth GP,
    40. Smith ME,
    41. Smith VP,
    42. Spiridou A,
    43. Torrance PE,
    44. Tzonev SS,
    45. Vermaas EH,
    46. Walter K,
    47. Wu X,
    48. Zhang L,
    49. Alam MD,
    50. Anastasi C,
    51. Aniebo IC,
    52. Bailey DM,
    53. Bancarz IR,
    54. Banerjee S,
    55. Barbour SG,
    56. Baybayan PA,
    57. Benoit VA,
    58. Benson KF,
    59. Bevis C,
    60. Black PJ,
    61. Boodhun A,
    62. Brennan JS,
    63. Bridgham JA,
    64. Brown RC,
    65. Brown AA,
    66. Buermann DH,
    67. Bundu AA,
    68. Burrows JC,
    69. Carter NP,
    70. Castillo N,
    71. Chiara ECM,
    72. Chang S,
    73. Neil Cooley R,
    74. Crake NR,
    75. Dada OO,
    76. Diakoumakos KD,
    77. Dominguez-Fernandez B,
    78. Earnshaw DJ,
    79. Egbujor UC,
    80. Elmore DW,
    81. Etchin SS,
    82. Ewan MR,
    83. Fedurco M,
    84. Fraser LJ,
    85. Fuentes Fajardo KV,
    86. Scott Furey W,
    87. George D,
    88. Gietzen KJ,
    89. Goddard CP,
    90. Golda GS,
    91. Granieri PA,
    92. Green DE,
    93. Gustafson DL,
    94. Hansen NF,
    95. Harnish K,
    96. Haudenschild CD,
    97. Heyer NI,
    98. Hims MM,
    99. Ho JT,
    100. Horgan AM,
    101. Hoschler K,
    102. Hurwitz S,
    103. Ivanov DV,
    104. Johnson MQ,
    105. James T,
    106. Huw Jones TA,
    107. Kang GD,
    108. Kerelska TH,
    109. Kersey AD,
    110. Khrebtukova I,
    111. Kindwall AP,
    112. Kingsbury Z,
    113. Kokko-Gonzales PI,
    114. Kumar A,
    115. Laurent MA,
    116. Lawley CT,
    117. Lee SE,
    118. Lee X,
    119. Liao AK,
    120. Loch JA,
    121. Lok M,
    122. Luo S,
    123. Mammen RM,
    124. Martin JW,
    125. McCauley PG,
    126. McNitt P,
    127. Mehta P,
    128. Moon KW,
    129. Mullens JW,
    130. Newington T,
    131. Ning Z,
    132. Ling Ng B,
    133. Novo SM,
    134. O'Neill MJ,
    135. Osborne MA,
    136. Osnowski A,
    137. Ostadan O,
    138. Paraschos LL,
    139. Pickering L,
    140. Pike AC,
    141. Chris Pinkard D,
    142. Pliskin DP,
    143. Podhasky J,
    144. Quijano VJ,
    145. Raczy C,
    146. Rae VH,
    147. Rawlings SR,
    148. Chiva Rodriguez A,
    149. Roe PM,
    150. Rogers J,
    151. Rogert Bacigalupo MC,
    152. Romanov N,
    153. Romieu A,
    154. Roth RK,
    155. Rourke NJ,
    156. Ruediger ST,
    157. Rusman E,
    158. Sanches-Kuiper RM,
    159. Schenker MR,
    160. Seoane JM,
    161. Shaw RJ,
    162. Shiver MK,
    163. Short SW,
    164. Sizto NL,
    165. Sluis JP,
    166. Smith MA,
    167. Ernest Sohna Sohna J,
    168. Spence EJ,
    169. Stevens K,
    170. Sutton N,
    171. Szajkowski L,
    172. Tregidgo CL,
    173. Turcatti G,
    174. Vandevondele S,
    175. Verhovsky Y,
    176. Virk SM,
    177. Wakelin S,
    178. Walcott GC,
    179. Wang J,
    180. Worsley GJ,
    181. Yan J,
    182. Yau L,
    183. Zuerlein M,
    184. Mullikin JC,
    185. Hurles ME,
    186. McCooke NJ,
    187. West JS,
    188. Oaks FL,
    189. Lundberg PL,
    190. Klenerman D,
    191. Durbin R,
    192. Smith AJ
    . Accurate whole human genome sequencing using reversible terminator chemistry. Nature. 2008; 456: 53– 59.
    OpenUrlCrossRefPubMed
  13. 13.↵
    1. Drmanac R,
    2. Sparks AB,
    3. Callow MJ,
    4. Halpern AL,
    5. Burns NL,
    6. Kermani BG,
    7. Carnevali P,
    8. Nazarenko I,
    9. Nilsen GB,
    10. Yeung G,
    11. Dahl F,
    12. Fernandez A,
    13. Staker B,
    14. Pant KP,
    15. Baccash J,
    16. Borcherding AP,
    17. Brownley A,
    18. Cedeno R,
    19. Chen L,
    20. Chernikoff D,
    21. Cheung A,
    22. Chirita R,
    23. Curson B,
    24. Ebert JC,
    25. Hacker CR,
    26. Hartlage R,
    27. Hauser B,
    28. Huang S,
    29. Jiang Y,
    30. Karpinchyk V,
    31. Koenig M,
    32. Kong C,
    33. Landers T,
    34. Le C,
    35. Liu J,
    36. McBride CE,
    37. Morenzoni M,
    38. Morey RE,
    39. Mutch K,
    40. Perazich H,
    41. Perry K,
    42. Peters BA,
    43. Peterson J,
    44. Pethiyagoda CL,
    45. Pothuraju K,
    46. Richter C,
    47. Rosenbaum AM,
    48. Roy S,
    49. Shafto J,
    50. Sharanhovich U,
    51. Shannon KW,
    52. Sheppy CG,
    53. Sun M,
    54. Thakuria JV,
    55. Tran A,
    56. Vu D,
    57. Zaranek AW,
    58. Wu X,
    59. Drmanac S,
    60. Oliphant AR,
    61. Banyai WC,
    62. Martin B,
    63. Ballinger DG,
    64. Church GM,
    65. Reid CA
    . Human genome sequencing using unchained base reads on self-assembling DNA nanoarrays. Science. 2010; 327: 78– 81.
    OpenUrlAbstract/FREE Full Text
  14. 14.↵
    1. Pushkarev D,
    2. Neff NF,
    3. Quake SR
    . Single-molecule sequencing of an individual human genome. Nat Biotechnol. 2009; 27: 847– 850.
    OpenUrlCrossRefPubMed
  15. 15.↵
    1. Ozsolak F,
    2. Platt AR,
    3. Jones DR,
    4. Reifenberger JG,
    5. Sass LE,
    6. McInerney P,
    7. Thompson JF,
    8. Bowers J,
    9. Jarosz M,
    10. Milos PM
    . Direct RNA sequencing. Nature. 2009; 461: 814– 818.
    OpenUrlCrossRefPubMed
  16. 16.↵
    1. Eid J,
    2. Fehr A,
    3. Gray J,
    4. Luong K,
    5. Lyle J,
    6. Otto G,
    7. Peluso P,
    8. Rank D,
    9. Baybayan P,
    10. Bettman B,
    11. Bibillo A,
    12. Bjornson K,
    13. Chaudhuri B,
    14. Christians F,
    15. Cicero R,
    16. Clark S,
    17. Dalal R,
    18. Dewinter A,
    19. Dixon J,
    20. Foquet M,
    21. Gaertner A,
    22. Hardenbol P,
    23. Heiner C,
    24. Hester K,
    25. Holden D,
    26. Kearns G,
    27. Kong X,
    28. Kuse R,
    29. Lacroix Y,
    30. Lin S,
    31. Lundquist P,
    32. Ma C,
    33. Marks P,
    34. Maxham M,
    35. Murphy D,
    36. Park I,
    37. Pham T,
    38. Phillips M,
    39. Roy J,
    40. Sebra R,
    41. Shen G,
    42. Sorenson J,
    43. Tomaney A,
    44. Travers K,
    45. Trulson M,
    46. Vieceli J,
    47. Wegener J,
    48. Wu D,
    49. Yang A,
    50. Zaccarin D,
    51. Zhao P,
    52. Zhong F,
    53. Korlach J,
    54. Turner S
    . Real-time DNA sequencing from single polymerase molecules. Science. 2009; 323: 133– 138.
    OpenUrlAbstract/FREE Full Text
  17. 17.↵
    1. Flusberg BA,
    2. Webster DR,
    3. Lee JH,
    4. Travers KJ,
    5. Olivares EC,
    6. Clark TA,
    7. Korlach J,
    8. Turner SW
    . Direct detection of DNA methylation during single-molecule, real-time sequencing. Nat Methods. 2010; 7: 461– 465.
    OpenUrlCrossRefPubMed
  18. 18.↵
    1. Rothberg JM,
    2. Hinz W,
    3. Rearick TM,
    4. Schultz J,
    5. Mileski W,
    6. Davey M,
    7. Leamon JH,
    8. Johnson K,
    9. Milgrew MJ,
    10. Edwards M,
    11. Hoon J,
    12. Simons JF,
    13. Marran D,
    14. Myers JW,
    15. Davidson JF,
    16. Branting A,
    17. Nobile JR,
    18. Puc BP,
    19. Light D,
    20. Clark TA,
    21. Huber M,
    22. Branciforte JT,
    23. Stoner IB,
    24. Cawley SE,
    25. Lyons M,
    26. Fu Y,
    27. Homer N,
    28. Sedova M,
    29. Miao X,
    30. Reed B,
    31. Sabina J,
    32. Feierstein E,
    33. Schorn M,
    34. Alanjary M,
    35. Dimalanta E,
    36. Dressman D,
    37. Kasinskas R,
    38. Sokolsky T,
    39. Fidanza JA,
    40. Namsaraev E,
    41. McKernan KJ,
    42. Williams A,
    43. Roth GT,
    44. Bustillo J
    . An integrated semiconductor device enabling non-optical genome sequencing. Nature. 2011; 475: 348– 352.
    OpenUrlCrossRefPubMed
  19. 19.↵
    1. Clarke J,
    2. Wu HC,
    3. Jayasinghe L,
    4. Patel A,
    5. Reid S,
    6. Bayley H
    . Continuous base identification for single-molecule nanopore DNA sequencing. Nat Nanotechnol. 2009; 4: 265– 270.
    OpenUrlCrossRefPubMed
  20. 20.↵
    1. Lieberman KR,
    2. Cherf GM,
    3. Doody MJ,
    4. Olasagasti F,
    5. Kolodji Y,
    6. Akeson M
    . Processive replication of single DNA molecules in a nanopore catalyzed by phi29 DNA polymerase. J Am Chem Soc. 2010; 132: 17961– 17972.
    OpenUrlCrossRefPubMed
  21. 21.↵
    1. Wendell D,
    2. Jing P,
    3. Geng J,
    4. Subramaniam V,
    5. Lee TJ,
    6. Montemagno C,
    7. Guo P
    . Translocation of double-stranded DNA through membrane-adapted phi29 motor protein nanopores. Nat Nanotechnol. 2009; 4: 765– 772.
    OpenUrlCrossRefPubMed
  22. 22.↵
    1. Pruitt KD,
    2. Tatusova T,
    3. Maglott DR
    . NCBI reference sequences (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins. Nucleic Acids Res. 2007; 35: D61– D65.
    OpenUrlAbstract/FREE Full Text
  23. 23.↵
    1. Dewey FE,
    2. Chen R,
    3. Cordero SP,
    4. Ormond KE,
    5. Caleshu C,
    6. Karczewski KJ,
    7. Carrillo MW,
    8. Wheeler MT,
    9. Dudley JT,
    10. Byrnes JK,
    11. Corenejo OE,
    12. Knowles JW,
    13. Woon M,
    14. Sangkuhl K,
    15. Gong L,
    16. Thorn CF,
    17. Hebert JM,
    18. Capriotti E,
    19. David SP,
    20. Pavlovic A,
    21. West A,
    22. Thakuria J,
    23. Ball MP,
    24. Zaranek AW,
    25. Rehm HL,
    26. Church GM,
    27. West JS,
    28. Bustamante CD,
    29. Snyder M,
    30. Altman RB,
    31. Klein RJ,
    32. Butte AJ,
    33. Ashley EA
    . Phased whole genome genetic risk in a family quartet using a major allele reference sequence. PLoS Genet. 2011; 7: e1002280.
    OpenUrlCrossRefPubMed
  24. 24.↵
    1. Chen R,
    2. Butte AJ
    . The reference human genome demonstrates high risk of type 1 diabetes and other disorders. Pac Symp Biocomput. 2011; 231– 242.
  25. 25.↵
    1. Li H,
    2. Homer N
    . A survey of sequence alignment algorithms for next-generation sequencing. Brief Bioinform. 2010; 11: 473– 483.
    OpenUrlAbstract/FREE Full Text
  26. 26.↵
    1. Li H,
    2. Ruan J,
    3. Durbin R
    . Mapping short DNA sequencing reads and calling variants using mapping quality scores. Genome Res. 2008; 18: 1851– 1858.
    OpenUrlAbstract/FREE Full Text
  27. 27.↵
    1. Nielsen R,
    2. Paul JS,
    3. Albrechtsen A,
    4. Song YS
    . Genotype and SNP calling from next-generation sequencing data. Nature reviews. Genetics. 2011; 12: 443– 451.
    OpenUrlPubMed
  28. 28.↵
    1. Li H,
    2. Handsaker B,
    3. Wysoker A,
    4. Fennell T,
    5. Ruan J,
    6. Homer N,
    7. Marth G,
    8. Abecasis G,
    9. Durbin R
    . The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009; 25: 2078– 2079.
    OpenUrlAbstract/FREE Full Text
  29. 29.↵
    1. McKenna A,
    2. Hanna M,
    3. Banks E,
    4. Sivachenko A,
    5. Cibulskis K,
    6. Kernytsky A,
    7. Garimella K,
    8. Altshuler D,
    9. Gabriel S,
    10. Daly M,
    11. DePristo MA
    . The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010; 20: 1297– 1303.
    OpenUrlAbstract/FREE Full Text
  30. 30.↵
    1. Weiss LA,
    2. Shen Y,
    3. Korn JM,
    4. Arking DE,
    5. Miller DT,
    6. Fossdal R,
    7. Saemundsen E,
    8. Stefansson H,
    9. Ferreira MA,
    10. Green T,
    11. Platt OS,
    12. Ruderfer DM,
    13. Walsh CA,
    14. Altshuler D,
    15. Chakravarti A,
    16. Tanzi RE,
    17. Stefansson K,
    18. Santangelo SL,
    19. Gusella JF,
    20. Sklar P,
    21. Wu BL,
    22. Daly MJ
    . Association between microdeletion and microduplication at 16p11.2 and autism. N Engl J Med. 2008; 358: 667– 675.
    OpenUrlCrossRefPubMed
  31. 31.↵
    1. Stefansson H,
    2. Rujescu D,
    3. Cichon S,
    4. Pietilainen OP,
    5. Ingason A,
    6. Steinberg S,
    7. Fossdal R,
    8. Sigurdsson E,
    9. Sigmundsson T,
    10. Buizer-Voskamp JE,
    11. Hansen T,
    12. Jakobsen KD,
    13. Muglia P,
    14. Francks C,
    15. Matthews PM,
    16. Gylfason A,
    17. Halldorsson BV,
    18. Gudbjartsson D,
    19. Thorgeirsson TE,
    20. Sigurdsson A,
    21. Jonasdottir A,
    22. Bjornsson A,
    23. Mattiasdottir S,
    24. Blondal T,
    25. Haraldsson M,
    26. Magnusdottir BB,
    27. Giegling I,
    28. Moller HJ,
    29. Hartmann A,
    30. Shianna KV,
    31. Ge D,
    32. Need AC,
    33. Crombie C,
    34. Fraser G,
    35. Walker N,
    36. Lonnqvist J,
    37. Suvisaari J,
    38. Tuulio-Henriksson A,
    39. Paunio T,
    40. Toulopoulou T,
    41. Bramon E,
    42. Di Forti M,
    43. Murray R,
    44. Ruggeri M,
    45. Vassos E,
    46. Tosato S,
    47. Walshe M,
    48. Li T,
    49. Vasilescu C,
    50. Muhleisen TW,
    51. Wang AG,
    52. Ullum H,
    53. Djurovic S,
    54. Melle I,
    55. Olesen J,
    56. Kiemeney LA,
    57. Franke B,
    58. Sabatti C,
    59. Freimer NB,
    60. Gulcher JR,
    61. Thorsteinsdottir U,
    62. Kong A,
    63. Andreassen OA,
    64. Ophoff RA,
    65. Georgi A,
    66. Rietschel M,
    67. Werge T,
    68. Petursson H,
    69. Goldstein DB,
    70. Nothen MM,
    71. Peltonen L,
    72. Collier DA,
    73. St Clair D,
    74. Stefansson K
    . Large recurrent microdeletions associated with schizophrenia. Nature. 2008; 455: 232– 236.
    OpenUrlCrossRefPubMed
  32. 32.↵
    1. Mefford HC,
    2. Sharp AJ,
    3. Baker C,
    4. Itsara A,
    5. Jiang Z,
    6. Buysse K,
    7. Huang S,
    8. Maloney VK,
    9. Crolla JA,
    10. Baralle D,
    11. Collins A,
    12. Mercer C,
    13. Norga K,
    14. de Ravel T,
    15. Devriendt K,
    16. Bongers EM,
    17. de Leeuw N,
    18. Reardon W,
    19. Gimelli S,
    20. Bena F,
    21. Hennekam RC,
    22. Male A,
    23. Gaunt L,
    24. Clayton-Smith J,
    25. Simonic I,
    26. Park SM,
    27. Mehta SG,
    28. Nik-Zainal S,
    29. Woods CG,
    30. Firth HV,
    31. Parkin G,
    32. Fichera M,
    33. Reitano S,
    34. Lo Giudice M,
    35. Li KE,
    36. Casuga I,
    37. Broomer A,
    38. Conrad B,
    39. Schwerzmann M,
    40. Raber L,
    41. Gallati S,
    42. Striano P,
    43. Coppola A,
    44. Tolmie JL,
    45. Tobias ES,
    46. Lilley C,
    47. Armengol L,
    48. Spysschaert Y,
    49. Verloo P,
    50. De Coene A,
    51. Goossens L,
    52. Mortier G,
    53. Speleman F,
    54. van Binsbergen E,
    55. Nelen MR,
    56. Hochstenbach R,
    57. Poot M,
    58. Gallagher L,
    59. Gill M,
    60. McClellan J,
    61. King MC,
    62. Regan R,
    63. Skinner C,
    64. Stevenson RE,
    65. Antonarakis SE,
    66. Chen C,
    67. Estivill X,
    68. Menten B,
    69. Gimelli G,
    70. Gribble S,
    71. Schwartz S,
    72. Sutcliffe JS,
    73. Walsh T,
    74. Knight SJ,
    75. Sebat J,
    76. Romano C,
    77. Schwartz CE,
    78. Veltman JA,
    79. de Vries BB,
    80. Vermeesch JR,
    81. Barber JC,
    82. Willatt L,
    83. Tassabehji M,
    84. Eichler EE
    . Recurrent rearrangements of chromosome 1q21.1 and variable pediatric phenotypes. N Engl J Med. 2008; 359: 1685– 1699.
    OpenUrlCrossRefPubMed
  33. 33.↵
    1. McCarroll SA,
    2. Huett A,
    3. Kuballa P,
    4. Chilewski SD,
    5. Landry A,
    6. Goyette P,
    7. Zody MC,
    8. Hall JL,
    9. Brant SR,
    10. Cho JH,
    11. Duerr RH,
    12. Silverberg MS,
    13. Taylor KD,
    14. Rioux JD,
    15. Altshuler D,
    16. Daly MJ,
    17. Xavier RJ
    . Deletion polymorphism upstream of IRGM associated with altered IRGM expression and Crohn's disease. Nat Genet. 2008; 40: 1107– 1112.
    OpenUrlCrossRefPubMed
  34. 34.↵
    1. Moreno-De-Luca D,
    2. Mulle JG,
    3. Kaminsky EB,
    4. Sanders SJ,
    5. Myers SM,
    6. Adam MP,
    7. Pakula AT,
    8. Eisenhauer NJ,
    9. Uhas K,
    10. Weik L,
    11. Guy L,
    12. Care ME,
    13. Morel CF,
    14. Boni C,
    15. Salbert BA,
    16. Chandrareddy A,
    17. Demmer LA,
    18. Chow EW,
    19. Surti U,
    20. Aradhya S,
    21. Pickering DL,
    22. Golden DM,
    23. Sanger WG,
    24. Aston E,
    25. Brothman AR,
    26. Gliem TJ,
    27. Thorland EC,
    28. Ackley T,
    29. Iyer R,
    30. Huang S,
    31. Barber JC,
    32. Crolla JA,
    33. Warren ST,
    34. Martin CL,
    35. Ledbetter DH
    . Deletion 17q12 is a recurrent copy number variant that confers high risk of autism and schizophrenia. Am J Hum Genet. 2010; 87: 618– 630.
    OpenUrlCrossRefPubMed
  35. 35.↵
    1. Norton N,
    2. Li D,
    3. Rieder MJ,
    4. Siegfried JD,
    5. Rampersaud E,
    6. Zuchner S,
    7. Mangos S,
    8. Gonzalez-Quintana J,
    9. Wang L,
    10. McGee S,
    11. Reiser J,
    12. Martin E,
    13. Nickerson DA,
    14. Hershberger RE
    . Genome-wide studies of copy number variation and exome sequencing identify rare variants in BAG3 as a cause of dilated cardiomyopathy. Am J Hum Genet. 2011; 88: 273– 282.
    OpenUrlCrossRefPubMed
  36. 36.↵
    1. Mefford HC,
    2. Eichler EE
    . Duplication hotspots, rare genomic disorders, and common disease. Curr Opin Genet Dev. 2009; 19: 196– 204.
    OpenUrlCrossRefPubMed
  37. 37.↵
    1. Korbel JO,
    2. Urban AE,
    3. Affourtit JP,
    4. Godwin B,
    5. Grubert F,
    6. Simons JF,
    7. Kim PM,
    8. Palejev D,
    9. Carriero NJ,
    10. Du L,
    11. Taillon BE,
    12. Chen Z,
    13. Tanzer A,
    14. Saunders AC,
    15. Chi J,
    16. Yang F,
    17. Carter NP,
    18. Hurles ME,
    19. Weissman SM,
    20. Harkins TT,
    21. Gerstein MB,
    22. Egholm M,
    23. Snyder M
    . Paired-end mapping reveals extensive structural variation in the human genome. Science. 2007; 318: 420– 426.
    OpenUrlAbstract/FREE Full Text
  38. 38.↵
    1. Chen K,
    2. Wallis JW,
    3. McLellan MD,
    4. Larson DE,
    5. Kalicki JM,
    6. Pohl CS,
    7. McGrath SD,
    8. Wendl MC,
    9. Zhang Q,
    10. Locke DP,
    11. Shi X,
    12. Fulton RS,
    13. Ley TJ,
    14. Wilson RK,
    15. Ding L,
    16. Mardis ER
    . BreakDancer: an algorithm for high-resolution mapping of genomic structural variation. Nat Methods. 2009; 6: 677– 681.
    OpenUrlCrossRefPubMed
  39. 39.↵
    1. Wang LY,
    2. Abyzov A,
    3. Korbel JO,
    4. Snyder M,
    5. Gerstein M
    . MSB: a mean-shift-based approach for the analysis of structural variation in the genome. Genome Res. 2009; 19: 106– 117.
    OpenUrlAbstract/FREE Full Text
  40. 40.↵
    1. Abyzov A,
    2. Urban AE,
    3. Snyder M,
    4. Gerstein M
    . CNVnator: an approach to discover, genotype and characterize typical and atypical CNVs from family and population genome sequencing. Genome Res. 2011; 21: 974– 984.
    OpenUrlAbstract/FREE Full Text
  41. 41.↵
    1. Yoon S,
    2. Xuan Z,
    3. Makarov V,
    4. Ye K,
    5. Sebat J
    . Sensitive and accurate detection of copy number variants using read depth of coverage. Genome Res. 2009; 19: 1586– 1592.
    OpenUrlAbstract/FREE Full Text
  42. 42.↵
    1. Sudmant PH,
    2. Kitzman JO,
    3. Antonacci F,
    4. Alkan C,
    5. Malig M,
    6. Tsalenko A,
    7. Sampas N,
    8. Bruhn L,
    9. Shendure J,
    10. Eichler EE
    . Diversity of human copy number variation and multicopy genes. Science. 2010; 330: 641– 646.
    OpenUrlAbstract/FREE Full Text
  43. 43.↵
    1. Zhang ZD,
    2. Gerstein MB
    . Detection of copy number variation from array intensity and sequencing read depth using a stepwise Bayesian model. BMC Bioinformatics. 2010; 11: 539.
    OpenUrlCrossRefPubMed
  44. 44.↵
    1. Abyzov A,
    2. Gerstein M
    . AGE: defining breakpoints of genomic structural variants at single-nucleotide resolution, through optimal alignments with gap excision. Bioinformatics. 2011; 27: 595– 603.
    OpenUrlAbstract/FREE Full Text
  45. 45.↵
    1. Lam HY,
    2. Mu XJ,
    3. Stutz AM,
    4. Tanzer A,
    5. Cayting PD,
    6. Snyder M,
    7. Kim PM,
    8. Korbel JO,
    9. Gerstein MB
    . Nucleotide-resolution analysis of structural variants using BreakSeq and a breakpoint library. Nat Biotechnol. 2010; 28: 47– 55.
    OpenUrlCrossRefPubMed
  46. 46.↵
    1. Roach JC,
    2. Glusman G,
    3. Smit AF,
    4. Huff CD,
    5. Hubley R,
    6. Shannon PT,
    7. Rowen L,
    8. Pant KP,
    9. Goodman N,
    10. Bamshad M,
    11. Shendure J,
    12. Drmanac R,
    13. Jorde LB,
    14. Hood L,
    15. Galas DJ
    . Analysis of genetic inheritance in a family quartet by whole-genome sequencing. Science. 2010; 328: 636– 639.
    OpenUrlAbstract/FREE Full Text
  47. 47.↵
    1. Browning SR,
    2. Browning BL
    . Haplotype phasing: existing methods and new developments. Nat Rev Genet. 2011; 12: 703– 714.
    OpenUrlCrossRefPubMed
  48. 48.↵
    1. Browning SR,
    2. Browning BL
    . Rapid and accurate haplotype phasing and missing-data inference for whole-genome association studies by use of localized haplotype clustering. Am J Hum Genet. 2007; 81: 1084– 1097.
    OpenUrlCrossRefPubMed
  49. 49.↵
    1. Abecasis GR,
    2. Cherny SS,
    3. Cookson WO,
    4. Cardon LR
    . Merlin–rapid analysis of dense genetic maps using sparse gene flow trees. Nat Genet. 2002; 30: 97– 101.
    OpenUrlCrossRefPubMed
  50. 50.↵
    1. Williams AL,
    2. Housman DE,
    3. Rinard MC,
    4. Gifford DK
    . Rapid haplotype inference for nuclear families. Genome Biol. 2010; 11: R108.
    OpenUrlCrossRefPubMed
  51. 51.↵
    1. Fan HC,
    2. Wang J,
    3. Potanina A,
    4. Quake SR
    . Whole-genome molecular haplotyping of single cells. Nat Biotechnol. 2011; 29: 51– 57.
    OpenUrlCrossRefPubMed
  52. 52.↵
    1. Kitzman JO,
    2. Mackenzie AP,
    3. Adey A,
    4. Hiatt JB,
    5. Patwardhan RP,
    6. Sudmant PH,
    7. Ng SB,
    8. Alkan C,
    9. Qiu R,
    10. Eichler EE,
    11. Shendure J
    . Haplotype-resolved genome sequencing of a Gujarati Indian individual. Nat Biotechnol. 2011; 29: 59– 63.
    OpenUrlCrossRefPubMed
  53. 53.↵
    1. Regalado ES,
    2. Guo DC,
    3. Villamizar C,
    4. Avidan N,
    5. Gilchrist D,
    6. McGillivray B,
    7. Clarke L,
    8. Bernier F,
    9. Santos-Cortez RL,
    10. Leal SM,
    11. Bertoli-Avella AM,
    12. Shendure J,
    13. Rieder MJ,
    14. Nickerson DA,
    15. Milewicz DM
    . Exome sequencing identifies SMAD3 mutations as a cause of familial thoracic aortic aneurysm and dissection with intracranial and other arterial aneurysms. Circ Res. 2011; 109: 680– 686.
    OpenUrlAbstract/FREE Full Text
  54. 54.↵
    1. Haack TB,
    2. Danhauser K,
    3. Haberberger B,
    4. Hoser J,
    5. Strecker V,
    6. Boehm D,
    7. Uziel G,
    8. Lamantea E,
    9. Invernizzi F,
    10. Poulton J,
    11. Rolinski B,
    12. Iuso A,
    13. Biskup S,
    14. Schmidt T,
    15. Mewes HW,
    16. Wittig I,
    17. Meitinger T,
    18. Zeviani M,
    19. Prokisch H
    . Exome sequencing identifies ACAD9 mutations as a cause of complex I deficiency. Nat Genet. 2010; 42: 1131– 1134.
    OpenUrlCrossRefPubMed
  55. 55.↵
    1. Gotz A,
    2. Tyynismaa H,
    3. Euro L,
    4. Ellonen P,
    5. Hyotylainen T,
    6. Ojala T,
    7. Hamalainen RH,
    8. Tommiska J,
    9. Raivio T,
    10. Oresic M,
    11. Karikoski R,
    12. Tammela O,
    13. Simola KO,
    14. Paetau A,
    15. Tyni T,
    16. Suomalainen A
    . Exome sequencing identifies mitochondrial alanyl-tRNA synthetase mutations in infantile mitochondrial cardiomyopathy. Am J Hum Genet. 2011; 88: 635– 642.
    OpenUrlCrossRefPubMed
  56. 56.
    1. Ng SB,
    2. Bigham AW,
    3. Buckingham KJ,
    4. Hannibal MC,
    5. McMillin MJ,
    6. Gildersleeve HI,
    7. Beck AE,
    8. Tabor HK,
    9. Cooper GM,
    10. Mefford HC,
    11. Lee C,
    12. Turner EH,
    13. Smith JD,
    14. Rieder MJ,
    15. Yoshiura K,
    16. Matsumoto N,
    17. Ohta T,
    18. Niikawa N,
    19. Nickerson DA,
    20. Bamshad MJ,
    21. Shendure J
    . Exome sequencing identifies MLL2 mutations as a cause of Kabuki syndrome. Nat Genet. 2010; 42: 790– 793.
    OpenUrlCrossRefPubMed
  57. 57.
    1. Liu W,
    2. Morito D,
    3. Takashima S,
    4. Mineharu Y,
    5. Kobayashi H,
    6. Hitomi T,
    7. Hashikata H,
    8. Matsuura N,
    9. Yamazaki S,
    10. Toyoda A,
    11. Kikuta K,
    12. Takagi Y,
    13. Harada KH,
    14. Fujiyama A,
    15. Herzig R,
    16. Krischek B,
    17. Zou L,
    18. Kim JE,
    19. Kitakaze M,
    20. Miyamoto S,
    21. Nagata K,
    22. Hashimoto N,
    23. Koizumi A
    . Identification of RNF213 as a susceptibility gene for moyamoya disease and its possible role in vascular development. PLoS One. 2011; 6: e22542.
    OpenUrlCrossRefPubMed
  58. 58.↵
    1. Holm H,
    2. Gudbjartsson DF,
    3. Sulem P,
    4. Masson G,
    5. Helgadottir HT,
    6. Zanon C,
    7. Magnusson OT,
    8. Helgason A,
    9. Saemundsdottir J,
    10. Gylfason A,
    11. Stefansdottir H,
    12. Gretarsdottir S,
    13. Matthiasson SE,
    14. Thorgeirsson GM,
    15. Jonasdottir A,
    16. Sigurdsson A,
    17. Stefansson H,
    18. Werge T,
    19. Rafnar T,
    20. Kiemeney LA,
    21. Parvez B,
    22. Muhammad R,
    23. Roden DM,
    24. Darbar D,
    25. Thorleifsson G,
    26. Walters GB,
    27. Kong A,
    28. Thorsteinsdottir U,
    29. Arnar DO,
    30. Stefansson K
    . A rare variant in MYH6 is associated with high risk of sick sinus syndrome. Nat Genet. 2011; 43: 316– 320.
    OpenUrlCrossRefPubMed
  59. 59.
    1. Sirmaci A,
    2. Walsh T,
    3. Akay H,
    4. Spiliopoulos M,
    5. Sakalar YB,
    6. Hasanefendioglu-Bayrak A,
    7. Duman D,
    8. Farooq A,
    9. King MC,
    10. Tekin M
    . MASP1 mutations in patients with facial, umbilical, coccygeal, and auditory findings of Carnevale, Malpuech, OSA, and Michels syndromes. Am J Hum Genet. 2010; 87: 679– 686.
    OpenUrlCrossRefPubMed
  60. 60.
    1. Montenegro G,
    2. Powell E,
    3. Huang J,
    4. Speziani F,
    5. Edwards YJ,
    6. Beecham G,
    7. Hulme W,
    8. Siskind C,
    9. Vance J,
    10. Shy M,
    11. Zuchner S
    . Exome sequencing allows for rapid gene identification in a Charcot-Marie-Tooth family. Ann Neurol. 2011; 69: 464– 470.
    OpenUrlCrossRefPubMed
  61. 61.↵
    1. Choi M,
    2. Scholl UI,
    3. Ji W,
    4. Liu T,
    5. Tikhonova IR,
    6. Zumbo P,
    7. Nayir A,
    8. Bakkaloglu A,
    9. Ozen S,
    10. Sanjad S,
    11. Nelson-Williams C,
    12. Farhi A,
    13. Mane S,
    14. Lifton RP
    . Genetic diagnosis by whole exome capture and massively parallel DNA sequencing. Proc Natl Acad Sci U S A. 2009; 106: 19096– 19101.
    OpenUrlAbstract/FREE Full Text
  62. 62.
    1. Bolze A,
    2. Byun M,
    3. McDonald D,
    4. Morgan NV,
    5. Abhyankar A,
    6. Premkumar L,
    7. Puel A,
    8. Bacon CM,
    9. Rieux-Laucat F,
    10. Pang K,
    11. Britland A,
    12. Abel L,
    13. Cant A,
    14. Maher ER,
    15. Riedl SJ,
    16. Hambleton S,
    17. Casanova JL
    . Whole-exome-sequencing-based discovery of human FADD deficiency. Am J Hum Genet. 2010; 87: 873– 881.
    OpenUrlCrossRefPubMed
  63. 63.
    1. Johnson JO,
    2. Mandrioli J,
    3. Benatar M,
    4. Abramzon Y,
    5. Van Deerlin VM,
    6. Trojanowski JQ,
    7. Gibbs JR,
    8. Brunetti M,
    9. Gronka S,
    10. Wuu J,
    11. Ding J,
    12. McCluskey L,
    13. Martinez-Lage M,
    14. Falcone D,
    15. Hernandez DG,
    16. Arepalli S,
    17. Chong S,
    18. Schymick JC,
    19. Rothstein J,
    20. Landi F,
    21. Wang YD,
    22. Calvo A,
    23. Mora G,
    24. Sabatelli M,
    25. Monsurro MR,
    26. Battistini S,
    27. Salvi F,
    28. Spataro R,
    29. Sola P,
    30. Borghero G,
    31. Galassi G,
    32. Scholz SW,
    33. Taylor JP,
    34. Restagno G,
    35. Chio A,
    36. Traynor BJ
    . Exome sequencing reveals VCP mutations as a cause of familial ALS. Neuron. 2010; 68: 857– 864.
    OpenUrlCrossRefPubMed
  64. 64.
    1. Musunuru K,
    2. Pirruccello JP,
    3. Do R,
    4. Peloso GM,
    5. Guiducci C,
    6. Sougnez C,
    7. Garimella KV,
    8. Fisher S,
    9. Abreu J,
    10. Barry AJ,
    11. Fennell T,
    12. Banks E,
    13. Ambrogio L,
    14. Cibulskis K,
    15. Kernytsky A,
    16. Gonzalez E,
    17. Rudzicz N,
    18. Engert JC,
    19. DePristo MA,
    20. Daly MJ,
    21. Cohen JC,
    22. Hobbs HH,
    23. Altshuler D,
    24. Schonfeld G,
    25. Gabriel SB,
    26. Yue P,
    27. Kathiresan S
    . Exome sequencing, ANGPTL3 mutations, and familial combined hypolipidemia. N Engl J Med. 2010; 363: 2220– 2227.
    OpenUrlCrossRefPubMed
  65. 65.
    1. Lalonde E,
    2. Albrecht S,
    3. Ha KC,
    4. Jacob K,
    5. Bolduc N,
    6. Polychronakos C,
    7. Dechelotte P,
    8. Majewski J,
    9. Jabado N
    . Unexpected allelic heterogeneity and spectrum of mutations in Fowler syndrome revealed by next-generation exome sequencing. Hum Mutat. 2010; 31: 918– 923.
    OpenUrlCrossRefPubMed
  66. 66.↵
    1. Worthey EA,
    2. Mayer AN,
    3. Syverson GD,
    4. Helbling D,
    5. Bonacci BB,
    6. Decker B,
    7. Serpe JM,
    8. Dasu T,
    9. Tschannen MR,
    10. Veith RL,
    11. Basehore MJ,
    12. Broeckel U,
    13. Tomita-Mitchell A,
    14. Arca MJ,
    15. Casper JT,
    16. Margolis DA,
    17. Bick DP,
    18. Hessner MJ,
    19. Routes JM,
    20. Verbsky JW,
    21. Jacob HJ,
    22. Dimmock DP
    . Making a definitive diagnosis: successful clinical application of whole exome sequencing in a child with intractable inflammatory bowel disease. Genet Med. 2011; 13: 255– 262.
    OpenUrlCrossRefPubMed
  67. 67.
    1. Edvardson S,
    2. Shaag A,
    3. Zenvirt S,
    4. Erlich Y,
    5. Hannon GJ,
    6. Shanske AL,
    7. Gomori JM,
    8. Ekstein J,
    9. Elpeleg O
    . Joubert syndrome 2 (JBTS2) in Ashkenazi Jews is associated with a TMEM216 mutation. Am J Hum Genet. 2010; 86: 93– 97.
    OpenUrlCrossRefPubMed
  68. 68.
    1. Caliskan M,
    2. Chong JX,
    3. Uricchio L,
    4. Anderson R,
    5. Chen P,
    6. Sougnez C,
    7. Garimella K,
    8. Gabriel SB,
    9. Depristo MA,
    10. Shakir K,
    11. Matern D,
    12. Das S,
    13. Waggoner D,
    14. Nicolae DL,
    15. Ober C
    . Exome sequencing reveals a novel mutation for autosomal recessive non-syndromic mental retardation in the TECR gene on chromosome 19p13. Hum Mol Genet. 2011; 20: 1285– 1289.
    OpenUrlAbstract/FREE Full Text
  69. 69.
    1. Vissers LE,
    2. de Ligt J,
    3. Gilissen C,
    4. Janssen I,
    5. Steehouwer M,
    6. de Vries P,
    7. van Lier B,
    8. Arts P,
    9. Wieskamp N,
    10. del Rosario M,
    11. van Bon BW,
    12. Hoischen A,
    13. de Vries BB,
    14. Brunner HG,
    15. Veltman JA
    . A de novo paradigm for mental retardation. Nat Genet. 2010; 42: 1109– 1112.
    OpenUrlCrossRefPubMed
  70. 70.
    1. Ng SB,
    2. Buckingham KJ,
    3. Lee C,
    4. Bigham AW,
    5. Tabor HK,
    6. Dent KM,
    7. Huff CD,
    8. Shannon PT,
    9. Jabs EW,
    10. Nickerson DA,
    11. Shendure J,
    12. Bamshad MJ
    . Exome sequencing identifies the cause of a mendelian disorder. Nat Genet. 2010; 42: 30– 35.
    OpenUrlCrossRefPubMed
  71. 71.
    1. Bonnefond A,
    2. Durand E,
    3. Sand O,
    4. De Graeve F,
    5. Gallina S,
    6. Busiah K,
    7. Lobbens S,
    8. Simon A,
    9. Bellanne-Chantelot C,
    10. Letourneau L,
    11. Scharfmann R,
    12. Delplanque J,
    13. Sladek R,
    14. Polak M,
    15. Vaxillaire M,
    16. Froguel P
    . Molecular diagnosis of neonatal diabetes mellitus using next-generation sequencing of the whole exome. PLoS One. 2010; 5: e13630.
    OpenUrlCrossRefPubMed
  72. 72.
    1. Walsh T,
    2. Shahin H,
    3. Elkan-Miller T,
    4. Lee MK,
    5. Thornton AM,
    6. Roeb W,
    7. Abu Rayyan A,
    8. Loulus S,
    9. Avraham KB,
    10. King MC,
    11. Kanaan M
    . Whole exome sequencing and homozygosity mapping identify mutation in the cell polarity protein GPSM2 as the cause of nonsyndromic hearing loss DFNB82. Am J Hum Genet. 2010; 87: 90– 94.
    OpenUrlCrossRefPubMed
  73. 73.
    1. Ostergaard P,
    2. Simpson MA,
    3. Brice G,
    4. Mansour S,
    5. Connell FC,
    6. Onoufriadis A,
    7. Child AH,
    8. Hwang J,
    9. Kalidas K,
    10. Mortimer PS,
    11. Trembath R,
    12. Jeffery S
    . Rapid identification of mutations in GJC2 in primary lymphoedema using whole exome sequencing combined with linkage analysis with delineation of the phenotype. J Med Genet. 2011; 48: 251– 255.
    OpenUrlAbstract/FREE Full Text
  74. 74.
    1. Hoischen A,
    2. van Bon BW,
    3. Gilissen C,
    4. Arts P,
    5. van Lier B,
    6. Steehouwer M,
    7. de Vries P,
    8. de Reuver R,
    9. Wieskamp N,
    10. Mortier G,
    11. Devriendt K,
    12. Amorim MZ,
    13. Revencu N,
    14. Kidd A,
    15. Barbosa M,
    16. Turner A,
    17. Smith J,
    18. Oley C,
    19. Henderson A,
    20. Hayes IM,
    21. Thompson EM,
    22. Brunner HG,
    23. de Vries BB,
    24. Veltman JA
    . De novo mutations of SETBP1 cause Schinzel-Giedion syndrome. Nat Genet. 2010; 42: 483– 485.
    OpenUrlCrossRefPubMed
  75. 75.
    1. Kalay E,
    2. Yigit G,
    3. Aslan Y,
    4. Brown KE,
    5. Pohl E,
    6. Bicknell LS,
    7. Kayserili H,
    8. Li Y,
    9. Tuysuz B,
    10. Nurnberg G,
    11. Kiess W,
    12. Koegl M,
    13. Baessmann I,
    14. Buruk K,
    15. Toraman B,
    16. Kayipmaz S,
    17. Kul S,
    18. Ikbal M,
    19. Turner DJ,
    20. Taylor MS,
    21. Aerts J,
    22. Scott C,
    23. Milstein K,
    24. Dollfus H,
    25. Wieczorek D,
    26. Brunner HG,
    27. Hurles M,
    28. Jackson AP,
    29. Rauch A,
    30. Nurnberg P,
    31. Karaguzel A,
    32. Wollnik B
    . CEP152 is a genome maintenance protein disrupted in Seckel syndrome. Nat Genet. 2011; 43: 23– 26.
    OpenUrlCrossRefPubMed
  76. 76.
    1. Wang JL,
    2. Yang X,
    3. Xia K,
    4. Hu ZM,
    5. Weng L,
    6. Jin X,
    7. Jiang H,
    8. Zhang P,
    9. Shen L,
    10. Guo JF,
    11. Li N,
    12. Li YR,
    13. Lei LF,
    14. Zhou J,
    15. Du J,
    16. Zhou YF,
    17. Pan Q,
    18. Wang J,
    19. Li RQ,
    20. Tang BS
    . TGM6 identified as a novel causative gene of spinocerebellar ataxias using exome sequencing. Brain. 2010; 133: 3510– 3518.
    OpenUrlAbstract/FREE Full Text
  77. 77.
    1. Lupski JR,
    2. Reid JG,
    3. Gonzaga-Jauregui C,
    4. Rio Deiros D,
    5. Chen DC,
    6. Nazareth L,
    7. Bainbridge M,
    8. Dinh H,
    9. Jing C,
    10. Wheeler DA,
    11. McGuire AL,
    12. Zhang F,
    13. Stankiewicz P,
    14. Halperin JJ,
    15. Yang C,
    16. Gehman C,
    17. Guo D,
    18. Irikat RK,
    19. Tom W,
    20. Fantin NJ,
    21. Muzny DM,
    22. Gibbs RA
    . Whole-genome sequencing in a patient with Charcot-Marie-Tooth neuropathy. N Engl J Med. 2010; 362: 1181– 1191.
    OpenUrlCrossRefPubMed
  78. 78.
    1. Sobreira NL,
    2. Cirulli ET,
    3. Avramopoulos D,
    4. Wohler E,
    5. Oswald GL,
    6. Stevens EL,
    7. Ge D,
    8. Shianna KV,
    9. Smith JP,
    10. Maia JM,
    11. Gumbs CE,
    12. Pevsner J,
    13. Thomas G,
    14. Valle D,
    15. Hoover-Fong JE,
    16. Goldstein DB
    . Whole-genome sequencing of a single proband together with linkage analysis identifies a Mendelian disease gene. PLoS Genet. 2010; 6: e1000991.
    OpenUrlCrossRefPubMed
  79. 79.
    1. Rios J,
    2. Stein E,
    3. Shendure J,
    4. Hobbs HH,
    5. Cohen JC
    . Identification by whole-genome resequencing of gene defect responsible for severe hypercholesterolemia. Hum Mol Genet. 2010; 19: 4313– 4318.
    OpenUrlAbstract/FREE Full Text
  80. 80.↵
    1. Li Y,
    2. Vinckenbosch N,
    3. Tian G,
    4. Huerta-Sanchez E,
    5. Jiang T,
    6. Jiang H,
    7. Albrechtsen A,
    8. Andersen G,
    9. Cao H,
    10. Korneliussen T,
    11. Grarup N,
    12. Guo Y,
    13. Hellman I,
    14. Jin X,
    15. Li Q,
    16. Liu J,
    17. Liu X,
    18. Sparso T,
    19. Tang M,
    20. Wu H,
    21. Wu R,
    22. Yu C,
    23. Zheng H,
    24. Astrup A,
    25. Bolund L,
    26. Holmkvist J,
    27. Jorgensen T,
    28. Kristiansen K,
    29. Schmitz O,
    30. Schwartz TW,
    31. Zhang X,
    32. Li R,
    33. Yang H,
    34. Wang J,
    35. Hansen T,
    36. Pedersen O,
    37. Nielsen R
    . Resequencing of 200 human exomes identifies an excess of low-frequency non-synonymous coding variants. Nat Genet. 2010; 42: 969– 972.
    OpenUrlCrossRefPubMed
  81. 81.↵
    1. Cirulli ET,
    2. Goldstein DB
    . Uncovering the roles of rare variants in common disease through whole-genome sequencing. Nat Rev Genet. 2010; 11: 415– 425.
    OpenUrlCrossRefPubMed
  82. 82.↵
    1. Zhu Q,
    2. Ge D,
    3. Maia JM,
    4. Zhu M,
    5. Petrovski S,
    6. Dickson SP,
    7. Heinzen EL,
    8. Shianna KV,
    9. Goldstein DB
    . A genome-wide comparison of the functional properties of rare and common genetic variants in humans. Am J Hum Genet. 2011; 88: 458– 468.
    OpenUrlCrossRefPubMed
  83. 83.↵
    1. Wang K,
    2. Dickson SP,
    3. Stolle CA,
    4. Krantz ID,
    5. Goldstein DB,
    6. Hakonarson H
    . Interpretation of association signals and identification of causal variants from genome-wide association studies. Am J Hum Genet. 2010; 86: 730– 742.
    OpenUrlCrossRefPubMed
  84. 84.↵
    1. Dickson SP,
    2. Wang K,
    3. Krantz I,
    4. Hakonarson H,
    5. Goldstein DB
    . Rare variants create synthetic genome-wide associations. PLoS Biol. 2010; 8: e1000294.
    OpenUrlCrossRefPubMed
  85. 85.↵
    1. Hindorff LA,
    2. Sethupathy P,
    3. Junkins HA,
    4. Ramos EM,
    5. Mehta JP,
    6. Collins FS,
    7. Manolio TA
    . Potential etiologic and functional implications of genome-wide association loci for human diseases and traits. Proc Natl Acad Sci U S A. 2009; 106: 9362– 9367.
    OpenUrlAbstract/FREE Full Text
  86. 86.↵
    1. Ashley EA,
    2. Butte AJ,
    3. Wheeler MT,
    4. Chen R,
    5. Klein TE,
    6. Dewey FE,
    7. Dudley JT,
    8. Ormond KE,
    9. Pavlovic A,
    10. Morgan AA,
    11. Pushkarev D,
    12. Neff NF,
    13. Hudgins L,
    14. Gong L,
    15. Hodges LM,
    16. Berlin DS,
    17. Thorn CF,
    18. Sangkuhl K,
    19. Hebert JM,
    20. Woon M,
    21. Sagreiya H,
    22. Whaley R,
    23. Knowles JW,
    24. Chou MF,
    25. Thakuria JV,
    26. Rosenbaum AM,
    27. Zaranek AW,
    28. Church GM,
    29. Greely HT,
    30. Quake SR,
    31. Altman RB
    . Clinical assessment incorporating a personal genome. Lancet. 2010; 375: 1525– 1535.
    OpenUrlCrossRefPubMed
  87. 87.↵
    1. Bell CJ,
    2. Dinwiddie DL,
    3. Miller NA,
    4. Hateley SL,
    5. Ganusova EE,
    6. Mudge J,
    7. Langley RJ,
    8. Zhang L,
    9. Lee CC,
    10. Schilkey FD,
    11. Sheth V,
    12. Woodward JE,
    13. Peckham HE,
    14. Schroth GP,
    15. Kim RW,
    16. Kingsmore SF
    . Carrier testing for severe childhood recessive diseases by next-generation sequencing. Sci Transl Med. 2011; 3: 65ra64.
    OpenUrl
  88. 88.↵
    1. Fan HC,
    2. Blumenfeld YJ,
    3. Chitkara U,
    4. Hudgins L,
    5. Quake SR
    . Noninvasive diagnosis of fetal aneuploidy by shotgun sequencing DNA from maternal blood. Proc Natl Acad Sci U S A. 2008; 105: 16266– 16271.
    OpenUrlAbstract/FREE Full Text
  89. 89.↵
    1. Pruitt KD,
    2. Harrow J,
    3. Harte RA,
    4. Wallin C,
    5. Diekhans M,
    6. Maglott DR,
    7. Searle S,
    8. Farrell CM,
    9. Loveland JE,
    10. Ruef BJ,
    11. Hart E,
    12. Suner MM,
    13. Landrum MJ,
    14. Aken B,
    15. Ayling S,
    16. Baertsch R,
    17. Fernandez-Banet J,
    18. Cherry JL,
    19. Curwen V,
    20. Dicuccio M,
    21. Kellis M,
    22. Lee J,
    23. Lin MF,
    24. Schuster M,
    25. Shkeda A,
    26. Amid C,
    27. Brown G,
    28. Dukhanina O,
    29. Frankish A,
    30. Hart J,
    31. Maidak BL,
    32. Mudge J,
    33. Murphy MR,
    34. Murphy T,
    35. Rajan J,
    36. Rajput B,
    37. Riddick LD,
    38. Snow C,
    39. Steward C,
    40. Webb D,
    41. Weber JA,
    42. Wilming L,
    43. Wu W,
    44. Birney E,
    45. Haussler D,
    46. Hubbard T,
    47. Ostell J,
    48. Durbin R,
    49. Lipman D
    . The consensus coding sequence (CCDS) project: identifying a common protein-coding gene set for the human and mouse genomes. Genome Res. 2009; 19: 1316– 1323.
    OpenUrlAbstract/FREE Full Text
  90. 90.↵
    1. Rhead B,
    2. Karolchik D,
    3. Kuhn RM,
    4. Hinrichs AS,
    5. Zweig AS,
    6. Fujita PA,
    7. Diekhans M,
    8. Smith KE,
    9. Rosenbloom KR,
    10. Raney BJ,
    11. Pohl A,
    12. Pheasant M,
    13. Meyer LR,
    14. Learned K,
    15. Hsu F,
    16. Hillman-Jackson J,
    17. Harte RA,
    18. Giardine B,
    19. Dreszer TR,
    20. Clawson H,
    21. Barber GP,
    22. Haussler D,
    23. Kent WJ
    . The UCSC Genome Browser database: update 2010. Nucleic Acids Res. 2010; 38: D613– D619.
    OpenUrlAbstract/FREE Full Text
  91. 91.↵
    1. Harrow J,
    2. Denoeud F,
    3. Frankish A,
    4. Reymond A,
    5. Chen CK,
    6. Chrast J,
    7. Lagarde J,
    8. Gilbert JG,
    9. Storey R,
    10. Swarbreck D,
    11. Rossier C,
    12. Ucla C,
    13. Hubbard T,
    14. Antonarakis SE,
    15. Guigo R
    . GENCODE: producing a reference annotation for ENCODE. Genome Biol. 2006; 7 (suppl 1): S4 1– 9.
    OpenUrlCrossRefPubMed
  92. 92.↵
    1. Hubbard TJ,
    2. Aken BL,
    3. Ayling S,
    4. Ballester B,
    5. Beal K,
    6. Bragin E,
    7. Brent S,
    8. Chen Y,
    9. Clapham P,
    10. Clarke L,
    11. Coates G,
    12. Fairley S,
    13. Fitzgerald S,
    14. Fernandez-Banet J,
    15. Gordon L,
    16. Graf S,
    17. Haider S,
    18. Hammond M,
    19. Holland R,
    20. Howe K,
    21. Jenkinson A,
    22. Johnson N,
    23. Kahari A,
    24. Keefe D,
    25. Keenan S,
    26. Kinsella R,
    27. Kokocinski F,
    28. Kulesha E,
    29. Lawson D,
    30. Longden I,
    31. Megy K,
    32. Meidl P,
    33. Overduin B,
    34. Parker A,
    35. Pritchard B,
    36. Rios D,
    37. Schuster M,
    38. Slater G,
    39. Smedley D,
    40. Spooner W,
    41. Spudich G,
    42. Trevanion S,
    43. Vilella A,
    44. Vogel J,
    45. White S,
    46. Wilder S,
    47. Zadissa A,
    48. Birney E,
    49. Cunningham F,
    50. Curwen V,
    51. Durbin R,
    52. Fernandez-Suarez XM,
    53. Herrero J,
    54. Kasprzyk A,
    55. Proctor G,
    56. Smith J,
    57. Searle S,
    58. Flicek P
    . Ensembl 2009. Nucleic Acids Res. 2009; 37: D690– D697.
    OpenUrlAbstract/FREE Full Text
  93. 93.↵
    1. Stenson PD,
    2. Mort M,
    3. Ball EV,
    4. Howells K,
    5. Phillips AD,
    6. Thomas NS,
    7. Cooper DN
    . The Human Gene Mutation Database: 2008 update. Genome Med. 2009; 1: 13.
    OpenUrlCrossRefPubMed
  94. 94.↵
    1. Cooper DN,
    2. Ball EV,
    3. Krawczak M
    . The human gene mutation database. Nucleic Acids Res. 1998; 26: 285– 287.
    OpenUrlAbstract/FREE Full Text
  95. 95.↵
    1. Cooper GM,
    2. Stone EA,
    3. Asimenos G,
    4. Green ED,
    5. Batzoglou S,
    6. Sidow A
    . Distribution and intensity of constraint in mammalian genomic sequence. Genome Res. 2005; 15: 901– 913.
    OpenUrlAbstract/FREE Full Text
  96. 96.↵
    1. Cooper GM,
    2. Goode DL,
    3. Ng SB,
    4. Sidow A,
    5. Bamshad MJ,
    6. Shendure J,
    7. Nickerson DA
    . Single-nucleotide evolutionary constraint scores highlight disease-causing mutations. Nat Methods. 2010; 7: 250– 251.
    OpenUrlCrossRefPubMed
  97. 97.↵
    1. Kumar P,
    2. Henikoff S,
    3. Ng PC
    . Predicting the effects of coding non-synonymous variants on protein function using the SIFT algorithm. Nat Protoc. 2009; 4: 1073– 1081.
    OpenUrlCrossRefPubMed
  98. 98.↵
    1. Ng PC,
    2. Henikoff S
    . SIFT: predicting amino acid changes that affect protein function. Nucleic Acids Res. 2003; 31: 3812– 3814.
    OpenUrlAbstract/FREE Full Text
  99. 99.↵
    1. Adzhubei IA,
    2. Schmidt S,
    3. Peshkin L,
    4. Ramensky VE,
    5. Gerasimova A,
    6. Bork P,
    7. Kondrashov AS,
    8. Sunyaev SR
    . A method and server for predicting damaging missense mutations. Nat Methods. 2010; 7: 248– 249.
    OpenUrlCrossRefPubMed
  100. 100.↵
    1. Chun S,
    2. Fay JC
    . Identification of deleterious mutations within three human genomes. Genome Res. 2009; 19: 1553– 1561.
    OpenUrlAbstract/FREE Full Text
  101. 101.↵
    1. Thorn CF,
    2. Klein TE,
    3. Altman RB
    . PharmGKB: the pharmacogenetics and pharmacogenomics knowledge base. Methods Mol Biol. 2005; 311: 179– 191.
    OpenUrlPubMed
  102. 102.↵
    1. Romeo S,
    2. Yin W,
    3. Kozlitina J,
    4. Pennacchio LA,
    5. Boerwinkle E,
    6. Hobbs HH,
    7. Cohen JC
    . Rare loss-of-function mutations in ANGPTL family members contribute to plasma triglyceride levels in humans. J Clin Invest. 2009; 119: 70– 79.
    OpenUrlPubMed
  103. 103.↵
    1. Rehman AU,
    2. Morell RJ,
    3. Belyantseva IA,
    4. Khan SY,
    5. Boger ET,
    6. Shahzad M,
    7. Ahmed ZM,
    8. Riazuddin S,
    9. Khan SN,
    10. Friedman TB
    . Targeted capture and next-generation sequencing identifies C9orf75, encoding taperin, as the mutated gene in nonsyndromic deafness DFNB79. Am J Hum Genet. 2010; 86: 378– 388.
    OpenUrlCrossRefPubMed
  104. 104.↵
    1. Berg JS,
    2. Evans JP,
    3. Leigh MW,
    4. Omran H,
    5. Bizon C,
    6. Mane K,
    7. Knowles MR,
    8. Weck KE,
    9. Zariwala MA
    . Next generation massively parallel sequencing of targeted exomes to identify genetic mutations in primary ciliary dyskinesia: implications for application to clinical testing. Genet Med. 2011; 13: 218– 229.
    OpenUrlCrossRefPubMed
  105. 105.↵
    1. Nikopoulos K,
    2. Gilissen C,
    3. Hoischen A,
    4. van Nouhuys CE,
    5. Boonstra FN,
    6. Blokland EA,
    7. Arts P,
    8. Wieskamp N,
    9. Strom TM,
    10. Ayuso C,
    11. Tilanus MA,
    12. Bouwhuis S,
    13. Mukhopadhyay A,
    14. Scheffer H,
    15. Hoefsloot LH,
    16. Veltman JA,
    17. Cremers FP,
    18. Collin RW
    . Next-generation sequencing of a 40 Mb linkage interval reveals TSPAN12 mutations in patients with familial exudative vitreoretinopathy. Am J Hum Genet. 2010; 86: 240– 247.
    OpenUrlCrossRefPubMed
  106. 106.↵
    1. Vermeer S,
    2. Hoischen A,
    3. Meijer RP,
    4. Gilissen C,
    5. Neveling K,
    6. Wieskamp N,
    7. de Brouwer A,
    8. Koenig M,
    9. Anheim M,
    10. Assoum M,
    11. Drouot N,
    12. Todorovic S,
    13. Milic-Rasic V,
    14. Lochmuller H,
    15. Stevanin G,
    16. Goizet C,
    17. David A,
    18. Durr A,
    19. Brice A,
    20. Kremer B,
    21. van de Warrenburg BP,
    22. Schijvenaars MM,
    23. Heister A,
    24. Kwint M,
    25. Arts P,
    26. van der Wijst J,
    27. Veltman J,
    28. Kamsteeg EJ,
    29. Scheffer H,
    30. Knoers N
    . Targeted next-generation sequencing of a 12.5 Mb homozygous region reveals ANO10 mutations in patients with autosomal-recessive cerebellar ataxia. Am J Hum Genet. 2010; 87: 813– 819.
    OpenUrlCrossRefPubMed
  107. 107.↵
    1. Snyder TM,
    2. Khush KK,
    3. Valantine HA,
    4. Quake SR
    . Universal noninvasive detection of solid organ transplant rejection. Proc Natl Acad Sci U S A. 2011; 108: 6229– 6234.
    OpenUrlAbstract/FREE Full Text
  108. 108.↵
    1. Pham MX,
    2. Teuteberg JJ,
    3. Kfoury AG,
    4. Starling RC,
    5. Deng MC,
    6. Cappola TP,
    7. Kao A,
    8. Anderson AS,
    9. Cotts WG,
    10. Ewald GA,
    11. Baran DA,
    12. Bogaev RC,
    13. Elashoff B,
    14. Baron H,
    15. Yee J,
    16. Valantinef HA
    ; IMAGE Study Group. Gene-expression profiling for rejection surveillance after cardiac transplantation. N Engl J Med. 2010; 362: 1890– 1900.
    OpenUrlCrossRefPubMed
  109. 109.↵
    1. Lister R,
    2. Pelizzola M,
    3. Dowen RH,
    4. Hawkins RD,
    5. Hon G,
    6. Tonti-Filippini J,
    7. Nery JR,
    8. Lee L,
    9. Ye Z,
    10. Ngo QM,
    11. Edsall L,
    12. Antosiewicz-Bourget J,
    13. Stewart R,
    14. Ruotti V,
    15. Millar AH,
    16. Thomson JA,
    17. Ren B,
    18. Ecker JR
    . Human DNA methylomes at base resolution show widespread epigenomic differences. Nature. 2009; 462: 315– 322.
    OpenUrlCrossRefPubMed
  110. 110.↵
    1. Pai AA,
    2. Bell JT,
    3. Marioni JC,
    4. Pritchard JK,
    5. Gilad Y
    . A genome-wide study of DNA methylation patterns and gene expression levels in multiple human and chimpanzee tissues. PLoS Genet. 2011; 7: e1001316.
    OpenUrlCrossRefPubMed
View Abstract
Back to top
Previous ArticleNext Article

This Issue

Circulation
February 21, 2012, Volume 125, Issue 7
  • Table of Contents
Previous ArticleNext Article

Jump to

  • Article
    • INTRODUCTION
    • Historical Perspective
    • Next-Generation Sequencing Technologies
    • Third-Generation Sequencing Technologies
    • Processing High-Throughput Sequence Data
    • The Human Reference Genome and Its Limitations
    • Aligning Sequence Reads to the Human Reference Genome
    • Identifying Single-Nucleotide Variants and Small Insertions/Deletions
    • Identifying Large Structural Variants
    • Variant Quality Control and Genotype Validation
    • Haplotype Phasing With Use of High-Throughput Sequence Data
    • High-Throughput Sequencing and Mendelian Disease Genetics
    • High-Throughput Sequencing and Complex Disease Genetics
    • Genome Sequencing and the Clinic
    • Other Applications of High-Throughput Sequencing
    • Conclusions
    • Sources of Funding
    • Disclosures
    • References
  • Figures & Tables
  • Info & Metrics
  • eLetters

Article Tools

  • Print
  • Citation Tools
    DNA Sequencing
    Frederick E. Dewey, Stephen Pan, Matthew T. Wheeler, Stephen R. Quake and Euan A. Ashley
    Circulation. 2012;125:931-944, originally published February 21, 2012
    https://doi.org/10.1161/CIRCULATIONAHA.110.972828

    Citation Manager Formats

    • BibTeX
    • Bookends
    • EasyBib
    • EndNote (tagged)
    • EndNote 8 (xml)
    • Medlars
    • Mendeley
    • Papers
    • RefWorks Tagged
    • Ref Manager
    • RIS
    • Zotero
  •  Download Powerpoint
  • Article Alerts
    Log in to Email Alerts with your email address.
  • Save to my folders

Share this Article

  • Email

    Thank you for your interest in spreading the word on Circulation.

    NOTE: We only request your email address so that the person you are recommending the page to knows that you wanted them to see it, and that it is not junk mail. We do not capture any email address.

    Enter multiple addresses on separate lines or separate them with commas.
    DNA Sequencing
    (Your Name) has sent you a message from Circulation
    (Your Name) thought you would like to see the Circulation web site.
  • Share on Social Media
    DNA Sequencing
    Frederick E. Dewey, Stephen Pan, Matthew T. Wheeler, Stephen R. Quake and Euan A. Ashley
    Circulation. 2012;125:931-944, originally published February 21, 2012
    https://doi.org/10.1161/CIRCULATIONAHA.110.972828
    Permalink:
    del.icio.us logo Digg logo Reddit logo Twitter logo CiteULike logo Facebook logo Google logo Mendeley logo

Related Articles

Cited By...

Subjects

  • Genetics
    • Functional Genomics
    • Genetics

Circulation

  • About Circulation
  • Instructions for Authors
  • Circulation CME
  • Statements and Guidelines
  • Meeting Abstracts
  • Permissions
  • Journal Policies
  • Email Alerts
  • Open Access Information
  • AHA Journals RSS
  • AHA Newsroom

Editorial Office Address:
200 Fifth Avenue, Suite 1020
Waltham, MA 02451
email: circ@circulationjournal.org
 

Information for:
  • Advertisers
  • Subscribers
  • Subscriber Help
  • Institutions / Librarians
  • Institutional Subscriptions FAQ
  • International Users
American Heart Association Learn and Live
National Center
7272 Greenville Ave.
Dallas, TX 75231

Customer Service

  • 1-800-AHA-USA-1
  • 1-800-242-8721
  • Local Info
  • Contact Us

About Us

Our mission is to build healthier lives, free of cardiovascular diseases and stroke. That single purpose drives all we do. The need for our work is beyond question. Find Out More about the American Heart Association

  • Careers
  • SHOP
  • Latest Heart and Stroke News
  • AHA/ASA Media Newsroom

Our Sites

  • American Heart Association
  • American Stroke Association
  • For Professionals
  • More Sites

Take Action

  • Advocate
  • Donate
  • Planned Giving
  • Volunteer

Online Communities

  • AFib Support
  • Garden Community
  • Patient Support Network
  • Professional Online Network

Follow Us:

  • Follow Circulation on Twitter
  • Visit Circulation on Facebook
  • Follow Circulation on Google Plus
  • Follow Circulation on Instagram
  • Follow Circulation on Pinterest
  • Follow Circulation on YouTube
  • Rss Feeds
  • Privacy Policy
  • Copyright
  • Ethics Policy
  • Conflict of Interest Policy
  • Linking Policy
  • Diversity
  • Careers

©2017 American Heart Association, Inc. All rights reserved. Unauthorized use prohibited. The American Heart Association is a qualified 501(c)(3) tax-exempt organization.
*Red Dress™ DHHS, Go Red™ AHA; National Wear Red Day ® is a registered trademark.

  • PUTTING PATIENTS FIRST National Health Council Standards of Excellence Certification Program
  • BBB Accredited Charity
  • Comodo Secured