Date  Cost per Mb   Cost per Genome 
Sep-01 $5,292.39 $95,263,072
Mar-02 $3,898.64 $70,175,437
Sep-02 $3,413.80 $61,448,422
Mar-03 $2,986.20 $53,751,684
Oct-03 $2,230.98 $40,157,554
Jan-04 $1,598.91 $28,780,376
Apr-04 $1,135.70 $20,442,576
Jul-04 $1,107.46 $19,934,346
Oct-04 $1,028.85 $18,519,312
Jan-05 $974.16 $17,534,970
Apr-05 $897.76 $16,159,699
Jul-05 $898.90 $16,180,224
Oct-05 $766.73 $13,801,124
Jan-06 $699.20 $12,585,659
Apr-06 $651.81 $11,732,535
Jul-06 $636.41 $11,455,315
Oct-06 $581.92 $10,474,556
Jan-07 $522.71 $9,408,739
Apr-07 $502.61 $9,047,003
Jul-07 $495.96 $8,927,342
Oct-07 $397.09 $7,147,571
Jan-08 $102.13 $3,063,820
Apr-08 $15.03 $1,352,982
Jul-08 $8.36 $752,080
Oct-08 $3.81 $342,502
Jan-09 $2.59 $232,735
Apr-09 $1.72 $154,714
Jul-09 $1.20 $108,065
Oct-09 $0.78 $70,333
Jan-10 $0.52 $46,774
Apr-10 $0.35 $31,512
Jul-10 $0.35 $31,125
Oct-10 $0.32 $29,092
Jan-11 $0.23 $20,963
Apr-11 $0.19 $16,712
Jul-11 $0.12 $10,497
Oct-11 $0.09 $7,743
Jan-12 $0.09 $7,666
Apr-12 $0.07 $5,901
Jul-12 $0.07 $5,985
Oct-12 $0.07 $6,618
Jan-13 $0.06 $5,671
Apr-13 $0.06 $5,826
Jul-13 $0.06 $5,550
Oct-13 $0.06 $5,096
Jan-14 $0.04 $4,008
Apr-14 $0.05 $4,920
Jul-14 $0.05 $4,905
Oct-14 $0.06 $5,731
Jan-15 $0.04 $3,970
Apr-15 $0.05 $4,211
Jul-15 $0.015 $1,363

Based on data collected by NHGRI from the Institute’s funded genome-sequencing groups, the cost to generate a high-quality ‘draft’ human genome sequence had dropped to ~$14 million by 2006. Hypothetically, it would have likely cost upwards of $20-25 million to generate a ‘finished’ human genome sequence – expensive, but still considerably less so than for generating the first reference human genome sequence.

A primer about genome sequencing

A genome consists of all of the DNA contained in a cell’s nucleus. DNA is composed of four chemical building blocks or “bases” (for simplicity, abbreviated G, A, T, and C), with the biological information encoded within DNA determined by the order of those bases. Diploid organisms, like humans and all other mammals, contain duplicate copies of almost all of their DNA (i.e., pairs of chromosomes; with one chromosome of each pair inherited from each parent). The size of an organism’s genome is generally considered to be the total number of bases in one representative copy of its nuclear DNA. In the case of diploid organisms (like humans), that corresponds to the sum of the sizes of one copy of each chromosome pair.

Organisms generally differ in their genome sizes. For example, the genome of E. coli (a bacterium that lives in your gut) is ~5 million bases (also called megabases), that of a fruit fly is ~123 million bases, and that of a human is ~3,000 million bases (or ~3 billion bases). There are also some surprising extremes, such as with the loblolly pine tree – its genome is ~23 billion bases in size, over seven times larger than ours. Obviously, the cost to sequence a genome depends on its size. The discussion below is focused on the human genome; keep in mind that a single ‘representative’ copy of the human genome is ~3 billion bases in size, whereas a given person’s actual (diploid) genome is ~6 billion bases in size.

genone-seq

Genomes are large and, at least with today’s methods, their bases cannot be ‘read out’ in order (i.e., sequenced) end-to-end in a single step. Rather, to sequence a genome, its DNA must first be broken down into smaller pieces, with each resulting piece then subjected to chemical reactions that allow the identity and order of its bases to be deduced. The established base order derived from each piece of DNA is often called a ‘sequence read,’ and the collection of the resulting set of sequence reads (often numbering in the billions) is then computationally assembled back together to deduce the sequence of the starting genome. Sequencing human genomes are nowadays aided by the availability of available ‘reference’ sequences of the human genome, which play an important role in the computational assembly process. Historically, the process of breaking down genomes, sequencing the individual pieces of DNA, and then reassembling the individual sequence reads to generate a sequence of the starting genome was called ‘shotgun sequencing’ (although this terminology is used less frequently today). When an entire genome is being sequenced, the process is called ‘whole-genome sequencing.’See Figure 2 for a comparison of human genome sequencing methods during the time of the Human Genome Project and circa ~ 2016.

Note that such cost-accounting does not typically include activities such as quality assurance/quality control (QA/QC), alignment of generated sequence to a reference human genome, sequence assembly, genomic variant calling, or annotation.

Based on the data collected from NHGRI-funded genome-sequencing groups, the cost to generate a high-quality ‘draft’ whole human genome sequence in mid-2015 was just above $4,000; by late in 2015, that figure had fallen below $1,500.


Join 25,000 people in helping redefine health with health concierge and precision medicine. For $1500, you get your complete DNA sequence and free health concierge from Motherhealth.

https://clubalthea.com/2016/10/14/your-complete-dna-sequence-will-help-shape-the-future-of-medicine/