Scientists release a new human “pangenome” reference

May 11, 2023
More complete and sophisticated collection of genome sequences captures significantly more human diversity.

Researchers have released a new high-quality collection of reference human genome sequences that captures substantially more diversity from different human populations than what was previously available. The work was led by the international Human Pangenome Reference Consortium, a group funded by the National Human Genome Research Institute (NHGRI), part of the National Institutes of Health.  

The new “pangenome” reference includes genome sequences of 47 people, with the researchers pursuing the goal of increasing that number to 350 by mid-2024. With each person carrying a paired set of chromosomes, the current reference actually includes 94 distinct genome sequences, with a goal of reaching 700 distinct genome sequences by the completion of the project. 

The work, appearing in the journal Nature, is one of several papers published by consortium members. 

Using advanced computational techniques to align the various genome sequences, the researchers constructed a new human pangenome reference with each assembly in the pangenome covering more than 99% of the expected sequence with more than 99% accuracy. It also builds upon the previous reference genome sequence, adding over 100 million new bases, or “letters” in DNA. While the previous reference genome sequence was single and linear, the new pangenome represents many different versions of the human genome sequence at the same time. This gives researchers a wider range of options for using the pangenome in analyzing other human genome sequences.   

NIH release