A recent report in Nature by the Human Pangenome Reference Consortium (HPRC) makes the first draft of the human pangenome publicly available. The pangenome is assembled from a cohort of 47 genetically diverse individuals to better represent the breadth of human genetic diversity and address the ethical, legal, and social ramifications of the human genome project.
This publication also highlights the use of new bioinformatics methods that replaced the traditional linear reference genome system with a pangenomic system based on graph data structures that represent the sequences of many people simultaneously. Genomes reconstructed based on the conventional consensus reference model can often appear to be more similar to the reference than they are because of inherent bias in the consensus reference. The new pangenomic reference system can reduce inherent bias in the traditional approach by constructing a new genome related to all the diverse genomes represented in the pangenome.
The Human Pangenome Reference Consortium reports that the pangenome will be expanded to include a diverse cohort of 350 individuals, to include telomere-2-telomere genomes, and to refine the pangenomic alignment systems to achieve the goal of building a more inclusive and global human reference genome. The availability of a more complete and diverse human pangenome combined with artificial intelligence to analyze the new data will certainly stimulate rapid development of genomic medicine.