A research group at Tohoku Medical Megabank Organization (ToMMo) has successfully constructed a Japanese population reference panel (1KJPN), from the genome information of 1,070 individuals who had participated in the cohort studies*1 of the Tohoku Medical Megabank Project.
ToMMo identified through this high-coverage sequencing (32.4 × on average), 21.2 million, including 12 million novel, single-nucleotide variants (SNVs) *2 at an estimated false discovery rate of <1.0%. This detailed analysis detected signatures for purifying selection on regulatory elements as well as coding regions.
ToMMo also catalogued structural variants, including 3.4 million insertions and deletions, and 25,923 genic copy-number variants. The 1KJPN was effective for imputing genotypes of the Japanese population genome wide.
The data demonstrates the value of high-coverage sequencing for constructing population-specific variant panels, which covers 99.0% SNVs of minor allele frequency ≥0.1%, and its value for identifying causal rare variants of complex human disease phenotypes in genetic association studies.
This research has produced a basic analysis tool for the large-scale identification and study of the genes related to the physical constitution and diseases that are peculiar to the Japanese. It has so far yielded important results that can accelerate the research of personalized healthcare and medicine.
The results of this research were released in the online version of Nature Communications on August 21, 2015.
Tohoku University Tohoku Medical Megabank Organization (known as ToMMo) has been working in cooperation with Iwate Medical University on the Tohoku Medical Megabank Project. It is part of the universities' contribution towards reconstruction of the region, following the Great East Japan Earthquake in 2011.
Since 2013, the two universities have been conducting cohort studies, which include some150,000 community residents in Miyagi and Iwate prefectures.
Description of terms
*1 Cohort study: A study in which a specific group of people is tracked for a given period of time to reveal the relationship between diseases and environmental factors such as lifestyle habits, genetics and other factors.
*2 Single nucleotide variants (SNVs): A variation of a single base of the genome sequence among individuals. Usually, a variation that exists at a constant frequency or higher is called a SNP.
- Publication details:
- Publication: Nature Communications
Title: Rare variant discovery by deep whole-genome sequencing of 1,070 Japanese individuals
Date:August 21, 2015
Authors: Masao Nagasaki*, Jun Yasuda*, Fumiki Katsuoka*, Naoki Nariai, Kaname Kojima, Yosuke Kawai, Yumi Yamaguchi-Kabata, Junji Yokozawa, Inaho Danjoh, Sakae Saito, Yukuto Sato, Takahiro Mimori, Kaoru Tsuda, Rumiko Saito, Xiaoqing Pan, Satoshi Nishikawa, Shin Ito, Yoko Kuroki, Osamu Tanabe, Nobuo Fuse, Shinichi Kuriyama, Hideyasu Kiyomoto, Atsushi Hozawa, Naoko Minegishi, James Douglas Engel, Kengo Kinoshita, Shigeo Kure, Nobuo Yaegashi, ToMMo Japanese Reference Panel Project & Masayuki Yamamoto
* These authors contributed equally to this work
Tohoku Medical Megabank Organization