A map of rice genome variation reveals the origin of cultivated rice

Huang, Xuehui; Kurata, Nori; Wei, Xinghua; Wang, Zi-Xuan; Wang, Ahong; Zhao, Qiang; Zhao, Yan; Liu, Kunyan; Lu, Hengyun; Li, Wenjun; Guo, Yunli; Lu, Yiqi; Zhou, Congcong; Fan, Danlin; Weng, Qijun; Zhu, Chuanrang; Huang, Tao; Zhang, Lei; Wang, Yongchun; Feng, Lei; Furuumi, Hiroyasu; Kubo, Takahiko; Miyabayashi, Toshie; Yuan, Xiaoping; Xu, Qun; Dong, Guojun; Zhan, Qilin; Li, Canyang; Fujiyama, Asao; Toyoda, Atsushi; Lu, Tingting; Feng, Qi; Qian, Qian; Li, Jiayang; Han, Bin

doi:10.1038/nature11532

Download PDF

Article
Open access
Published: 03 October 2012

A map of rice genome variation reveals the origin of cultivated rice

Xuehui Huang¹^na1,
Nori Kurata²^na1,
Xinghua Wei³^na1,
Zi-Xuan Wang^1,2^na1,
Ahong Wang¹,
Qiang Zhao¹,
Yan Zhao¹,
Kunyan Liu¹,
Hengyun Lu¹,
Wenjun Li¹,
Yunli Guo¹,
Yiqi Lu¹,
Congcong Zhou¹,
Danlin Fan¹,
Qijun Weng¹,
Chuanrang Zhu¹,
Tao Huang¹,
Lei Zhang¹,
Yongchun Wang¹,
Lei Feng¹,
Hiroyasu Furuumi²,
Takahiko Kubo²,
Toshie Miyabayashi²,
Xiaoping Yuan³,
Qun Xu³,
Guojun Dong³,
Qilin Zhan¹,
Canyang Li¹,
Asao Fujiyama²,
Atsushi Toyoda²,
Tingting Lu¹,
Qi Feng¹,
Qian Qian³,
Jiayang Li⁴ &
…
Bin Han^1,5

Nature volume 490, pages 497–501 (2012)Cite this article

75k Accesses
1163 Citations
Metrics details

Subjects

Abstract

Crop domestications are long-term selection experiments that have greatly advanced human civilization. The domestication of cultivated rice (Oryza sativa L.) ranks as one of the most important developments in history. However, its origins and domestication processes are controversial and have long been debated. Here we generate genome sequences from 446 geographically diverse accessions of the wild rice species Oryza rufipogon, the immediate ancestral progenitor of cultivated rice, and from 1,083 cultivated indica and japonica varieties to construct a comprehensive map of rice genome variation. In the search for signatures of selection, we identify 55 selective sweeps that have occurred during domestication. In-depth analyses of the domestication sweeps and genome-wide patterns reveal that Oryza sativa japonica rice was first domesticated from a specific population of O. rufipogon around the middle area of the Pearl River in southern China, and that Oryza sativa indica rice was subsequently developed from crosses between japonica rice and local wild rice as the initial cultivars spread into South East and South Asia. The domestication-associated traits are analysed through high-resolution genetic mapping. This study provides an important resource for rice breeding and an effective genomics approach for crop domestication research.

Multiple domestications of Asian rice

Article 07 August 2023

Genomic revolution of US weedy rice in response to 21st century agricultural technologies

Article Open access 08 September 2022

Genomic history and ecology of the geographic spread of rice

Article 15 May 2020

Main

Cultivated rice (Oryza sativa L.), which is grown worldwide and is one of the most important cereals for human nutrition, is considered to have been domesticated from wild rice (Oryza rufipogon) thousands of years ago^1,2,3,4. The differences between O. sativa and O. rufipogon are reflected in a wide range of morphological and physiological traits^5,6,7,8,9. Despite the fact that rice is a major cereal and a model system for plant biology, the evolutionary origins and domestication processes of cultivated rice have long been debated. The puzzles about rice domestication include: (1) where the geographic origin of cultivated rice was, (2) which types of O. rufipogon served as its direct wild progenitor, and (3) whether the two subspecies of cultivated rice, indica and japonica, are derived from a single or multiple domestications.

A wide range of genetic and archaeological studies have been carried out to examine the phylogenetic relationships of rice, and investigate the demographic history of rice domestication^{10,11,12,13,14,15,16,17,18,19}. Molecular phylogenetic analyses indicated that indica and japonica originated independently^3,10,20. However, the well-characterized domestication genes in rice were found to be fixed in both subspecies with the same alleles, thus supporting a single domestication origin^6,7,8,9,16. Recently, a demographic analysis of single-nucleotide polymorphisms (SNPs) detected from 630 gene fragments suggested a single domestication origin of rice¹⁷. Meanwhile, population genetics analyses of genome-wide data of cultivated and wild rice have tended to suggest that indica and japonica genomes generally appear to be of independent origin^18,19, but many genomic segments bearing domestication alleles may have originated only once¹⁸. Despite these advances, wider sampling with population-scale whole-genome sequencing is needed to shed greater light on the evolutionary history of rice domestication. An in-depth investigation of the haplotype structure near the domestication sites will be critical for evaluating the direction of introgression. The specific ancestral population and the subsequent demographic event are yet to be identified.

Moreover, a comprehensive map of rice genome variation will facilitate genetic mapping of complex traits in rice. We recently collected diverse rice cultivars for sequencing, and carried out genome-wide association studies (GWAS) for many agronomic traits in cultivated rice^21,22. Here we sequenced and analysed the genomes of 446 O. rufipogon accessions to investigate the phylogenetic relationships between cultivated and wild rice and identify the signatures of selection in rice domestication. This research also provides a robust foundation that will enable rice breeders to effectively exploit diverse genetic resources for rice improvement.

Analysis of wild rice populations

The strategy of this study is briefly described in Supplementary Fig. 1. The genus Oryza consists of 23 species^23,24, and the wild rice O. rufipogon is believed to be the immediate progenitor of the cultivated rice O. sativa (Supplementary Fig. 2 and Supplementary Table 1). From large collections of wild rice germplasm maintained in China and Japan, we selected 446 diverse O. rufipogon accessions including both perennial and annual (also called as Oryza nivara) forms from Asia and Oceania, spanning the native geographic range of the species (Supplementary Tables 2 and 3). We sequenced these accessions with twofold genome coverage. After aligning the reads against the rice reference genome sequence, we identified a total of 5,037,497 non-singleton SNPs. Based on the SNP data, the sequence diversity (π) of O. rufipogon was estimated at ∼0.003, which is higher than that in O. sativa—the sequence diversity is 0.0024 for O. sativa, and 0.0016 and 0.0006 for indica and japonica, respectively²¹. Approximately 82% of SNPs (minor allele frequency > 0.05) segregating in O. rufipogon also segregate in O. sativa (Supplementary Table 4). This observation is consistent with previous suggestions that part of the genetic diversity in the progenitor would be lost because only a limited number of individuals were used during domestication^11,25.

We investigated the population structure of the O. rufipogon accessions. On the basis of the neighbour-joining tree, as well as principal-component analysis (PCA), we classified the O. rufipogon species into three types, simply designated as Or-I, Or-II and Or-III in this study (Fig. 1a and Supplementary Fig. 3). We found that the rice population structure strongly correlated with geographic distribution (r² = 0.2 between the first principal component and the longitude, and r² = 0.3 between the second principal component and the latitude)²⁶. Interestingly, the O. rufipogon accessions sampled from southern China mostly belong to the Or-III type (Fig. 1b and Supplementary Fig. 4). The level of population differentiation, F_ST, was estimated at 0.18 among the groups of O. rufipogon, which is much lower than that of O. sativa (∼0.55 on average²¹). The differentiation was not evenly distributed across the rice genome (Supplementary Fig. 5). We scanned the whole genome for highly differentiated loci and found 68 loci with F_ST > 0.3, which covered ∼3% of the complete rice genome (Supplementary Table 5). Of these loci, we found several known genes or quantitative trait loci (QTLs) that included DPL2 (hybrid incompatibility²⁷) (Fig. 1c), OsSOC1 and Ghd7 (flowering time^28,29,30) and qCTS12 (cold tolerance³¹), all of which have been reported to be closely related to indica–japonica differentiation in rice. Hence, the highly differentiated loci can provide important clues for searching the genes involved in local adaptation.

Figure 1: **Genetic structure and association analysis in the wild rice population.**

Because O. rufipogon is an out-crossing species, it was expected to have a relatively high decay rate of linkage disequilibrium. We found that the decay rate of linkage disequilibrium in O. rufipogon, expressed as r², dropped to half of its maximum value at ∼20 kilobases (kb) on average (Supplementary Fig. 6), which is much more rapid than that in O. sativa (∼123 kb and ∼167 kb in indica and japonica, respectively)²¹. To perform GWAS in O. rufipogon, we used the k-nearest neighbour algorithm for data imputation (Supplementary Table 6), and phenotyped the O. rufipogon population for two traits, leaf sheath colour and tiller angle. The strongest associations for sheath colour and tiller angle were found to be just around the known loci OsC1, for colouration³², and PROG1, for prostrate growth^7,8 (Fig. 1d and Supplementary Figs 7–9). Through computational simulations we predicted that the mapping resolution of GWAS in O. rufipogon was approximately three times greater than that in O. sativa on average (Supplementary Figs 10–12). Hence, the wild rice population and accompanying comprehensive sequence resource should be of great utility for directly dissecting agronomic traits in rice.

Phylogenetic relationships of rice

We used whole-genome sequencing data for a large panel of accessions containing 446 O. rufipogon accessions and 1,083 O. sativa varieties to explore their phylogenetic relationships. The 1,083 diverse O. sativa varieties were collected throughout the world and sequenced with one-fold genome coverage (Supplementary Table 7). Of these, 950 genomes had previously been reported^21,22, and the remaining 133 genomes, including many representative varieties (for example, aromatic rice), were sequenced and first reported in this study. A total of 7,970,359 non-singleton SNPs were identified from the 1,529 genome sequences. To determine the ancestral states of the SNPs, the close relatives of O. rufipogon and O. sativa were also sequenced with a total of approximately 50-fold genome coverage (Supplementary Table 8). Using genome sequences of these outgroups, we were able to identify the ancestral states of 6,119,311 SNPs out of the approximately 8 million SNPs (77%).

We used the genotype data set of the ∼8 million SNP sites from 1,529 genomes to infer genome-wide relationships. Both the phylogenetic tree and the PCA plots indicate that O. sativa indica and japonica are descended from Or-I and Or-III, respectively (Fig. 2a and Supplementary Fig. 13). We then investigated the detailed relationship between indica and Or-I and that between japonica and Or-III separately (Supplementary Figs 14–16). The level of genetic differentiation between indica and Or-I was modest (F_ST = 0.17), and indica contains approximately 75% of the genetic diversity in Or-I (Fig. 2b). A small number of indica and Or-I accessions seemed to be intermediate between cultivated and wild rice (Fig. 2a). In contrast, the japonica groups (temperate japonica, tropical japonica and aromatic) were all clustered together, and had descended from wild rice in southern China (sub-clade Or-IIIa). There was an obvious genetic distinction between japonica and Or-IIIa (Fig. 2a), which had a relatively high level of population differentiation (F_ST = 0.36). We found that only approximately 33% of the genetic diversity of Or-III persisted in japonica (Fig. 2b). The strong genetic bottleneck indicates that a small effective population from Or-III was used for domesticating japonica cultivars.

Figure 2: **Genome-wide relationship between cultivated rice and its wild progenitor.**

On the basis of our SNP data, most sequence variants observed between indica and japonica, which were estimated to have diverged hundreds of thousands of years ago²⁰, already existed in the progenitor gene pools. However, the differentiation was enhanced during domestication, with F_ST expanding from 0.18 in O. rufipogon to 0.55 in O. sativa (Fig. 2b). The high level of genetic differentiation between indica and japonica has resulted from a combination of the modest differentiation present within their ancestor population and the recent domestication bottleneck (Fig. 2c). For example, at the causal SNP site for GS3 (a major QTL for grain shape^33,34), the null allele mainly existed in the wild rice populations in the Guangxi and Guangdong provinces of southern China. This allele rapidly extended to become the major allele (∼98%) in the japonica population, thus generating a distinct phenotypic difference between indica and japonica. We screened all SNPs that were highly differentiated in frequency between indica and japonica, and found a total of 213,188 indica–japonica-differentiated SNPs. With regard to the differentiated SNPs, the differences present within the O. rufipogon, the domestication bottleneck in japonica and in indica were estimated to contribute ∼50%, ∼40% and ∼10% of total alterations in allele frequency, respectively (Supplementary Fig. 17).

We examined the ancestral states of indica–japonica-fixed SNPs, and found that 55% of SNPs of the ancestral alleles are fixed in indica and the other 45% are fixed in japonica. The dN/dS ratio (dN, number of non-synonymous substitutions per non-synonymous site; dS, number of synonymous substitutions per synonymous site) of the SNPs was calculated to be 0.34, which was almost equal to the average level of the total SNPs. In contrast, for the 9,595 SNPs that were fixed between O. rufipogon and O. sativa, the ancestral alleles of 93.3% SNPs are identical to O. rufipogon, indicating that O. rufipogon has retained more ancestral states than O. sativa, which also further supported our conclusion that O. rufipogon is likely to be the ancestral progenitor of O. sativa. The dN/dS ratio of the fixed SNPs between O. rufipogon and O. sativa is calculated to be 1.04, indicating positive selection during domestication (P < 0.001, chi-square test).

Screens and annotation of domestication loci

Selective signatures from domestication include a reduction in nucleotide diversity and altered allele frequency in the domestication loci^19,25. We measured the ratio of the genetic diversity in wild rice to that in cultivated rice (π_w/π_c) across the rice genome, and determined the cutoff on the basis of permutation tests (Supplementary Information section 2). We performed whole-genome screening in indica, japonica and the full population using the diversity ratios (Fig. 3 and Supplementary Figs 18–20). In total we identified 60 loci in indica, 62 in japonica and 55 in the full population (Supplementary Tables 9–11). We noticed that many loci with strong signals of selection were nearly identical in both indica and japonica where F_ST between indica and japonica was extremely low, indicating that introduction of traits during domestication has in many cases involved introgression events. We noted that most well-characterized domestication genes, including Bh4 (hull colour⁹), PROG1 (tiller angle^7,8), sh4 (seed shattering^5,6), qSW5 (grain width³⁵) and OsC1 (leaf sheath colour and apiculus colour³²), were among the 55 loci detected in the full population (Fig. 3). Another three well-characterized domestication genes, qSH1 for seed shattering³⁶, Waxy for grain quality³⁷ and Rc for pericarp colour^38,39, which showed strong selection signals in the japonica panel, were not fully shared in the indica population.

Figure 3: **Whole-genome screening and functional annotations of domestication sweeps.**

We investigated the genetic patterns around the well-characterized domestication loci, which were quite different from those at the whole-genome scale. According to phylogenetic trees calculated from SNPs on these loci, most cultivars were clustered together, and the Or-III population from southern China tended to be the closest ancestral progenitor of all the cultivars (Supplementary Fig. 21). These patterns were much clearer when expanded to the total regions of the 55 domestication-related loci (Fig. 4a). The indica cultivars were generally close to Or-I across the genome, but were closer to Or-IIIa than to Or-I at the 55 selected loci (Supplementary Fig. 22). We further calculated the genetic distances between cultivated rice and the wild rice populations from each geographic sampling region through analysis of the genomic regions around the 55 domestication-related loci (Supplementary Fig. 23). Our genetic approach showed that the middle area of the Pearl River in Guangxi province, southern China, was probably the place of the first development of cultivated rice (Fig. 4b–d), although an archaeological finding had identified the Lower Yangtze region in eastern China as one of the centres of rice cultivation^14,15. These results suggest a model in which japonica was first domesticated from Or-III in southern China, and was subsequently crossed to local wild rice in South East Asia and South Asia, thus generating indica after many cross-differentiation-selection cycles (Fig. 4e). Furthermore, we performed computational simulations to generate in silico data sets under various demographic scenarios (Supplementary Fig. 24), using the forward simulator SFS_CODE⁴⁰ (Supplementary Information section 5). We calculated the genetic distances of indica and japonica with different clades of O. rufipogon for both the real and simulated data (Supplementary Fig. 25), which showed that there were significant differences between the real data and the simulated ones in all the scenarios except for the model proposed here (P < 0.01, rank sum test), providing further support for the model that we propose.

Figure 4: **Genetic and geographic origins of rice domestication.**

To investigate why these chromosomal loci were selected during rice domestication, QTL mapping of domestication-related traits was performed using a population that was developed from a cross between O. sativa indica Guangluai-4 and O. rufipogon (Or-IIIa accession W1943). Using a sequencing-based genotyping method⁴¹, we sequenced 271 lines in the population with 0.5× coverage for each, and constructed an ultra-dense genotype map. A total of 15 domestication-related traits were phenotyped, with 58 QTLs detected using a composite interval mapping method (Supplementary Fig. 26 and Supplementary Tables 12, 13). For ten QTLs for which causal genes have been reported, almost all the peak signals were within 200 kb from the known genes, indicating that a high mapping resolution was achieved (Supplementary Fig. 27).

Of the 58 QTLs detected, 32 QTLs were located within domestication sweeps (Fig. 3b–d and Supplementary Figs 28–30). The overlap of the QTLs for domestication traits with domestication sweeps is very significant (P < 0.001, chi-square test). Among them, the QTLs for traits like exserted stigma (controlling mating system) and grain size (controlling output yield) show stronger signals than those for shattering (sh4) and plant architecture (PROG1). Mating systems have been proposed to have a fundamental part in crop domestication^42,43. We identified three major loci and two minor loci responsible for exserted stigma, and found that all the five QTLs were located within domestication sweeps.

Identification of domestication-associated variants

Characterization of domestication-associated genes and their genetic variation relies on high-quality genome sequences of both the cultivated species and its immediate progenitor^44,45. Hence, we further sequenced the O. rufipogon accession W1943 with 100-fold genome coverage and carried out de novo genome assembly. The total length of the assembly is 406 megabases (Mb) and the N50 length of the initial contigs was 16 kb. After aligning the assembly against the reference genome of cultivated rice, we identified 2,621,077 SNPs, 619,132 small indels and 140,075 structural variants of large size (Supplementary Table 14). We examined the sequence variants for their potential effects on protein coding, and identified a total of 128,010 non-synonymous SNPs and 49,236 sequence variants with large effect. Moreover, we surveyed the allele information at all the polymorphic sites across all rice accessions, and constructed a comprehensive sequence-variant frequency spectrum map.

We investigated the patterns of sequence polymorphism to detect variants that affect gene coding and have differential frequency in different populations. For the domestication loci with well-characterized genes, this approach allowed us to narrow down the causative polymorphism to a limited number of sequence variants (Supplementary Figs 31–34). For the domestication loci for which known genes have not yet been identified, we found a total of 273 novel functional variants within 204 genes that showed high differentiation in allele frequencies between cultivated and wild rice (Supplementary Table 15). Moreover, we identified 305 sequence variants in the promoter regions with a differential frequency between cultivated and wild rice (Supplementary Table 16), and 1,120 functional variants that were fixed in either japonica or indica panel (Supplementary Tables 17 and 18).

Discussion

Our study has provided new insights into how and where rice was likely to be domesticated, and we have identified a set of domestication sweeps and putative causal genes. Such endeavours will be enhanced by continuing improvements in assembly and annotation of wild rice genome sequences, generation of functional genomics data sets of wild rice⁴⁶, advances in mapping of rice domestication traits⁴⁷, and biological follow-up of the putative causal genes. The understanding of past domestication, including the selections on critical traits and the recent rapid speciation, will further guide future breeding efforts⁴⁸. Moreover, the great diversity in the wild rice populations, which have much more natural allelic variation than domesticated rice, will further facilitate breeding to modify crops in the post-domestication era.

Methods Summary

The cultivated and wild rice accessions were all from large collections of rice accessions preserved at the China National Rice Research Institute in Hangzhou, China, and the National Institute of Genetics in Mishima, Japan. The DNA samples were sequenced on the Illumina Genome Analyzer IIx or HiSeq2000.

Online Methods

Sampling and sequencing

The cultivated and wild rice accessions were all from large collections of rice accessions preserved at the China National Rice Research Institute in Hangzhou, China, and the National Institute of Genetics in Mishima, Japan. The accessions were selected on the basis of the germplasm database records of phenotypic data and sampling localities to maximize genetic and geographic diversity. The collection was maintained by selfing in the laboratories. For each accession, genomic DNA from a single plant was used for sequencing, and seeds derived from the same plant were used for following field trials. In total, the genomes of 1,083 O. sativa accessions, 446 O. rufipogon accessions and 15 accessions of outgroup species were sequenced on the Illumina Genome Analyzer IIx generating 73-bp (or 117-bp) paired-end reads, each to approximately onefold (for O. sativa accessions), twofold (for O. rufipogon accessions) or threefold (for outgroup species) coverage. The detailed information, including geographic origin and sequencing coverage of the rice accessions, was listed in Supplementary Tables 2, 7 and 8. Library construction and sequencing of these accessions were performed as described²¹. One representative accession of O. rufipogon, W1943, was sequenced on the Illumina HiSeq2000, generating 100-bp paired-end reads with 100-fold genome coverage. An amplification-free method of library preparation⁴⁹ was used in deep sequencing of the rice accession, which reduced the incidence of duplicate sequences, thus facilitating genome assembly and variation analysis.

Read alignment and SNP calling

The paired-end reads of all the rice accessions were aligned against the rice reference genome (IRGSP 4.0) using the software Smalt (version 0.4) with the parameters of ‘-pair 50, 700’ and ‘-mthresh 50’. SNPs were called using the Ssaha Pileup package (version 0.5) with detailed procedure described previously²¹. Genotypes of the rice accessions, including 1,083 O. sativa accessions, 446 O. rufipogon accessions and 15 accessions of outgroup species, were further called at the SNP sites from the Ssaha Pileup outputs. The genotype calls in 15 accessions of outgroup species were used to determine the ancestral states of SNPs in O. sativa and O. rufipogon. SNPs in coding regions, which were defined based on the gene models in the RAP-DB (release 2), were then annotated to be synonymous or non-synonymous for calculating the non-synonymous/synonymous ratio and dN/dS ratio. The genotype data set of the 1,529 rice accessions (1,083 O. sativa accessions and 446 O. rufipogon accessions) was generated on the basis of the calls in each rice accession. Seven sets of genome sequences, which included bacterial-artificial-chromosome-based Sanger sequences and high-coverage resequencing data, were used to assess the accuracy of the genotype data sets (Supplementary Table 6). The wild rice accessions with sequencing coverage >9 were selected to investigate the heterozygosity based on the overlapped reads that were aligned onto the reference sequence. For each accession, the proportion of heterozygosity genotypes was calculated at the polymorphic sites (Supplementary Table 3).

Population genetics analysis

The software Haploview was used to calculate linkage disequilibrium with default settings, using SNPs with information in 446 O. rufipogon accessions⁵⁰. Pairwise r² was calculated for all the SNPs and then averaged across the whole genome. The matrix of pairwise genetic distance derived from simple SNP-matching coefficients was used to construct phylogenetic trees using the software PHYLIP⁵¹ (version 3.66). The software TreeView and MEGA5 were used for visualizing the phylogenetic trees. Principal component analysis of the SNPs was performed using the software EIGENSOFT⁵². The sequence diversity statistics (π) and the population-differentiation statistics (F_ST) were computed using a 100-kb window. The value of π was calculated for each group in O. rufipogon and O. sativa, respectively, and the ratio of π in the full population (or each clade) of O. rufipogon to that in the full population (or corresponding subspecies) of O. sativa was used to detect selective sweeps. The genomic regions where both O. rufipogon and O. sativa show a low level of genetic diversity were excluded for further analysis. To adopt appropriate thresholds to reduce the false-positive rate but also retain true selection signals, thresholds were chosen on the basis of both whole-genome permutation tests and signals at known loci. Permutation tests were performed to estimate the genome-wide type I error rate and determine the threshold to call selective sweeps (see Supplementary Information section 2 for details)⁵³. The method cross-population extended haplotype homozygosity (XP-EHH) was also tested for detecting selective sweeps using the software xpehh⁵⁴ (http://hgdp.uchicago.edu/Software/) (Supplementary Fig. 20). The genetic distance between two clades was computed based on the matrix of pairwise genetic distance, where the distance of all pairs of accessions from the two clades were retrieved and averaged. A custom Perl script was developed to plot all O. rufipogon accessions, using the public geographic information of world borders from the ‘Thematic Mapping’ data set (version 0.3). The computational simulations under different demographic scenarios were performed using the program SFS_CODE⁴⁰.

Planting, crossing and phenotyping

For the O. rufipogon population, approximately five seeds for each accession from the collection of wild rice were germinated and planted in the experimental field (in Sanya, China at N 18.65°, E 109.80°) from March 2011. The leaf sheath colour was observed and scored directly and the tiller angle was measured for each plant. The mapping population of 210 backcross inbred lines (BILs) and 61 chromosome segment substitution lines (CSSLs) was derived from a cross between O. sativa ssp. indica cv. Guangluai-4 and O. rufipogon accession W1943. The BILs were developed by one generation of backcross to Guangluai-4 followed with six generations of self-fertilization. The CSSLs were developed by five generations of backcross to Guangluai-4 followed with three generations of self-fertilization. Phenotyping was conducted in the experimental field (in Shanghai, China at N 31.13°, E 121.28°) from May to October, 2011. The fifteen traits that we phenotyped for this study include germination rate, tiller angle, heading date, stigma colour, the degree of stigma exsertion, plant height, panicle length, the degree of shattering, awn length, grain number per panicle, grain length, grain width, grain weight per 1,000 grains, hull colour and pericarp colour. The degree of stigma exsertion was scored based on the observation of ∼20 randomly sampled spikelets of each line, on a scale of 1–3 (no, incomplete or complete exertion). Seed germination rate was measured by using mature seeds which were placed in a plastic Petri dishes kept at 30 °C in the dark for 48 h⁵. Other traits were phenotyped and scored as described previously^21,22,29.

Imputation and association analysis

For the genotype data set in O. rufipogon, genotypes of 446 O. rufipogon accessions were called specifically at the ∼5 million SNP sites that were polymorphic in the O. rufipogon population. In the panel for GWAS, only the SNPs that have a minor allele frequency (MAF) of more than 5% and contain genotype calls of more than 100 accessions were left for subsequent imputation. The k-nearest neighbour algorithm-based imputation method was used for inferring missing calls²¹. The specificity of the genotype data set before and after imputation was assessed using three sets of genome sequences (Supplementary Table 6). Association analysis was conducted using the compressed mixed linear model⁵⁵. The top five principle components were used as fixed effects and the matrix of genetic distance was used to model the variance–covariance matrix of the random effect. Permutation tests were used to define the threshold of association signals of the GWAS in the wild rice population. A total of 20 permutation analyses were performed (10 independent permutation tests for each of the two traits, sheath colour and tiller angle), which resulted in two ‘association signals’ with the thresholds we set⁵³. Hence, there were an average of 0.1 false positives (that is, totally two false positives in 20 permutation tests) in a single whole-genome scanning analysis. Simulation tests were used to compare the performance of GWAS between the populations of cultivated and wild rice.

Genotyping and linkage analysis

Genomic DNA of each line in the mapping population was sequenced on the Illumina Genome Analyzer IIx, each to approximately 0.5× coverage. Both parents of the population, Guangluai-4 and W1943, were sequenced with at least 20× genome coverage, in a previous work²¹ and in this study, respectively. SNP identification between parents was conducted as described previously⁴¹. Genotype calling, recombination breakpoint determination and bin map construction was performed using the software SEG-Map (http://www.ncgr.ac.cn/software/SEG/). QTL analysis of the fifteen traits was conducted with the composite interval mapping (CIM) method implemented in the software Windows QTL Cartographer⁵⁶ (version 2.5) with a window size of 10 cM and a step size of 2 cM. QTL with LOD value higher than 3.5 were called, of which the location was described according to its LOD peak location. The phenotypic effect (r²) of each QTL was computed using Windows QTL Cartographer. QTLs located within selective sweep regions were further used to associate the selected regions with their biological functions. It needs to be noted that we adopted a stringent threshold in the QTL calling (LOD > 3.5), and the genomic regions with LOD ranging from 2.5 to 3.5 may include many minor QTLs (the threshold was set to 2.5 in most studies).

Genome assembly and contig anchoring

The genome of W1943 was assembled by using a custom pipeline integrating Phusion2 (clustering the raw reads into different groups)⁵⁷ and Phrap (then assembling all the reads in each group to generate contigs)⁵⁸. The N50 length of the entire assembly was calculated for the initial contigs with small contigs of <200 bp excluded. All the full-length complementary DNA sequences⁴⁶ of W1943 were aligned with the final assembly of W1943 genome sequence using the software GMAP⁵⁹ (version 6) with the parameters ‘-K 15000’ and ‘-k 0.97’. The resulting contigs from whole-genome de novo assembly were anchored to the rice reference genome sequence (IRGSP4.0) using the software MUMmer⁶⁰ (version 3).

Genome annotation and variant detection

Gene models of the genome of the wild rice W1943 were predicted using the software Fgenesh that was set for a monocot model⁶¹ (version 2.0). The resulting proteome of W1943 was compared with protein sequences in Rice Genome Annotation Project (version 7.0) using BLASTP with a cutoff of a minimum of 95% identity. Sequence variants, including SNPs, indels and imbalanced substitutions, were called using the diffseq program in the EMBOSS package⁶². Indels of large size were called from the alignment results of MUMmer. Effects of the sequence variants were predicted according to the gene models of Nipponbare in the RAP-DB (release 2) across the rice genome. For indels in genic regions and SNPs with large effect around the domestication loci, the effects were mainly based on the reference gene models.

Population-scale sequence comparison

The sequence reads of 1,083 O. sativa accessions, 446 O. rufipogon accessions and 15 outgroup accessions were then aligned against assembled genome sequences of W1943 using the same parameters with those against the reference Nipponbare genome sequences. Genotypes of each accession were called at all sequence variant sites (including SNPs, indels and imbalanced substitutions that were detected from assembled sequences), based on the alignment outputs against the two genome sequences. The allele frequencies at the sequence variant sites were calculated for each clade of O. sativa and O. rufipogon. In each clade, variant sites with information of less than 10 accessions (less than 2 for the outgroups) were then excluded for computing allele frequencies, namely no data available.

Accession codes

Primary accessions

European Nucleotide Archive

Data deposits

DNA sequencing data are deposited in the European Nucleotide Archive (http://www.ebi.ac.uk/ena/) under accession numbers ERP001143, ERP000729 and ERP000106. De novo assembly and genome annotation of wild rice W1943, the genotype dataset of 1,529 rice accessions and the imputed dataset of 446 O. rufipogon accessions for GWAS are available at the Rice Haplotype Map Project database (http://www.ncgr.ac.cn/RiceHap3).

References

Oka, H. I. Origin of cultivated rice. (Japan Scientific Societies Press, 1988)
Google Scholar
Khush, G. S. Origin, dispersal, cultivation and variation of rice. Plant Mol. Biol. 35, 25–34 (1997)
CAS PubMed Google Scholar
Cheng, C. et al. Polyphyletic origin of cultivated rice: based on the interspersion pattern of SINEs. Mol. Biol. Evol. 20, 67–75 (2003)
CAS PubMed Google Scholar
Fuller, D. Q. et al. Consilience of genetics and archaeobotany in the entangled history of rice. Archaeol. Anthropol. Sci. 2, 115–131 (2010)
Google Scholar
Li, C., Zhou, A. & Sang, T. Genetic analysis of rice domestication syndrome with the wild annual species, Oryza nivara. New Phytol. 170, 185–194 (2006)
ADS CAS PubMed Google Scholar
Li, C., Zhou, A. & Sang, T. Rice domestication by reducing shattering. Science 311, 1936–1939 (2006)
ADS CAS PubMed Google Scholar
Jin, J. et al. Genetic control of rice plant architecture under domestication. Nature Genet. 40, 1365–1369 (2008)
CAS PubMed Google Scholar
Tan, L. et al. Control of a key transition from prostrate to erect growth in rice domestication. Nature Genet. 40, 1360–1364 (2008)
CAS PubMed Google Scholar
Zhu, B. F. et al. Genetic control of a transition from black to straw-white seed hull in rice domestication. Plant Physiol. 155, 1301–1311 (2011)
CAS PubMed PubMed Central Google Scholar
Londo, J. P., Chiang, Y. C., Hung, K. H., Chiang, T. Y. & Schaal, B. A. Phylogeography of Asian wild rice, Oryza rufipogon, reveals multiple independent domestications of cultivated rice, Oryza sativa. Proc. Natl Acad. Sci. USA 103, 9578–9583 (2006)
ADS CAS PubMed Google Scholar
Caicedo, A. L. et al. Genome-wide patterns of nucleotide polymorphism in domesticated rice. PLoS Genet. 3, e163 (2007)
PubMed Central Google Scholar
Kovach, M. J., Sweeney, M. T. & McCouch, S. R. New insights into the history of rice domestication. Trends Genet. 23, 578–587 (2007)
CAS PubMed Google Scholar
Sang, T. & Ge, S. The puzzle of rice domestication. J. Integr. Plant Biol. 49, 760–768 (2007)
CAS Google Scholar
Zong, Y. et al. Fire and flood management of coastal swamp enabled first rice paddy cultivation in east China. Nature 449, 459–462 (2007)
ADS CAS PubMed Google Scholar
Fuller, D. Q. et al. The domestication process and domestication rate in rice: spikelet bases from the Lower Yangtze. Science 323, 1607–1610 (2009)
ADS CAS PubMed Google Scholar
Zhang, L. B. et al. Selection on grain shattering genes and rates of rice domestication. New Phytol. 184, 708–720 (2009)
CAS PubMed Google Scholar
Molina, J. et al. Molecular evidence for a single evolutionary origin of domesticated rice. Proc. Natl Acad. Sci. USA 108, 8351–8356 (2011)
ADS CAS PubMed Google Scholar
He, Z. et al. Two evolutionary histories in the genome of rice: the roles of domestication genes. PLoS Genet. 7, e1002100 (2011)
CAS PubMed PubMed Central Google Scholar
Xu, X. et al. Resequencing 50 accessions of cultivated and wild rice yields markers for identifying agronomically important genes. Nature Biotechnol. 30, 105–111 (2012)
CAS Google Scholar
Ma, J. & Bennetzen, J. L. Rapid recent growth and divergence of rice nuclear genomes. Proc. Natl Acad. Sci. USA 101, 12404–12410 (2004)
ADS CAS PubMed Google Scholar
Huang, X. et al. Genome-wide association studies of 14 agronomic traits in rice landraces. Nature Genet. 42, 961–967 (2010)
CAS PubMed Google Scholar
Huang, X. et al. Genome-wide association study of flowering time and grain yield traits in a worldwide collection of rice germplasm. Nature Genet. 44, 32–39 (2012)
Google Scholar
Ge, S., Sang, T., Lu, B. R. & Hong, D. Y. Phylogeny of rice genomes with emphasis on origins of allotetraploid species. Proc. Natl Acad. Sci. USA 96, 14400–14405 (1999)
ADS CAS PubMed Google Scholar
Vaughan, D. A., Morishima, H. & Kadowaki, K. Diversity in the Oryza genus. Curr. Opin. Plant Biol. 6, 139–146 (2003)
CAS PubMed Google Scholar
Doebley, J. F., Gaut, B. S. & Smith, B. D. The molecular genetics of crop domestication. Cell 127, 1309–1321 (2006)
CAS PubMed Google Scholar
Novembre, J. & Stephens, M. Interpreting principal component analyses of spatial population genetic variation. Nature Genet. 40, 646–649 (2008)
CAS PubMed Google Scholar
Mizuta, Y., Harushima, Y. & Kurata, N. Rice pollen hybrid incompatibility caused by reciprocal gene loss of duplicated genes. Proc. Natl Acad. Sci. USA 107, 20417–20422 (2010)
ADS CAS PubMed Google Scholar
Tadege, M. et al. Reciprocal control of flowering time by OsSOC1 in transgenic Arabidopsis and by FLC in transgenic rice. Plant Biotechnol. J. 1, 361–369 (2003)
CAS PubMed Google Scholar
Wang, L. et al. Mapping 49 quantitative trait loci at high resolution through sequencing-based genotyping of rice recombinant inbred lines. Theor. Appl. Genet. 122, 327–340 (2011)
PubMed Google Scholar
Xue, W. et al. Natural variation in Ghd7 is an important regulator of heading date and yield potential in rice. Nature Genet. 40, 761–767 (2008)
CAS PubMed Google Scholar
Andaya, V. C. & Tai, T. H. Fine mapping of the qCTS12 locus, a major QTL for seedling cold tolerance in rice. Theor. Appl. Genet. 113, 467–475 (2006)
CAS PubMed Google Scholar
Saitoh, K., Onishi, K., Mikami, I., Thidar, K. & Sano, Y. Allelic diversification at the C (OsC1) locus of wild and cultivated rice: nucleotide changes associated with phenotypes. Genetics 168, 997–1007 (2004)
CAS PubMed PubMed Central Google Scholar
Fan, C. et al. GS3, a major QTL for grain length and weight and minor QTL for grain width and thickness in rice, encodes a putative transmembrane protein. Theor. Appl. Genet. 112, 1164–1171 (2006)
CAS PubMed Google Scholar
Takano-Kai, N. et al. Evolutionary history of GS3, a gene conferring grain length in rice. Genetics 182, 1323–1334 (2009)
CAS PubMed PubMed Central Google Scholar
Shomura, A. et al. Deletion in a gene associated with grain size increased yields during rice domestication. Nature Genet. 40, 1023–1028 (2008)
CAS PubMed Google Scholar
Konishi, S. et al. An SNP caused loss of seed shattering during rice domestication. Science 312, 1392–1396 (2006)
ADS CAS PubMed Google Scholar
Wang, Z. Y. et al. The amylose content in rice endosperm is related to the post-transcriptional regulation of the waxy gene. Plant J. 7, 613–622 (1995)
CAS PubMed Google Scholar
Sweeney, M. T., Thomson, M. J., Pfeil, B. E. & McCouch, S. Caught red-handed: Rc encodes a basic helix-loop-helix protein conditioning red pericarp in rice. Plant Cell 18, 283–294 (2006)
CAS PubMed PubMed Central Google Scholar
Sweeney, M. T. et al. Global dissemination of a single mutation conferring white pericarp in rice. PLoS Genet. 3, e133 (2007)
PubMed PubMed Central Google Scholar
Hernandez, R. D. A flexible forward simulator for populations subjects to selection and demography. Bioinformatics 24, 2786–2787 (2008)
CAS PubMed PubMed Central Google Scholar
Huang, X. et al. High-throughput genotyping by whole-genome resequencing. Genome Res. 19, 1068–1076 (2009)
CAS PubMed PubMed Central Google Scholar
Chen, K. Y., Cong, B., Wing, R., Vrebalov, J. & Tanksley, S. D. Changes in regulation of a transcription factor lead to autogamy in cultivated tomatoes. Science 318, 643–645 (2007)
ADS CAS PubMed Google Scholar
Rieseberg, L. H. & Blackman, B. K. Speciation genes in plants. Ann. Bot. 106, 439–455 (2010)
CAS PubMed PubMed Central Google Scholar
Gan, X. et al. Multiple reference genomes and transcriptomes for Arabidopsis thaliana. Nature 477, 419–423 (2011)
ADS CAS PubMed PubMed Central Google Scholar
Schneeberger, K. et al. Reference-guided assembly of four diverse Arabidopsis thaliana genomes. Proc. Natl Acad. Sci. USA 108, 10249–10254 (2011)
ADS CAS PubMed Google Scholar
Lu, T. et al. Collection and comparative analysis of 1888 full-length cDNAs from wild rice Oryza rufipogon Griff. W1943. DNA Res. 15, 285–295 (2008)
CAS PubMed PubMed Central Google Scholar
Tang, H., Sezen, U. & Paterson, A. H. Domestication and plant genomes. Curr. Opin. Plant Biol. 13, 160–166 (2010)
CAS PubMed Google Scholar
Morrell, P. L., Buckler, E. S. & Ross-Ibarra, J. Crop genomics: advances and applications. Nature Rev. Genet. 13, 85–96 (2012)
CAS Google Scholar
Kozarewa, I. et al. Amplification-free Illumina sequencing-library preparation facilitates improved mapping and assembly of (G+C)-biased genomes. Nature Methods 6, 291–295 (2009)
CAS PubMed PubMed Central Google Scholar
Barrett, J. C., Fry, B., Maller, J. & Daly, M. J. Haploview: analysis and visualization of LD and haplotype maps. Bioinformatics 21, 263–265 (2005)
CAS PubMed Google Scholar
Felsenstein, J. PHYLIP: phylogeny inference package (version 3.2). Cladistics 5, 164–166 (1989)
Google Scholar
Price, A. L. et al. Principal components analysis corrects for stratification in genome-wide association studies. Nature Genet. 38, 904–909 (2006)
CAS PubMed Google Scholar
Churchill, G. A. & Doerge, R. W. Empirical threshold values for quantitative trait mapping. Genetics 138, 963–971 (1994)
CAS PubMed PubMed Central Google Scholar
Sabeti, P. C. et al. Genome-wide detection and characterization of positive selection in human populations. Nature 449, 913–918 (2007)
ADS CAS PubMed PubMed Central Google Scholar
Zhang, Z. et al. Mixed linear model approach adapted for genome-wide association studies. Nature Genet. 42, 355–360 (2010)
CAS PubMed Google Scholar
Wang, S., Basten, C. J. & Zeng, Z. B. Windows QTL Cartographer 2.5. (Department of Statistics, North Carolina State Univ., 2007)
Mullikin, J. C. & Ning, Z. The phusion assembler. Genome Res. 13, 81–90 (2003)
CAS PubMed PubMed Central Google Scholar
de la Bastide, M. & McCombie, W. R. Assembling genomic DNA sequences with PHRAP. Curr. Protoc. Bioinformatics 17, 11.4.1–11.4.15 (2007)
Google Scholar
Wu, T. D. & Watanabe, C. K. GMAP: a genomic mapping and alignment program for mRNA and EST sequences. Bioinformatics 21, 1859–1875 (2005)
CAS PubMed Google Scholar
Kurtz, S. et al. Versatile and open software for comparing large genomes. Genome Biol. 5, R12 (2004)
PubMed PubMed Central Google Scholar
Salamov, A. A. & Solovyev, V. V. Ab initio gene finding in Drosophila genomic DNA. Genome Res. 10, 516–522 (2000)
CAS PubMed PubMed Central Google Scholar
Rice, P., Longden, I. & Bleasby, A. EMBOSS: the European Molecular Biology Open Software Suite. Trends Genet. 16, 276–277 (2000)
CAS PubMed Google Scholar

Download references

Acknowledgements

We thank the China National Rice Research Institute for providing all cultivated rice germplasm and Chinese wild rice accessions. The rest of the wild rice accessions were distributed from the Rice Collection of National Institute of Genetics jointly supported by the National Bioresource Project, MEXT, and Systems Functional Genetics Project of the Transdisciplinary Research Integration Center, ROIS, Japan. We thank Z. Ning, Y. Minobe and A. Osbourn for their advice and assistance in this work. This work was supported by the Ministry of Science and Technology of China (2011CB100205, 2012AA10A302 and 2012AA10A304), the Ministry of Agriculture of China (2011ZX08009-002 and 2011ZX08001-004), the National Natural Science Foundation of China (31121063) and the Chinese Academy of Sciences (to B.H.).

Author information

Xuehui Huang, Nori Kurata, Xinghua Wei and Zi-Xuan Wang: These authors contributed equally to this work.

Authors and Affiliations

National Center for Gene Research, Institute of Plant Physiology and Ecology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200233, China,
Xuehui Huang, Zi-Xuan Wang, Ahong Wang, Qiang Zhao, Yan Zhao, Kunyan Liu, Hengyun Lu, Wenjun Li, Yunli Guo, Yiqi Lu, Congcong Zhou, Danlin Fan, Qijun Weng, Chuanrang Zhu, Tao Huang, Lei Zhang, Yongchun Wang, Lei Feng, Qilin Zhan, Canyang Li, Tingting Lu, Qi Feng & Bin Han
Plant Genetics Laboratory and Comparative Genomics Laboratory, National Institute of Genetics, Mishima, 411-8540, Shizuoka, Japan
Nori Kurata, Zi-Xuan Wang, Hiroyasu Furuumi, Takahiko Kubo, Toshie Miyabayashi, Asao Fujiyama & Atsushi Toyoda
State Key Laboratory of Rice Biology, China National Rice Research Institute, Chinese Academy of Agricultural Sciences, Hangzhou 310006, China,
Xinghua Wei, Xiaoping Yuan, Qun Xu, Guojun Dong & Qian Qian
National Center for Plant Gene Research, State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences, Beijing 100101, China,
Jiayang Li
Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100029, China,
Bin Han

Authors

Xuehui Huang
View author publications
You can also search for this author in PubMed Google Scholar
Nori Kurata
View author publications
You can also search for this author in PubMed Google Scholar
Xinghua Wei
View author publications
You can also search for this author in PubMed Google Scholar
Zi-Xuan Wang
View author publications
You can also search for this author in PubMed Google Scholar
Ahong Wang
View author publications
You can also search for this author in PubMed Google Scholar
Qiang Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Yan Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Kunyan Liu
View author publications
You can also search for this author in PubMed Google Scholar
Hengyun Lu
View author publications
You can also search for this author in PubMed Google Scholar
Wenjun Li
View author publications
You can also search for this author in PubMed Google Scholar
Yunli Guo
View author publications
You can also search for this author in PubMed Google Scholar
Yiqi Lu
View author publications
You can also search for this author in PubMed Google Scholar
Congcong Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Danlin Fan
View author publications
You can also search for this author in PubMed Google Scholar
Qijun Weng
View author publications
You can also search for this author in PubMed Google Scholar
Chuanrang Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Tao Huang
View author publications
You can also search for this author in PubMed Google Scholar
Lei Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Yongchun Wang
View author publications
You can also search for this author in PubMed Google Scholar
Lei Feng
View author publications
You can also search for this author in PubMed Google Scholar
Hiroyasu Furuumi
View author publications
You can also search for this author in PubMed Google Scholar
Takahiko Kubo
View author publications
You can also search for this author in PubMed Google Scholar
Toshie Miyabayashi
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoping Yuan
View author publications
You can also search for this author in PubMed Google Scholar
Qun Xu
View author publications
You can also search for this author in PubMed Google Scholar
Guojun Dong
View author publications
You can also search for this author in PubMed Google Scholar
Qilin Zhan
View author publications
You can also search for this author in PubMed Google Scholar
Canyang Li
View author publications
You can also search for this author in PubMed Google Scholar
Asao Fujiyama
View author publications
You can also search for this author in PubMed Google Scholar
Atsushi Toyoda
View author publications
You can also search for this author in PubMed Google Scholar
Tingting Lu
View author publications
You can also search for this author in PubMed Google Scholar
Qi Feng
View author publications
You can also search for this author in PubMed Google Scholar
Qian Qian
View author publications
You can also search for this author in PubMed Google Scholar
Jiayang Li
View author publications
You can also search for this author in PubMed Google Scholar
Bin Han
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

B.H. conceived the project and its components. X.H. and B.H. designed studies and contributed to the original concept of the project. X.W., X.Y. and Q.X. contributed the collection of rice cultivars and Chinese wild rice accessions. N.K., H.F., T.K., T.M., A.F. and A.T. contributed the collection of other wild rice accessions and analysed geographical distributions of wild rice. Z.-X.W., A.W., Y.W., L.F., Qilin Z., C.L., G.D. and Q.Q. contributed in phenotyping of rice. Z.-X.W., A.W. and L.F. contributed in phenotyping of the backcross inbred line population and genetic mapping of domestication-related traits. W.L., Y.G., Y.L., C.Z., D.F., Q.W. and Q.F. performed the genome sequencing. X.H., Y.Z., K.L., C.Z., T.H., L.Z. and T.L. performed genome data analysis. Qiang Z. and H.L. performed de novo genome assembly. X.H. performed evolutionary study and genome annotation. X.H., Y.Z. and K.L performed GWAS, population genetics, and statistical analyses. J.L. contributed to functional analyses. X.H. and B.H. analysed whole data and wrote the paper.

Corresponding author

Correspondence to Bin Han.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Supplementary information

Supplementary Information

This file contains Supplementary Text 1-8, Supplementary References and Supplementary Tables 1, 3-6, and 8114 (see separate zipped file for Supplementary Tables 2, 7, 15, 16, 17 and 18. (PDF 458 kb)

Supplementary Figures

This file contains Supplementary Figures 1-34. (PDF 7906 kb)

Supplementary Tables

This file contains Supplementary Tables 2, 7 and 15-18. (ZIP 268 kb)

PowerPoint slides

PowerPoint slide for Fig. 1

PowerPoint slide for Fig. 2

PowerPoint slide for Fig. 3

PowerPoint slide for Fig. 4

Rights and permissions

This article is distributed under the terms of the Creative Commons Attribution-Non-Commercial-Share Alike licence (http://creativecommons.org/licenses/by-nc-sa/3.0/), which permits distribution, and reproduction in any medium, provided the original author and source are credited. This licence does not permit commercial exploitation, and derivative works must be licensed under the same or similar licence.

Reprints and permissions

About this article

Cite this article

Huang, X., Kurata, N., Wei, X. et al. A map of rice genome variation reveals the origin of cultivated rice. Nature 490, 497–501 (2012). https://doi.org/10.1038/nature11532

Download citation

Received: 18 January 2012
Accepted: 20 August 2012
Published: 03 October 2012
Issue Date: 25 October 2012
DOI: https://doi.org/10.1038/nature11532

This article is cited by

Genomic insight into the origin, domestication, dispersal, diversification and human selection of Tartary buckwheat
- Yuqi He
- Kaixuan Zhang
- Meiliang Zhou
Genome Biology (2024)
OsVPE2, a Member of Vacuolar Processing Enzyme Family, Decreases Chilling Tolerance of Rice
- Huabing Deng
- Sai Cao
- Xuedan Lu
Rice (2024)
Genomic variation in weedy and cultivated broomcorn millet accessions uncovers the genetic architecture of agronomic traits
- Qiong Lu
- Hainan Zhao
- Weibin Song
Nature Genetics (2024)
A mediator of OsbZIP46 deactivation and degradation negatively regulates seed dormancy in rice
- Naihui Guo
- Shengjia Tang
- Peisong Hu
Nature Communications (2024)
Dysfunction of duplicated pair rice histone acetyltransferases causes segregation distortion and an interspecific reproductive barrier
- Ben Liao
- You-Huang Xiang
- Hong-Xuan Lin
Nature Communications (2024)