*
aState Key Laboratory of Plant Cell and Chromosome Engineering,Institute of Genetics and Developmental Biology,Chinese Academy of Sciences,Beijing 100101,China
bCollege of Life Sciences,University of Chinese Academy of Sciences,Beijing 100049,China
Common wheat(Triticum aestivum L.),one of the important staple food crops in the world,feeds>30%of the human population[1].The world annual production of wheat is>620 million metric tons(http://www.fao.org/3/a-i4691e.pdf).China is the largest wheat producer and consumer.The Chinese annual wheat production is about 100 million tons[2].Wheat production needs to be constantly increased in order to satisfy the food demands of the increasing world.Accurate sequencing and assembly of wheat genomes are helpful for basic research and genetic improvement of wheat cultivars[3].
The different ecotypes of common wheat(such as winter wheat and spring wheat)adapt well to a wide range of climates.Wheat is an allohexaploid(AABBDD)that arose from two wide hybridization events.The first occurred 0.5–-3.0 million years ago between two diploid ancestral species carrying the A(T.urartu)and B(an unknown species)genomes and after chromosome doubling formed wild tetraploid wheat(Triticum turgidum ssp.dicoccoides,AABB).This species was domesticated to cultivated emmer(T.turgidum ssp.dicoccum,AABB)[4].The second hybridization tookplaceabout 9000 years ago between cultivated emmer and diploid goat grass(Aegilops tauschii,DD)to form allohexaploid common wheat[5].Compared with other major crops the common wheat genome is very large in size(~17 gigabases,Gb)with each subgenome being approximately 5.5 Gb[6]and complex in composition,of which>80%is made up of repetitive sequences.Therefore,sequencing and assembly of the entire wheat genome was very challenging.During the past several years many effort was made in this area,and a series of draft and near completed genome/chromosome assemblies for common wheat and its diploid and tetraploid progenitors have been generated in succession[7–16].In this review,we mainly focus on contributions made by Chinese scientists in the area of wheat genome sequencing following a brief summary of international achievements.
In order to decode the mystery of the wheat genome and to expedite molecular breeding in wheat,a group of scientists and breeders initiated the International Wheat Genome Sequencing Consortium(IWGSC)in 2005.To overcome the difficulties caused by genome size and complexities,the 21 chromosomes of common wheat landrace Chinese Spring were separated by flow cytometric sorting.Bacterial artificial chromosome(BAC)libraries and physical maps were then constructed for each chromosome or chromosome arm.Chromosome sorting,DNA isolation and BAC library construction for each chromosome arm were performed in the laboratory of Prof.Jaroslav Dolezel at the Institute of Experimental Botany in the Czech Republic.Subsequent physical map construction and BAC sequencing were assigned to different laboratories of the International Wheat Genome Sequencing Consortium (IWGSC). Numerous projects by different groups were undertaken to produce reference sequences of single chromosome or chromosome arms.Chromosome 3B was the first chromosome to be sorted successfully due to its large size.A 3B physical map was generated using BAC clones originating from the purified 3B chromosome in 2008[17].BAC clones were selected by a minimal tiling path(MTP)approach and sequenced.The final pseudomolecule of 3B was 774 megabases(Mb)in length and carried 5326 protein-coding genes[13].Currently,all chromosomes/chromosome arms of Chinese Spring have been sorted and their physical maps have been constructed(http://www.wheatgenome.org/Projects/IWGSC-Bread-Wheat-Projects).
Sequences of many chromosomes,or parts thereof,are publicly available,including 1AS,1BS,3DS,5DS,7DS,1AL,1BL,4A,5A,6A,6B,and 7B[16,18–29].
In addition to the chromosome-based BAC-by-BAC sequencing strategy of IWGSC,Hall and colleagues in the UK applied a whole genome shotgun sequencing strategy with 454 pyro-sequencing technology to sequence Chinese Spring,and produced a five-fold coverage genome sequence of Chinese Spring in 2012.Based on the assemblies of 5.42 Gb,they predicted 94,000 to 96,000 genes,and assigned two-thirds of them to the three subgenomes(A,B,and D).The authors indicated that gene families were pronouncedly reduced in common wheat compared to the diploid progenitors,[7].Two years after the first wheat genome release,IWGSC published a chromosome-based draft sequence of Chinese Spring[1].Compared to the whole genome shotgun sequencing strategy,this approach differentiated the highly conserved gene copies in each chromosome.
The 21 chromosomes of Chinese Spring were isolated by flow cytometric sorting and sequenced by a chromosome-based shotgun sequencing strategy using Illumina technology to yield 10.2 Gb of genome sequence.Abundant gene losses and duplications were observed by intra-and inter-specific comparisons,indicating that the wheat genome was somewhat dynamic in evolution[1].In 2017,Clarketal.[11]published an improved genome sequence of Chinese Spring.They used precisely sized mate-pair libraries and an optimized algorithm to generate a new assembly representing>78%of the genome,much higher than the scaffold proportion(~49%)produced previously by IWGSC.Genome-wide sequence rearrangements were revealed based on comparative analysis of the data.Zimin et al.[12]reported a more complete wheat genome assembly.The final sequences were generated by combining next generation(short Illumina reads)and third generation sequencing data(long Pacific Biosciences reads).>15 Gb of final assembly representing>90%of the Chinese Spring genome was created by merging two sets of sequences assembled using the MaSuRCA[30]and FALCON assemblers[31].This is the most complete wheat genome sequence published by far.Recently,IWGSC announced that they have completed a high quality sequence of Chinese Spring(IWGSC v1.0)and released the genomic data for public access (http://www.wheatgenome.org/News/Latest-news/RefSeq-v1.0-URGI).
In addition to sequencing Chinese Spring at the genome and single chromosome level,sequencing of wild emmer,the tetraploid ancestor of common wheat,was reported in July 2017[10].A software package Denovo MAGIC2(NRGene,NesZiona,Israel)was applied to perform thescaffold assembly from short Illumina sequencing reads.The software took advantage of improvements in throughput and read length of Illumina sequencing technology capable of completing challenging assemblies within days.About 10 Gb sequence of wild emmer genome was obtained by using whole genome shotgun sequencing of various insert-size libraries.The quality of the assembly was further validated by genetic data and three-dimensional chromosome conformation capture sequencing(Hi-C)data[32].Decoding the genome of the tetraploid ancestor will help in understanding the evolution of common wheat.
Chinese scientists have made considerable contributions in wheat genome sequencing.They were first achieved to generate drafts of the A and D genomes in the diploid progenitors in 2013[8,9].These provided good reference information for generating the polyploid wheat genome assembly and analysis.
As described above,common wheat evolved from three diploid progenitors by two wide hybridization events.The diploid progenitors provide important foundations for understanding the evolution and domestication of hexaploid wheat.The progenitor species of the wheat A genome is T.urartu(wild einkorn)with an estimated genome size of 4.94 Gb.Ling et al.applied a whole genome shotgun sequencing strategy using an Illumina HiSequation(2000)sequencing platform,and successfully generated the draft genome of T.urartu with 4.66 Gb,accounting for 94%of the estimated genome size[9].The sequencing and assembly process was as follows:first,paired-end sequencing was performed to obtain genomic DNA sequence data from 57 DNA libraries with different insertion sizes.Then,low-quality,redundant and contaminated reads were removed to generate~91-fold coverage(448.49 Gb) high quality sequence data. Finally,the SOAPdenovo(version 1.05;http://soap.genomics.org.cn/)assembler was used to obtain 3.92 Gb of contigs with N50 size of 3.42 kilobases(kb)and the 4.66 Gb genome assembly with a scaffold N50 size of 63.69 kb.Assembly quality and coverage were validated using previously published BAC and expressed sequence tag(EST)sequences by PCR amplification.
About 67%of the assembly was annotated as repetitive elements,of which the most abundant were long terminal repeat retrotransposons (49.07%). In total, 34,879 protein-coding gene models were predicted by analyzing>100 Mb of transcriptomes generated with the HiSequation and Roche 454 sequencing platform,together with publicly available ESTs from hexaploid wheat and related grass genomes[33–37].Average gene size was 3207 bp and exon number per gene was 4.7.Compared with 28,000 genes predicated for the A subgenome of hexaploid wheat[7]6800 more genes were identified in T.urartu.Although partly caused by different annotation methods the difference indicated extensive gene loss in the A subgenome of hexaploid wheat.Comparative analysis also revealed that there was a specific expansion of resistant genes in the T.urartu genome[9].Furthermore,the authors found that the large genome size of T.urartu compared to Brachypodium distachyon was caused by greatly increased intergenic spaces enriched with Gypsy and Copia retrotransposons,and firstly provided genome-scale evidence for the role of repeat expansion in genome size enlargement during the evolution of the tribe Triticeae[9].The T.urartu draft genome sequence will enable discovery of agronomic important genes and the development of genetic markers for molecular breeding.For example,TuGASR7 was shown to be a homologue of a grain length control gene in rice.Two haplotypes of TuGASR7(H1 and H2)were identified among 92 T.urartu accessions and H1 was significantly associated with long grain[9]and potentially higher yield if transferred to common wheat.
Genome sequencing of Ae.tauschii,the diploid progenitor of wheat D subgenome,was also reported by Chinese scientists[8].A whole genome shotgun sequencing strategy combined with paired-end Illumina sequencing was used to produce 557.55 Gb of raw data.Then low quality,adaptor-contaminated and PCR-duplicated reads were filtered to obtain a 378.86 Gb of high quality dataset.The Illumina reads were assembled by SOAPdenovo(version 1.05;http://soap.genomics.org.cn/)to yield 3.53 Gb of contigs with a N50 size of 4.5 kb.The assembled contigs were linked to a scaffold based on paired-end reads and an additional 18.4 Gb of 454 pyro-sequencing long reads.In total,4.23 Gb of scaffolds were achieved with a N50 size of 57.59 kb,representing 97%of the 4.36 Gb estimated genome size[8].The quality of the draft genome was evaluated by comparison with ESTs from two full-length cDNA libraries from leaf and root tissue of Ae.tauschii.About 91%of EST sequences were mapped to the scaffolds with>90%coverage.
>65.9%of the Ae.tauschii genome was annotated as repetitive DNA with the assemblies,[8].A large scale Ae.tauschii genome extension caused by a burst of retrotransposons was dated to have occurred about 3–-4 million years ago based on the insertion date of the assembled LTR retrotransposons.Jia et al.generated 117 Mb transcriptomes involving eight tissues to facilitate gene annotation[8].Using both evidence-based and de novo gene prediction methods,34,498 high confidence and 8652 low confidence protein-coding genes were identified.The latter have incomplete gene structure or limited expression data support.The average gene size was 1203 bp and exon number per gene was 4.9.About 1.72 Gb of scaffold sequences,comprising 30,697(71.1%)protein-coding genes,were aligned to chromosomes based on a genetic map constructed from an F2population of 490 individuals from a cross between Ae.tauschii accessions Y2280 and Al8/78.Genome-wide analysis revealed expansion of several agronomical relevant gene families,such as NBS-LRR and cytochrome P450 genes.
Researchers from China Agricultural University collaborating with several research groups in USA reported a high quality sequence of the short arm of Ae.tauschii chromosome 3D in 2017[15].To overcome the limitations of the whole genome shotgun sequencing strategy the authors used a BAC-by-BAC strategy to sequence 3176 BAC clones selected by MTP based on a previously reported Ae.tauschii physical map[38].These MTP clones were genetically anchored to the short arm of Ae.tauschii chromosome 3D,or co-assembled with the 3DS physical map of the common wheat.BAC DNA was used to construct 3 kb paired-end libraries and sequenced using the Roche GS FLX Titanium XL Chemistry protocol.Sequenced reads were assembled using the Roche 454 gs Assembler V2.6 package[39]and scaffolded by adding 3–5 kb mate-pair reads using the Consed package[40].The assembled sequences were merged according to MTP BAC order.A final assembly contained 689 scaffolds with a N50 of 766 kb and total length of 293 Mb.Two genetic maps[8,38]and a Radiation-Hybrid(RH)map[41]on3DS were used to build an At3DS pseudo molecule.In total,612 scaffolds(247 Mb)were anchored on to the pseudo molecule,covering 90%of the At3DS arm sequence.
>81%of the At3DS pseudo molecule was annotated as transposable elements;this was higher than previously observated on the Ae. tauschii genome [8].Pericentromeric-centromeric region were localized from 170 to 247 Mb by plotting the density of the Cereba and Quinta repeat families.About 88%(1873)of the annotated high confidence protein-coding genes(2124)and 52%(51)of low confidence protein-coding genes(98)were expressed in Ae.tauschii.Average gene length was 3821 bp,which was longer than previous reports for the Ae.tauschii genome[8],and average exon number per gene was 4.3.At3DS was further compared with homologous segments in related grasses to reveal rapid evolution of Triticeae genomes.
Past studies on wheat genome sequencing in China were mainly focused on whole genome or single chromosome arm of diploid ancestors.The most frequently used methods were second generation sequencing combined with whole genome shotgun or BAC-by-BAC strategies.The data produced have short read-lengths(<300 bp)and biased genome coverage,resulting in fragmented and incomplete genome assemblies,especially in genomic regions with complex repeat structures.A new sequencing technology,single molecule real-time(SMRT)sequencing technology(Pacific Biosciences,USA),produces significantly longer sequence reads of up to 40–50 kb.This platform enables production of genome sequences with fewer gaps and longer contigs even for large and complicated genomes[42].Two additional next-generation mapping technologies,BioNano genome mapping and 10×Genomics linked reads,are being increasingly applied in genome sequencing to develop high quality assemblies[43–46].Moreover,a method described as Hi-C can be used to determine the three-dimensional architecture of chromosomes,providing insights into chromatin structure.Hi-C technology is able to validate genome assembly and scaffold order on chromosomes[32].
Combining these new technologies,our laboratory successfully generated high quality genome assemblies of T.urartu(in preparation for publication).We completed the genome sequencing using a BAC-by-BAC strategy combined with the SMRT sequencing technology and the new BioNano genome map and 10x Genomics linked reads mapping technologies.The assembly pipeline was briefly described as follows:Firstly,the Illumina clean reads in each BAC pool were separately assembled.Then,the sequence contigs of each BAC were connected using the best aligned PacBio reads.Thirdly,the BAC sequences were iteratively connected into FPC contigs based on the MTP physical map.Finally,the FPC contigs were merged into scaffolds by referring to BioNano consensus map,10x Genomics linked reads and mate-pair sequences.The assembly pipeline significantly increased the scaffold length and accuracy compared to the first version of the A genome sequence,where short reads sequencing data combined with whole genome shotgun sequencing were applied.Given the previously estimated genome size of 4.94 Gb,our new assembly accounts for 98.4%of the T.urartu genome.
As described above,the company NRGene(NesZiona,Isreal)developed the Denovo MAGIC2 assembler for large and complicated genomes such as wheat.It enables assembles of Illumina short reads into large scaffolds with N50 up to several Mb[10].Using this software,we completed the genome sequencing and assembly of Chinese bread wheat cultivar Kenong 9204.At the same time Prof.Jizeng Jia and colleagues at the Chinese Academy of Agricultural Sciences generated high quality Ae. tauschii genome assemblies,comprising of large scaffolds with a N50 size of 14.1 Mb[47].
In general,the advent of new sequencing,mapping and assembly technologies will greatly facilitate the productions of high-quality genome sequences of Triticeae,and the genomes of various wheat species and cultivars will be sequenced and released for public access.These will strongly enhance systematic study of the genetics,comparative genomics and evolution of wheat,and will expedite isolation and characterization of genes controlling agronomical important traits,such as yield and resistance to biotic and abiotic stress.The complete genome sequence of common wheat and its progenitors will greatly assist molecular breeding of wheat and thereby contribute in meeting the challenges of food security and sustainable agriculture.
This work was supported by the Chinese Academy of Sciences(QYZDJ-SSW-SMC001)and the National Key Research and Development Program of China(2016YFD0101004).
R E F E R E N C E S
[1]International Wheat Genome Sequencing Consortium,A chromosome-based draft sequence of the hexaploid bread wheat(Triticum aestivum)genome,Science 345(2014)1251788.
[2]H.Q.Ling,Progress and perspectives of the genome sequencing in wheat and its relatives,J.Triticeae Crops 36(2016)397–403(in Chinese with English abstract).
[3]M.W.Bevan,C.Uauy,B.B.Wulff,J.Zhou,K.Krasileva,M.D.Clark,Genomic innovation for crop improvement,Nature 543(2017)346–354.
[4]J.Dubcovsky,J.Dvorak,Genome plasticity a key factor in the success of polyploid wheat under domestication,Science 316(2007)1862–1866.
[5]J.H.Peng,D.F.Sun,E.Nevo,Domestication evolution,genetics and genomics in wheat,Mol.Breed.8(2011)281–301.
[6]T.Marcussen,S.R.Sandve,L.Heier,M.Spannagl,M.Pfeifer,International wheat genome sequencing consortium,in:K.S.Jakobsen,B.B.Wulff,B.Steuernagel,K.F.Mayer,O.A.Olsen(Eds.),Ancient Hybridizations Among the Ancestral Genomes of Bread Wheat,Science,345,2014,p.1250092.
[7]R.Brenchley,M.Spannagl,M.Pfeifer,G.L.Barker,R.D'Amore,A.M.Allen,N.McKenzie,M.Kramer,A.Kerhornou,D.Bolser,S.Kay,D.Waite,M.Trick,I.Bancroft,Y.Gu,N.Huo,M.C.Luo,S.Sehgal,B.Gill,S.Kianian,O.Anderson,P.Kersey,J.Dvorak,W.R.McCombie,A.Hall,K.F.Mayer,K.J.Edwards,M.W.Bevan,N.Hall,Analysis of the bread wheat genome using whole-genome shotgun sequencing,Nature 491(2012)705–710.
[8]J.Jia,S.Zhao,X.Kong,Y.Li,G.Zhao,W.He,R.Appels,M.Pfeifer,Y.Tao,X.Zhang,R.Jing,C.Zhang,Y.Ma,L.Gao,C.Gao,M.Spannagl,K.F.Mayer,D.Li,S.Pan,F.Zheng,Q.Hu,X.Xia,J.Li,Q.Liang,J.Chen,T.Wicker,C.Gou,H.Kuang,G.He,Y.Luo,B.Keller,Q.Xia,P.Lu,J.Wang,H.Zou,R.Zhang,J.Xu,J.Gao,C.Middleton,Z.Quan,G.Liu,J.Wang,International Wheat Genome Sequencing Consortium,H.Yang,X.Liu,Z.He,L.Mao,J.Wang,Aegilops tauschii draft genome sequence reveals a gene repertoire for wheat adaptation,Nature 496(2013)91–95.
[9]H.Q.Ling,S.Zhao,D.Liu,J.Wang,H.Sun,C.Zhang,H.Fan,D.Li,L.Dong,Y.Tao,C.Gao,H.Wu,Y.Li,Y.Cui,X.Guo,S.Zheng,B.Wang,K.Yu,Q.Liang,W.Yang,X.Lou,J.Chen,M.Feng,J.Jian,X.Zhang,G.Luo,Y.Jiang,J.Liu,Z.Wang,Y.Sha,B.Zhang,H.Wu,D.Tang,Q.Shen,P.Xue,S.Zou,X.Wang,X.Liu,F.Wang,Y.Yang,X.An,Z.Dong,K.Zhang,X.Zhang,M.C.Luo,J.Dvorak,Y.Tong,J.Wang,H.Yang,Z.Li,D.Wang,A.Zhang,J.Wang,Draft genome of the wheat A-genome progenitor Triticum urartu,Nature 496(2013)87–90.
[10]R.Avni,M.Nave,O.Barad,K.Baruch,S.O.Twardziok,H.Gundlach,I.Hale,M.Mascher,M.Spannagl,K.Wiebe,K.W.Jordan,G.Golan,J.Deek,B.Ben-Zvi,G.Ben-Zvi,A.Himmelbach,R.P.MacLachlan,A.G.Sharpe,A.Fritz,R.Ben-David,H.Budak,T.Fahima,A.Korol,J.D.Faris,A.Hernandez,M.A.Mikel,A.A.Levy,B.Steffenson,M.Maccaferri,R.Tuberosa,L.Cattivelli,P.Faccioli,A.Ceriotti,K.Kashkush,M.Pourkheirandish,T.Komatsuda,T.Eilam,H.Sela,A.Sharon,N.Ohad,D.A.Chamovitz,K.F.X.Mayer,N.Stein,G.Ronen,Z.Peleg,C.J.Pozniak,E.D.Akhunov,A.Distelfeld,Wild emmer genome architecture and diversity elucidate wheat evolution and domestication,Science 357(2017)93–97.
[11]B.J.Clavijo,L.Venturini,C.Schudoma,G.G.Accinelli,G.Kaithakottil,J.Wright,P.Borrill,G.Kettleborough,D.Heavens,H.Chapman,J.Lipscombe,T.Barker,F.H.Lu,N.McKenzie,D.Raats,R.H.Ramirez-Gonzalez,A.Coince,N.Peel,L.Percival-Alwyn,O.Duncan,J.Trosch,G.Yu,D.M.Bolser,G.Namaati,A.Kerhornou,M.Spannagl,H.Gundlach,G.Haberer,R.P.Davey,C.Fosker,F.D.Palma,A.L.Phillips,A.H.Millar,P.J.Kersey,C.Uauy,K.V.Krasileva,D.Swarbreck,M.W.Bevan,M.D.Clark,An improved assembly and annotation of the allohexaploid wheat genome identifies complete families of agronomic genes and provides genomic evidence for chromosomal translocations,Genome Res.27(2017)885–896.
[12]A.V.Zimin,D.Puiu,R.Hall,S.Kingan,S.L.Salzberg,The first near-complete assembly of the hexaploid bread wheat genome,Triticum aestivum,bioRxiv(2017)https://doi.org/10.1101/159111.
[13]F.Choulet,A.Alberti,S.Theil,N.Glover,V.Barbe,J.Daron,L.Pingault,P.Sourdille,A.Couloux,E.Paux,P.Leroy,S.Mangenot,N.Guilhot,J.Le Gouis,F.Balfourier,M.Alaux,V.Jamilloux,J.Poulain,C.Durand,A.Bellec,C.Gaspin,J.Safar,J.Dolezel,J.Rogers,K.Vandepoele,J.M.Aury,K.Mayer,H.Berges,H.Quesneville,P.Wincker,C.Feuillet,Structural and functional partitioning of bread wheat chromosome 3B,Science 345(2014)1249721.
[14]M.Liu,J.Stiller,K.Holusova,J.Vrana,D.Liu,J.Dolezel,C.Liu,Chromosome-specific sequencing reveals an extensive dispensable genome component in wheat,Sci.Rep.6(2016)36398.
[15]J.Xie,N.Huo,S.Zhou,Y.Wang,G.Guo,K.R.Deal,S.Ouyang,Y.Liang,Z.Wang,L.Xiao,T.Zhu,T.Hu,V.Tiwari,J.Zhang,H.Li,Z.Ni,Y.Yao,H.Peng,S.Zhang,O.D.Anderson,P.E.McGuire,J.Dvorak,M.C.Luo,Z.Liu,Y.Q.Gu,Q.Sun,Sequencing and comparative analyses of Aegilops tauschii chromosome arm 3DS reveal rapid evolution of Triticeae genomes,J.Genet.Genomics 44(2017)51–61.
[16]K.Holusova,J.Vrana,J.Safar,H.Simkova,B.Balcarkova,Z.Frenkel,B.Darrier,E.Paux,F.Cattonaro,H.Berges,T.Letellier,M.Alaux,J.Dolezel,J.Bartos,Physical map of the short arm of bread wheat chromosome 3D,Plant Genome 10(2017)1.
[17]E.Paux,P.Sourdille,J.Salse,C.Saintenac,F.Choulet,P.Leroy,A.Korol,M.Michalak,S.Kianian,W.Spielmeyer,E.Lagudah,D.Somers,A.Kilian,M.Alaux,S.Vautrin,H.Berges,K.Eversole,R.Appels,J.Safar,H.Simkova,J.Dolezel,M.Bernard,C.Feuillet,A physical map of the 1-gigabase bread wheat chromosome 3B,Science 322(2008)101–104.
[18]P.Hernandez,M.Martis,G.Dorado,M.Pfeifer,S.Galvez,S.Schaaf,N.Jouve,H.Simkova,M.Valarik,J.Dolezel,K.F.Mayer,Next-generation sequencing and syntenic integration of flow-sorted arms of wheat chromosome 4A exposes the chromosome structure and gene content,Plant J.69(2012)377–386.
[19]J.Breen,T.Wicker,M.Shatalina,Z.Frenkel,I.Bertin,R.Philippe,W.Spielmeyer,H.Simkova,J.Safar,F.Cattonaro,S.Scalabrin,F.Magni,S.Vautrin,H.Berges,International wheat genome sequencing consortium,in:E.Paux,T.Fahima,J.Dolezel,A.Korol,C.Feuillet,B.Keller(Eds.),A Physical Map of the Short Arm of Wheat Chromosome 1A,PLoS One,8,2013,p.e80272.
[20]S.J.Lucas,B.A.Akpinar,M.Kantar,Z.Weinstein,F.Aydinoglu,J.Safar,H.Simkova,Z.Frenkel,A.Korol,F.Magni,F.Cattonaro,S.Vautrin,A.Bellec,H.Berges,J.Dolezel,H.Budak,Physical mapping integrated with syntenic analysis to characterize the gene space of the long arm of wheat chromosome 1A,PLoS One 8(2013),e59542..
[21]T.Belova,L.Gronvold,A.Kumar,S.Kianian,X.He,M.Lillemo,N.M.Springer,S.Lien,O.A.Olsen,S.R.Sandve,Utilization of deletion bins to anchor and order sequences along the wheat 7B chromosome,Theor.Appl.Genet.127(2014)2029–2040.
[22]N.Poursarebani,T.Nussbaumer,H.Simkova,J.Safar,H.Witsenboer,J.van Oeveren,J.Dolezel,K.F.Mayer,N.Stein,T.Schnurbusch,Whole-genome profiling and shotgun sequencing delivers an anchored,gene-decorated,physical map assembly of bread wheat chromosome 6A,Plant J.79(2014)334–347.
[23]T.Tanaka,F.Kobayashi,G.P.Joshi,R.Onuki,H.Sakai,H.Kanamori,J.Wu,H.Simkova,S.Nasuda,T.R.Endo,K.Hayakawa,J.Dolezel,Y.Ogihara,T.Itoh,T.Matsumoto,H.Handa,Next-generation survey sequencing and the molecular organization of wheat chromosome 6B,DNA Res.21(2014)103–114.
[24]F.Kobayashi,J.Wu,H.Kanamori,T.Tanaka,S.Katagiri,W.Karasawa,S.Kaneko,S.Watanabe,T.Sakaguchi,Y.Hanawa,H.Fujisawa,K.Kurita,C.Abe,J.C.Iehisa,R.Ohno,J.Safar,H.Simkova,Y.Mukai,M.Hamada,M.Saito,G.Ishikawa,Y.Katayose,T.R.Endo,S.Takumi,T.Nakamura,K.Sato,Y.Ogihara,K.Hayakawa,J.Dolezel,S.Nasuda,T.Matsumoto,H.Handa,A high-resolution physical map integrating an anchored chromosome with the BAC physical maps of wheat chromosome 6B,BMC Genomics 16(2015)595.
[25]B.A.Akpinar,F.Magni,M.Yuce,S.J.Lucas,H.Simkova,J.Safar,S.Vautrin,H.Berges,F.Cattonaro,J.Dolezel,H.Budak,The physical map of wheat chromosome 5DS revealed gene duplications and small rearrangements,BMC Genomics 16(2015)453.
[26]H.Stankova,A.R.Hastie,S.Chan,J.Vrana,Z.Tulpova,M.Kubalakova,P.Visendi,S.Hayashi,M.Luo,J.Batley,D.Edwards,J.Dolezel,H.Simkova,BioNano genome mapping of individual chromosomes supports physical mapping and sequence assembly in complex plant genomes,Plant Biotechnol.J.14(2016)1523–1531.
[27]R.Philippe,E.Paux,I.Bertin,P.Sourdille,F.Choulet,C.Laugier,H.Simkova,J.Safar,A.Bellec,S.Vautrin,Z.Frenkel,F.Cattonaro,F.Magni,S.Scalabrin,M.M.Martis,K.F.Mayer,A.Korol,H.Berges,J.Dolezel,C.Feuillet,A high density physical map of chromosome 1BL supports evolutionary studies,map-based cloning and sequencing in wheat,Genome Biol.14(2013)R64.
[28]D.Raats,Z.Frenkel,T.Krugman,I.Dodek,H.Sela,H.Simkova,F.Magni,F.Cattonaro,S.Vautrin,H.Berges,T.Wicker,B.Keller,P.Leroy,R.Philippe,E.Paux,J.Dolezel,C.Feuillet,A.Korol,T.Fahima,The physical map of wheat chromosome 1BS provides insights into its gene space organization and evolution,Genome Biol.14(2013)R138.
[29]D.Barabaschi,F.Magni,A.Volante,A.Gadaleta,H.?imková,S.Scalabrin,M.L.Prazzoli,P.Bagnaresi,K.Lacrima,V.Michelotti,F.Desiderio,L.Orru,V.Mazzamurro,A.Fricano,A.M.Mastrangelo,P.Tononi,N.Vitulo,I.Jurman,Z.Frenkel,F.Cattonaro,M.Morgante,A.Blanco,J.Dole?el,M.Delledonne,A.M.Stanca,L.Cattivelli,G.Vale,Physical mapping of bread wheat chromosome 5A:an integrated approach,Plant Genome 8(2015)24.
[30]A.V.Zimin,G.Marcais,D.Puiu,M.Roberts,S.L.Salzberg,J.A.Yorke,The MaSuRCA genome assembler,Bioinformatics 29(2013)2669–2677.
[31]C.S.Chin,P.Peluso,F.J.Sedlazeck,M.Nattestad,G.T.Concepcion,A.Clum,C.Dunn,R.O'Malley,R.Figueroa-Balderas,A.Morales-Cruz,G.R.Cramer,M.Delledonne,C.Luo,J.R.Ecker,D.Cantu,D.R.Rank,M.C.Schatz,Phased diploid genome assembly with single-molecule real-time sequencing,Nat.Methods 13(2016)1050–1054.
[32]E.Lieberman-Aiden,N.L.van Berkum,L.Williams,M.Imakaev,T.Ragoczy,A.Telling,I.Amit,B.R.Lajoie,P.J.Sabo,M.O.Dorschner,R.Sandstrom,B.Bernstein,M.A.Bender,M.Groudine,A.Gnirke,J.Stamatoyannopoulos,L.A.Mirny,E.S.Lander,J.Dekker,Comprehensive mapping of long-range interactions reveals folding principles of the human genome,Science 326(2009)289–293.
[33]International Rice Genome Sequencing Project,The mapbased sequence of the rice genome,Nature 436(2005)793–800.
[34]P.S.Schnable,D.Ware,R.S.Fulton,J.C.Stein,F.Wei,S.Pasternak,C.Liang,J.Zhang,L.Fulton,T.A.Graves,P.Minx,A.D.Reily,L.Courtney,S.S.Kruchowski,C.Tomlinson,C.Strong,K.Delehaunty,C.Fronick,B.Courtney,S.M.Rock,E.Belter,F.Du,K.Kim,R.M.Abbott,M.Cotton,A.Levy,P.Marchetto,K.Ochoa,S.M.Jackson,B.Gillam,W.Chen,L.Yan,J.Higginbotham,M.Cardenas,J.Waligorski,E.Applebaum,L.Phelps,J.Falcone,K.Kanchi,T.Thane,A.Scimone,N.Thane,J.Henke,T.Wang,J.Ruppert,N.Shah,K.Rotter,J.Hodges,E.Ingenthron,M.Cordes,S.Kohlberg,J.Sgro,B.Delgado,K.Mead,A.Chinwalla,S.Leonard,K.Crouse,K.Collura,D.Kudrna,J.Currie,R.He,A.Angelova,S.Rajasekar,T.Mueller,R.Lomeli,G.Scara,A.Ko,K.Delaney,M.Wissotski,G.Lopez,D.Campos,M.Braidotti,E.Ashley,W.Golser,H.Kim,S.Lee,J.Lin,Z.Dujmic,W.Kim,J.Talag,A.Zuccolo,C.Fan,A.Sebastian,M.Kramer,L.Spiegel,L.Nascimento,T.Zutavern,B.Miller,C.Ambroise,S.Muller,W.Spooner,A.Narechania,L.Ren,S.Wei,S.Kumari,B.Faga,M.J.Levy,L.McMahan,P.Van Buren,M.W.Vaughn,K.Ying,C.T.Yeh,S.J.Emrich,Y.Jia,A.Kalyanaraman,A.P.Hsia,W.B.Barbazuk,R.S.Baucom,T.P.Brutnell,N.C.Carpita,C.Chaparro,J.M.Chia,J.M.Deragon,J.C.Estill,Y.Fu,J.A.Jeddeloh,Y.Han,H.Lee,P.Li,D.R.Lisch,S.Liu,Z.Liu,D.H.Nagel,M.C.McCann,P.SanMiguel,A.M.Myers,D.Nettleton,J.Nguyen,B.W.Penning,L.Ponnala,K.L.Schneider,D.C.Schwartz,A.Sharma,C.Soderlund,N.M.Springer,Q.Sun,H.Wang,M.Waterman,R.Westerman,T.K.Wolfgruber,L.Yang,Y.Yu,L.Zhang,S.Zhou,Q.Zhu,J.L.Bennetzen,R.K.Dawe,J.Jiang,N.Jiang,G.G.Presting,S.R.Wessler,S.Aluru,R.A.Martienssen,S.W.Clifton,W.R.McCombie,R.A.Wing,R.K.Wilson,The B73 maize genome:complexity,diversity,and dynamics,Science 326(2009)1112–1115.
[35]A.H.Paterson,J.E.Bowers,R.Bruggmann,I.Dubchak,J.Grimwood,H.Gundlach,G.Haberer,U.Hellsten,T.Mitros,A.Poliakov,J.Schmutz,M.Spannagl,H.Tang,X.Wang,T.Wicker,A.K.Bharti,J.Chapman,F.A.Feltus,U.Gowik,I.V.Grigoriev,E.Lyons,C.A.Maher,M.Martis,A.Narechania,R.P.Otillar,B.W.Penning,A.A.Salamov,Y.Wang,L.Zhang,N.C.Carpita,M.Freeling,A.R.Gingle,C.T.Hash,B.Keller,P.Klein,S.Kresovich,M.C.McCann,R.Ming,D.G.Peterson,D.Ware Mehboob-ur-Rahman,P.Westhoff,K.F.Mayer,J.Messing,D.S.Rokhsar,The Sorghum bicolor genome and the diversification of grasses,Nature 457(2009)551–556.
[36]International Brachypodium Initiative,Genome sequencing and analysis of the model grass Brachypodium distachyon,Nature 463(2010)763–768.
[37]International Barley Genome Sequencing Consortium,K.F.Mayer,R.Waugh,J.W.Brown,A.Schulman,P.Langridge,M.Platzer,G.B.Fincher,G.J.Muehlbauer,K.Sato,T.J.Close,R.P.Wise,N.Stein,A physical,genetic and functional sequence assembly of the barley genome,Nature 491(2012)711–716.
[38]M.C.Luo,Y.Q.Gu,F.M.You,K.R.Deal,Y.Ma,Y.Hu,N.Huo,Y.Wang,J.Wang,S.Chen,C.M.Jorgensen,Y.Zhang,P.E.McGuire,S.Pasternak,J.C.Stein,D.Ware,M.Kramer,W.R.McCombie,S.F.Kianian,M.M.Martis,K.F.Mayer,S.K.Sehgal,W.Li,B.S.Gill,M.W.Bevan,H.Simkova,J.Dolezel,S.Weining,G.R.Lazo,O.D.Anderson,J.Dvorak,A 4-gigabase physical map unlocks the structure and evolution of the complex genome of Aegilops tauschii,the wheat D-genome progenitor,Proc.Natl.Acad.Sci.U.S.A.110(2013)7940–7945.
[39]M.Margulies,M.Egholm,W.E.Altman,S.Attiya,J.S.Bader,L.A.Bemben,J.Berka,M.S.Braverman,Y.J.Chen,Z.Chen,S.B.Dewell,L.Du,J.M.Fierro,X.V.Gomes,B.C.Godwin,W.He,S.Helgesen,C.H.Ho,G.P.Irzyk,S.C.Jando,M.L.Alenquer,T.P.Jarvie,K.B.Jirage,J.B.Kim,J.R.Knight,J.R.Lanza,J.H.Leamon,S.M.Lefkowitz,M.Lei,J.Li,K.L.Lohman,H.Lu,V.B.Makhijani,K.E.McDade,M.P.McKenna,E.W.Myers,E.Nickerson,J.R.Nobile,R.Plant,B.P.Puc,M.T.Ronan,G.T.Roth,G.J.Sarkis,J.F.Simons,J.W.Simpson,M.Srinivasan,K.R.Tartaro,A.Tomasz,K.A.Vogt,G.A.Volkmer,S.H.Wang,Y.Wang,M.P.Weiner,P.Yu,R.F.Begley,J.M.Rothberg,Genome sequencing in microfabricated high-density picolitre reactors,Nature 437(2005)376–380.
[40]D.Gordon,P.Green,Consed:a graphical editor for nextgeneration sequencing,Bioinformatics 29(2013)2936–2937.
[41]A.Kumar,R.Seetan,M.Mergoum,V.K.Tiwari,M.J.Iqbal,Y.Wang,O.Al-Azzam,H.Simkova,M.C.Luo,J.Dvorak,Y.Q.Gu,A.Denton,A.Kilian,G.R.Lazo,S.F.Kianian,Radiation hybrid maps of the D-genome of Aegilops tauschii and their application in sequence assembly of large and complex plant genomes,BMC Genomics 16(2015)800.
[42]M.Ferrarini,M.Moretto,J.A.Ward,N.Surbanovski,V.Stevanovic,L.Giongo,R.Viola,D.Cavalieri,R.Velasco,A.Cestaro,D.J.Sargent,An evaluation of the PacBio RS platform for sequencing and de novo assembly of a chloroplast genome,BMC Genomics 14(2013)670.
[43]K.M.Moll,P.Zhou,T.Ramaraj,D.Fajardo,N.P.Devitt,M.J.Sadowsky,R.M.Stupar,P.Tiffin,J.R.Miller,N.D.Young,K.A.T.Silverstein,J.Mudge,Strategies for optimizing BioNano and Dovetail explored through a second reference quality assembly for the legume model,Medicago truncatula,BMC Genomics 18(2017)578.
[44]P.Chen,X.Jing,B.Liao,Y.Zhu,J.Xu,R.Liu,Y.Zhao,X.Li,BioNano genome map resource for Oryza sativa ssp.japonica and indica and its application in rice genome sequence correction and gap filling,Mol.Plant 10(2017)895–898.
[45]L.Coombe,R.L.Warren,S.D.Jackman,C.Yang,B.P.Vandervalk,R.A.Moore,S.Pleasance,R.J.Coope,J.Bohlmann,R.A.Holt,S.J.Jones,I.Birol,Assembly of the complete sitka spruce chloroplast genome using 10X Genomics'GemCode sequencing data,PLoS One 11(2016),e0163059..
[46]M.Eslami Rasekh,G.Chiatante,M.Miroballo,J.Tang,M.Ventura,C.T.Amemiya,E.E.Eichler,F.Antonacci,C.Alkan,Discovery of large genomic inversions using long range information,BMC Genomics 18(2017)65.
[47]G.Zhao,C.Zou,K.Li,K.Wang,T.Li,L.Gao,X.Zhang,Z.Yang,W.Jiang,L.Mao,X.Kong,Y.Jiao,J.Jia,Generation and analysis of the high quality genome sequence of goat grass(Aegilops tauschii),the donor of wheat D genome,Proceedings of the 8th Wheat Genomics and Molecular Breeding Conference of China,August 7–9,2017,2017(Shijiazhuang,Hebei,China,in Chinese).