Progress 09/01/09 to 08/31/12
Outputs OUTPUTS: As proposed, we sequenced the two parental genomes, F153 (A. comosus, 2n=2x=50) and HANA64 (A. bracteatus, 2n=2x=50), to develop microsatellite markers and to obtain the genomic sequence for pineapple genome structure study and comparative genomic analysis. The sequence reads were assembled into contigs and the quality of genome assemblies were assessed for the completeness by estimating the coverage of ultra-conserved eukaryotic. From the assembled contigs, we designed 8,542 pairs of SSR primers. After excluding redundant primers and duplicated primers we designed from EST sequences, we obtained 7,967 pairs of unique primers, about 3,000 more primers than we proposed. Novel pineapple specific repeat were identified and a customized library of repeat elements compiled. Comprehensive repeat analysis identified 34.8% and 26.63% of assembled genome covered by repeats for F153 and HANA 64, respectively. The most abundant repeats in both genomes were unclassified indicating these novel repeats to be specific to the pineapple genome. Retrotransposon) elements were the major elements with LTR elements being the most abundant. Of these LTRs, gypsy type LTRs occupied 4.78% and 2.62% of the genomes and copia type LTRs occupied 2.32% and 1.69% of F153 and Hana64 respectively. We also identified tandem repeats in F153 and HANA 64 genomes, respectively We used the new technology, RAD-Seq, to construct a high-density genetic map of pineapple. Two genetic maps were constructed, one for each of the parental genomes. A linkage map of F153 was constructed using 973 RAD-Seq markers. This map consisted of 29 linkage groups and spanned a total length of approximately 1630 cM, with an average interval of 1.68 cM. Another linkage map, composed of 2048 RAD-Seq markers in 28 linkage groups and covered a total length of 1373.9 cM, was constructed for HANA 64 genome. Larger number of markers were detected in HANA 64 than F153, suggesting a higher heterozygosity in HANA 64 genome. To map the trait of leaf margin spine, we created a F2 mapping population with 492 individuals. The F2 mapping population was used to map the loci of major genes controlling the leaf margin spine. PARTICIPANTS: Not relevant to this project. TARGET AUDIENCES: Not relevant to this project. PROJECT MODIFICATIONS: Not relevant to this project.
Impacts Pineapple is the No. 1 fruit crop in Hawaii and the third most important commercial tropical fruit crop in world production after banana and citrus. Besides the commercial value of its fruits, its leaves are the most promising source for nano-cellulose materials that may be used as a plastic alternative in the future. As a crassulacean acid metabolism (CAM) plant species, pineapple is the best representative of this under-explored node of angiosperms. However, very little molecular genetics and genomics research has been carried out on this crop. The sequence data we have generated is the largest data set for pineapple. And the genetic maps we have constructed are the most saturated genetic maps of pineapple so far. The genome sequence generated by this proposed project will significantly advance our understanding of the genome organization of pineapple, will simplify the process of isolation of homologous genes of interest, and will develop a better understanding of plant evolution. The high density genetic map constructed by the proposed project will have profound impact on pineapple improvement through better understanding of relevant biology and direct application of genetic and genomic tools in breeding programs. Results of the proposed research will significantly advance the development of genomic tools and knowledge for pineapple improvement.
Publications
- No publications reported this period
|
Progress 09/01/10 to 08/31/11
Outputs OUTPUTS: As we proposed, we sequenced the two parent genomes, F153 (A. comosus, 2n=2x=50) and HANA64 (A. bracteatus, 2n=2x=50), using Roche 454 1 kb beta test kit to develop microsatellite markers and to obtain the genomic sequence for pineapple genome structure study and comparative genomic analysis. The average read length was 453 for F153 and 497 for HANA64 with 800.2Mbp and 721.7 Mbp read, respectively. From the assembled contigs, we designed 8,542 pairs of SSR primers. After excluding redundant primers and duplicated primers we designed from EST sequences, we obtained 7,967 pairs of unique primers, which are about 3,000 more primers than we proposed. Besides the genomic sequence, we also generated EST sequences for pineapple using Illumina RNA-seq. The average read length was 115 and we read 10.9 Mbp for F153 and 25Mbp for HANA 63. We are annotating the genomic sequences. Comparative genomic analysis and heterozygosity study of pineapple genome will be carried out soon. Using next generation DNA sequencing, a recent method called restriction site associated DNA sequencing (RAD-seq) allows the detection of thousands of sequence-based markers for a reference as well as a non-reference genome at reasonable costs. We are using the new technology (RAD-seq) to construct a high-density genetic map of pineapple with 1000-2000 sequence-based markers (SNPs, InDels, and SSRs). The RAD-seq libraries of two parents and 54 F1 individuals are currently under construction. To map the trait of leaf margin spine, we created a F2 mapping population with 492 individuals. The F2 mapping population will be genotyped using RAD-seq to map the loci of major genes controlling the leaf margin spine. PARTICIPANTS: Not relevant to this project. TARGET AUDIENCES: Not relevant to this project. PROJECT MODIFICATIONS: Not relevant to this project.
Impacts Pineapple is the No. 1 fruit crop in Hawaii and the third most important commercial tropical fruit crop in world production after banana and citrus. Besides the commercial value of its fruits, its leaves are the most promising source for nano-cellulose materials that may be used as a plastic alternative in the future. As a crassulacean acid metabolism (CAM) plant species, pineapple is the best representative of this under-explored node of angiosperms. However, very little molecular genetics and genomics research has been carried out on this crop. The sequence data we have generated is the largest data set for pineapple. The genome sequence generated by this proposed project will significantly advance our understanding of the genome organization of pineapple, will simplify the process of isolation of homologous genes of interest, and will develop a better understanding of plant evolution. The high density genetic map constructed by the proposed project will have profound impact on pineapple improvement through better understanding of relevant biology and direct application of genetic and genomic tools in breeding programs. Results of the proposed research will significantly advance the development of genomic tools and knowledge for pineapple improvement.
Publications
- No publications reported this period
|
Progress 09/01/09 to 08/31/10
Outputs OUTPUTS: We have proposed 3 objectives for this project and planned to complete objective 1 and part of objective 2 in Year 1. We have generated about 800 Mb sequence of A. comosus var. F153 genome using Roche 454 1 kb beta test kit. The new 454 1 kb beta test kit produce longer reads than the Titanium kit, but the cost is the same as the Titanium kit. The reads were assembled using CLC Genomics Workbench. The total length of assembled contigs is 147,685,460 bp and the average length of each contig is 860 bp. A total of 197 contigs is above 10 kb and the longest contig is 74 kb. Based on the genome sizes of A. comosus var. comosus and A. comosus var. bracteatus estimated at 526 and 444 Mbp, respectively by Arumuganathan and Earle (1991), the coverage of assembled contigs would be 28% ~ 33% of the pineapple genome. The total length of singletons is 93,398,241. Including singletons, the coverage would be 46% ~ 54% of the pineapple genome. From the assembled contigs, we designed 8,542 pairs of SSR primers. After excluding redundant primers and duplicated primers we designed from EST sequences, we obtained 7,967 pairs of unique primers, which is about 3,000 more primers than we proposed. Besides the genomic sequence, we generated EST sequences for pineapple using Illumina Solexa 115 bp paired ends. We have assembled the F153 ESTs using Velvet, Edena, and SOAPdenovo. A total of 3776 contigs were obtained with an average length of 883 bp and a total bases of 3,295,921 bp. The longest contig is 5,119 bp and smallest contig is 500 bp. From the assembled EST contigs, we designed 83 pairs of SSR primers. We have finished the genomic DNA isolation for 34 individuals of mapping population. Two parents and 6 F1 individuals were selected for SSR polymorphism survey. We have finished survey for 83 primer pairs. Out of the 83 primer pairs, 10 of them showed polymorphism. PARTICIPANTS: Not relevant to this project. TARGET AUDIENCES: Not relevant to this project. PROJECT MODIFICATIONS: Not relevant to this project.
Impacts Pineapple is one of the most important tropical fruit crops. Pineapple is the No. 1 fruit crop in Hawaii and the third most important commercial tropical fruit crop in world production after banana and citrus. However, very little molecular genetics and genomics research has been conducted on this crop. The sequence data we have generated is the largest data set for pineapple. The genome sequence generated by this proposed project will significantly advance our understanding of the genome organization of pineapple, will simplify the process of isolation of homologous genes of interest, and will develop a better understanding of plant evolution. The high density genetic map constructed by the proposed project will have profound impact on pineapple improvement through better understanding of relevant biology and direct application of genetic and genomic tools in breeding programs. Results of the proposed research will significantly advance the development of genomic tools and knowledge for pineapple improvement.
Publications
- No publications reported this period
|
|