Assembled unigenes used to develop Helianthus and Helianthus annuus microarrays by members of the Rieseberg and Knapp labs in collaboration with other researchers at the Compositae Genome Project (
http://compgenomics.ucdavis.edu/) and HeliaGene (
http://www.heliagene.org/)
Assemblies were constructed with Helianthus EST sequences available on Genbank 9 Sep 2007 and additional ESTs from Steve Knapp's lab. The assembly pipeline starts with Seqclean to remove poor quality sequences, clip poor quality ends, and trim vector. Prior to contiging, a library-less repeat masker, RBR, was used to dynamically identify and mask repeats with the default settings. Contigs were constructed using TGICL with default settings, and a final file called *.unigenes containing contigs and singletons was constructed.
A total of 33,376 unigenes were assembled from a starting pool of 94,706 ESTs for Helianthus annuus.
A total of 87,237 unigenes were assembled from a starting pool of 284,340 ESTs for all species of Helianthus.
All files produced during this assembly are available in each assembly archive.
Manuscript forthcoming - contact Mike Barker for more information (msbarker@biodiversity.ubc.ca)