- Data Note
- Open Access
A 44K microarray dataset of the changing transcriptome in developing Atlantic salmon (Salmo salar L.)
- Stuart G Jantzen†1,
- Dan S Sanderson†1,
- Kris R von Schalburg1,
- Motoshige Yasuike1,
- Francesco Marass1 and
- Ben F Koop1Email author
© Koop et al; licensee BioMed Central Ltd. 2011
- Received: 15 December 2010
- Accepted: 29 March 2011
- Published: 29 March 2011
Atlantic salmon (Salmo salar L.) is an environmentally and economically important organism and its gene content is reasonably well characterized. From a transcriptional standpoint, it is important to characterize the changes in gene expression over the course of unperturbed early development, from fertilization through to the parr stage.
S. salar samples were taken at 17 time points from 2 to 89 days post fertilization. Total RNA was extracted and cRNA was synthesized and hybridized to a newly developed 44K oligo salmonid microarray platform. Quantified results were subjected to preliminary data analysis and submitted to NCBI's Gene Expression Omnibus (GEO). Data can be found under the GEO accession number GSE25938. http://0-www.ncbi.nlm.nih.gov.brum.beds.ac.uk/geo/query/acc.cgi?acc=GSE25938
Throughout the entire period of development, several thousand genes were found to be differentially regulated. This work represents the trancriptional characterization of a very large geneset that will be extremely valuable in further examination of the transcriptional changes in Atlantic salmon during the first few months of development. The expression profiles can help to annotate salmon genes in addition to being used as references against any number of experimental variables to which developing salmonids might be subjected.
- Atlantic Salmon
- Gene Expression Omnibus
- Salmo Salar
- Post Fertilization
- Reference Pool
Atlantic salmon (Salmo salar L.) are an environmentally and economically important organism. The genome has been well studied and is currently being fully sequenced [1–4]. In addition, a number of microarrays have been developed for transcription studies of S. salar[2, 5–7]. As a benefit of the extensive characterization of the transcriptome of S. salar, large scale studies of gene expression changes can be undertaken using these microarray platforms .
This study is the first to utilize a newly developed 44K oligo salmonid microarray design, one of the first salmonid oligo microarrays. This array comprises approximately 22,000 60-mer oligos that were conserved (95% similar) between rainbow trout (Oncorhynchus mykiss) and Atlantic salmon  plus 14,866 additional Atlantic salmon and 5,661 additional rainbow trout contig sequences. The result is a microarray that has a large transcript representation with very low redundancy. The array is composed of oligos based on roughly 80% S. salar and 20% O. mykiss contigs. 84% of all features are well annotated with fairly stringent hits (e-value cutoff: 1e-10) to public databases (December 17, 2009). The annotation files may be found at the cGRASP microarray page . Efforts to annotate unknown contigs will continue.
Library construction, sequence analysis and contig assembly have been described previously . The 14,866 additional S. salar oligos were all derived from selected contigs compiled in the local database of the authors (August 11, 2009) . These were chosen first from the approximately 10,000 full length cDNAs in the database . The remainder were selected from well annotated sequences and then from poorly annotated sequences with an open reading frame longer than 300 bp represented by two or more clones. 5,206 of the additional O. mykiss contig-derived oligos were selected from the set of well annotated sequences in the local database that did not have a clear homologous representative in S. salar. The remaining 455 were selected from annotated NCBI Nucleotide resources (July 21, 2009)  with priority given to immune system related sequences. Representative oligos from sequences identified by Gene Ontology (GO)  were included. After sequence selection, oligos were derived from the selected contigs by Agilent Technologies (Santa Clara, CA) and were 60 bp in length. The oligo selection process was biased in favor of 3' sequences. While the majority of oligos are unique to contigs (i.e. only one spot on the array can be mapped back to a given contig), approximately 27% of oligos, including the original 22,000, were derived from the same contig as at least one other oligo. Finally, in situ oligo synthesis and microarray manufacturing was performed by Agilent. Microarray slides are available through Agilent's eArray platform, with each slide containing four arrays .
Due to the very large number of unique features on this platform, a genome-wide exploration of expression levels in salmonids is expected to produce significant and detailed information on many molecular systems. For example, the genetic factors involved in the very early developmental stages of Atlantic salmon are not completely understood. It is therefore of interest to do a thorough examination of S. salar developmental stages from fertilization through to the parr stage, using a transcriptomics approach. Recently, another group has used a microarray platform to profile the changes in gene expression during smoltification in Atlantic salmon  when freshwater parr make the transition to saltwater smolt. The dataset presented here complements this earlier study.
The objective was to comprehensively monitor the salmonid transcriptome during controlled and unperturbed development. This work complements and facilitates recent efforts to sequence and annotate the Atlantic salmon genome. It further provides a resource that identifies expression levels of tens of thousands of genes during the course of development. These baseline expression patterns can be used as references in future experiments to examine physiological, reproductive, mutational, and environmental variables.
Animals and sampling
Treatment of the fish used in this study was in compliance with the regulations of the University of Victoria Animal Care Committee. Eggs from Atlantic salmon (McConnell (Mowi)) were obtained in November, 2009 from Marine Harvest United Hatchery (Fanny Bay, BC, Canada). The eggs were fertilized by gently mixing the eggs and milt by hand and then washed with partial exchanges of water. Approximately 2,000 fertilized eggs were then transferred and placed in Heath trays (Marisource, Fife, WA) at the University of Victoria. The embryos and larvae were raised in fresh water at a temperature of 12°C and a flow rate of 200 liters/h.
Total RNAs were extracted in TRIzol reagent (Invitrogen, Carlsbad, CA) by mixer-mill homogenization (Retsch, Newtown, PA) and spin-column purified using RNeasy Mini kits (Qiagen, Hilden, Germany). The RNA was extracted from three whole individual embryos, larvae, or alevins at 2 dpf and then every fifth day (5 to 35 dpf) or sixth day (35 to 89 dpf). Quantity and quality of RNA samples were then measured using UV absorbance (NanoDrop Technologies, Wilmington, DE) and quality was also checked by agarose gel electrophoresis.
cRNA synthesis, labeling, and purification
Cy5 labeled experimental cRNA samples were generated using an Agilent Low Input Quick Amp (LIQA) kit, following the manufacturer's instructions. For each time point, 40 ng of total RNA from three individuals was used to generate first-strand cDNA. Agilent Spike-In B control RNA was included in each reaction. After the denaturation step (10 min at 65°C) and cRNA synthesis step (2 hr at 40°C), the reactions were incubated at 70°C for 15 minutes to inactivate the AffinityScript enzyme and subsequently stored at -80°C until further use. For the labeling reactions, thawed cRNA samples were each mixed with 16 μL of Transcription Master Mix cocktail containing Cy5 dye, and incubated at 40°C for two hours. Purification was performed using Qiagen RNeasy mini spin columns, eluting in 30 μL of RNase-free water. For the generation of the reference pool, equimolar amounts from the three individuals in each time point were pooled to give 120 ng of total RNA used in each first-strand reaction. Spike-In A control RNA was included in each reaction. After labeling with Cy3 and column purification as above, a common reference pool was created by including 2.8 μg of cRNA from each time point, except for 2 dpf, for which only 1.3 μg of material was ultimately available. Due to the small size and early developmental stage of samples from days 2, 5, and 10 dpf, limited RNA quantities necessitated additional extractions and subsequent synthesis and labeling reactions, however repeated procedures produced Cy5 labeled cRNA of the required quantity and quality.
Microarray hybridization and scanning
Experimental samples of Cy5 labeled cRNA were quantified on a Nanodrop ND-1000. All samples were found to be of sufficient specific activity with a mean (± SD) of 18.22 ± 2.03 pmol Cy5/μg cRNA as per manufacturer's recommendation (Agilent) and an appropriate RNA absorbance ratio with a mean of 2.29 ± 0.06. Next, cRNA fragmentation mixtures were created following the LIQA kit instructions, using 825 ng of experimental sample and 825 ng of reference pool. These mixtures were incubated at 60°C for 30 minutes. After cooling on ice for one minute, hybridization mixtures were prepared by adding 2x GEx Hybridization Buffer HI-RPM and mixing well by pipetting. These reactions were loaded in random arrangements with respect to time point onto 44K oligo salmonid microarrays (Agilent-025055) using Agilent SureHyb Hybridization Chambers. Each of the 4 × 44K arrays on the microarray slides had 100 μL of hybridization reaction added. The hybridization reactions were allowed to occur for 17 hours at 65°C. Slide washes were performed as per the manufacturer's instructions, including an ozone-protection step using the Agilent Stabilization and Drying Solution. Slides were scanned as soon as possible on a ScanArray Express (PerkinElmer, Waltham, MA) scanner at 5 μm resolution using a PMT setting of 80 in both channels, a black threshold of 1800, and a full color threshold of 26.8. Slides were stored in a low ozone chamber (typically < 5 ppb) until scanned.
S. salar developmental stages sampled
Days post fertilization (dpf)
Degree days (12°C * dpf)
Relative age in τs
Vascularization of yolksac
Vascularization of yolksac
Formation of caudal rays
Formation of caudal rays
Formation of caudal rays
Alevins have left gravel/
Beginning of parr markings
Fry/Appearance of caudal parr marks
Fry/Yolk-sac completely absorbed
Numbers of differentially regulated entities across timeline
Number of differentially regulated entities
2 vs. 5
5 vs. 10
10 vs. 15
15 vs. 20
20 vs. 25
25 vs. 30
30 vs. 35
35 vs. 41
41 vs. 47
47 vs. 53
53 vs. 59
59 vs. 65
65 vs. 71
71 vs. 77
77 vs. 83
83 vs. 89
Variation among replicate samples across timeline
In terms of ontogenetically relevant probes on the 44K microarray, over 900 entities are currently annotated with the GO term "development". More specifically, approximately 620 and 180 entities are annotated with the terms "system development" and "embryo development", respectively. In this experiment, the majority of entities in each of these categories was expressed above our threshold of 500 in at least one condition. Some of the GO terms that are significantly enriched in the various comparisons include "blastocyst development" between 2 and 5 dpf, "brain development" between 5 and 10 dpf, "organ development" and "induction of apoptosis" between 10 and 15 dpf, and "erythrocyte development" between 15 and 20 dpf, to name just a few. Other researchers may perform fuller and more detailed analyses in accordance with their own questions and hypotheses.
Beyond this preliminary analysis, there is a wealth of information to be gained from these data and we have submitted all normalized and raw data to NCBI's Gene Expression Omnibus (GEO)  for others to examine. The data are accessible through GEO Series accession number GSE25938 http://0-www.ncbi.nlm.nih.gov.brum.beds.ac.uk/geo/query/acc.cgi?acc=GSE25938. This dataset encompasses variation among three individuals per condition and differences across 17 timepoints. It is evident that this microarray provides the ability to determine a detailed transcriptional basis of ontogeny and this experiment in particular contains a great deal of developmental information. In addition, these data could be used as a reference for perturbed or abnormal development in other studies, or for researchers to refer to when transcriptional patterns of specific genes are discovered in other young salmonids.
Here we present a large and novel dataset that represents an invaluable source of information on the transcriptional changes present in developing salmon. We believe these data will be of interest to many researchers in several fields, including aquaculture, genomics, and developmental and evolutionary biology. Both as an examination of healthy development on its own and as a reference for future studies, this set of expression profiles will prove to be valuable to the scientific community.
This project was funded by the Natural Sciences and Engineering Research Council of Canada. Thanks to Marine Harvest for the generous donation of the Atlantic salmon eggs for this study. Thanks also to Brendon Campbell, Amy Hoare, and Brian Ringwood for egg transport and care of developing fish (Animal Care Services, University of Victoria).
- Ng S, Artieri C, Bosdet I, Chiu R, Danzmann R, Davidson W, Ferguson M, Fjell C, Hoyheim B, Jones S, de Jong P, Koop B, Krzywinski M, Lubieniecki K, Marra M, Mitchell L, Mathewson C, Osoegawa K, Parisotto S, Phillips R, Rise M, von Schalburg K, Schein J, Shin H, Siddiqui A, Thorsen J, Wye N, Yang G, Zhu B: A physical map of the genome of Atlantic salmon, Salmo salar. Genomics. 2005, 86 (4): 396-404. 10.1016/j.ygeno.2005.06.001.PubMedView ArticleGoogle Scholar
- Koop BF, von Schalburg KR, Leong J, Walker N, Lieph R, Cooper GA, Robb A, Beetz-Sargent M, Holt RA, Moore R, Brahmbhatt S, Rosner J, Rexroad CE, McGowan CR, Davidson WS: A salmonid EST genomic study: genes, duplications, phylogeny and microarrays. BMC Genomics. 2008, 9: 545-10.1186/1471-2164-9-545.PubMedPubMed CentralView ArticleGoogle Scholar
- Phillips RB, Keatley KA, Morasch MR, Ventura AB, Lubieniecki KP, Koop BF, Danzmann RG, Davidson WS: Assignment of Atlantic salmon (Salmo salar) linkage groups to specific chromosomes: Conservation of large syntenic blocks corresponding to whole chromosome arms in rainbow trout (Oncorhynchus mykiss). BMC Genet. 2009, 10: 46-10.1186/1471-2156-10-46.PubMedPubMed CentralView ArticleGoogle Scholar
- Davidson WS, Koop BF, Jones SJ, Iturra P, Vidal R, Maass A, Jonassen I, Lien S, Omholt SW: Sequencing the genome of the Atlantic salmon (Salmo salar). Genome Biol. 2010, 11: 403-PubMedPubMed CentralGoogle Scholar
- von Schalburg K, Rise M, Cooper G, Brown G, Gibbs A, Nelson C, Davidson W, Koop B: Fish and chips: Various methodologies demonstrate utility of a 16,006-gene salmonid microarray. BMC Genomics. 2005, 6: 126-10.1186/1471-2164-6-126.PubMedPubMed CentralView ArticleGoogle Scholar
- von Schalburg KR, Cooper GA, Leong J, Robb A, Lieph R, Rise ML, Davidson WS, Koop BF: Expansion of the genomics research on Atlantic salmon Salmo salar L. project (GRASP) microarray tools. J Fish Biol. 2008, 72: 2051-2070. 10.1111/j.1095-8649.2008.01910.x.View ArticleGoogle Scholar
- Taggart JB, Bron JE, Martin SAM, Seear PJ, Hoyheim B, Talbot R, Carmichael SN, Villeneuve LAN, Sweeney GE, Houlihan DF, Secombes CJ, Tocher DR, Teale AJ: A description of the origins, design and performance of the TRAITS-SGP Atlantic salmon Salmo salar L. cDNA microarray. J Fish Biol. 2008, 72 (9): 2071-2094. 10.1111/j.1095-8649.2008.01876.x.PubMedPubMed CentralView ArticleGoogle Scholar
- Miller KM, Maclean N: Teleost microarrays: development in a broad phylogenetic range reflecting diverse applications. J Fish Biol. 2008, 72 (9): 2039-2050. 10.1111/j.1095-8649.2008.01913.x.View ArticleGoogle Scholar
- cGRASP Microarray Page. [http://web.uvic.ca/grasp/microarray/]
- cGRASP. [http://web.uvic.ca/grasp]
- Leong J, Jantzen S, von Schalburg K, Cooper G, Messmer A, Liao N, Munro S, Moore R, Holt R, Jones S, Davidson W, Koop B: Salmo salar and Esox lucius full-length cDNA sequences reveal changes in evolutionary pressures on a post-tetraploidization genome. BMC Genomics. 2010, 11 (1): 279-10.1186/1471-2164-11-279.PubMedPubMed CentralView ArticleGoogle Scholar
- NCBI. [http://www.ncbi.nlm.nih.gov]
- Ashburner M, Ball C, Blake J, Botstein D, Butler H, Cherry J, Davis A, Dolinski K, Dwight S, Eppig J, Harris M, Hill D, Issel-Tarver L, Kasarskis A, Lewis S, Matese J, Richardson J, Ringwald M, Rubin G, Sherlock G, GO Consortium: Gene Ontology: tool for the unification of biology. Nat Genet. 2000, 25 (1): 25-29. 10.1038/75556.PubMedPubMed CentralView ArticleGoogle Scholar
- Agilent Technologies eArray. [https://earray.chem.agilent.com/earray]
- Seear PJ, Carmichael SN, Talbot R, Taggart JB, Bron JE, Sweeney GE: Differential Gene Expression During Smoltification of Atlantic Salmon (Salmo salar L.): a First Large-Scale Microarray Study. Mar Biotechnol. 2010, 12 (2): 126-140. 10.1007/s10126-009-9218-x.PubMedView ArticleGoogle Scholar
- Hamor T, Garside E: Developmental rates of embryos of Atlantic salmon, Salmo salar L., in response to various levels of temperature, dissolved oxygen, and water exchange. Can J Zool. 1976, 54 (11): 1912-1917. 10.1139/z76-221.PubMedView ArticleGoogle Scholar
- Gorodilov Y: Description of the early ontogeny of the Atlantic salmon, Salmo salar, with a novel system of interval (state) identification. Environ Biol Fish. 1996, 47 (2): 109-127. 10.1007/BF00005034.View ArticleGoogle Scholar
- Brazma A, Hingamp P, Quackenbush J, Sherlock G, Spellman P, Stoeckert C, Aach J, Ansorge W, Ball C, Causton H, Gaasterl T, Glenisson P, Holstege F, Kim I, Markowitz V, Matese J, Parkinson H, Robinson A, Sarkans U, Schulze-Kremer S, Stewart J, Taylor R, Vilo J, Vingron M: Minimum information about a microarray experiment (MIAME) - toward standards for microarray data. Nat Genet. 2001, 29 (4): 365-371. 10.1038/ng1201-365.PubMedView ArticleGoogle Scholar
- Edgar R, Domrachev M, Lash A: Gene Expression Omnibus: NCBI gene expression and hybridization array data repository. Nucleic Acids Res. 2002, 30 (1): 207-210. 10.1093/nar/30.1.207.PubMedPubMed CentralView ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.