Assessment of genome integrity with array CGH in cattle transgenic cell lines produced by homologous recombination and somatic cell cloning

Background Transgenic cattle carrying multiple genomic modifications have been produced by serial rounds of somatic cell chromatin transfer (cloning) of sequentially genetically targeted somatic cells. However, cloning efficiency tends to decline with the increase of rounds of cloning. It is possible that multiple rounds of cloning compromise the genome integrity or/and introduce epigenetic errors in the resulting cell lines, rendering a decline in cloning. To test these possibilities, we performed 9 high density array Comparative Genomic Hybridization (CGH) experiments to test the genome integrity in 3 independent bovine transgenic cell lineages generated from genetic modification and cloning. Our plan included the control hybridizations (self to self) of the 3 founder cell lines and 6 comparative hybridizations between these founders and their derived cell lines with either high or low cloning efficiencies. Results We detected similar amounts of differences between the control hybridizations (8, 13 and 39 differences) and the comparative analyses of both "high" and "low" cell lines (ranging from 7 to 57 with a mean of ~20). Almost 75% of the large differences (>10 kb) and about 45% of all differences shared the same type (loss or gain) and were located in nearby genomic regions across hybridizations. Therefore, it is likely that they were not true differences but caused by systematic factors associated with local genomic features (e.g. GC contents). Conclusions Our findings reveal that large copy number variations are less likely to arise during genetic targeting and serial rounds of cloning, fortifying the notion that epigenetic errors introduced from serial cloning may be responsible for the cloning efficiency decline.


Findings
As embryonic stem cells are not available in the bovine species, somatic cells have been used for genetic modifications, and transgenic cattle have been produced from such genetically modified somatic cells by animal cloning. However, because primary somatic cells have limited life span and inevitably become senescent following DNA transfection and selection in cell culture, it is impossible to perform any further genetic modifications in these cells. Because of such, transgenic cattle with a desired genotype that requires more than one genetic targeting event, such as homozygous deletion of the two alleles of a gene, cannot be produced. To overcome such limitations, a novel sequential genetic modification strategy in bovine somatic cells, for producing extensively genetically modified cattle, has been developed [1]. This process involves a serial round of genetic targeting events, each followed by cloning to rejuvenate the genetically modified somatic cells (to rescue them from senescence) for the next round of genetic targeting. Such genetically modified somatic cells are then subjected to a final round of cloning for producing transgenic animals with the desired genotypes. While multiple genomic loci have been modified by this strategy, cloning efficiency tends to decline with the increased rounds of cloning, and in some severe cases, such manipulated cells can become unclonable (no live calf can be cloned from them) [2]. It is yet unknown whether the cloning efficiency declines in such derived cells are due to genetic abnormalities caused by the multiple genetic targeting or/and serial cloning process or due to the accumulation of epigenetic errors introduced during these processes. Such questions are fundamental in farm animal transgenesis, as somatic cells and cloning are currently the only choices for genetic modifications and for transgenic animal production in the domestic animal species.
To investigate whether the declines of cloning efficiency in the cloned bovine transgenic cell lines are due to large genomic deletions or insertions, 9 high density array Comparative Genomic Hybridization (CGH) experiments were performed to test the genome integrity in 3 independent bovine transgenic cell lineages. Array CGH allows the entire genome to be assayed for the gain or loss of material in a single experiment by measuring the relative hybridization intensity between fluorescently labeled test and reference DNA samples. It has been widely used in the detection of copy number variations (CNVs). One objective of this study is to develop array CGH into a systematic test for the genomic integrity of donor cells after each round of genetic modification before they are used as donors for producing transgenic animals.
We selected 3 independent cell lineages from our transgenic bovine cell line collection. Each lineage includes the founder and two derived cell lines, which demonstrated dramatic differences in cloning efficiency ( Figure 1). The cloning efficiencies are represented by the live calf counts at birth divided by recipient numbers used for embryo transfer as shown in parentheses. Test lines were classified into "high (H)" and "low (L)" based on their cloning efficiencies, with 7%-42% live calving rates designated as high and 0% as low. The procedures for genetic modifications, animal cloning and transgenic cell line establishment were described previously [1]. Genomic DNA samples were purified from the cell lines using Qiagen Miniprep Kit as recommended by the manufacturer. All DNA samples were analyzed by Nanodrop spectrophotometer and agarose gel electrophoresis. Nine array CGH experiments were carried out using each cell line as the test sample and the corresponding founder line as the reference sample (Table 1). Therefore, our plan included the control hybridizations of the 3 founder cell lines (self to self) and 6 comparative hybridizations between these founders and their  Three cell lineages (founders and test cell lines) and their success rates for animal cloning. Live calving rates for the cell lines were calculated by the live calf counts at birth divided by recipient numbers used for embryo transfer as shown in parentheses. Cell lines with 7% or more living rates are indicated as High (H; high calving rate) and those with 0% live calving rate as Low (L; low calving rate). The 3 founder cell lines (F1, F2 and F3) were established from 3 different fetuses (day 40) respectively that were produced by artificial insemination. The 6 test cell lines, except for cell line L3, were derived from 2 rounds genetic modification and somatic cell cloning. L3 line was derived from 3 rounds of genetic modification and somatic cell cloning.
derived cell lines of extreme phenotypes ("high" versus "low" cloning efficiencies). Another self to self control hybridization was performed using the sequenced Hereford cow L1 Dominette 01449 (Dt, American Hereford Association registration number 42190680). Each CGH array contains~2.1 million oligonucleotide probes that provide a genome-wide coverage with an average interval of~1.2 kb (kilo basepairs) on the UMD3 genome assemblies [3]. DNA labeling, hybridizations, array scanning, data normalization, and segmentation were performed as described before [4,5]. The genomic variations were represented by gains and losses of normalized fluorescence intensities relative to the reference. The calls are filtered according to the similar criteria as described previously [6]. Briefly, we tested a series of log2 ratio shift and affected neighboring probe counts and their impact on the false discovery rate in the selfself control hybridizations. We then selected a calling criterion, requiring that alternations of 0.5 log2 ratios over 5 neighboring probes, under which minimal false positives were found for self-self control hybridizations. Thus, the arrays have a resolution of approximately 4.8 kb. Nine array CGH data have been submitted to the gene expression omnibus (http://www.ncbi.nlm.nih.gov/ geo/) under the accession number GSE26132.
We detected 8, 13 and 39 differences in 3 control hybridizations. Similar amounts of differences (ranging from 7 to 57 with a mean of~20) were detected in comparative analyses of both "high" and "low" derived cell lines (Table 1 and Table 2). We also made event calls on Btau_4.0 and obtained a comparable number of events (data not shown). Almost 75% of the large differences (>10 kb, 42/58 events in Table 2) and about 45% of all differences (82/186 events) shared the same type (loss or gain) and were located in nearby genomic regions across hybridizations. Therefore, it is likely that they were not true differences but instead caused by systematic factors like dye bias (Cy3 versus Cy5) or genomic waves associated with local genomic features, such as GC contents [7]. For example, a variable region of   Figure 2). Although each probe has a GC% range from 42-48%, the average GC% of this region (53.5%) is significantly higher than the cattle genome average of 41.7% and multiple GC% peaks exist in the close proximity of 3 out of the 6 probes. Out of 186 events, 129 events are unique after merging the overlapped events (data not shown). Out of these 129 unique events, 71 events can be successfully migrated from UMD3 to Btau_4.0 and all of them showed various degrees of higher GC contents as compared to the genome average. In this project, we employed array CGH to study genomic integrity in cattle transgenic cell lines. This highresolution genome-wide survey fills the knowledge gaps left out in the existing literature. Our results generate a valuable tool for genomic integrity evaluation and largely exclude the occurrences of large genomic structural variations (≥ 10 kb) during animal cloning, supporting our recent findings that epigenetic errors introduced by multiple rounds of cloning and/or genetic targeting are the possible underlying causes for the cloning efficiency decline [8,9]. However, this initial genomic integrity survey reported here is probably not complete as the CGH arrays were designed by using only one reference genome. As a result, sequences absent in Dominette and present in other animals cannot be ascertained. Also, array CGH cannot detect small event (<5 kb) and balanced events like inversions and translocations. Therefore, we cannot totally exclude the possibility that both genetic and epigenetic influences may be at work and genetic differences may have played a role in the low efficiencies. With the costs of genome sequencing dropping dramatically by using next-generation sequencing, emerging high-quality cattle genomic sequence will soon facilitate the application of the direct sequence comparison strategy. Furthermore, additional studies like epigenomics are warranted and may unravel the epigenetic basis for the successful and efficient animal cloning.