| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
Department of Pediatric Oncology and Hematology and Center of Molecular Medicine Cologne, University Children's Hospital, Cologne, Germany
Requests for reprints: Matthias Fischer, Department of Pediatric Oncology and Hematology, University Children's Hospital of Cologne, Kerpener Strasse 62, 50924 Cologne, Germany. Phone: 49-221-478-6816; Fax: 49-221-478-4689. E-mail: matthias.fischer{at}uk-koeln.de
| Abstract |
|---|
|
|
|---|
| Introduction |
|---|
|
|
|---|
8% of all childhood cancers (1). Remarkably, the tumor may follow contrasting patterns of biological behavior, which are most strikingly reflected by the clinically recognizable syndromes defined as stage IVS and stage IV, which both present with disseminated disease although having completely different courses: whereas stage IVS tumors are characterized by spontaneous regression of the primary tumor and metastases with little or no therapy, patients with stage IV disease present with rapidly progressing metastatic disease that is difficult to treat and has a poor prognosis. The molecular mechanisms leading to the diverse clinical courses of disseminated neuroblastoma are largely unknown and remain one of the most intriguing questions in neuroblastoma research. Whereas delayed onset of programmed cell death of neuroblasts has been proposed to account for the phenomenon of spontaneous regression of stage IVS disease and tumors in children diagnosed before the age of 1 year (2), the reasons for dissemination of disease before regression in stage IVS patients remain elusive. In recent years, gene expression profiling using microarrays has been used to identify the molecular characteristics of neuroblastoma (3-8); however, whereas several gene expression patterns associated with favorable patients' outcome were described, none of these did contribute to the elucidation of the peculiar biology of stage IVS neuroblastoma or reported on a specific molecular marker defining this subtype of the disease. Serial Analysis of Gene Expression (SAGE) has been shown to be a powerful tool to identify differentially expressed genes in cancer and to effectively discover novel cancer targets (9, 10). Using this technique, we have generated gene expression profiles of five favorable stage IVS and three unfavorable stage IV neuroblastomas to identify transcripts that represent specific markers of these two subtypes and that may account for the characteristic biology of either phenotype (11). In this study, we describe a novel gene designated as DEIN, the expression of which is strongly associated with stage IVS disease and which may contribute to the molecular definition of this particular subtype of neuroblastoma.
| Results |
|---|
|
|
|---|
To convert the tag GCAACTTAAC back into its corresponding full-length transcript, we carried out Generation of Longer cDNA fragments from SAGE tags for gene identification (GLGI; ref. 12), 5'-rapid amplification of cDNA ends (RACE)-PCR, and reverse transcription-PCR (RT-PCR). Using GLGI, two 3'-cDNA fragments measuring 57 and 158 bp were detected, the latter of which comprised the 57-bp fragment and an additional 101 bp at the 3' end. Both fragments contain a polyadenylation signal (AATAAA) within the last 50 bp 5' of the polyadenylate tail. Considering this additional sequence information, the cDNA was mapped to chromosome 4q33-34. Consistent with our results, several 3' reads of transcribed sequences assigned to cluster Hs.61435 match the experimentally generated 3' ends (e.g., CR623023; Fig. 1A ).
|
To determine whether the variants identified by RACE and RT-PCR represent full-length transcripts, Northern blot analysis of primary neuroblastoma samples was done with five specific probes (Fig. 1B). In stage IVS tumors, two intense signals of about 4 and 5 kb were detected with probes I to IV, whereas probe V only revealed a 5-kb transcript (Fig. 1A and B, results with probes I, III, and V are shown). The two signals at 4 and 5 kb correspond well to variants A and B, respectively. Because no additional signals were observed in the Northern blot analysis with probe III, which was designed to detect the four variants A, B, D, and E, one can suggest that the latter two variants represent minor isoforms of this gene. The finding of a single signal at 5 kb using probe V is in line with the proposed structure of variant B (Fig. 1A) because this region is not covered by any other exon of this gene. Transcript variant C was not detected in an additional Northern blot analysis using a probe specific to exons 1, 2, and 4 (data not shown). Because of the strong and differential expression of both variants A and B in primary neuroblastoma, the gene was designated as DEIN (differentially expressed in neuroblastoma). A search in the human UCSC BLAT database4 disclosed numerous transcribed sequences with the splicing sites of variant A (e.g., BC101437 and BC101981) as well as sequences that match a fragment specific for variant B (BC009184, AU118035, and AL522836). In contrast, only one expressed sequence tag (EST) showed a splicing site identical to variant C (EST R24426) and no transcribed sequence with the splice sites of variant E or D was found. These findings support our experimental results suggesting that the isoforms A and B represent the major transcript variants of DEIN.
None of the cDNAs encoding the variants A to E of DEIN contain repetitive sequence elements. Transcript variant A harbors an open reading frame encoding a putative protein of 134 amino acids, which is identical to the putative protein XP_496724 (Fig. 1A). The transcript variants B to E comprise open reading frames ranging from 38 to 143 amino acids in length. Strong or adequate contexts of start codons according to Kozak's rule, together with conservation of start codons and the following amino acid sequences in other species, have been regarded as minimal prerequisites for attributing the status of a coding transcript to the sequence (13, 14). The only putative start codon of DEIN in a strong context is located in exon 4 and corresponds to the coding sequence of XM_496724. However, because this start codon is not conserved in other species, and the predicted proteins do not show significant homology to known proteins or conserved domains as determined by a protein-protein-BLAST-alignment,5 it remains to be determined whether these sequences represent coding transcripts or noncoding RNAs.
The nucleotide sequences of DEIN variants A and B are conserved among various mammals such as dog, mouse, and rat, with similarities ranging from 62% to 71% (Fig. 2 ). A 250-bp region located between two CpG islands in the last exon of these variants shows the highest degree of conservation, particularly in a 43-bp segment exhibiting 88.4% identity between human and zebrafish (Figs. 1A and 2). Various conserved transcription factor binding sites were detected in this sequence by in silico prediction. The highest homology was found for transcription factor binding sites of Nkx-2.5, Pbx-1, GATA-1, and CCAAT/enhancer binding protein-β, which were conserved in all species with only 1-bp variation in zebrafish and/or chicken (data not shown). In other species, numerous transcribed sequences that match the genomic region of DEIN were found in a BLAT search. In mouse, most of these ESTs correspond to exons 2 and 4 and originate almost exclusively from heart (numerous ESTs) as well as fetal and embryonic tissues, including trophoblast (BQ032256 and BU755815), hematopoietic stem cells (DT915657), and a sympathetic ganglion of a 2-day neonate (CA460908). Only one EST (BB692529) that was also derived from the latter tissue matches the genomic region of exon 5. In rat, dog, and chicken, a limited number of ESTs corresponding to DEIN were found, most of which originated from cDNA libraries of heart, brain, or mixed embryonic and adult tissues.
|
|
|
|
In contrast to these findings, examination of variant C transcript levels by quantitative real-time RT-PCR in a cohort of 76 primary neuroblastoma specimens (stage IV, n = 49; stage IVS, n = 27) with primers specific for this isoform did not reveal a significant association of this splice variant with stage or age, whereas an inverse correlation with MYCN amplification was observed (P = 0.001). In addition, relative expression levels of the major transcript variants and variant C were not correlated (Pearson's correlation coefficient, 0.18).
| Discussion |
|---|
|
|
|---|
The genomic sequence of DEIN was found to be conserved among various species with the highest degree of conservation in a 250-bp region in the last exon of variants A and B. This region may represent a cis-regulatory element for DEIN transcription (Fig. 2), which is supported by the finding of several conserved putative transcription factor binding sites within this sequence. Interestingly, the transcription factor binding site CCAAT/enhancer binding protein-β has been shown to have a regulatory function in differentiation and apoptosis of neuroblastoma cells (16), the latter of which is suggested to account for spontaneous regression in stage IVS neuroblastoma. Alternatively, this DNA segment could also represent a currently unknown cis-regulatory element for the control of expression of HAND2, a gene that is located closely upstream and in antisense orientation to DEIN and functions as a key regulator of neurogenesis in the developing sympathetic nervous system. This suggestion is supported by the high degree of homology of HAND2, its enhancer element for expression in branchial arches (located between exon 3 and 4 of DEIN; ref. 17), and the present putative regulatory region between human and zebrafish (Fig. 2), whereas the remaining sequences of DEIN variants A and B are not conserved in zebrafish.
Only one of the putative open reading frames of the DEIN transcripts has a start codon in a strong context according to Kozak's rule, but this codon is not conserved in other species and the resulting proteins do not have known functional domains or match known proteins. Because comparative genome analysis (i.e., detection of significant conservation of open reading frames in related species) is suggested to be a reliable method to determine whether a transcript is protein coding or not (18), one may suggest that DEIN comprises noncoding RNAs. In line with this, SAGE tags not matching a known gene have been reported to frequently correspond to noncoding RNAs that are expressed in antisense orientation to known genes or from loci outside of currently annotated exons (i.e., intergenic and intronic regions; ref. 19). The high degree of conservation of certain nucleotide segments of DEIN in different species could be due to specific consensus secondary structures that many noncoding RNAs evolutionary conserve and that are related to their function (18).
RT-PCR and Northern blot analyses revealed strong expression of transcripts A and B in primary neuroblastoma and neuroblastoma cell lines, whereas lower expression levels were observed in heart, uterus, placenta, adrenal gland, cerebellum, testis, and fetal liver (Fig. 3). These findings are consistent with the rare occurrence of the corresponding SAGE tag in expression profiles of human tissues currently published in the SAGEmap database and with tissue sources of ESTs of UniGene cluster Hs.61435. Interestingly, the tissue expression pattern of DEIN strongly parallels that of HAND2. According to the UniGene database, cluster Hs.61435 shares common tissue sources for ESTs with HAND2, such as neuroblastoma, heart, uterus, placenta, and brain, and a substantial number of ESTs assigned to these clusters originate from embryonic tissues or those of children <3 years of age. Together, these findings might suggest that HAND2 and DEIN are regulated by common transcriptional mechanisms in a restricted range of tissues during early developmental stages. This hypothesis is further supported by the observation of high DEIN transcript abundances in neuroblastoma of patients <1 year old, whereas older patients showed lower levels of expression. In addition, the description of two ESTs in mouse with sequence homology to variant A that originated from a sympathetic ganglion of a 2-day mouse neonate, together with the observation of high DEIN expression in neuroblastoma of infants, might suggest that DEIN may particularly play a role in the development of the sympathetic nervous system of mammals.
Within primary neuroblastoma subtypes, highest expression levels of the main variants A and B of DEIN were found in stage IVS tumors. Whereas numerous genes such as TrkA (2), CD44 (20), ECEL1 (21), and ICE (22) have been shown to be strongly expressed in favorable neuroblastoma comprising both locoregional and stage IVS disease, this is the first report of a gene that exhibits high transcript abundances selectively in stage IVS neuroblastoma. One could therefore speculate that DEIN might be involved in biological processes related to the unique stage IVS phenotype, which is characterized by dissemination to specific tissues followed by subsequent spontaneous regression. In addition, whereas multivariate analyses suggested that the clinical value of DEIN expression levels for patients' outcome prediction seems to be limited as compared with established prognostic markers, the characteristic expression pattern could contribute to the definition of this stage as a separate biological subtype of neuroblastoma. Moreover, high expression levels of DEIN were found to be associated with age at diagnosis <1 year even after exclusion of stage IVS tumors. In light of the beneficial outcome of infants in comparison with patients >12 to 18 months (23, 24), expression of DEIN may represent a molecular correlate of the distinct biology of these tumors in comparison with neuroblastoma developing in older children. However, the biological role of DEIN in neuroblastoma and normal development has yet to be determined by functional analyses.
| Materials and Methods |
|---|
|
|
|---|
Synthesis of cDNA, Cloning, and Sequencing of PCR Products
Generation of cDNA used for GLGI, 5'-RACE-PCR, and RT-PCR was done with 0.4 to 2 µg of polyadenylate RNA of a primary stage IVS tumor that had a high count for tag GCAACTTAAC (GEO accession no. GSM112808). Synthesis of cDNA for semiquantitative RT-PCR and real-time RT-PCR was carried out using 2 µg of total RNA of normal human tissues (Clontech), neuroblastoma cell lines, and primary neuroblastoma samples. For RT-PCR, cDNA synthesis was done with the Qiagen LongRange 2Step RTPCR kit according to the manufacturer's protocol. For real-time RT-PCR, first-strand cDNA synthesis was done with oligo(dT) primers (Sigma-Aldrich) in a total volume of 20 µL according to the SUPERSCRIPT II protocol (Invitrogen). Cloning of PCR products was carried out using the TOPO TA-Vector (Invitrogen). Sequencing was done with the BigDye terminator sequencing kit, version 3.1 (Applied Biosystems).
Generation of Longer cDNA Fragments from SAGE Tags for Gene Identification (GLGI) 5'- and 3'-RACE-PCR
To identify tags that did not match a known gene, GLGI was done as described previously (12). GLGI is a modified 3'-RACE-PCR and converts SAGE tags back into their corresponding 3'-cDNA fragments. 5'- and 3'-RACE-PCR was done with the SMART RACE cDNA Amplification Kit (Clontech) according to the manufacturer's protocol. Primers used for amplification are available on request.
Northern Blot Analysis
Ten micrograms of total RNA from primary neuroblastoma samples were size fractionated on a 1% denaturating formaldehyde agarose gel and transferred onto a nylon membrane (Roche Diagnostics) using Northern Max One-Hour Transfer Buffer (Ambion). Expression in normal adult tissues was assessed using the FirstChoice Human Blot II (Ambion) containing 2 µg of polyadenylate RNA from 10 adult human tissues. Five cDNA probes for DEIN and one probe for β-actin were generated by RT-PCR. Primers used for amplification of probes I to V are shown in Table 1
(F1 + R1 for probe I, F2 + R4 for probe II, F3 + R5 for probe III, F5 + R6 for probe IV, and F6 + R7 for probe V). Blots were hybridized overnight at 42°C in UltraHyb Hybridization Buffer (Ambion) with high-sensitivity strippable DNA probes labeled with [
32P]dATP (Strip-EZ DNA kit, Ambion). After hybridization, membranes were washed, air-dried, and exposed to Kodak BioMax MR-1 films (Amersham).
RT-PCR
PCR was carried out in a total volume of 50 µL containing 1-µL first-strand cDNA, 2 units HiFi Platinum-Taq DNA polymerase (Invitrogen), 125 nmol/L sense and antisense primers each, 20 mmol/L Tris-HCl (pH 8.4), 50 mmol/L KCl, 200 nmol/L deoxynucleotide triphosphates each, and 1.5 mmol/L MgSO4 (Invitrogen). Cycling conditions consisted of a single denaturation step at 95°C for 3 min, followed by 35 cycles of 95°C for 1 min, 58°C for 1 min, 72°C for 2 to 11.5 min, and a final extension step at 72°C for 15 min. For amplification of overlapping subfragments of variants A and B, the primer combinations F7 + R6, F5 + R5, and F3 + R1 were used (Table 1). For amplification of variant E, the primers F7 to F9 together with R4 were used. PCR products were visualized on a 2% agarose gel. To exclude amplification of contaminating DNA, RT-PCR was done with negative controls without reverse transcriptase in first-strand cDNA reactions.
Semiquantitative RT-PCR
PCR was carried out in a total volume of 30 µL containing 1-µL first-strand cDNA, 1.2 units Platinum-Taq DNA polymerase (Invitrogen), 75 nmol/L sense and antisense primers (F1 + R3) each, 12 mmol/L Tris-HCl (pH 8.4), 30 mmol/L KCl, 120 nmol/L deoxynucleotide triphosphates each (Invitrogen), and 1 mmol/L MgCl2. Cycling conditions consisted of a single denaturation step at 95°C for 3 min, followed by various cycles of 95°C for 30 s, 60°C for 30 s, and 72°C for 30 s. To exclude amplification of contaminating DNA, RT-PCR was done with negative controls without reverse transcriptase in first-strand cDNA reactions.
Quantitative Real-time RT-PCR
Quantitative real-time RT-PCR was done using the SYBR Green I reagent on the ABI PRISM 7700 Sequence Detection System (Applied Biosystems). PCR reactions were carried out in a total volume of 30 µL containing 26.8 µL of SYBR Green I PCR master mix (Applied Biosystems), 0.4 µL of 1:10 diluted first-strand cDNA, and 1.4 µL of 2.5 µmol/L forward and reverse primers (F1 + R3 for variants A, B, and E, and F7 + R2 for variant C; see Table 1) each. For calculation of relative expression levels, serial dilutions of cDNA of the cell line IMR-32 were used to generate standard curves. All PCR reactions were done in 96-well optical reaction plates with optical caps (Applied Biosystems) as duplicates (triplicates for the determination of standard curves). Cycling conditions consisted of a single step at 50°C for 2 min, subsequent 10-min polymerase activation at 95°C, followed by 40 cycles of 95°C for 15 s and 60°C for 1 min. Data normalization of real-time RT-PCR experiments was done as previously described (25).
Sequence Analysis
The DEIN sequences were screened for potential protein products with the National Center for Biotechnology Information Open Reading Frame Finder8 and for repetitive elements with the program cross_match.9 Homology analysis was done by comparing sequences of different species with the DEIN reference sequence using the ALIGN webtool, which calculates a global alignment between two sequences.10 Potential transcription factor binding sites within the putative promotor region were detected using in silico prediction based on the TRANSFAC algorithm11 (26). CpG islands within the nucleotide sequence of DEIN were detected with CpGplot.12
Statistical Analysis
Variables of interest (stage, age at diagnosis, and cytogenetic aberrations) and relative expression levels of DEIN variants A/B/E or C were compared by
2 test, Kruskal-Wallis test, or Mann-Whitney U test where appropriate. Expression levels of variants A/B/E and C were correlated with Pearson's correlation coefficient. For DEIN expression, Kaplan-Meier estimates for 3-year event-free survival and overall survival were calculated and compared by log-rank test. Relapse, progression, and death were regarded as events. Death resulting from therapy complications was censored for event-free survival and overall survival analyses. Multivariate Cox regression was applied to analyze the prognostic value of DEIN expression in comparison with other markers for event-free survival. Models were built using a stepwise variable selection procedure recommended by Collett (15). In the first step, all variables were tested in a univariate Cox regression calculation one at a time. In the second step, all variables that seemed to be important in step 1 were fitted together in a multivariate Cox regression model. In the third step, all variables that were not important in step 1 were added to the variables that were important in step 2, one at a time. The likelihood-ratio test P value for inclusion was <0.05; for exclusion, the P value was >0.10.
| Acknowledgements |
|---|
|
|
|---|
| Notes |
|---|
|
|
|---|
The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
1 http://www.ncbi.nlm.nih.gov/geo/ ![]()
2 http://www.ncbi.nlm.nih.gov/SAGE ![]()
3 http://cgap.ncbi.nih.govSAGE/AnatomicViewer ![]()
4 http://genome.ucsc.edu/cgi-bin/hgBlat ![]()
5 http://www.ncbi.nlm.nih.gov/_BLAST ![]()
6 GNF Expression Atlas 2 Data from U133A and GNF1H Chips: http://genome.ucsc.edu/cgi-bin/hgGene. ![]()
7 http://www.ncbi.nlm.nih.gov/SAGE ![]()
8 http://www.ncbi.nlm.nih.gov/projects/gorf/ ![]()
9 http://www.repeatmasker.org/cgi-bin/WEBRepeatMasker ![]()
10 http://www.ch.embnet.org/software/LALIGN_form.html ![]()
11 http://motif.genome.jp/motif-bin/Srch_Motif_Lib ![]()
12 http://www.ebi.ac.uk/emboss/cpgplot ![]()
Received 8/16/06; revised 8/ 6/07; accepted 8/23/07.
| References |
|---|
|
|
|---|
and β, is associated with favorable prognosis in human neuroblastomas. Int J Oncol 2003;22:815–22.[Medline]
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Cancer Research | Clinical Cancer Research |
| Cancer Epidemiology Biomarkers & Prevention | Molecular Cancer Therapeutics |
| Molecular Cancer Research | Cancer Prevention Research |
| Cancer Prevention Journals Portal | Cancer Reviews Online |
| Annual Meeting Education Book | Meeting Abstracts Online |