LOCUS       ADRCG                   2418 bp    DNA     linear   VRL 14-MAR-1996
DEFINITION  Adenovirus type 2, complete genome.
ACCESSION   J01917 REGION: 24108..26525
VERSION     J01917.1  GI:209811
KEYWORDS    DNA polymerase; DNA-binding protein; RNA polymerase III;
            alternative splicing; coat protein; complete genome; genome-linked
            protein; glycoprotein; overlapping genes; polymerase; terminal
            repeat; unidentified reading frame; virus-associated RNA.
SOURCE      Human adenovirus 2
  ORGANISM  Human adenovirus 2
            Viruses; dsDNA viruses, no RNA stage; Adenoviridae; Mastadenovirus;
            Human adenovirus C.
REFERENCE   1  (sites)
  AUTHORS   Ohe,K. and Weissman,S.M.
  TITLE     The nucleotide sequence of a low molecular weight ribonucleic acid
            from cells infected with adenovirus 2
  JOURNAL   J. Biol. Chem. 246 (22), 6991-7009 (1971)
   PUBMED   4331202
REFERENCE   2  (sites)
  AUTHORS   Jornvall,H., Ohlsson,H. and Philipson,L.
  TITLE     An acetylated N-terminus of adenovirus type 2 hexon protein
  JOURNAL   Biochem. Biophys. Res. Commun. 56 (2), 304-310 (1974)
   PUBMED   4823869
REFERENCE   3  (sites)
  AUTHORS   Celma,M.L., Pan,J. and Weissman,S.M.
  TITLE     Studies of low molecular weight RNA from cells infected with
            adenovirus 2. I. The sequences at the 3' end of VA-RNA I
  JOURNAL   J. Biol. Chem. 252 (24), 9032-9042 (1977)
   PUBMED   318160
REFERENCE   4  (sites)
  AUTHORS   Celma,M.L., Pan,J. and Weissman,S.M.
  TITLE     Studies of low molecular weight RNA from cells infected with
            adenovirus 2. II. Heterogeneity at the 5' end of VA-RNA I
  JOURNAL   J. Biol. Chem. 252 (24), 9043-9046 (1977)
   PUBMED   925035
REFERENCE   5  (sites)
  AUTHORS   Akusjarvi,G. and Pettersson,U.
  TITLE     Nucleotide sequence at the junction between the coding region of
            the adenovirus 2 hexon messenger RNA and its leader sequence
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 75 (12), 5822-5826 (1978)
   PUBMED   83649
REFERENCE   6  (bases 1 to 2418)
  AUTHORS   Zain,S., Sambrook,J., Roberts,R.J., Keller,W., Fried,M. and
  TITLE     Nucleotide sequence analysis of the leader segments in a cloned
            copy of adenovirus 2 fiber mRNA
  JOURNAL   Cell 16 (4), 851-861 (1979)
   PUBMED   455453
REFERENCE   7  (sites)
  AUTHORS   Shinagawa,M. and Padmanabhan,R.
  TITLE     Nucleotide sequence at the inverted terminal repetition of
            adenovirus type 2 DNA
  JOURNAL   Biochem. Biophys. Res. Commun. 87 (3), 671-678 (1979)
   PUBMED   454416
REFERENCE   8  (bases 1527 to 2418)
  AUTHORS   Galibert,F., Herisse,J. and Courtois,G.
  TITLE     Nucleotide sequence of the EcoRI-F fragment of adenovirus 2 genome
  JOURNAL   Gene 6 (1), 1-22 (1979)
   PUBMED   478297
REFERENCE   9  (sites)
  AUTHORS   Zain,B.S. and Roberts,R.J.
  TITLE     Sequences from the beginning of the fiber messenger RNA of
  JOURNAL   J. Mol. Biol. 131 (2), 341-352 (1979)
   PUBMED   318165
REFERENCE   10 (sites)
  AUTHORS   Baker,C.C., Herisse,J., Courtois,G., Galibert,F. and Ziff,E.
  TITLE     Messenger RNA for the Ad2 DNA binding protein: DNA sequences
            encoding the first leader and heterogenity at the mRNA 5' end
  JOURNAL   Cell 18 (2), 569-580 (1979)
   PUBMED   227610
REFERENCE   11 (sites)
  AUTHORS   Perricaudet,M., Akusjarvi,G., Virtanen,A. and Pettersson,U.
  TITLE     Structure of two spliced mRNAs from the transforming region of
            human subgroup C adenoviruses
  JOURNAL   Nature 281 (5733), 694-696 (1979)
   PUBMED   551290
REFERENCE   12 (bases 1 to 2418)
  AUTHORS   Zain,S., Gingeras,T.R., Bullock,P., Wong,G. and Gelinas,R.E.
  TITLE     Determination and analysis of adenovirus-2 DNA sequences which may
            include signals for late messenger RNA processing
  JOURNAL   J. Mol. Biol. 135 (2), 413-433 (1979)
   PUBMED   318167
REFERENCE   13 (sites)
  AUTHORS   Baker,C.C. and Ziff,E.B.
  TITLE     Biogenesis, structures, and sites of encoding of the 5' termini of
            adenovirus-2 mRNAs
  JOURNAL   Cold Spring Harb. Symp. Quant. Biol. 44 Pt 1,, 415-428 (1980)
   PUBMED   6933038
REFERENCE   14 (sites)
  AUTHORS   Akusjarvi,G., Mathews,M.B., Andersson,P., Vennstrom,B. and
  TITLE     Structure of genes for virus-associated RNAI and RNAII of
            adenovirus type 2
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 77 (5), 2424-2428 (1980)
   PUBMED   6930642
REFERENCE   15 (sites)
  AUTHORS   Perricaudet,M., Le Moullec,J.M. and Pettersson,U.
  TITLE     Predicted structure of two adenovirus tumor antigens
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 77 (7), 3778-3782 (1980)
   PUBMED   6253988
REFERENCE   16 (sites)
  AUTHORS   Anderson,C.W. and Lewis,J.B.
  TITLE     Amino-terminal sequence of adenovirus type 2 proteins: hexon,
            fiber, component IX, and early protein 1B-15K
  JOURNAL   Virology 104 (1), 27-41 (1980)
   PUBMED   6985474
REFERENCE   17 (sites)
  AUTHORS   Persson,H., Jornvall,H. and Zabielski,J.
  TITLE     Multiple mRNA species for the precursor to an adenovirus-encoded
            glycoprotein: identification and structure of the signal sequence
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 77 (11), 6349-6353 (1980)
   PUBMED   6985478
REFERENCE   18 (sites)
  AUTHORS   Van Etten,R.A., Walberg,M.W. and Clayton,D.A.
  TITLE     Precise localization and nucleotide sequence of the two mouse
            mitochondrial rRNA genes and three immediately adjacent novel tRNA
  JOURNAL   Cell 22 (1 Pt 1), 157-170 (1980)
   PUBMED   7428037
REFERENCE   19 (sites)
  AUTHORS   Buttner,W. and Veres-Molnar,Z.
  TITLE     Localization of the 3'-terminal end of the EcoRI B
            fragment-specific early mRNA of adenovirus type 2
  JOURNAL   FEBS Lett. 122 (2), 317-321 (1980)
   PUBMED   6258984
REFERENCE   20 (sites)
  AUTHORS   Akusjarvi,G., Zabielski,J., Perricaudet,M. and Pettersson,U.
  TITLE     The sequence of the 3' non-coding region of the hexon mRNA
            discloses a novel adenovirus gene
  JOURNAL   Nucleic Acids Res. 9 (1), 1-17 (1981)
   PUBMED   6259616
REFERENCE   21 (sites)
  AUTHORS   Hashimoto,S., Pursley,M.H. and Green,M.
  TITLE     Nucleotide sequences and mapping of novel heterogenous 5'-termini
            of adenovirus 2 early region 4 mRNA
  JOURNAL   Nucleic Acids Res. 9 (7), 1675-1689 (1981)
   PUBMED   6164992
REFERENCE   22 (sites)
  AUTHORS   Akusjarvi,G. and Persson,H.
  TITLE     Gene and mRNA for precursor polypeptide VI from adenovirus type 2
  JOURNAL   J. Virol. 38 (2), 469-482 (1981)
   PUBMED   6985479
REFERENCE   23 (sites)
  AUTHORS   Baker,C.C. and Ziff,E.B.
  TITLE     Promoters and heterogeneous 5' termini of the messenger RNAs of
            adenovirus serotype 2
  JOURNAL   J. Mol. Biol. 149 (2), 189-221 (1981)
   PUBMED   6927849
REFERENCE   24 (sites)
  AUTHORS   Akusjarvi,G. and Persson,H.
  TITLE     Controls of RNA splicing and termination in the major late
            adenovirus transcription unit
  JOURNAL   Nature 292 (5822), 420-426 (1981)
   PUBMED   7254339
REFERENCE   25 (sites)
  AUTHORS   Kruijer,W., van Schaik,F.M. and Sussenbach,J.S.
  TITLE     Structure and organization of the gene coding for the DNA binding
            protein of adenovirus type 5
  JOURNAL   Nucleic Acids Res. 9 (18), 4439-4457 (1981)
   PUBMED   6117824
REFERENCE   26 (sites)
  AUTHORS   van Beveren,C.P., Maat,J., Dekker,B.M. and van Ormondt,H.
  TITLE     The nucleotide sequence of the gene for protein IVa2 and of the 5'
            leader segment of the major late mRNAs of adenovirus type 5
  JOURNAL   Gene 16 (1-3), 179-189 (1981)
   PUBMED   7343420
REFERENCE   27 (sites)
  AUTHORS   Uhlen,M., Svensson,C., Josephson,S., Alestrom,P.,
            Chattapadhyaya,J.B., Pettersson,U. and Philipson,L.
  TITLE     Leader arrangement in the adenovirus fiber mRNA
  JOURNAL   EMBO J. 1 (2), 249-254 (1982)
   PUBMED   6897915
REFERENCE   28 (bases 1 to 2418)
  AUTHORS   Fraser,N.W., Baker,C.C., Moore,M.A. and Ziff,E.B.
  TITLE     Poly(A) sites of adenovirus serotype 2 transcription units
  JOURNAL   J. Mol. Biol. 155 (3), 207-233 (1982)
   PUBMED   6176714
REFERENCE   29 (sites)
  AUTHORS   Osborne,T.F., Gaynor,R.B. and Berk,A.J.
  TITLE     The TATA homology and the mRNA 5' untranslated sequence are not
            required for expression of essential adenovirus E1A functions
  JOURNAL   Cell 29 (1), 139-148 (1982)
   PUBMED   7105179
REFERENCE   30 (bases 1 to 18)
  AUTHORS   Kruijer,W., Van Schaik,F.M. and Sussenbach,J.S.
  TITLE     Nucleotide sequence of the gene encoding adenovirus type 2 DNA
            binding protein
  JOURNAL   Nucleic Acids Res. 10 (15), 4493-4500 (1982)
   PUBMED   6985485
REFERENCE   31 (sites)
  AUTHORS   Gingeras,T.R., Sciaky,D., Gelinas,R.E., Bing-Dong,J., Yen,C.E.,
            Kelly,M.M., Bullock,P.A., Parsons,B.L., O'Neill,K.E. and
  TITLE     Nucleotide sequences from the adenovirus-2 genome
  JOURNAL   J. Biol. Chem. 257 (22), 13475-13491 (1982)
   PUBMED   7142161
REFERENCE   32 (sites)
  AUTHORS   Alestrom,P., Akusjarvi,G., Pettersson,M. and Pettersson,U.
  TITLE     DNA sequence analysis of the region encoding the terminal protein
            and the hypothetical N-gene product of adenovirus type 2
  JOURNAL   J. Biol. Chem. 257 (22), 13492-13498 (1982)
   PUBMED   7142162
REFERENCE   33 (sites)
  AUTHORS   Ahmed,C.M., Chanda,R.S., Stow,N.D. and Zain,B.S.
  TITLE     The nucleotide sequence of mRNA for the Mr 19 000 glycoprotein from
            early gene block III of adenovirus 2
  JOURNAL   Gene 20 (3), 339-346 (1982)
   PUBMED   6985487
REFERENCE   34 (sites)
  AUTHORS   Oosterom-Dragon,E.A. and Anderson,C.W.
  TITLE     Polypeptide structure and encoding location of the adenovirus
            serotype 2 late, nonstructural 33K protein
  JOURNAL   J. Virol. 45 (1), 251-263 (1983)
   PUBMED   6296434
REFERENCE   35 (sites)
  AUTHORS   Downey,J.F., Rowe,D.T., Bacchetti,S., Graham,F.L. and Bayley,S.T.
  TITLE     Mapping of a 14,000-dalton antigen to early region 4 of the human
            adenovirus 5 genome
  JOURNAL   J. Virol. 45 (2), 514-523 (1983)
   PUBMED   6834467
REFERENCE   36 (sites)
  AUTHORS   Goldenberg,C.J. and Hauser,S.D.
  TITLE     Accurate and efficient in vitro splicing of purified precursor RNAs
            specified by early region 2 of the adenovirus 2 genome
  JOURNAL   Nucleic Acids Res. 11 (5), 1337-1348 (1983)
   PUBMED   6186991
REFERENCE   37 (sites)
  AUTHORS   Virtanen,A. and Pettersson,U.
  TITLE     The molecular structure of the 9S mRNA from early region 1A of
            adenovirus serotype 2
  JOURNAL   J. Mol. Biol. 165 (3), 496-499 (1983)
   PUBMED   6842608
REFERENCE   38 (sites)
  AUTHORS   Stalhandske,P., Persson,H., Perricaudet,M., Philipson,L. and
  TITLE     Structure of three spliced mRNAs from region E3 of adenovirus type
  JOURNAL   Gene 22 (2-3), 157-165 (1983)
   PUBMED   6307816
REFERENCE   39 (sites)
  AUTHORS   Sung,M.T., Cao,T.M., Coleman,R.T. and Budelier,K.A.
  TITLE     Gene and protein sequences of adenovirus protein VII, a hybrid
            basic chromosomal protein
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 80 (10), 2902-2906 (1983)
   PUBMED   6574459
REFERENCE   40 (sites)
  AUTHORS   Lewis,J.B. and Anderson,C.W.
  TITLE     Proteins encoded near the adenovirus late messenger RNA leader
  JOURNAL   Virology 127 (1), 112-123 (1983)
   PUBMED   6857999
REFERENCE   41 (sites)
  AUTHORS   Yeh-Kai,L., Akusjarvi,G., Alestrom,P., Pettersson,U., Tremblay,M.
            and Weber,J.
  TITLE     Genetic identification of an endoproteinase encoded by the
            adenovirus genome
  JOURNAL   J. Mol. Biol. 167 (1), 217-222 (1983)
   PUBMED   6306252
REFERENCE   42 (sites)
  AUTHORS   Sung,M.T., Cao,T.M., Lischwe,M.A. and Coleman,R.T.
  TITLE     Molecular processing of adenovirus proteins
  JOURNAL   J. Biol. Chem. 258 (13), 8266-8272 (1983)
   PUBMED   6336325
REFERENCE   43 (sites)
  AUTHORS   Kruijer,W., van Schaik,F.M., Speijer,J.G. and Sussenbach,J.S.
  TITLE     Structure and function of adenovirus DNA binding protein:
            comparison of the amino acid sequences of the Ad5 and Ad12 proteins
            derived from the nucleotide sequence of the corresponding genes
  JOURNAL   Virology 128 (1), 140-153 (1983)
   PUBMED   6308889
REFERENCE   44 (sites)
  AUTHORS   Le Moullec,J.M., Akusjarvi,G., Stalhandske,P., Pettersson,U.,
            Chambraud,B., Gilardi,P., Nasri,M. and Perricaudet,M.
  TITLE     Polyadenylic acid addition sites in the adenovirus type 2 major
            late transcription unit
  JOURNAL   J. Virol. 48 (1), 127-134 (1983)
   PUBMED   6136617
REFERENCE   45 (sites)
  AUTHORS   Tigges,M.A. and Raskas,H.J.
  TITLE     Splice junctions in adenovirus 2 early region 4 mRNAs: multiple
            splice sites produce 18 to 24 RNAs
  JOURNAL   J. Virol. 50 (1), 106-117 (1984)
   PUBMED   6336328
REFERENCE   46 (sites)
  AUTHORS   Downey,J.F., Evelegh,C.M., Branton,P.E. and Bayley,S.T.
  TITLE     Peptide maps and N-terminal sequences of polypeptides from early
            region 1A of human adenovirus 5
  JOURNAL   J. Virol. 50 (1), 30-37 (1984)
   PUBMED   6699947
REFERENCE   47 (sites)
  AUTHORS   Freyer,G.A., Katoh,Y. and Roberts,R.J.
  TITLE     Characterization of the major mRNAs from adenovirus 2 early region
            4 by cDNA cloning and sequencing
  JOURNAL   Nucleic Acids Res. 12 (8), 3503-3519 (1984)
   PUBMED   6145146
REFERENCE   48 (sites)
  AUTHORS   Leff,T., Elkaim,R., Goding,C.R., Jalinot,P., Sassone-Corsi,P.,
            Perricaudet,M., Kedinger,C. and Chambon,P.
  TITLE     Individual products of the adenovirus 12S and 13S EIa mRNAs
            stimulate viral EIIa and EIII expression at the transcriptional
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 81 (14), 4381-4385 (1984)
   PUBMED   6336330
REFERENCE   49 (sites)
  AUTHORS   Padgett,R.A., Konarska,M.M., Grabowski,P.J., Hardy,S.F. and
  TITLE     Lariat RNA's as intermediates and products in the splicing of
            messenger RNA precursors
  JOURNAL   Science 225 (4665), 898-903 (1984)
   PUBMED   6206566
REFERENCE   50 (sites)
  AUTHORS   Natarajan,V., Madden,M.J. and Salzman,N.P.
  TITLE     Proximal and distal domains that control in vitro transcription of
            the adenovirus IVa2 gene
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 81 (20), 6290-6294 (1984)
   PUBMED   6593702
REFERENCE   51 (sites)
  AUTHORS   Wold,W.S., Cladaras,C., Magie,S.C. and Yacoub,N.
  TITLE     Mapping a new gene that encodes an 11,600-molecular-weight protein
            in the E3 transcription unit of adenovirus 2
  JOURNAL   J. Virol. 52 (2), 307-313 (1984)
   PUBMED   6492252
REFERENCE   52 (sites)
  AUTHORS   Mautner,V. and Mackay,N.
  TITLE     Recombination in adenovirus: analysis of crossover sites in
            intertypic overlap recombinants
  JOURNAL   Virology 139 (1), 43-52 (1984)
   PUBMED   6495658
REFERENCE   53 (sites)
  AUTHORS   Alestrom,P., Akusjarvi,G., Lager,M., Yeh-kai,L. and Pettersson,U.
  TITLE     Genes encoding the core proteins of adenovirus type 2
  JOURNAL   J. Biol. Chem. 259 (22), 13980-13985 (1984)
   PUBMED   6094534
REFERENCE   54 (bases 1 to 1531)
  AUTHORS   Roberts,R.J., O'Neill,K.E. and Yen,C.T.
  TITLE     DNA sequences from the adenovirus 2 genome
  JOURNAL   J. Biol. Chem. 259 (22), 13968-13975 (1984)
   PUBMED   6334081
REFERENCE   55 (sites)
  AUTHORS   Moore,C.L. and Sharp,P.A.
  TITLE     Accurate cleavage and polyadenylation of exogenous RNA substrate
  JOURNAL   Cell 41 (3), 845-855 (1985)
   PUBMED   2408761
REFERENCE   56 (sites)
  AUTHORS   Roberts,R.J., Akusjaervi,G., Alestroem,P., Gelinas,R.E.,
            Gingeras,T.R., Sciaky,D. and Pettersson,U.
  TITLE     A consensus sequence for the adenovirus-2 genome
  JOURNAL   (in) Doerfler,W. (Ed.);
            ADENOVIRUS DNA: 1-51;
            Martinus Nijhoff Publishing, Boston (1986)
REFERENCE   57 (sites)
  AUTHORS   Anderson,C.W., Schmitt,R.C., Smart,J.E. and Lewis,J.B.
  TITLE     Early region 1B of adenovirus serotype 2 encodes two co-terminal
            proteins of 495 and 155 amino acid residues
  JOURNAL   Unpublished (1984)
REFERENCE   58 (sites)
  AUTHORS   Virtanen,A. and Pettersson,U.
  JOURNAL   Unpublished (1984)
COMMENT     [1]  RNA.
            [2]  sites; cds start for the hexon protein.
            [4]  sites; 5' terminus of VA I RNA.
            [6]  mRNA and DNA.
            [11]  cDNA to hexon mRNA.
            [12]  several fragments over this span;.
            [13]  mRNA and DNA.
            [19]  cDNA.
            [16]  sites; acceptor splice site for fiber mRNA.
            [18]  several leader fragments over this span.
            [27]  sites; splice sites for E1b mRNAs.
            [28]  sites; cds start for E3 19K glycoprotein.
            [29]  sites; cds start for 15K, IX and fiber polypeptides. [37]
            sites; cap site for E4 mrnas.
            [39]  sites; splice sites in E2a mRNA.
            [31]  fragments over this span.
            [34]  sites; splice site in 52,55K-pept mRNA.
            [30]  sites; splice sites in IVa2 mRNA, Ad5.
            [43]  cDNA and DNA.
            [42]  sites; splice site for 'i' leader.
            [46]  mRNA and DNA.
            [41]  sites; E1a mutational analysis.
            [55]  sites; splice sites for 33K mRNA.
            [56]  sites; cds start for E4 11K-pept, ad5.
            [60]  sites; cds start for the 13.6K-pept.
            [61]  sites; splice sites for 72K and 100K mRNAs.
            [50]  sites; splice sites for leaders; poly-A sites. [52]  sites;
            splice sites for E1a mRNAs.
            [58]  sites; splice sites in E2 mRNA.
            [53]  sites; H2ts1 mutation between 57.0% and 69.0%. [49]  H2ts125
            [68]  sites; cds start for E1a proteins.
            [69]  sites; splice sites in E4 region.
            [71]  sites; splice sites in E4 region; poly-A site for E4 mRNAs.
            [63]  sites; cds start for 57K-pept.
            [63]  sites; splice sites in E4 region; poly-A site for E4 mRNAs.
            [63]  sites; splice sites in E1b region.
            [(in) Doerfler,W. (Ed.);Adenovirus DNA: 1-51;Martinus Nijhoff
            Publishing, Boston ]  review; bases 1 to 35937.
            [75]  sites; recombination analysis of ad2 and ad5. [74]  sites;
            splice sites in major late mRNA.
            [73]  sites; IVa2 transcription start.
            [72]  sites; transcription start for EIa mRNAs.
            [70]  sites; E3 11.6 -K protein.
            [76]  sites; L3 mRNA polyadenylation site.
            [78]  sites; L3 mRNA polyadenylation site.
            Communicated on tape by R. Roberts.  That tape and [(in) Doerfler,
            W. (Ed.);Adenovirus DNA: 1-51;Martinus Nijhoff Publishing, Boston ]
            are the
            immediate sources of the annotation herein.
            A consensus sequence for the l-strand of the genome is shown.
            Population heterogeneity as distinct from strain variation is known
            (35937 +/- 9 bp) [67]; both are annotated as 'variation' below. For
            site differences with adenovirus type 5, see loci beginning <ad5>
            which are arranged in the library according to the map coordinates
            of <ad2> where one map unit corresponds to 360 bases throughout
            (see [44],[67]).  For mutational changes in the ad2 sequence, see
            the appropriate references above.
            The origin of replication is located in the first fifty bases from
            each end.
            Transcription is leftward off the l-strand and rightward off the
            r-strand; in the former case, the annotation shows '(c)' for
            complementary strand.  Complex splicing events give rise to perhaps
            fifty or more distinct mRNA transcripts at early, intermediate and
            late times after infection, many of which are still being
            characterized; in particular, some transcripts are known from
            electron microscopy which are not yet characterized at the sequence
            level.  To date nine mRNA start sites (cap sites) have been
            identified, and these define the general units of mRNAs under which
            all known transcripts are classified.
            From the r-strand, the early transcripts are E1a, E1b and E3.  The
            28 kb late transcript called herein 'major late mRNA' comprises
            five families, L1 through L5, of 3' co-terminal mRNAs.  L1, and to
            a lesser extent L2, can be expressed at early and intermediate
            times [34].  Transcripts from this region contain a common
            tripartite leader sequence at their 5' ends: the three segments of
            this leader are encoded at bases 6039-6079, 7101-7172 and
            9634-9723.  At early and intermediate times, an extra leader
            segment, the 'i' leader, is frequently present (bases 7942-8381).
            The IX message, the only unspliced message in ad2, is intermediate,
            and its termination overlaps that for E1b on the same strand and
            that for IVa2, and most likely E2b, on the opposite strand. From
            the l-strand, or the 'comp strand', early expression derives from
            the E2a, E2b and E4 families of mRNAs, although there can be late
            transcription from E2a.  The E2b cap sites, splice sites and
            termination sites have not been determined at the sequence level.
            From electron microscopy there is evidence that the E2b mRNAs may
            originate at the E2a early cap site at 27092 (c) and terminate at
            the poly-A addition site found for the IVa2 mRNA at 4050 (c) [44].
            IVa2 is an intermediate message.  The promoters for these nine
            classes of mRNAs can be localized and characterized to the
            following extent [32]:
                mRNA          cap site      possible promoter region
               ------        ----------    -----------------------------
                E1a             498         tatttata at 468-474
                E1b            1699         tatataat at 1669-1676
                IX             3576         tatataa at 3545-3551
                major late     6039         tataaaa at 6008-6014
                E3            27609         tataa at 27580-27584
                E4            35609 (c)     tatatata at 35641-35633 (c)
                E2a early     27092 (c)     no obvious sequence for 100 bases
                E2a late      25956 (c)     tacaaattt at 25985-25977 (c)
                IVa2           5826 (c)     no obvious sequence for 100 bases
            The mRNA responsible for the 13.6K protein encoded at 7968 has not
            been identified.  The VA I and VA II transcripts are unique in that
            they are generated by RNA polymerase III; for a discussion of these
            low molecular weight RNAs-- the modulation of their start points,
            their promoters, their heterogeneity and their similarity to tRNA--
            see [3],[4],[5],[26] and <ad5a2>.
            The proteins known to be encoded from these mRNAs are given in the
            Features table below, though the details of translation and
            processing have not been fully determined.  In cases such as the
            IIIa peptide or the 11K peptide, the exact span of the coding
            awaits elucidation of the mRNA splicing.  Some of these products
            share reading frames and therefore manifest partial homologies. The
            following table summarizes the unidentified reading frames ('URF')
            of 100 or more amino acids:
                initiator    terminator      frame    protein encoded
                -----------  ----------     -------   -----------------
                 6280           6600           1      11.6K URF
                17284          17763           1      17.4K URF
                23782          24138           1      12.9K URF
                24481          24867           1      14.2K URF
                26044          26826           1      28.6K URF(contains the
                                                      N-terminus of 33K cds)
                30973          32778           1      63.9K URF(contains the
                                                      fiber cds)
                10421          10834           2      14.4K URF
                20504          20935           2      15.7K URF
                27899          28222           2      12.4K URF
                30059          30451           2      14.5k URF
                33956          34456           2      18.8K URF
                 9294           9800           3      17.7K URF
                23526          26525           3      110.2K URF(contains the
                                                      100K-pept cds)
                30444          30830           3      14.7K URF
                34470          34808           3      12.7K URF
                complementary strand
                35532          35146           1      14.3K URF
                34077          33193           1      34.1K URF
                11109          10744           1      12.8K URF
                 9030           8383           1      22.8K URF
                 6780           6442           1      12.8K URF
                31604          31290           2      10.7K URF
                31211          30852           2      13.5K URF
                18707          18159           2      18.9K URF
                14861          14424           2      16.4K URF
                14114          13728           2      13.5K URF
                11618          11250           2      13.6K URF
                 1712 1194 2 18.1K URF
                35113          34703           3      15.3K URF
                34342          33998           3      13.3K URF
                 5674           5327           3      12.2K URF Additionally
            there are numerous unidentified reading frames of less than 100
            amino acid residues; and further small modifications of a few of
            the coding sequences are possible.
            [7] missing data project.
FEATURES             Location/Qualifiers
     source          1..2418
                     /organism="Human adenovirus 2"
                     /mol_type="genomic DNA"
     gene            complement(<1..>2418)
     mRNA            complement(<1..>2418)
     prim_transcript <1..>2418
     gene            <1..>2418
     prim_transcript <1..>2418
     prim_transcript complement(<1..>2418)
     prim_transcript complement(<1..>2418)
     prim_transcript complement(<1..1849)
     prim_transcript complement(<1..1847)
     gene            complement(<1..1778)
     intron          complement(<1..607)
     CDS             1..2418
                     /note="100K protein"
                     /product="hexon assembly-associated protein"
     intron          complement(685..>2418)
     intron          complement(865..1778)
                     /note="does not fit consensus"
                     /cons_splice=(5'site:no, 3'site:no)
     gene            2132..>2418
     CDS             2132..>2418
                     /product="virion morphogenesis-associated protein"
        1 atggagtcag tcgagaagga ggacagccta accgccccct ttgagttcgc caccaccgcc
       61 tccaccgatg ccgccaacgc gcctaccacc ttccccgtcg aggcaccccc gcttgaggag
      121 gaggaagtga ttatcgagca ggacccaggt tttgtaagcg aagacgacga ggatcgctca
      181 gtaccaacag aggataaaaa gcaagaccag gacgacgcag aggcaaacga ggaacaagtc
      241 gggcgggggg accaaaggca tggcgactac ctagatgtgg gagacgacgt gctgttgaag
      301 catctgcagc gccagtgcgc cattatctgc gacgcgttgc aagagcgcag cgatgtgccc
      361 ctcgccatag cggatgtcag ccttgcctac gaacgccacc tgttctcacc gcgcgtaccc
      421 cccaaacgcc aagaaaacgg cacatgcgag cccaacccgc gcctcaactt ctaccccgta
      481 tttgccgtgc cagaggtgct tgccacctat cacatctttt tccaaaactg caagataccc
      541 ctatcctgcc gtgccaaccg cagccgagcg gacaagcagc tggccttgcg gcagggcgct
      601 gtcatacctg atatcgcctc gctcgacgaa gtgccaaaaa tctttgaggg tcttggacgc
      661 gacgagaaac gcgcggcaaa cgctctgcaa caagaaaaca gcgaaaatga aagtcactgt
      721 ggagtgctgg tggaacttga gggtgacaac gcgcgcctag ccgtgctgaa acgcagcatc
      781 gaggtcaccc actttgccta cccggcactt aacctacccc ccaaggttat gagcacagtc
      841 atgagcgagc tgatcgtgcg ccgtgcacga cccctggaga gggatgcaaa cttgcaagaa
      901 caaaccgagg agggcctacc cgcagttggc gatgagcagc tggcgcgctg gcttgagacg
      961 cgcgagcctg ccgacttgga ggagcgacgc aagctaatga tggccgcagt gcttgttacc
     1021 gtggagcttg agtgcatgca gcggttcttt gctgacccgg agatgcagcg caagctagag
     1081 gaaacgttgc actacacctt tcgccagggc tacgtgcgcc aggcctgcaa aatttccaac
     1141 gtggagctct gcaacctggt ctcctacctt ggaattttgc acgaaaaccg cctcgggcaa
     1201 aacgtgcttc attccacgct caagggcgag gcgcgccgcg actacgtccg cgactgcgtt
     1261 tacttatttc tgtgctacac ctggcaaacg gccatgggcg tgtggcagca atgcctggag
     1321 gagcgcaacc taaaggagct gcagaagctg ctaaagcaaa acttgaagga cctatggacg
     1381 gccttcaacg agcgctccgt ggccgcgcac ctggcggaca ttatcttccc cgaacgcctg
     1441 cttaaaaccc tgcaacaggg tctgccagac ttcaccagtc aaagcatgtt gcaaaacttt
     1501 aggaacttta tcctagagcg ttcaggaatt ctgcccgcca cctgctgtgc gcttcctagc
     1561 gactttgtgc ccattaagta ccgtgaatgc cctccgccgc tttggggtca ctgctacctt
     1621 ctgcagctag ccaactacct tgcctaccac tccgacatca tggaagacgt gagcggtgac
     1681 ggcctactgg agtgtcactg tcgctgcaac ctatgcaccc cgcaccgctc cctggtctgc
     1741 aattcgcaac tgcttagcga aagtcaaatt atcggtacct ttgagctgca gggtccctcg
     1801 cctgacgaaa agtccgcggc tccggggttg aaactcactc cggggctgtg gacgtcggct
     1861 taccttcgca aatttgtacc tgaggactac cacgcccacg agattaggtt ctacgaagac
     1921 caatcccgcc cgccaaatgc ggagcttacc gcctgcgtca ttacccaggg ccacatcctt
     1981 ggccaattgc aagccatcaa caaagcccgc caagagtttc tgctacgaaa gggacggggg
     2041 gtttacctgg acccccagtc cggcgaggag ctcaacccaa tccccccgcc gccgcagccc
     2101 tatcagcagc cgcgggccct tgcttcccag gatggcaccc aaaaagaagc tgcagctgcc
     2161 gccgccgcca cccacggacg aggaggaata ctgggacagt caggcagagg aggttttgga
     2221 cgaggaggag gagatgatgg aagactggga cagcctagac gaagcttccg aggccgaaga
     2281 ggtgtcagac gaaacaccgt caccctcggt cgcattcccc tcgccggcgc cccagaaatt
     2341 ggcaaccgtt cccagcatcg ctacaacctc cgctcctcag gcgccgccgg cactgcctgt
     2401 tcgccgaccc aaccgtag