LOCUS       ADRCG                   2907 bp    DNA     linear   VRL 14-MAR-1996
DEFINITION  Adenovirus type 2, complete genome.
ACCESSION   J01917 REGION: 18838..21744
VERSION     J01917.1  GI:209811
KEYWORDS    DNA polymerase; DNA-binding protein; RNA polymerase III;
            alternative splicing; coat protein; complete genome; genome-linked
            protein; glycoprotein; overlapping genes; polymerase; terminal
            repeat; unidentified reading frame; virus-associated RNA.
SOURCE      Human adenovirus 2
  ORGANISM  Human adenovirus 2
            Viruses; dsDNA viruses, no RNA stage; Adenoviridae; Mastadenovirus;
            Human adenovirus C.
REFERENCE   1  (sites)
  AUTHORS   Ohe,K. and Weissman,S.M.
  TITLE     The nucleotide sequence of a low molecular weight ribonucleic acid
            from cells infected with adenovirus 2
  JOURNAL   J. Biol. Chem. 246 (22), 6991-7009 (1971)
   PUBMED   4331202
REFERENCE   2  (sites)
  AUTHORS   Jornvall,H., Ohlsson,H. and Philipson,L.
  TITLE     An acetylated N-terminus of adenovirus type 2 hexon protein
  JOURNAL   Biochem. Biophys. Res. Commun. 56 (2), 304-310 (1974)
   PUBMED   4823869
REFERENCE   3  (sites)
  AUTHORS   Celma,M.L., Pan,J. and Weissman,S.M.
  TITLE     Studies of low molecular weight RNA from cells infected with
            adenovirus 2. I. The sequences at the 3' end of VA-RNA I
  JOURNAL   J. Biol. Chem. 252 (24), 9032-9042 (1977)
   PUBMED   318160
REFERENCE   4  (sites)
  AUTHORS   Celma,M.L., Pan,J. and Weissman,S.M.
  TITLE     Studies of low molecular weight RNA from cells infected with
            adenovirus 2. II. Heterogeneity at the 5' end of VA-RNA I
  JOURNAL   J. Biol. Chem. 252 (24), 9043-9046 (1977)
   PUBMED   925035
REFERENCE   5  (bases 1 to 81)
  AUTHORS   Akusjarvi,G. and Pettersson,U.
  TITLE     Nucleotide sequence at the junction between the coding region of
            the adenovirus 2 hexon messenger RNA and its leader sequence
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 75 (12), 5822-5826 (1978)
   PUBMED   83649
REFERENCE   6  (bases 2770 to 2907)
  AUTHORS   Akusjarvi,G. and Pettersson,U.
  TITLE     Sequence analysis of adenovirus DNA. I. Nucleotide sequence at the
            carboxy-terminal end of the gene for adenovirus type 2 hexon
  JOURNAL   Virology 91 (2), 477-480 (1978)
   PUBMED   741658
REFERENCE   7  (bases 1 to 24)
  AUTHORS   Akusjarvi,G. and Pettersson,U.
  TITLE     Sequence analysis of adenovirus DNA: complete nucleotide sequence
            of the spliced 5' noncoding region of adenovirus 2 hexon messenger
  JOURNAL   Cell 16 (4), 841-850 (1979)
   PUBMED   455452
REFERENCE   8  (bases 1 to 2907)
  AUTHORS   Zain,S., Sambrook,J., Roberts,R.J., Keller,W., Fried,M. and
  TITLE     Nucleotide sequence analysis of the leader segments in a cloned
            copy of adenovirus 2 fiber mRNA
  JOURNAL   Cell 16 (4), 851-861 (1979)
   PUBMED   455453
REFERENCE   9  (sites)
  AUTHORS   Shinagawa,M. and Padmanabhan,R.
  TITLE     Nucleotide sequence at the inverted terminal repetition of
            adenovirus type 2 DNA
  JOURNAL   Biochem. Biophys. Res. Commun. 87 (3), 671-678 (1979)
   PUBMED   454416
REFERENCE   10 (sites)
  AUTHORS   Zain,B.S. and Roberts,R.J.
  TITLE     Sequences from the beginning of the fiber messenger RNA of
  JOURNAL   J. Mol. Biol. 131 (2), 341-352 (1979)
   PUBMED   318165
REFERENCE   11 (sites)
  AUTHORS   Baker,C.C., Herisse,J., Courtois,G., Galibert,F. and Ziff,E.
  TITLE     Messenger RNA for the Ad2 DNA binding protein: DNA sequences
            encoding the first leader and heterogenity at the mRNA 5' end
  JOURNAL   Cell 18 (2), 569-580 (1979)
   PUBMED   227610
REFERENCE   12 (sites)
  AUTHORS   Perricaudet,M., Akusjarvi,G., Virtanen,A. and Pettersson,U.
  TITLE     Structure of two spliced mRNAs from the transforming region of
            human subgroup C adenoviruses
  JOURNAL   Nature 281 (5733), 694-696 (1979)
   PUBMED   551290
REFERENCE   13 (bases 1 to 2907)
  AUTHORS   Zain,S., Gingeras,T.R., Bullock,P., Wong,G. and Gelinas,R.E.
  TITLE     Determination and analysis of adenovirus-2 DNA sequences which may
            include signals for late messenger RNA processing
  JOURNAL   J. Mol. Biol. 135 (2), 413-433 (1979)
   PUBMED   318167
REFERENCE   14 (sites)
  AUTHORS   Baker,C.C. and Ziff,E.B.
  TITLE     Biogenesis, structures, and sites of encoding of the 5' termini of
            adenovirus-2 mRNAs
  JOURNAL   Cold Spring Harb. Symp. Quant. Biol. 44 Pt 1,, 415-428 (1980)
   PUBMED   6933038
REFERENCE   15 (sites)
  AUTHORS   Akusjarvi,G., Mathews,M.B., Andersson,P., Vennstrom,B. and
  TITLE     Structure of genes for virus-associated RNAI and RNAII of
            adenovirus type 2
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 77 (5), 2424-2428 (1980)
   PUBMED   6930642
REFERENCE   16 (sites)
  AUTHORS   Perricaudet,M., Le Moullec,J.M. and Pettersson,U.
  TITLE     Predicted structure of two adenovirus tumor antigens
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 77 (7), 3778-3782 (1980)
   PUBMED   6253988
REFERENCE   17 (sites)
  AUTHORS   Anderson,C.W. and Lewis,J.B.
  TITLE     Amino-terminal sequence of adenovirus type 2 proteins: hexon,
            fiber, component IX, and early protein 1B-15K
  JOURNAL   Virology 104 (1), 27-41 (1980)
   PUBMED   6985474
REFERENCE   18 (sites)
  AUTHORS   Persson,H., Jornvall,H. and Zabielski,J.
  TITLE     Multiple mRNA species for the precursor to an adenovirus-encoded
            glycoprotein: identification and structure of the signal sequence
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 77 (11), 6349-6353 (1980)
   PUBMED   6985478
REFERENCE   19 (sites)
  AUTHORS   Van Etten,R.A., Walberg,M.W. and Clayton,D.A.
  TITLE     Precise localization and nucleotide sequence of the two mouse
            mitochondrial rRNA genes and three immediately adjacent novel tRNA
  JOURNAL   Cell 22 (1 Pt 1), 157-170 (1980)
   PUBMED   7428037
REFERENCE   20 (sites)
  AUTHORS   Buttner,W. and Veres-Molnar,Z.
  TITLE     Localization of the 3'-terminal end of the EcoRI B
            fragment-specific early mRNA of adenovirus type 2
  JOURNAL   FEBS Lett. 122 (2), 317-321 (1980)
   PUBMED   6258984
REFERENCE   21 (bases 2770 to 2907)
  AUTHORS   Akusjarvi,G., Zabielski,J., Perricaudet,M. and Pettersson,U.
  TITLE     The sequence of the 3' non-coding region of the hexon mRNA
            discloses a novel adenovirus gene
  JOURNAL   Nucleic Acids Res. 9 (1), 1-17 (1981)
   PUBMED   6259616
REFERENCE   22 (sites)
  AUTHORS   Hashimoto,S., Pursley,M.H. and Green,M.
  TITLE     Nucleotide sequences and mapping of novel heterogenous 5'-termini
            of adenovirus 2 early region 4 mRNA
  JOURNAL   Nucleic Acids Res. 9 (7), 1675-1689 (1981)
   PUBMED   6164992
REFERENCE   23 (bases 1 to 81)
  AUTHORS   Akusjarvi,G. and Persson,H.
  TITLE     Gene and mRNA for precursor polypeptide VI from adenovirus type 2
  JOURNAL   J. Virol. 38 (2), 469-482 (1981)
   PUBMED   6985479
REFERENCE   24 (sites)
  AUTHORS   Baker,C.C. and Ziff,E.B.
  TITLE     Promoters and heterogeneous 5' termini of the messenger RNAs of
            adenovirus serotype 2
  JOURNAL   J. Mol. Biol. 149 (2), 189-221 (1981)
   PUBMED   6927849
REFERENCE   25 (bases 1 to 2907)
  AUTHORS   Jornvall,H., Alestrom,P., Akusjarvi,G., von Bahr-Lindstrom,H.,
            Philipson,L. and Pettersson,U.
  TITLE     Order of the CNBr fragments in the adenovirus hexon protein
  JOURNAL   J. Biol. Chem. 256 (12), 6204-6212 (1981)
   PUBMED   7240199
REFERENCE   26 (sites)
  AUTHORS   Akusjarvi,G. and Persson,H.
  TITLE     Controls of RNA splicing and termination in the major late
            adenovirus transcription unit
  JOURNAL   Nature 292 (5822), 420-426 (1981)
   PUBMED   7254339
REFERENCE   27 (sites)
  AUTHORS   Kruijer,W., van Schaik,F.M. and Sussenbach,J.S.
  TITLE     Structure and organization of the gene coding for the DNA binding
            protein of adenovirus type 5
  JOURNAL   Nucleic Acids Res. 9 (18), 4439-4457 (1981)
   PUBMED   6117824
REFERENCE   28 (sites)
  AUTHORS   van Beveren,C.P., Maat,J., Dekker,B.M. and van Ormondt,H.
  TITLE     The nucleotide sequence of the gene for protein IVa2 and of the 5'
            leader segment of the major late mRNAs of adenovirus type 5
  JOURNAL   Gene 16 (1-3), 179-189 (1981)
   PUBMED   7343420
REFERENCE   29 (sites)
  AUTHORS   Uhlen,M., Svensson,C., Josephson,S., Alestrom,P.,
            Chattapadhyaya,J.B., Pettersson,U. and Philipson,L.
  TITLE     Leader arrangement in the adenovirus fiber mRNA
  JOURNAL   EMBO J. 1 (2), 249-254 (1982)
   PUBMED   6897915
REFERENCE   30 (bases 2305 to 2907)
  AUTHORS   Fraser,N.W., Baker,C.C., Moore,M.A. and Ziff,E.B.
  TITLE     Poly(A) sites of adenovirus serotype 2 transcription units
  JOURNAL   J. Mol. Biol. 155 (3), 207-233 (1982)
   PUBMED   6176714
REFERENCE   31 (sites)
  AUTHORS   Osborne,T.F., Gaynor,R.B. and Berk,A.J.
  TITLE     The TATA homology and the mRNA 5' untranslated sequence are not
            required for expression of essential adenovirus E1A functions
  JOURNAL   Cell 29 (1), 139-148 (1982)
   PUBMED   7105179
REFERENCE   32 (sites)
  AUTHORS   Gingeras,T.R., Sciaky,D., Gelinas,R.E., Bing-Dong,J., Yen,C.E.,
            Kelly,M.M., Bullock,P.A., Parsons,B.L., O'Neill,K.E. and
  TITLE     Nucleotide sequences from the adenovirus-2 genome
  JOURNAL   J. Biol. Chem. 257 (22), 13475-13491 (1982)
   PUBMED   7142161
REFERENCE   33 (sites)
  AUTHORS   Alestrom,P., Akusjarvi,G., Pettersson,M. and Pettersson,U.
  TITLE     DNA sequence analysis of the region encoding the terminal protein
            and the hypothetical N-gene product of adenovirus type 2
  JOURNAL   J. Biol. Chem. 257 (22), 13492-13498 (1982)
   PUBMED   7142162
REFERENCE   34 (sites)
  AUTHORS   Ahmed,C.M., Chanda,R.S., Stow,N.D. and Zain,B.S.
  TITLE     The nucleotide sequence of mRNA for the Mr 19 000 glycoprotein from
            early gene block III of adenovirus 2
  JOURNAL   Gene 20 (3), 339-346 (1982)
   PUBMED   6985487
REFERENCE   35 (sites)
  AUTHORS   Oosterom-Dragon,E.A. and Anderson,C.W.
  TITLE     Polypeptide structure and encoding location of the adenovirus
            serotype 2 late, nonstructural 33K protein
  JOURNAL   J. Virol. 45 (1), 251-263 (1983)
   PUBMED   6296434
REFERENCE   36 (sites)
  AUTHORS   Downey,J.F., Rowe,D.T., Bacchetti,S., Graham,F.L. and Bayley,S.T.
  TITLE     Mapping of a 14,000-dalton antigen to early region 4 of the human
            adenovirus 5 genome
  JOURNAL   J. Virol. 45 (2), 514-523 (1983)
   PUBMED   6834467
REFERENCE   37 (sites)
  AUTHORS   Goldenberg,C.J. and Hauser,S.D.
  TITLE     Accurate and efficient in vitro splicing of purified precursor RNAs
            specified by early region 2 of the adenovirus 2 genome
  JOURNAL   Nucleic Acids Res. 11 (5), 1337-1348 (1983)
   PUBMED   6186991
REFERENCE   38 (sites)
  AUTHORS   Virtanen,A. and Pettersson,U.
  TITLE     The molecular structure of the 9S mRNA from early region 1A of
            adenovirus serotype 2
  JOURNAL   J. Mol. Biol. 165 (3), 496-499 (1983)
   PUBMED   6842608
REFERENCE   39 (sites)
  AUTHORS   Stalhandske,P., Persson,H., Perricaudet,M., Philipson,L. and
  TITLE     Structure of three spliced mRNAs from region E3 of adenovirus type
  JOURNAL   Gene 22 (2-3), 157-165 (1983)
   PUBMED   6307816
REFERENCE   40 (sites)
  AUTHORS   Sung,M.T., Cao,T.M., Coleman,R.T. and Budelier,K.A.
  TITLE     Gene and protein sequences of adenovirus protein VII, a hybrid
            basic chromosomal protein
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 80 (10), 2902-2906 (1983)
   PUBMED   6574459
REFERENCE   41 (sites)
  AUTHORS   Lewis,J.B. and Anderson,C.W.
  TITLE     Proteins encoded near the adenovirus late messenger RNA leader
  JOURNAL   Virology 127 (1), 112-123 (1983)
   PUBMED   6857999
REFERENCE   42 (sites)
  AUTHORS   Yeh-Kai,L., Akusjarvi,G., Alestrom,P., Pettersson,U., Tremblay,M.
            and Weber,J.
  TITLE     Genetic identification of an endoproteinase encoded by the
            adenovirus genome
  JOURNAL   J. Mol. Biol. 167 (1), 217-222 (1983)
   PUBMED   6306252
REFERENCE   43 (sites)
  AUTHORS   Sung,M.T., Cao,T.M., Lischwe,M.A. and Coleman,R.T.
  TITLE     Molecular processing of adenovirus proteins
  JOURNAL   J. Biol. Chem. 258 (13), 8266-8272 (1983)
   PUBMED   6336325
REFERENCE   44 (sites)
  AUTHORS   Kruijer,W., van Schaik,F.M., Speijer,J.G. and Sussenbach,J.S.
  TITLE     Structure and function of adenovirus DNA binding protein:
            comparison of the amino acid sequences of the Ad5 and Ad12 proteins
            derived from the nucleotide sequence of the corresponding genes
  JOURNAL   Virology 128 (1), 140-153 (1983)
   PUBMED   6308889
REFERENCE   45 (sites)
  AUTHORS   Le Moullec,J.M., Akusjarvi,G., Stalhandske,P., Pettersson,U.,
            Chambraud,B., Gilardi,P., Nasri,M. and Perricaudet,M.
  TITLE     Polyadenylic acid addition sites in the adenovirus type 2 major
            late transcription unit
  JOURNAL   J. Virol. 48 (1), 127-134 (1983)
   PUBMED   6136617
REFERENCE   46 (bases 1 to 396)
  AUTHORS   Mautner,V. and Boursnell,M.E.
  TITLE     Recombination in adenovirus: DNA sequence analysis of crossover
            sites in intertypic recombinants
  JOURNAL   Virology 131 (1), 1-10 (1983)
   PUBMED   6316645
REFERENCE   47 (sites)
  AUTHORS   Tigges,M.A. and Raskas,H.J.
  TITLE     Splice junctions in adenovirus 2 early region 4 mRNAs: multiple
            splice sites produce 18 to 24 RNAs
  JOURNAL   J. Virol. 50 (1), 106-117 (1984)
   PUBMED   6336328
REFERENCE   48 (sites)
  AUTHORS   Downey,J.F., Evelegh,C.M., Branton,P.E. and Bayley,S.T.
  TITLE     Peptide maps and N-terminal sequences of polypeptides from early
            region 1A of human adenovirus 5
  JOURNAL   J. Virol. 50 (1), 30-37 (1984)
   PUBMED   6699947
REFERENCE   49 (sites)
  AUTHORS   Freyer,G.A., Katoh,Y. and Roberts,R.J.
  TITLE     Characterization of the major mRNAs from adenovirus 2 early region
            4 by cDNA cloning and sequencing
  JOURNAL   Nucleic Acids Res. 12 (8), 3503-3519 (1984)
   PUBMED   6145146
REFERENCE   50 (sites)
  AUTHORS   Leff,T., Elkaim,R., Goding,C.R., Jalinot,P., Sassone-Corsi,P.,
            Perricaudet,M., Kedinger,C. and Chambon,P.
  TITLE     Individual products of the adenovirus 12S and 13S EIa mRNAs
            stimulate viral EIIa and EIII expression at the transcriptional
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 81 (14), 4381-4385 (1984)
   PUBMED   6336330
REFERENCE   51 (sites)
  AUTHORS   Padgett,R.A., Konarska,M.M., Grabowski,P.J., Hardy,S.F. and
  TITLE     Lariat RNA's as intermediates and products in the splicing of
            messenger RNA precursors
  JOURNAL   Science 225 (4665), 898-903 (1984)
   PUBMED   6206566
REFERENCE   52 (sites)
  AUTHORS   Natarajan,V., Madden,M.J. and Salzman,N.P.
  TITLE     Proximal and distal domains that control in vitro transcription of
            the adenovirus IVa2 gene
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 81 (20), 6290-6294 (1984)
   PUBMED   6593702
REFERENCE   53 (sites)
  AUTHORS   Wold,W.S., Cladaras,C., Magie,S.C. and Yacoub,N.
  TITLE     Mapping a new gene that encodes an 11,600-molecular-weight protein
            in the E3 transcription unit of adenovirus 2
  JOURNAL   J. Virol. 52 (2), 307-313 (1984)
   PUBMED   6492252
REFERENCE   54 (sites)
  AUTHORS   Mautner,V. and Mackay,N.
  TITLE     Recombination in adenovirus: analysis of crossover sites in
            intertypic overlap recombinants
  JOURNAL   Virology 139 (1), 43-52 (1984)
   PUBMED   6495658
REFERENCE   55 (sites)
  AUTHORS   Alestrom,P., Akusjarvi,G., Lager,M., Yeh-kai,L. and Pettersson,U.
  TITLE     Genes encoding the core proteins of adenovirus type 2
  JOURNAL   J. Biol. Chem. 259 (22), 13980-13985 (1984)
   PUBMED   6094534
REFERENCE   56 (sites)
  AUTHORS   Roberts,R.J., O'Neill,K.E. and Yen,C.T.
  TITLE     DNA sequences from the adenovirus 2 genome
  JOURNAL   J. Biol. Chem. 259 (22), 13968-13975 (1984)
   PUBMED   6334081
REFERENCE   57 (bases 1 to 2907)
  AUTHORS   Akusjarvi,G., Alestrom,P., Pettersson,M., Lager,M., Jornvall,H. and
  TITLE     The gene for the adenovirus 2 hexon polypeptide
  JOURNAL   J. Biol. Chem. 259 (22), 13976-13979 (1984)
   PUBMED   6501284
REFERENCE   58 (sites)
  AUTHORS   Moore,C.L. and Sharp,P.A.
  TITLE     Accurate cleavage and polyadenylation of exogenous RNA substrate
  JOURNAL   Cell 41 (3), 845-855 (1985)
   PUBMED   2408761
REFERENCE   59 (sites)
  AUTHORS   Roberts,R.J., Akusjaervi,G., Alestroem,P., Gelinas,R.E.,
            Gingeras,T.R., Sciaky,D. and Pettersson,U.
  TITLE     A consensus sequence for the adenovirus-2 genome
  JOURNAL   (in) Doerfler,W. (Ed.);
            ADENOVIRUS DNA: 1-51;
            Martinus Nijhoff Publishing, Boston (1986)
REFERENCE   60 (sites)
  AUTHORS   Anderson,C.W., Schmitt,R.C., Smart,J.E. and Lewis,J.B.
  TITLE     Early region 1B of adenovirus serotype 2 encodes two co-terminal
            proteins of 495 and 155 amino acid residues
  JOURNAL   Unpublished (1984)
REFERENCE   61 (sites)
  AUTHORS   Virtanen,A. and Pettersson,U.
  JOURNAL   Unpublished (1984)
COMMENT     [1]  RNA.
            [2]  sites; cds start for the hexon protein.
            [4]  sites; 5' terminus of VA I RNA.
            [6]  mRNA and DNA.
            [11]  cDNA to hexon mRNA.
            [12]  several fragments over this span;.
            [13]  mRNA and DNA.
            [19]  cDNA.
            [16]  sites; acceptor splice site for fiber mRNA.
            [18]  several leader fragments over this span.
            [27]  sites; splice sites for E1b mRNAs.
            [28]  sites; cds start for E3 19K glycoprotein.
            [29]  sites; cds start for 15K, IX and fiber polypeptides. [37]
            sites; cap site for E4 mrnas.
            [39]  sites; splice sites in E2a mRNA.
            [31]  fragments over this span.
            [34]  sites; splice site in 52,55K-pept mRNA.
            [30]  sites; splice sites in IVa2 mRNA, Ad5.
            [43]  cDNA and DNA.
            [42]  sites; splice site for 'i' leader.
            [46]  mRNA and DNA.
            [41]  sites; E1a mutational analysis.
            [55]  sites; splice sites for 33K mRNA.
            [56]  sites; cds start for E4 11K-pept, ad5.
            [60]  sites; cds start for the 13.6K-pept.
            [61]  sites; splice sites for 72K and 100K mRNAs.
            [50]  sites; splice sites for leaders; poly-A sites. [52]  sites;
            splice sites for E1a mRNAs.
            [58]  sites; splice sites in E2 mRNA.
            [53]  sites; H2ts1 mutation between 57.0% and 69.0%. [49]  H2ts125
            [68]  sites; cds start for E1a proteins.
            [69]  sites; splice sites in E4 region.
            [71]  sites; splice sites in E4 region; poly-A site for E4 mRNAs.
            [63]  sites; cds start for 57K-pept.
            [63]  sites; splice sites in E4 region; poly-A site for E4 mRNAs.
            [63]  sites; splice sites in E1b region.
            [(in) Doerfler,W. (Ed.);Adenovirus DNA: 1-51;Martinus Nijhoff
            Publishing, Boston ]  review; bases 1 to 35937.
            [75]  sites; recombination analysis of ad2 and ad5. [74]  sites;
            splice sites in major late mRNA.
            [73]  sites; IVa2 transcription start.
            [72]  sites; transcription start for EIa mRNAs.
            [70]  sites; E3 11.6 -K protein.
            [76]  sites; L3 mRNA polyadenylation site.
            [78]  sites; L3 mRNA polyadenylation site.
            Communicated on tape by R. Roberts.  That tape and [(in) Doerfler,
            W. (Ed.);Adenovirus DNA: 1-51;Martinus Nijhoff Publishing, Boston ]
            are the
            immediate sources of the annotation herein.
            A consensus sequence for the l-strand of the genome is shown.
            Population heterogeneity as distinct from strain variation is known
            (35937 +/- 9 bp) [67]; both are annotated as 'variation' below. For
            site differences with adenovirus type 5, see loci beginning <ad5>
            which are arranged in the library according to the map coordinates
            of <ad2> where one map unit corresponds to 360 bases throughout
            (see [44],[67]).  For mutational changes in the ad2 sequence, see
            the appropriate references above.
            The origin of replication is located in the first fifty bases from
            each end.
            Transcription is leftward off the l-strand and rightward off the
            r-strand; in the former case, the annotation shows '(c)' for
            complementary strand.  Complex splicing events give rise to perhaps
            fifty or more distinct mRNA transcripts at early, intermediate and
            late times after infection, many of which are still being
            characterized; in particular, some transcripts are known from
            electron microscopy which are not yet characterized at the sequence
            level.  To date nine mRNA start sites (cap sites) have been
            identified, and these define the general units of mRNAs under which
            all known transcripts are classified.
            From the r-strand, the early transcripts are E1a, E1b and E3.  The
            28 kb late transcript called herein 'major late mRNA' comprises
            five families, L1 through L5, of 3' co-terminal mRNAs.  L1, and to
            a lesser extent L2, can be expressed at early and intermediate
            times [34].  Transcripts from this region contain a common
            tripartite leader sequence at their 5' ends: the three segments of
            this leader are encoded at bases 6039-6079, 7101-7172 and
            9634-9723.  At early and intermediate times, an extra leader
            segment, the 'i' leader, is frequently present (bases 7942-8381).
            The IX message, the only unspliced message in ad2, is intermediate,
            and its termination overlaps that for E1b on the same strand and
            that for IVa2, and most likely E2b, on the opposite strand. From
            the l-strand, or the 'comp strand', early expression derives from
            the E2a, E2b and E4 families of mRNAs, although there can be late
            transcription from E2a.  The E2b cap sites, splice sites and
            termination sites have not been determined at the sequence level.
            From electron microscopy there is evidence that the E2b mRNAs may
            originate at the E2a early cap site at 27092 (c) and terminate at
            the poly-A addition site found for the IVa2 mRNA at 4050 (c) [44].
            IVa2 is an intermediate message.  The promoters for these nine
            classes of mRNAs can be localized and characterized to the
            following extent [32]:
                mRNA          cap site      possible promoter region
               ------        ----------    -----------------------------
                E1a             498         tatttata at 468-474
                E1b            1699         tatataat at 1669-1676
                IX             3576         tatataa at 3545-3551
                major late     6039         tataaaa at 6008-6014
                E3            27609         tataa at 27580-27584
                E4            35609 (c)     tatatata at 35641-35633 (c)
                E2a early     27092 (c)     no obvious sequence for 100 bases
                E2a late      25956 (c)     tacaaattt at 25985-25977 (c)
                IVa2           5826 (c)     no obvious sequence for 100 bases
            The mRNA responsible for the 13.6K protein encoded at 7968 has not
            been identified.  The VA I and VA II transcripts are unique in that
            they are generated by RNA polymerase III; for a discussion of these
            low molecular weight RNAs-- the modulation of their start points,
            their promoters, their heterogeneity and their similarity to tRNA--
            see [3],[4],[5],[26] and <ad5a2>.
            The proteins known to be encoded from these mRNAs are given in the
            Features table below, though the details of translation and
            processing have not been fully determined.  In cases such as the
            IIIa peptide or the 11K peptide, the exact span of the coding
            awaits elucidation of the mRNA splicing.  Some of these products
            share reading frames and therefore manifest partial homologies. The
            following table summarizes the unidentified reading frames ('URF')
            of 100 or more amino acids:
                initiator    terminator      frame    protein encoded
                -----------  ----------     -------   -----------------
                 6280           6600           1      11.6K URF
                17284          17763           1      17.4K URF
                23782          24138           1      12.9K URF
                24481          24867           1      14.2K URF
                26044          26826           1      28.6K URF(contains the
                                                      N-terminus of 33K cds)
                30973          32778           1      63.9K URF(contains the
                                                      fiber cds)
                10421          10834           2      14.4K URF
                20504          20935           2      15.7K URF
                27899          28222           2      12.4K URF
                30059          30451           2      14.5k URF
                33956          34456           2      18.8K URF
                 9294           9800           3      17.7K URF
                23526          26525           3      110.2K URF(contains the
                                                      100K-pept cds)
                30444          30830           3      14.7K URF
                34470          34808           3      12.7K URF
                complementary strand
                35532          35146           1      14.3K URF
                34077          33193           1      34.1K URF
                11109          10744           1      12.8K URF
                 9030           8383           1      22.8K URF
                 6780           6442           1      12.8K URF
                31604          31290           2      10.7K URF
                31211          30852           2      13.5K URF
                18707          18159           2      18.9K URF
                14861          14424           2      16.4K URF
                14114          13728           2      13.5K URF
                11618          11250           2      13.6K URF
                 1712 1194 2 18.1K URF
                35113          34703           3      15.3K URF
                34342          33998           3      13.3K URF
                 5674           5327           3      12.2K URF Additionally
            there are numerous unidentified reading frames of less than 100
            amino acid residues; and further small modifications of a few of
            the coding sequences are possible.
            [7] missing data project.
FEATURES             Location/Qualifiers
     source          1..2907
                     /organism="Human adenovirus 2"
                     /mol_type="genomic DNA"
     gene            complement(<1..>2907)
     mRNA            complement(<1..>2907)
     prim_transcript <1..>2907
     gene            <1..>2907
     prim_transcript <1..>2907
     gene            <1..>2907
     prim_transcript <1..>2907
     gene            <1..1650
     intron          <1..>2907
                     /note="precedes 100K mRNA"
     intron          <1..2812
                     /note="precedes 23K mRNA"
     CDS             1..2907
                     /note="virion component II"
                     /product="hexon protein"
     old_sequence    77..78
     old_sequence    82
     old_sequence    780
     old_sequence    829
     old_sequence    986
     old_sequence    1590
     old_sequence    1650
        1 atggctaccc cttcgatgat gccgcagtgg tcttacatgc acatctcggg ccaggacgcc
       61 tcggagtacc tgagccccgg gctggtgcag tttgcccgcg ccaccgagac gtacttcagc
      121 ctgaataaca agtttagaaa ccccacggtg gcacctacgc acgacgtaac cacagaccgg
      181 tcccagcgtt tgacgctgcg gttcatccct gtggaccgcg aggataccgc gtactcgtac
      241 aaagcgcggt tcaccctggc tgtgggtgac aaccgtgtgc ttgatatggc ttccacgtac
      301 tttgacatcc gcggcgtgct ggacaggggg cctactttta agccctactc cggcactgcc
      361 tacaacgctc tagctcccaa gggcgctcct aactcctgtg agtgggaaca aaccgaagat
      421 agcggccggg cagttgccga ggatgaagaa gaggaagatg aagatgaaga agaggaagaa
      481 gaagagcaaa acgctcgaga tcaggctact aagaaaacac atgtctatgc ccaggctcct
      541 ttgtctggag aaacaattac aaaaagcggg ctacaaatag gatcagacaa tgcagaaaca
      601 caagctaaac ctgtatacgc agatccttcc tatcaaccag aacctcaaat tggcgaatct
      661 cagtggaacg aagctgatgc taatgcggca ggagggagag tgcttaaaaa aacaactccc
      721 atgaaaccat gctatggatc ttatgccagg cctacaaatc cttttggtgg tcaatccgtt
      781 ctggttccgg atgaaaaagg ggtgcctctt ccaaaggttg acttgcaatt cttctcaaat
      841 actacctctt tgaacgaccg gcaaggcaat gctactaaac caaaagtggt tttgtacagt
      901 gaagatgtaa atatggaaac cccagacaca catctgtctt acaaacctgg aaaaggtgat
      961 gaaaattcta aagctatgtt gggtcaacaa tctatgccaa acagacccaa ttacattgct
     1021 ttcagggaca attttattgg cctaatgtat tataacagca ctggcaacat gggtgttctt
     1081 gctggtcagg catcgcagct aaatgccgtg gtagatttgc aagacagaaa cacagagctg
     1141 tcctatcaac tcttgcttga ttccataggt gatagaacca gatatttttc tatgtggaat
     1201 caggctgtag acagctatga tccagatgtt agaatcattg aaaaccatgg aactgaggat
     1261 gaattgccaa attattgttt tcctcttggg ggtattgggg taactgacac ctatcaagct
     1321 attaaggcta atggcaatgg ctcaggcgat aatggagata ctacatggac aaaagatgaa
     1381 acttttgcaa cacgtaatga aataggagtg ggtaacaact ttgccatgga aattaaccta
     1441 aatgccaacc tatggagaaa tttcctttac tccaatattg cgctgtacct gccagacaag
     1501 ctaaaataca accccaccaa tgtggaaata tctgacaacc ccaacaccta cgactacatg
     1561 aacaagcgag tggtggctcc cgggcttgta gactgctaca ttaaccttgg ggcgcgctgg
     1621 tctctggact acatggacaa cgttaatccc tttaaccacc accgcaatgc gggcctccgt
     1681 tatcgctcca tgttgttggg aaacggccgc tacgtgccct ttcacattca ggtgccccaa
     1741 aagttttttg ccattaaaaa cctcctcctc ctgccaggct catatacata tgaatggaac
     1801 ttcaggaagg atgttaacat ggttctgcag agctctctgg gaaacgatct tagagttgac
     1861 ggggctagca ttaagtttga cagcatttgt ctttacgcca ccttcttccc catggcccac
     1921 aacacggcct ccacgctgga agccatgctc agaaatgaca ccaacgacca gtcctttaat
     1981 gactaccttt ccgccgccaa catgctatac cccatacccg ccaacgccac caacgtgccc
     2041 atctccatcc catcgcgcaa ctgggcagca tttcgcggtt gggccttcac acgcttgaag
     2101 acaaaggaaa ccccttccct gggatcaggc tacgaccctt actacaccta ctctggctcc
     2161 ataccatacc ttgacggaac cttctatctt aatcacacct ttaagaaggt ggccattacc
     2221 tttgactctt ctgttagctg gccgggcaac gaccgcctgc ttactcccaa tgagtttgag
     2281 attaaacgct cagttgacgg ggagggctac aacgtagctc agtgcaacat gaccaaggac
     2341 tggttcctgg tgcagatgtt ggccaactac aatattggct accagggctt ctacattcca
     2401 gaaagctaca aggaccgcat gtactcgttc ttcagaaact tccagcccat gagccggcaa
     2461 gtggttgacg atactaaata caaggagtat cagcaggttg gaattcttca ccagcataac
     2521 aactcaggat tcgtaggcta cctcgctccc accatgcgcg agggacaggc ttaccccgcc
     2581 aacgtgccct acccactaat aggcaaaacc gcggttgaca gtattaccca gaaaaagttt
     2641 ctttgcgatc gcaccctttg gcgcatccca ttctccagta actttatgtc catgggcgca
     2701 ctcacagacc tgggccaaaa ccttctctac gccaactccg cccacgcgct agacatgact
     2761 tttgaggtgg atcccatgga cgagcccacc cttctttatg ttttgtttga agtctttgac
     2821 gtggtccgtg tgcaccagcc gcaccgcggc gtcatcgaga ccgtgtacct gcgcacgccc
     2881 ttctcggccg gcaacgccac aacataa