LOCUS       ADRCG                   3171 bp    DNA     linear   VRL 14-MAR-1996
DEFINITION  Adenovirus type 2, complete genome.
ACCESSION   J01917 REGION: 5187..8357
VERSION     J01917.1  GI:209811
KEYWORDS    DNA polymerase; DNA-binding protein; RNA polymerase III;
            alternative splicing; coat protein; complete genome; genome-linked
            protein; glycoprotein; overlapping genes; polymerase; terminal
            repeat; unidentified reading frame; virus-associated RNA.
SOURCE      Human adenovirus 2
  ORGANISM  Human adenovirus 2
            Viruses; dsDNA viruses, no RNA stage; Adenoviridae; Mastadenovirus;
            Human adenovirus C.
REFERENCE   1  (sites)
  AUTHORS   Ohe,K. and Weissman,S.M.
  TITLE     The nucleotide sequence of a low molecular weight ribonucleic acid
            from cells infected with adenovirus 2
  JOURNAL   J. Biol. Chem. 246 (22), 6991-7009 (1971)
   PUBMED   4331202
REFERENCE   2  (sites)
  AUTHORS   Jornvall,H., Ohlsson,H. and Philipson,L.
  TITLE     An acetylated N-terminus of adenovirus type 2 hexon protein
  JOURNAL   Biochem. Biophys. Res. Commun. 56 (2), 304-310 (1974)
   PUBMED   4823869
REFERENCE   3  (sites)
  AUTHORS   Celma,M.L., Pan,J. and Weissman,S.M.
  TITLE     Studies of low molecular weight RNA from cells infected with
            adenovirus 2. I. The sequences at the 3' end of VA-RNA I
  JOURNAL   J. Biol. Chem. 252 (24), 9032-9042 (1977)
   PUBMED   318160
REFERENCE   4  (sites)
  AUTHORS   Celma,M.L., Pan,J. and Weissman,S.M.
  TITLE     Studies of low molecular weight RNA from cells infected with
            adenovirus 2. II. Heterogeneity at the 5' end of VA-RNA I
  JOURNAL   J. Biol. Chem. 252 (24), 9043-9046 (1977)
   PUBMED   925035
REFERENCE   5  (sites)
  AUTHORS   Akusjarvi,G. and Pettersson,U.
  TITLE     Nucleotide sequence at the junction between the coding region of
            the adenovirus 2 hexon messenger RNA and its leader sequence
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 75 (12), 5822-5826 (1978)
   PUBMED   83649
REFERENCE   6  (bases 800 to 1050)
  AUTHORS   Ziff,E.B. and Evans,R.M.
  TITLE     Coincidence of the promoter and capped 5' terminus of RNA from the
            adenovirus 2 major late transcription unit
  JOURNAL   Cell 15 (4), 1463-1475 (1978)
   PUBMED   729003
REFERENCE   7  (bases 853 to 893; 1915 to 1986)
  AUTHORS   Akusjarvi,G. and Pettersson,U.
  TITLE     Sequence analysis of adenovirus DNA: complete nucleotide sequence
            of the spliced 5' noncoding region of adenovirus 2 hexon messenger
            RNA
  JOURNAL   Cell 16 (4), 841-850 (1979)
   PUBMED   455452
REFERENCE   8  (bases 853 to 3171)
  AUTHORS   Zain,S., Sambrook,J., Roberts,R.J., Keller,W., Fried,M. and
            Dunn,A.R.
  TITLE     Nucleotide sequence analysis of the leader segments in a cloned
            copy of adenovirus 2 fiber mRNA
  JOURNAL   Cell 16 (4), 851-861 (1979)
   PUBMED   455453
REFERENCE   9  (sites)
  AUTHORS   Shinagawa,M. and Padmanabhan,R.
  TITLE     Nucleotide sequence at the inverted terminal repetition of
            adenovirus type 2 DNA
  JOURNAL   Biochem. Biophys. Res. Commun. 87 (3), 671-678 (1979)
   PUBMED   454416
REFERENCE   10 (sites)
  AUTHORS   Zain,B.S. and Roberts,R.J.
  TITLE     Sequences from the beginning of the fiber messenger RNA of
            adenovirus-2
  JOURNAL   J. Mol. Biol. 131 (2), 341-352 (1979)
   PUBMED   318165
REFERENCE   11 (sites)
  AUTHORS   Baker,C.C., Herisse,J., Courtois,G., Galibert,F. and Ziff,E.
  TITLE     Messenger RNA for the Ad2 DNA binding protein: DNA sequences
            encoding the first leader and heterogenity at the mRNA 5' end
  JOURNAL   Cell 18 (2), 569-580 (1979)
   PUBMED   227610
REFERENCE   12 (bases 723 to 992; 1837 to 2026)
  AUTHORS   Akusjarvi,G. and Pettersson,J.
  TITLE     Sequence analysis of adenovirus DNA. IV. The genomic sequences
            encoding the common tripartite leader of late adenovirus messenger
            RNA
  JOURNAL   J. Mol. Biol. 134 (1), 143-158 (1979)
   PUBMED   318166
REFERENCE   13 (sites)
  AUTHORS   Perricaudet,M., Akusjarvi,G., Virtanen,A. and Pettersson,U.
  TITLE     Structure of two spliced mRNAs from the transforming region of
            human subgroup C adenoviruses
  JOURNAL   Nature 281 (5733), 694-696 (1979)
   PUBMED   551290
REFERENCE   14 (bases 853 to 3171)
  AUTHORS   Zain,S., Gingeras,T.R., Bullock,P., Wong,G. and Gelinas,R.E.
  TITLE     Determination and analysis of adenovirus-2 DNA sequences which may
            include signals for late messenger RNA processing
  JOURNAL   J. Mol. Biol. 135 (2), 413-433 (1979)
   PUBMED   318167
REFERENCE   15 (bases 662 to 1392)
  AUTHORS   Baker,C.C. and Ziff,E.B.
  TITLE     Biogenesis, structures, and sites of encoding of the 5' termini of
            adenovirus-2 mRNAs
  JOURNAL   Cold Spring Harb. Symp. Quant. Biol. 44 Pt 1,, 415-428 (1980)
   PUBMED   6933038
REFERENCE   16 (sites)
  AUTHORS   Akusjarvi,G., Mathews,M.B., Andersson,P., Vennstrom,B. and
            Pettersson,U.
  TITLE     Structure of genes for virus-associated RNAI and RNAII of
            adenovirus type 2
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 77 (5), 2424-2428 (1980)
   PUBMED   6930642
REFERENCE   17 (sites)
  AUTHORS   Perricaudet,M., Le Moullec,J.M. and Pettersson,U.
  TITLE     Predicted structure of two adenovirus tumor antigens
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 77 (7), 3778-3782 (1980)
   PUBMED   6253988
REFERENCE   18 (sites)
  AUTHORS   Anderson,C.W. and Lewis,J.B.
  TITLE     Amino-terminal sequence of adenovirus type 2 proteins: hexon,
            fiber, component IX, and early protein 1B-15K
  JOURNAL   Virology 104 (1), 27-41 (1980)
   PUBMED   6985474
REFERENCE   19 (sites)
  AUTHORS   Persson,H., Jornvall,H. and Zabielski,J.
  TITLE     Multiple mRNA species for the precursor to an adenovirus-encoded
            glycoprotein: identification and structure of the signal sequence
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 77 (11), 6349-6353 (1980)
   PUBMED   6985478
REFERENCE   20 (sites)
  AUTHORS   Van Etten,R.A., Walberg,M.W. and Clayton,D.A.
  TITLE     Precise localization and nucleotide sequence of the two mouse
            mitochondrial rRNA genes and three immediately adjacent novel tRNA
            genes
  JOURNAL   Cell 22 (1 Pt 1), 157-170 (1980)
   PUBMED   7428037
REFERENCE   21 (sites)
  AUTHORS   Buttner,W. and Veres-Molnar,Z.
  TITLE     Localization of the 3'-terminal end of the EcoRI B
            fragment-specific early mRNA of adenovirus type 2
  JOURNAL   FEBS Lett. 122 (2), 317-321 (1980)
   PUBMED   6258984
REFERENCE   22 (sites)
  AUTHORS   Akusjarvi,G., Zabielski,J., Perricaudet,M. and Pettersson,U.
  TITLE     The sequence of the 3' non-coding region of the hexon mRNA
            discloses a novel adenovirus gene
  JOURNAL   Nucleic Acids Res. 9 (1), 1-17 (1981)
   PUBMED   6259616
REFERENCE   23 (sites)
  AUTHORS   Hashimoto,S., Pursley,M.H. and Green,M.
  TITLE     Nucleotide sequences and mapping of novel heterogenous 5'-termini
            of adenovirus 2 early region 4 mRNA
  JOURNAL   Nucleic Acids Res. 9 (7), 1675-1689 (1981)
   PUBMED   6164992
REFERENCE   24 (sites)
  AUTHORS   Akusjarvi,G. and Persson,H.
  TITLE     Gene and mRNA for precursor polypeptide VI from adenovirus type 2
  JOURNAL   J. Virol. 38 (2), 469-482 (1981)
   PUBMED   6985479
REFERENCE   25 (bases 631 to 865)
  AUTHORS   Baker,C.C. and Ziff,E.B.
  TITLE     Promoters and heterogeneous 5' termini of the messenger RNAs of
            adenovirus serotype 2
  JOURNAL   J. Mol. Biol. 149 (2), 189-221 (1981)
   PUBMED   6927849
REFERENCE   26 (sites)
  AUTHORS   Akusjarvi,G. and Persson,H.
  TITLE     Controls of RNA splicing and termination in the major late
            adenovirus transcription unit
  JOURNAL   Nature 292 (5822), 420-426 (1981)
   PUBMED   7254339
REFERENCE   27 (sites)
  AUTHORS   Kruijer,W., van Schaik,F.M. and Sussenbach,J.S.
  TITLE     Structure and organization of the gene coding for the DNA binding
            protein of adenovirus type 5
  JOURNAL   Nucleic Acids Res. 9 (18), 4439-4457 (1981)
   PUBMED   6117824
REFERENCE   28 (sites)
  AUTHORS   van Beveren,C.P., Maat,J., Dekker,B.M. and van Ormondt,H.
  TITLE     The nucleotide sequence of the gene for protein IVa2 and of the 5'
            leader segment of the major late mRNAs of adenovirus type 5
  JOURNAL   Gene 16 (1-3), 179-189 (1981)
   PUBMED   7343420
REFERENCE   29 (sites)
  AUTHORS   Uhlen,M., Svensson,C., Josephson,S., Alestrom,P.,
            Chattapadhyaya,J.B., Pettersson,U. and Philipson,L.
  TITLE     Leader arrangement in the adenovirus fiber mRNA
  JOURNAL   EMBO J. 1 (2), 249-254 (1982)
   PUBMED   6897915
REFERENCE   30 (bases 2683 to 3171)
  AUTHORS   Virtanen,A., Alestrom,P., Persson,H., Katze,M.G. and Pettersson,U.
  TITLE     An adenovirus agnogene
  JOURNAL   Nucleic Acids Res. 10 (8), 2539-2548 (1982)
   PUBMED   6281734
REFERENCE   31 (sites)
  AUTHORS   Osborne,T.F., Gaynor,R.B. and Berk,A.J.
  TITLE     The TATA homology and the mRNA 5' untranslated sequence are not
            required for expression of essential adenovirus E1A functions
  JOURNAL   Cell 29 (1), 139-148 (1982)
   PUBMED   7105179
REFERENCE   32 (bases 1 to 3171)
  AUTHORS   Gingeras,T.R., Sciaky,D., Gelinas,R.E., Bing-Dong,J., Yen,C.E.,
            Kelly,M.M., Bullock,P.A., Parsons,B.L., O'Neill,K.E. and
            Roberts,R.J.
  TITLE     Nucleotide sequences from the adenovirus-2 genome
  JOURNAL   J. Biol. Chem. 257 (22), 13475-13491 (1982)
   PUBMED   7142161
REFERENCE   33 (bases 592 to 3171)
  AUTHORS   Alestrom,P., Akusjarvi,G., Pettersson,M. and Pettersson,U.
  TITLE     DNA sequence analysis of the region encoding the terminal protein
            and the hypothetical N-gene product of adenovirus type 2
  JOURNAL   J. Biol. Chem. 257 (22), 13492-13498 (1982)
   PUBMED   7142162
REFERENCE   34 (sites)
  AUTHORS   Ahmed,C.M., Chanda,R.S., Stow,N.D. and Zain,B.S.
  TITLE     The nucleotide sequence of mRNA for the Mr 19 000 glycoprotein from
            early gene block III of adenovirus 2
  JOURNAL   Gene 20 (3), 339-346 (1982)
   PUBMED   6985487
REFERENCE   35 (sites)
  AUTHORS   Oosterom-Dragon,E.A. and Anderson,C.W.
  TITLE     Polypeptide structure and encoding location of the adenovirus
            serotype 2 late, nonstructural 33K protein
  JOURNAL   J. Virol. 45 (1), 251-263 (1983)
   PUBMED   6296434
REFERENCE   36 (bases 2743 to 3171)
  AUTHORS   Falvey,E. and Ziff,E.
  TITLE     Sequence arrangement and protein coding capacity of the adenovirus
            type 2 'i' leader
  JOURNAL   J. Virol. 45 (1), 185-191 (1983)
   PUBMED   6823011
REFERENCE   37 (sites)
  AUTHORS   Downey,J.F., Rowe,D.T., Bacchetti,S., Graham,F.L. and Bayley,S.T.
  TITLE     Mapping of a 14,000-dalton antigen to early region 4 of the human
            adenovirus 5 genome
  JOURNAL   J. Virol. 45 (2), 514-523 (1983)
   PUBMED   6834467
REFERENCE   38 (sites)
  AUTHORS   Goldenberg,C.J. and Hauser,S.D.
  TITLE     Accurate and efficient in vitro splicing of purified precursor RNAs
            specified by early region 2 of the adenovirus 2 genome
  JOURNAL   Nucleic Acids Res. 11 (5), 1337-1348 (1983)
   PUBMED   6186991
REFERENCE   39 (sites)
  AUTHORS   Virtanen,A. and Pettersson,U.
  TITLE     The molecular structure of the 9S mRNA from early region 1A of
            adenovirus serotype 2
  JOURNAL   J. Mol. Biol. 165 (3), 496-499 (1983)
   PUBMED   6842608
REFERENCE   40 (sites)
  AUTHORS   Stalhandske,P., Persson,H., Perricaudet,M., Philipson,L. and
            Pettersson,U.
  TITLE     Structure of three spliced mRNAs from region E3 of adenovirus type
            2
  JOURNAL   Gene 22 (2-3), 157-165 (1983)
   PUBMED   6307816
REFERENCE   41 (sites)
  AUTHORS   Sung,M.T., Cao,T.M., Coleman,R.T. and Budelier,K.A.
  TITLE     Gene and protein sequences of adenovirus protein VII, a hybrid
            basic chromosomal protein
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 80 (10), 2902-2906 (1983)
   PUBMED   6574459
REFERENCE   42 (sites)
  AUTHORS   Lewis,J.B. and Anderson,C.W.
  TITLE     Proteins encoded near the adenovirus late messenger RNA leader
            segments
  JOURNAL   Virology 127 (1), 112-123 (1983)
   PUBMED   6857999
REFERENCE   43 (sites)
  AUTHORS   Yeh-Kai,L., Akusjarvi,G., Alestrom,P., Pettersson,U., Tremblay,M.
            and Weber,J.
  TITLE     Genetic identification of an endoproteinase encoded by the
            adenovirus genome
  JOURNAL   J. Mol. Biol. 167 (1), 217-222 (1983)
   PUBMED   6306252
REFERENCE   44 (sites)
  AUTHORS   Sung,M.T., Cao,T.M., Lischwe,M.A. and Coleman,R.T.
  TITLE     Molecular processing of adenovirus proteins
  JOURNAL   J. Biol. Chem. 258 (13), 8266-8272 (1983)
   PUBMED   6336325
REFERENCE   45 (sites)
  AUTHORS   Kruijer,W., van Schaik,F.M., Speijer,J.G. and Sussenbach,J.S.
  TITLE     Structure and function of adenovirus DNA binding protein:
            comparison of the amino acid sequences of the Ad5 and Ad12 proteins
            derived from the nucleotide sequence of the corresponding genes
  JOURNAL   Virology 128 (1), 140-153 (1983)
   PUBMED   6308889
REFERENCE   46 (sites)
  AUTHORS   Le Moullec,J.M., Akusjarvi,G., Stalhandske,P., Pettersson,U.,
            Chambraud,B., Gilardi,P., Nasri,M. and Perricaudet,M.
  TITLE     Polyadenylic acid addition sites in the adenovirus type 2 major
            late transcription unit
  JOURNAL   J. Virol. 48 (1), 127-134 (1983)
   PUBMED   6136617
REFERENCE   47 (sites)
  AUTHORS   Tigges,M.A. and Raskas,H.J.
  TITLE     Splice junctions in adenovirus 2 early region 4 mRNAs: multiple
            splice sites produce 18 to 24 RNAs
  JOURNAL   J. Virol. 50 (1), 106-117 (1984)
   PUBMED   6336328
REFERENCE   48 (sites)
  AUTHORS   Downey,J.F., Evelegh,C.M., Branton,P.E. and Bayley,S.T.
  TITLE     Peptide maps and N-terminal sequences of polypeptides from early
            region 1A of human adenovirus 5
  JOURNAL   J. Virol. 50 (1), 30-37 (1984)
   PUBMED   6699947
REFERENCE   49 (sites)
  AUTHORS   Freyer,G.A., Katoh,Y. and Roberts,R.J.
  TITLE     Characterization of the major mRNAs from adenovirus 2 early region
            4 by cDNA cloning and sequencing
  JOURNAL   Nucleic Acids Res. 12 (8), 3503-3519 (1984)
   PUBMED   6145146
REFERENCE   50 (sites)
  AUTHORS   Leff,T., Elkaim,R., Goding,C.R., Jalinot,P., Sassone-Corsi,P.,
            Perricaudet,M., Kedinger,C. and Chambon,P.
  TITLE     Individual products of the adenovirus 12S and 13S EIa mRNAs
            stimulate viral EIIa and EIII expression at the transcriptional
            level
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 81 (14), 4381-4385 (1984)
   PUBMED   6336330
REFERENCE   51 (sites)
  AUTHORS   Padgett,R.A., Konarska,M.M., Grabowski,P.J., Hardy,S.F. and
            Sharp,P.A.
  TITLE     Lariat RNA's as intermediates and products in the splicing of
            messenger RNA precursors
  JOURNAL   Science 225 (4665), 898-903 (1984)
   PUBMED   6206566
REFERENCE   52 (sites)
  AUTHORS   Natarajan,V., Madden,M.J. and Salzman,N.P.
  TITLE     Proximal and distal domains that control in vitro transcription of
            the adenovirus IVa2 gene
  JOURNAL   Proc. Natl. Acad. Sci. U.S.A. 81 (20), 6290-6294 (1984)
   PUBMED   6593702
REFERENCE   53 (sites)
  AUTHORS   Wold,W.S., Cladaras,C., Magie,S.C. and Yacoub,N.
  TITLE     Mapping a new gene that encodes an 11,600-molecular-weight protein
            in the E3 transcription unit of adenovirus 2
  JOURNAL   J. Virol. 52 (2), 307-313 (1984)
   PUBMED   6492252
REFERENCE   54 (sites)
  AUTHORS   Mautner,V. and Mackay,N.
  TITLE     Recombination in adenovirus: analysis of crossover sites in
            intertypic overlap recombinants
  JOURNAL   Virology 139 (1), 43-52 (1984)
   PUBMED   6495658
REFERENCE   55 (sites)
  AUTHORS   Alestrom,P., Akusjarvi,G., Lager,M., Yeh-kai,L. and Pettersson,U.
  TITLE     Genes encoding the core proteins of adenovirus type 2
  JOURNAL   J. Biol. Chem. 259 (22), 13980-13985 (1984)
   PUBMED   6094534
REFERENCE   56 (sites)
  AUTHORS   Roberts,R.J., O'Neill,K.E. and Yen,C.T.
  TITLE     DNA sequences from the adenovirus 2 genome
  JOURNAL   J. Biol. Chem. 259 (22), 13968-13975 (1984)
   PUBMED   6334081
REFERENCE   57 (sites)
  AUTHORS   Moore,C.L. and Sharp,P.A.
  TITLE     Accurate cleavage and polyadenylation of exogenous RNA substrate
  JOURNAL   Cell 41 (3), 845-855 (1985)
   PUBMED   2408761
REFERENCE   58 (sites)
  AUTHORS   Roberts,R.J., Akusjaervi,G., Alestroem,P., Gelinas,R.E.,
            Gingeras,T.R., Sciaky,D. and Pettersson,U.
  TITLE     A consensus sequence for the adenovirus-2 genome
  JOURNAL   (in) Doerfler,W. (Ed.);
            ADENOVIRUS DNA: 1-51;
            Martinus Nijhoff Publishing, Boston (1986)
REFERENCE   59 (sites)
  AUTHORS   Anderson,C.W., Schmitt,R.C., Smart,J.E. and Lewis,J.B.
  TITLE     Early region 1B of adenovirus serotype 2 encodes two co-terminal
            proteins of 495 and 155 amino acid residues
  JOURNAL   Unpublished (1984)
REFERENCE   60 (sites)
  AUTHORS   Virtanen,A. and Pettersson,U.
  JOURNAL   Unpublished (1984)
COMMENT     [1]  RNA.
            [2]  sites; cds start for the hexon protein.
            [4]  sites; 5' terminus of VA I RNA.
            [6]  mRNA and DNA.
            [11]  cDNA to hexon mRNA.
            [12]  several fragments over this span;.
            [13]  mRNA and DNA.
            [19]  cDNA.
            [16]  sites; acceptor splice site for fiber mRNA.
            [18]  several leader fragments over this span.
            [27]  sites; splice sites for E1b mRNAs.
            [28]  sites; cds start for E3 19K glycoprotein.
            [29]  sites; cds start for 15K, IX and fiber polypeptides. [37]
            sites; cap site for E4 mrnas.
            [39]  sites; splice sites in E2a mRNA.
            [31]  fragments over this span.
            [34]  sites; splice site in 52,55K-pept mRNA.
            [30]  sites; splice sites in IVa2 mRNA, Ad5.
            [43]  cDNA and DNA.
            [42]  sites; splice site for 'i' leader.
            [46]  mRNA and DNA.
            [41]  sites; E1a mutational analysis.
            [55]  sites; splice sites for 33K mRNA.
            [56]  sites; cds start for E4 11K-pept, ad5.
            [60]  sites; cds start for the 13.6K-pept.
            [61]  sites; splice sites for 72K and 100K mRNAs.
            [50]  sites; splice sites for leaders; poly-A sites. [52]  sites;
            splice sites for E1a mRNAs.
            [58]  sites; splice sites in E2 mRNA.
            [53]  sites; H2ts1 mutation between 57.0% and 69.0%. [49]  H2ts125
            strain.
            [68]  sites; cds start for E1a proteins.
            [69]  sites; splice sites in E4 region.
            [71]  sites; splice sites in E4 region; poly-A site for E4 mRNAs.
            [63]  sites; cds start for 57K-pept.
            [63]  sites; splice sites in E4 region; poly-A site for E4 mRNAs.
            [63]  sites; splice sites in E1b region.
            [(in) Doerfler,W. (Ed.);Adenovirus DNA: 1-51;Martinus Nijhoff
            Publishing, Boston ]  review; bases 1 to 35937.
            [75]  sites; recombination analysis of ad2 and ad5. [74]  sites;
            splice sites in major late mRNA.
            [73]  sites; IVa2 transcription start.
            [72]  sites; transcription start for EIa mRNAs.
            [70]  sites; E3 11.6 -K protein.
            [76]  sites; L3 mRNA polyadenylation site.
            [78]  sites; L3 mRNA polyadenylation site.
            Communicated on tape by R. Roberts.  That tape and [(in) Doerfler,
            W. (Ed.);Adenovirus DNA: 1-51;Martinus Nijhoff Publishing, Boston ]
            are the
            immediate sources of the annotation herein.
            A consensus sequence for the l-strand of the genome is shown.
            Population heterogeneity as distinct from strain variation is known
            (35937 +/- 9 bp) [67]; both are annotated as 'variation' below. For
            site differences with adenovirus type 5, see loci beginning <ad5>
            which are arranged in the library according to the map coordinates
            of <ad2> where one map unit corresponds to 360 bases throughout
            (see [44],[67]).  For mutational changes in the ad2 sequence, see
            the appropriate references above.
            The origin of replication is located in the first fifty bases from
            each end.
            Transcription is leftward off the l-strand and rightward off the
            r-strand; in the former case, the annotation shows '(c)' for
            complementary strand.  Complex splicing events give rise to perhaps
            fifty or more distinct mRNA transcripts at early, intermediate and
            late times after infection, many of which are still being
            characterized; in particular, some transcripts are known from
            electron microscopy which are not yet characterized at the sequence
            level.  To date nine mRNA start sites (cap sites) have been
            identified, and these define the general units of mRNAs under which
            all known transcripts are classified.
            From the r-strand, the early transcripts are E1a, E1b and E3.  The
            28 kb late transcript called herein 'major late mRNA' comprises
            five families, L1 through L5, of 3' co-terminal mRNAs.  L1, and to
            a lesser extent L2, can be expressed at early and intermediate
            times [34].  Transcripts from this region contain a common
            tripartite leader sequence at their 5' ends: the three segments of
            this leader are encoded at bases 6039-6079, 7101-7172 and
            9634-9723.  At early and intermediate times, an extra leader
            segment, the 'i' leader, is frequently present (bases 7942-8381).
            The IX message, the only unspliced message in ad2, is intermediate,
            and its termination overlaps that for E1b on the same strand and
            that for IVa2, and most likely E2b, on the opposite strand. From
            the l-strand, or the 'comp strand', early expression derives from
            the E2a, E2b and E4 families of mRNAs, although there can be late
            transcription from E2a.  The E2b cap sites, splice sites and
            termination sites have not been determined at the sequence level.
            From electron microscopy there is evidence that the E2b mRNAs may
            originate at the E2a early cap site at 27092 (c) and terminate at
            the poly-A addition site found for the IVa2 mRNA at 4050 (c) [44].
            IVa2 is an intermediate message.  The promoters for these nine
            classes of mRNAs can be localized and characterized to the
            following extent [32]:
                mRNA          cap site      possible promoter region
               ------        ----------    -----------------------------
                E1a             498         tatttata at 468-474
                E1b            1699         tatataat at 1669-1676
                IX             3576         tatataa at 3545-3551
                major late     6039         tataaaa at 6008-6014
                E3            27609         tataa at 27580-27584
                E4            35609 (c)     tatatata at 35641-35633 (c)
                E2a early     27092 (c)     no obvious sequence for 100 bases
                                            upstream
                E2a late      25956 (c)     tacaaattt at 25985-25977 (c)
                IVa2           5826 (c)     no obvious sequence for 100 bases
                                            upstream
            The mRNA responsible for the 13.6K protein encoded at 7968 has not
            been identified.  The VA I and VA II transcripts are unique in that
            they are generated by RNA polymerase III; for a discussion of these
            low molecular weight RNAs-- the modulation of their start points,
            their promoters, their heterogeneity and their similarity to tRNA--
            see [3],[4],[5],[26] and <ad5a2>.
            The proteins known to be encoded from these mRNAs are given in the
            Features table below, though the details of translation and
            processing have not been fully determined.  In cases such as the
            IIIa peptide or the 11K peptide, the exact span of the coding
            awaits elucidation of the mRNA splicing.  Some of these products
            share reading frames and therefore manifest partial homologies. The
            following table summarizes the unidentified reading frames ('URF')
            of 100 or more amino acids:
                initiator    terminator      frame    protein encoded
                -----------  ----------     -------   -----------------
                 6280           6600           1      11.6K URF
                17284          17763           1      17.4K URF
                23782          24138           1      12.9K URF
                24481          24867           1      14.2K URF
                26044          26826           1      28.6K URF(contains the
                                                      N-terminus of 33K cds)
                30973          32778           1      63.9K URF(contains the
                                                      fiber cds)
                10421          10834           2      14.4K URF
                20504          20935           2      15.7K URF
                27899          28222           2      12.4K URF
                30059          30451           2      14.5k URF
                33956          34456           2      18.8K URF
                 9294           9800           3      17.7K URF
                23526          26525           3      110.2K URF(contains the
                                                      100K-pept cds)
                30444          30830           3      14.7K URF
                34470          34808           3      12.7K URF
                complementary strand
               ---------------------------
                35532          35146           1      14.3K URF
                34077          33193           1      34.1K URF
                11109          10744           1      12.8K URF
                 9030           8383           1      22.8K URF
                 6780           6442           1      12.8K URF
                31604          31290           2      10.7K URF
                31211          30852           2      13.5K URF
                18707          18159           2      18.9K URF
                14861          14424           2      16.4K URF
                14114          13728           2      13.5K URF
                11618          11250           2      13.6K URF
                 1712 1194 2 18.1K URF
                35113          34703           3      15.3K URF
                34342          33998           3      13.3K URF
                 5674           5327           3      12.2K URF Additionally
            there are numerous unidentified reading frames of less than 100
            amino acid residues; and further small modifications of a few of
            the coding sequences are possible.
            [7] missing data project.
FEATURES             Location/Qualifiers
     source          1..3171
                     /organism="Human adenovirus 2"
                     /mol_type="genomic DNA"
                     /db_xref="taxon:10515"
     gene            complement(<1..>3171)
                     /gene="E2b"
     mRNA            complement(<1..>3171)
                     /gene="E2b"
                     /citation=[32]
     prim_transcript complement(<1..640)
                     /gene="IVa2"
                     /citation=[32]
     gene            complement(<1..509)
                     /gene="IVa2"
     CDS             complement(join(<1..231,510..522))
                     /gene="IVa2"
                     /codon_start=1
                     /product="virion morphogenesis-associated protein"
                     /protein_id="AAA92205.1"
                     /db_xref="GI:209817"
                     /translation="METRGRRPAALQHQQDQPQAHPGQRAARSAPLHRDPDYADEDPA
                     PVERHDPGPSGRAPTTAVQRKPPQPAKRGDMLDRDAVEHVTELWDRLELLGQTLKSMP
                     TADGLKPLKNFASLQELLSLGGERLLAHLVRENMQVRDMLNEVAPLLRDDGSCSSLNY
                     QLQPVIGVIYGPTGCGKSQLLRNLLSSQLISPTPETVFFIAPQVDMIPPSELKAWEMQ
                     ICEGNYAPGPDGTIIPQSGTLRPRFVKMAYDDLILEHNYDVSDPRNIFAQAAARGPIA
                     IIMDECMENLGGHKGVSKFFHAFPSKLHDKFPKCTGYTVLVVLHNMNPRRDMAGNIAN
                     LKIQSKMHLISPRMHPSQLNRFVNTYTKGLPLAISLLLKDIFRHHAQRSCYDWIIYNT
                     TPQHEALQWCYLHPRDGLMPMYLNIQSHLYHVLEKIHRTLNDRDRWSRAYRARKTPK"
     CDS             complement(1..3171)
                     /gene="DNA polymerase"
                     /codon_start=1
                     /product="DNA polymerase"
                     /protein_id="AAA92206.1"
                     /db_xref="GI:209822"
                     /translation="MDSSHLRDVVIKLRPPRADIWTLGSRGVVVRSTITPLEQPDGQG
                     QAAEVEDHQPNPPGEGLKFPLCFLVRGRQVNLVQDVQPVHRCQYCARFYKSQHECSAR
                     RRDFYFHHINSHSSNWWREIQFFPIGSHPRTERLFVTYDVETYTWMGAFGKQLVPFML
                     VMKFGGDEPLVTAARDLAVDLGWDRWEQDPLTFYCITPEKMAIGRQFRTFRDHLQMLM
                     ARDLWSSFVASNPHLADWALSEHGLSSPEELTYEELKKLPSIKGTPRFLELYIVGHNI
                     NGFDEIVLAAQVINNRSEVPGPFRITRNFMPRAGKILFNDVTFALPNPRSKKRTDFLL
                     WEQGGCDDTDFKYQYLKVMVRDTFALTHTSLRKAAQAYALPVEKGCCAYQAVNQFYML
                     GSYRSEADGFPIQEYWKDREEFVLNRELWKKKGQDKYDIIKETLDYCALDVQVTAELV
                     NKLRDSYASFVRDAVGLTDASFNVFQRPTISSNSHAIFRQIVFRAEQPARSNLGPDLL
                     APSHELYDYVRASIRGGRCYPTYLGILREPLYVYDICGMYASALTHPMPWGPPLNPYE
                     RALAARAWQQALDLQGCKIDYFDARLLPGVFTVDADPPDETQLDPLPPFCSRKGGRLC
                     WTNERLRGEVATSVDLVTLHNRGWRVHLVPDERTTVFPEWRCVAREYVQLNIAAKERA
                     DRDKNQTLRSIAKLLSNALYGSFATKLDNKKIVFSDQMDAATLKGITAGQVNIKSSSF
                     LETDNLSAEVMPAFEREYSPQQLALADSDAEESEDERAPTPFYSPPSGTPGHVAYTYK
                     PITFLDAEEGDMCLHTLERVDPLVDNDRYPSHLASFVLAWTRAFVSEWSEFLYEEDRG
                     TPLEDRPLKSVYGDTDSLFVTERGHRLMETRGKKRIKKHGGNLVFDPERPELTWLVEC
                     ETVCGACGADAYSPESVFLAPKLYALKSLHCPSCGASSKGKLRAKGHAAEGLDYDTMV
                     KCYLADAQGEDRQRFSTSRTSLKRTLASAQPGAHPFTVTQTTLTRTLRPWKDMTLARL
                     DEHRLLPYSESRPNPRNEEICWIEMP"
     gene            complement(1..2027)
                     /gene="DNA polymerase"
     intron          complement(232..509)
                     /gene="IVa2"
                     /citation=[17]
     prim_transcript 853..>3171
                     /citation=[25]
     gene            853..>3171
                     /gene="L5"
     prim_transcript 853..>3171
                     /gene="L4"
                     /citation=[25]
     gene            853..>3171
                     /gene="L4"
     prim_transcript 853..>3171
                     /gene="L4"
                     /citation=[22]
     gene            853..>3171
                     /gene="L3_1"
     prim_transcript 853..>3171
                     /gene="L3_1"
                     /citation=[24]
     gene            853..>3171
                     /gene="L2_1"
     prim_transcript 853..>3171
                     /gene="L1"
                     /citation=[25]
     gene            853..>3171
                     /gene="L1"
     intron          894..1914
                     /gene="L1"
                     /note="major late mRNA intron (precedes 2nd leader)"
                     /citation=[8]
     old_sequence    1257
                     /gene="L1"
                     /citation=[15]
     conflict        1388..1389
                     /gene="L1"
                     /citation=[15]
                     /replace=""
     intron          1987..2755
                     /gene="DNA polymerase"
                     /note="major late mRNA intron (precedes 'i' leader)"
                     /citation=[8]
     old_sequence    2026..2027
                     /gene="L1"
                     /citation=[14]
     CDS             2782..>3171
                     /gene="L1"
                     /codon_start=1
                     /product="13.6 kD protein"
                     /protein_id="AAA92207.1"
                     /db_xref="GI:209823"
                     /translation="MRADREELDLPPPVGGVAVDVVKVEVPATGRTLVLAFVKTCAVL
                     AAVHGLYILHEVDLTTAHKEAEWEFEPLAWRVWLVVFYFGCLSLTVWLLEGSYGGSDH
                     HAARAQSPDVRARRSELDDNIAQMGAVHGLELPRRQVRRELLQVYLA"
ORIGIN      
        1 ctacggcatc tcgatccagc atatctcctc gtttcgcggg ttggggcggc tttcgctgta
       61 cggcagtagt cggtgctcgt ccagacgggc cagggtcatg tctttccacg ggcgcagggt
      121 cctcgtcagc gtagtctggg tcacggtgaa ggggtgcgct ccgggctgcg cgctggccag
      181 ggtgcgcttg aggctggtcc tgctggtgct gaagcgctgc cggtcttcgc cctgcgcgtc
      241 ggccaggtag catttgacca tggtgtcata gtccagcccc tccgcggcgt ggcccttggc
      301 gcgcagcttg cccttggagg aggcgccgca cgaggggcag tgcagacttt taagggcgta
      361 gagcttgggc gcgagaaata ccgattccgg ggagtaggca tccgcgccgc aggccccgca
      421 gacggtctcg cattccacga gccaggtgag ctctggccgt tcggggtcaa aaaccaggtt
      481 tcccccatgc tttttgatgc gtttcttacc tctggtttcc atgagccggt gtccacgctc
      541 ggtgacgaaa aggctgtccg tgtccccgta tacagacttg agaggcctgt cctcgagcgg
      601 tgttccgcgg tcctcctcgt atagaaactc ggaccactct gagacgaagg ctcgcgtcca
      661 ggccagcacg aaggaggcta agtgggaggg gtagcggtcg ttgtccacta gggggtccac
      721 tcgctccagg gtgtgaagac acatgtcgcc ctcttcggca tcaaggaagg tgattggttt
      781 ataggtgtag gccacgtgac cgggtgttcc tgaagggggg ctataaaagg gggtgggggc
      841 gcgttcgtcc tcactctctt ccgcatcgct gtctgcgagg gccagctgtt ggggtgagta
      901 ctccctctca aaagcgggca tgacttctgc gctaagattg tcagtttcca aaaacgagga
      961 ggatttgata ttcacctggc ccgcggtgat gcctttgagg gtggccgcgt ccatctggtc
     1021 agaaaagaca atctttttgt tgtcaagctt ggtggcaaac gacccgtaga gggcgttgga
     1081 cagcaacttg gcgatggagc gcagggtttg gtttttgtcg cgatcggcgc gctccttggc
     1141 cgcgatgttt agctgcacgt attcgcgcgc aacgcaccgc cattcgggaa agacggtggt
     1201 gcgctcgtcg ggcaccaggt gcacgcgcca accgcggttg tgcagggtga caaggtcaac
     1261 gctggtggct acctctccgc gtaggcgctc gttggtccag cagaggcggc cgcccttgcg
     1321 cgaacagaat ggcggtagtg ggtctagctg cgtctcgtcc ggggggtctg cgtccacggt
     1381 aaagaccccg ggcagcaggc gcgcgtcgaa gtagtctatc ttgcatcctt gcaagtctag
     1441 cgcctgctgc catgcgcggg cggcaagcgc gcgctcgtat gggttgagtg ggggacccca
     1501 tggcatgggg tgggtgagcg cggaggcgta catgccgcaa atgtcgtaaa cgtagagggg
     1561 ctctctgagt attccaagat atgtagggta gcatcttcca ccgcggatgc tggcgcgcac
     1621 gtaatcgtat agttcgtgcg agggagcgag gaggtcggga ccgaggttgc tacgggcggg
     1681 ctgctctgct cggaagacta tctgcctgaa gatggcatgt gagttggatg atatggttgg
     1741 acgctggaag acgttgaagc tggcgtctgt gagacctacc gcgtcacgca cgaaggaggc
     1801 gtaggagtcg cgcagcttgt tgaccagctc ggcggtgacc tgcacgtcta gggcgcagta
     1861 gtccagggtt tccttgatga tgtcatactt atcctgtccc ttttttttcc acagctcgcg
     1921 gttgaggaca aactcttcgc ggtctttcca gtactcttgg atcggaaacc cgtcggcctc
     1981 cgaacggtaa gagcctagca tgtagaactg gttgacggcc tggtaggcgc agcatccctt
     2041 ttctacgggt agcgcgtatg cctgcgcggc cttccggagc gaggtgtggg tgagcgcaaa
     2101 ggtgtcccta accatgactt tgaggtactg gtatttgaag tcagtgtcgt cgcatccgcc
     2161 ctgctcccag agcaaaaagt ccgtgcgctt tttggaacgc gggtttggca gggcgaaggt
     2221 gacatcgttg aaaagtatct ttcccgcgcg aggcataaag ttgcgtgtga tgcggaaggg
     2281 tcccggcacc tcggaacggt tgttaattac ctgggcggcg agcacgatct cgtcgaagcc
     2341 gttgatgttg tggcccacga tgtaaagttc caagaagcgc ggggtgccct tgatggaggg
     2401 caatttttta agttcctcgt aggtgagctc ctcaggggag ctgagcccgt gttctgacag
     2461 ggcccagtct gcaagatgag ggttggaagc gacgaatgag ctccacaggt cacgggccat
     2521 tagcatttgc aggtggtcgc gaaaggtcct aaactggcga cctatggcca ttttttctgg
     2581 ggtgatgcag tagaaggtaa gcgggtcttg ttcccagcgg tcccatccaa ggtccacggc
     2641 taggtctcgc gcggcggtca ccagaggctc atctccgccg aacttcataa ccagcatgaa
     2701 gggcacgagc tgcttcccaa aggcccccat ccaagtatag gtctctacat cgtaggtgac
     2761 aaagagacgc tcggtgcgag gatgcgagcc gatcgggaag aactggatct cccgccacca
     2821 gttggaggag tggctgttga tgtggtgaaa gtagaagtcc ctgcgacggg ccgaacactc
     2881 gtgctggctt ttgtaaaaac gtgcgcagta ctggcagcgg tgcacgggct gtacatcctg
     2941 cacgaggttg acctgacgac cgcgcacaag gaagcagagt gggaatttga gcccctcgcc
     3001 tggcgggttt ggctggtggt cttctacttc ggctgcttgt ccttgaccgt ctggctgctc
     3061 gaggggagtt atggtggatc ggaccaccac gccgcgcgag cccaaagtcc agatgtccgc
     3121 gcgcggcggt cggagcttga tgacaacatc gcgcagatgg gagctgtcca t
//