LOCUS ADRCG 35937 bp DNA linear VRL 14-MAR-1996
DEFINITION Adenovirus type 2, complete genome.
ACCESSION J01917 J01918-J01957 K00086 K00394 K00395 K02367 M13004
V00007-V00020 V00023 V00024
VERSION J01917.1 GI:209811
KEYWORDS DNA polymerase; DNA-binding protein; RNA polymerase III;
alternative splicing; coat protein; complete genome; genome-linked
protein; glycoprotein; overlapping genes; polymerase; terminal
repeat; unidentified reading frame; virus-associated RNA.
SOURCE Human adenovirus 2
ORGANISM Human adenovirus 2
Viruses; dsDNA viruses, no RNA stage; Adenoviridae; Mastadenovirus;
Human adenovirus C.
REFERENCE 1 (bases 10610 to 10766)
AUTHORS Ohe,K. and Weissman,S.M.
TITLE The nucleotide sequence of a low molecular weight ribonucleic acid
from cells infected with adenovirus 2
JOURNAL J. Biol. Chem. 246 (22), 6991-7009 (1971)
PUBMED 4331202
REFERENCE 2 (sites)
AUTHORS Jornvall,H., Ohlsson,H. and Philipson,L.
TITLE An acetylated N-terminus of adenovirus type 2 hexon protein
JOURNAL Biochem. Biophys. Res. Commun. 56 (2), 304-310 (1974)
PUBMED 4823869
REFERENCE 3 (bases 10514 to 10680)
AUTHORS Pan,J., Celma,M.L. and Weissman,S.M.
TITLE Studies of low molecular weight RNA from cells infected with
adenovirus 2. III. The sequence of the promoter for VA-RNA I
JOURNAL J. Biol. Chem. 252 (24), 9047-9054 (1977)
PUBMED 200627
REFERENCE 4 (bases 10681 to 10813)
AUTHORS Celma,M.L., Pan,J. and Weissman,S.M.
TITLE Studies of low molecular weight RNA from cells infected with
adenovirus 2. I. The sequences at the 3' end of VA-RNA I
JOURNAL J. Biol. Chem. 252 (24), 9032-9042 (1977)
PUBMED 318160
REFERENCE 5 (sites)
AUTHORS Celma,M.L., Pan,J. and Weissman,S.M.
TITLE Studies of low molecular weight RNA from cells infected with
adenovirus 2. II. Heterogeneity at the 5' end of VA-RNA I
JOURNAL J. Biol. Chem. 252 (24), 9043-9046 (1977)
PUBMED 925035
REFERENCE 6 (bases 30812 to 30900)
AUTHORS Zain,B.S. and Roberts,R.J.
TITLE Characterization and sequence analysis of a recombination site in
the hybrid virus Ad2+ND
JOURNAL J. Mol. Biol. 120 (1), 13-31 (1978)
PUBMED 205651
REFERENCE 7 (bases 18778 to 18918)
AUTHORS Akusjarvi,G. and Pettersson,U.
TITLE Nucleotide sequence at the junction between the coding region of
the adenovirus 2 hexon messenger RNA and its leader sequence
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 75 (12), 5822-5826 (1978)
PUBMED 83649
REFERENCE 8 (bases 5986 to 6236)
AUTHORS Ziff,E.B. and Evans,R.M.
TITLE Coincidence of the promoter and capped 5' terminus of RNA from the
adenovirus 2 major late transcription unit
JOURNAL Cell 15 (4), 1463-1475 (1978)
PUBMED 729003
REFERENCE 9 (bases 21607 to 21816)
AUTHORS Akusjarvi,G. and Pettersson,U.
TITLE Sequence analysis of adenovirus DNA. I. Nucleotide sequence at the
carboxy-terminal end of the gene for adenovirus type 2 hexon
JOURNAL Virology 91 (2), 477-480 (1978)
PUBMED 741658
REFERENCE 10 (bases 1 to 156; 35804 to 35937)
AUTHORS Arrand,J.R. and Roberts,R.J.
TITLE The nucleotide sequences at the termini of adenovirus-2 DNA
JOURNAL J. Mol. Biol. 128 (4), 577-594 (1979)
PUBMED 318164
REFERENCE 11 (bases 6039 to 6079; 7101 to 7172; 9634 to 9723; 18802 to 18861)
AUTHORS Akusjarvi,G. and Pettersson,U.
TITLE Sequence analysis of adenovirus DNA: complete nucleotide sequence
of the spliced 5' noncoding region of adenovirus 2 hexon messenger
RNA
JOURNAL Cell 16 (4), 841-850 (1979)
PUBMED 455452
REFERENCE 12 (bases 6039 to 31095)
AUTHORS Zain,S., Sambrook,J., Roberts,R.J., Keller,W., Fried,M. and
Dunn,A.R.
TITLE Nucleotide sequence analysis of the leader segments in a cloned
copy of adenovirus 2 fiber mRNA
JOURNAL Cell 16 (4), 851-861 (1979)
PUBMED 455453
REFERENCE 13 (bases 1 to 110; 35835 to 35937)
AUTHORS Shinagawa,M. and Padmanabhan,R.
TITLE Nucleotide sequence at the inverted terminal repetition of
adenovirus type 2 DNA
JOURNAL Biochem. Biophys. Res. Commun. 87 (3), 671-678 (1979)
PUBMED 454416
REFERENCE 14 (bases 25634 to 27376)
AUTHORS Galibert,F., Herisse,J. and Courtois,G.
TITLE Nucleotide sequence of the EcoRI-F fragment of adenovirus 2 genome
JOURNAL Gene 6 (1), 1-22 (1979)
PUBMED 478297
REFERENCE 15 (sites)
AUTHORS Zain,B.S. and Roberts,R.J.
TITLE Sequences from the beginning of the fiber messenger RNA of
adenovirus-2
JOURNAL J. Mol. Biol. 131 (2), 341-352 (1979)
PUBMED 318165
REFERENCE 16 (bases 26977 to 27178)
AUTHORS Baker,C.C., Herisse,J., Courtois,G., Galibert,F. and Ziff,E.
TITLE Messenger RNA for the Ad2 DNA binding protein: DNA sequences
encoding the first leader and heterogenity at the mRNA 5' end
JOURNAL Cell 18 (2), 569-580 (1979)
PUBMED 227610
REFERENCE 17 (bases 5909 to 6178; 7023 to 7212; 9452 to 9836)
AUTHORS Akusjarvi,G. and Pettersson,J.
TITLE Sequence analysis of adenovirus DNA. IV. The genomic sequences
encoding the common tripartite leader of late adenovirus messenger
RNA
JOURNAL J. Mol. Biol. 134 (1), 143-158 (1979)
PUBMED 318166
REFERENCE 18 (bases 513 to 1111; 1226 to 1630)
AUTHORS Perricaudet,M., Akusjarvi,G., Virtanen,A. and Pettersson,U.
TITLE Structure of two spliced mRNAs from the transforming region of
human subgroup C adenoviruses
JOURNAL Nature 281 (5733), 694-696 (1979)
PUBMED 551290
REFERENCE 19 (bases 6039 to 31080)
AUTHORS Zain,S., Gingeras,T.R., Bullock,P., Wong,G. and Gelinas,R.E.
TITLE Determination and analysis of adenovirus-2 DNA sequences which may
include signals for late messenger RNA processing
JOURNAL J. Mol. Biol. 135 (2), 413-433 (1979)
PUBMED 318167
REFERENCE 20 (bases 5848 to 6578)
AUTHORS Baker,C.C. and Ziff,E.B.
TITLE Biogenesis, structures, and sites of encoding of the 5' termini of
adenovirus-2 mRNAs
JOURNAL Cold Spring Harb. Symp. Quant. Biol. 44 Pt 1,, 415-428 (1980)
PUBMED 6933038
REFERENCE 21 (bases 3504 to 4109)
AUTHORS Alestrom,P., Akusjarvi,G., Perricaudet,M., Mathews,M.B.,
Klessig,D.F. and Pettersson,U.
TITLE The gene for polypeptide IX of adenovirus type 2 and its unspliced
messenger RNA
JOURNAL Cell 19 (3), 671-681 (1980)
PUBMED 6985473
REFERENCE 22 (bases 35360 to 35937)
AUTHORS Shinagawa,M., Padmanabhan,R.V. and Padmanabhan,R.
TITLE The nucleotide sequence of the right-hand terminal SmaI-K fragment
of adenovirus type 2 DNA
JOURNAL Gene 9 (1-2), 99-114 (1980)
PUBMED 6247249
REFERENCE 23 (bases 10514 to 11065)
AUTHORS Akusjarvi,G., Mathews,M.B., Andersson,P., Vennstrom,B. and
Pettersson,U.
TITLE Structure of genes for virus-associated RNAI and RNAII of
adenovirus type 2
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 77 (5), 2424-2428 (1980)
PUBMED 6930642
REFERENCE 24 (bases 27373 to 30050)
AUTHORS Herisse,J., Courtois,G. and Galibert,F.
TITLE Nucleotide sequence of the EcoRI D fragment of adenovirus 2 genome
JOURNAL Nucleic Acids Res. 8 (10), 2173-2192 (1980)
PUBMED 6253880
REFERENCE 25 (sites)
AUTHORS Perricaudet,M., Le Moullec,J.M. and Pettersson,U.
TITLE Predicted structure of two adenovirus tumor antigens
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 77 (7), 3778-3782 (1980)
PUBMED 6253988
REFERENCE 26 (sites)
AUTHORS Anderson,C.W. and Lewis,J.B.
TITLE Amino-terminal sequence of adenovirus type 2 proteins: hexon,
fiber, component IX, and early protein 1B-15K
JOURNAL Virology 104 (1), 27-41 (1980)
PUBMED 6985474
REFERENCE 27 (sites)
AUTHORS Persson,H., Jornvall,H. and Zabielski,J.
TITLE Multiple mRNA species for the precursor to an adenovirus-encoded
glycoprotein: identification and structure of the signal sequence
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 77 (11), 6349-6353 (1980)
PUBMED 6985478
REFERENCE 28 (sites)
AUTHORS Van Etten,R.A., Walberg,M.W. and Clayton,D.A.
TITLE Precise localization and nucleotide sequence of the two mouse
mitochondrial rRNA genes and three immediately adjacent novel tRNA
genes
JOURNAL Cell 22 (1 Pt 1), 157-170 (1980)
PUBMED 7428037
REFERENCE 29 (bases 22305 to 22600)
AUTHORS Buttner,W. and Veres-Molnar,Z.
TITLE Localization of the 3'-terminal end of the EcoRI B
fragment-specific early mRNA of adenovirus type 2
JOURNAL FEBS Lett. 122 (2), 317-321 (1980)
PUBMED 6258984
REFERENCE 30 (bases 21607 to 22770)
AUTHORS Akusjarvi,G., Zabielski,J., Perricaudet,M. and Pettersson,U.
TITLE The sequence of the 3' non-coding region of the hexon mRNA
discloses a novel adenovirus gene
JOURNAL Nucleic Acids Res. 9 (1), 1-17 (1981)
PUBMED 6259616
REFERENCE 31 (bases 459 to 608)
AUTHORS Osborne,T.F., Schell,R.E., Burch-Jaffe,E., Berget,S.J. and
Berk,A.J.
TITLE Mapping a eukaryotic promoter: a DNA sequence required for in vivo
expression of adenovirus pre-early functions
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 78 (3), 1381-1385 (1981)
PUBMED 6262792
REFERENCE 32 (bases 30047 to 32268)
AUTHORS Herisse,J. and Galibert,F.
TITLE Nucleotide sequence of the EcoRI E fragment of adenovirus 2 genome
JOURNAL Nucleic Acids Res. 9 (5), 1229-1240 (1981)
PUBMED 6262722
REFERENCE 33 (sites)
AUTHORS Hashimoto,S., Pursley,M.H. and Green,M.
TITLE Nucleotide sequences and mapping of novel heterogenous 5'-termini
of adenovirus 2 early region 4 mRNA
JOURNAL Nucleic Acids Res. 9 (7), 1675-1689 (1981)
PUBMED 6164992
REFERENCE 34 (bases 17878 to 18918)
AUTHORS Akusjarvi,G. and Persson,H.
TITLE Gene and mRNA for precursor polypeptide VI from adenovirus type 2
JOURNAL J. Virol. 38 (2), 469-482 (1981)
PUBMED 6985479
REFERENCE 35 (bases 5817 to 6051; 35358 to 35707)
AUTHORS Baker,C.C. and Ziff,E.B.
TITLE Promoters and heterogeneous 5' termini of the messenger RNAs of
adenovirus serotype 2
JOURNAL J. Mol. Biol. 149 (2), 189-221 (1981)
PUBMED 6927849
REFERENCE 36 (bases 18838 to 21744)
AUTHORS Jornvall,H., Alestrom,P., Akusjarvi,G., von Bahr-Lindstrom,H.,
Philipson,L. and Pettersson,U.
TITLE Order of the CNBr fragments in the adenovirus hexon protein
JOURNAL J. Biol. Chem. 256 (12), 6204-6212 (1981)
PUBMED 7240199
REFERENCE 37 (sites)
AUTHORS Akusjarvi,G. and Persson,H.
TITLE Controls of RNA splicing and termination in the major late
adenovirus transcription unit
JOURNAL Nature 292 (5822), 420-426 (1981)
PUBMED 7254339
REFERENCE 38 (bases 32263 to 35937)
AUTHORS Herisse,J., Rigolet,M., de Dinechin,S.D. and Galibert,F.
TITLE Nucleotide sequence of adenovirus 2 DNA fragment encoding for the
carboxylic region of the fiber protein and the entire E4 region
JOURNAL Nucleic Acids Res. 9 (16), 4023-4042 (1981)
PUBMED 6985482
REFERENCE 39 (sites)
AUTHORS Kruijer,W., van Schaik,F.M. and Sussenbach,J.S.
TITLE Structure and organization of the gene coding for the DNA binding
protein of adenovirus type 5
JOURNAL Nucleic Acids Res. 9 (18), 4439-4457 (1981)
PUBMED 6117824
REFERENCE 40 (sites)
AUTHORS van Beveren,C.P., Maat,J., Dekker,B.M. and van Ormondt,H.
TITLE The nucleotide sequence of the gene for protein IVa2 and of the 5'
leader segment of the major late mRNAs of adenovirus type 5
JOURNAL Gene 16 (1-3), 179-189 (1981)
PUBMED 7343420
REFERENCE 41 (sites)
AUTHORS Uhlen,M., Svensson,C., Josephson,S., Alestrom,P.,
Chattapadhyaya,J.B., Pettersson,U. and Philipson,L.
TITLE Leader arrangement in the adenovirus fiber mRNA
JOURNAL EMBO J. 1 (2), 249-254 (1982)
PUBMED 6897915
REFERENCE 42 (bases 1517 to 1696; 3932 to 4112; 17880 to 17975; 21142 to
28259)
AUTHORS Fraser,N.W., Baker,C.C., Moore,M.A. and Ziff,E.B.
TITLE Poly(A) sites of adenovirus serotype 2 transcription units
JOURNAL J. Mol. Biol. 155 (3), 207-233 (1982)
PUBMED 6176714
REFERENCE 43 (bases 7869 to 8420)
AUTHORS Virtanen,A., Alestrom,P., Persson,H., Katze,M.G. and Pettersson,U.
TITLE An adenovirus agnogene
JOURNAL Nucleic Acids Res. 10 (8), 2539-2548 (1982)
PUBMED 6281734
REFERENCE 44 (sites)
AUTHORS Osborne,T.F., Gaynor,R.B. and Berk,A.J.
TITLE The TATA homology and the mRNA 5' untranslated sequence are not
required for expression of essential adenovirus E1A functions
JOURNAL Cell 29 (1), 139-148 (1982)
PUBMED 7105179
REFERENCE 45 (bases 22469 to 24125)
AUTHORS Kruijer,W., Van Schaik,F.M. and Sussenbach,J.S.
TITLE Nucleotide sequence of the gene encoding adenovirus type 2 DNA
binding protein
JOURNAL Nucleic Acids Res. 10 (15), 4493-4500 (1982)
PUBMED 6985485
REFERENCE 46 (bases 1 to 11600; 32092 to 35937)
AUTHORS Gingeras,T.R., Sciaky,D., Gelinas,R.E., Bing-Dong,J., Yen,C.E.,
Kelly,M.M., Bullock,P.A., Parsons,B.L., O'Neill,K.E. and
Roberts,R.J.
TITLE Nucleotide sequences from the adenovirus-2 genome
JOURNAL J. Biol. Chem. 257 (22), 13475-13491 (1982)
PUBMED 7142161
REFERENCE 47 (bases 5778 to 11560)
AUTHORS Alestrom,P., Akusjarvi,G., Pettersson,M. and Pettersson,U.
TITLE DNA sequence analysis of the region encoding the terminal protein
and the hypothetical N-gene product of adenovirus type 2
JOURNAL J. Biol. Chem. 257 (22), 13492-13498 (1982)
PUBMED 7142162
REFERENCE 48 (bases 27609 to 27980; 28376 to 29792)
AUTHORS Ahmed,C.M., Chanda,R.S., Stow,N.D. and Zain,B.S.
TITLE The nucleotide sequence of mRNA for the Mr 19 000 glycoprotein from
early gene block III of adenovirus 2
JOURNAL Gene 20 (3), 339-346 (1982)
PUBMED 6985487
REFERENCE 49 (bases 31030 to 32775)
AUTHORS Boudin,M.L., Rigolet,M., Lemay,P., Galibert,F. and Boulanger,P.
TITLE Biochemical and genetical characterization of a fiber-defective
temperature-sensitive mutant of type 2 adenovirus
JOURNAL EMBO J. 2 (11), 1921-1927 (1983)
PUBMED 6571701
REFERENCE 50 (sites)
AUTHORS Oosterom-Dragon,E.A. and Anderson,C.W.
TITLE Polypeptide structure and encoding location of the adenovirus
serotype 2 late, nonstructural 33K protein
JOURNAL J. Virol. 45 (1), 251-263 (1983)
PUBMED 6296434
REFERENCE 51 (bases 7929 to 8423)
AUTHORS Falvey,E. and Ziff,E.
TITLE Sequence arrangement and protein coding capacity of the adenovirus
type 2 'i' leader
JOURNAL J. Virol. 45 (1), 185-191 (1983)
PUBMED 6823011
REFERENCE 52 (sites)
AUTHORS Downey,J.F., Rowe,D.T., Bacchetti,S., Graham,F.L. and Bayley,S.T.
TITLE Mapping of a 14,000-dalton antigen to early region 4 of the human
adenovirus 5 genome
JOURNAL J. Virol. 45 (2), 514-523 (1983)
PUBMED 6834467
REFERENCE 53 (sites)
AUTHORS Goldenberg,C.J. and Hauser,S.D.
TITLE Accurate and efficient in vitro splicing of purified precursor RNAs
specified by early region 2 of the adenovirus 2 genome
JOURNAL Nucleic Acids Res. 11 (5), 1337-1348 (1983)
PUBMED 6186991
REFERENCE 54 (sites)
AUTHORS Virtanen,A. and Pettersson,U.
TITLE The molecular structure of the 9S mRNA from early region 1A of
adenovirus serotype 2
JOURNAL J. Mol. Biol. 165 (3), 496-499 (1983)
PUBMED 6842608
REFERENCE 55 (sites)
AUTHORS Stalhandske,P., Persson,H., Perricaudet,M., Philipson,L. and
Pettersson,U.
TITLE Structure of three spliced mRNAs from region E3 of adenovirus type
2
JOURNAL Gene 22 (2-3), 157-165 (1983)
PUBMED 6307816
REFERENCE 56 (bases 15821 to 16495)
AUTHORS Sung,M.T., Cao,T.M., Coleman,R.T. and Budelier,K.A.
TITLE Gene and protein sequences of adenovirus protein VII, a hybrid
basic chromosomal protein
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 80 (10), 2902-2906 (1983)
PUBMED 6574459
REFERENCE 57 (sites)
AUTHORS Lewis,J.B. and Anderson,C.W.
TITLE Proteins encoded near the adenovirus late messenger RNA leader
segments
JOURNAL Virology 127 (1), 112-123 (1983)
PUBMED 6857999
REFERENCE 58 (sites)
AUTHORS Yeh-Kai,L., Akusjarvi,G., Alestrom,P., Pettersson,U., Tremblay,M.
and Weber,J.
TITLE Genetic identification of an endoproteinase encoded by the
adenovirus genome
JOURNAL J. Mol. Biol. 167 (1), 217-222 (1983)
PUBMED 6306252
REFERENCE 59 (bases 17539 to 18177)
AUTHORS Sung,M.T., Cao,T.M., Lischwe,M.A. and Coleman,R.T.
TITLE Molecular processing of adenovirus proteins
JOURNAL J. Biol. Chem. 258 (13), 8266-8272 (1983)
PUBMED 6336325
REFERENCE 60 (sites)
AUTHORS Kruijer,W., van Schaik,F.M., Speijer,J.G. and Sussenbach,J.S.
TITLE Structure and function of adenovirus DNA binding protein:
comparison of the amino acid sequences of the Ad5 and Ad12 proteins
derived from the nucleotide sequence of the corresponding genes
JOURNAL Virology 128 (1), 140-153 (1983)
PUBMED 6308889
REFERENCE 61 (bases 13898 to 14231)
AUTHORS Le Moullec,J.M., Akusjarvi,G., Stalhandske,P., Pettersson,U.,
Chambraud,B., Gilardi,P., Nasri,M. and Perricaudet,M.
TITLE Polyadenylic acid addition sites in the adenovirus type 2 major
late transcription unit
JOURNAL J. Virol. 48 (1), 127-134 (1983)
PUBMED 6136617
REFERENCE 62 (bases 18616 to 19233)
AUTHORS Mautner,V. and Boursnell,M.E.
TITLE Recombination in adenovirus: DNA sequence analysis of crossover
sites in intertypic recombinants
JOURNAL Virology 131 (1), 1-10 (1983)
PUBMED 6316645
REFERENCE 63 (sites)
AUTHORS Tigges,M.A. and Raskas,H.J.
TITLE Splice junctions in adenovirus 2 early region 4 mRNAs: multiple
splice sites produce 18 to 24 RNAs
JOURNAL J. Virol. 50 (1), 106-117 (1984)
PUBMED 6336328
REFERENCE 64 (sites)
AUTHORS Downey,J.F., Evelegh,C.M., Branton,P.E. and Bayley,S.T.
TITLE Peptide maps and N-terminal sequences of polypeptides from early
region 1A of human adenovirus 5
JOURNAL J. Virol. 50 (1), 30-37 (1984)
PUBMED 6699947
REFERENCE 65 (sites)
AUTHORS Freyer,G.A., Katoh,Y. and Roberts,R.J.
TITLE Characterization of the major mRNAs from adenovirus 2 early region
4 by cDNA cloning and sequencing
JOURNAL Nucleic Acids Res. 12 (8), 3503-3519 (1984)
PUBMED 6145146
REFERENCE 66 (sites)
AUTHORS Leff,T., Elkaim,R., Goding,C.R., Jalinot,P., Sassone-Corsi,P.,
Perricaudet,M., Kedinger,C. and Chambon,P.
TITLE Individual products of the adenovirus 12S and 13S EIa mRNAs
stimulate viral EIIa and EIII expression at the transcriptional
level
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 81 (14), 4381-4385 (1984)
PUBMED 6336330
REFERENCE 67 (sites)
AUTHORS Padgett,R.A., Konarska,M.M., Grabowski,P.J., Hardy,S.F. and
Sharp,P.A.
TITLE Lariat RNA's as intermediates and products in the splicing of
messenger RNA precursors
JOURNAL Science 225 (4665), 898-903 (1984)
PUBMED 6206566
REFERENCE 68 (sites)
AUTHORS Natarajan,V., Madden,M.J. and Salzman,N.P.
TITLE Proximal and distal domains that control in vitro transcription of
the adenovirus IVa2 gene
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 81 (20), 6290-6294 (1984)
PUBMED 6593702
REFERENCE 69 (sites)
AUTHORS Wold,W.S., Cladaras,C., Magie,S.C. and Yacoub,N.
TITLE Mapping a new gene that encodes an 11,600-molecular-weight protein
in the E3 transcription unit of adenovirus 2
JOURNAL J. Virol. 52 (2), 307-313 (1984)
PUBMED 6492252
REFERENCE 70 (sites)
AUTHORS Mautner,V. and Mackay,N.
TITLE Recombination in adenovirus: analysis of crossover sites in
intertypic overlap recombinants
JOURNAL Virology 139 (1), 43-52 (1984)
PUBMED 6495658
REFERENCE 71 (bases 15033 to 18316)
AUTHORS Alestrom,P., Akusjarvi,G., Lager,M., Yeh-kai,L. and Pettersson,U.
TITLE Genes encoding the core proteins of adenovirus type 2
JOURNAL J. Biol. Chem. 259 (22), 13980-13985 (1984)
PUBMED 6094534
REFERENCE 72 (bases 11601 to 15726; 23924 to 25638)
AUTHORS Roberts,R.J., O'Neill,K.E. and Yen,C.T.
TITLE DNA sequences from the adenovirus 2 genome
JOURNAL J. Biol. Chem. 259 (22), 13968-13975 (1984)
PUBMED 6334081
REFERENCE 73 (bases 18838 to 21744)
AUTHORS Akusjarvi,G., Alestrom,P., Pettersson,M., Lager,M., Jornvall,H. and
Pettersson,U.
TITLE The gene for the adenovirus 2 hexon polypeptide
JOURNAL J. Biol. Chem. 259 (22), 13976-13979 (1984)
PUBMED 6501284
REFERENCE 74 (sites)
AUTHORS Moore,C.L. and Sharp,P.A.
TITLE Accurate cleavage and polyadenylation of exogenous RNA substrate
JOURNAL Cell 41 (3), 845-855 (1985)
PUBMED 2408761
REFERENCE 75 (sites)
AUTHORS Roberts,R.J., Akusjaervi,G., Alestroem,P., Gelinas,R.E.,
Gingeras,T.R., Sciaky,D. and Pettersson,U.
TITLE A consensus sequence for the adenovirus-2 genome
JOURNAL (in) Doerfler,W. (Ed.);
ADENOVIRUS DNA: 1-51;
Martinus Nijhoff Publishing, Boston (1986)
REFERENCE 76 (bases 10610 to 10711)
AUTHORS Cannon,R.E., Wu,G.J. and Railey,J.F.
TITLE Functions of and interactions between the A and B blocks in
adenovirus type 2-specific VARNA1 gene
JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83 (5), 1285-1289 (1986)
PUBMED 3456587
REFERENCE 77 (sites)
AUTHORS Anderson,C.W., Schmitt,R.C., Smart,J.E. and Lewis,J.B.
TITLE Early region 1B of adenovirus serotype 2 encodes two co-terminal
proteins of 495 and 155 amino acid residues
JOURNAL Unpublished (1984)
REFERENCE 78 (sites)
AUTHORS Virtanen,A. and Pettersson,U.
JOURNAL Unpublished (1984)
COMMENT [1] RNA.
[2] sites; cds start for the hexon protein.
[4] sites; 5' terminus of VA I RNA.
[6] mRNA and DNA.
[11] cDNA to hexon mRNA.
[12] several fragments over this span;.
[13] mRNA and DNA.
[19] cDNA.
[16] sites; acceptor splice site for fiber mRNA.
[18] several leader fragments over this span.
[27] sites; splice sites for E1b mRNAs.
[28] sites; cds start for E3 19K glycoprotein.
[29] sites; cds start for 15K, IX and fiber polypeptides. [37]
sites; cap site for E4 mrnas.
[39] sites; splice sites in E2a mRNA.
[31] fragments over this span.
[34] sites; splice site in 52,55K-pept mRNA.
[30] sites; splice sites in IVa2 mRNA, Ad5.
[43] cDNA and DNA.
[42] sites; splice site for 'i' leader.
[46] mRNA and DNA.
[41] sites; E1a mutational analysis.
[55] sites; splice sites for 33K mRNA.
[56] sites; cds start for E4 11K-pept, ad5.
[60] sites; cds start for the 13.6K-pept.
[61] sites; splice sites for 72K and 100K mRNAs.
[50] sites; splice sites for leaders; poly-A sites. [52] sites;
splice sites for E1a mRNAs.
[58] sites; splice sites in E2 mRNA.
[53] sites; H2ts1 mutation between 57.0% and 69.0%. [49] H2ts125
strain.
[68] sites; cds start for E1a proteins.
[69] sites; splice sites in E4 region.
[71] sites; splice sites in E4 region; poly-A site for E4 mRNAs.
[63] sites; cds start for 57K-pept.
[63] sites; splice sites in E4 region; poly-A site for E4 mRNAs.
[63] sites; splice sites in E1b region.
[(in) Doerfler,W. (Ed.);Adenovirus DNA: 1-51;Martinus Nijhoff
Publishing, Boston ] review; bases 1 to 35937.
[75] sites; recombination analysis of ad2 and ad5. [74] sites;
splice sites in major late mRNA.
[73] sites; IVa2 transcription start.
[72] sites; transcription start for EIa mRNAs.
[70] sites; E3 11.6 -K protein.
[76] sites; L3 mRNA polyadenylation site.
[78] sites; L3 mRNA polyadenylation site.
Communicated on tape by R. Roberts. That tape and [(in) Doerfler,
W. (Ed.);Adenovirus DNA: 1-51;Martinus Nijhoff Publishing, Boston ]
are the
immediate sources of the annotation herein.
A consensus sequence for the l-strand of the genome is shown.
Population heterogeneity as distinct from strain variation is known
(35937 +/- 9 bp) [67]; both are annotated as 'variation' below. For
site differences with adenovirus type 5, see loci beginning <ad5>
which are arranged in the library according to the map coordinates
of <ad2> where one map unit corresponds to 360 bases throughout
(see [44],[67]). For mutational changes in the ad2 sequence, see
the appropriate references above.
The origin of replication is located in the first fifty bases from
each end.
Transcription is leftward off the l-strand and rightward off the
r-strand; in the former case, the annotation shows '(c)' for
complementary strand. Complex splicing events give rise to perhaps
fifty or more distinct mRNA transcripts at early, intermediate and
late times after infection, many of which are still being
characterized; in particular, some transcripts are known from
electron microscopy which are not yet characterized at the sequence
level. To date nine mRNA start sites (cap sites) have been
identified, and these define the general units of mRNAs under which
all known transcripts are classified.
From the r-strand, the early transcripts are E1a, E1b and E3. The
28 kb late transcript called herein 'major late mRNA' comprises
five families, L1 through L5, of 3' co-terminal mRNAs. L1, and to
a lesser extent L2, can be expressed at early and intermediate
times [34]. Transcripts from this region contain a common
tripartite leader sequence at their 5' ends: the three segments of
this leader are encoded at bases 6039-6079, 7101-7172 and
9634-9723. At early and intermediate times, an extra leader
segment, the 'i' leader, is frequently present (bases 7942-8381).
The IX message, the only unspliced message in ad2, is intermediate,
and its termination overlaps that for E1b on the same strand and
that for IVa2, and most likely E2b, on the opposite strand. From
the l-strand, or the 'comp strand', early expression derives from
the E2a, E2b and E4 families of mRNAs, although there can be late
transcription from E2a. The E2b cap sites, splice sites and
termination sites have not been determined at the sequence level.
From electron microscopy there is evidence that the E2b mRNAs may
originate at the E2a early cap site at 27092 (c) and terminate at
the poly-A addition site found for the IVa2 mRNA at 4050 (c) [44].
IVa2 is an intermediate message. The promoters for these nine
classes of mRNAs can be localized and characterized to the
following extent [32]:
mRNA cap site possible promoter region
------ ---------- -----------------------------
E1a 498 tatttata at 468-474
E1b 1699 tatataat at 1669-1676
IX 3576 tatataa at 3545-3551
major late 6039 tataaaa at 6008-6014
E3 27609 tataa at 27580-27584
E4 35609 (c) tatatata at 35641-35633 (c)
E2a early 27092 (c) no obvious sequence for 100 bases
upstream
E2a late 25956 (c) tacaaattt at 25985-25977 (c)
IVa2 5826 (c) no obvious sequence for 100 bases
upstream
The mRNA responsible for the 13.6K protein encoded at 7968 has not
been identified. The VA I and VA II transcripts are unique in that
they are generated by RNA polymerase III; for a discussion of these
low molecular weight RNAs-- the modulation of their start points,
their promoters, their heterogeneity and their similarity to tRNA--
see [3],[4],[5],[26] and <ad5a2>.
The proteins known to be encoded from these mRNAs are given in the
Features table below, though the details of translation and
processing have not been fully determined. In cases such as the
IIIa peptide or the 11K peptide, the exact span of the coding
awaits elucidation of the mRNA splicing. Some of these products
share reading frames and therefore manifest partial homologies. The
following table summarizes the unidentified reading frames ('URF')
of 100 or more amino acids:
initiator terminator frame protein encoded
----------- ---------- ------- -----------------
6280 6600 1 11.6K URF
17284 17763 1 17.4K URF
23782 24138 1 12.9K URF
24481 24867 1 14.2K URF
26044 26826 1 28.6K URF(contains the
N-terminus of 33K cds)
30973 32778 1 63.9K URF(contains the
fiber cds)
10421 10834 2 14.4K URF
20504 20935 2 15.7K URF
27899 28222 2 12.4K URF
30059 30451 2 14.5k URF
33956 34456 2 18.8K URF
9294 9800 3 17.7K URF
23526 26525 3 110.2K URF(contains the
100K-pept cds)
30444 30830 3 14.7K URF
34470 34808 3 12.7K URF
complementary strand
---------------------------
35532 35146 1 14.3K URF
34077 33193 1 34.1K URF
11109 10744 1 12.8K URF
9030 8383 1 22.8K URF
6780 6442 1 12.8K URF
31604 31290 2 10.7K URF
31211 30852 2 13.5K URF
18707 18159 2 18.9K URF
14861 14424 2 16.4K URF
14114 13728 2 13.5K URF
11618 11250 2 13.6K URF











1712 1194 2 18
.1K URF
35113 34703 3 15.3K URF
34342 33998 3 13.3K URF
5674 5327 3 12.2K URF Additionally
there are numerous unidentified reading frames of less than 100
amino acid residues; and further small modifications of a few of
the coding sequences are possible.
[7] missing data project.
FEATURES Location/Qualifiers
source 1..35937
/organism="Human adenovirus 2"
/mol_type="genomic DNA"
/db_xref="taxon:10515"
repeat_region 1..102
/standard_name="inverted terminal repeat"
/citation=[13]
variation 8
/note="a in [44],[(in) Doerfler,W. (Ed.);Adenovirus DNA:
1-51;Martinus Nijhoff Publishing, Boston ]; aa in other
strains,e.g.[13]"
/replace="aa"
variation 460
/note="c in [44],[63]; t in [31]"
/replace="t"
prim_transcript 498..1630
/citation=[18]
gene 498..1613
/gene="E1a"
CDS join(559..1111,1226..1542)
/gene="E1a"
/note="derived from the 13S mRNA"
/codon_start=1
/product="32 kD protein"
/protein_id="AAA92199.1"
/db_xref="GI:209814"
/translation="MRHIICHGGVITEEMAASLLDQLIEEVLADNLPPPSHFEPPTLH
ELYDLDVTAPEDPNEEAVSQIFPESVMLAVQEGIDLFTFPPAPGSPEPPHLSRQPEQP
EQRALGPVSMPNLVPEVIDLTCHEAGFPPSDDEDEEGEEFVLDYVEHPGHGCRSCHYH
RRNTGDPDIMCSLCYMRTCGMFVYSPVSEPEPEPEPEPEPARPTRRPKLVPAILRRPT
SPVSRECNSSTDSCDSGPSNTPPEIHPVVPLCPIKPVAVRVGGRRQAVECIEDLLNES
GQPLDLSCKRPRP"
CDS join(559..973,1226..1542)
/gene="E1a"
/note="derived from the 12S mRNA"
/codon_start=1
/product="26 kD protein"
/protein_id="AAA92197.1"
/db_xref="GI:209813"
/translation="MRHIICHGGVITEEMAASLLDQLIEEVLADNLPPPSHFEPPTLH
ELYDLDVTAPEDPNEEAVSQIFPESVMLAVQEGIDLFTFPPAPGSPEPPHLSRQPEQP
EQRALGPVSMPNLVPEVIDLTCHEAGFPPSDDEDEEGPVSEPEPEPEPEPEPARPTRR
PKLVPAILRRPTSPVSRECNSSTDSCDSGPSNTPPEIHPVVPLCPIKPVAVRVGGRRQ
AVECIEDLLNESGQPLDLSCKRPRP"
CDS join(559..636,1226..1315)
/gene="E1a"
/note="derived from the 9S mRNA"
/codon_start=1
/product="6 kD protein"
/protein_id="AAA92198.1"
/db_xref="GI:209812"
/translation="MRHIICHGGVITEEMAASLLDQLIEEVLCLNLSLSPSQNRSLQD
LPGVLNWCLLS"
intron 637..1225
/gene="E1a"
/note="9S-derived"
/citation=[18]
intron 974..1225
/gene="E1a"
/note="12S-derived"
/citation=[18]
intron 1112..1225
/gene="E1a"
/note="13S-derived"
/citation=[18]
polyA_signal 1608..1613
/gene="E1a"
/note="4.47%; putative"
prim_transcript 1699..4061
/citation=[35]
gene 1699..3588
/gene="E1b_1"
CDS 1711..2238
/gene="E1b_1"
/note="derived from 13S mRNA"
/codon_start=1
/product="20.5 kD protein"
/protein_id="AAA92200.1"
/db_xref="GI:209819"
/translation="MEAWECLEDFSAVRNLLEQSSNSTSWFWRFLWGSSQAKLVCRIK
EDYKWEFEELLKSCGELFDSLNLGHQALFQEKVIKTLDFSTPGRAAAAVAFLSFIKDK
WSEETHLSGGYLLDFLAMHLWRAVVRHKNRLLLLSSVRPAIIPTEEQQQEEARRRRRQ
EQSPWNPRAGLDPRE"
CDS 2016..3503
/gene="E1b_1"
/note="derived from the 22S mRNA"
/codon_start=1
/product="transformation-associated protein"
/protein_id="AAA92202.1"
/db_xref="GI:209820"
/translation="MERRNPSERGVPAGFSGHASVESGGETQESPATVVFRPPGNNTD
GGATAGGSQAAAAAGAEPMEPESRPGPSGMNVVQVAELFPELRRILTINEDGQGLKGV
KRERGASEATEEARNLTFSLMTRHRPECVTFQQIKDNCANELDLLAQKYSIEQLTTYW
LQPGDDFEEAIRVYAKVALRPDCKYKISKLVNIRNCCYISGNGAEVEIDTEDRVAFRC
SMINMWPGVLGMDGVVIMNVRFTGPNFSGTVFLANTNLILHGVSFYGFNNTCVEAWTD
VRVRGCAFYCCWKGVVCRPKSRASIKKCLFERCTLGILSEGNSRVRHNVASDCGCFML
VKSVAVIKHNMVCGNCEDRASQMLTCSDGNCHLLKTIHVASHSRKAWPVFEHNILTRC
SLHLGNRRGVFLPYQCNLSHTKILLEPESMSKVNLNGVFDMTMKIWKVLRYDETRTRC
RPCECGGKHIRNQPVMLDVTEELRPDHLVLACTRAEFGSSDEDTD"
CDS join(2016..2249,3270..3503)
/gene="E1b_1"
/note="derived from the 1.26 kb mRNA"
/codon_start=1
/protein_id="AAA92203.1"
/db_xref="GI:209816"
/translation="MERRNPSERGVPAGFSGHASVESGGETQESPATVVFRPPGNNTD
GGATAGGSQAAAAAGAEPMEPESRPGPSGMNVVQPESMSKVNLNGVFDMTMKIWKVLR
YDETRTRCRPCECGGKHIRNQPVMLDVTEELRPDHLVLACTRAEFGSSDEDTD"
CDS join(2016..2249,3212..3256)
/gene="E1b_1"
/note="derived from the 1.31 kb mRNA"
/codon_start=1
/protein_id="AAA92201.1"
/db_xref="GI:209815"
/translation="MERRNPSERGVPAGFSGHASVESGGETQESPATVVFRPPGNNTD
GGATAGGSQAAAAAGAEPMEPESRPGPSGMNVVQEGGVPTLPMQFESH"
intron 2250..3588
/gene="E1b_1"
/citation=[25]
/number=1
intron 2250..3269
/gene="E1b_1"
/number=1
intron 2250..3211
/gene="E1b_1"
/number=1
intron 3505..3588
/gene="E1b_1"
/citation=[25]
/number=2
mRNA 3576..4061
/citation=[46]
gene 3576..4034
/gene="IX"
CDS 3600..4022
/gene="IX"
/codon_start=1
/product="hexon-associated protein"
/protein_id="AAA92204.1"
/db_xref="GI:209821"
/translation="MSANSFDGSIVSSYLTTRMPPWAGVRQNVMGSSIDGRPVLPANS
TTLTYETVSGTPLETAASAAASAAAATARGIVTDFAFLSPLASSAASRSSARDDKLTA
LLAQLDSLTRELNVVSQQLLDLRQQVSALKASSPPNAV"
gene 4029..4034
/gene="E1b_2"
polyA_signal 4029..4034
/gene="E1b_2"
/note="putative"
polyA_signal 4029..4034
/gene="IX"
/note="putative"
gene complement(4050..27092)
/gene="E2b"
mRNA complement(4050..27092)
/gene="E2b"
/citation=[46]
prim_transcript complement(4050..5826)
/gene="IVa2"
/citation=[46]
gene complement(4050..5695)
/gene="IVa2"
CDS complement(join(4081..5417,5696..5708))
/gene="IVa2"
/codon_start=1
/product="virion morphogenesis-associated protein"
/protein_id="AAA92205.1"
/db_xref="GI:209817"
/translation="METRGRRPAALQHQQDQPQAHPGQRAARSAPLHRDPDYADEDPA
PVERHDPGPSGRAPTTAVQRKPPQPAKRGDMLDRDAVEHVTELWDRLELLGQTLKSMP
TADGLKPLKNFASLQELLSLGGERLLAHLVRENMQVRDMLNEVAPLLRDDGSCSSLNY
QLQPVIGVIYGPTGCGKSQLLRNLLSSQLISPTPETVFFIAPQVDMIPPSELKAWEMQ
ICEGNYAPGPDGTIIPQSGTLRPRFVKMAYDDLILEHNYDVSDPRNIFAQAAARGPIA
IIMDECMENLGGHKGVSKFFHAFPSKLHDKFPKCTGYTVLVVLHNMNPRRDMAGNIAN
LKIQSKMHLISPRMHPSQLNRFVNTYTKGLPLAISLLLKDIFRHHAQRSCYDWIIYNT
TPQHEALQWCYLHPRDGLMPMYLNIQSHLYHVLEKIHRTLNDRDRWSRAYRARKTPK"
polyA_signal complement(4085..4090)
/gene="IVa2"
/note="putative"
CDS complement(5187..8357)
/gene="DNA polymerase"
/codon_start=1
/product="DNA polymerase"
/protein_id="AAA92206.1"
/db_xref="GI:209822"
/translation="MDSSHLRDVVIKLRPPRADIWTLGSRGVVVRSTITPLEQPDGQG
QAAEVEDHQPNPPGEGLKFPLCFLVRGRQVNLVQDVQPVHRCQYCARFYKSQHECSAR
RRDFYFHHINSHSSNWWREIQFFPIGSHPRTERLFVTYDVETYTWMGAFGKQLVPFML
VMKFGGDEPLVTAARDLAVDLGWDRWEQDPLTFYCITPEKMAIGRQFRTFRDHLQMLM
ARDLWSSFVASNPHLADWALSEHGLSSPEELTYEELKKLPSIKGTPRFLELYIVGHNI
NGFDEIVLAAQVINNRSEVPGPFRITRNFMPRAGKILFNDVTFALPNPRSKKRTDFLL
WEQGGCDDTDFKYQYLKVMVRDTFALTHTSLRKAAQAYALPVEKGCCAYQAVNQFYML
GSYRSEADGFPIQEYWKDREEFVLNRELWKKKGQDKYDIIKETLDYCALDVQVTAELV
NKLRDSYASFVRDAVGLTDASFNVFQRPTISSNSHAIFRQIVFRAEQPARSNLGPDLL
APSHELYDYVRASIRGGRCYPTYLGILREPLYVYDICGMYASALTHPMPWGPPLNPYE
RALAARAWQQALDLQGCKIDYFDARLLPGVFTVDADPPDETQLDPLPPFCSRKGGRLC
WTNERLRGEVATSVDLVTLHNRGWRVHLVPDERTTVFPEWRCVAREYVQLNIAAKERA
DRDKNQTLRSIAKLLSNALYGSFATKLDNKKIVFSDQMDAATLKGITAGQVNIKSSSF
LETDNLSAEVMPAFEREYSPQQLALADSDAEESEDERAPTPFYSPPSGTPGHVAYTYK
PITFLDAEEGDMCLHTLERVDPLVDNDRYPSHLASFVLAWTRAFVSEWSEFLYEEDRG
TPLEDRPLKSVYGDTDSLFVTERGHRLMETRGKKRIKKHGGNLVFDPERPELTWLVEC
ETVCGACGADAYSPESVFLAPKLYALKSLHCPSCGASSKGKLRAKGHAAEGLDYDTMV
KCYLADAQGEDRQRFSTSRTSLKRTLASAQPGAHPFTVTQTTLTRTLRPWKDMTLARL
DEHRLLPYSESRPNPRNEEICWIEMP"
gene complement(5187..7213)
/gene="DNA polymerase"
intron complement(5418..5695)
/gene="IVa2"
/citation=[25]
prim_transcript 6039..32798
/citation=[35]
gene 6039..32779
/gene="L5"
prim_transcript 6039..28223
/gene="L4"
/citation=[35]
gene 6039..24094
/gene="L4"
prim_transcript 6039..22443
/gene="L4"
/citation=[30]
gene 6039..20487
/gene="L3_1"
prim_transcript 6039..17969
/gene="L3_1"
/citation=[34]
gene 6039..15856
/gene="L2_1"
prim_transcript 6039..14113
/gene="L1"
/citation=[35]
gene 6039..14097
/gene="L1"
intron 6080..7100
/gene="L1"
/note="major late mRNA intron (precedes 2nd leader)"
/citation=[12]
old_sequence 6443
/gene="L1"
/citation=[20]
conflict 6574..6575
/gene="L1"
/citation=[20]
/replace=""
intron 7173..7941
/gene="DNA polymerase"
/note="major late mRNA intron (precedes 'i' leader)"
/citation=[12]
old_sequence 7212..7213
/gene="L1"
/citation=[19]
CDS 7968..8417
/gene="L1"
/codon_start=1
/product="13.6 kD protein"
/protein_id="AAA92207.1"
/db_xref="GI:209823"
/translation="MRADREELDLPPPVGGVAVDVVKVEVPATGRTLVLAFVKTCAVL
AAVHGLYILHEVDLTTAHKEAEWEFEPLAWRVWLVVFYFGCLSLTVWLLEGSYGGSDH
HAARAQSPDVRARRSELDDNIAQMGAVHGLELPRRQVRRELLQVYLA"
intron 8382..9633
/gene="L1"
/note="major late mRNA intron (precedes 3rd leader)"
/citation=[12]
CDS complement(8573..10534)
/gene="E2b"
/note="also known as Bellet protein"
/codon_start=1
/product="terminal protein"
/protein_id="AAA92208.1"
/db_xref="GI:209824"
/translation="MEHFLPLRNIWNRVRDFPRASTTAAGITWMSRYIYGYHRLMLED
LAPGAPATLRWPLYRQPPPHFLVGYQYLVRTCNDYVFDSRAYSRLRYTELSQPGHQTV
NWSVMANCTYTINTGAYHRFVDMDDFQSTLTQVQQAILAERVVADLALLQPMRGFGVT
RMGGRGRHLRPNSAAAVAIDARDAGQEEGEEEVPVERLMQDYYKDLRRCQNEAWGMAD
RLRIQQAGPKDMVLLSTIRRLKTAYFNYIISSTSARNNPDRHPLPPATVLSLPCDCDW
LDAFLERFSDPVDADSLRSLGGGVPTQQLLRCIVSAVSLPHGSPPPTHNRDMTGGVFQ
LRPRENGRAVTETMRRRRGEMIERFVDRLPVRRRRRRVPPPPPPPEEEEEGEALMEEE
IEEEEAPVAFEREVRDTVAELIRLLEEELTVSARNSQFFNFAVDFYEAMERLEALGDI
NESTLRRWVMYFFVAEHTATTLNYLFQRLRNYAVFARHVELNLAQVVMRARDAEGGVV
YSRVWNEGGLNAFSQLMARISNDLAATVERAGRGDLQEEEIEQFMAEIAYQDNSGDVQ
EILRQAAVNDTEIDSVELSFRFKLTGPVVFTQRRQIQEINRRVVAFASNLRAQHQLLP
ARGADVPLPPLPAGPEPPLPPGARPRHRF"
conflict 9315..9316
/gene="L1"
/citation=[47]
/replace=""
variation 9382
/gene="L1"
/note="c is shown; can be cttc due to population
heterogeneity"
/citation=[72]
/replace="cttc"
conflict 9633..9634
/gene="L1"
/citation=[12]
/replace=""
intron 9724..24094
/gene="L4"
/note="precedes 100K mRNA"
/citation=[60]
intron 9724..21649
/gene="L4"
/note="precedes 23K mRNA"
/citation=[30]
intron 9724..18801
/gene="L3_1"
/note="precedes hexon mRNA"
/citation=[7]
intron 9724..17999
/gene="L3_1"
/note="precedes pVI mRNA"
/citation=[34]
intron 9724..16515
/gene="L3_1"
/note="precedes pV mRNA"
/citation=[71]
intron 9724..14149
/gene="L2_1"
/note="precedes penton mRNA"
/citation=[61]
intron 9724..11039
/gene="L1"
/note="precedes 52,55K mRNA"
/citation=[12]
misc_RNA 10607..10766
/gene="VA I"
/citation=[5]
gene 10607..10716
/gene="VA I"
misc_RNA 10610..10766
/gene="VA I"
/citation=[1]
conflict 10715..10716
/gene="VA I"
/citation=[4]
/replace=""
gene 10866..11023
/gene="VA II"
misc_RNA 10866..11023
/gene="VA II"
/citation=[23]
CDS 11040..12287
/gene="L1"
/codon_start=1
/product="52,55kD protein"
/protein_id="AAA92209.1"
/db_xref="GI:209825"
/translation="MHPVLRQMRPPPQQRQEQEQRQTCRAPSPSPTASGGATSAADAA
ADGDYEPPRRRARHYLDLEEGEGLARLGAPSPERHPRVQLKRDTREAYVPRQNLFRDR
EGEEPEEMRDRKFHAGRELRHGLNRERLLREEDFEPDARTGISPARAHVAAADLVTAY
EQTVNQEINFQKSFNNHVRTLVAREEVAIGLMHLWDFVSALEQNPNSKPLMAQLFLIV
QHSRDNEAFRDALLNIVEPEGRWLLDLINILQSIVVQERSLSLADKVAAINYSMLSLG
KFYARKIYHTPYVPIDKEVKIEGFYMRMALKVLTLSDDLGVYRNERIHKAVSVSRRRE
LSDRELMHSLQRALAGTGSGDREAESYFDAGADLRWAPSRRALEAAGAGPGLAVAPAR
AGNVGGVEEYDEDDEYEPEDGEY"
variation 11062
/gene="L1"
/note="t in [43],[63]"
/citation=[23]
/replace="c"
gene 12308..14065
/gene="IIIa"
CDS 12308..14065
/gene="IIIa"
/note="splice sites not sequenced"
/codon_start=1
/product="peripentonal hexon-associated protein"
/protein_id="AAA92210.1"
/db_xref="GI:209826"
/translation="MMQDATDPAVRAALQSQPSGLNSTDDWRQVMDRIMSLTARNPDA
FRQQPQANRLSAILEAVVPARANPTHEKVLAIVNALAENRAIRPDEAGLVYDALLQRV
ARYNSGNVQTNLDRLVGDVREAVAQRERAQQQGNLGSMVALNAFLSTQPANVPRGQED
YTNFVSALRLMVTETPQSEVYQSGPDYFFQTSRQGLQTVNLSQAFKNLQGLWGVRAPT
GDRATVSSLLTPNSRLLLLLIAPFTDSGSVSRDTYLGHLLTLYREAIGQAHVDEHTFQ
EITSVSRALGQEDTGSLEATLNYLLTNRRQKIPSLHSLNSEEERILRYVQQSVSLNLM
RDGVTPSVALDMTARNMEPGMYASNRPFINRLMDYLHRAAAVNPEYFTNAILNPHWLP
PPGFYTGGFEVPEGNDGFLWDDIDDSVFSPQPQTLLELQQREQAEAALRKESFRRPSS
LSDLGAAAPRSDASSPFPSLIGSFTSTRTTRPRLLGEEEYLNNSLLQPQREKNLPPAF
PNNGIESLVDKMSRWKTYAQEHRDVPGPRPPTRRQRHDRQRGLVWEDDDSADDSSVLD
LGGSGNPFAHLRPRLGRMF"
variation 14064..14080
/gene="L1"
/note="15 to 19 A residues have been observed in various
populations"
/citation=[72]
/replace="aaaaaaaaaaaaaaaaaaa"
polyA_signal 14092..14097
/gene="L1"
/note="39.1%; putative"
CDS 14151..15866
/gene="L3_1"
/note="virion component III"
/codon_start=1
/product="penton protein"
/protein_id="AAA92211.1"
/db_xref="GI:209827"
/translation="MQRAAMYEEGPPPSYESVVSAAPVAAALGSPFDAPLDPPFVPPR
YLRPTGGRNSIRYSELAPLFDTTRVYLVDNKSTDVASLNYQNDHSNFLTTVIQNNDYS
PGEASTQTINLDDRSHWGGDLKTILHTNMPNVNEFMFTNKFKARVMVSRSLTKDKQVE
LKYEWVEFTLPEGNYSETMTIDLMNNAIVEHYLKVGRQNGVLESDIGVKFDTRNFRLG
FDPVTGLVMPGVYTNEAFHPDIILLPGCGVDFTHSRLSNLLGIRKRQPFQEGFRITYD
DLEGGNIPALLDVDAYQASLKDDTEQGGDGAGGGNNSGSGAEENSNAAAAAMQPVEDM
NDHAIRGDTFATRAEEKRAEAEAAAEAAAPAAQPEVEKPQKKPVIKPLTEDSKKRSYN
LISNDSTFTQYRSWYLAYNYGDPQTGIRSWTLLCTPDVTCGSEQVYWSLPDMMQDPVT
FRSTSQISNFPVVGAELLPVHSKSFYNDQAVYSQLIRQFTSLTHVFNRFPENQILARP
PAPTITTVSENVPALTDHGTLPLRNSIGGVQRVTITDARRRTCPYVYKALGIVSPRVL
SSRTF"
variation 15856
/gene="L2_1"
/note="g in [67],[(in) Doerfler,W. (Ed.);Adenovirus DNA:
1-51;Martinus Nijhoff Publishing, Boston ]"
/citation=[56]
/replace="t"
CDS 15873..16469
/gene="Pro-VII"
/note="precursor"
/codon_start=1
/product="major core protein"
/protein_id="AAA92212.1"
/db_xref="GI:209828"
/translation="MSILISPSNNTGWGLRFPSKMFGGAKKRSDQHPVRVRGHYRAPW
GAHKRGRTGRTTVDDAIDAVVEEARNYTPTPPPVSTVDAAIQTVVRGARRYAKMKRRR
RRVARRHRRRPGTAAQRAAAALLNRARRTGRRAAMRAARRLAAGIVTVPPRSRRRAAA
AAAAAISAMTQGRRGNVYWVRDSVSGLRVPVRTRPPRN"
gene 15873..16437
/gene="Pro-VII"
variation 15914
/gene="Pro-VII"
/note="c in [69],[63]"
/citation=[56]
/replace="t"
variation 15998
/gene="Pro-VII"
/note="g in [66],[63]"
/citation=[56]
/replace="c"
conflict 16205..16208
/gene="Pro-VII"
/citation=[56]
/replace=""
variation 16437
/gene="Pro-VII"
/note="g in [66],[63]"
/citation=[56]
/replace="c"
gene 16539..17648
/gene="pV"
CDS 16539..17648
/gene="pV"
/codon_start=1
/product="minor core protein"
/protein_id="AAA92213.1"
/db_xref="GI:209829"
/translation="MSKRKIKEEMLQVIAPEIYGPPKKEEQDYKPRKLKRVKKKKKDD
DDDELDDEVELLHATAPRRRVQWKGRRVRRVLRPGTTVVFTPGERSTRTYKRVYDEVY
GDEDLLEQANERLGEFAYGKRHKDMLALPLDEGNPTPSLKPVTLQQVLPTLAPSEEKR
GLKRESGDLAPTVQLMVPKRQRLEDVLEKMTVEPGLEPEVRVRPIKQVAPGLGVQTVD
VQIPTTSSTSIATATEGMETQTSPVASAVADAAVQAAAAAASKTSTEVQTDPWMFRVS
APRRPRRSRKYGTASALLPEYALHPSIAPTPGYRGYTYRPRRRATTRRRTTTGTRRRR
RRRQPVLAPISVRRVAREGGRTLVLPTARYHPSIV"
gene 17949..17964
/gene="L2_2"
polyA_signal 17949..17954
/gene="L2_2"
/note="49.94%; putative"
variation 17964
/gene="L2_2"
/note="g in [67],[(in) Doerfler,W. (Ed.);Adenovirus DNA:
1-51;Martinus Nijhoff Publishing, Boston ]used"
/citation=[59]
/replace="c"
gene 18001..18753
/gene="pVI"
CDS 18001..18753
/gene="pVI"
/note="precursor"
/codon_start=1
/product="hexon-associated protein"
/protein_id="AAA92214.1"
/db_xref="GI:209830"
/translation="MEDINFASLAPRHGSRPFMGNWQDIGTSNMSGGAFSWGSLWSGI
KNFGSTIKNYGSKAWNSSTGQMLRDKLKEQNFQQKVVDGLASGISGVVDLANQAVQNK
INSKLDPRPPVEEPPPAVETVSPEGRGEKRPRPDREETLVTQIDEPPSYEEALKQGLP
TTRPIAPMATGVLGQHTPVTLDLPPPADTQQKPVLPGPSAVVVTRPSRASLRRAASGP
RSMRPVASGNWQSTLNSIVGLGVQSLKRRRCF"
CDS 18838..21744
/gene="L4"
/note="virion component II"
/codon_start=1
/product="hexon protein"
/protein_id="AAA92215.1"
/db_xref="GI:209831"
/translation="MATPSMMPQWSYMHISGQDASEYLSPGLVQFARATETYFSLNNK
FRNPTVAPTHDVTTDRSQRLTLRFIPVDREDTAYSYKARFTLAVGDNRVLDMASTYFD
IRGVLDRGPTFKPYSGTAYNALAPKGAPNSCEWEQTEDSGRAVAEDEEEEDEDEEEEE
EEQNARDQATKKTHVYAQAPLSGETITKSGLQIGSDNAETQAKPVYADPSYQPEPQIG
ESQWNEADANAAGGRVLKKTTPMKPCYGSYARPTNPFGGQSVLVPDEKGVPLPKVDLQ
FFSNTTSLNDRQGNATKPKVVLYSEDVNMETPDTHLSYKPGKGDENSKAMLGQQSMPN
RPNYIAFRDNFIGLMYYNSTGNMGVLAGQASQLNAVVDLQDRNTELSYQLLLDSIGDR
TRYFSMWNQAVDSYDPDVRIIENHGTEDELPNYCFPLGGIGVTDTYQAIKANGNGSGD
NGDTTWTKDETFATRNEIGVGNNFAMEINLNANLWRNFLYSNIALYLPDKLKYNPTNV
EISDNPNTYDYMNKRVVAPGLVDCYINLGARWSLDYMDNVNPFNHHRNAGLRYRSMLL
GNGRYVPFHIQVPQKFFAIKNLLLLPGSYTYEWNFRKDVNMVLQSSLGNDLRVDGASI
KFDSICLYATFFPMAHNTASTLEAMLRNDTNDQSFNDYLSAANMLYPIPANATNVPIS
IPSRNWAAFRGWAFTRLKTKETPSLGSGYDPYYTYSGSIPYLDGTFYLNHTFKKVAIT
FDSSVSWPGNDRLLTPNEFEIKRSVDGEGYNVAQCNMTKDWFLVQMLANYNIGYQGFY
IPESYKDRMYSFFRNFQPMSRQVVDDTKYKEYQQVGILHQHNNSGFVGYLAPTMREGQ
AYPANVPYPLIGKTAVDSITQKKFLCDRTLWRIPFSSNFMSMGALTDLGQNLLYANSA
HALDMTFEVDPMDEPTLLYVLFEVFDVVRVHQPHRGVIETVYLRTPFSAGNATT"
old_sequence 18914..18915
/gene="L3_1"
/citation=[34]
old_sequence 18919
/gene="L3_1"
/citation=[36]
old_sequence 19617
/gene="L3_1"
/citation=[36]
old_sequence 19666
/gene="L3_1"
/citation=[36]
old_sequence 19823
/gene="L3_1"
/citation=[36]
old_sequence 20427
/gene="L3_1"
/citation=[36]
old_sequence 20487
/gene="L3_1"
/citation=[36]
gene 21778..22392
/gene="23 kD"
CDS 21778..22392
/gene="23 kD"
/codon_start=1
/product="endopeptidase"
/protein_id="AAA92216.1"
/db_xref="GI:209832"
/translation="MGSSEQELKAIVKDLGCGPYFLGTYDKRFPGFVSPHKLACAIVN
TAGRETGGVHWMAFAWNPRSKTCYLFEPFGFSDQRLKQVYQFEYESLLRRSAIASSPD
RCITLEKSTQSVQGPNSAACGLFCCMFLHAFANWPQTPMDHNPTMNLITGVPNSMLNS
PQVQPTLRRNQEQLYSFLERHSPYFRSHSAQIRSATSFCHLKNM"
gene 22418..22423
/gene="L3_2"
polyA_signal 22418..22423
/gene="L3_2"
/note="putative"
prim_transcript complement(22420..27092)
/gene="E2b"
/citation=[35]
prim_transcript complement(22420..27091)
/gene="E2b"
/citation=[35]
prim_transcript complement(22420..25956)
/gene="E2b"
/citation=[65]
prim_transcript complement(22420..25954)
/gene="E2b"
/citation=[35]
gene complement(22420..22444)
/gene="E2a_1"
polyA_signal complement(22439..22444)
/gene="E2a_1"
/note="62.43%; putative"
CDS complement(22490..24079)
/gene="DBP"
/note="72kD protein"
/codon_start=1
/product="DNA-binding protein"
/protein_id="AAA92217.1"
/db_xref="GI:209833"
/translation="MASREEEQRETTPERGRGAARRPPTMEDVSSPSPSPPPPRAPPK
KRLRRRLESEDEEDSSQDALVPRTPSPRPSTSTADLAIASKKKKKRPSPKPERPPSPE
VIVDSEEEREDVALQMVGFSNPPVLIKHGKGGKRTVRRLNEDDPVARGMRTQEEKEES
SEAESESTVINPLSLPIVSAWEKGMEAARALMDKYHVDNDLKANFKLLPDQVEALAAV
CKTWLNEEHRGLQLTFTSNKTFVTMMGRFLQAYLQSFAEVTYKHHEPTGCALWLHRCA
EIEGELKCLHGSIMINKEHVIEMDVTSENGQRALKEQSSKAKIVKNRWGRNVVQISNT
DARCCVHDAACPANQFSGKSCGMFFSEGAKAQVAFKQIKAFMQALYPNAQTGHGHLLM
PLRCECNSKPGHAPFLGRQLPKLTPFALSNAEDLDADLISDKSVLASVHHPALIVFQC
CNPVYRNSRAQGGGPNCDFKISAPDLLNALVMVRSLWSENFTELPRMVVPEFKWSTKH
QYRNVSLPVAHSDARQNPFDF"
gene complement(22490..22524)
/gene="DBP"
variation 22524
/gene="L4"
/note="t in [48],[(in) Doerfler,W. (Ed.);Adenovirus DNA:
1-51;Martinus Nijhoff Publishing, Boston ]"
/citation=[29]
/replace="c"
gene complement(24089..25885)
/gene="E2a_2"
intron complement(24089..24714)
/gene="E2a_2"
/citation=[39]
CDS 24108..26525
/gene="L5"
/note="100K protein"
/codon_start=1
/product="hexon assembly-associated protein"
/protein_id="AAA92218.1"
/db_xref="GI:209834"
/translation="MESVEKEDSLTAPFEFATTASTDAANAPTTFPVEAPPLEEEEVI
IEQDPGFVSEDDEDRSVPTEDKKQDQDDAEANEEQVGRGDQRHGDYLDVGDDVLLKHL
QRQCAIICDALQERSDVPLAIADVSLAYERHLFSPRVPPKRQENGTCEPNPRLNFYPV
FAVPEVLATYHIFFQNCKIPLSCRANRSRADKQLALRQGAVIPDIASLDEVPKIFEGL
GRDEKRAANALQQENSENESHCGVLVELEGDNARLAVLKRSIEVTHFAYPALNLPPKV
MSTVMSELIVRRARPLERDANLQEQTEEGLPAVGDEQLARWLETREPADLEERRKLMM
AAVLVTVELECMQRFFADPEMQRKLEETLHYTFRQGYVRQACKISNVELCNLVSYLGI
LHENRLGQNVLHSTLKGEARRDYVRDCVYLFLCYTWQTAMGVWQQCLEERNLKELQKL
LKQNLKDLWTAFNERSVAAHLADIIFPERLLKTLQQGLPDFTSQSMLQNFRNFILERS
GILPATCCALPSDFVPIKYRECPPPLWGHCYLLQLANYLAYHSDIMEDVSGDGLLECH
CRCNLCTPHRSLVCNSQLLSESQIIGTFELQGPSPDEKSAAPGLKLTPGLWTSAYLRK
FVPEDYHAHEIRFYEDQSRPPNAELTACVITQGHILGQLQAINKARQEFLLRKGRGVY
LDPQSGEELNPIPPPPQPYQQPRALASQDGTQKEAAAAAAATHGRGGILGQSGRGGFG
RGGGDDGRLGQPRRSFRGRRGVRRNTVTLGRIPLAGAPEIGNRSQHRYNLRSSGAAGT
ACSPTQP"
intron complement(24792..27024)
/gene="E2b"
/citation=[16]
intron complement(24972..25885)
/gene="E2a_2"
/note="does not fit consensus"
/citation=[60]
/cons_splice=(5'site:no, 3'site:no)
gene 26239..27127
/gene="33K"
CDS join(26239..26551,26754..27127)
/gene="33K"
/codon_start=1
/product="virion morphogenesis-associated protein"
/protein_id="AAA92219.1"
/db_xref="GI:209818"
/translation="MAPKKKLQLPPPPPTDEEEYWDSQAEEVLDEEEEMMEDWDSLDE
ASEAEEVSDETPSPSVAFPSPAPQKLATVPSIATTSAPQAPPALPVRRPNRRWDTTGT
RAAPTAPAAAAAAATAAVTQKQRRPDSKTLTKPKKSTAAAAAGGGALRLAPNEPVSTR
ELRNRIFPTLYAIFQQSRGQEQELKIKNRSLRSLTRSCLYHKSEDQLRRTLEDAEALF
SKYCALTLKD"
gene 26552..26753
/gene="33kD protein"
intron 26552..26753
/gene="33kD protein"
/citation=[50]
gene 27215..27898
/gene="pVIII"
CDS 27215..27898
/gene="pVIII"
/note="precursor"
/codon_start=1
/product="hexon-associated protein"
/protein_id="AAA92220.1"
/db_xref="GI:209835"
/translation="MSKEIPTPYMWSYQPQMGLAAGAAQDYSTRINYMSAGPHMISRV
NGIRAHRNRILLEQAAITTTPRNNLNPRSWPAALVYQESPAPTTVVLPRDAQAEVQMT
NSGAQLAGGFRHRVRSPGQGITHLKIRGRGIQLNDESVSSSLGLRPDGTFQIGGAGRS
SFTPRQAILTLQTSSSEPRSGGIGTLQFIEEFVPSVYFNPFSGPPGHYPDQFIPNFDA
VKDSADGYD"
mRNA 27609..30864
/gene="E3-2"
/citation=[55]
gene 27609..30847
/gene="E3-2"
mRNA 27609..29804
/gene="E3-1"
/citation=[48]
mRNA 27609..29801
/gene="E3-1"
/citation=[48]
mRNA 27609..29799
/gene="E3-1"
/citation=[48]
mRNA 27609..29792
/gene="E3-1"
/citation=[35]
gene 27609..29774
/gene="E3-1"
intron 27981..28375
/gene="E3-1"
/note="major late mRNA intron ('x' leader)"
/citation=[71]
polyA_signal 28205..28210
/gene="E3-1"
/note="major late mRNA L4 78.48%; putative"
old_sequence 28339
/gene="E3-1"
/citation=[19]
old_sequence 28350
/gene="E3-1"
/citation=[19]
old_sequence 28359
/gene="E3-1"
/citation=[19]
old_sequence 28465..28466
/gene="E3-1"
/citation=[19]
old_sequence 28495..28497
/gene="E3-1"
/citation=[19]
intron 28560..30437
/gene="E3-2"
/note="major late mRNA ('y' leader)"
/citation=[34]
gene 28812..29773
/gene="E3"
CDS 28812..29291
/gene="E3"
/note="19kD protein"
/codon_start=1
/product="glycosylated membrane protein"
/protein_id="AAA92221.1"
/db_xref="GI:209836"
/translation="MRYMILGLLALAAVCSAAKKVEFKEPACNVTFKSEANECTTLIK
CTTEHEKLIIRHKDKIGKYAVYAIWQPGDTNDYNVTVFQGENRKTFMYKFPFYEMCDI
TMYMSKQYKLWPPQKCLENTGTFCSTALLITALALVCTLLYLKYKSRRSFIDEKKMP"
CDS 29468..29773
/gene="E3"
/codon_start=1
/product="11.6 kD protein"
/protein_id="AAA92222.1"
/db_xref="GI:209837"
/translation="MTGSTIAPTTDYRNTTATGLTSALNLPQVHAFVNDWASLDMWWF
SIALMFVCLIIMWLICCLKRRRARPPIYRPIIVLNPHNEKIHRLDGLKPCSLLLQYD"
polyA_signal 29769..29774
/gene="E3-1"
/note="82.69%; putative"
intron 30583..31029
/gene="L5"
/note="major late mRNA ('z' leader)"
/citation=[7]
polyA_signal 30842..30847
/gene="E3-2"
/note="85.82%; putative"
old_sequence 30980..30981
/gene="L5"
/citation=[19]
CDS 31030..32778
/gene="L5"
/note="virion component IV"
/codon_start=1
/product="fiber protein"
/protein_id="AAA92223.1"
/db_xref="GI:209838"
/translation="MKRARPSEDTFNPVYPYDTETGPPTVPFLTPPFVSPNGFQESPP
GVLSLRVSEPLDTSHGMLALKMGSGLTLDKAGNLTSQNVTTVTQPLKKTKSNISLDTS
APLTITSGALTVATTAPLIVTSGALSVQSQAPLTVQDSKLSIATKGPITVSDGKLALQ
TSAPLSGSDSDTLTVTASPPLTTATGSLGINMEDPIYVNNGKIGIKISGPLQVAQNSD
TLTVVTGPGVTVEQNSLRTKVAGAIGYDSSNNMEIKTGGGMRINNNLLILDVDYPFDA
QTKLRLKLGQGPLYINASHNLDINYNRGLYLFNASNNTKKLEVSIKKSSGLNFDNTAI
AINAGKGLEFDTNTSESPDINPIKTKIGSGIDYNENGAMITKLGAGLSFDNSGAITIG
NKNDDKLTLWTTPDPSPNCRIHSDNDCKFTLVLTKCGSQVLATVAALAVSGDLSSMTG
TVASVSIFLRFDQNGVLMENSSLKKHYWNFRNGNSTNANPYTNAVGFMPNLLAYPKTQ
SQTAKNNIVSQVYLHGDKTKPMILTITLNGTSESTETSEVSTYSMSFTWSWESGKYTT
ETFATNSYTFSYIAQE"
polyA_signal 32774..32779
/gene="L5"
/note="91.19%; putative"
prim_transcript complement(32802..35609)
/citation=[35]
gene complement(32802..35547)
/gene="E4"
polyA_signal complement(32821..32826)
/gene="E4"
/note="91.32%; putative"
intron complement(33193..33903)
/gene="E4"
/citation=[65]
intron complement(33284..33903)
/gene="E4"
/citation=[65]
intron complement(33377..33903)
/gene="E4"
/citation=[63]
intron complement(33404..33903)
/gene="E4"
/citation=[63]
intron complement(33452..33903)
/gene="E4"
/citation=[63]
intron complement(33610..33903)
/gene="E4"
/citation=[63]
intron complement(33679..33903)
/gene="E4"
/citation=[63]
intron complement(33875..33903)
/gene="E4"
/citation=[63]
intron complement(34083..34288)
/gene="E4"
/citation=[65]
intron complement(34242..34288)
/gene="E4"
/citation=[65]
intron complement(34330..34605)
/gene="E4"
/citation=[65]
variation 34344..34345
/gene="E4"
/note="tt in [44],[(in) Doerfler,W. (Ed.);Adenovirus DNA:
1-51;Martinus Nijhoff Publishing, Boston ]; t in [38]"
CDS complement(34356..34706)
/gene="E4"
/note="11kD protein; splice sites not sequenced"
/codon_start=1
/product="nuclear binding protein"
/protein_id="AAA92224.1"
/db_xref="GI:209839"
/translation="MIRCLRLKVEGALEQIFTMAGLNIRDLLRDILIRWRDENYLGMV
EGAGMFIEEIHPEGFSLYVHLDVRAVCLLEAIVQHLTNAIICSLAVEFDHATGGERVH
LIDLHFEVLDNLLE"
intron complement(34380..34605)
/gene="E4"
/citation=[63]
intron complement(34436..34605)
/gene="E4"
/citation=[65]
intron complement(34736..35547)
/gene="E4"
/citation=[65]
intron complement(35108..35547)
/gene="E4"
repeat_region 35836..35937
/standard_name="inverted terminal repeat"
/note="99.54%"
/citation=[13]
variation 35930
/note="t in [44],[(in) Doerfler,W. (Ed.);Adenovirus DNA:
1-51;Martinus Nijhoff Publishing, Boston ]; tt in other
strains"
/replace="tt"
ORIGIN
1 catcatcata atatacctta ttttggattg aagccaatat gataatgagg gggtggagtt
61 tgtgacgtgg cgcggggcgt gggaacgggg cgggtgacgt agtagtgtgg cggaagtgtg
121 atgttgcaag tgtggcggaa cacatgtaag cgccggatgt ggtaaaagtg acgtttttgg
181 tgtgcgccgg tgtatacggg aagtgacaat tttcgcgcgg ttttaggcgg atgttgtagt
241 aaatttgggc gtaaccaagt aatgtttggc cattttcgcg ggaaaactga ataagaggaa
301 gtgaaatctg aataattctg tgttactcat agcgcgtaat atttgtctag ggccgcgggg
361 actttgaccg tttacgtgga gactcgccca ggtgtttttc tcaggtgttt tccgcgttcc
421 gggtcaaagt tggcgtttta ttattatagt cagctgacgc gcagtgtatt tatacccggt
481 gagttcctca agaggccact cttgagtgcc agcgagtaga gttttctcct ccgagccgct
541 ccgacaccgg gactgaaaat gagacatatt atctgccacg gaggtgttat taccgaagaa
601 atggccgcca gtcttttgga ccagctgatc gaagaggtac tggctgataa tcttccacct
661 cctagccatt ttgaaccacc tacccttcac gaactgtatg atttagacgt gacggccccc
721 gaagatccca acgaggaggc ggtttcgcag atttttcccg agtctgtaat gttggcggtg
781 caggaaggga ttgacttatt cacttttccg ccggcgcccg gttctccgga gccgcctcac
841 ctttcccggc agcccgagca gccggagcag agagccttgg gtccggtttc tatgccaaac
901 cttgtgccgg aggtgatcga tcttacctgc cacgaggctg gctttccacc cagtgacgac
961 gaggatgaag agggtgagga gtttgtgtta gattatgtgg agcaccccgg gcacggttgc
1021 aggtcttgtc attatcaccg gaggaatacg ggggacccag atattatgtg ttcgctttgc
1081 tatatgagga cctgtggcat gtttgtctac agtaagtgaa aattatgggc agtcggtgat
1141 agagtggtgg gtttggtgtg gtaatttttt tttaattttt acagttttgt ggtttaaaga
1201 attttgtatt gtgatttttt aaaaggtcct gtgtctgaac ctgagcctga gcccgagcca
1261 gaaccggagc ctgcaagacc tacccggcgt cctaaattgg tgcctgctat cctgagacgc
1321 ccgacatcac ctgtgtctag agaatgcaat agtagtacgg atagctgtga ctccggtcct
1381 tctaacacac ctcctgagat acacccggtg gtcccgctgt gccccattaa accagttgcc
1441 gtgagagttg gtgggcgtcg ccaggctgtg gaatgtatcg aggacttgct taacgagtct
1501 gggcaacctt tggacttgag ctgtaaacgc cccaggccat aaggtgtaaa cctgtgattg
1561 cgtgtgtggt taacgccttt gtttgctgaa tgagttgatg taagtttaat aaagggtgag
1621 ataatgttta acttgcatgg cgtgttaaat ggggcggggc ttaaagggta tataatgcgc
1681 cgtgggctaa tcttggttac atctgacctc atggaggctt gggagtgttt ggaagatttt
1741 tctgctgtgc gtaacttgct ggaacagagc tctaacagta cctcttggtt ttggaggttt
1801 ctgtggggct cctcccaggc aaagttagtc tgcagaatta aggaggatta caagtgggaa
1861 tttgaagagc ttttgaaatc ctgtggtgag ctgtttgatt ctttgaatct gggtcaccag
1921 gcgcttttcc aagagaaggt catcaagact ttggattttt ccacaccggg gcgcgctgcg
1981 gctgctgttg cttttttgag ttttataaag gataaatgga gcgaagaaac ccatctgagc
2041 ggggggtacc tgctggattt tctggccatg catctgtgga gagcggtggt gagacacaag
2101 aatcgcctgc tactgttgtc ttccgtccgc ccggcaataa taccgacgga ggagcaacag
2161 caggaggaag ccaggcggcg gcggcggcag gagcagagcc catggaaccc gagagccggc
2221 ctggaccctc gggaatgaat gttgtacagg tggctgaact gtttccagaa ctgagacgca
2281 ttttaaccat taacgaggat gggcaggggc taaagggggt aaagagggag cggggggctt
2341 ctgaggctac agaggaggct aggaatctaa cttttagctt aatgaccaga caccgtcctg
2401 agtgtgttac ttttcagcag attaaggata attgcgctaa tgagcttgat ctgctggcgc
2461 agaagtattc catagagcag ctgaccactt actggctgca gccaggggat gattttgagg
2521 aggctattag ggtatatgca aaggtggcac ttaggccaga ttgcaagtac aagattagca
2581 aacttgtaaa tatcaggaat tgttgctaca tttctgggaa cggggccgag gtggagatag
2641 atacggagga tagggtggcc tttagatgta gcatgataaa tatgtggccg ggggtgcttg
2701 gcatggacgg ggtggttatt atgaatgtga ggtttactgg tcccaatttt agcggtacgg
2761 ttttcctggc caataccaat cttatcctac acggtgtaag cttctatggg tttaacaata
2821 cctgtgtgga agcctggacc gatgtaaggg ttcggggctg tgccttttac tgctgctgga
2881 agggggtggt gtgtcgcccc aaaagcaggg cttcaattaa gaaatgcctg tttgaaaggt
2941 gtaccttggg tatcctgtct gagggtaact ccagggtgcg ccacaatgtg gcctccgact
3001 gtggttgctt catgctagtg aaaagcgtgg ctgtgattaa gcataacatg gtgtgtggca
3061 actgcgagga cagggcctct cagatgctga cctgctcgga cggcaactgt cacttgctga
3121 agaccattca cgtagccagc cactctcgca aggcctggcc agtgtttgag cacaacatac
3181 tgacccgctg ttccttgcat ttgggtaaca ggaggggggt gttcctacct taccaatgca
3241 atttgagtca cactaagata ttgcttgagc ccgagagcat gtccaaggtg aacctgaacg
3301 gggtgtttga catgaccatg aagatctgga aggtgctgag gtacgatgag acccgcacca
3361 ggtgcagacc ctgcgagtgt ggcggtaaac atattaggaa ccagcctgtg atgctggatg
3421 tgaccgagga gctgaggccc gatcacttgg tgctggcctg cacccgcgct gagtttggct
3481 ctagcgatga agatacagat tgaggtactg aaatgtgtgg gcgtggctta agggtgggaa
3541 agaatatata aggtgggggt ctcatgtagt tttgtatctg ttttgcagca gccgccgcca
3601 tgagcgccaa ctcgtttgat ggaagcattg tgagctcata tttgacaacg cgcatgcccc
3661 catgggccgg ggtgcgtcag aatgtgatgg gctccagcat tgatggtcgc cccgtcctgc
3721 ccgcaaactc tactaccttg acctacgaga ccgtgtctgg aacgccgttg gagactgcag
3781 cctccgccgc cgcttcagcc gctgcagcca ccgcccgcgg gattgtgact gactttgctt
3841 tcctgagccc gcttgcaagc agtgcagctt cccgttcatc cgcccgcgat gacaagttga
3901 cggctctttt ggcacaattg gattctttga cccgggaact taatgtcgtt tctcagcagc
3961 tgttggatct gcgccagcag gtttctgccc tgaaggcttc ctcccctccc aatgcggttt
4021 aaaacataaa taaaaaccag actctgtttg gattttgatc aagcaagtgt cttgctgtct
4081 ttatttaggg gttttgcgcg cgcggtaggc ccgggaccag cggtctcggt cgttgagggt
4141 cctgtgtatt ttttccagga cgtggtaaag gtgactctgg atgttcagat acatgggcat
4201 aagcccgtct ctggggtgga ggtagcacca ctgcagagct tcatgctgcg gggtggtgtt
4261 gtagatgatc cagtcgtagc aggagcgctg ggcgtggtgc ctaaaaatgt ctttcagtag
4321 caagctgatt gccaggggca ggcccttggt gtaagtgttt acaaagcggt taagctggga
4381 tgggtgcata cgtggggata tgagatgcat cttggactgt atttttaggt tggctatgtt
4441 cccagccata tccctccggg gattcatgtt gtgcagaacc accagcacag tgtatccggt
4501 gcacttggga aatttgtcat gtagcttaga aggaaatgcg tggaagaact tggagacgcc
4561 cttgtgacct ccgagatttt ccatgcattc gtccataatg atggcaatgg gcccacgggc
4621 ggcggcctgg gcgaagatat ttctgggatc actaacgtca tagttgtgtt ccaggatgag
4681 atcgtcatag gccattttta caaagcgcgg gcggagggtg ccagactgcg gtataatggt
4741 tccatccggc ccaggggcgt agttaccctc acagatttgc atttcccacg ctttgagttc
4801 agatgggggg atcatgtcta cctgcggggc gatgaagaaa accgtttccg gggtagggga
4861 gatcagctgg gaagaaagca ggttcctgag cagctgcgac ttaccgcagc cggtgggccc
4921 gtaaatcaca cctattaccg gctgcaactg gtagttaaga gagctgcagc tgccgtcatc
4981 cctgagcagg ggggccactt cgttaagcat gtccctgact tgcatgtttt ccctgaccaa
5041 atgcgccaga aggcgctcgc cgcccagcga tagcagttct tgcaaggaag caaagttttt
5101 caacggtttg aggccgtccg ccgtaggcat gcttttgagc gtttgaccaa gcagttccag
5161 gcggtcccac agctcggtca cgtgctctac ggcatctcga tccagcatat ctcctcgttt
5221 cgcgggttgg ggcggctttc gctgtacggc agtagtcggt gctcgtccag acgggccagg
5281 gtcatgtctt tccacgggcg cagggtcctc gtcagcgtag tctgggtcac ggtgaagggg
5341 tgcgctccgg gctgcgcgct ggccagggtg cgcttgaggc tggtcctgct ggtgctgaag
5401 cgctgccggt cttcgccctg cgcgtcggcc aggtagcatt tgaccatggt gtcatagtcc
5461 agcccctccg cggcgtggcc cttggcgcgc agcttgccct tggaggaggc gccgcacgag
5521 gggcagtgca gacttttaag ggcgtagagc ttgggcgcga gaaataccga ttccggggag
5581 taggcatccg cgccgcaggc cccgcagacg gtctcgcatt ccacgagcca ggtgagctct
5641 ggccgttcgg ggtcaaaaac caggtttccc ccatgctttt tgatgcgttt cttacctctg
5701 gtttccatga gccggtgtcc acgctcggtg acgaaaaggc tgtccgtgtc cccgtataca
5761 gacttgagag gcctgtcctc gagcggtgtt ccgcggtcct cctcgtatag aaactcggac
5821 cactctgaga cgaaggctcg cgtccaggcc agcacgaagg aggctaagtg ggaggggtag
5881 cggtcgttgt ccactagggg gtccactcgc tccagggtgt gaagacacat gtcgccctct
5941 tcggcatcaa ggaaggtgat tggtttatag gtgtaggcca cgtgaccggg tgttcctgaa
6001 ggggggctat aaaagggggt gggggcgcgt tcgtcctcac tctcttccgc atcgctgtct
6061 gcgagggcca gctgttgggg tgagtactcc ctctcaaaag cgggcatgac ttctgcgcta
6121 agattgtcag tttccaaaaa cgaggaggat ttgatattca cctggcccgc ggtgatgcct
6181 ttgagggtgg ccgcgtccat ctggtcagaa aagacaatct ttttgttgtc aagcttggtg
6241 gcaaacgacc cgtagagggc gttggacagc aacttggcga tggagcgcag ggtttggttt
6301 ttgtcgcgat cggcgcgctc cttggccgcg atgtttagct gcacgtattc gcgcgcaacg
6361 caccgccatt cgggaaagac ggtggtgcgc tcgtcgggca ccaggtgcac gcgccaaccg
6421 cggttgtgca gggtgacaag gtcaacgctg gtggctacct ctccgcgtag gcgctcgttg
6481 gtccagcaga ggcggccgcc cttgcgcgaa cagaatggcg gtagtgggtc tagctgcgtc
6541 tcgtccgggg ggtctgcgtc cacggtaaag accccgggca gcaggcgcgc gtcgaagtag
6601 tctatcttgc atccttgcaa gtctagcgcc tgctgccatg cgcgggcggc aagcgcgcgc
6661 tcgtatgggt tgagtggggg accccatggc atggggtggg tgagcgcgga ggcgtacatg
6721 ccgcaaatgt cgtaaacgta gaggggctct ctgagtattc caagatatgt agggtagcat
6781 cttccaccgc ggatgctggc gcgcacgtaa tcgtatagtt cgtgcgaggg agcgaggagg
6841 tcgggaccga ggttgctacg ggcgggctgc tctgctcgga agactatctg cctgaagatg
6901 gcatgtgagt tggatgatat ggttggacgc tggaagacgt tgaagctggc gtctgtgaga
6961 cctaccgcgt cacgcacgaa ggaggcgtag gagtcgcgca gcttgttgac cagctcggcg
7021 gtgacctgca cgtctagggc gcagtagtcc agggtttcct tgatgatgtc atacttatcc
7081 tgtccctttt ttttccacag ctcgcggttg aggacaaact cttcgcggtc tttccagtac
7141 tcttggatcg gaaacccgtc ggcctccgaa cggtaagagc ctagcatgta gaactggttg
7201 acggcctggt aggcgcagca tcccttttct acgggtagcg cgtatgcctg cgcggccttc
7261 cggagcgagg tgtgggtgag cgcaaaggtg tccctaacca tgactttgag gtactggtat
7321 ttgaagtcag tgtcgtcgca tccgccctgc tcccagagca aaaagtccgt gcgctttttg
7381 gaacgcgggt ttggcagggc gaaggtgaca tcgttgaaaa gtatctttcc cgcgcgaggc
7441 ataaagttgc gtgtgatgcg gaagggtccc ggcacctcgg aacggttgtt aattacctgg
7501 gcggcgagca cgatctcgtc gaagccgttg atgttgtggc ccacgatgta aagttccaag
7561 aagcgcgggg tgcccttgat ggagggcaat tttttaagtt cctcgtaggt gagctcctca
7621 ggggagctga gcccgtgttc tgacagggcc cagtctgcaa gatgagggtt ggaagcgacg
7681 aatgagctcc acaggtcacg ggccattagc atttgcaggt ggtcgcgaaa ggtcctaaac
7741 tggcgaccta tggccatttt ttctggggtg atgcagtaga aggtaagcgg gtcttgttcc
7801 cagcggtccc atccaaggtc cacggctagg tctcgcgcgg cggtcaccag aggctcatct
7861 ccgccgaact tcataaccag catgaagggc acgagctgct tcccaaaggc ccccatccaa
7921 gtataggtct ctacatcgta ggtgacaaag agacgctcgg tgcgaggatg cgagccgatc
7981 gggaagaact ggatctcccg ccaccagttg gaggagtggc tgttgatgtg gtgaaagtag
8041 aagtccctgc gacgggccga acactcgtgc tggcttttgt aaaaacgtgc gcagtactgg
8101 cagcggtgca cgggctgtac atcctgcacg aggttgacct gacgaccgcg cacaaggaag
8161 cagagtggga atttgagccc ctcgcctggc gggtttggct ggtggtcttc tacttcggct
8221 gcttgtcctt gaccgtctgg ctgctcgagg ggagttatgg tggatcggac caccacgccg
8281 cgcgagccca aagtccagat gtccgcgcgc ggcggtcgga gcttgatgac aacatcgcgc
8341 agatgggagc tgtccatggt ctggagctcc cgcggcgaca ggtcaggcgg gagctcctgc
8401 aggtttacct cgcatagccg ggtcagggcg cgggctaggt ccaggtgata cctgatttcc
8461 aggggctggt tggtggcggc gtcgatgact tgcaagaggc cgcatccccg cggcgcgact
8521 acggtaccgc gcggcgggcg gtgggccgcg ggggtgtcct tggatgatgc atctaaaagc
8581 ggtgacgcgg gcgggccccc ggaggtaggg ggggctcggg acccgccggg agagggggca
8641 ggggcacgtc ggcgccgcgc gcgggcagga gctggtgctg cgcgcggagg ttgctggcga
8701 acgcgacgac gcggcggttg atctcctgaa tctggcgcct ctgcgtgaag acgacgggcc
8761 cggtgagctt gaacctgaaa gagagttcga cagaatcaat ttcggtgtcg ttgacggcgg
8821 cctggcgcaa aatctcctgc acgtctcctg agttgtcttg ataggcgatt tcggccatga
8881 actgctcgat ctcttcctcc tggagatctc cgcgtccggc tcgctccacg gtggcggcga
8941 ggtcgttgga gatgcgggcc atgagctgcg agaaggcgtt gaggcctccc tcgttccaga
9001 cgcggctgta gaccacgccc ccttcggcat cgcgggcgcg catgaccacc tgcgcgagat
9061 tgagctccac gtgccgggcg aagacggcgt agtttcgcag gcgctgaaag aggtagttga
9121 gggtggtggc ggtgtgttct gccacgaaga agtacataac ccagcgtcgc aacgtggatt
9181 cgttgatatc ccccaaggcc tcaaggcgct ccatggcctc gtagaagtcc acggcgaagt
9241 tgaaaaactg ggagttgcgc gccgacacgg ttaactcctc ctccagaaga cggatgagct
9301 cggcgacagt gtcgcgcacc tcgcgctcaa aggctacagg ggcctcttct tcttcaatct
9361 cctcttccat aagggcctcc ccttcttctt cttcttctgg cggcggtggg ggagggggga
9421 cacggcggcg acgacggcgc accgggaggc ggtcgacaaa gcgctcgatc atctccccgc
9481 ggcgacggcg catggtctcg gtgacggcgc ggccgttctc gcgggggcgc agttggaaga
9541 cgccgcccgt catgtcccgg ttatgggttg gcggggggct gccgtgcggc agggatacgg
9601 cgctaacgat gcatctcaac aattgttgtg taggtactcc gccaccgagg gacctgagcg
9661 agtccgcatc gaccggatcg gaaaacctct cgagaaaggc gtctaaccag tcacagtcgc
9721 aaggtaggct gagcaccgtg gcgggcggca gcgggtggcg gtcggggttg tttctggcgg
9781 aggtgctgct gatgatgtaa ttaaagtagg cggtcttgag acggcggatg gtcgacagaa
9841 gcaccatgtc cttgggtccg gcctgctgaa tgcgcaggcg gtcggccatg ccccaggctt
9901 cgttttgaca tcggcgcagg tctttgtagt agtcttgcat gagcctttct accggcactt
9961 cttcttctcc ttcctcttgt cctgcatctc ttgcatctat cgctacggcg gcggcggagt
10021 ttggccgtag gtggcgccct cttcctccca tgcgtgtgac cccgaagccc ctcatcggct
10081 gaagcagggc caggtcggcg acaacgcgct cggctaatat ggcctgctgc acctgcgtga
10141 gggtagactg gaagtcatcc atgtccacaa agcggtggta tgcgcccgtg ttgatggtgt
10201 aagtgcagtt ggccataacg gaccagttaa cggtctggtg acccggctgc gagagctcgg
10261 tgtacctgag acgcgagtaa gcccttgagt caaagacgta gtcgttgcaa gtccgcacca
10321 ggtactgata tcccaccaaa aagtgcggcg gcggctggcg gtagaggggc cagcgtaggg
10381 tggccggggc tccgggggcg aggtcttcca acataaggcg atgatatccg tagatgtacc
10441 tggacatcca ggtgatgccg gcggcggtgg tggaggcgcg cggaaagtcg cggacgcggt
10501 tccagatgtt gcgcagcggc aaaaagtgct ccatggtcgg gacgctctgg ccggtgaggc
10561 gtgcgcagtc gttgacgctc tagaccgtgc aaaaggagag cctgtaagcg ggcactcttc
10621 cgtggtctgg tggataaatt cgcaagggta tcatggcgga cgaccggggt tcgaaccccg
10681 gatccggccg tccgccgtga tccatgcggt taccgcccgc gtgtcgaacc caggtgtgcg
10741 acgtcagaca acgggggagc gctccttttg gcttccttcc aggcgcggcg gctgctgcgc
10801 tagctttttt ggccactggc cgcgcgcggc gtaagcggtt aggctggaaa gcgaaagcat
10861 taagtggctc gctccctgta gccggagggt tattttccaa gggttgagtc gcaggacccc
10921 cggttcgagt ctcgggccgg ccggactgcg gcgaacgggg gtttgcctcc ccgtcatgca
10981 agaccccgct tgcaaattcc tccggaaaca gggacgagcc ccttttttgc ttttcccaga
11041 tgcatccggt gctgcggcag atgcgccccc ctcctcagca gcggcaagag caagagcagc
11101 ggcagacatg cagggcaccc tccccttctc ctaccgcgtc aggaggggca acatccgcgg
11161 ctgacgcggc ggcagatggt gattacgaac ccccgcggcg ccgggcccgg cactacctgg
11221 acttggagga gggcgagggc ctggcgcggc taggagcgcc ctctcctgag cgacacccaa
11281 gggtgcagct gaagcgtgac acgcgcgagg cgtacgtgcc gcggcagaac ctgtttcgcg
11341 accgcgaggg agaggagccc gaggagatgc gggatcgaaa gttccacgca gggcgcgagt
11401 tgcggcatgg cctgaaccgc gagcggttgc tgcgcgagga ggactttgag cccgacgcgc
11461 ggaccgggat tagtcccgcg cgcgcacacg tggcggccgc cgacctggta accgcgtacg
11521 agcagacggt gaaccaggag attaactttc aaaaaagctt taacaaccac gtgcgcacgc
11581 ttgtggcgcg cgaggaggtg gctataggac tgatgcatct gtgggacttt gtaagcgcgc
11641 tggagcaaaa cccaaatagc aagccgctca tggcgcagct gttccttata gtgcagcaca
11701 gcagggacaa cgaggcattc agggatgcgc tgctaaacat agtagagccc gagggccgct
11761 ggctgctcga tttgataaac attctgcaga gcatagtggt gcaggagcgc agcttgagcc
11821 tggctgacaa ggtggccgcc attaactatt ccatgctcag tctgggcaag ttttacgccc
11881 gcaagatata ccatacccct tacgttccca tagacaagga ggtaaagatc gaggggttct
11941 acatgcgcat ggcgttgaag gtgcttacct tgagcgacga cctgggcgtt tatcgcaacg
12001 agcgcatcca caaggccgtg agcgtgagcc ggcggcgcga gctcagcgac cgcgagctga
12061 tgcacagcct gcaaagggcc ctggctggca cgggcagcgg cgatagagag gccgagtcct
12121 actttgacgc gggcgctgac ctgcgctggg ccccaagccg acgcgccctg gaggcagctg
12181 gggccggacc tgggctggcg gtggcacccg cgcgcgctgg caacgtcggc ggcgtggagg
12241 aatatgacga ggacgatgag tacgagccag aggacggcga gtactaagcg gtgatgtttc
12301 tgatcagatg atgcaagacg caacggaccc ggcggtgcgg gcggcgctgc agagccagcc
12361 gtccggcctt aactccacgg acgactggcg ccaggtcatg gaccgcatca tgtcgctgac
12421 tgcgcgtaac cctgacgcgt tccggcagca gccgcaggcc aaccggctct ccgcaattct
12481 ggaagcggtg gtcccggcgc gcgcaaaccc cacgcacgag aaggtgctgg cgatcgtaaa
12541 cgcgctggcc gaaaacaggg ccatccggcc cgatgaggcc ggcctggtct acgacgcgct
12601 gcttcagcgc gtggctcgtt acaacagcgg caacgtgcag accaacctgg accggctggt
12661 gggggatgtg cgcgaggccg tggcgcagcg tgagcgcgcg cagcagcagg gcaacctggg
12721 ctccatggtt gcactaaacg ccttcctgag tacacagccc gccaacgtgc cgcggggaca
12781 ggaggactac accaactttg tgagcgcact gcggctaatg gtgactgaga caccgcaaag
12841 tgaggtgtac cagtccgggc cagactattt tttccagacc agtagacaag gcctgcagac
12901 cgtaaacctg agccaggctt tcaagaactt gcaggggctg tggggggtgc gggctcccac
12961 aggcgaccgc gcgaccgtgt ctagcttgct gacgcccaac tcgcgcctgt tgctgctgct
13021 aatagcgccc ttcacggaca gtggcagcgt gtcccgggac acatacctag gtcacttgct
13081 gacactgtac cgcgaggcca taggtcaggc gcatgtggac gagcatactt tccaggagat
13141 tacaagtgtc agccgcgcgc tggggcagga ggacacgggc agcctggagg caaccctgaa
13201 ctacctgctg accaaccggc ggcagaagat cccctcgttg cacagtttaa acagcgagga
13261 ggagcgcatc ttgcgctatg tgcagcagag cgtgagcctt aacctgatgc gcgacggggt
13321 aacgcccagc gtggcgctgg acatgaccgc gcgcaacatg gaaccgggca tgtatgcctc
13381 aaaccggccg tttatcaatc gcctaatgga ctacttgcat cgcgcggccg ccgtgaaccc
13441 cgagtatttc accaatgcca tcttgaaccc gcactggcta ccgccccctg gtttctacac
13501 cgggggattt gaggtgcccg agggtaacga tggattcctc tgggacgaca tagacgacag
13561 cgtgttttcc ccgcaaccgc agaccctgct agagttgcaa cagcgcgagc aggcagaggc
13621 ggcgctgcga aaggaaagct tccgcaggcc aagcagcttg tccgatctag gcgctgcggc
13681 cccgcggtca gatgcgagta gcccatttcc aagcttgata gggtctttta ccagcactcg
13741 caccacccgc ccgcgcctgc tgggcgagga ggagtaccta aacaactcgc tgctgcagcc
13801 gcagcgcgaa aagaacctgc ctccggcatt tcccaacaac gggatagaga gcctagtgga
13861 caagatgagt agatggaaga cgtatgcgca ggagcacagg gatgtgcccg gcccgcgccc
13921 gcccacccgt cgtcaaaggc acgaccgtca gcggggtctg gtgtgggagg acgatgactc
13981 ggcagacgac agcagcgtcc tggatttggg agggagtggc aacccgtttg cgcaccttcg
14041 ccccaggctg gggagaatgt tttaaaaaaa aaaaaaaaaa gcatgatgca aaataaaaaa
14101 ctcaccaagg ccatggcacc gagcgttggt tttcttgtat tccccttagt atgcagcgcg
14161 cggcgatgta tgaggaaggt cctcctccct cctacgagag cgtggtgagc gcggcgccag
14221 tggcggcggc gctgggttcc cccttcgatg ctcccctgga cccgccgttt gtgcctccgc
14281 ggtacctgcg gcctaccggg gggagaaaca gcatccgtta ctctgagttg gcacccctat
14341 tcgacaccac ccgtgtgtac cttgtggaca acaagtcaac ggatgtggca tccctgaact
14401 accagaacga ccacagcaac tttctaacca cggtcattca aaacaatgac tacagcccgg
14461 gggaggcaag cacacagacc atcaatcttg acgaccgttc gcactggggc ggcgacctga
14521 aaaccatcct gcataccaac atgccaaatg tgaacgagtt catgtttacc aataagttta
14581 aggcgcgggt gatggtgtcg cgctcgctta ctaaggacaa acaggtggag ctgaaatatg
14641 agtgggtgga gttcacgctg cccgagggca actactccga gaccatgacc atagacctta
14701 tgaacaacgc gatcgtggag cactacttga aagtgggcag gcagaacggg gttctggaaa
14761 gcgacatcgg ggtaaagttt gacacccgca acttcagact ggggtttgac ccagtcactg
14821 gtcttgtcat gcctggggta tatacaaacg aagccttcca tccagacatc attttgctgc
14881 caggatgcgg ggtggacttc acccacagcc gcctgagcaa cttgttgggc atccgcaagc
14941 ggcaaccctt ccaggagggc tttaggatca cctacgatga cctggagggt ggtaacattc
15001 ccgcactgtt ggatgtggac gcctaccagg caagcttaaa agatgacacc gaacagggcg
15061 gggatggcgc aggcggcggc aacaacagtg gcagcggcgc ggaagagaac tccaacgcgg
15121 cagccgcggc aatgcagccg gtggaggaca tgaacgatca tgccattcgc ggcgacacct
15181 ttgccacacg ggcggaggag aagcgcgctg aggccgaggc agcggcagaa gctgccgccc
15241 ccgctgcgca acccgaggtc gagaagcctc agaagaaacc ggtgatcaaa cccctgacag
15301 aggacagcaa gaaacgcagt tacaacctaa taagcaatga cagcaccttc acccagtacc
15361 gcagctggta ccttgcatac aactacggcg accctcagac cgggatccgc tcatggaccc
15421 tcctttgcac tcctgacgta acctgcggct cggagcaggt ctactggtcg ttgccagaca
15481 tgatgcaaga ccccgtgacc ttccgctcca cgagccagat cagcaacttt ccggtggtgg
15541 gcgccgagct gttgcccgtg cactccaaga gcttctacaa cgaccaggcc gtctactccc
15601 agctcatccg ccagtttacc tctctgaccc acgtgttcaa tcgctttccc gagaaccaga
15661 ttttggcgcg cccgccagcc cccaccatca ccaccgtcag tgaaaacgtt cctgctctca
15721 cagatcacgg gacgctaccg ctgcgcaaca gcatcggagg agtccagcga gtgaccatta
15781 ctgacgccag acgccgcacc tgcccctacg tttacaaggc cctgggcata gtctcgccgc
15841 gcgtcctatc gagccgcact ttttgagcaa acatgtccat ccttatatcg cccagcaata
15901 acacaggctg gggcctgcgc ttcccaagca agatgtttgg cggggcaaag aagcgctccg
15961 accaacaccc agtgcgcgtg cgcgggcact accgcgcgcc ctggggcgcg cacaaacgcg
16021 gccgcactgg gcgcaccacc gtcgatgacg ccattgacgc ggtggtggag gaggcgcgca
16081 actacacgcc cacgccgcca ccagtgtcca cagtggacgc ggccattcag accgtggtgc
16141 gcggagcccg gcgttatgct aaaatgaaga gacggcggag gcgcgtagca cgtcgccacc
16201 gccgccgacc cggcactgcc gcccaacgcg cggcggcggc cctgcttaac cgcgcacgtc
16261 gcaccggccg acgggcggcc atgcgggccg ctcgaaggct ggccgcgggt attgtcactg
16321 tgccccccag gtccaggcga cgagcggccg ccgcagcagc cgcggccatt agtgctatga
16381 ctcagggtcg caggggcaac gtgtactggg tgcgcgactc ggttagcggc ctgcgcgtgc
16441 ccgtgcgcac ccgccccccg cgcaactaga ttgcaagaaa aaactactta gactcgtact
16501 gttgtatgta tccagcggcg gcggcgcgca acgaagctat gtccaagcgc aaaatcaaag
16561 aagagatgct ccaggtcatc gcgccggaga tctatggccc cccgaagaag gaagagcagg
16621 attacaagcc ccgaaagcta aagcgggtca aaaagaaaaa gaaagatgat gatgatgatg
16681 aacttgacga cgaggtggaa ctgctgcacg caaccgcgcc caggcggcgg gtacagtgga
16741 aaggtcgacg cgtaagacgt gttttgcgac ccggcaccac cgtagttttt acgcccggtg
16801 agcgctccac ccgcacctac aagcgcgtgt atgatgaggt gtacggcgac gaggacctgc
16861 ttgagcaggc caacgagcgc ctcggggagt ttgcctacgg aaagcggcat aaggacatgt
16921 tggcgttgcc gctggacgag ggcaacccaa cacctagcct aaagcccgtg acactgcagc
16981 aggtgctgcc cacgcttgca ccgtccgaag aaaagcgcgg cctaaagcgc gagtctggtg
17041 acttggcacc caccgtgcag ctgatggtac ccaagcgcca gcgactggaa gatgtcttgg
17101 aaaaaatgac cgtggagcct gggctggagc ccgaggtccg cgtgcggcca atcaagcagg
17161 tggcaccggg actgggcgtg cagaccgtgg acgttcagat acccaccacc agtagcacta
17221 gtattgccac tgccacagag ggcatggaga cacaaacgtc cccggttgcc tcggcggtgg
17281 cagatgccgc ggtgcaggcg gccgctgcgg ccgcgtccaa aacctctacg gaggtgcaaa
17341 cggacccgtg gatgtttcgc gtttcagccc cccggcgccc gcgccgttcc aggaagtacg
17401 gcaccgccag cgcactactg cccgaatatg ccctacatcc ttccatcgcg cctacccccg
17461 gctatcgtgg ctacacctac cgccccagaa gacgagcgac tacccgacgc cgaaccacca
17521 ctggaacccg ccgccgccgt cgccgtcgcc agcccgtgct ggccccgatt tccgtgcgca
17581 gggtggctcg cgaaggaggc aggaccctgg tgctgccaac agcgcgctac caccccagca
17641 tcgtttaaaa gccggtcttt gtggttcttg cagatatggc cctcacctgc cgcctccgtt
17701 tcccggtgcc gggattccga ggaagaatgc accgtaggag gggcatggcc ggccacggcc
17761 tgacgggcgg catgcgtcgt gcgcaccacc ggcggcggcg cgcgtcgcac cgtcgcatgc
17821 gcggcggtat cctgcccctc cttattccac tgatcgccgc ggcgattggc gccgtgcccg
17881 gaattgcatc cgtggccttg caggcgcaga gacactgatt aaaaacaagt tgcatgtgga
17941 aaaatcaaaa taaaaagtct ggagtctcac gctcgcttgg tcctgtaact attttgtaga
18001 atggaagaca tcaactttgc gtctctggcc ccgcgacacg gctcgcgccc gttcatggga
18061 aactggcaag atatcggcac cagcaatatg agcggtggcg ccttcagctg gggctcgctg
18121 tggagcggca ttaaaaattt cggttccacc attaagaact atggcagcaa ggcctggaac
18181 agcagcacag gccagatgct gagggacaag ttgaaagagc aaaatttcca acaaaaggtg
18241 gtagatggcc tggcctctgg cattagcggg gtggtggacc tggccaacca ggcagtgcaa
18301 aataagatta acagtaagct tgatccccgc cctcccgtag aggagcctcc accggccgtg
18361 gagacagtgt ctccagaggg gcgtggcgaa aagcgtccgc ggcccgacag ggaagaaact
18421 ctggtgacgc aaatagatga gcctccctcg tacgaggagg cactaaagca aggcctgccc
18481 accacccgtc ccatcgcgcc catggctacc ggagtgctgg gccagcacac acctgtaacg
18541 ctggacctgc ctccccccgc tgacacccag cagaaacctg tgctgccagg gccgtccgcc
18601 gttgttgtaa cccgccctag ccgcgcgtcc ctgcgccgtg ccgccagcgg tccgcgatcg
18661 atgcggcccg tagccagtgg caactggcaa agcacactga acagcatcgt gggtctgggg
18721 gtgcaatccc tgaagcgccg acgatgcttc taaatagcta acgtgtcgta tgtgtcatgt
18781 atgcgtccat gtcgccgcca gaggagctgc tgagccgccg tgcgcccgct ttccaagatg
18841 gctacccctt cgatgatgcc gcagtggtct tacatgcaca tctcgggcca ggacgcctcg
18901 gagtacctga gccccgggct ggtgcagttt gcccgcgcca ccgagacgta cttcagcctg
18961 aataacaagt ttagaaaccc cacggtggca cctacgcacg acgtaaccac agaccggtcc
19021 cagcgtttga cgctgcggtt catccctgtg gaccgcgagg ataccgcgta ctcgtacaaa
19081 gcgcggttca ccctggctgt gggtgacaac cgtgtgcttg atatggcttc cacgtacttt
19141 gacatccgcg gcgtgctgga cagggggcct acttttaagc cctactccgg cactgcctac
19201 aacgctctag ctcccaaggg cgctcctaac tcctgtgagt gggaacaaac cgaagatagc
19261 ggccgggcag ttgccgagga tgaagaagag gaagatgaag atgaagaaga ggaagaagaa
19321 gagcaaaacg ctcgagatca ggctactaag aaaacacatg tctatgccca ggctcctttg
19381 tctggagaaa caattacaaa aagcgggcta caaataggat cagacaatgc agaaacacaa
19441 gctaaacctg tatacgcaga tccttcctat caaccagaac ctcaaattgg cgaatctcag
19501 tggaacgaag ctgatgctaa tgcggcagga gggagagtgc ttaaaaaaac aactcccatg
19561 aaaccatgct atggatctta tgccaggcct acaaatcctt ttggtggtca atccgttctg
19621 gttccggatg aaaaaggggt gcctcttcca aaggttgact tgcaattctt ctcaaatact
19681 acctctttga acgaccggca aggcaatgct actaaaccaa aagtggtttt gtacagtgaa
19741 gatgtaaata tggaaacccc agacacacat ctgtcttaca aacctggaaa aggtgatgaa
19801 aattctaaag ctatgttggg tcaacaatct atgccaaaca gacccaatta cattgctttc
19861 agggacaatt ttattggcct aatgtattat aacagcactg gcaacatggg tgttcttgct
19921 ggtcaggcat cgcagctaaa tgccgtggta gatttgcaag acagaaacac agagctgtcc
19981 tatcaactct tgcttgattc cataggtgat agaaccagat atttttctat gtggaatcag
20041 gctgtagaca gctatgatcc agatgttaga atcattgaaa accatggaac tgaggatgaa
20101 ttgccaaatt attgttttcc tcttgggggt attggggtaa ctgacaccta tcaagctatt
20161 aaggctaatg gcaatggctc aggcgataat ggagatacta catggacaaa agatgaaact
20221 tttgcaacac gtaatgaaat aggagtgggt aacaactttg ccatggaaat taacctaaat
20281 gccaacctat ggagaaattt cctttactcc aatattgcgc tgtacctgcc agacaagcta
20341 aaatacaacc ccaccaatgt ggaaatatct gacaacccca acacctacga ctacatgaac
20401 aagcgagtgg tggctcccgg gcttgtagac tgctacatta accttggggc gcgctggtct
20461 ctggactaca tggacaacgt taatcccttt aaccaccacc gcaatgcggg cctccgttat
20521 cgctccatgt tgttgggaaa cggccgctac gtgccctttc acattcaggt gccccaaaag
20581 ttttttgcca ttaaaaacct cctcctcctg ccaggctcat atacatatga atggaacttc
20641 aggaaggatg ttaacatggt tctgcagagc tctctgggaa acgatcttag agttgacggg
20701 gctagcatta agtttgacag catttgtctt tacgccacct tcttccccat ggcccacaac
20761 acggcctcca cgctggaagc catgctcaga aatgacacca acgaccagtc ctttaatgac
20821 tacctttccg ccgccaacat gctatacccc atacccgcca acgccaccaa cgtgcccatc
20881 tccatcccat cgcgcaactg ggcagcattt cgcggttggg ccttcacacg cttgaagaca
20941 aaggaaaccc cttccctggg atcaggctac gacccttact acacctactc tggctccata
21001 ccataccttg acggaacctt ctatcttaat cacaccttta agaaggtggc cattaccttt
21061 gactcttctg ttagctggcc gggcaacgac cgcctgctta ctcccaatga gtttgagatt
21121 aaacgctcag ttgacgggga gggctacaac gtagctcagt gcaacatgac caaggactgg
21181 ttcctggtgc agatgttggc caactacaat attggctacc agggcttcta cattccagaa
21241 agctacaagg accgcatgta ctcgttcttc agaaacttcc agcccatgag ccggcaagtg
21301 gttgacgata ctaaatacaa ggagtatcag caggttggaa ttcttcacca gcataacaac
21361 tcaggattcg taggctacct cgctcccacc atgcgcgagg gacaggctta ccccgccaac
21421 gtgccctacc cactaatagg caaaaccgcg gttgacagta ttacccagaa aaagtttctt
21481 tgcgatcgca ccctttggcg catcccattc tccagtaact ttatgtccat gggcgcactc
21541 acagacctgg gccaaaacct tctctacgcc aactccgccc acgcgctaga catgactttt
21601 gaggtggatc ccatggacga gcccaccctt ctttatgttt tgtttgaagt ctttgacgtg
21661 gtccgtgtgc accagccgca ccgcggcgtc atcgagaccg tgtacctgcg cacgcccttc
21721 tcggccggca acgccacaac ataaaagaag caagcaacat caacaacagc tgccgccatg
21781 ggctccagtg agcaggaact gaaagccatt gtcaaagatc ttggttgtgg gccatatttt
21841 ttgggcacct atgacaagcg ctttccaggc tttgtttctc cacacaagct cgcctgcgcc
21901 atagtcaata cggccggtcg cgagactggg ggcgtacact ggatggcctt tgcctggaac
21961 ccgcgctcaa aaacatgcta cctctttgag ccctttggct tttctgacca acgactcaag
22021 caggtttacc agtttgagta cgagtcactc ctgcgccgta gcgccattgc ttcttccccc
22081 gaccgctgta taacgctgga aaagtccacc caaagcgtgc aggggcccaa ctcggccgcc
22141 tgtggactat tctgctgcat gtttctccac gcctttgcca actggcccca aactcccatg
22201 gatcacaacc ccaccatgaa ccttattacc ggggtaccca actccatgct taacagtccc
22261 caggtacagc ccaccctgcg tcgcaaccag gaacagctct acagcttcct ggagcgccac
22321 tcgccctact tccgcagcca cagtgcgcag attaggagcg ccacttcttt ttgtcacttg
22381 aaaaacatgt aaaaataatg tactaggaga cactttcaat aaaggcaaat gtttttattt
22441 gtacactctc gggtgattat ttacccccca cccttgccgt ctgcgccgtt taaaaatcaa
22501 aggggttctg ccgcgcatcg ctatgcgcca ctggcaggga cacgttgcga tactggtgtt
22561 tagtgctcca cttaaactca ggcacaacca tccgcggcag ctcggtgaag ttttcactcc
22621 acaggctgcg caccatcacc aacgcgttta gcaggtcggg cgccgatatc ttgaagtcgc
22681 agttggggcc tccgccctgc gcgcgcgagt tgcgatacac agggttgcag cactggaaca
22741 ctatcagcgc cgggtggtgc acgctggcca gcacgctctt gtcggagatc agatccgcgt
22801 ccaggtcctc cgcgttgctc agggcgaacg gagtcaactt tggtagctgc cttcccaaaa
22861 agggtgcatg cccaggcttt gagttgcact cgcaccgtag tggcatcaga aggtgaccgt
22921 gcccggtctg ggcgttagga tacagcgcct gcatgaaagc cttgatctgc ttaaaagcca
22981 cctgagcctt tgcgccttca gagaagaaca tgccgcaaga cttgccggaa aactgattgg
23041 ccggacaggc cgcgtcatgc acgcagcacc ttgcgtcggt gttggagatc tgcaccacat
23101 ttcggcccca ccggttcttc acgatcttgg ccttgctaga ctgctccttc agcgcgcgct
23161 gcccgttttc gctcgtcaca tccatttcaa tcacgtgctc cttatttatc ataatgctcc
23221 cgtgtagaca cttaagctcg ccttcgatct cagcgcagcg gtgcagccac aacgcgcagc
23281 ccgtgggctc gtggtgcttg taggttacct ctgcaaacga ctgcaggtac gcctgcagga
23341 atcgccccat catcgtcaca aaggtcttgt tgctggtgaa ggtcagctgc aacccgcggt
23401 gctcctcgtt tagccaggtc ttgcatacgg ccgccagagc ttccacttgg tcaggcagta
23461 gcttgaagtt tgcctttaga tcgttatcca cgtggtactt gtccatcaac gcgcgcgcag
23521 cctccatgcc cttctcccac gcagacacga tcggcaggct cagcgggttt atcaccgtgc
23581 tttcactttc cgcttcactg gactcttcct tttcctcttg cgtccgcata ccccgcgcca
23641 ctgggtcgtc ttcattcagc cgccgcaccg tgcgcttacc tcccttgccg tgcttgatta
23701 gcaccggtgg gttgctgaaa cccaccattt gtagcgccac atcttctctt tcttcctcgc
23761 tgtccacgat cacctctggg gatggcgggc gctcgggctt gggagagggg cgcttctttt
23821 tctttttgga cgcaatggcc aaatccgccg tcgaggtcga tggccgcggg ctgggtgtgc
23881 gcggcaccag cgcatcttgt gacgagtctt cttcgtcctc ggactcgaga cgccgcctca
23941 gccgcttttt tgggggcgcg cggggaggcg gcggcgacgg cgacggggac gacacgtcct
24001 ccatggttgg tggacgtcgc gccgcaccgc gtccgcgctc gggggtggtt tcgcgctgct
24061 cctcttcccg actggccatt tccttctcct ataggcagaa aaagatcatg gagtcagtcg
24121 agaaggagga cagcctaacc gccccctttg agttcgccac caccgcctcc accgatgccg
24181 ccaacgcgcc taccaccttc cccgtcgagg cacccccgct tgaggaggag gaagtgatta
24241 tcgagcagga cccaggtttt gtaagcgaag acgacgagga tcgctcagta ccaacagagg
24301 ataaaaagca agaccaggac gacgcagagg caaacgagga acaagtcggg cggggggacc
24361 aaaggcatgg cgactaccta gatgtgggag acgacgtgct gttgaagcat ctgcagcgcc
24421 agtgcgccat tatctgcgac gcgttgcaag agcgcagcga tgtgcccctc gccatagcgg
24481 atgtcagcct tgcctacgaa cgccacctgt tctcaccgcg cgtacccccc aaacgccaag
24541 aaaacggcac atgcgagccc aacccgcgcc tcaacttcta ccccgtattt gccgtgccag
24601 aggtgcttgc cacctatcac atctttttcc aaaactgcaa gataccccta tcctgccgtg
24661 ccaaccgcag ccgagcggac aagcagctgg ccttgcggca gggcgctgtc atacctgata
24721 tcgcctcgct cgacgaagtg ccaaaaatct ttgagggtct tggacgcgac gagaaacgcg
24781 cggcaaacgc tctgcaacaa gaaaacagcg aaaatgaaag tcactgtgga gtgctggtgg
24841 aacttgaggg tgacaacgcg cgcctagccg tgctgaaacg cagcatcgag gtcacccact
24901 ttgcctaccc ggcacttaac ctacccccca aggttatgag cacagtcatg agcgagctga
24961 tcgtgcgccg tgcacgaccc ctggagaggg atgcaaactt gcaagaacaa accgaggagg
25021 gcctacccgc agttggcgat gagcagctgg cgcgctggct tgagacgcgc gagcctgccg
25081 acttggagga gcgacgcaag ctaatgatgg ccgcagtgct tgttaccgtg gagcttgagt
25141 gcatgcagcg gttctttgct gacccggaga tgcagcgcaa gctagaggaa acgttgcact
25201 acacctttcg ccagggctac gtgcgccagg cctgcaaaat ttccaacgtg gagctctgca
25261 acctggtctc ctaccttgga attttgcacg aaaaccgcct cgggcaaaac gtgcttcatt
25321 ccacgctcaa gggcgaggcg cgccgcgact acgtccgcga ctgcgtttac ttatttctgt
25381 gctacacctg gcaaacggcc atgggcgtgt ggcagcaatg cctggaggag cgcaacctaa
25441 aggagctgca gaagctgcta aagcaaaact tgaaggacct atggacggcc ttcaacgagc
25501 gctccgtggc cgcgcacctg gcggacatta tcttccccga acgcctgctt aaaaccctgc
25561 aacagggtct gccagacttc accagtcaaa gcatgttgca aaactttagg aactttatcc
25621 tagagcgttc aggaattctg cccgccacct gctgtgcgct tcctagcgac tttgtgccca
25681 ttaagtaccg tgaatgccct ccgccgcttt ggggtcactg ctaccttctg cagctagcca
25741 actaccttgc ctaccactcc gacatcatgg aagacgtgag cggtgacggc ctactggagt
25801 gtcactgtcg ctgcaaccta tgcaccccgc accgctccct ggtctgcaat tcgcaactgc
25861 ttagcgaaag tcaaattatc ggtacctttg agctgcaggg tccctcgcct gacgaaaagt
25921 ccgcggctcc ggggttgaaa ctcactccgg ggctgtggac gtcggcttac cttcgcaaat
25981 ttgtacctga ggactaccac gcccacgaga ttaggttcta cgaagaccaa tcccgcccgc
26041 caaatgcgga gcttaccgcc tgcgtcatta cccagggcca catccttggc caattgcaag
26101 ccatcaacaa agcccgccaa gagtttctgc tacgaaaggg acggggggtt tacctggacc
26161 cccagtccgg cgaggagctc aacccaatcc ccccgccgcc gcagccctat cagcagccgc
26221 gggcccttgc ttcccaggat ggcacccaaa aagaagctgc agctgccgcc gccgccaccc
26281 acggacgagg aggaatactg ggacagtcag gcagaggagg ttttggacga ggaggaggag
26341 atgatggaag actgggacag cctagacgaa gcttccgagg ccgaagaggt gtcagacgaa
26401 acaccgtcac cctcggtcgc attcccctcg ccggcgcccc agaaattggc aaccgttccc
26461 agcatcgcta caacctccgc tcctcaggcg ccgccggcac tgcctgttcg ccgacccaac
26521 cgtagatggg acaccactgg aaccagggcc ggtaagtcta agcagccgcc gccgttagcc
26581 caagagcaac aacagcgcca aggctaccgc tcgtggcgcg ggcacaagaa cgccatagtt
26641 gcttgcttgc aagactgtgg gggcaacatc tccttcgccc gccgctttct tctctaccat
26701 cacggcgtgg ccttcccccg taacatcctg cattactacc gtcatctcta cagcccctac
26761 tgcaccggcg gcagcggcag cggcagcaac agcagcggtc acacagaagc aaaggcgacc
26821 ggatagcaag actctgacaa agcccaagaa atccacagcg gcggcagcag caggaggagg
26881 agcgctgcgt ctggcgccca acgaacccgt atcgacccgc gagcttagaa ataggatttt
26941 tcccactctg tatgctatat ttcaacaaag caggggccaa gaacaagagc tgaaaataaa
27001 aaacaggtct ctgcgctccc tcacccgcag ctgcctgtat cacaaaagcg aagatcagct
27061 tcggcgcacg ctggaagacg cggaggctct cttcagcaaa tactgcgcgc tgactcttaa
27121 ggactagttt cgcgcccttt ctcaaattta agcgcgaaaa ctacgtcatc tccagcggcc
27181 acacccggcg ccagcacctg tcgtcagcgc cattatgagc aaggaaattc ccacgcccta
27241 catgtggagt taccagccac aaatgggact tgcggctgga gctgcccaag actactcaac
27301 ccgaataaac tacatgagcg cgggacccca catgatatcc cgggtcaacg gaatccgcgc
27361 ccaccgaaac cgaattctcc tcgaacaggc ggctattacc accacacctc gtaataacct
27421 taatccccgt agttggcccg ctgccctggt gtaccaggaa agtcccgctc ccaccactgt
27481 ggtacttccc agagacgccc aggccgaagt tcagatgact aactcagggg cgcagcttgc
27541 gggcggcttt cgtcacaggg tgcggtcgcc cgggcagggt ataactcacc tgaaaatcag
27601 agggcgaggt attcagctca acgacgagtc ggtgagctcc tctcttggtc tccgtccgga
27661 cgggacattt cagatcggcg gcgctggccg ctcttcattt acgccccgtc aggcgatcct
27721 aactctgcag acctcgtcct cggagccgcg ctccggaggc attggaactc tacaatttat
27781 tgaggagttc gtgccttcgg tttacttcaa ccccttttct ggacctcccg gccactaccc
27841 ggaccagttt attcccaact ttgacgcggt gaaagactcg gcggacggct acgactgaat
27901 gaccagtgga gaggcagagc gactgcgcct gacacacctc gaccactgcc gccgccacaa
27961 gtgctttgcc cgcggctccg gtgagttttg ttactttgaa ttgcccgaag agcatatcga
28021 gggcccggcg cacggcgtcc ggctcaccac ccaggtagag cttacacgta gcctgattcg
28081 ggagtttacc aagcgccccc tgctagtgga gcgggagcgg ggtccctgtg ttctgaccgt
28141 ggtttgcaac tgtcctaacc ctggattaca tcaagatctt tgttgtcatc tctgtgctga
28201 gtataataaa tacagaaatt agaatctact ggggctcctg tcgccatcct gtgaacgcca
28261 ccgtttttac ccacccaaag cagaccaaag caaacctcac ctccggtttg cacaagcggg
28321 ccaataagta ccttacctgg tactttaacg gctcttcatt tgtaatttac aacagtttcc
28381 agcgagacga agtaagtttg ccacacaacc ttctcggctt caactacacc gtcaagaaaa
28441 acaccaccac caccaccctc ctcacctgcc gggaacgtac gagtgcgtca ccggttgctg
28501 cgcccacacc tacagcctga gcgtaaccag acattactcc catttttcca aaacaggagg
28561 tgagctcaac tcccggaact caggtcaaaa aagcattttg cggggtgctg ggatttttta
28621 attaagtata tgagcaattc aagtaactct acaagcttgt ctaatttttc tggaattggg
28681 gtcggggtta tccttactct tgtaattctg tttattctta tactagcact tctgtgcctt
28741 agggttgccg cctgctgcac gcacgtttgt acctattgtc agctttttaa acgctggggg
28801 caacatccaa gatgaggtac atgattttag gcttgctcgc ccttgcggca gtctgcagcg
28861 ctgccaaaaa ggttgagttt aaggaaccag cttgcaatgt tacatttaaa tcagaagcta
28921 atgaatgcac tactcttata aaatgcacca cagaacatga aaagcttatt attcgccaca
28981 aagacaaaat tggcaagtat gctgtatatg ctatttggca gccaggtgac actaacgact
29041 ataatgtcac agtcttccaa ggtgaaaatc gtaaaacttt tatgtataaa tttccatttt
29101 atgaaatgtg cgatattacc atgtacatga gcaaacagta caagttgtgg cccccacaaa
29161 agtgtttaga gaacactggc accttttgtt ccaccgctct gcttattaca gcgcttgctt
29221 tggtatgtac cttactttat ctcaaataca aaagcagacg cagttttatt gatgaaaaga
29281 aaatgccttg attttccgct tgcttgtatt cccctggaca atttactcta tgtgggatat
29341 gctccaggcg ggcaagatta tacccacaac cttcaaatca aactttcctg gacgttagcg
29401 cctgatttct gccagcgcct gcactgcaaa tttgatcaaa cccagcttca gcttgcctgc
29461 tccagagatg accggctcaa ccatcgcgcc cacaacggac tatcgcaaca ccactgctac
29521 cggactaaca tctgccctaa atttacccca agttcatgcc tttgtcaatg actgggcgag
29581 cttggacatg tggtggtttt ccatagcgct tatgtttgtt tgccttatta ttatgtggct
29641 tatttgttgc ctaaagcgca gacgcgccag accccccatc tataggccta tcattgtgct
29701 caacccacac aatgaaaaaa ttcatagatt ggacggtctg aaaccatgtt ctcttctttt
29761 acagtatgat taaatgagac atgattcctc gagttcttat attattgacc cttgttgcgc
29821 ttttctgtgc gtgctctaca ttggccgcgg tcgctcacat cgaagtagat tgcatcccac
29881 ctttcacagt ttacctgctt tacggatttg tcacccttat cctcatctgc agcctcgtca
29941 ctgtagtcat cgccttcatt cagttcattg actgggtttg tgtgcgcatt gcgtacctca
30001 ggcaccatcc gcaatacaga gacaggacta tagctgatct tctcagaatt ctttaattat
30061 gaaacggagt gtcatttttg ttttgctgat tttttgcgcc ctacctgtgc tttgctccca
30121 aacctcagcg cctcccaaaa gacatatttc ctgcagattc actcaaatat ggaacattcc
30181 cagctgctac aacaaacaga gcgatttgtc agaagcctgg ttatacgcca tcatctctgt
30241 catggttttt tgcagtacca tttttgccct agccatatat ccataccttg acattggctg
30301 gaatgccata gatgccatga accaccctac tttcccagtg cccgctgtca taccactgca
30361 acaggttatt gccccaatca atcagcctcg ccccccttct cccaccccca ctgagattag
30421 ctactttaat ttgacaggtg gagatgactg aatctctaga tctagaattg gatggaatta
30481 acaccgaaca gcgcctacta gaaaggcgca aggcggcgtc cgagcgagaa cgcctaaaac
30541 aagaagttga agacatggtt aacctacacc agtgtaaaag aggtatcttt tgtgtggtca
30601 agcaggccaa acttacctac gaaaaaacca ctaccggcaa ccgcctcagc tacaagctac
30661 ccacccagcg ccaaaaactg gtgcttatgg tgggagaaaa acctatcacc gtcacccagc
30721 actcggcaga aacagagggc tgcctgcact tcccctatca gggtccagag gacctctgca
30781 ctcttattaa aaccatgtgt ggtattagag atcttattcc attcaactaa cataaacaca
30841 caataaatta cttacttaaa atcagtcagc aaatctttgt ccagcttatt cagcatcacc
30901 tcctttcctt cctcccaact ctggtatctc agccgccttt tagctgcaaa ctttctccaa
30961 agtttaaatg ggatgtcaaa ttcctcatgt tcttgtccct ccgcacccac tatcttcata
31021 ttgttgcaga tgaaacgcgc cagaccgtct gaagacacct tcaaccccgt gtatccatat
31081 gacacagaaa ccgggcctcc aactgtgccc tttcttaccc ctccatttgt ttcacccaat
31141 ggtttccaag aaagtccccc tggagttctc tctctacgcg tctccgaacc tttggacacc
31201 tcccacggca tgcttgcgct taaaatgggc agcggtctta ccctagacaa ggccggaaac
31261 ctcacctccc aaaatgtaac cactgttact cagccactta aaaaaacaaa gtcaaacata
31321 agtttggaca cctccgcacc acttacaatt acctcaggcg ccctaacagt ggcaaccacc
31381 gctcctctga tagttactag cggcgctctt agcgtacagt cacaagcccc actgaccgtg
31441 caagactcca aactaagcat tgctactaaa gggcccatta cagtgtcaga tggaaagcta
31501 gccctgcaaa catcagcccc cctctctggc agtgacagcg acacccttac tgtaactgca
31561 tcacccccgc taactactgc cacgggtagc ttgggcatta acatggaaga tcctatttat
31621 gtaaataatg gaaaaatagg aattaaaata agcggtcctt tgcaagtagc acaaaactcc
31681 gatacactaa cagtagttac tggaccaggt gtcaccgttg aacaaaactc ccttagaacc
31741 aaagttgcag gagctattgg ttatgattca tcaaacaaca tggaaattaa aacgggcggt
31801 ggcatgcgta taaataacaa cttgttaatt ctagatgtgg attacccatt tgatgctcaa
31861 acaaaactac gtcttaaact ggggcaggga cccctgtata ttaatgcatc tcataacttg
31921 gacataaact ataacagagg cctatacctt tttaatgcat caaacaatac taaaaaactg
31981 gaagttagca taaaaaaatc cagtggacta aactttgata atactgccat agctataaat
32041 gcaggaaagg gtctggagtt tgatacaaac acatctgagt ctccagatat caacccaata
32101 aaaactaaaa ttggctctgg cattgattac aatgaaaacg gtgccatgat tactaaactt
32161 ggagcgggtt taagctttga caactcaggg gccattacaa taggaaacaa aaatgatgac
32221 aaacttaccc tgtggacaac cccagaccca tctcctaact gcagaattca ttcagataat
32281 gactgcaaat ttactttggt tcttacaaaa tgtgggagtc aagtactagc tactgtagct
32341 gctttggctg tatctggaga tctttcatcc atgacaggca ccgttgcaag tgttagtata
32401 ttccttagat ttgaccaaaa cggtgttcta atggagaact cctcacttaa aaaacattac
32461 tggaacttta gaaatgggaa ctcaactaat gcaaatccat acacaaatgc agttggattt
32521 atgcctaacc ttctagccta tccaaaaacc caaagtcaaa ctgctaaaaa taacattgtc
32581 agtcaagttt acttgcatgg tgataaaact aaacctatga tacttaccat tacacttaat
32641 ggcactagtg aatccacaga aactagcgag gtaagcactt actctatgtc ttttacatgg
32701 tcctgggaaa gtggaaaata caccactgaa acttttgcta ccaactctta caccttctcc
32761 tacattgccc aggaataaag aatcgtgaac ctgttgcatg ttatgtttca acgtgtttat
32821 ttttcaattg cagaaaattt caagtcattt ttcattcagt agtatagccc caccaccaca
32881 tagcttatat tgatcaccgt accttaatca aactcacaga accctagtat tcaacctgcc
32941 acctccctcc caacacacag agtacacagt cctttctccc cggctggcct taaaaagcat
33001 catatcatgg gtaacagaca tattcttagg tgttatattc cacacggttt cctgtcgagc
33061 caaacgctca tcagtgatat taataaactc cccgggcagc tcgcttaagt tcatgtcgct
33121 gtccagctgc tgagccacag gctgctgtcc aacttgcggt tgctcaacgg gcggcgaagg
33181 ggaagtccac gcctacatgg gggtagagtc ataatcgtgc atcaggatag ggcggtggtg
33241 ctgcagcagc gcgcgaataa actgctgccg ccgccgctcc gtcctgcagg aatacaacat
33301 ggcagtggtc tcctcagcga tgattcgcac cgcccgcagc atgagacgcc ttgtcctccg
33361 ggcacagcag cgcaccctga tctcacttaa atcagcacag taactgcagc acagcaccac
33421 aatattgttc aaaatcccac agtgcaaggc gctgtatcca aagctcatgg cggggaccac
33481 agaacccacg tggccatcat accacaagcg caggtagatt aagtggcgac ccctcataaa
33541 cacgctggac ataaacatta cctcttttgg catgttgtaa ttcaccacct cccggtacca
33601 tataaacctc tgattaaaca tggcgccatc caccaccatc ctaaaccagc tggccaaaac
33661 ctgcccgccg gctatgcact gcagggaacc gggactggaa caatgacagt ggagagccca
33721 ggactcgtaa ccatggatca tcatgctcgt catgatatca atgttggcac aacacaggca
33781 cacgtgcata cacttcctca ggattacaag ctcctcccgc gtcagaacca tatcccaggg
33841 aacaacccat tcctgaatca gcgtaaatcc cacactgcag ggaagacctc gcacgtaact
33901 cacgttgtgc attgtcaaag tgttacattc gggcagcagc ggatgatcct ccagtatggt
33961 agcgcgggtc tctgtctcaa aaggaggtag gcgatcccta ctgtacggag tgcgccgaga
34021 caaccgagat cgtgttggtc gtagtgtcat gccaaatgga acgccggacg tagtcatatt
34081 tcctgaagca aaaccaggtg cgggcgtgac aaacagatct gcgtctccgg tctcgtcgct
34141 tagctcgctc tgtgtagtag ttgtagtata tccactctct caaagcatcc aggcgccccc
34201 tggcttcggg ttctatgtaa actccttcat gcgccgctgc cctgataaca tccaccaccg
34261 cagaataagc cacacccagc caacctacac attcgttctg cgagtcacac acgggaggag
34321 cgggaagagc tggaagaacc atgttttttt tttttttatt ccaaaagatt atccaaaacc
34381 tcaaaatgaa gatctattaa gtgaacgcgc tcccctccgg tggcgtggtc aaactctaca
34441 gccaaagaac agataatggc atttgtaaga tgttgcacaa tggcttccaa aaggcaaact
34501 gccctcacgt ccaagtggac gtaaaggcta aacccttcag ggtgaatctc ctctataaac
34561 attccagcac cttcaaccat gcccaaataa ttttcatctc gccaccttat caatatgtct
34621 ctaagcaaat cccgaatatt aagtccggcc attgtaaaaa tctgctccag agcgccctcc
34681 accttcagcc tcaagcagcg aatcatgatt gcaaaaattc aggttcctca cagacctgta
34741 taagattcaa aagcggaaca ttaacaaaaa taccgcgatc ccgtaggtcc cttcgcaggg
34801 ccagctgaac ataatcgtgc aggtctgcac ggaccagcgc ggccacttcc ccgccaggaa
34861 ccatgacaaa agaacccaca ctgattatga cacgcatact cggagctatg ctaaccagcg
34921 tagcccctat gtaagcttgt tgcatgggcg gcgatataaa atgcaaggtg ctgctcaaaa
34981 aatcaggcaa agcctcgcgc aaaaaagcaa gcacatcgta gtcatgctca tgcagataaa
35041 ggcaggtaag ttccggaacc accacagaaa aagacaccat ttttctctca aacatgtctg
35101 cgggttcctg cattaaacac aaaataaaat aacaaaaaaa aacatttaaa cattagaagc
35161 ctgtcttaca acaggaaaaa caacccttat aagcataaga cggactacgg ccatgccggc
35221 gtgaccgtaa aaaaactggt caccgtgatt aaaaagcacc accgacagtt cctcggtcat
35281 gtccggagtc ataatgtaag actcggtaaa cacatcaggt tggttaacat cggtcagtgc
35341 taaaaagcga ccgaaatagc ccgggggaat acatacccgc aggcgtagag acaacattac
35401 agcccccata ggaggtataa caaaattaat aggagagaaa aacacataaa cacctgaaaa
35461 accctcctgc ctaggcaaaa tagcaccctc ccgctccaga acaacataca gcgcttccac
35521 agcggcagcc ataacagtca gccttaccag taaaaaaacc tattaaaaaa caccactcga
35581 cacggcacca gctcaatcag tcacagtgta aaaagggcca agtacagagc gagtatatat
35641 aggactaaaa aatgacgtaa cggttaaagt ccacaaaaaa cacccagaaa accgcacgcg
35701 aacctacgcc cagaaacgaa agccaaaaaa cccacaactt cctcaaatct tcacttccgt
35761 tttcccacga tacgtcactt cccattttaa aaaaactaca attcccaata catgcaagtt
35821 actccgccct aaaacctacg tcacccgccc cgttcccacg ccccgcgcca cgtcacaaac
35881 tccaccccct cattatcata ttggcttcaa tccaaaataa ggtatattat gatgatg
//