LOCUS ADRCG 35937 bp DNA linear VRL 14-MAR-1996 DEFINITION Adenovirus type 2, complete genome. ACCESSION J01917 J01918-J01957 K00086 K00394 K00395 K02367 M13004 V00007-V00020 V00023 V00024 VERSION J01917.1 GI:209811 KEYWORDS DNA polymerase; DNA-binding protein; RNA polymerase III; alternative splicing; coat protein; complete genome; genome-linked protein; glycoprotein; overlapping genes; polymerase; terminal repeat; unidentified reading frame; virus-associated RNA. SOURCE Human adenovirus 2 ORGANISM Human adenovirus 2 Viruses; dsDNA viruses, no RNA stage; Adenoviridae; Mastadenovirus; Human adenovirus C. REFERENCE 1 (bases 10610 to 10766) AUTHORS Ohe,K. and Weissman,S.M. TITLE The nucleotide sequence of a low molecular weight ribonucleic acid from cells infected with adenovirus 2 JOURNAL J. Biol. Chem. 246 (22), 6991-7009 (1971) PUBMED 4331202 REFERENCE 2 (sites) AUTHORS Jornvall,H., Ohlsson,H. and Philipson,L. TITLE An acetylated N-terminus of adenovirus type 2 hexon protein JOURNAL Biochem. Biophys. Res. Commun. 56 (2), 304-310 (1974) PUBMED 4823869 REFERENCE 3 (bases 10514 to 10680) AUTHORS Pan,J., Celma,M.L. and Weissman,S.M. TITLE Studies of low molecular weight RNA from cells infected with adenovirus 2. III. The sequence of the promoter for VA-RNA I JOURNAL J. Biol. Chem. 252 (24), 9047-9054 (1977) PUBMED 200627 REFERENCE 4 (bases 10681 to 10813) AUTHORS Celma,M.L., Pan,J. and Weissman,S.M. TITLE Studies of low molecular weight RNA from cells infected with adenovirus 2. I. The sequences at the 3' end of VA-RNA I JOURNAL J. Biol. Chem. 252 (24), 9032-9042 (1977) PUBMED 318160 REFERENCE 5 (sites) AUTHORS Celma,M.L., Pan,J. and Weissman,S.M. TITLE Studies of low molecular weight RNA from cells infected with adenovirus 2. II. Heterogeneity at the 5' end of VA-RNA I JOURNAL J. Biol. Chem. 252 (24), 9043-9046 (1977) PUBMED 925035 REFERENCE 6 (bases 30812 to 30900) AUTHORS Zain,B.S. and Roberts,R.J. TITLE Characterization and sequence analysis of a recombination site in the hybrid virus Ad2+ND JOURNAL J. Mol. Biol. 120 (1), 13-31 (1978) PUBMED 205651 REFERENCE 7 (bases 18778 to 18918) AUTHORS Akusjarvi,G. and Pettersson,U. TITLE Nucleotide sequence at the junction between the coding region of the adenovirus 2 hexon messenger RNA and its leader sequence JOURNAL Proc. Natl. Acad. Sci. U.S.A. 75 (12), 5822-5826 (1978) PUBMED 83649 REFERENCE 8 (bases 5986 to 6236) AUTHORS Ziff,E.B. and Evans,R.M. TITLE Coincidence of the promoter and capped 5' terminus of RNA from the adenovirus 2 major late transcription unit JOURNAL Cell 15 (4), 1463-1475 (1978) PUBMED 729003 REFERENCE 9 (bases 21607 to 21816) AUTHORS Akusjarvi,G. and Pettersson,U. TITLE Sequence analysis of adenovirus DNA. I. Nucleotide sequence at the carboxy-terminal end of the gene for adenovirus type 2 hexon JOURNAL Virology 91 (2), 477-480 (1978) PUBMED 741658 REFERENCE 10 (bases 1 to 156; 35804 to 35937) AUTHORS Arrand,J.R. and Roberts,R.J. TITLE The nucleotide sequences at the termini of adenovirus-2 DNA JOURNAL J. Mol. Biol. 128 (4), 577-594 (1979) PUBMED 318164 REFERENCE 11 (bases 6039 to 6079; 7101 to 7172; 9634 to 9723; 18802 to 18861) AUTHORS Akusjarvi,G. and Pettersson,U. TITLE Sequence analysis of adenovirus DNA: complete nucleotide sequence of the spliced 5' noncoding region of adenovirus 2 hexon messenger RNA JOURNAL Cell 16 (4), 841-850 (1979) PUBMED 455452 REFERENCE 12 (bases 6039 to 31095) AUTHORS Zain,S., Sambrook,J., Roberts,R.J., Keller,W., Fried,M. and Dunn,A.R. TITLE Nucleotide sequence analysis of the leader segments in a cloned copy of adenovirus 2 fiber mRNA JOURNAL Cell 16 (4), 851-861 (1979) PUBMED 455453 REFERENCE 13 (bases 1 to 110; 35835 to 35937) AUTHORS Shinagawa,M. and Padmanabhan,R. TITLE Nucleotide sequence at the inverted terminal repetition of adenovirus type 2 DNA JOURNAL Biochem. Biophys. Res. Commun. 87 (3), 671-678 (1979) PUBMED 454416 REFERENCE 14 (bases 25634 to 27376) AUTHORS Galibert,F., Herisse,J. and Courtois,G. TITLE Nucleotide sequence of the EcoRI-F fragment of adenovirus 2 genome JOURNAL Gene 6 (1), 1-22 (1979) PUBMED 478297 REFERENCE 15 (sites) AUTHORS Zain,B.S. and Roberts,R.J. TITLE Sequences from the beginning of the fiber messenger RNA of adenovirus-2 JOURNAL J. Mol. Biol. 131 (2), 341-352 (1979) PUBMED 318165 REFERENCE 16 (bases 26977 to 27178) AUTHORS Baker,C.C., Herisse,J., Courtois,G., Galibert,F. and Ziff,E. TITLE Messenger RNA for the Ad2 DNA binding protein: DNA sequences encoding the first leader and heterogenity at the mRNA 5' end JOURNAL Cell 18 (2), 569-580 (1979) PUBMED 227610 REFERENCE 17 (bases 5909 to 6178; 7023 to 7212; 9452 to 9836) AUTHORS Akusjarvi,G. and Pettersson,J. TITLE Sequence analysis of adenovirus DNA. IV. The genomic sequences encoding the common tripartite leader of late adenovirus messenger RNA JOURNAL J. Mol. Biol. 134 (1), 143-158 (1979) PUBMED 318166 REFERENCE 18 (bases 513 to 1111; 1226 to 1630) AUTHORS Perricaudet,M., Akusjarvi,G., Virtanen,A. and Pettersson,U. TITLE Structure of two spliced mRNAs from the transforming region of human subgroup C adenoviruses JOURNAL Nature 281 (5733), 694-696 (1979) PUBMED 551290 REFERENCE 19 (bases 6039 to 31080) AUTHORS Zain,S., Gingeras,T.R., Bullock,P., Wong,G. and Gelinas,R.E. TITLE Determination and analysis of adenovirus-2 DNA sequences which may include signals for late messenger RNA processing JOURNAL J. Mol. Biol. 135 (2), 413-433 (1979) PUBMED 318167 REFERENCE 20 (bases 5848 to 6578) AUTHORS Baker,C.C. and Ziff,E.B. TITLE Biogenesis, structures, and sites of encoding of the 5' termini of adenovirus-2 mRNAs JOURNAL Cold Spring Harb. Symp. Quant. Biol. 44 Pt 1,, 415-428 (1980) PUBMED 6933038 REFERENCE 21 (bases 3504 to 4109) AUTHORS Alestrom,P., Akusjarvi,G., Perricaudet,M., Mathews,M.B., Klessig,D.F. and Pettersson,U. TITLE The gene for polypeptide IX of adenovirus type 2 and its unspliced messenger RNA JOURNAL Cell 19 (3), 671-681 (1980) PUBMED 6985473 REFERENCE 22 (bases 35360 to 35937) AUTHORS Shinagawa,M., Padmanabhan,R.V. and Padmanabhan,R. TITLE The nucleotide sequence of the right-hand terminal SmaI-K fragment of adenovirus type 2 DNA JOURNAL Gene 9 (1-2), 99-114 (1980) PUBMED 6247249 REFERENCE 23 (bases 10514 to 11065) AUTHORS Akusjarvi,G., Mathews,M.B., Andersson,P., Vennstrom,B. and Pettersson,U. TITLE Structure of genes for virus-associated RNAI and RNAII of adenovirus type 2 JOURNAL Proc. Natl. Acad. Sci. U.S.A. 77 (5), 2424-2428 (1980) PUBMED 6930642 REFERENCE 24 (bases 27373 to 30050) AUTHORS Herisse,J., Courtois,G. and Galibert,F. TITLE Nucleotide sequence of the EcoRI D fragment of adenovirus 2 genome JOURNAL Nucleic Acids Res. 8 (10), 2173-2192 (1980) PUBMED 6253880 REFERENCE 25 (sites) AUTHORS Perricaudet,M., Le Moullec,J.M. and Pettersson,U. TITLE Predicted structure of two adenovirus tumor antigens JOURNAL Proc. Natl. Acad. Sci. U.S.A. 77 (7), 3778-3782 (1980) PUBMED 6253988 REFERENCE 26 (sites) AUTHORS Anderson,C.W. and Lewis,J.B. TITLE Amino-terminal sequence of adenovirus type 2 proteins: hexon, fiber, component IX, and early protein 1B-15K JOURNAL Virology 104 (1), 27-41 (1980) PUBMED 6985474 REFERENCE 27 (sites) AUTHORS Persson,H., Jornvall,H. and Zabielski,J. TITLE Multiple mRNA species for the precursor to an adenovirus-encoded glycoprotein: identification and structure of the signal sequence JOURNAL Proc. Natl. Acad. Sci. U.S.A. 77 (11), 6349-6353 (1980) PUBMED 6985478 REFERENCE 28 (sites) AUTHORS Van Etten,R.A., Walberg,M.W. and Clayton,D.A. TITLE Precise localization and nucleotide sequence of the two mouse mitochondrial rRNA genes and three immediately adjacent novel tRNA genes JOURNAL Cell 22 (1 Pt 1), 157-170 (1980) PUBMED 7428037 REFERENCE 29 (bases 22305 to 22600) AUTHORS Buttner,W. and Veres-Molnar,Z. TITLE Localization of the 3'-terminal end of the EcoRI B fragment-specific early mRNA of adenovirus type 2 JOURNAL FEBS Lett. 122 (2), 317-321 (1980) PUBMED 6258984 REFERENCE 30 (bases 21607 to 22770) AUTHORS Akusjarvi,G., Zabielski,J., Perricaudet,M. and Pettersson,U. TITLE The sequence of the 3' non-coding region of the hexon mRNA discloses a novel adenovirus gene JOURNAL Nucleic Acids Res. 9 (1), 1-17 (1981) PUBMED 6259616 REFERENCE 31 (bases 459 to 608) AUTHORS Osborne,T.F., Schell,R.E., Burch-Jaffe,E., Berget,S.J. and Berk,A.J. TITLE Mapping a eukaryotic promoter: a DNA sequence required for in vivo expression of adenovirus pre-early functions JOURNAL Proc. Natl. Acad. Sci. U.S.A. 78 (3), 1381-1385 (1981) PUBMED 6262792 REFERENCE 32 (bases 30047 to 32268) AUTHORS Herisse,J. and Galibert,F. TITLE Nucleotide sequence of the EcoRI E fragment of adenovirus 2 genome JOURNAL Nucleic Acids Res. 9 (5), 1229-1240 (1981) PUBMED 6262722 REFERENCE 33 (sites) AUTHORS Hashimoto,S., Pursley,M.H. and Green,M. TITLE Nucleotide sequences and mapping of novel heterogenous 5'-termini of adenovirus 2 early region 4 mRNA JOURNAL Nucleic Acids Res. 9 (7), 1675-1689 (1981) PUBMED 6164992 REFERENCE 34 (bases 17878 to 18918) AUTHORS Akusjarvi,G. and Persson,H. TITLE Gene and mRNA for precursor polypeptide VI from adenovirus type 2 JOURNAL J. Virol. 38 (2), 469-482 (1981) PUBMED 6985479 REFERENCE 35 (bases 5817 to 6051; 35358 to 35707) AUTHORS Baker,C.C. and Ziff,E.B. TITLE Promoters and heterogeneous 5' termini of the messenger RNAs of adenovirus serotype 2 JOURNAL J. Mol. Biol. 149 (2), 189-221 (1981) PUBMED 6927849 REFERENCE 36 (bases 18838 to 21744) AUTHORS Jornvall,H., Alestrom,P., Akusjarvi,G., von Bahr-Lindstrom,H., Philipson,L. and Pettersson,U. TITLE Order of the CNBr fragments in the adenovirus hexon protein JOURNAL J. Biol. Chem. 256 (12), 6204-6212 (1981) PUBMED 7240199 REFERENCE 37 (sites) AUTHORS Akusjarvi,G. and Persson,H. TITLE Controls of RNA splicing and termination in the major late adenovirus transcription unit JOURNAL Nature 292 (5822), 420-426 (1981) PUBMED 7254339 REFERENCE 38 (bases 32263 to 35937) AUTHORS Herisse,J., Rigolet,M., de Dinechin,S.D. and Galibert,F. TITLE Nucleotide sequence of adenovirus 2 DNA fragment encoding for the carboxylic region of the fiber protein and the entire E4 region JOURNAL Nucleic Acids Res. 9 (16), 4023-4042 (1981) PUBMED 6985482 REFERENCE 39 (sites) AUTHORS Kruijer,W., van Schaik,F.M. and Sussenbach,J.S. TITLE Structure and organization of the gene coding for the DNA binding protein of adenovirus type 5 JOURNAL Nucleic Acids Res. 9 (18), 4439-4457 (1981) PUBMED 6117824 REFERENCE 40 (sites) AUTHORS van Beveren,C.P., Maat,J., Dekker,B.M. and van Ormondt,H. TITLE The nucleotide sequence of the gene for protein IVa2 and of the 5' leader segment of the major late mRNAs of adenovirus type 5 JOURNAL Gene 16 (1-3), 179-189 (1981) PUBMED 7343420 REFERENCE 41 (sites) AUTHORS Uhlen,M., Svensson,C., Josephson,S., Alestrom,P., Chattapadhyaya,J.B., Pettersson,U. and Philipson,L. TITLE Leader arrangement in the adenovirus fiber mRNA JOURNAL EMBO J. 1 (2), 249-254 (1982) PUBMED 6897915 REFERENCE 42 (bases 1517 to 1696; 3932 to 4112; 17880 to 17975; 21142 to 28259) AUTHORS Fraser,N.W., Baker,C.C., Moore,M.A. and Ziff,E.B. TITLE Poly(A) sites of adenovirus serotype 2 transcription units JOURNAL J. Mol. Biol. 155 (3), 207-233 (1982) PUBMED 6176714 REFERENCE 43 (bases 7869 to 8420) AUTHORS Virtanen,A., Alestrom,P., Persson,H., Katze,M.G. and Pettersson,U. TITLE An adenovirus agnogene JOURNAL Nucleic Acids Res. 10 (8), 2539-2548 (1982) PUBMED 6281734 REFERENCE 44 (sites) AUTHORS Osborne,T.F., Gaynor,R.B. and Berk,A.J. TITLE The TATA homology and the mRNA 5' untranslated sequence are not required for expression of essential adenovirus E1A functions JOURNAL Cell 29 (1), 139-148 (1982) PUBMED 7105179 REFERENCE 45 (bases 22469 to 24125) AUTHORS Kruijer,W., Van Schaik,F.M. and Sussenbach,J.S. TITLE Nucleotide sequence of the gene encoding adenovirus type 2 DNA binding protein JOURNAL Nucleic Acids Res. 10 (15), 4493-4500 (1982) PUBMED 6985485 REFERENCE 46 (bases 1 to 11600; 32092 to 35937) AUTHORS Gingeras,T.R., Sciaky,D., Gelinas,R.E., Bing-Dong,J., Yen,C.E., Kelly,M.M., Bullock,P.A., Parsons,B.L., O'Neill,K.E. and Roberts,R.J. TITLE Nucleotide sequences from the adenovirus-2 genome JOURNAL J. Biol. Chem. 257 (22), 13475-13491 (1982) PUBMED 7142161 REFERENCE 47 (bases 5778 to 11560) AUTHORS Alestrom,P., Akusjarvi,G., Pettersson,M. and Pettersson,U. TITLE DNA sequence analysis of the region encoding the terminal protein and the hypothetical N-gene product of adenovirus type 2 JOURNAL J. Biol. Chem. 257 (22), 13492-13498 (1982) PUBMED 7142162 REFERENCE 48 (bases 27609 to 27980; 28376 to 29792) AUTHORS Ahmed,C.M., Chanda,R.S., Stow,N.D. and Zain,B.S. TITLE The nucleotide sequence of mRNA for the Mr 19 000 glycoprotein from early gene block III of adenovirus 2 JOURNAL Gene 20 (3), 339-346 (1982) PUBMED 6985487 REFERENCE 49 (bases 31030 to 32775) AUTHORS Boudin,M.L., Rigolet,M., Lemay,P., Galibert,F. and Boulanger,P. TITLE Biochemical and genetical characterization of a fiber-defective temperature-sensitive mutant of type 2 adenovirus JOURNAL EMBO J. 2 (11), 1921-1927 (1983) PUBMED 6571701 REFERENCE 50 (sites) AUTHORS Oosterom-Dragon,E.A. and Anderson,C.W. TITLE Polypeptide structure and encoding location of the adenovirus serotype 2 late, nonstructural 33K protein JOURNAL J. Virol. 45 (1), 251-263 (1983) PUBMED 6296434 REFERENCE 51 (bases 7929 to 8423) AUTHORS Falvey,E. and Ziff,E. TITLE Sequence arrangement and protein coding capacity of the adenovirus type 2 'i' leader JOURNAL J. Virol. 45 (1), 185-191 (1983) PUBMED 6823011 REFERENCE 52 (sites) AUTHORS Downey,J.F., Rowe,D.T., Bacchetti,S., Graham,F.L. and Bayley,S.T. TITLE Mapping of a 14,000-dalton antigen to early region 4 of the human adenovirus 5 genome JOURNAL J. Virol. 45 (2), 514-523 (1983) PUBMED 6834467 REFERENCE 53 (sites) AUTHORS Goldenberg,C.J. and Hauser,S.D. TITLE Accurate and efficient in vitro splicing of purified precursor RNAs specified by early region 2 of the adenovirus 2 genome JOURNAL Nucleic Acids Res. 11 (5), 1337-1348 (1983) PUBMED 6186991 REFERENCE 54 (sites) AUTHORS Virtanen,A. and Pettersson,U. TITLE The molecular structure of the 9S mRNA from early region 1A of adenovirus serotype 2 JOURNAL J. Mol. Biol. 165 (3), 496-499 (1983) PUBMED 6842608 REFERENCE 55 (sites) AUTHORS Stalhandske,P., Persson,H., Perricaudet,M., Philipson,L. and Pettersson,U. TITLE Structure of three spliced mRNAs from region E3 of adenovirus type 2 JOURNAL Gene 22 (2-3), 157-165 (1983) PUBMED 6307816 REFERENCE 56 (bases 15821 to 16495) AUTHORS Sung,M.T., Cao,T.M., Coleman,R.T. and Budelier,K.A. TITLE Gene and protein sequences of adenovirus protein VII, a hybrid basic chromosomal protein JOURNAL Proc. Natl. Acad. Sci. U.S.A. 80 (10), 2902-2906 (1983) PUBMED 6574459 REFERENCE 57 (sites) AUTHORS Lewis,J.B. and Anderson,C.W. TITLE Proteins encoded near the adenovirus late messenger RNA leader segments JOURNAL Virology 127 (1), 112-123 (1983) PUBMED 6857999 REFERENCE 58 (sites) AUTHORS Yeh-Kai,L., Akusjarvi,G., Alestrom,P., Pettersson,U., Tremblay,M. and Weber,J. TITLE Genetic identification of an endoproteinase encoded by the adenovirus genome JOURNAL J. Mol. Biol. 167 (1), 217-222 (1983) PUBMED 6306252 REFERENCE 59 (bases 17539 to 18177) AUTHORS Sung,M.T., Cao,T.M., Lischwe,M.A. and Coleman,R.T. TITLE Molecular processing of adenovirus proteins JOURNAL J. Biol. Chem. 258 (13), 8266-8272 (1983) PUBMED 6336325 REFERENCE 60 (sites) AUTHORS Kruijer,W., van Schaik,F.M., Speijer,J.G. and Sussenbach,J.S. TITLE Structure and function of adenovirus DNA binding protein: comparison of the amino acid sequences of the Ad5 and Ad12 proteins derived from the nucleotide sequence of the corresponding genes JOURNAL Virology 128 (1), 140-153 (1983) PUBMED 6308889 REFERENCE 61 (bases 13898 to 14231) AUTHORS Le Moullec,J.M., Akusjarvi,G., Stalhandske,P., Pettersson,U., Chambraud,B., Gilardi,P., Nasri,M. and Perricaudet,M. TITLE Polyadenylic acid addition sites in the adenovirus type 2 major late transcription unit JOURNAL J. Virol. 48 (1), 127-134 (1983) PUBMED 6136617 REFERENCE 62 (bases 18616 to 19233) AUTHORS Mautner,V. and Boursnell,M.E. TITLE Recombination in adenovirus: DNA sequence analysis of crossover sites in intertypic recombinants JOURNAL Virology 131 (1), 1-10 (1983) PUBMED 6316645 REFERENCE 63 (sites) AUTHORS Tigges,M.A. and Raskas,H.J. TITLE Splice junctions in adenovirus 2 early region 4 mRNAs: multiple splice sites produce 18 to 24 RNAs JOURNAL J. Virol. 50 (1), 106-117 (1984) PUBMED 6336328 REFERENCE 64 (sites) AUTHORS Downey,J.F., Evelegh,C.M., Branton,P.E. and Bayley,S.T. TITLE Peptide maps and N-terminal sequences of polypeptides from early region 1A of human adenovirus 5 JOURNAL J. Virol. 50 (1), 30-37 (1984) PUBMED 6699947 REFERENCE 65 (sites) AUTHORS Freyer,G.A., Katoh,Y. and Roberts,R.J. TITLE Characterization of the major mRNAs from adenovirus 2 early region 4 by cDNA cloning and sequencing JOURNAL Nucleic Acids Res. 12 (8), 3503-3519 (1984) PUBMED 6145146 REFERENCE 66 (sites) AUTHORS Leff,T., Elkaim,R., Goding,C.R., Jalinot,P., Sassone-Corsi,P., Perricaudet,M., Kedinger,C. and Chambon,P. TITLE Individual products of the adenovirus 12S and 13S EIa mRNAs stimulate viral EIIa and EIII expression at the transcriptional level JOURNAL Proc. Natl. Acad. Sci. U.S.A. 81 (14), 4381-4385 (1984) PUBMED 6336330 REFERENCE 67 (sites) AUTHORS Padgett,R.A., Konarska,M.M., Grabowski,P.J., Hardy,S.F. and Sharp,P.A. TITLE Lariat RNA's as intermediates and products in the splicing of messenger RNA precursors JOURNAL Science 225 (4665), 898-903 (1984) PUBMED 6206566 REFERENCE 68 (sites) AUTHORS Natarajan,V., Madden,M.J. and Salzman,N.P. TITLE Proximal and distal domains that control in vitro transcription of the adenovirus IVa2 gene JOURNAL Proc. Natl. Acad. Sci. U.S.A. 81 (20), 6290-6294 (1984) PUBMED 6593702 REFERENCE 69 (sites) AUTHORS Wold,W.S., Cladaras,C., Magie,S.C. and Yacoub,N. TITLE Mapping a new gene that encodes an 11,600-molecular-weight protein in the E3 transcription unit of adenovirus 2 JOURNAL J. Virol. 52 (2), 307-313 (1984) PUBMED 6492252 REFERENCE 70 (sites) AUTHORS Mautner,V. and Mackay,N. TITLE Recombination in adenovirus: analysis of crossover sites in intertypic overlap recombinants JOURNAL Virology 139 (1), 43-52 (1984) PUBMED 6495658 REFERENCE 71 (bases 15033 to 18316) AUTHORS Alestrom,P., Akusjarvi,G., Lager,M., Yeh-kai,L. and Pettersson,U. TITLE Genes encoding the core proteins of adenovirus type 2 JOURNAL J. Biol. Chem. 259 (22), 13980-13985 (1984) PUBMED 6094534 REFERENCE 72 (bases 11601 to 15726; 23924 to 25638) AUTHORS Roberts,R.J., O'Neill,K.E. and Yen,C.T. TITLE DNA sequences from the adenovirus 2 genome JOURNAL J. Biol. Chem. 259 (22), 13968-13975 (1984) PUBMED 6334081 REFERENCE 73 (bases 18838 to 21744) AUTHORS Akusjarvi,G., Alestrom,P., Pettersson,M., Lager,M., Jornvall,H. and Pettersson,U. TITLE The gene for the adenovirus 2 hexon polypeptide JOURNAL J. Biol. Chem. 259 (22), 13976-13979 (1984) PUBMED 6501284 REFERENCE 74 (sites) AUTHORS Moore,C.L. and Sharp,P.A. TITLE Accurate cleavage and polyadenylation of exogenous RNA substrate JOURNAL Cell 41 (3), 845-855 (1985) PUBMED 2408761 REFERENCE 75 (sites) AUTHORS Roberts,R.J., Akusjaervi,G., Alestroem,P., Gelinas,R.E., Gingeras,T.R., Sciaky,D. and Pettersson,U. TITLE A consensus sequence for the adenovirus-2 genome JOURNAL (in) Doerfler,W. (Ed.); ADENOVIRUS DNA: 1-51; Martinus Nijhoff Publishing, Boston (1986) REFERENCE 76 (bases 10610 to 10711) AUTHORS Cannon,R.E., Wu,G.J. and Railey,J.F. TITLE Functions of and interactions between the A and B blocks in adenovirus type 2-specific VARNA1 gene JOURNAL Proc. Natl. Acad. Sci. U.S.A. 83 (5), 1285-1289 (1986) PUBMED 3456587 REFERENCE 77 (sites) AUTHORS Anderson,C.W., Schmitt,R.C., Smart,J.E. and Lewis,J.B. TITLE Early region 1B of adenovirus serotype 2 encodes two co-terminal proteins of 495 and 155 amino acid residues JOURNAL Unpublished (1984) REFERENCE 78 (sites) AUTHORS Virtanen,A. and Pettersson,U. JOURNAL Unpublished (1984) COMMENT [1] RNA. [2] sites; cds start for the hexon protein. [4] sites; 5' terminus of VA I RNA. [6] mRNA and DNA. [11] cDNA to hexon mRNA. [12] several fragments over this span;. [13] mRNA and DNA. [19] cDNA. [16] sites; acceptor splice site for fiber mRNA. [18] several leader fragments over this span. [27] sites; splice sites for E1b mRNAs. [28] sites; cds start for E3 19K glycoprotein. [29] sites; cds start for 15K, IX and fiber polypeptides. [37] sites; cap site for E4 mrnas. [39] sites; splice sites in E2a mRNA. [31] fragments over this span. [34] sites; splice site in 52,55K-pept mRNA. [30] sites; splice sites in IVa2 mRNA, Ad5. [43] cDNA and DNA. [42] sites; splice site for 'i' leader. [46] mRNA and DNA. [41] sites; E1a mutational analysis. [55] sites; splice sites for 33K mRNA. [56] sites; cds start for E4 11K-pept, ad5. [60] sites; cds start for the 13.6K-pept. [61] sites; splice sites for 72K and 100K mRNAs. [50] sites; splice sites for leaders; poly-A sites. [52] sites; splice sites for E1a mRNAs. [58] sites; splice sites in E2 mRNA. [53] sites; H2ts1 mutation between 57.0% and 69.0%. [49] H2ts125 strain. [68] sites; cds start for E1a proteins. [69] sites; splice sites in E4 region. [71] sites; splice sites in E4 region; poly-A site for E4 mRNAs. [63] sites; cds start for 57K-pept. [63] sites; splice sites in E4 region; poly-A site for E4 mRNAs. [63] sites; splice sites in E1b region. [(in) Doerfler,W. (Ed.);Adenovirus DNA: 1-51;Martinus Nijhoff Publishing, Boston ] review; bases 1 to 35937. [75] sites; recombination analysis of ad2 and ad5. [74] sites; splice sites in major late mRNA. [73] sites; IVa2 transcription start. [72] sites; transcription start for EIa mRNAs. [70] sites; E3 11.6 -K protein. [76] sites; L3 mRNA polyadenylation site. [78] sites; L3 mRNA polyadenylation site. Communicated on tape by R. Roberts. That tape and [(in) Doerfler, W. (Ed.);Adenovirus DNA: 1-51;Martinus Nijhoff Publishing, Boston ] are the immediate sources of the annotation herein. A consensus sequence for the l-strand of the genome is shown. Population heterogeneity as distinct from strain variation is known (35937 +/- 9 bp) [67]; both are annotated as 'variation' below. For site differences with adenovirus type 5, see loci beginning <ad5> which are arranged in the library according to the map coordinates of <ad2> where one map unit corresponds to 360 bases throughout (see [44],[67]). For mutational changes in the ad2 sequence, see the appropriate references above. The origin of replication is located in the first fifty bases from each end. Transcription is leftward off the l-strand and rightward off the r-strand; in the former case, the annotation shows '(c)' for complementary strand. Complex splicing events give rise to perhaps fifty or more distinct mRNA transcripts at early, intermediate and late times after infection, many of which are still being characterized; in particular, some transcripts are known from electron microscopy which are not yet characterized at the sequence level. To date nine mRNA start sites (cap sites) have been identified, and these define the general units of mRNAs under which all known transcripts are classified. From the r-strand, the early transcripts are E1a, E1b and E3. The 28 kb late transcript called herein 'major late mRNA' comprises five families, L1 through L5, of 3' co-terminal mRNAs. L1, and to a lesser extent L2, can be expressed at early and intermediate times [34]. Transcripts from this region contain a common tripartite leader sequence at their 5' ends: the three segments of this leader are encoded at bases 6039-6079, 7101-7172 and 9634-9723. At early and intermediate times, an extra leader segment, the 'i' leader, is frequently present (bases 7942-8381). The IX message, the only unspliced message in ad2, is intermediate, and its termination overlaps that for E1b on the same strand and that for IVa2, and most likely E2b, on the opposite strand. From the l-strand, or the 'comp strand', early expression derives from the E2a, E2b and E4 families of mRNAs, although there can be late transcription from E2a. The E2b cap sites, splice sites and termination sites have not been determined at the sequence level. From electron microscopy there is evidence that the E2b mRNAs may originate at the E2a early cap site at 27092 (c) and terminate at the poly-A addition site found for the IVa2 mRNA at 4050 (c) [44]. IVa2 is an intermediate message. The promoters for these nine classes of mRNAs can be localized and characterized to the following extent [32]: mRNA cap site possible promoter region ------ ---------- ----------------------------- E1a 498 tatttata at 468-474 E1b 1699 tatataat at 1669-1676 IX 3576 tatataa at 3545-3551 major late 6039 tataaaa at 6008-6014 E3 27609 tataa at 27580-27584 E4 35609 (c) tatatata at 35641-35633 (c) E2a early 27092 (c) no obvious sequence for 100 bases upstream E2a late 25956 (c) tacaaattt at 25985-25977 (c) IVa2 5826 (c) no obvious sequence for 100 bases upstream The mRNA responsible for the 13.6K protein encoded at 7968 has not been identified. The VA I and VA II transcripts are unique in that they are generated by RNA polymerase III; for a discussion of these low molecular weight RNAs-- the modulation of their start points, their promoters, their heterogeneity and their similarity to tRNA-- see [3],[4],[5],[26] and <ad5a2>. The proteins known to be encoded from these mRNAs are given in the Features table below, though the details of translation and processing have not been fully determined. In cases such as the IIIa peptide or the 11K peptide, the exact span of the coding awaits elucidation of the mRNA splicing. Some of these products share reading frames and therefore manifest partial homologies. The following table summarizes the unidentified reading frames ('URF') of 100 or more amino acids: initiator terminator frame protein encoded ----------- ---------- ------- ----------------- 6280 6600 1 11.6K URF 17284 17763 1 17.4K URF 23782 24138 1 12.9K URF 24481 24867 1 14.2K URF 26044 26826 1 28.6K URF(contains the N-terminus of 33K cds) 30973 32778 1 63.9K URF(contains the fiber cds) 10421 10834 2 14.4K URF 20504 20935 2 15.7K URF 27899 28222 2 12.4K URF 30059 30451 2 14.5k URF 33956 34456 2 18.8K URF 9294 9800 3 17.7K URF 23526 26525 3 110.2K URF(contains the 100K-pept cds) 30444 30830 3 14.7K URF 34470 34808 3 12.7K URF complementary strand --------------------------- 35532 35146 1 14.3K URF 34077 33193 1 34.1K URF 11109 10744 1 12.8K URF 9030 8383 1 22.8K URF 6780 6442 1 12.8K URF 31604 31290 2 10.7K URF 31211 30852 2 13.5K URF 18707 18159 2 18.9K URF 14861 14424 2 16.4K URF 14114 13728 2 13.5K URF 11618 11250 2 13.6K URF 1712 1194 2 18.1K URF 35113 34703 3 15.3K URF 34342 33998 3 13.3K URF 5674 5327 3 12.2K URF Additionally there are numerous unidentified reading frames of less than 100 amino acid residues; and further small modifications of a few of the coding sequences are possible. [7] missing data project. FEATURES Location/Qualifiers source 1..35937 /organism="Human adenovirus 2" /mol_type="genomic DNA" /db_xref="taxon:10515" repeat_region 1..102 /standard_name="inverted terminal repeat" /citation=[13] variation 8 /note="a in [44],[(in) Doerfler,W. (Ed.);Adenovirus DNA: 1-51;Martinus Nijhoff Publishing, Boston ]; aa in other strains,e.g.[13]" /replace="aa" variation 460 /note="c in [44],[63]; t in [31]" /replace="t" prim_transcript 498..1630 /citation=[18] gene 498..1613 /gene="E1a" CDS join(559..1111,1226..1542) /gene="E1a" /note="derived from the 13S mRNA" /codon_start=1 /product="32 kD protein" /protein_id="AAA92199.1" /db_xref="GI:209814" /translation="MRHIICHGGVITEEMAASLLDQLIEEVLADNLPPPSHFEPPTLH ELYDLDVTAPEDPNEEAVSQIFPESVMLAVQEGIDLFTFPPAPGSPEPPHLSRQPEQP EQRALGPVSMPNLVPEVIDLTCHEAGFPPSDDEDEEGEEFVLDYVEHPGHGCRSCHYH RRNTGDPDIMCSLCYMRTCGMFVYSPVSEPEPEPEPEPEPARPTRRPKLVPAILRRPT SPVSRECNSSTDSCDSGPSNTPPEIHPVVPLCPIKPVAVRVGGRRQAVECIEDLLNES GQPLDLSCKRPRP" CDS join(559..973,1226..1542) /gene="E1a" /note="derived from the 12S mRNA" /codon_start=1 /product="26 kD protein" /protein_id="AAA92197.1" /db_xref="GI:209813" /translation="MRHIICHGGVITEEMAASLLDQLIEEVLADNLPPPSHFEPPTLH ELYDLDVTAPEDPNEEAVSQIFPESVMLAVQEGIDLFTFPPAPGSPEPPHLSRQPEQP EQRALGPVSMPNLVPEVIDLTCHEAGFPPSDDEDEEGPVSEPEPEPEPEPEPARPTRR PKLVPAILRRPTSPVSRECNSSTDSCDSGPSNTPPEIHPVVPLCPIKPVAVRVGGRRQ AVECIEDLLNESGQPLDLSCKRPRP" CDS join(559..636,1226..1315) /gene="E1a" /note="derived from the 9S mRNA" /codon_start=1 /product="6 kD protein" /protein_id="AAA92198.1" /db_xref="GI:209812" /translation="MRHIICHGGVITEEMAASLLDQLIEEVLCLNLSLSPSQNRSLQD LPGVLNWCLLS" intron 637..1225 /gene="E1a" /note="9S-derived" /citation=[18] intron 974..1225 /gene="E1a" /note="12S-derived" /citation=[18] intron 1112..1225 /gene="E1a" /note="13S-derived" /citation=[18] polyA_signal 1608..1613 /gene="E1a" /note="4.47%; putative" prim_transcript 1699..4061 /citation=[35] gene 1699..3588 /gene="E1b_1" CDS 1711..2238 /gene="E1b_1" /note="derived from 13S mRNA" /codon_start=1 /product="20.5 kD protein" /protein_id="AAA92200.1" /db_xref="GI:209819" /translation="MEAWECLEDFSAVRNLLEQSSNSTSWFWRFLWGSSQAKLVCRIK EDYKWEFEELLKSCGELFDSLNLGHQALFQEKVIKTLDFSTPGRAAAAVAFLSFIKDK WSEETHLSGGYLLDFLAMHLWRAVVRHKNRLLLLSSVRPAIIPTEEQQQEEARRRRRQ EQSPWNPRAGLDPRE" CDS 2016..3503 /gene="E1b_1" /note="derived from the 22S mRNA" /codon_start=1 /product="transformation-associated protein" /protein_id="AAA92202.1" /db_xref="GI:209820" /translation="MERRNPSERGVPAGFSGHASVESGGETQESPATVVFRPPGNNTD GGATAGGSQAAAAAGAEPMEPESRPGPSGMNVVQVAELFPELRRILTINEDGQGLKGV KRERGASEATEEARNLTFSLMTRHRPECVTFQQIKDNCANELDLLAQKYSIEQLTTYW LQPGDDFEEAIRVYAKVALRPDCKYKISKLVNIRNCCYISGNGAEVEIDTEDRVAFRC SMINMWPGVLGMDGVVIMNVRFTGPNFSGTVFLANTNLILHGVSFYGFNNTCVEAWTD VRVRGCAFYCCWKGVVCRPKSRASIKKCLFERCTLGILSEGNSRVRHNVASDCGCFML VKSVAVIKHNMVCGNCEDRASQMLTCSDGNCHLLKTIHVASHSRKAWPVFEHNILTRC SLHLGNRRGVFLPYQCNLSHTKILLEPESMSKVNLNGVFDMTMKIWKVLRYDETRTRC RPCECGGKHIRNQPVMLDVTEELRPDHLVLACTRAEFGSSDEDTD" CDS join(2016..2249,3270..3503) /gene="E1b_1" /note="derived from the 1.26 kb mRNA" /codon_start=1 /protein_id="AAA92203.1" /db_xref="GI:209816" /translation="MERRNPSERGVPAGFSGHASVESGGETQESPATVVFRPPGNNTD GGATAGGSQAAAAAGAEPMEPESRPGPSGMNVVQPESMSKVNLNGVFDMTMKIWKVLR YDETRTRCRPCECGGKHIRNQPVMLDVTEELRPDHLVLACTRAEFGSSDEDTD" CDS join(2016..2249,3212..3256) /gene="E1b_1" /note="derived from the 1.31 kb mRNA" /codon_start=1 /protein_id="AAA92201.1" /db_xref="GI:209815" /translation="MERRNPSERGVPAGFSGHASVESGGETQESPATVVFRPPGNNTD GGATAGGSQAAAAAGAEPMEPESRPGPSGMNVVQEGGVPTLPMQFESH" intron 2250..3588 /gene="E1b_1" /citation=[25] /number=1 intron 2250..3269 /gene="E1b_1" /number=1 intron 2250..3211 /gene="E1b_1" /number=1 intron 3505..3588 /gene="E1b_1" /citation=[25] /number=2 mRNA 3576..4061 /citation=[46] gene 3576..4034 /gene="IX" CDS 3600..4022 /gene="IX" /codon_start=1 /product="hexon-associated protein" /protein_id="AAA92204.1" /db_xref="GI:209821" /translation="MSANSFDGSIVSSYLTTRMPPWAGVRQNVMGSSIDGRPVLPANS TTLTYETVSGTPLETAASAAASAAAATARGIVTDFAFLSPLASSAASRSSARDDKLTA LLAQLDSLTRELNVVSQQLLDLRQQVSALKASSPPNAV" gene 4029..4034 /gene="E1b_2" polyA_signal 4029..4034 /gene="E1b_2" /note="putative" polyA_signal 4029..4034 /gene="IX" /note="putative" gene complement(4050..27092) /gene="E2b" mRNA complement(4050..27092) /gene="E2b" /citation=[46] prim_transcript complement(4050..5826) /gene="IVa2" /citation=[46] gene complement(4050..5695) /gene="IVa2" CDS complement(join(4081..5417,5696..5708)) /gene="IVa2" /codon_start=1 /product="virion morphogenesis-associated protein" /protein_id="AAA92205.1" /db_xref="GI:209817" /translation="METRGRRPAALQHQQDQPQAHPGQRAARSAPLHRDPDYADEDPA PVERHDPGPSGRAPTTAVQRKPPQPAKRGDMLDRDAVEHVTELWDRLELLGQTLKSMP TADGLKPLKNFASLQELLSLGGERLLAHLVRENMQVRDMLNEVAPLLRDDGSCSSLNY QLQPVIGVIYGPTGCGKSQLLRNLLSSQLISPTPETVFFIAPQVDMIPPSELKAWEMQ ICEGNYAPGPDGTIIPQSGTLRPRFVKMAYDDLILEHNYDVSDPRNIFAQAAARGPIA IIMDECMENLGGHKGVSKFFHAFPSKLHDKFPKCTGYTVLVVLHNMNPRRDMAGNIAN LKIQSKMHLISPRMHPSQLNRFVNTYTKGLPLAISLLLKDIFRHHAQRSCYDWIIYNT TPQHEALQWCYLHPRDGLMPMYLNIQSHLYHVLEKIHRTLNDRDRWSRAYRARKTPK" polyA_signal complement(4085..4090) /gene="IVa2" /note="putative" CDS complement(5187..8357) /gene="DNA polymerase" /codon_start=1 /product="DNA polymerase" /protein_id="AAA92206.1" /db_xref="GI:209822" /translation="MDSSHLRDVVIKLRPPRADIWTLGSRGVVVRSTITPLEQPDGQG QAAEVEDHQPNPPGEGLKFPLCFLVRGRQVNLVQDVQPVHRCQYCARFYKSQHECSAR RRDFYFHHINSHSSNWWREIQFFPIGSHPRTERLFVTYDVETYTWMGAFGKQLVPFML VMKFGGDEPLVTAARDLAVDLGWDRWEQDPLTFYCITPEKMAIGRQFRTFRDHLQMLM ARDLWSSFVASNPHLADWALSEHGLSSPEELTYEELKKLPSIKGTPRFLELYIVGHNI NGFDEIVLAAQVINNRSEVPGPFRITRNFMPRAGKILFNDVTFALPNPRSKKRTDFLL WEQGGCDDTDFKYQYLKVMVRDTFALTHTSLRKAAQAYALPVEKGCCAYQAVNQFYML GSYRSEADGFPIQEYWKDREEFVLNRELWKKKGQDKYDIIKETLDYCALDVQVTAELV NKLRDSYASFVRDAVGLTDASFNVFQRPTISSNSHAIFRQIVFRAEQPARSNLGPDLL APSHELYDYVRASIRGGRCYPTYLGILREPLYVYDICGMYASALTHPMPWGPPLNPYE RALAARAWQQALDLQGCKIDYFDARLLPGVFTVDADPPDETQLDPLPPFCSRKGGRLC WTNERLRGEVATSVDLVTLHNRGWRVHLVPDERTTVFPEWRCVAREYVQLNIAAKERA DRDKNQTLRSIAKLLSNALYGSFATKLDNKKIVFSDQMDAATLKGITAGQVNIKSSSF LETDNLSAEVMPAFEREYSPQQLALADSDAEESEDERAPTPFYSPPSGTPGHVAYTYK PITFLDAEEGDMCLHTLERVDPLVDNDRYPSHLASFVLAWTRAFVSEWSEFLYEEDRG TPLEDRPLKSVYGDTDSLFVTERGHRLMETRGKKRIKKHGGNLVFDPERPELTWLVEC ETVCGACGADAYSPESVFLAPKLYALKSLHCPSCGASSKGKLRAKGHAAEGLDYDTMV KCYLADAQGEDRQRFSTSRTSLKRTLASAQPGAHPFTVTQTTLTRTLRPWKDMTLARL DEHRLLPYSESRPNPRNEEICWIEMP" gene complement(5187..7213) /gene="DNA polymerase" intron complement(5418..5695) /gene="IVa2" /citation=[25] prim_transcript 6039..32798 /citation=[35] gene 6039..32779 /gene="L5" prim_transcript 6039..28223 /gene="L4" /citation=[35] gene 6039..24094 /gene="L4" prim_transcript 6039..22443 /gene="L4" /citation=[30] gene 6039..20487 /gene="L3_1" prim_transcript 6039..17969 /gene="L3_1" /citation=[34] gene 6039..15856 /gene="L2_1" prim_transcript 6039..14113 /gene="L1" /citation=[35] gene 6039..14097 /gene="L1" intron 6080..7100 /gene="L1" /note="major late mRNA intron (precedes 2nd leader)" /citation=[12] old_sequence 6443 /gene="L1" /citation=[20] conflict 6574..6575 /gene="L1" /citation=[20] /replace="" intron 7173..7941 /gene="DNA polymerase" /note="major late mRNA intron (precedes 'i' leader)" /citation=[12] old_sequence 7212..7213 /gene="L1" /citation=[19] CDS 7968..8417 /gene="L1" /codon_start=1 /product="13.6 kD protein" /protein_id="AAA92207.1" /db_xref="GI:209823" /translation="MRADREELDLPPPVGGVAVDVVKVEVPATGRTLVLAFVKTCAVL AAVHGLYILHEVDLTTAHKEAEWEFEPLAWRVWLVVFYFGCLSLTVWLLEGSYGGSDH HAARAQSPDVRARRSELDDNIAQMGAVHGLELPRRQVRRELLQVYLA" intron 8382..9633 /gene="L1" /note="major late mRNA intron (precedes 3rd leader)" /citation=[12] CDS complement(8573..10534) /gene="E2b" /note="also known as Bellet protein" /codon_start=1 /product="terminal protein" /protein_id="AAA92208.1" /db_xref="GI:209824" /translation="MEHFLPLRNIWNRVRDFPRASTTAAGITWMSRYIYGYHRLMLED LAPGAPATLRWPLYRQPPPHFLVGYQYLVRTCNDYVFDSRAYSRLRYTELSQPGHQTV NWSVMANCTYTINTGAYHRFVDMDDFQSTLTQVQQAILAERVVADLALLQPMRGFGVT RMGGRGRHLRPNSAAAVAIDARDAGQEEGEEEVPVERLMQDYYKDLRRCQNEAWGMAD RLRIQQAGPKDMVLLSTIRRLKTAYFNYIISSTSARNNPDRHPLPPATVLSLPCDCDW LDAFLERFSDPVDADSLRSLGGGVPTQQLLRCIVSAVSLPHGSPPPTHNRDMTGGVFQ LRPRENGRAVTETMRRRRGEMIERFVDRLPVRRRRRRVPPPPPPPEEEEEGEALMEEE IEEEEAPVAFEREVRDTVAELIRLLEEELTVSARNSQFFNFAVDFYEAMERLEALGDI NESTLRRWVMYFFVAEHTATTLNYLFQRLRNYAVFARHVELNLAQVVMRARDAEGGVV YSRVWNEGGLNAFSQLMARISNDLAATVERAGRGDLQEEEIEQFMAEIAYQDNSGDVQ EILRQAAVNDTEIDSVELSFRFKLTGPVVFTQRRQIQEINRRVVAFASNLRAQHQLLP ARGADVPLPPLPAGPEPPLPPGARPRHRF" conflict 9315..9316 /gene="L1" /citation=[47] /replace="" variation 9382 /gene="L1" /note="c is shown; can be cttc due to population heterogeneity" /citation=[72] /replace="cttc" conflict 9633..9634 /gene="L1" /citation=[12] /replace="" intron 9724..24094 /gene="L4" /note="precedes 100K mRNA" /citation=[60] intron 9724..21649 /gene="L4" /note="precedes 23K mRNA" /citation=[30] intron 9724..18801 /gene="L3_1" /note="precedes hexon mRNA" /citation=[7] intron 9724..17999 /gene="L3_1" /note="precedes pVI mRNA" /citation=[34] intron 9724..16515 /gene="L3_1" /note="precedes pV mRNA" /citation=[71] intron 9724..14149 /gene="L2_1" /note="precedes penton mRNA" /citation=[61] intron 9724..11039 /gene="L1" /note="precedes 52,55K mRNA" /citation=[12] misc_RNA 10607..10766 /gene="VA I" /citation=[5] gene 10607..10716 /gene="VA I" misc_RNA 10610..10766 /gene="VA I" /citation=[1] conflict 10715..10716 /gene="VA I" /citation=[4] /replace="" gene 10866..11023 /gene="VA II" misc_RNA 10866..11023 /gene="VA II" /citation=[23] CDS 11040..12287 /gene="L1" /codon_start=1 /product="52,55kD protein" /protein_id="AAA92209.1" /db_xref="GI:209825" /translation="MHPVLRQMRPPPQQRQEQEQRQTCRAPSPSPTASGGATSAADAA ADGDYEPPRRRARHYLDLEEGEGLARLGAPSPERHPRVQLKRDTREAYVPRQNLFRDR EGEEPEEMRDRKFHAGRELRHGLNRERLLREEDFEPDARTGISPARAHVAAADLVTAY EQTVNQEINFQKSFNNHVRTLVAREEVAIGLMHLWDFVSALEQNPNSKPLMAQLFLIV QHSRDNEAFRDALLNIVEPEGRWLLDLINILQSIVVQERSLSLADKVAAINYSMLSLG KFYARKIYHTPYVPIDKEVKIEGFYMRMALKVLTLSDDLGVYRNERIHKAVSVSRRRE LSDRELMHSLQRALAGTGSGDREAESYFDAGADLRWAPSRRALEAAGAGPGLAVAPAR AGNVGGVEEYDEDDEYEPEDGEY" variation 11062 /gene="L1" /note="t in [43],[63]" /citation=[23] /replace="c" gene 12308..14065 /gene="IIIa" CDS 12308..14065 /gene="IIIa" /note="splice sites not sequenced" /codon_start=1 /product="peripentonal hexon-associated protein" /protein_id="AAA92210.1" /db_xref="GI:209826" /translation="MMQDATDPAVRAALQSQPSGLNSTDDWRQVMDRIMSLTARNPDA FRQQPQANRLSAILEAVVPARANPTHEKVLAIVNALAENRAIRPDEAGLVYDALLQRV ARYNSGNVQTNLDRLVGDVREAVAQRERAQQQGNLGSMVALNAFLSTQPANVPRGQED YTNFVSALRLMVTETPQSEVYQSGPDYFFQTSRQGLQTVNLSQAFKNLQGLWGVRAPT GDRATVSSLLTPNSRLLLLLIAPFTDSGSVSRDTYLGHLLTLYREAIGQAHVDEHTFQ EITSVSRALGQEDTGSLEATLNYLLTNRRQKIPSLHSLNSEEERILRYVQQSVSLNLM RDGVTPSVALDMTARNMEPGMYASNRPFINRLMDYLHRAAAVNPEYFTNAILNPHWLP PPGFYTGGFEVPEGNDGFLWDDIDDSVFSPQPQTLLELQQREQAEAALRKESFRRPSS LSDLGAAAPRSDASSPFPSLIGSFTSTRTTRPRLLGEEEYLNNSLLQPQREKNLPPAF PNNGIESLVDKMSRWKTYAQEHRDVPGPRPPTRRQRHDRQRGLVWEDDDSADDSSVLD LGGSGNPFAHLRPRLGRMF" variation 14064..14080 /gene="L1" /note="15 to 19 A residues have been observed in various populations" /citation=[72] /replace="aaaaaaaaaaaaaaaaaaa" polyA_signal 14092..14097 /gene="L1" /note="39.1%; putative" CDS 14151..15866 /gene="L3_1" /note="virion component III" /codon_start=1 /product="penton protein" /protein_id="AAA92211.1" /db_xref="GI:209827" /translation="MQRAAMYEEGPPPSYESVVSAAPVAAALGSPFDAPLDPPFVPPR YLRPTGGRNSIRYSELAPLFDTTRVYLVDNKSTDVASLNYQNDHSNFLTTVIQNNDYS PGEASTQTINLDDRSHWGGDLKTILHTNMPNVNEFMFTNKFKARVMVSRSLTKDKQVE LKYEWVEFTLPEGNYSETMTIDLMNNAIVEHYLKVGRQNGVLESDIGVKFDTRNFRLG FDPVTGLVMPGVYTNEAFHPDIILLPGCGVDFTHSRLSNLLGIRKRQPFQEGFRITYD DLEGGNIPALLDVDAYQASLKDDTEQGGDGAGGGNNSGSGAEENSNAAAAAMQPVEDM NDHAIRGDTFATRAEEKRAEAEAAAEAAAPAAQPEVEKPQKKPVIKPLTEDSKKRSYN LISNDSTFTQYRSWYLAYNYGDPQTGIRSWTLLCTPDVTCGSEQVYWSLPDMMQDPVT FRSTSQISNFPVVGAELLPVHSKSFYNDQAVYSQLIRQFTSLTHVFNRFPENQILARP PAPTITTVSENVPALTDHGTLPLRNSIGGVQRVTITDARRRTCPYVYKALGIVSPRVL SSRTF" variation 15856 /gene="L2_1" /note="g in [67],[(in) Doerfler,W. (Ed.);Adenovirus DNA: 1-51;Martinus Nijhoff Publishing, Boston ]" /citation=[56] /replace="t" CDS 15873..16469 /gene="Pro-VII" /note="precursor" /codon_start=1 /product="major core protein" /protein_id="AAA92212.1" /db_xref="GI:209828" /translation="MSILISPSNNTGWGLRFPSKMFGGAKKRSDQHPVRVRGHYRAPW GAHKRGRTGRTTVDDAIDAVVEEARNYTPTPPPVSTVDAAIQTVVRGARRYAKMKRRR RRVARRHRRRPGTAAQRAAAALLNRARRTGRRAAMRAARRLAAGIVTVPPRSRRRAAA AAAAAISAMTQGRRGNVYWVRDSVSGLRVPVRTRPPRN" gene 15873..16437 /gene="Pro-VII" variation 15914 /gene="Pro-VII" /note="c in [69],[63]" /citation=[56] /replace="t" variation 15998 /gene="Pro-VII" /note="g in [66],[63]" /citation=[56] /replace="c" conflict 16205..16208 /gene="Pro-VII" /citation=[56] /replace="" variation 16437 /gene="Pro-VII" /note="g in [66],[63]" /citation=[56] /replace="c" gene 16539..17648 /gene="pV" CDS 16539..17648 /gene="pV" /codon_start=1 /product="minor core protein" /protein_id="AAA92213.1" /db_xref="GI:209829" /translation="MSKRKIKEEMLQVIAPEIYGPPKKEEQDYKPRKLKRVKKKKKDD DDDELDDEVELLHATAPRRRVQWKGRRVRRVLRPGTTVVFTPGERSTRTYKRVYDEVY GDEDLLEQANERLGEFAYGKRHKDMLALPLDEGNPTPSLKPVTLQQVLPTLAPSEEKR GLKRESGDLAPTVQLMVPKRQRLEDVLEKMTVEPGLEPEVRVRPIKQVAPGLGVQTVD VQIPTTSSTSIATATEGMETQTSPVASAVADAAVQAAAAAASKTSTEVQTDPWMFRVS APRRPRRSRKYGTASALLPEYALHPSIAPTPGYRGYTYRPRRRATTRRRTTTGTRRRR RRRQPVLAPISVRRVAREGGRTLVLPTARYHPSIV" gene 17949..17964 /gene="L2_2" polyA_signal 17949..17954 /gene="L2_2" /note="49.94%; putative" variation 17964 /gene="L2_2" /note="g in [67],[(in) Doerfler,W. (Ed.);Adenovirus DNA: 1-51;Martinus Nijhoff Publishing, Boston ]used" /citation=[59] /replace="c" gene 18001..18753 /gene="pVI" CDS 18001..18753 /gene="pVI" /note="precursor" /codon_start=1 /product="hexon-associated protein" /protein_id="AAA92214.1" /db_xref="GI:209830" /translation="MEDINFASLAPRHGSRPFMGNWQDIGTSNMSGGAFSWGSLWSGI KNFGSTIKNYGSKAWNSSTGQMLRDKLKEQNFQQKVVDGLASGISGVVDLANQAVQNK INSKLDPRPPVEEPPPAVETVSPEGRGEKRPRPDREETLVTQIDEPPSYEEALKQGLP TTRPIAPMATGVLGQHTPVTLDLPPPADTQQKPVLPGPSAVVVTRPSRASLRRAASGP RSMRPVASGNWQSTLNSIVGLGVQSLKRRRCF" CDS 18838..21744 /gene="L4" /note="virion component II" /codon_start=1 /product="hexon protein" /protein_id="AAA92215.1" /db_xref="GI:209831" /translation="MATPSMMPQWSYMHISGQDASEYLSPGLVQFARATETYFSLNNK FRNPTVAPTHDVTTDRSQRLTLRFIPVDREDTAYSYKARFTLAVGDNRVLDMASTYFD IRGVLDRGPTFKPYSGTAYNALAPKGAPNSCEWEQTEDSGRAVAEDEEEEDEDEEEEE EEQNARDQATKKTHVYAQAPLSGETITKSGLQIGSDNAETQAKPVYADPSYQPEPQIG ESQWNEADANAAGGRVLKKTTPMKPCYGSYARPTNPFGGQSVLVPDEKGVPLPKVDLQ FFSNTTSLNDRQGNATKPKVVLYSEDVNMETPDTHLSYKPGKGDENSKAMLGQQSMPN RPNYIAFRDNFIGLMYYNSTGNMGVLAGQASQLNAVVDLQDRNTELSYQLLLDSIGDR TRYFSMWNQAVDSYDPDVRIIENHGTEDELPNYCFPLGGIGVTDTYQAIKANGNGSGD NGDTTWTKDETFATRNEIGVGNNFAMEINLNANLWRNFLYSNIALYLPDKLKYNPTNV EISDNPNTYDYMNKRVVAPGLVDCYINLGARWSLDYMDNVNPFNHHRNAGLRYRSMLL GNGRYVPFHIQVPQKFFAIKNLLLLPGSYTYEWNFRKDVNMVLQSSLGNDLRVDGASI KFDSICLYATFFPMAHNTASTLEAMLRNDTNDQSFNDYLSAANMLYPIPANATNVPIS IPSRNWAAFRGWAFTRLKTKETPSLGSGYDPYYTYSGSIPYLDGTFYLNHTFKKVAIT FDSSVSWPGNDRLLTPNEFEIKRSVDGEGYNVAQCNMTKDWFLVQMLANYNIGYQGFY IPESYKDRMYSFFRNFQPMSRQVVDDTKYKEYQQVGILHQHNNSGFVGYLAPTMREGQ AYPANVPYPLIGKTAVDSITQKKFLCDRTLWRIPFSSNFMSMGALTDLGQNLLYANSA HALDMTFEVDPMDEPTLLYVLFEVFDVVRVHQPHRGVIETVYLRTPFSAGNATT" old_sequence 18914..18915 /gene="L3_1" /citation=[34] old_sequence 18919 /gene="L3_1" /citation=[36] old_sequence 19617 /gene="L3_1" /citation=[36] old_sequence 19666 /gene="L3_1" /citation=[36] old_sequence 19823 /gene="L3_1" /citation=[36] old_sequence 20427 /gene="L3_1" /citation=[36] old_sequence 20487 /gene="L3_1" /citation=[36] gene 21778..22392 /gene="23 kD" CDS 21778..22392 /gene="23 kD" /codon_start=1 /product="endopeptidase" /protein_id="AAA92216.1" /db_xref="GI:209832" /translation="MGSSEQELKAIVKDLGCGPYFLGTYDKRFPGFVSPHKLACAIVN TAGRETGGVHWMAFAWNPRSKTCYLFEPFGFSDQRLKQVYQFEYESLLRRSAIASSPD RCITLEKSTQSVQGPNSAACGLFCCMFLHAFANWPQTPMDHNPTMNLITGVPNSMLNS PQVQPTLRRNQEQLYSFLERHSPYFRSHSAQIRSATSFCHLKNM" gene 22418..22423 /gene="L3_2" polyA_signal 22418..22423 /gene="L3_2" /note="putative" prim_transcript complement(22420..27092) /gene="E2b" /citation=[35] prim_transcript complement(22420..27091) /gene="E2b" /citation=[35] prim_transcript complement(22420..25956) /gene="E2b" /citation=[65] prim_transcript complement(22420..25954) /gene="E2b" /citation=[35] gene complement(22420..22444) /gene="E2a_1" polyA_signal complement(22439..22444) /gene="E2a_1" /note="62.43%; putative" CDS complement(22490..24079) /gene="DBP" /note="72kD protein" /codon_start=1 /product="DNA-binding protein" /protein_id="AAA92217.1" /db_xref="GI:209833" /translation="MASREEEQRETTPERGRGAARRPPTMEDVSSPSPSPPPPRAPPK KRLRRRLESEDEEDSSQDALVPRTPSPRPSTSTADLAIASKKKKKRPSPKPERPPSPE VIVDSEEEREDVALQMVGFSNPPVLIKHGKGGKRTVRRLNEDDPVARGMRTQEEKEES SEAESESTVINPLSLPIVSAWEKGMEAARALMDKYHVDNDLKANFKLLPDQVEALAAV CKTWLNEEHRGLQLTFTSNKTFVTMMGRFLQAYLQSFAEVTYKHHEPTGCALWLHRCA EIEGELKCLHGSIMINKEHVIEMDVTSENGQRALKEQSSKAKIVKNRWGRNVVQISNT DARCCVHDAACPANQFSGKSCGMFFSEGAKAQVAFKQIKAFMQALYPNAQTGHGHLLM PLRCECNSKPGHAPFLGRQLPKLTPFALSNAEDLDADLISDKSVLASVHHPALIVFQC CNPVYRNSRAQGGGPNCDFKISAPDLLNALVMVRSLWSENFTELPRMVVPEFKWSTKH QYRNVSLPVAHSDARQNPFDF" gene complement(22490..22524) /gene="DBP" variation 22524 /gene="L4" /note="t in [48],[(in) Doerfler,W. (Ed.);Adenovirus DNA: 1-51;Martinus Nijhoff Publishing, Boston ]" /citation=[29] /replace="c" gene complement(24089..25885) /gene="E2a_2" intron complement(24089..24714) /gene="E2a_2" /citation=[39] CDS 24108..26525 /gene="L5" /note="100K protein" /codon_start=1 /product="hexon assembly-associated protein" /protein_id="AAA92218.1" /db_xref="GI:209834" /translation="MESVEKEDSLTAPFEFATTASTDAANAPTTFPVEAPPLEEEEVI IEQDPGFVSEDDEDRSVPTEDKKQDQDDAEANEEQVGRGDQRHGDYLDVGDDVLLKHL QRQCAIICDALQERSDVPLAIADVSLAYERHLFSPRVPPKRQENGTCEPNPRLNFYPV FAVPEVLATYHIFFQNCKIPLSCRANRSRADKQLALRQGAVIPDIASLDEVPKIFEGL GRDEKRAANALQQENSENESHCGVLVELEGDNARLAVLKRSIEVTHFAYPALNLPPKV MSTVMSELIVRRARPLERDANLQEQTEEGLPAVGDEQLARWLETREPADLEERRKLMM AAVLVTVELECMQRFFADPEMQRKLEETLHYTFRQGYVRQACKISNVELCNLVSYLGI LHENRLGQNVLHSTLKGEARRDYVRDCVYLFLCYTWQTAMGVWQQCLEERNLKELQKL LKQNLKDLWTAFNERSVAAHLADIIFPERLLKTLQQGLPDFTSQSMLQNFRNFILERS GILPATCCALPSDFVPIKYRECPPPLWGHCYLLQLANYLAYHSDIMEDVSGDGLLECH CRCNLCTPHRSLVCNSQLLSESQIIGTFELQGPSPDEKSAAPGLKLTPGLWTSAYLRK FVPEDYHAHEIRFYEDQSRPPNAELTACVITQGHILGQLQAINKARQEFLLRKGRGVY LDPQSGEELNPIPPPPQPYQQPRALASQDGTQKEAAAAAAATHGRGGILGQSGRGGFG RGGGDDGRLGQPRRSFRGRRGVRRNTVTLGRIPLAGAPEIGNRSQHRYNLRSSGAAGT ACSPTQP" intron complement(24792..27024) /gene="E2b" /citation=[16] intron complement(24972..25885) /gene="E2a_2" /note="does not fit consensus" /citation=[60] /cons_splice=(5'site:no, 3'site:no) gene 26239..27127 /gene="33K" CDS join(26239..26551,26754..27127) /gene="33K" /codon_start=1 /product="virion morphogenesis-associated protein" /protein_id="AAA92219.1" /db_xref="GI:209818" /translation="MAPKKKLQLPPPPPTDEEEYWDSQAEEVLDEEEEMMEDWDSLDE ASEAEEVSDETPSPSVAFPSPAPQKLATVPSIATTSAPQAPPALPVRRPNRRWDTTGT RAAPTAPAAAAAAATAAVTQKQRRPDSKTLTKPKKSTAAAAAGGGALRLAPNEPVSTR ELRNRIFPTLYAIFQQSRGQEQELKIKNRSLRSLTRSCLYHKSEDQLRRTLEDAEALF SKYCALTLKD" gene 26552..26753 /gene="33kD protein" intron 26552..26753 /gene="33kD protein" /citation=[50] gene 27215..27898 /gene="pVIII" CDS 27215..27898 /gene="pVIII" /note="precursor" /codon_start=1 /product="hexon-associated protein" /protein_id="AAA92220.1" /db_xref="GI:209835" /translation="MSKEIPTPYMWSYQPQMGLAAGAAQDYSTRINYMSAGPHMISRV NGIRAHRNRILLEQAAITTTPRNNLNPRSWPAALVYQESPAPTTVVLPRDAQAEVQMT NSGAQLAGGFRHRVRSPGQGITHLKIRGRGIQLNDESVSSSLGLRPDGTFQIGGAGRS SFTPRQAILTLQTSSSEPRSGGIGTLQFIEEFVPSVYFNPFSGPPGHYPDQFIPNFDA VKDSADGYD" mRNA 27609..30864 /gene="E3-2" /citation=[55] gene 27609..30847 /gene="E3-2" mRNA 27609..29804 /gene="E3-1" /citation=[48] mRNA 27609..29801 /gene="E3-1" /citation=[48] mRNA 27609..29799 /gene="E3-1" /citation=[48] mRNA 27609..29792 /gene="E3-1" /citation=[35] gene 27609..29774 /gene="E3-1" intron 27981..28375 /gene="E3-1" /note="major late mRNA intron ('x' leader)" /citation=[71] polyA_signal 28205..28210 /gene="E3-1" /note="major late mRNA L4 78.48%; putative" old_sequence 28339 /gene="E3-1" /citation=[19] old_sequence 28350 /gene="E3-1" /citation=[19] old_sequence 28359 /gene="E3-1" /citation=[19] old_sequence 28465..28466 /gene="E3-1" /citation=[19] old_sequence 28495..28497 /gene="E3-1" /citation=[19] intron 28560..30437 /gene="E3-2" /note="major late mRNA ('y' leader)" /citation=[34] gene 28812..29773 /gene="E3" CDS 28812..29291 /gene="E3" /note="19kD protein" /codon_start=1 /product="glycosylated membrane protein" /protein_id="AAA92221.1" /db_xref="GI:209836" /translation="MRYMILGLLALAAVCSAAKKVEFKEPACNVTFKSEANECTTLIK CTTEHEKLIIRHKDKIGKYAVYAIWQPGDTNDYNVTVFQGENRKTFMYKFPFYEMCDI TMYMSKQYKLWPPQKCLENTGTFCSTALLITALALVCTLLYLKYKSRRSFIDEKKMP" CDS 29468..29773 /gene="E3" /codon_start=1 /product="11.6 kD protein" /protein_id="AAA92222.1" /db_xref="GI:209837" /translation="MTGSTIAPTTDYRNTTATGLTSALNLPQVHAFVNDWASLDMWWF SIALMFVCLIIMWLICCLKRRRARPPIYRPIIVLNPHNEKIHRLDGLKPCSLLLQYD" polyA_signal 29769..29774 /gene="E3-1" /note="82.69%; putative" intron 30583..31029 /gene="L5" /note="major late mRNA ('z' leader)" /citation=[7] polyA_signal 30842..30847 /gene="E3-2" /note="85.82%; putative" old_sequence 30980..30981 /gene="L5" /citation=[19] CDS 31030..32778 /gene="L5" /note="virion component IV" /codon_start=1 /product="fiber protein" /protein_id="AAA92223.1" /db_xref="GI:209838" /translation="MKRARPSEDTFNPVYPYDTETGPPTVPFLTPPFVSPNGFQESPP GVLSLRVSEPLDTSHGMLALKMGSGLTLDKAGNLTSQNVTTVTQPLKKTKSNISLDTS APLTITSGALTVATTAPLIVTSGALSVQSQAPLTVQDSKLSIATKGPITVSDGKLALQ TSAPLSGSDSDTLTVTASPPLTTATGSLGINMEDPIYVNNGKIGIKISGPLQVAQNSD TLTVVTGPGVTVEQNSLRTKVAGAIGYDSSNNMEIKTGGGMRINNNLLILDVDYPFDA QTKLRLKLGQGPLYINASHNLDINYNRGLYLFNASNNTKKLEVSIKKSSGLNFDNTAI AINAGKGLEFDTNTSESPDINPIKTKIGSGIDYNENGAMITKLGAGLSFDNSGAITIG NKNDDKLTLWTTPDPSPNCRIHSDNDCKFTLVLTKCGSQVLATVAALAVSGDLSSMTG TVASVSIFLRFDQNGVLMENSSLKKHYWNFRNGNSTNANPYTNAVGFMPNLLAYPKTQ SQTAKNNIVSQVYLHGDKTKPMILTITLNGTSESTETSEVSTYSMSFTWSWESGKYTT ETFATNSYTFSYIAQE" polyA_signal 32774..32779 /gene="L5" /note="91.19%; putative" prim_transcript complement(32802..35609) /citation=[35] gene complement(32802..35547) /gene="E4" polyA_signal complement(32821..32826) /gene="E4" /note="91.32%; putative" intron complement(33193..33903) /gene="E4" /citation=[65] intron complement(33284..33903) /gene="E4" /citation=[65] intron complement(33377..33903) /gene="E4" /citation=[63] intron complement(33404..33903) /gene="E4" /citation=[63] intron complement(33452..33903) /gene="E4" /citation=[63] intron complement(33610..33903) /gene="E4" /citation=[63] intron complement(33679..33903) /gene="E4" /citation=[63] intron complement(33875..33903) /gene="E4" /citation=[63] intron complement(34083..34288) /gene="E4" /citation=[65] intron complement(34242..34288) /gene="E4" /citation=[65] intron complement(34330..34605) /gene="E4" /citation=[65] variation 34344..34345 /gene="E4" /note="tt in [44],[(in) Doerfler,W. (Ed.);Adenovirus DNA: 1-51;Martinus Nijhoff Publishing, Boston ]; t in [38]" CDS complement(34356..34706) /gene="E4" /note="11kD protein; splice sites not sequenced" /codon_start=1 /product="nuclear binding protein" /protein_id="AAA92224.1" /db_xref="GI:209839" /translation="MIRCLRLKVEGALEQIFTMAGLNIRDLLRDILIRWRDENYLGMV EGAGMFIEEIHPEGFSLYVHLDVRAVCLLEAIVQHLTNAIICSLAVEFDHATGGERVH LIDLHFEVLDNLLE" intron complement(34380..34605) /gene="E4" /citation=[63] intron complement(34436..34605) /gene="E4" /citation=[65] intron complement(34736..35547) /gene="E4" /citation=[65] intron complement(35108..35547) /gene="E4" repeat_region 35836..35937 /standard_name="inverted terminal repeat" /note="99.54%" /citation=[13] variation 35930 /note="t in [44],[(in) Doerfler,W. (Ed.);Adenovirus DNA: 1-51;Martinus Nijhoff Publishing, Boston ]; tt in other strains" /replace="tt" ORIGIN 1 catcatcata atatacctta ttttggattg aagccaatat gataatgagg gggtggagtt 61 tgtgacgtgg cgcggggcgt gggaacgggg cgggtgacgt agtagtgtgg cggaagtgtg 121 atgttgcaag tgtggcggaa cacatgtaag cgccggatgt ggtaaaagtg acgtttttgg 181 tgtgcgccgg tgtatacggg aagtgacaat tttcgcgcgg ttttaggcgg atgttgtagt 241 aaatttgggc gtaaccaagt aatgtttggc cattttcgcg ggaaaactga ataagaggaa 301 gtgaaatctg aataattctg tgttactcat agcgcgtaat atttgtctag ggccgcgggg 361 actttgaccg tttacgtgga gactcgccca ggtgtttttc tcaggtgttt tccgcgttcc 421 gggtcaaagt tggcgtttta ttattatagt cagctgacgc gcagtgtatt tatacccggt 481 gagttcctca agaggccact cttgagtgcc agcgagtaga gttttctcct ccgagccgct 541 ccgacaccgg gactgaaaat gagacatatt atctgccacg gaggtgttat taccgaagaa 601 atggccgcca gtcttttgga ccagctgatc gaagaggtac tggctgataa tcttccacct 661 cctagccatt ttgaaccacc tacccttcac gaactgtatg atttagacgt gacggccccc 721 gaagatccca acgaggaggc ggtttcgcag atttttcccg agtctgtaat gttggcggtg 781 caggaaggga ttgacttatt cacttttccg ccggcgcccg gttctccgga gccgcctcac 841 ctttcccggc agcccgagca gccggagcag agagccttgg gtccggtttc tatgccaaac 901 cttgtgccgg aggtgatcga tcttacctgc cacgaggctg gctttccacc cagtgacgac 961 gaggatgaag agggtgagga gtttgtgtta gattatgtgg agcaccccgg gcacggttgc 1021 aggtcttgtc attatcaccg gaggaatacg ggggacccag atattatgtg ttcgctttgc 1081 tatatgagga cctgtggcat gtttgtctac agtaagtgaa aattatgggc agtcggtgat 1141 agagtggtgg gtttggtgtg gtaatttttt tttaattttt acagttttgt ggtttaaaga 1201 attttgtatt gtgatttttt aaaaggtcct gtgtctgaac ctgagcctga gcccgagcca 1261 gaaccggagc ctgcaagacc tacccggcgt cctaaattgg tgcctgctat cctgagacgc 1321 ccgacatcac ctgtgtctag agaatgcaat agtagtacgg atagctgtga ctccggtcct 1381 tctaacacac ctcctgagat acacccggtg gtcccgctgt gccccattaa accagttgcc 1441 gtgagagttg gtgggcgtcg ccaggctgtg gaatgtatcg aggacttgct taacgagtct 1501 gggcaacctt tggacttgag ctgtaaacgc cccaggccat aaggtgtaaa cctgtgattg 1561 cgtgtgtggt taacgccttt gtttgctgaa tgagttgatg taagtttaat aaagggtgag 1621 ataatgttta acttgcatgg cgtgttaaat ggggcggggc ttaaagggta tataatgcgc 1681 cgtgggctaa tcttggttac atctgacctc atggaggctt gggagtgttt ggaagatttt 1741 tctgctgtgc gtaacttgct ggaacagagc tctaacagta cctcttggtt ttggaggttt 1801 ctgtggggct cctcccaggc aaagttagtc tgcagaatta aggaggatta caagtgggaa 1861 tttgaagagc ttttgaaatc ctgtggtgag ctgtttgatt ctttgaatct gggtcaccag 1921 gcgcttttcc aagagaaggt catcaagact ttggattttt ccacaccggg gcgcgctgcg 1981 gctgctgttg cttttttgag ttttataaag gataaatgga gcgaagaaac ccatctgagc 2041 ggggggtacc tgctggattt tctggccatg catctgtgga gagcggtggt gagacacaag 2101 aatcgcctgc tactgttgtc ttccgtccgc ccggcaataa taccgacgga ggagcaacag 2161 caggaggaag ccaggcggcg gcggcggcag gagcagagcc catggaaccc gagagccggc 2221 ctggaccctc gggaatgaat gttgtacagg tggctgaact gtttccagaa ctgagacgca 2281 ttttaaccat taacgaggat gggcaggggc taaagggggt aaagagggag cggggggctt 2341 ctgaggctac agaggaggct aggaatctaa cttttagctt aatgaccaga caccgtcctg 2401 agtgtgttac ttttcagcag attaaggata attgcgctaa tgagcttgat ctgctggcgc 2461 agaagtattc catagagcag ctgaccactt actggctgca gccaggggat gattttgagg 2521 aggctattag ggtatatgca aaggtggcac ttaggccaga ttgcaagtac aagattagca 2581 aacttgtaaa tatcaggaat tgttgctaca tttctgggaa cggggccgag gtggagatag 2641 atacggagga tagggtggcc tttagatgta gcatgataaa tatgtggccg ggggtgcttg 2701 gcatggacgg ggtggttatt atgaatgtga ggtttactgg tcccaatttt agcggtacgg 2761 ttttcctggc caataccaat cttatcctac acggtgtaag cttctatggg tttaacaata 2821 cctgtgtgga agcctggacc gatgtaaggg ttcggggctg tgccttttac tgctgctgga 2881 agggggtggt gtgtcgcccc aaaagcaggg cttcaattaa gaaatgcctg tttgaaaggt 2941 gtaccttggg tatcctgtct gagggtaact ccagggtgcg ccacaatgtg gcctccgact 3001 gtggttgctt catgctagtg aaaagcgtgg ctgtgattaa gcataacatg gtgtgtggca 3061 actgcgagga cagggcctct cagatgctga cctgctcgga cggcaactgt cacttgctga 3121 agaccattca cgtagccagc cactctcgca aggcctggcc agtgtttgag cacaacatac 3181 tgacccgctg ttccttgcat ttgggtaaca ggaggggggt gttcctacct taccaatgca 3241 atttgagtca cactaagata ttgcttgagc ccgagagcat gtccaaggtg aacctgaacg 3301 gggtgtttga catgaccatg aagatctgga aggtgctgag gtacgatgag acccgcacca 3361 ggtgcagacc ctgcgagtgt ggcggtaaac atattaggaa ccagcctgtg atgctggatg 3421 tgaccgagga gctgaggccc gatcacttgg tgctggcctg cacccgcgct gagtttggct 3481 ctagcgatga agatacagat tgaggtactg aaatgtgtgg gcgtggctta agggtgggaa 3541 agaatatata aggtgggggt ctcatgtagt tttgtatctg ttttgcagca gccgccgcca 3601 tgagcgccaa ctcgtttgat ggaagcattg tgagctcata tttgacaacg cgcatgcccc 3661 catgggccgg ggtgcgtcag aatgtgatgg gctccagcat tgatggtcgc cccgtcctgc 3721 ccgcaaactc tactaccttg acctacgaga ccgtgtctgg aacgccgttg gagactgcag 3781 cctccgccgc cgcttcagcc gctgcagcca ccgcccgcgg gattgtgact gactttgctt 3841 tcctgagccc gcttgcaagc agtgcagctt cccgttcatc cgcccgcgat gacaagttga 3901 cggctctttt ggcacaattg gattctttga cccgggaact taatgtcgtt tctcagcagc 3961 tgttggatct gcgccagcag gtttctgccc tgaaggcttc ctcccctccc aatgcggttt 4021 aaaacataaa taaaaaccag actctgtttg gattttgatc aagcaagtgt cttgctgtct 4081 ttatttaggg gttttgcgcg cgcggtaggc ccgggaccag cggtctcggt cgttgagggt 4141 cctgtgtatt ttttccagga cgtggtaaag gtgactctgg atgttcagat acatgggcat 4201 aagcccgtct ctggggtgga ggtagcacca ctgcagagct tcatgctgcg gggtggtgtt 4261 gtagatgatc cagtcgtagc aggagcgctg ggcgtggtgc ctaaaaatgt ctttcagtag 4321 caagctgatt gccaggggca ggcccttggt gtaagtgttt acaaagcggt taagctggga 4381 tgggtgcata cgtggggata tgagatgcat cttggactgt atttttaggt tggctatgtt 4441 cccagccata tccctccggg gattcatgtt gtgcagaacc accagcacag tgtatccggt 4501 gcacttggga aatttgtcat gtagcttaga aggaaatgcg tggaagaact tggagacgcc 4561 cttgtgacct ccgagatttt ccatgcattc gtccataatg atggcaatgg gcccacgggc 4621 ggcggcctgg gcgaagatat ttctgggatc actaacgtca tagttgtgtt ccaggatgag 4681 atcgtcatag gccattttta caaagcgcgg gcggagggtg ccagactgcg gtataatggt 4741 tccatccggc ccaggggcgt agttaccctc acagatttgc atttcccacg ctttgagttc 4801 agatgggggg atcatgtcta cctgcggggc gatgaagaaa accgtttccg gggtagggga 4861 gatcagctgg gaagaaagca ggttcctgag cagctgcgac ttaccgcagc cggtgggccc 4921 gtaaatcaca cctattaccg gctgcaactg gtagttaaga gagctgcagc tgccgtcatc 4981 cctgagcagg ggggccactt cgttaagcat gtccctgact tgcatgtttt ccctgaccaa 5041 atgcgccaga aggcgctcgc cgcccagcga tagcagttct tgcaaggaag caaagttttt 5101 caacggtttg aggccgtccg ccgtaggcat gcttttgagc gtttgaccaa gcagttccag 5161 gcggtcccac agctcggtca cgtgctctac ggcatctcga tccagcatat ctcctcgttt 5221 cgcgggttgg ggcggctttc gctgtacggc agtagtcggt gctcgtccag acgggccagg 5281 gtcatgtctt tccacgggcg cagggtcctc gtcagcgtag tctgggtcac ggtgaagggg 5341 tgcgctccgg gctgcgcgct ggccagggtg cgcttgaggc tggtcctgct ggtgctgaag 5401 cgctgccggt cttcgccctg cgcgtcggcc aggtagcatt tgaccatggt gtcatagtcc 5461 agcccctccg cggcgtggcc cttggcgcgc agcttgccct tggaggaggc gccgcacgag 5521 gggcagtgca gacttttaag ggcgtagagc ttgggcgcga gaaataccga ttccggggag 5581 taggcatccg cgccgcaggc cccgcagacg gtctcgcatt ccacgagcca ggtgagctct 5641 ggccgttcgg ggtcaaaaac caggtttccc ccatgctttt tgatgcgttt cttacctctg 5701 gtttccatga gccggtgtcc acgctcggtg acgaaaaggc tgtccgtgtc cccgtataca 5761 gacttgagag gcctgtcctc gagcggtgtt ccgcggtcct cctcgtatag aaactcggac 5821 cactctgaga cgaaggctcg cgtccaggcc agcacgaagg aggctaagtg ggaggggtag 5881 cggtcgttgt ccactagggg gtccactcgc tccagggtgt gaagacacat gtcgccctct 5941 tcggcatcaa ggaaggtgat tggtttatag gtgtaggcca cgtgaccggg tgttcctgaa 6001 ggggggctat aaaagggggt gggggcgcgt tcgtcctcac tctcttccgc atcgctgtct 6061 gcgagggcca gctgttgggg tgagtactcc ctctcaaaag cgggcatgac ttctgcgcta 6121 agattgtcag tttccaaaaa cgaggaggat ttgatattca cctggcccgc ggtgatgcct 6181 ttgagggtgg ccgcgtccat ctggtcagaa aagacaatct ttttgttgtc aagcttggtg 6241 gcaaacgacc cgtagagggc gttggacagc aacttggcga tggagcgcag ggtttggttt 6301 ttgtcgcgat cggcgcgctc cttggccgcg atgtttagct gcacgtattc gcgcgcaacg 6361 caccgccatt cgggaaagac ggtggtgcgc tcgtcgggca ccaggtgcac gcgccaaccg 6421 cggttgtgca gggtgacaag gtcaacgctg gtggctacct ctccgcgtag gcgctcgttg 6481 gtccagcaga ggcggccgcc cttgcgcgaa cagaatggcg gtagtgggtc tagctgcgtc 6541 tcgtccgggg ggtctgcgtc cacggtaaag accccgggca gcaggcgcgc gtcgaagtag 6601 tctatcttgc atccttgcaa gtctagcgcc tgctgccatg cgcgggcggc aagcgcgcgc 6661 tcgtatgggt tgagtggggg accccatggc atggggtggg tgagcgcgga ggcgtacatg 6721 ccgcaaatgt cgtaaacgta gaggggctct ctgagtattc caagatatgt agggtagcat 6781 cttccaccgc ggatgctggc gcgcacgtaa tcgtatagtt cgtgcgaggg agcgaggagg 6841 tcgggaccga ggttgctacg ggcgggctgc tctgctcgga agactatctg cctgaagatg 6901 gcatgtgagt tggatgatat ggttggacgc tggaagacgt tgaagctggc gtctgtgaga 6961 cctaccgcgt cacgcacgaa ggaggcgtag gagtcgcgca gcttgttgac cagctcggcg 7021 gtgacctgca cgtctagggc gcagtagtcc agggtttcct tgatgatgtc atacttatcc 7081 tgtccctttt ttttccacag ctcgcggttg aggacaaact cttcgcggtc tttccagtac 7141 tcttggatcg gaaacccgtc ggcctccgaa cggtaagagc ctagcatgta gaactggttg 7201 acggcctggt aggcgcagca tcccttttct acgggtagcg cgtatgcctg cgcggccttc 7261 cggagcgagg tgtgggtgag cgcaaaggtg tccctaacca tgactttgag gtactggtat 7321 ttgaagtcag tgtcgtcgca tccgccctgc tcccagagca aaaagtccgt gcgctttttg 7381 gaacgcgggt ttggcagggc gaaggtgaca tcgttgaaaa gtatctttcc cgcgcgaggc 7441 ataaagttgc gtgtgatgcg gaagggtccc ggcacctcgg aacggttgtt aattacctgg 7501 gcggcgagca cgatctcgtc gaagccgttg atgttgtggc ccacgatgta aagttccaag 7561 aagcgcgggg tgcccttgat ggagggcaat tttttaagtt cctcgtaggt gagctcctca 7621 ggggagctga gcccgtgttc tgacagggcc cagtctgcaa gatgagggtt ggaagcgacg 7681 aatgagctcc acaggtcacg ggccattagc atttgcaggt ggtcgcgaaa ggtcctaaac 7741 tggcgaccta tggccatttt ttctggggtg atgcagtaga aggtaagcgg gtcttgttcc 7801 cagcggtccc atccaaggtc cacggctagg tctcgcgcgg cggtcaccag aggctcatct 7861 ccgccgaact tcataaccag catgaagggc acgagctgct tcccaaaggc ccccatccaa 7921 gtataggtct ctacatcgta ggtgacaaag agacgctcgg tgcgaggatg cgagccgatc 7981 gggaagaact ggatctcccg ccaccagttg gaggagtggc tgttgatgtg gtgaaagtag 8041 aagtccctgc gacgggccga acactcgtgc tggcttttgt aaaaacgtgc gcagtactgg 8101 cagcggtgca cgggctgtac atcctgcacg aggttgacct gacgaccgcg cacaaggaag 8161 cagagtggga atttgagccc ctcgcctggc gggtttggct ggtggtcttc tacttcggct 8221 gcttgtcctt gaccgtctgg ctgctcgagg ggagttatgg tggatcggac caccacgccg 8281 cgcgagccca aagtccagat gtccgcgcgc ggcggtcgga gcttgatgac aacatcgcgc 8341 agatgggagc tgtccatggt ctggagctcc cgcggcgaca ggtcaggcgg gagctcctgc 8401 aggtttacct cgcatagccg ggtcagggcg cgggctaggt ccaggtgata cctgatttcc 8461 aggggctggt tggtggcggc gtcgatgact tgcaagaggc cgcatccccg cggcgcgact 8521 acggtaccgc gcggcgggcg gtgggccgcg ggggtgtcct tggatgatgc atctaaaagc 8581 ggtgacgcgg gcgggccccc ggaggtaggg ggggctcggg acccgccggg agagggggca 8641 ggggcacgtc ggcgccgcgc gcgggcagga gctggtgctg cgcgcggagg ttgctggcga 8701 acgcgacgac gcggcggttg atctcctgaa tctggcgcct ctgcgtgaag acgacgggcc 8761 cggtgagctt gaacctgaaa gagagttcga cagaatcaat ttcggtgtcg ttgacggcgg 8821 cctggcgcaa aatctcctgc acgtctcctg agttgtcttg ataggcgatt tcggccatga 8881 actgctcgat ctcttcctcc tggagatctc cgcgtccggc tcgctccacg gtggcggcga 8941 ggtcgttgga gatgcgggcc atgagctgcg agaaggcgtt gaggcctccc tcgttccaga 9001 cgcggctgta gaccacgccc ccttcggcat cgcgggcgcg catgaccacc tgcgcgagat 9061 tgagctccac gtgccgggcg aagacggcgt agtttcgcag gcgctgaaag aggtagttga 9121 gggtggtggc ggtgtgttct gccacgaaga agtacataac ccagcgtcgc aacgtggatt 9181 cgttgatatc ccccaaggcc tcaaggcgct ccatggcctc gtagaagtcc acggcgaagt 9241 tgaaaaactg ggagttgcgc gccgacacgg ttaactcctc ctccagaaga cggatgagct 9301 cggcgacagt gtcgcgcacc tcgcgctcaa aggctacagg ggcctcttct tcttcaatct 9361 cctcttccat aagggcctcc ccttcttctt cttcttctgg cggcggtggg ggagggggga 9421 cacggcggcg acgacggcgc accgggaggc ggtcgacaaa gcgctcgatc atctccccgc 9481 ggcgacggcg catggtctcg gtgacggcgc ggccgttctc gcgggggcgc agttggaaga 9541 cgccgcccgt catgtcccgg ttatgggttg gcggggggct gccgtgcggc agggatacgg 9601 cgctaacgat gcatctcaac aattgttgtg taggtactcc gccaccgagg gacctgagcg 9661 agtccgcatc gaccggatcg gaaaacctct cgagaaaggc gtctaaccag tcacagtcgc 9721 aaggtaggct gagcaccgtg gcgggcggca gcgggtggcg gtcggggttg tttctggcgg 9781 aggtgctgct gatgatgtaa ttaaagtagg cggtcttgag acggcggatg gtcgacagaa 9841 gcaccatgtc cttgggtccg gcctgctgaa tgcgcaggcg gtcggccatg ccccaggctt 9901 cgttttgaca tcggcgcagg tctttgtagt agtcttgcat gagcctttct accggcactt 9961 cttcttctcc ttcctcttgt cctgcatctc ttgcatctat cgctacggcg gcggcggagt 10021 ttggccgtag gtggcgccct cttcctccca tgcgtgtgac cccgaagccc ctcatcggct 10081 gaagcagggc caggtcggcg acaacgcgct cggctaatat ggcctgctgc acctgcgtga 10141 gggtagactg gaagtcatcc atgtccacaa agcggtggta tgcgcccgtg ttgatggtgt 10201 aagtgcagtt ggccataacg gaccagttaa cggtctggtg acccggctgc gagagctcgg 10261 tgtacctgag acgcgagtaa gcccttgagt caaagacgta gtcgttgcaa gtccgcacca 10321 ggtactgata tcccaccaaa aagtgcggcg gcggctggcg gtagaggggc cagcgtaggg 10381 tggccggggc tccgggggcg aggtcttcca acataaggcg atgatatccg tagatgtacc 10441 tggacatcca ggtgatgccg gcggcggtgg tggaggcgcg cggaaagtcg cggacgcggt 10501 tccagatgtt gcgcagcggc aaaaagtgct ccatggtcgg gacgctctgg ccggtgaggc 10561 gtgcgcagtc gttgacgctc tagaccgtgc aaaaggagag cctgtaagcg ggcactcttc 10621 cgtggtctgg tggataaatt cgcaagggta tcatggcgga cgaccggggt tcgaaccccg 10681 gatccggccg tccgccgtga tccatgcggt taccgcccgc gtgtcgaacc caggtgtgcg 10741 acgtcagaca acgggggagc gctccttttg gcttccttcc aggcgcggcg gctgctgcgc 10801 tagctttttt ggccactggc cgcgcgcggc gtaagcggtt aggctggaaa gcgaaagcat 10861 taagtggctc gctccctgta gccggagggt tattttccaa gggttgagtc gcaggacccc 10921 cggttcgagt ctcgggccgg ccggactgcg gcgaacgggg gtttgcctcc ccgtcatgca 10981 agaccccgct tgcaaattcc tccggaaaca gggacgagcc ccttttttgc ttttcccaga 11041 tgcatccggt gctgcggcag atgcgccccc ctcctcagca gcggcaagag caagagcagc 11101 ggcagacatg cagggcaccc tccccttctc ctaccgcgtc aggaggggca acatccgcgg 11161 ctgacgcggc ggcagatggt gattacgaac ccccgcggcg ccgggcccgg cactacctgg 11221 acttggagga gggcgagggc ctggcgcggc taggagcgcc ctctcctgag cgacacccaa 11281 gggtgcagct gaagcgtgac acgcgcgagg cgtacgtgcc gcggcagaac ctgtttcgcg 11341 accgcgaggg agaggagccc gaggagatgc gggatcgaaa gttccacgca gggcgcgagt 11401 tgcggcatgg cctgaaccgc gagcggttgc tgcgcgagga ggactttgag cccgacgcgc 11461 ggaccgggat tagtcccgcg cgcgcacacg tggcggccgc cgacctggta accgcgtacg 11521 agcagacggt gaaccaggag attaactttc aaaaaagctt taacaaccac gtgcgcacgc 11581 ttgtggcgcg cgaggaggtg gctataggac tgatgcatct gtgggacttt gtaagcgcgc 11641 tggagcaaaa cccaaatagc aagccgctca tggcgcagct gttccttata gtgcagcaca 11701 gcagggacaa cgaggcattc agggatgcgc tgctaaacat agtagagccc gagggccgct 11761 ggctgctcga tttgataaac attctgcaga gcatagtggt gcaggagcgc agcttgagcc 11821 tggctgacaa ggtggccgcc attaactatt ccatgctcag tctgggcaag ttttacgccc 11881 gcaagatata ccatacccct tacgttccca tagacaagga ggtaaagatc gaggggttct 11941 acatgcgcat ggcgttgaag gtgcttacct tgagcgacga cctgggcgtt tatcgcaacg 12001 agcgcatcca caaggccgtg agcgtgagcc ggcggcgcga gctcagcgac cgcgagctga 12061 tgcacagcct gcaaagggcc ctggctggca cgggcagcgg cgatagagag gccgagtcct 12121 actttgacgc gggcgctgac ctgcgctggg ccccaagccg acgcgccctg gaggcagctg 12181 gggccggacc tgggctggcg gtggcacccg cgcgcgctgg caacgtcggc ggcgtggagg 12241 aatatgacga ggacgatgag tacgagccag aggacggcga gtactaagcg gtgatgtttc 12301 tgatcagatg atgcaagacg caacggaccc ggcggtgcgg gcggcgctgc agagccagcc 12361 gtccggcctt aactccacgg acgactggcg ccaggtcatg gaccgcatca tgtcgctgac 12421 tgcgcgtaac cctgacgcgt tccggcagca gccgcaggcc aaccggctct ccgcaattct 12481 ggaagcggtg gtcccggcgc gcgcaaaccc cacgcacgag aaggtgctgg cgatcgtaaa 12541 cgcgctggcc gaaaacaggg ccatccggcc cgatgaggcc ggcctggtct acgacgcgct 12601 gcttcagcgc gtggctcgtt acaacagcgg caacgtgcag accaacctgg accggctggt 12661 gggggatgtg cgcgaggccg tggcgcagcg tgagcgcgcg cagcagcagg gcaacctggg 12721 ctccatggtt gcactaaacg ccttcctgag tacacagccc gccaacgtgc cgcggggaca 12781 ggaggactac accaactttg tgagcgcact gcggctaatg gtgactgaga caccgcaaag 12841 tgaggtgtac cagtccgggc cagactattt tttccagacc agtagacaag gcctgcagac 12901 cgtaaacctg agccaggctt tcaagaactt gcaggggctg tggggggtgc gggctcccac 12961 aggcgaccgc gcgaccgtgt ctagcttgct gacgcccaac tcgcgcctgt tgctgctgct 13021 aatagcgccc ttcacggaca gtggcagcgt gtcccgggac acatacctag gtcacttgct 13081 gacactgtac cgcgaggcca taggtcaggc gcatgtggac gagcatactt tccaggagat 13141 tacaagtgtc agccgcgcgc tggggcagga ggacacgggc agcctggagg caaccctgaa 13201 ctacctgctg accaaccggc ggcagaagat cccctcgttg cacagtttaa acagcgagga 13261 ggagcgcatc ttgcgctatg tgcagcagag cgtgagcctt aacctgatgc gcgacggggt 13321 aacgcccagc gtggcgctgg acatgaccgc gcgcaacatg gaaccgggca tgtatgcctc 13381 aaaccggccg tttatcaatc gcctaatgga ctacttgcat cgcgcggccg ccgtgaaccc 13441 cgagtatttc accaatgcca tcttgaaccc gcactggcta ccgccccctg gtttctacac 13501 cgggggattt gaggtgcccg agggtaacga tggattcctc tgggacgaca tagacgacag 13561 cgtgttttcc ccgcaaccgc agaccctgct agagttgcaa cagcgcgagc aggcagaggc 13621 ggcgctgcga aaggaaagct tccgcaggcc aagcagcttg tccgatctag gcgctgcggc 13681 cccgcggtca gatgcgagta gcccatttcc aagcttgata gggtctttta ccagcactcg 13741 caccacccgc ccgcgcctgc tgggcgagga ggagtaccta aacaactcgc tgctgcagcc 13801 gcagcgcgaa aagaacctgc ctccggcatt tcccaacaac gggatagaga gcctagtgga 13861 caagatgagt agatggaaga cgtatgcgca ggagcacagg gatgtgcccg gcccgcgccc 13921 gcccacccgt cgtcaaaggc acgaccgtca gcggggtctg gtgtgggagg acgatgactc 13981 ggcagacgac agcagcgtcc tggatttggg agggagtggc aacccgtttg cgcaccttcg 14041 ccccaggctg gggagaatgt tttaaaaaaa aaaaaaaaaa gcatgatgca aaataaaaaa 14101 ctcaccaagg ccatggcacc gagcgttggt tttcttgtat tccccttagt atgcagcgcg 14161 cggcgatgta tgaggaaggt cctcctccct cctacgagag cgtggtgagc gcggcgccag 14221 tggcggcggc gctgggttcc cccttcgatg ctcccctgga cccgccgttt gtgcctccgc 14281 ggtacctgcg gcctaccggg gggagaaaca gcatccgtta ctctgagttg gcacccctat 14341 tcgacaccac ccgtgtgtac cttgtggaca acaagtcaac ggatgtggca tccctgaact 14401 accagaacga ccacagcaac tttctaacca cggtcattca aaacaatgac tacagcccgg 14461 gggaggcaag cacacagacc atcaatcttg acgaccgttc gcactggggc ggcgacctga 14521 aaaccatcct gcataccaac atgccaaatg tgaacgagtt catgtttacc aataagttta 14581 aggcgcgggt gatggtgtcg cgctcgctta ctaaggacaa acaggtggag ctgaaatatg 14641 agtgggtgga gttcacgctg cccgagggca actactccga gaccatgacc atagacctta 14701 tgaacaacgc gatcgtggag cactacttga aagtgggcag gcagaacggg gttctggaaa 14761 gcgacatcgg ggtaaagttt gacacccgca acttcagact ggggtttgac ccagtcactg 14821 gtcttgtcat gcctggggta tatacaaacg aagccttcca tccagacatc attttgctgc 14881 caggatgcgg ggtggacttc acccacagcc gcctgagcaa cttgttgggc atccgcaagc 14941 ggcaaccctt ccaggagggc tttaggatca cctacgatga cctggagggt ggtaacattc 15001 ccgcactgtt ggatgtggac gcctaccagg caagcttaaa agatgacacc gaacagggcg 15061 gggatggcgc aggcggcggc aacaacagtg gcagcggcgc ggaagagaac tccaacgcgg 15121 cagccgcggc aatgcagccg gtggaggaca tgaacgatca tgccattcgc ggcgacacct 15181 ttgccacacg ggcggaggag aagcgcgctg aggccgaggc agcggcagaa gctgccgccc 15241 ccgctgcgca acccgaggtc gagaagcctc agaagaaacc ggtgatcaaa cccctgacag 15301 aggacagcaa gaaacgcagt tacaacctaa taagcaatga cagcaccttc acccagtacc 15361 gcagctggta ccttgcatac aactacggcg accctcagac cgggatccgc tcatggaccc 15421 tcctttgcac tcctgacgta acctgcggct cggagcaggt ctactggtcg ttgccagaca 15481 tgatgcaaga ccccgtgacc ttccgctcca cgagccagat cagcaacttt ccggtggtgg 15541 gcgccgagct gttgcccgtg cactccaaga gcttctacaa cgaccaggcc gtctactccc 15601 agctcatccg ccagtttacc tctctgaccc acgtgttcaa tcgctttccc gagaaccaga 15661 ttttggcgcg cccgccagcc cccaccatca ccaccgtcag tgaaaacgtt cctgctctca 15721 cagatcacgg gacgctaccg ctgcgcaaca gcatcggagg agtccagcga gtgaccatta 15781 ctgacgccag acgccgcacc tgcccctacg tttacaaggc cctgggcata gtctcgccgc 15841 gcgtcctatc gagccgcact ttttgagcaa acatgtccat ccttatatcg cccagcaata 15901 acacaggctg gggcctgcgc ttcccaagca agatgtttgg cggggcaaag aagcgctccg 15961 accaacaccc agtgcgcgtg cgcgggcact accgcgcgcc ctggggcgcg cacaaacgcg 16021 gccgcactgg gcgcaccacc gtcgatgacg ccattgacgc ggtggtggag gaggcgcgca 16081 actacacgcc cacgccgcca ccagtgtcca cagtggacgc ggccattcag accgtggtgc 16141 gcggagcccg gcgttatgct aaaatgaaga gacggcggag gcgcgtagca cgtcgccacc 16201 gccgccgacc cggcactgcc gcccaacgcg cggcggcggc cctgcttaac cgcgcacgtc 16261 gcaccggccg acgggcggcc atgcgggccg ctcgaaggct ggccgcgggt attgtcactg 16321 tgccccccag gtccaggcga cgagcggccg ccgcagcagc cgcggccatt agtgctatga 16381 ctcagggtcg caggggcaac gtgtactggg tgcgcgactc ggttagcggc ctgcgcgtgc 16441 ccgtgcgcac ccgccccccg cgcaactaga ttgcaagaaa aaactactta gactcgtact 16501 gttgtatgta tccagcggcg gcggcgcgca acgaagctat gtccaagcgc aaaatcaaag 16561 aagagatgct ccaggtcatc gcgccggaga tctatggccc cccgaagaag gaagagcagg 16621 attacaagcc ccgaaagcta aagcgggtca aaaagaaaaa gaaagatgat gatgatgatg 16681 aacttgacga cgaggtggaa ctgctgcacg caaccgcgcc caggcggcgg gtacagtgga 16741 aaggtcgacg cgtaagacgt gttttgcgac ccggcaccac cgtagttttt acgcccggtg 16801 agcgctccac ccgcacctac aagcgcgtgt atgatgaggt gtacggcgac gaggacctgc 16861 ttgagcaggc caacgagcgc ctcggggagt ttgcctacgg aaagcggcat aaggacatgt 16921 tggcgttgcc gctggacgag ggcaacccaa cacctagcct aaagcccgtg acactgcagc 16981 aggtgctgcc cacgcttgca ccgtccgaag aaaagcgcgg cctaaagcgc gagtctggtg 17041 acttggcacc caccgtgcag ctgatggtac ccaagcgcca gcgactggaa gatgtcttgg 17101 aaaaaatgac cgtggagcct gggctggagc ccgaggtccg cgtgcggcca atcaagcagg 17161 tggcaccggg actgggcgtg cagaccgtgg acgttcagat acccaccacc agtagcacta 17221 gtattgccac tgccacagag ggcatggaga cacaaacgtc cccggttgcc tcggcggtgg 17281 cagatgccgc ggtgcaggcg gccgctgcgg ccgcgtccaa aacctctacg gaggtgcaaa 17341 cggacccgtg gatgtttcgc gtttcagccc cccggcgccc gcgccgttcc aggaagtacg 17401 gcaccgccag cgcactactg cccgaatatg ccctacatcc ttccatcgcg cctacccccg 17461 gctatcgtgg ctacacctac cgccccagaa gacgagcgac tacccgacgc cgaaccacca 17521 ctggaacccg ccgccgccgt cgccgtcgcc agcccgtgct ggccccgatt tccgtgcgca 17581 gggtggctcg cgaaggaggc aggaccctgg tgctgccaac agcgcgctac caccccagca 17641 tcgtttaaaa gccggtcttt gtggttcttg cagatatggc cctcacctgc cgcctccgtt 17701 tcccggtgcc gggattccga ggaagaatgc accgtaggag gggcatggcc ggccacggcc 17761 tgacgggcgg catgcgtcgt gcgcaccacc ggcggcggcg cgcgtcgcac cgtcgcatgc 17821 gcggcggtat cctgcccctc cttattccac tgatcgccgc ggcgattggc gccgtgcccg 17881 gaattgcatc cgtggccttg caggcgcaga gacactgatt aaaaacaagt tgcatgtgga 17941 aaaatcaaaa taaaaagtct ggagtctcac gctcgcttgg tcctgtaact attttgtaga 18001 atggaagaca tcaactttgc gtctctggcc ccgcgacacg gctcgcgccc gttcatggga 18061 aactggcaag atatcggcac cagcaatatg agcggtggcg ccttcagctg gggctcgctg 18121 tggagcggca ttaaaaattt cggttccacc attaagaact atggcagcaa ggcctggaac 18181 agcagcacag gccagatgct gagggacaag ttgaaagagc aaaatttcca acaaaaggtg 18241 gtagatggcc tggcctctgg cattagcggg gtggtggacc tggccaacca ggcagtgcaa 18301 aataagatta acagtaagct tgatccccgc cctcccgtag aggagcctcc accggccgtg 18361 gagacagtgt ctccagaggg gcgtggcgaa aagcgtccgc ggcccgacag ggaagaaact 18421 ctggtgacgc aaatagatga gcctccctcg tacgaggagg cactaaagca aggcctgccc 18481 accacccgtc ccatcgcgcc catggctacc ggagtgctgg gccagcacac acctgtaacg 18541 ctggacctgc ctccccccgc tgacacccag cagaaacctg tgctgccagg gccgtccgcc 18601 gttgttgtaa cccgccctag ccgcgcgtcc ctgcgccgtg ccgccagcgg tccgcgatcg 18661 atgcggcccg tagccagtgg caactggcaa agcacactga acagcatcgt gggtctgggg 18721 gtgcaatccc tgaagcgccg acgatgcttc taaatagcta acgtgtcgta tgtgtcatgt 18781 atgcgtccat gtcgccgcca gaggagctgc tgagccgccg tgcgcccgct ttccaagatg 18841 gctacccctt cgatgatgcc gcagtggtct tacatgcaca tctcgggcca ggacgcctcg 18901 gagtacctga gccccgggct ggtgcagttt gcccgcgcca ccgagacgta cttcagcctg 18961 aataacaagt ttagaaaccc cacggtggca cctacgcacg acgtaaccac agaccggtcc 19021 cagcgtttga cgctgcggtt catccctgtg gaccgcgagg ataccgcgta ctcgtacaaa 19081 gcgcggttca ccctggctgt gggtgacaac cgtgtgcttg atatggcttc cacgtacttt 19141 gacatccgcg gcgtgctgga cagggggcct acttttaagc cctactccgg cactgcctac 19201 aacgctctag ctcccaaggg cgctcctaac tcctgtgagt gggaacaaac cgaagatagc 19261 ggccgggcag ttgccgagga tgaagaagag gaagatgaag atgaagaaga ggaagaagaa 19321 gagcaaaacg ctcgagatca ggctactaag aaaacacatg tctatgccca ggctcctttg 19381 tctggagaaa caattacaaa aagcgggcta caaataggat cagacaatgc agaaacacaa 19441 gctaaacctg tatacgcaga tccttcctat caaccagaac ctcaaattgg cgaatctcag 19501 tggaacgaag ctgatgctaa tgcggcagga gggagagtgc ttaaaaaaac aactcccatg 19561 aaaccatgct atggatctta tgccaggcct acaaatcctt ttggtggtca atccgttctg 19621 gttccggatg aaaaaggggt gcctcttcca aaggttgact tgcaattctt ctcaaatact 19681 acctctttga acgaccggca aggcaatgct actaaaccaa aagtggtttt gtacagtgaa 19741 gatgtaaata tggaaacccc agacacacat ctgtcttaca aacctggaaa aggtgatgaa 19801 aattctaaag ctatgttggg tcaacaatct atgccaaaca gacccaatta cattgctttc 19861 agggacaatt ttattggcct aatgtattat aacagcactg gcaacatggg tgttcttgct 19921 ggtcaggcat cgcagctaaa tgccgtggta gatttgcaag acagaaacac agagctgtcc 19981 tatcaactct tgcttgattc cataggtgat agaaccagat atttttctat gtggaatcag 20041 gctgtagaca gctatgatcc agatgttaga atcattgaaa accatggaac tgaggatgaa 20101 ttgccaaatt attgttttcc tcttgggggt attggggtaa ctgacaccta tcaagctatt 20161 aaggctaatg gcaatggctc aggcgataat ggagatacta catggacaaa agatgaaact 20221 tttgcaacac gtaatgaaat aggagtgggt aacaactttg ccatggaaat taacctaaat 20281 gccaacctat ggagaaattt cctttactcc aatattgcgc tgtacctgcc agacaagcta 20341 aaatacaacc ccaccaatgt ggaaatatct gacaacccca acacctacga ctacatgaac 20401 aagcgagtgg tggctcccgg gcttgtagac tgctacatta accttggggc gcgctggtct 20461 ctggactaca tggacaacgt taatcccttt aaccaccacc gcaatgcggg cctccgttat 20521 cgctccatgt tgttgggaaa cggccgctac gtgccctttc acattcaggt gccccaaaag 20581 ttttttgcca ttaaaaacct cctcctcctg ccaggctcat atacatatga atggaacttc 20641 aggaaggatg ttaacatggt tctgcagagc tctctgggaa acgatcttag agttgacggg 20701 gctagcatta agtttgacag catttgtctt tacgccacct tcttccccat ggcccacaac 20761 acggcctcca cgctggaagc catgctcaga aatgacacca acgaccagtc ctttaatgac 20821 tacctttccg ccgccaacat gctatacccc atacccgcca acgccaccaa cgtgcccatc 20881 tccatcccat cgcgcaactg ggcagcattt cgcggttggg ccttcacacg cttgaagaca 20941 aaggaaaccc cttccctggg atcaggctac gacccttact acacctactc tggctccata 21001 ccataccttg acggaacctt ctatcttaat cacaccttta agaaggtggc cattaccttt 21061 gactcttctg ttagctggcc gggcaacgac cgcctgctta ctcccaatga gtttgagatt 21121 aaacgctcag ttgacgggga gggctacaac gtagctcagt gcaacatgac caaggactgg 21181 ttcctggtgc agatgttggc caactacaat attggctacc agggcttcta cattccagaa 21241 agctacaagg accgcatgta ctcgttcttc agaaacttcc agcccatgag ccggcaagtg 21301 gttgacgata ctaaatacaa ggagtatcag caggttggaa ttcttcacca gcataacaac 21361 tcaggattcg taggctacct cgctcccacc atgcgcgagg gacaggctta ccccgccaac 21421 gtgccctacc cactaatagg caaaaccgcg gttgacagta ttacccagaa aaagtttctt 21481 tgcgatcgca ccctttggcg catcccattc tccagtaact ttatgtccat gggcgcactc 21541 acagacctgg gccaaaacct tctctacgcc aactccgccc acgcgctaga catgactttt 21601 gaggtggatc ccatggacga gcccaccctt ctttatgttt tgtttgaagt ctttgacgtg 21661 gtccgtgtgc accagccgca ccgcggcgtc atcgagaccg tgtacctgcg cacgcccttc 21721 tcggccggca acgccacaac ataaaagaag caagcaacat caacaacagc tgccgccatg 21781 ggctccagtg agcaggaact gaaagccatt gtcaaagatc ttggttgtgg gccatatttt 21841 ttgggcacct atgacaagcg ctttccaggc tttgtttctc cacacaagct cgcctgcgcc 21901 atagtcaata cggccggtcg cgagactggg ggcgtacact ggatggcctt tgcctggaac 21961 ccgcgctcaa aaacatgcta cctctttgag ccctttggct tttctgacca acgactcaag 22021 caggtttacc agtttgagta cgagtcactc ctgcgccgta gcgccattgc ttcttccccc 22081 gaccgctgta taacgctgga aaagtccacc caaagcgtgc aggggcccaa ctcggccgcc 22141 tgtggactat tctgctgcat gtttctccac gcctttgcca actggcccca aactcccatg 22201 gatcacaacc ccaccatgaa ccttattacc ggggtaccca actccatgct taacagtccc 22261 caggtacagc ccaccctgcg tcgcaaccag gaacagctct acagcttcct ggagcgccac 22321 tcgccctact tccgcagcca cagtgcgcag attaggagcg ccacttcttt ttgtcacttg 22381 aaaaacatgt aaaaataatg tactaggaga cactttcaat aaaggcaaat gtttttattt 22441 gtacactctc gggtgattat ttacccccca cccttgccgt ctgcgccgtt taaaaatcaa 22501 aggggttctg ccgcgcatcg ctatgcgcca ctggcaggga cacgttgcga tactggtgtt 22561 tagtgctcca cttaaactca ggcacaacca tccgcggcag ctcggtgaag ttttcactcc 22621 acaggctgcg caccatcacc aacgcgttta gcaggtcggg cgccgatatc ttgaagtcgc 22681 agttggggcc tccgccctgc gcgcgcgagt tgcgatacac agggttgcag cactggaaca 22741 ctatcagcgc cgggtggtgc acgctggcca gcacgctctt gtcggagatc agatccgcgt 22801 ccaggtcctc cgcgttgctc agggcgaacg gagtcaactt tggtagctgc cttcccaaaa 22861 agggtgcatg cccaggcttt gagttgcact cgcaccgtag tggcatcaga aggtgaccgt 22921 gcccggtctg ggcgttagga tacagcgcct gcatgaaagc cttgatctgc ttaaaagcca 22981 cctgagcctt tgcgccttca gagaagaaca tgccgcaaga cttgccggaa aactgattgg 23041 ccggacaggc cgcgtcatgc acgcagcacc ttgcgtcggt gttggagatc tgcaccacat 23101 ttcggcccca ccggttcttc acgatcttgg ccttgctaga ctgctccttc agcgcgcgct 23161 gcccgttttc gctcgtcaca tccatttcaa tcacgtgctc cttatttatc ataatgctcc 23221 cgtgtagaca cttaagctcg ccttcgatct cagcgcagcg gtgcagccac aacgcgcagc 23281 ccgtgggctc gtggtgcttg taggttacct ctgcaaacga ctgcaggtac gcctgcagga 23341 atcgccccat catcgtcaca aaggtcttgt tgctggtgaa ggtcagctgc aacccgcggt 23401 gctcctcgtt tagccaggtc ttgcatacgg ccgccagagc ttccacttgg tcaggcagta 23461 gcttgaagtt tgcctttaga tcgttatcca cgtggtactt gtccatcaac gcgcgcgcag 23521 cctccatgcc cttctcccac gcagacacga tcggcaggct cagcgggttt atcaccgtgc 23581 tttcactttc cgcttcactg gactcttcct tttcctcttg cgtccgcata ccccgcgcca 23641 ctgggtcgtc ttcattcagc cgccgcaccg tgcgcttacc tcccttgccg tgcttgatta 23701 gcaccggtgg gttgctgaaa cccaccattt gtagcgccac atcttctctt tcttcctcgc 23761 tgtccacgat cacctctggg gatggcgggc gctcgggctt gggagagggg cgcttctttt 23821 tctttttgga cgcaatggcc aaatccgccg tcgaggtcga tggccgcggg ctgggtgtgc 23881 gcggcaccag cgcatcttgt gacgagtctt cttcgtcctc ggactcgaga cgccgcctca 23941 gccgcttttt tgggggcgcg cggggaggcg gcggcgacgg cgacggggac gacacgtcct 24001 ccatggttgg tggacgtcgc gccgcaccgc gtccgcgctc gggggtggtt tcgcgctgct 24061 cctcttcccg actggccatt tccttctcct ataggcagaa aaagatcatg gagtcagtcg 24121 agaaggagga cagcctaacc gccccctttg agttcgccac caccgcctcc accgatgccg 24181 ccaacgcgcc taccaccttc cccgtcgagg cacccccgct tgaggaggag gaagtgatta 24241 tcgagcagga cccaggtttt gtaagcgaag acgacgagga tcgctcagta ccaacagagg 24301 ataaaaagca agaccaggac gacgcagagg caaacgagga acaagtcggg cggggggacc 24361 aaaggcatgg cgactaccta gatgtgggag acgacgtgct gttgaagcat ctgcagcgcc 24421 agtgcgccat tatctgcgac gcgttgcaag agcgcagcga tgtgcccctc gccatagcgg 24481 atgtcagcct tgcctacgaa cgccacctgt tctcaccgcg cgtacccccc aaacgccaag 24541 aaaacggcac atgcgagccc aacccgcgcc tcaacttcta ccccgtattt gccgtgccag 24601 aggtgcttgc cacctatcac atctttttcc aaaactgcaa gataccccta tcctgccgtg 24661 ccaaccgcag ccgagcggac aagcagctgg ccttgcggca gggcgctgtc atacctgata 24721 tcgcctcgct cgacgaagtg ccaaaaatct ttgagggtct tggacgcgac gagaaacgcg 24781 cggcaaacgc tctgcaacaa gaaaacagcg aaaatgaaag tcactgtgga gtgctggtgg 24841 aacttgaggg tgacaacgcg cgcctagccg tgctgaaacg cagcatcgag gtcacccact 24901 ttgcctaccc ggcacttaac ctacccccca aggttatgag cacagtcatg agcgagctga 24961 tcgtgcgccg tgcacgaccc ctggagaggg atgcaaactt gcaagaacaa accgaggagg 25021 gcctacccgc agttggcgat gagcagctgg cgcgctggct tgagacgcgc gagcctgccg 25081 acttggagga gcgacgcaag ctaatgatgg ccgcagtgct tgttaccgtg gagcttgagt 25141 gcatgcagcg gttctttgct gacccggaga tgcagcgcaa gctagaggaa acgttgcact 25201 acacctttcg ccagggctac gtgcgccagg cctgcaaaat ttccaacgtg gagctctgca 25261 acctggtctc ctaccttgga attttgcacg aaaaccgcct cgggcaaaac gtgcttcatt 25321 ccacgctcaa gggcgaggcg cgccgcgact acgtccgcga ctgcgtttac ttatttctgt 25381 gctacacctg gcaaacggcc atgggcgtgt ggcagcaatg cctggaggag cgcaacctaa 25441 aggagctgca gaagctgcta aagcaaaact tgaaggacct atggacggcc ttcaacgagc 25501 gctccgtggc cgcgcacctg gcggacatta tcttccccga acgcctgctt aaaaccctgc 25561 aacagggtct gccagacttc accagtcaaa gcatgttgca aaactttagg aactttatcc 25621 tagagcgttc aggaattctg cccgccacct gctgtgcgct tcctagcgac tttgtgccca 25681 ttaagtaccg tgaatgccct ccgccgcttt ggggtcactg ctaccttctg cagctagcca 25741 actaccttgc ctaccactcc gacatcatgg aagacgtgag cggtgacggc ctactggagt 25801 gtcactgtcg ctgcaaccta tgcaccccgc accgctccct ggtctgcaat tcgcaactgc 25861 ttagcgaaag tcaaattatc ggtacctttg agctgcaggg tccctcgcct gacgaaaagt 25921 ccgcggctcc ggggttgaaa ctcactccgg ggctgtggac gtcggcttac cttcgcaaat 25981 ttgtacctga ggactaccac gcccacgaga ttaggttcta cgaagaccaa tcccgcccgc 26041 caaatgcgga gcttaccgcc tgcgtcatta cccagggcca catccttggc caattgcaag 26101 ccatcaacaa agcccgccaa gagtttctgc tacgaaaggg acggggggtt tacctggacc 26161 cccagtccgg cgaggagctc aacccaatcc ccccgccgcc gcagccctat cagcagccgc 26221 gggcccttgc ttcccaggat ggcacccaaa aagaagctgc agctgccgcc gccgccaccc 26281 acggacgagg aggaatactg ggacagtcag gcagaggagg ttttggacga ggaggaggag 26341 atgatggaag actgggacag cctagacgaa gcttccgagg ccgaagaggt gtcagacgaa 26401 acaccgtcac cctcggtcgc attcccctcg ccggcgcccc agaaattggc aaccgttccc 26461 agcatcgcta caacctccgc tcctcaggcg ccgccggcac tgcctgttcg ccgacccaac 26521 cgtagatggg acaccactgg aaccagggcc ggtaagtcta agcagccgcc gccgttagcc 26581 caagagcaac aacagcgcca aggctaccgc tcgtggcgcg ggcacaagaa cgccatagtt 26641 gcttgcttgc aagactgtgg gggcaacatc tccttcgccc gccgctttct tctctaccat 26701 cacggcgtgg ccttcccccg taacatcctg cattactacc gtcatctcta cagcccctac 26761 tgcaccggcg gcagcggcag cggcagcaac agcagcggtc acacagaagc aaaggcgacc 26821 ggatagcaag actctgacaa agcccaagaa atccacagcg gcggcagcag caggaggagg 26881 agcgctgcgt ctggcgccca acgaacccgt atcgacccgc gagcttagaa ataggatttt 26941 tcccactctg tatgctatat ttcaacaaag caggggccaa gaacaagagc tgaaaataaa 27001 aaacaggtct ctgcgctccc tcacccgcag ctgcctgtat cacaaaagcg aagatcagct 27061 tcggcgcacg ctggaagacg cggaggctct cttcagcaaa tactgcgcgc tgactcttaa 27121 ggactagttt cgcgcccttt ctcaaattta agcgcgaaaa ctacgtcatc tccagcggcc 27181 acacccggcg ccagcacctg tcgtcagcgc cattatgagc aaggaaattc ccacgcccta 27241 catgtggagt taccagccac aaatgggact tgcggctgga gctgcccaag actactcaac 27301 ccgaataaac tacatgagcg cgggacccca catgatatcc cgggtcaacg gaatccgcgc 27361 ccaccgaaac cgaattctcc tcgaacaggc ggctattacc accacacctc gtaataacct 27421 taatccccgt agttggcccg ctgccctggt gtaccaggaa agtcccgctc ccaccactgt 27481 ggtacttccc agagacgccc aggccgaagt tcagatgact aactcagggg cgcagcttgc 27541 gggcggcttt cgtcacaggg tgcggtcgcc cgggcagggt ataactcacc tgaaaatcag 27601 agggcgaggt attcagctca acgacgagtc ggtgagctcc tctcttggtc tccgtccgga 27661 cgggacattt cagatcggcg gcgctggccg ctcttcattt acgccccgtc aggcgatcct 27721 aactctgcag acctcgtcct cggagccgcg ctccggaggc attggaactc tacaatttat 27781 tgaggagttc gtgccttcgg tttacttcaa ccccttttct ggacctcccg gccactaccc 27841 ggaccagttt attcccaact ttgacgcggt gaaagactcg gcggacggct acgactgaat 27901 gaccagtgga gaggcagagc gactgcgcct gacacacctc gaccactgcc gccgccacaa 27961 gtgctttgcc cgcggctccg gtgagttttg ttactttgaa ttgcccgaag agcatatcga 28021 gggcccggcg cacggcgtcc ggctcaccac ccaggtagag cttacacgta gcctgattcg 28081 ggagtttacc aagcgccccc tgctagtgga gcgggagcgg ggtccctgtg ttctgaccgt 28141 ggtttgcaac tgtcctaacc ctggattaca tcaagatctt tgttgtcatc tctgtgctga 28201 gtataataaa tacagaaatt agaatctact ggggctcctg tcgccatcct gtgaacgcca 28261 ccgtttttac ccacccaaag cagaccaaag caaacctcac ctccggtttg cacaagcggg 28321 ccaataagta ccttacctgg tactttaacg gctcttcatt tgtaatttac aacagtttcc 28381 agcgagacga agtaagtttg ccacacaacc ttctcggctt caactacacc gtcaagaaaa 28441 acaccaccac caccaccctc ctcacctgcc gggaacgtac gagtgcgtca ccggttgctg 28501 cgcccacacc tacagcctga gcgtaaccag acattactcc catttttcca aaacaggagg 28561 tgagctcaac tcccggaact caggtcaaaa aagcattttg cggggtgctg ggatttttta 28621 attaagtata tgagcaattc aagtaactct acaagcttgt ctaatttttc tggaattggg 28681 gtcggggtta tccttactct tgtaattctg tttattctta tactagcact tctgtgcctt 28741 agggttgccg cctgctgcac gcacgtttgt acctattgtc agctttttaa acgctggggg 28801 caacatccaa gatgaggtac atgattttag gcttgctcgc ccttgcggca gtctgcagcg 28861 ctgccaaaaa ggttgagttt aaggaaccag cttgcaatgt tacatttaaa tcagaagcta 28921 atgaatgcac tactcttata aaatgcacca cagaacatga aaagcttatt attcgccaca 28981 aagacaaaat tggcaagtat gctgtatatg ctatttggca gccaggtgac actaacgact 29041 ataatgtcac agtcttccaa ggtgaaaatc gtaaaacttt tatgtataaa tttccatttt 29101 atgaaatgtg cgatattacc atgtacatga gcaaacagta caagttgtgg cccccacaaa 29161 agtgtttaga gaacactggc accttttgtt ccaccgctct gcttattaca gcgcttgctt 29221 tggtatgtac cttactttat ctcaaataca aaagcagacg cagttttatt gatgaaaaga 29281 aaatgccttg attttccgct tgcttgtatt cccctggaca atttactcta tgtgggatat 29341 gctccaggcg ggcaagatta tacccacaac cttcaaatca aactttcctg gacgttagcg 29401 cctgatttct gccagcgcct gcactgcaaa tttgatcaaa cccagcttca gcttgcctgc 29461 tccagagatg accggctcaa ccatcgcgcc cacaacggac tatcgcaaca ccactgctac 29521 cggactaaca tctgccctaa atttacccca agttcatgcc tttgtcaatg actgggcgag 29581 cttggacatg tggtggtttt ccatagcgct tatgtttgtt tgccttatta ttatgtggct 29641 tatttgttgc ctaaagcgca gacgcgccag accccccatc tataggccta tcattgtgct 29701 caacccacac aatgaaaaaa ttcatagatt ggacggtctg aaaccatgtt ctcttctttt 29761 acagtatgat taaatgagac atgattcctc gagttcttat attattgacc cttgttgcgc 29821 ttttctgtgc gtgctctaca ttggccgcgg tcgctcacat cgaagtagat tgcatcccac 29881 ctttcacagt ttacctgctt tacggatttg tcacccttat cctcatctgc agcctcgtca 29941 ctgtagtcat cgccttcatt cagttcattg actgggtttg tgtgcgcatt gcgtacctca 30001 ggcaccatcc gcaatacaga gacaggacta tagctgatct tctcagaatt ctttaattat 30061 gaaacggagt gtcatttttg ttttgctgat tttttgcgcc ctacctgtgc tttgctccca 30121 aacctcagcg cctcccaaaa gacatatttc ctgcagattc actcaaatat ggaacattcc 30181 cagctgctac aacaaacaga gcgatttgtc agaagcctgg ttatacgcca tcatctctgt 30241 catggttttt tgcagtacca tttttgccct agccatatat ccataccttg acattggctg 30301 gaatgccata gatgccatga accaccctac tttcccagtg cccgctgtca taccactgca 30361 acaggttatt gccccaatca atcagcctcg ccccccttct cccaccccca ctgagattag 30421 ctactttaat ttgacaggtg gagatgactg aatctctaga tctagaattg gatggaatta 30481 acaccgaaca gcgcctacta gaaaggcgca aggcggcgtc cgagcgagaa cgcctaaaac 30541 aagaagttga agacatggtt aacctacacc agtgtaaaag aggtatcttt tgtgtggtca 30601 agcaggccaa acttacctac gaaaaaacca ctaccggcaa ccgcctcagc tacaagctac 30661 ccacccagcg ccaaaaactg gtgcttatgg tgggagaaaa acctatcacc gtcacccagc 30721 actcggcaga aacagagggc tgcctgcact tcccctatca gggtccagag gacctctgca 30781 ctcttattaa aaccatgtgt ggtattagag atcttattcc attcaactaa cataaacaca 30841 caataaatta cttacttaaa atcagtcagc aaatctttgt ccagcttatt cagcatcacc 30901 tcctttcctt cctcccaact ctggtatctc agccgccttt tagctgcaaa ctttctccaa 30961 agtttaaatg ggatgtcaaa ttcctcatgt tcttgtccct ccgcacccac tatcttcata 31021 ttgttgcaga tgaaacgcgc cagaccgtct gaagacacct tcaaccccgt gtatccatat 31081 gacacagaaa ccgggcctcc aactgtgccc tttcttaccc ctccatttgt ttcacccaat 31141 ggtttccaag aaagtccccc tggagttctc tctctacgcg tctccgaacc tttggacacc 31201 tcccacggca tgcttgcgct taaaatgggc agcggtctta ccctagacaa ggccggaaac 31261 ctcacctccc aaaatgtaac cactgttact cagccactta aaaaaacaaa gtcaaacata 31321 agtttggaca cctccgcacc acttacaatt acctcaggcg ccctaacagt ggcaaccacc 31381 gctcctctga tagttactag cggcgctctt agcgtacagt cacaagcccc actgaccgtg 31441 caagactcca aactaagcat tgctactaaa gggcccatta cagtgtcaga tggaaagcta 31501 gccctgcaaa catcagcccc cctctctggc agtgacagcg acacccttac tgtaactgca 31561 tcacccccgc taactactgc cacgggtagc ttgggcatta acatggaaga tcctatttat 31621 gtaaataatg gaaaaatagg aattaaaata agcggtcctt tgcaagtagc acaaaactcc 31681 gatacactaa cagtagttac tggaccaggt gtcaccgttg aacaaaactc ccttagaacc 31741 aaagttgcag gagctattgg ttatgattca tcaaacaaca tggaaattaa aacgggcggt 31801 ggcatgcgta taaataacaa cttgttaatt ctagatgtgg attacccatt tgatgctcaa 31861 acaaaactac gtcttaaact ggggcaggga cccctgtata ttaatgcatc tcataacttg 31921 gacataaact ataacagagg cctatacctt tttaatgcat caaacaatac taaaaaactg 31981 gaagttagca taaaaaaatc cagtggacta aactttgata atactgccat agctataaat 32041 gcaggaaagg gtctggagtt tgatacaaac acatctgagt ctccagatat caacccaata 32101 aaaactaaaa ttggctctgg cattgattac aatgaaaacg gtgccatgat tactaaactt 32161 ggagcgggtt taagctttga caactcaggg gccattacaa taggaaacaa aaatgatgac 32221 aaacttaccc tgtggacaac cccagaccca tctcctaact gcagaattca ttcagataat 32281 gactgcaaat ttactttggt tcttacaaaa tgtgggagtc aagtactagc tactgtagct 32341 gctttggctg tatctggaga tctttcatcc atgacaggca ccgttgcaag tgttagtata 32401 ttccttagat ttgaccaaaa cggtgttcta atggagaact cctcacttaa aaaacattac 32461 tggaacttta gaaatgggaa ctcaactaat gcaaatccat acacaaatgc agttggattt 32521 atgcctaacc ttctagccta tccaaaaacc caaagtcaaa ctgctaaaaa taacattgtc 32581 agtcaagttt acttgcatgg tgataaaact aaacctatga tacttaccat tacacttaat 32641 ggcactagtg aatccacaga aactagcgag gtaagcactt actctatgtc ttttacatgg 32701 tcctgggaaa gtggaaaata caccactgaa acttttgcta ccaactctta caccttctcc 32761 tacattgccc aggaataaag aatcgtgaac ctgttgcatg ttatgtttca acgtgtttat 32821 ttttcaattg cagaaaattt caagtcattt ttcattcagt agtatagccc caccaccaca 32881 tagcttatat tgatcaccgt accttaatca aactcacaga accctagtat tcaacctgcc 32941 acctccctcc caacacacag agtacacagt cctttctccc cggctggcct taaaaagcat 33001 catatcatgg gtaacagaca tattcttagg tgttatattc cacacggttt cctgtcgagc 33061 caaacgctca tcagtgatat taataaactc cccgggcagc tcgcttaagt tcatgtcgct 33121 gtccagctgc tgagccacag gctgctgtcc aacttgcggt tgctcaacgg gcggcgaagg 33181 ggaagtccac gcctacatgg gggtagagtc ataatcgtgc atcaggatag ggcggtggtg 33241 ctgcagcagc gcgcgaataa actgctgccg ccgccgctcc gtcctgcagg aatacaacat 33301 ggcagtggtc tcctcagcga tgattcgcac cgcccgcagc atgagacgcc ttgtcctccg 33361 ggcacagcag cgcaccctga tctcacttaa atcagcacag taactgcagc acagcaccac 33421 aatattgttc aaaatcccac agtgcaaggc gctgtatcca aagctcatgg cggggaccac 33481 agaacccacg tggccatcat accacaagcg caggtagatt aagtggcgac ccctcataaa 33541 cacgctggac ataaacatta cctcttttgg catgttgtaa ttcaccacct cccggtacca 33601 tataaacctc tgattaaaca tggcgccatc caccaccatc ctaaaccagc tggccaaaac 33661 ctgcccgccg gctatgcact gcagggaacc gggactggaa caatgacagt ggagagccca 33721 ggactcgtaa ccatggatca tcatgctcgt catgatatca atgttggcac aacacaggca 33781 cacgtgcata cacttcctca ggattacaag ctcctcccgc gtcagaacca tatcccaggg 33841 aacaacccat tcctgaatca gcgtaaatcc cacactgcag ggaagacctc gcacgtaact 33901 cacgttgtgc attgtcaaag tgttacattc gggcagcagc ggatgatcct ccagtatggt 33961 agcgcgggtc tctgtctcaa aaggaggtag gcgatcccta ctgtacggag tgcgccgaga 34021 caaccgagat cgtgttggtc gtagtgtcat gccaaatgga acgccggacg tagtcatatt 34081 tcctgaagca aaaccaggtg cgggcgtgac aaacagatct gcgtctccgg tctcgtcgct 34141 tagctcgctc tgtgtagtag ttgtagtata tccactctct caaagcatcc aggcgccccc 34201 tggcttcggg ttctatgtaa actccttcat gcgccgctgc cctgataaca tccaccaccg 34261 cagaataagc cacacccagc caacctacac attcgttctg cgagtcacac acgggaggag 34321 cgggaagagc tggaagaacc atgttttttt tttttttatt ccaaaagatt atccaaaacc 34381 tcaaaatgaa gatctattaa gtgaacgcgc tcccctccgg tggcgtggtc aaactctaca 34441 gccaaagaac agataatggc atttgtaaga tgttgcacaa tggcttccaa aaggcaaact 34501 gccctcacgt ccaagtggac gtaaaggcta aacccttcag ggtgaatctc ctctataaac 34561 attccagcac cttcaaccat gcccaaataa ttttcatctc gccaccttat caatatgtct 34621 ctaagcaaat cccgaatatt aagtccggcc attgtaaaaa tctgctccag agcgccctcc 34681 accttcagcc tcaagcagcg aatcatgatt gcaaaaattc aggttcctca cagacctgta 34741 taagattcaa aagcggaaca ttaacaaaaa taccgcgatc ccgtaggtcc cttcgcaggg 34801 ccagctgaac ataatcgtgc aggtctgcac ggaccagcgc ggccacttcc ccgccaggaa 34861 ccatgacaaa agaacccaca ctgattatga cacgcatact cggagctatg ctaaccagcg 34921 tagcccctat gtaagcttgt tgcatgggcg gcgatataaa atgcaaggtg ctgctcaaaa 34981 aatcaggcaa agcctcgcgc aaaaaagcaa gcacatcgta gtcatgctca tgcagataaa 35041 ggcaggtaag ttccggaacc accacagaaa aagacaccat ttttctctca aacatgtctg 35101 cgggttcctg cattaaacac aaaataaaat aacaaaaaaa aacatttaaa cattagaagc 35161 ctgtcttaca acaggaaaaa caacccttat aagcataaga cggactacgg ccatgccggc 35221 gtgaccgtaa aaaaactggt caccgtgatt aaaaagcacc accgacagtt cctcggtcat 35281 gtccggagtc ataatgtaag actcggtaaa cacatcaggt tggttaacat cggtcagtgc 35341 taaaaagcga ccgaaatagc ccgggggaat acatacccgc aggcgtagag acaacattac 35401 agcccccata ggaggtataa caaaattaat aggagagaaa aacacataaa cacctgaaaa 35461 accctcctgc ctaggcaaaa tagcaccctc ccgctccaga acaacataca gcgcttccac 35521 agcggcagcc ataacagtca gccttaccag taaaaaaacc tattaaaaaa caccactcga 35581 cacggcacca gctcaatcag tcacagtgta aaaagggcca agtacagagc gagtatatat 35641 aggactaaaa aatgacgtaa cggttaaagt ccacaaaaaa cacccagaaa accgcacgcg 35701 aacctacgcc cagaaacgaa agccaaaaaa cccacaactt cctcaaatct tcacttccgt 35761 tttcccacga tacgtcactt cccattttaa aaaaactaca attcccaata catgcaagtt 35821 actccgccct aaaacctacg tcacccgccc cgttcccacg ccccgcgcca cgtcacaaac 35881 tccaccccct cattatcata ttggcttcaa tccaaaataa ggtatattat gatgatg //