Supplementary Data

  1. Lung Adenocarcinoma Probe Dataset (LAPD) Files
  2. AffyMAPSDetector input and output files
  3. Supporting data for Figures and Tables
  4. Data analysis confirming behavior of SNP-containing probes (intensity distribution figures/supporting-data)
  5. Data analysis confirming behavior of probes without SNPs (intensity distribution figures/supporting-data)
  6. Examples: SNPs affecting binding efficiencies of PM and MM probes
  7. Additional Data (from human, mouse, and rat gene chips)
  8. Additional Data Files (also uploaded at BMC Bioinformatics)

1. Lung Adenocarcinoma Probe Dataset (LAPD) Files

Go to top

  1. LAPD - Raw PM/MM data for probes without SNPs (>2GB).
  2. LAPD - Raw PM/MM data for probes with SNP at the 13th position (~3MB).
  3. LAPD - Raw PM/MM data for probes with SNP NOT at the 13th position (~80MB).

2. AffyMAPSDetector input and output files

Go to top

AffyMAPSDetector requires two ASCII text files as input data sources: "NetAffx Annotation File" and "Sequence File". Both of these files are available for download from the Affymetrix support page under "NetAffx Annotation File" and "Sequence Files" respectively. However, Affymetrix requires registration before you can download the annotation files. Here we refer to the "NetAffx Annotation File" as the gene information file (GIF) and the "Sequence File" as the probe-set information file (PIF). GIF and PIF, from HG-U95Av2 GeneChipTM that were used to characterize SNPs are provided below for easy access, however, Affymetrix NetAffxTM analysis center is recommended for obtaining an upto date version of these files.

  1. Tab delimited gene information file in ASCII text.
  2. Tab delimited probe information file in ASCII text. (download PIF in FASTA format.)
The output files generated by AffyMAPSDetector (based on dbSNP build 123) are:
  1. HG-U95Av2_Probes_With_SNPs.xls: The HG-U95Av2 probes that were found to have documented SNPS.
  2. HG-U95Av2_Genes_Without_Locus_Link.xls: List of those genes for which either LocusLink information was not provided in the gene-information file or AffyMAPSDetector could not parse LocusLink as a positive integer.
  3. HG-U95Av2_Probes_Without_Snps.xls: List of genes and probe-sets for which no documented SNPs were found.
  4. HG-U95Av2_Genes_Info_From_Web.xls: This file contains the gene description and the mRNA sequences of genes that were collected by AffyMAPSDetector from the NCBI nucleotide database.
  5. HG-U95Av2_Snps_Info_From_Web.xls: This file contains additional SNP information including: "Nucleotide Accession Number of Gene", "SNP position with respect to mRNA sequence", "Genomic Axis Orientation", "dbSNP Reference Cluster ID - rs#", "Protein Accession Number", "Function", "SNP Class", "Heterozygosity", and "Allele".
  6. Log File: This file contains the output log messages from AffyMAPSDetector run.
Note that all the above files, except the log file, are tab delimited ASCII text files. As a convenience, you can also download all the output files zipped together.

The output files generated by AffyMAPSDetector (based on dbSNP build 126) are:

  1. HG-U95Av2_Probes_With_SNPs.xls: The HG-U95Av2 probes that were found to have documented SNPS.
  2. HG-U95Av2_Genes_Without_Locus_Link.xls: List of those genes for which either LocusLink information was not provided in the gene-information file or AffyMAPSDetector could not parse LocusLink as a positive integer.
  3. HG-U95Av2_Probes_Without_Snps.xls: List of genes and probe-sets for which no documented SNPs were found.
  4. HG-U95Av2_Genes_Info_From_Web.xls: This file contains the gene description and the mRNA sequences of genes that were collected by AffyMAPSDetector from the NCBI nucleotide database.
  5. HG-U95Av2_Snps_Info_From_Web.xls: This file contains additional SNP information including: "Nucleotide Accession Number of Gene", "SNP position with respect to mRNA sequence", "Genomic Axis Orientation", "dbSNP Reference Cluster ID - rs#", "Protein Accession Number", "Function", "SNP Class", "Heterozygosity", and "Allele".
  6. Log File: This file contains the output log messages from AffyMAPSDetector run.
Note that all the above files, except the log file, are tab delimited ASCII text files. As a convenience, you can also download all the output files zipped together.

Comparison of the output results between dbSNP builds 123 and 126 from AffyMAPSDetector run on HG-U95Av2:
The GeneChipTM HG-U95Av2 contains 199,084 probes belonging to 12,625 probe-sets (or 11,302 unique genes).
  dbSNP Build 123 dbSNP Build 126
Number of probes that contain documented SNPs 7,286 probes from 2,582 probe-sets (or 2,479 unique genes) 8,758 probes from 3,002 probe-sets (or 2,858 unique genes)
Number of SNP containing probes involving 13th position 325 probes 409 probes
Number of SNP containing probes NOT involving 13thposition 6,961 probes 8,349 probes
Number of SNP containing probes involving 13th position only 251 probes 332 probes
Number of SNP containing probes involving 13th position and atleast one more position 74 probes 77 probes
Number of probe-sets (or unique genes) without documented SNPs 8,474 probe-sets (or 7,662 unique genes) 9,450 probe-sets (or 8,533 unique genes)
Number of probes NOT mapped into their respective reference mRNA sequence 15,269 probes belonging to 2,304 probe-sets (or 2,249 unique genes) 16,753 probes belonging to 2,168 probe-sets (or 2,017 unique genes)


Note: At the time of submission of the draft for this publication, only dbSNP-build-123 was available, therefore, analysis of the data presented below is based on dbSNP-build-123.

3. Supporting data for Figures and Tables

Go to top

4. Data analysis confirming behavior of SNP-containing probes (intensity distribution figures/supporting-data)

Go to top

5. Data analysis confirming behavior of probes without SNPs (intensity distribution figures/supporting-data)

Go to top

6. Examples: SNPs affecting binding efficiencies of PM and MM probes

Go to top

Click here to see a number of examples where presence of one or more SNPs in a probe affects the binding efficiencies of the related Perfect Match (PM) and Mis-Match (MM) probes.

7. Additional Data (from human, mouse, and rat gene chips)

Go to top

In addition to HG-U95Av2, below are the AffyMAPSDetector results (using dbSNP-build-123) for other expression chips from human, mouse and rat genomes.

Human Expression Array GeneChip™ HG-U133
Description Download File Size
HG-U133A: AffyMAPSDetector Results HG-U133A_AffyMapsDetectorResults.zip 30 MB
HG-U133A gene information file (trimmed version) HG-U133A_gene_info (Tab delimited) 578 KB
HG-U133A probe sequence information file HG-U133A_probe_tab or HG-U133A_probe_fasta 14 MB / 23.8 MB
HG-U133B gene annotation/information file HG-U133B_gene_info 18.2 MB
HG-U133B probe sequence information file HG-U133B_probe_tab or HG-U133B_probe_fasta 14 MB / 23.8 MB

 

Mouse Expression Array GeneChip™ MG-430A2
Description Download File Size
MG-430A2: AffyMAPSDetector Results MG-430A2_AffyMapsDetectorResults.zip 6.5 MB
MG-430A2 gene information file (trimmed version) Mouse430A2_gene_info (Tab delimited) 982 KB
MG-430A2 probe sequence information file Mouse430A2_probe_tab 14.2 MB

 

Rat Expression Array GeneChip™ Rat-230
Description Download File Size
RAE-230A: AffyMAPSDetector Results Rat-230A_AffyMapsDetectorResults.zip 3.2 MB
RAE-230B: AffyMAPSDetector Results Rat-230B_AffyMapsDetectorResults.zip 744 KB
RAE-230A gene information file (trimmed version) RAE-230A_gene_info (Tab delimited) 493 KB
RAE-230A probe sequence information file RAE-230A_probe_tab 9.87 MB
RAE-230B gene information file (trimmed version) RAE-230B_gene_info 450 KB
RAE-230B probe sequence information file RAE-230B_probe_tab 9.44 MB

8. Additional Data Files (also uploaded at BMC Bioinformatics)

  1. file 1 - Complete SNP output file.
  2. file 2 - Probes having SNP at mismatch location.
  3. file 3 - Probe-sets without SNPs.
  4. file 4 - Genes with undefined LocusLink.
  5. file 5 - HG-U95Av2 genes mRNA sequence.
  6. file 6 - Additional SNP information for Probes having SNPs.
  7. file 7 - AffyMAPSDetector execution log.
  8. file 8 - Behavior of SNP-containing probes with respect to PM and MM binding efficiencies.
  9. file 9 - Behavior of SNP-containing probes with respect to PM and MM binding efficiencies.
  10. file 10 - Example of probes affecting probe-set detection calls.
  11. file 11 - SNP-containing probes’ PM/MM ratio data file for expression genotype.
  12. file 12 - AffyMAPSDetector v1 distribution package (compiled code).
  13. file 13 - AffyMAPSDetector v1 source code.
  14. file 14 - Intensity distribution profiles' confirmation.
Go to top