Best Practice & Research Clinical Haematology
Volume 22, Issue 2 , Pages 271-282 , June 2009

Analysis of DNA microarray expression data

  • Richard Simon, D.Sc. (Chief, Biometric Research Branch)

      Affiliations

    • Corresponding Author InformationTel.: +1 301 496 0975; Fax: +1 301 402 0560.

References 

  1. Dupuy A, Simon R. Critical review of published microarray studies for cancer outcome and guidelines on statistical analysis and reporting. J Natl Cancer Inst. 2007;99:147–157
  2. Simon R, Lam A, Li MC, et al. Analysis of gene expression data using BRB-ArrayTools. Cancer Inform. 2007;2:11–17
  3. Irizarry RA, Wu Z, Jaffee HA. Comparison of Affymetrix GeneChip expression measures. Bioinformatics. 2006;22(7):789–794
  4. Bolstad BM, Irizarry RA, Astrand M, et al. A comparison of normalization methods for high density oligonucleotide array data based on variance and bias. Bioinformatics. 2003;19:185–193
  5. Park T, Yi SG, Kang SH, et al. Evaluation of normalization methods for microarray data. BMC Bioinformatics. 2003;4:33
  6. Simon R, Radmacher MD, Dobbin K, et al. Pitfalls in the use of DNA microarray data for diagnostic and prognostic classification. J Natl Cancer Inst. 2003;95:14–18
  7. Simon R, Korn EL, McShane LM, et al. Design and analysis of DNA microarray investigations. New York: Springer Verlag; 2003;
  8. Korn EL, McShane LM, Troendle JF, et al. Identifying pre-post chemotherapy differences in gene expression in breast tumors: a statistical method appropriate for this aim. Br J Cancer. 2002;86:1093–1096
  9. Yang SX, Simon RM, Tan AR, et al. Gene expression patterns and profile changes pre- and post-erlotinib treatment in patients with metastatic breast cancer. Clin Cancer Res. 2005;11:6226–6232
  10. Sotiriou C, Neo SY, McShane LM, et al. Breast cancer classification and prognosis based on gene expression profiles from a population based study. Proc Natl Acad Sci U S A. 2003;100(18):10393–10398
  11. Desai KV, Xiao N, Wang W, et al. Initiating oncogenic event determines gene-expression patterns of human breast cancer models. Proc Natl Acad Sci USA. 2002;99:6967–6972
  12. Simon R. Development and validation of therapeutically relevant multi-gene biomarker classifiers. J Natl Cancer Inst. 2005;97:866–867
  13. van'tVeer LJ, Dai H, Vijver MJvd, et al. Gene expression profiling predicts clinical outcome of breast cancer. Nature. 2002;415:530–536
  14. van-de-Vijver MJ, He YD, Veer LJvt, et al. A gene expression signature as a predictor of survival in breast cancer. N Engl J Med. 2002;347(25):1999–2009
  15. Ma XJ, Wang Z, Ryan PD, et al. A two-gene expression ratio predicts clinical outcome in breast cancer patients treated with tamoxifen. Cancer Cell. 2004;5:1–10
  16. Ayers M, Symmans WF, Stec J, et al. Gene expression profiles predict complete pathologic response to neoadjuvant paclitaxel and fluorouracil, doxorubicin and cyclophosphamide chemotherapy in breast cancer. J Clin Oncol. 2005;22(12):2284–2293
  17. Jansen MPHM, Foekens JA, Staveren ILv, et al. Molecular classification of tamoxifen-resistant breast carcinomas by gene expression profiling. J Clin Oncol. 2005;23(4):732–740
  18. Perou CM, Serlie T, Eisen MB, et al. Molecular portraits of human breast tumors. Nature. 2000;406:747–752
  19. McShane LM, Radmacher MD, Freidlin B, et al. Methods of assessing reproducibility of clustering patterns observed in analyses of microarray data. Bioinformatics. 2002;18:1462–1469
  20. Klebanov L, Yakovlev A. Is there an alternative to increasing the sample size in microarray studies?. Bioinformation. 2007;1(10):429–431
  21. Shih JH, Michalowska AM, Dobbin K, et al. Effects of pooling mRNA in microarray class comparison. Bioinformatics. 2004;20:3318–3325
  22. Dobbin K, Shih J, Simon R. Questions and answers on design of dual-label microarrays for identifying differentially expressed genes. J Natl Cancer Inst. 2003;95:1362–1369
  23. Dobbin K, Simon R. Sample size determination in microarray experiments for class comparison and prognostic classification. Biostatistics. 2005;6:27–38
  24. Dobbin K, Simon R. Sample size planning for developing classifiers using high dimensional DNA expression data. Biostatistics. 2007;8:101–117
  25. Wright GW, Simon R. A random variance model for detection of differential gene expression in small microarray experiments. Bioinformatics. 2003;19:2448–2455
  26. Baldi P, Long AD. A Bayesian framework for the analysis of microarray expression data: regularized t test and statistical inferences of gene changes. Bioinformatics. 2001;17:509–519
  27. Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc Series B. 1995;57:289–300
  28. Korn E, Troendle JF, McShane LM, et al. Controlling the number of false discoveries: application to high-dimensional genomic data. J Stat Plan Inference. 2004;124:379–398
  29. Korn EL, Li MC, McShane LM, et al. An investigation of SAM and the multivariate permutation test for controlling the false discovery proportion. Statistics in Medicine. 2007;26:4428–4440
  30. Subramanian A, Tamayo P, Mootha VK. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A. 2005;102(43):15545–15550
  31. Tian L, Greenberg SA, Kong SW, et al. Discovering statistically significant pathways in expression profiling studies. Proc Natl Acad Sci U S A. 2005;102(38):13544–13549
  32. Xu X, Zhao Y, Simon R. Gene sets expression comparison in BRB-ArrayTools. Bioinformatics. 2008;24:137–139
  33. Storey JD, Xiao W, Leek JT, et al. Significance analysis of time course microarray experiments. Proc Natl Acad Sci U S A. 2005;102:12837–12842
  34. Leek JT, Monsen E, Dabney AR, et al. EDGE: extraction and analysis of differential gene expression. Bioinformatics. 2006;22:507–508
  35. Simon R. When is a genomic classifier ready for prime time?. Nat Clin Pract Oncol. 2004;1(1):2–3
  36. Radmacher MD, McShane LM, Simon R. A paradigm for class prediction using gene expression profiles. J Comput Biol. 2002;9:505–512
  37. Dudoit S, Fridlyand J. Classification in microarray experiments. In:  Speed T editors. Statistical analysis of gene expression microarray data. London, New York, Washington D.C.: Boca Raton;: Chapman & Hall/CRC; 2003;
  38. Dudoit S, Fridlyand J, Speed TP. Comparison of discrimination methods for the classification of tumors using gene expression data. J Am Stat Assoc. 2002;97:77–87
  39. Golub TR, Slonim DK, Tamayo P, et al. Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. Science. 1999;286:531–537
  40. Ramaswamy S, Tamayo P, Rifkin R, et al. Multiclass cancer diagnosis using tumor gene expression signatures. Proc Natl Acad Sci U S A. 2001;98:15149–15154
  41. Ben-Dor A, Bruhn L, Friedman N, et al. Tissue classification with gene expression profiles. J Comput Biol. 2000;7:559–584
  42. Tibshirani R, Hastie T, Narasimhan B, et al. Diagnosis of multiple cancer types by shrunken centroids of gene expression. Proc Natl Acad Sci U S A. 2002;99:6567–6572
  43. Bo TH, Jonassen I. New feature subset selection procedures for classification of expression profiles. Genome Biol. 2002;3(4):0017.1–0017.11
  44. Ooi CH, Tan P. Genetic algorithms applied to multi-class prediction for the analysis of gene expression data. Bioinformatics. 2003;19:37–44
  45. Deutsch JM. Evolutionary algorithms for finding optimal gene sets in microarray prediction. Bioinformatics. 2003;19:45–54
  46. Kim S, Dougherty ER, Barrera J, et al. Strong feature sets from small samples. J Comput Biol. 2002;9:127–146
  47. Lai C, Reinders MJT, Veer LJvt, et al. A comparison of univariate and multivariate gene selection techniques for classification of cancer datasets. BMC Bioinformatics. 2006;7:235
  48. Lecocke M, Hess K. An empirical study of univariate and genetic algorithm-based feature selection in binary classification with microarray data. Cancer Inform. 2006;2:313–327
  49. Dobbin K, Beer DG, Meyerson M, et al. Inter-laboratory comparability study of cancer gene expression analysis using oligonucleotide microarrays. Clin Cancer Res. 2005;11:565–572
  50. Paik S, Shak S, Tang G, et al. A multigene assay to predict recurrence of tamoxifen-treated, node-negative breast cancer. N Engl J Med. 2004;351:2817–2826
  51. Rosenwald A, Wright G, Chan WC, et al. The use of molecular profiling to predict survival after chemotherapy for diffuse large-B-cell lymphoma. N Engl J Med. 2002;346:1937–1947
  52. Michiels S, Koscielny S, Hill C. Prediction of cancer outcome with microarrays: a multiple validation strategy. Lancet. 2005;365:488–492
  53. Molinaro AM, Simon R, Pfeiffer RM. Prediction error estimation: a comparison of resampling methods. Bioinformatics. 2005;21(15):3301–3307
  54. Simon R, Altman DG. Statistical aspects of prognostic factor studies in oncology. Br J Cancer. 1994;69:979–985
  55. Bast RC, Ravdin P, Hayes DF, et al. 2000 update of recommendations for the use of tumor markers in breast and colorectal cancer: clinical practice guidelines of the American Society of Clinical Oncology. J Clin Oncol. 2001;19:1865–1878
  56. Shipp MA, Ross KN, Tamayo P, et al. Diffuse large B-cell lymphoma outcome prediction by gene-expression profiling and supervised machine learning. Nat Med. 2002;8:68–74
  57. Simon R. A roadmap for developing and validating therapeutically relevant genomic classifiers. J Clin Oncol. 2005;23:7332–7341
  58. Simon R, Maitournam A. Evaluating the efficiency of targeted designs for randomized clinical trials. Clin Cancer Res. 2005;10:6759–6763
  59. Simon R. Using genomics in clinical trial design. Clin Cancer Res. 2008;14:5984–5993
  60. Sargent DJ, Conley BA, Allegra C. Clinical trial designs for predictive marker validation in cancer treatment trials. J Clin Oncol. 2005;23(9):2020–2027
  61. Pusztai L, Hess KR. Clinical trial design for microarray predictive marker discovery and assessment. Ann Oncol. 2004;15:1731–1737
  62. Bogaerts J, Cardoso F, Buyse M, et al. Gene signature evaluation as a prognostic tool: challenges in the design of the MINDACT trial. Nat Clin Pract Oncol. 2006;3(10):540–551
  63. Simon R, Wang SJ. Use of genomic signatures in therapeutics development. Pharmacogenomics J. 2006;6:166–173

PII: S1521-6926(09)00035-8

doi: 10.1016/j.beha.2009.07.001

Best Practice & Research Clinical Haematology
Volume 22, Issue 2 , Pages 271-282 , June 2009