Statistics versus machine learning

doi:10.1038/nmeth.4642

. 2018 Apr;15(4):233-234.

doi: 10.1038/nmeth.4642. Epub 2018 Apr 3.

Statistics versus machine learning

Danilo Bzdok¹, Naomi Altman², Martin Krzywinski³

Affiliations

¹ Department of Psychiatry, RWTH Aachen University, Germany, and a Visiting Professor at INRIA/Neurospin Saclay in France.
² Statistics at The Pennsylvania State University.
³ Canada's Michael Smith Genome Sciences Centre.

PMID: 30100822
PMCID: PMC6082636
DOI: 10.1038/nmeth.4642

Statistics versus machine learning

Danilo Bzdok et al. Nat Methods. 2018 Apr.

. 2018 Apr;15(4):233-234.

doi: 10.1038/nmeth.4642. Epub 2018 Apr 3.

Authors

Danilo Bzdok¹, Naomi Altman², Martin Krzywinski³

Affiliations

¹ Department of Psychiatry, RWTH Aachen University, Germany, and a Visiting Professor at INRIA/Neurospin Saclay in France.
² Statistics at The Pennsylvania State University.
³ Canada's Michael Smith Genome Sciences Centre.

PMID: 30100822
PMCID: PMC6082636
DOI: 10.1038/nmeth.4642

No abstract available

PubMed Disclaimer

Figures

**Figure 1**
Simulated expression and RNA-seq read counts for 40 genes in which the last 10 genes (A–J) are differentially expressed across two phenotypes (−/+). Simulated quantities and heat maps are log-scaled. (a) Simulated log mean expression levels for the genes generated by sampling from the normal distribution with mean 4 and s.d. 2. In the + phenotype the differential expression of genes A–J was created by the addition of a standard normal to each mean expression in the – phenotype. (b) The simulated RNA-seq read counts for ten subjects in each phenotype generated from an overdispersed Poisson distribution based on mean expression in a with biological variation. The heat map shows z-scores of the read counts normalized across all 20 subjects for a given gene.

**Figure 2**
Analysis of gene ranking by classical inference and ML. (a) Unadjusted log-scaled P values from statistical differential expression analysis as a function of effect size, measured by fold change in expression. (b) Log-scaled P values from a as a function of gene importance from random forest classification. In a and b, red circles identify the ten differentially expressed genes from Figure 1; the remaining genes are indicated by open circles. (c) Distribution of the number of dysregulated genes correctly identified in 1,000 simulations by inference (gray fill) and random forest (black line).

See this image and copyright information in PMC

Cited by

Pediatric Intensive Care Unit Length of Stay Prediction by Machine Learning.
Ganatra HA, Latifi SQ, Baloglu O. Ganatra HA, et al. Bioengineering (Basel). 2024 Sep 26;11(10):962. doi: 10.3390/bioengineering11100962. Bioengineering (Basel). 2024. PMID: 39451338 Free PMC article.
Combining metabolomics and machine learning to discover biomarkers for early-stage breast cancer diagnosis.
Anh NK, Lee A, Phat NK, Yen NTH, Thu NQ, Tien NTN, Kim HS, Kim TH, Kim DH, Kim HY, Phuoc Long N. Anh NK, et al. PLoS One. 2024 Oct 21;19(10):e0311810. doi: 10.1371/journal.pone.0311810. eCollection 2024. PLoS One. 2024. PMID: 39432469 Free PMC article.
Construction and validation of a risk prediction model for extrauterine growth restriction in preterm infants born at gestational age less than 34 weeks.
Xie Y, Zhang Z, Luo M, Mo Y, Wei Q, Wang L, Zhang R, Zhong H, Li Y. Xie Y, et al. Front Pediatr. 2024 Sep 18;12:1381193. doi: 10.3389/fped.2024.1381193. eCollection 2024. Front Pediatr. 2024. PMID: 39359744 Free PMC article.
Accuracy of machine learning in predicting outcomes post-percutaneous coronary intervention: a systematic review.
Wee CF, Tan CJ, Yau CE, Teo YH, Go R, Teo YN, Jyn BK, Syn NL, Sim HW, Chen JZ, Wong RCC, Yip JW, Tan HC, Yeo TC, Chai P, Li TYW, Yeung WL, Djohan AH, Sia CH. Wee CF, et al. AsiaIntervention. 2024 Sep 27;10(3):219-232. doi: 10.4244/AIJ-D-23-00023. eCollection 2024 Sep. AsiaIntervention. 2024. PMID: 39347111 Free PMC article.
Comparative Performance of Autoencoders and Traditional Machine Learning Algorithms in Clinical Data Analysis for Predicting Post-Staged GKRS Tumor Dynamics.
Volovăț SR, Popa TO, Rusu D, Ochiuz L, Vasincu D, Agop M, Buzea CG, Volovăț CC. Volovăț SR, et al. Diagnostics (Basel). 2024 Sep 21;14(18):2091. doi: 10.3390/diagnostics14182091. Diagnostics (Basel). 2024. PMID: 39335770 Free PMC article.

See all "Cited by" articles

References

1. Bzdok D. Front Neurosci. 2017;11:543. - PMC - PubMed
1. Bzdok D, Krzywinski M, Altman N. Nat Methods. 2017;14:1119–1120. - PMC - PubMed
1. Krzywinski M, Altman N. Nat Methods. 2014;11:355–356. - PubMed
1. Lever J, Krzywinski M, Altman N. Nat Methods. 2016;13:603–604. - PubMed
1. Altman N, Krzywinski M. Nat Methods. 2017;14:933–934. - PMC - PubMed

MeSH terms

Actions
Actions
Actions
Actions

Grants and funding

R01 MH074457/MH/NIMH NIH HHS/United States

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations
- scite Smart Citations

[1] Bzdok D. Front Neurosci. 2017;11:543. - PMC - PubMed

[2] Bzdok D. Front Neurosci. 2017;11:543. - PMC - PubMed

[3] Bzdok D, Krzywinski M, Altman N. Nat Methods. 2017;14:1119–1120. - PMC - PubMed

[4] Bzdok D, Krzywinski M, Altman N. Nat Methods. 2017;14:1119–1120. - PMC - PubMed

[5] Krzywinski M, Altman N. Nat Methods. 2014;11:355–356. - PubMed

[6] Krzywinski M, Altman N. Nat Methods. 2014;11:355–356. - PubMed

[7] Lever J, Krzywinski M, Altman N. Nat Methods. 2016;13:603–604. - PubMed

[8] Lever J, Krzywinski M, Altman N. Nat Methods. 2016;13:603–604. - PubMed

[9] Altman N, Krzywinski M. Nat Methods. 2017;14:933–934. - PMC - PubMed

[10] Altman N, Krzywinski M. Nat Methods. 2017;14:933–934. - PMC - PubMed

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Statistics versus machine learning

Affiliations

Statistics versus machine learning

Authors

Affiliations

Figures

Similar articles

Cited by

References

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources