Jeffrey T. Leek
|Alma mater||University of Washington|
|Known for||Biostatistics and Data Science|
|Institutions||Johns Hopkins Bloomberg School of Public Health|
|Doctoral advisor||John D. Storey|
Jeffrey Tullis Leek is an American biostatician and data scientist working as a Professor at Johns Hopkins Bloomberg School of Public Health. He is an author of the Simply Statistics blog, and runs several online courses through Coursera, as part of their Data Science Specialization. His most popular course is The Data Scientist's Toolbox., which he instructed along with Roger Peng and Brian Caffo. Leek is best known for his contributions to genomic data analysis and critical view of research and the accuracy of popular statistical methods.
Leek graduated from Utah State University in 2003 with his Bachelors of Science. Then went on to study at the University of Washington achieving a Master's degree in 2005 and completed a PhD in Biostatistics in 2007 under the guidance of Prof. John D. Storey.
Research and career
Leek joined Johns Hopkins University as an assistant professor in Biostatistics in 2009, working at the Bloomberg School of Public Health. In 2014 he became an associate professor in Biostatistics and Oncology.
Leek has conducted several talks at prestigious universities and locations such as a colloquium series at Harvard  and a lecture at the New York Genome Center titled “Building a Comprehensive Resource for the Study of Human Gene Expression with Machine Learning and Data Science”  as a part of their lecture series.
He is an expert in reproducibility, and his work and opinions have been published in notable scientific and medical journals such as Nature and the Proceedings of the National Academy of Sciences. Leek wrote a self-published book, The Elements of Data Analytic Style and is considered an expert on replication.
A few of his highly cited works include
- "Capturing Heterogeneity in Gene Expression Studies by Surrogate Variable Analysis"
- "Tackling the Widespread and Critical Impact of Batch Effects in High-Throughput Data"
- "Faculty - Johns Hopkins".
- "About". Simply Statistics.
- Diane Peters (2018-02-22). "MOOCs are not dead, but evolving". University Affairs.
- Steven Salzberg (2015-04-13). "How Disruptive Are MOOCs? Hopkins Genomics MOOC Launches In June". Forbes.
- "Coursera - Data Scientists Toolbox".
- "Jeff Leek". LinkedIn.
- "Center for Computational Biology". Johns Hopkins University.
- "Software developed by Jeffrey Leek".
- "Software developed by The Center for Computation Biology".
- "Simply Statistics".
- Jeff Leek. "Is Most Published Research Really False?".
- "What Can 20,000+ RNA-seq Samples Tell Us About How Much Of The Genome Is Transcribed?". Harvard Colloquium Seminar.
- Jeff Leek. ""Building a Comprehensive Resource for the Study of Human Gene Expression with Machine Learning and Data Science."". New York Genome Center Lecture.
- Leek, Jeff; Peng, Roger (2015-04-28). "Statistics: P values are just the tip of the iceberg". Nature. 520 (7549).
- Leek, Jeff; McShane, Blakeley; Gelman, Andrew; Colquhoun, David; Nuijten, Michele; Goodman, Steven (2017-11-28). "Five Ways to Fix Statistics". Nature.
- "The Elements of Data Analytic Style".
- Karen Nitkin (2017-11-07). "Could you repeat that? Fixing the 'replication crisis' in biomedical research has become top priority". Hub.
- Leek, Jeff; Storey, John (2007-09-28). "Capturing Heterogeneity in Gene Expression Studies by Surrogate Variable Analysis". PLOS Genetics. 3 (9).
- Leek, Jeff; Scharpf, Robert; Corrado Bravo, Hector; Simcha, David; Langmead, Benjamin; Johnson, Evan; Geman, Donald; Baggerly, Keith; Irizarry, Rafael (2010-10-01). "Tackling the Widespread and Critical Impact of Batch Effects in High-Throughput Data". Nature Reviews Genetics. 11 (10).