Professional Documents
Culture Documents
know:
A Taxonomic Bibliography
Contents & Bibliographic Taxonomy
Preface and Preamble
Section 1: Elementary & Introductory Texts
Section 2: Thinking Critically with Statistical and Probabilistic Reasoning
Section 3: Probability: Statistics by Any Other Name is Still Important
Section 3: Subject Specific Texts
Section 4: Some Important Studies on Critical Issues
Section 5: History & Development of Statistics
Section 6: Statistical & Computational Software
Section 7: Some Useful and Important Series of Monographs, Conference
Proceedings, and Volumes
Section 8: Sample of Texts: Not Your Grandmothers Statistics
Preface and Preamble:
Mathematics is the language of the sciences.i However, one area of mathematics dominates in
scientific research: statistics. From the statistical mechanics to machine learning and social
psychology, the degree of overlap in the statistical methods, measures, and models is rather
astounding considering how different typical methods used in these and other fields are, let alone
the fields themselves. For example, the following is from a paper by two experienced researchers
(one with both an MD and PhD) in a volume on pain research methods
Standard tests such as analysis of variance (ANOVA), Scheffs F test, and serial ttests...should be used to ensure statistical significance (p < 0.05)ii.
What these standard tests are is far less important than that they are standard. But standard
for whom? Naturally, the authors mean standard for their target audience:
target readers include beginners in pain research who may have substantial training and
experiences in other fields; and pain researchers who may have extensive knowledge and
experience in a specific field, but who may want to extend their research to a new level
(p. v).
Clearly, business researchers dont qualify. Yet looking at a textbook on business research
methodsiii we find ANOVA, t-tests, & F-tests all on one page in a chapter summary of Key
Concepts (p. 548). So despite the radical differences between beta testing products and
classifying nociceptive neurons in the peripheral nervous system, the same statistical tests are
standard.
Unfortunately, these standard tests are frequently inferior to readily available and easily used
alternatives, and are employed without a sufficient appreciation of their underlying logic and
assumptions. The literature on problems with the uses of statistics across different sciences is
vast, but the first point at least can be illustrated with a neutral statement based on
mathematically demonstrable fact:
Many of the statistical methods routinely used in contemporary research are based on a
compromise with the ideal... The compromise is represented by most statistical tests in
common use, such as the t and F testsiv
The most widely used statistical tests (including the t and F statistics) were developed by
Pearson, Galton, Edgeworth, Gosset, & Fisher in the late 19th and early 20th centuries.v Of
course, being old isnt the issue. After all, ideal [methods]represented by permutation tests,
such as Fishers exact testvi include those that have been around since the 30s. It was not until
computers that many of these methods could be used due to their computational demands.
Today, most researchers carry mobile phones that are more powerful than personal computers
from the 80s and 90s, while Fishers exact test was formulated roughly a decade prior to perhaps
the first rigorously defined theoretical computer.vii Meanwhile, far too many researchers us
powerful computers equipped with expensive statistical software only to continue using the
statistical tests that, as noted above, were compromises almost a century ago. We have possessed
the capability to employ superior methods we do not for far too long. It is time to start exploiting
the computational power we have at our disposal.
By exploiting their power I do not mean the garbage in, garbage out approach that seems to
necessarily follow from standard curricula. Even in top universities, such as Boston College, we
find evidence that graduate curricula is trending toward (if not already at) the point of actually
designing statistics and research methods courses around this approach. For example, at BC, we
find that the multivariate statistics course Sociology 703 uses the textbook Statistics and Data
Analysis for Nursing Research. Are there truly no adequate textbooks that explain the same
methods but in a context fit for students of sociology? More troublesome is how graduate level
research & statistics courses across the sciences have increasingly become more classes on how
to use software than courses on statistics. In other words, we are teaching future researchers how
to use utilize software such as SPSS in order to perform statistical techniques they do not
understand.
The reason for this is not just because such software enables researchers to perform complicated
analyses or construct sophisticated models with minimal understanding. Another major
contributing factor is easily found. We need look no farther than e.g., the 3rd edition of The
Linear Algebra a Beginning Graduate Student Ought to Know, as linear algebra is the foundation
for basically all of modern statistics:
Everitt, B., & Hothorn, T. (2011). An Introduction to Applied Multivariate Analysis with R (Use
R!). Springer.
Foster, J. J., Barkus, E., & Yavorsky, C. (2005). Understanding and Using Advanced Statistics.
Sage.
Gentle, J. E. (2007). Matrix Algebra: Theory, Computations, and Applications in Statistics
(Springer Texts in Statistics). Springer.
Harville, D. A. (2008). Matrix Algebra From a Statistician's Perspective. Springer. [This book
has a companion Exercises and Solutions text]
Lynch, Scott M. Introduction to Applied Bayesian Statistics and Estimation for Social Scientists
(Statistics for Social and Behavioral Sciences). Springer, 2007.
Schwarz, W. (2007). 40 Puzzles and Problems in Probability and Mathematical Statistics.
Springer.
Wickens, T. D. (1995). The Geometry of Multivariate Statistics. Psychology Press.
Wilcox, R. R. (2009). Basic Statistics: Understanding Conventional Methods and Modern
Insights. Oxford University Press.
Wilcox, R. (2011). Modern statistics for the social and behavioral sciences: A practical
introduction. CRC press.
Wilcox, R. R. (2012). Introduction to Robust Estimation and Hypothesis Testing (3rd Ed.).
Academic Press.
Young, G. A., & Smith, R. L. (2005). Essentials of statistical inference (Cambridge Series in
Statistical and Probabilistic Mathematics Vol. 16). Cambridge University Press.
Sect. 2: Thinking Critically with Statistical and Probabilistic Reasoning
Aliseda, A. (2006). Abductive Reasoning: Logical Investigations into Discovery and Explanation
(Synthese Library Vol. 330). Dordrecht: Springer.
Alon, N., & Spencer, J. H. (2000). The Probabilistic Method. (2nd Ed.) (Wiley-Interscience
Series in Discrete Mathematics and Optimization). Wiley.
Berger, J. O. (1985). Statistical Decision Theory and Bayesian Analysis (Springer Series in
Statistics). Springer.
Chen, M. H., Mller, P., Sun, D., Ye, K., & Dey, D. K. (2010). Frontiers of Statistical Decision
Making and Bayesian Analysis: In Honor of James O. Berger. Springer.
Courgeau, D. (2012). Probability and Social Science: Methodological Relationships between the
two Approaches (Methodos Series Vol. 10). Springer Science & Business Media.
Das, S. K. (2008). Foundations Of Decision-Making Agents: Logic, Probability and Modality.
World Scientific.
Evans, M. J., & Rosenthal, J. S. (2010). Probability And Statistics: The Science Of Uncertainty.
W. H. Freeman & Co.
Hacking, Ian. An introduction to probability and inductive logic. Cambridge University Press,
2001.
Haenni, R., Romeijn, J-W., Wheeler, G., & Williamson, J. (2011). Probabilistic Logics and
Probabilistic Networks (Synthese Library Vol. 350). Springer.
Howson, C., & Urbach, P. (2006). Scientific Reasoning: The Bayesian Approach (3rd Ed.). Open
Court Publishing.
Jaynes, E. T. (2003). Probability Theory: The Logic of Science. Cambridge University Press.
Jeffrey, R. (2004). Subjective Probability: The Real Thing. Cambridge University Press.
Morgan, S. L., & Winship, C. (2007). Counterfactuals and Causal Inference- Methods and
Principles for Social Research (Analytical Methods for Social Research). Cambridge University
Press.
Morrison, M. (2000). Unifying Scientific Theories: Physical Concepts and Mathematical
Structures. Cambridge University Press.
Nickerson, R. (2011). Mathematical reasoning: Patterns, problems, conjectures, and proofs.
Psychology Press.
Press, S. J. (2009). Subjective and Objective Bayesian Statistics: Principles, Models, and
Applications (Wiley Series in Probability and Statistics Vol. 590). Wiley.
Perea, A. (2012). Epistemic game theory: reasoning and choice. Cambridge University Press.
Walley, P. (1991). Statistical Reasoning with Imprecise Probabilities (Monographs on Statistics
and Applied Probability Vol. 42). Chapman & Hall.
Sect. 3: Probability Theory: Statistics by Any Other Name is Still Important
I added this section for two reasons:
1) I have found that more and more researchers who rely on statistical methods, tests, models,
etc., appeal to probability distributions without having studied probability
2) Even those who have taken a course in probability theory remain unfamiliar with measuretheoretic probability. Although its hardly necessary for most researchers to understand Lebesque
integration, functional spaces, and so forth, understanding probability measure at some level is
actually important.
Athreya, K. B., & Lahiri, S. N. (2006). Measure Theory and Probability Theory (Springer Texts
in Statistics). Springer.
Billingsley, P. (1995). Probability and Measure (Wiley Series in Probability and Mathematical
Statistics) (3rd Ed.). Wiley.
Bobrowski, A. (2005). Functional Analysis for Probability and Stochastic Processes: An
Introduction. Cambridge University Press.
Gut, A. (2005). Probability: A Graduate Course (Springer Texts in Statistics). Springer.
Heyer, H. (2010). Structural Aspects in the Theory of Probability (2nd Ed.) (Series on
Multivariate Analysis Vol. 8). World Scientific.
Leadbetter, R., Cambanis, S., & Pipiras, V. (2014). A Basic Course in Measure and Probability:
Theory for Applications. Cambridge University Press.
Pollard, D. (2002). A User's Guide to Measure Theoretic Probability (Cambridge Series in
Statistical and Probabilistic Mathematics). Cambridge University Press.
Roussas, G. G. (2014). An Introduction to Measure-Theoretic Probability (2nd Ed.). Academic
Press.
Sect. 4: Subject Specific Texts
Angrist, J. D., & Pischke, J. S. (2008). Mostly harmless econometrics: An empiricist's
companion. Princeton university press.
Aschwanden, M. (2011). Self-Organized Criticality in Astrophysics: The Statistics of Nonlinear
Processes in the Universe. Springer.
Baecher, G. B., & Christian, J. T. (2005). Reliability and Statistics in Geotechnical Engineering.
Wiley.
Barcel, J. A. (2009). Computational Intelligence in Archaeology. Information Science
Reference.
Beran, J. (2004). Statistics in Musicology. CRC Press.
Chandler, R., & Scott, M. (2011). Statistical Methods for Trend Detection and Analysis in the
Environmental Sciences (Statistics in Practice). Wiley.
Chattopadhyay, A. K., & Chattopadhyay, T. (2014). Statistical Methods for Astronomical Data
Analysis (Springer Series in Astrostatistics Vol. 3). Springer.
Drennan, R. D. (2009). Statistics for Archaeologists: A Common Sense Approach
(Interdisciplinary Contributions to Archaeology). Springer.
Deutsch, A., Brusch, L., Byrne, H., de Vries, G., & Herzel, H. (2007). Mathematical Modeling of
Biological Systems, Vol. I: Cellular Biophysics, Regulatory Networks, Development,
Biomedicine, and Data Analysis (Modeling and Simulation in Science, Engineering and
Technology). Birkhuser
Diggle, P., & Ribeiro, P. J. (2007). Model-based Geostatistics (Springer Series in Statistics).
Springer.
Feeman, T. G. (2010). The Mathematics of Medical Imaging: A Beginner's Guide (SUMAT).
Springer.
Feigelson, E. D., & Babu, G. J. (2012). Modern Statistical Methods for Astronomy: With R
Applications. Cambridge University Press.
Gelfand, A. E., Diggle, P., Guttorp, P., & Fuentes, M. (Eds.). (2010). Handbook of Spatial
Statistics (Chapman & Hall CRC Handbook of Modern Statistical Methods). CRC Press.
Gregory, P. (2005). Bayesian Logical Data Analysis for the Physical Sciences: A Comparative
Approach with Mathematica Support. Cambridge University Press.
Hsieh, W. W. (2009). Machine Learning Methods in the Environmental Sciences: Neural
Networks and Kernels. Cambridge University Press.
Lamm, E., & Unger, R. (2011). Biological Computation (Mathematical and Computational
Biology Series). CRC Press.
Landau, D. P., & Binder, K. (2009). A Guide to Monte Carlo Simulations in Statistical Physics
(3rd ed.). Cambridge University Press.
Loy, G. (2006). Musimathics: The Mathematical Foundations of Music (Vol. I). MIT Press.
Loy, G. (2007). Musimathics: The Mathematical Foundations of Music (Vol. II). MIT Press.
Mussardo, G. (2010). Statistical Field Theory: An Introduction to Exactly Solved Models in
Statistical Physics (Oxford Graduate Texts). Oxford University Press.
Dale, A. I. (1999). A History of Inverse Probability: From Thomas Bayes to Karl Pearson (2nd
ed.) (Sources and Studies in the History of Mathematics and Physical Sciences). Springer.
David, F. N. (1998). Games, Gods And Gambling: The Origins And History Of Probability And
Statistical Ideas From The Earliest Times To The Newtonian Era. Hafner Publishing.
Fischer, H. (2010). A History of the Central Limit Theorem:From Classical to Modern
Probability Theory (Sources and Studies in the History of Mathematics and Physical Sciences).
Springer.
Gigerenzer, G., & Swijtink, Z. (1989). The Empire of Chance: How Probability Changed
Science and Everyday Life (Ideas in Context). Cambridge University Press.
Godin, B. (2005). Measurement and Statistics on Science and Technology: 1920 to the Present
(Routledge Studies in the History of Science, Technology, and Medicine). Routledge.
Hald, A. (2003) A History of Probability and Statistics and Their Applications before 1750
(Wiley Series in Probability and Statistics). Wiley.
Hald, A. (2007). A History of Parametric Statistical Inference from Bernoulli to Fisher, 17131935 (Sources and Studies in the History of Mathematics and Physical Sciences). Springer.
Johnson, N. L., & Kotz, S. (Eds.). (1997). Leading Personalities in Statistical Sciences: From
the Seventeenth Century to the Present (Wiley Series in Probability and Statistics). Wiley.
Lehmann, E. L. (2011). Fisher, Neyman, and the Creation of Classical Statistics. Springer
Science.
Von Plato, J. (1994). Creating Modern Probability: Its Mathematics, Physics and Philosophy in
Historical Perspective. Cambridge University Press.
Stigler, S. M. (1986). The History of Statistics: The Measurement of Uncertainty before 1900.
Harvard University Press.
Stigler, S. M. (1999). Statistics on the Table: The History of Statistical Concepts and Methods.
Harvard University Press.
Tabak, J. (2004). Probability And Statistics: The Science Of Uncertainty (History of
Mathematics). Facts on File.
Sect. 7: Statistical & Computational Software
I have deliberately avoided including general introductions to software packages, one simple
reason being that these abound and are too much alike for me to recommend those I think best
written. Instead, I have tried to include sources that are specific to a subject or topic, or in
general texts covering applications that general introductions (and giant reference books) lack. I
have also deliberately not included any SPSS books (as I find SPSS to be a major factor in the
current deficit in statistics education for many graduate students) and so too with Excel (which is
just SPSS without the pre-loaded statistical methods).
Borgo, M., Soranzo, A., & Grassi, M. (2012). MATLAB for Psychologists. Springer.
Chekanov, S. V. (2010). Scientific data analysis using Jython Scripting and Java. Springer.
Chen, D. G., & Peace, K. E. (2011). Clinical Trial Data Analysis Using R (Biostatistics Series).
CRC Press.
Chihara, L. M., & Hesterberg, T. C. (2012). Mathematical Statistics with Resampling and R.
Wiley.
Everitt, B. S., & Rabe-Hesketh, S. (2006). Handbook of Statistical Analyses Using Stata. CRC
Press.
Foulkes, A. S. (2009). Applied Statistical Genetics with R: For Population-based Association
Studies (Use R!). Springer.
Haneberg, W. C. (2004). Computational Geosciences with Mathematica. Springer.
Hastings, K. J. (2010). Introduction to Probability with Mathematica (2nd ed.) (Textbooks in
Mathematics). CRC Press.
Jones, O., Maillardet, R., & Robinson, A. (2012). Introduction to Scientific Programming and
Simulation Using R. CRC Press.
Kay, S. M. (2006). Intuitive Probability and Random Processes using MATLAB. Springer.
Li, Y., & Baron, J. (2012). Behavioral Research Data Analysis with R (Use R!). Springer.
Martinez, W. L., & Martinez, A. R. (2002). Computational statistics handbook with MATLAB.
CRC press.
Martinez, W. L., & Martinez, A. R. (2005). Exploratory Data Analysis with MATLAB (Series in
Computer Science and Data Analysis). CRC Press.
Marin, J. M., & Robert, C. P. (2014). Bayesian Essentials with R (Springer Texts in Statistics).
Springer.
Mathur, S. K. (2010). Statistical Bioinformatics with R. Academic Press.
Pitt-Francis, J., & Whiteley, J. (2012). Guide to Scientific Computing in C++ (Undergraduate
Topics in Computer Science). Springer.
Ruskeep, H. (2009). Mathematica Navigator: Mathematics, Statistics and Graphics (3rd ed.).
Academic Press.
Suess, E. A., & Trumbo, B. E. (2010). Introduction to Probability Simulation and Gibbs
Sampling with R (Use R!). Springer.
Sumathi, S., & Paneerselvam, S. (2010). Computational Intelligence Paradigms: Theory &
Applications using MATLAB. CRC Press.
Torrence, B. F., & Torrence, E. A. (2009). The Student's Introduction to Mathematica: A
Handbook for Precalculus, Calculus, and Linear Algebra (2nd ed.). Cambridge University Press.
Wallisch, P., Lusignan, M. E., Benayoun, M. D., Baker, T. I., Dickey, A. S., & Hatsopoulos, N.
G. (2014). Matlab for Neuroscientists: An Introduction to Scientific Computing for MATLAB (2nd
ed.). Academic Press.
Williams, G. (2011). Data Mining with Rattle and R: The Art of Excavating Data for Knowledge
Discovery (Use R!). Springer.
Zhao, Y. (2012). R and Data Mining: Examples and Case Studies. Academic Press.
Zieffler, A. S., Harring, J. R., & Long, J. D. (2011). Comparing Groups: Randomization and
Bootstrap Methods Using R. Wiley.
Zuur, A., Ieno, E. N., Walker, N., Saveliev, A. A., & Smith, G. M. (2009). Mixed Effects Models
and Extensions in Ecology with R. Springer.
Sect. 7: Some Useful and Important Series of Monographs, Conference Proceedings, and
Volumes
*Series names are given followed by further identification information. Usually, this is merely
the publisher, but can include e.g., the series that a particular series is published under (Lecture
Notes in Computer Science (LNCS), for example, publishes hundreds of conference proceedings
subseries) or other information to ensure the reader can easily look up the series in question.*
Advances in Computational Intelligence. Communications in Computer and Information Science
(CCIS). Springer.
Advances in Knowledge Discovery and Data Mining. Lecture notes in Computer Science
(Subseries Lecture Notes in Artificial Intellegence). Springer.
ASA-SIAM Series on Statistics and Applied Probability (Society for Industrial and Applied
Mathematics)
Contributions to Statistics (Springer)
Grafarend, E. W. (2006). Linear and Nonlinear Models: Fixed Effects, Random Effects, and
Mixed Models. Walter de Gruyter.
Grigoletto, M., Lisi, F., & Petrone, S. (2013). Complex Models and Computational Methods in
Statistics (Contributions to Statistics). Springer.
Gut, A. (2005). Probability: a graduate course (Springer Series in Statistics). Springer.
Hastie, T., Tibshirani, R., Friedman, J., & Franklin, J. (2009) The Elements of Statistical
Learning: Data Mining, Inference, and Prediction (2nd ed.) (Springer Series in Statistics).
Springer.
Heyer, H. (2009). Structural Aspects in the Theory of Probability (2nd Enlarged ed.) (Series on
Multivariate Analysis, Vol. 8). World Scientific.
Hilbe, J. (2011). Negative Binomial Regression (2nd Ed.). Cambridge University Press.
Huber, C., Limnios, N., Mesbah, M., & Nikulin, M. (Eds.). (2010). Mathematical Methods in
Survival Analysis, Reliability and Quality of Life (Applied Stochastic Methods Series). Wiley.
Izenman, A. J. (2009). Modern multivariate statistical techniques: regression, classification, and
manifold learning (Springer Texts in Statistics). Springer.
Kogan, J., Nicholas, C., & Teboulle, M. (2006). Grouping Multidimensional Data: Recent
Advances in Clustering. Springer.
Krzanowski, W. J., & Hand, D. J. (2009). ROC Curves for Continuous Data (Monographs on
Statistics and Applied Probability Vol. 111). CRC Press.
Kulkarni, S., & Harman, G. (2011). An elementary introduction to statistical learning theory
(Wiley Series in Probability and Statistics). Wiley.
Li, F., & Klette, R. (2011). Euclidean Shortest Paths: Exact or Approximate Algorithms.
Springer.
Liu, H., & Motoda, H. (Eds.). (2007). Computational Methods of Feature Selection. CRC Press.
Lomax, R. G., & Schumacker, R. E. (2010). A beginner's guide to structural equation modeling.
(3rd ed.) Routledge.
Mantovan, P. (Ed.). (2010). Complex Data Modeling and Computationally Intensive Statistical
Methods (Contributions to Statistics). Springer.
Mielke, P. W., & Berry, K. J. (2007). Permutation methods: a distance function approach (2nd
ed.) (Springer Series in Statistics). Springer.
FormanyitisalsoascienceorevenTHEscience.Forexample,inhis1997monographMathematicsasaScience
ofPatterns,ResnikcallsmathematicsthequeenofthesciencesbutdoesnotciteGauss(whencecomesthe
quote,albeitintheformDieMathematikistdieKniginderWissenschaften).Apparently,itisnolongera
quotebutaproverboraphorism/apothegmandhencerequiresnocitation.
ii
Jasmin,L.,&Ohara,P.T.(2004).Anatomicalidentificationofneuronsresponsivetonociceptivestimuli.InZ.
D.Luo(Ed.)PainResearch(pp.167188).HumanaPress.
iii
Zikmund,W.,Babin,B.,Carr,J.,&Griffin,M.(2009).BusinessResearchMethods(8thEd.).CengageLearning.
iv
Mielke,P.W.,&Berry,K.J.(2007).PermutationMethods:ADistanceFunctionApproach(2ndEd.).Springer.
v
Seesection4
vi
Mielke&Berry,2007.
vii
Fisher,R.A.(1922).Ontheinterpretationof2fromcontingencytables,andthecalculationofP.Journalofthe
RoyalStatisticalSociety,8794.;Turing,A.M.(1936),"Oncomputablenumbers,withanapplicationtothe
Entcheidungproblem",Proc.Lond.Math.Soc.IISer.,Vol.42,pp.23065.