Foundations of Statistics for Data Scientists: With R and Python

Foundations of Statistics for Data Scientists: With R and Python
ISBN-10
1000462919
ISBN-13
9781000462913
Category
Business & Economics
Pages
486
Language
English
Published
2021-11-22
Publisher
CRC Press
Authors
Alan Agresti, Maria Kateri

Description

Foundations of Statistics for Data Scientists: With R and Python is designed as a textbook for a one- or two-term introduction to mathematical statistics for students training to become data scientists. It is an in-depth presentation of the topics in statistical science with which any data scientist should be familiar, including probability distributions, descriptive and inferential statistical methods, and linear modeling. The book assumes knowledge of basic calculus, so the presentation can focus on "why it works" as well as "how to do it." Compared to traditional "mathematical statistics" textbooks, however, the book has less emphasis on probability theory and more emphasis on using software to implement statistical methods and to conduct simulations to illustrate key concepts. All statistical analyses in the book use R software, with an appendix showing the same analyses with Python. The book also introduces modern topics that do not normally appear in mathematical statistics texts but are highly relevant for data scientists, such as Bayesian inference, generalized linear models for non-normal responses (e.g., logistic regression and Poisson loglinear models), and regularized model fitting. The nearly 500 exercises are grouped into "Data Analysis and Applications" and "Methods and Concepts." Appendices introduce R and Python and contain solutions for odd-numbered exercises. The book's website has expanded R, Python, and Matlab appendices and all data sets from the examples and exercises.

Other editions

Similar books

  • Statistical Foundations of Data Science
    By Runze Li, Cun-Hui Zhang, Jianqing Fan

    It includes ample exercises that involve both theoretical studies as well as empirical applications. The book begins with an introduction to the stylized features of big data and their impacts on statistical analysis.

  • Statistical Foundations of Data Science
    By Runze Li, Cun-Hui Zhang, Jianqing Fan

    It includes ample exercises that involve both theoretical studies as well as empirical applications. The book begins with an introduction to the stylized features of big data and their impacts on statistical analysis.

  • Foundations of Data Science
    By Avrim Blum, John Hopcroft, Ravindran Kannan

    This book provides an introduction to the mathematical and algorithmic foundations of data science, including machine learning, high-dimensional geometry, and analysis of large networks.

  • Practical Statistics for Data Scientists: 50 Essential Concepts
    By Peter Bruce, Andrew Bruce

    With this book, you’ll learn: Why exploratory data analysis is a key preliminary step in data science How random sampling can reduce bias and yield a higher quality dataset, even with big data How the principles of experimental design ...

  • Statistical Foundations, Reasoning and Inference: For Science and Data Science
    By Christian Heumann, Göran Kauermann, Helmut Küchenhoff

    Moreover, the book addresses statistical ideas that are useful in modern data analytics, including bootstrapping, modeling of multivariate distributions, missing data analysis, causality as well as principles of experimental design.

  • Probability and Statistics for Data Science: Math + R + Data
    By Norman Matloff

    His book Statistical Regression and Classification: From Linear Models to Machine Learning was the recipient of the Ziegel Award for the best book reviewed in Technometrics in 2017.

  • Mathematical Foundations of Data Science Using R
    By Matthias Dehmer, Frank Emmert-Streib, Salissou Moutari

    The aim of the book is to help students become data scientists.

  • Doing Data Science: Straight Talk from the Frontline
    By Cathy O'Neil, Rachel Schutt

    But how can you get started working in a wide-ranging, interdisciplinary field that’s so clouded in hype? This insightful book, based on Columbia University’s Introduction to Data Science class, tells you what you need to know.

  • Introduction to Data Science: Data Analysis and Prediction Algorithms with R
    By Rafael A. Irizarry

    This book is a textbook for a first course in data science. No previous knowledge of R is necessary, although some experience with programming may be helpful.

  • Statistics and Data Science for Teachers
    By Christine Franklin, Anna Bargagliotti

    In supporting the spirit of Pre-K-12 Guidelines for Assessment and Instruction in Statistics Education II (GAISE II), this book presents statistical ideas through investigations and engagement with the statistical problem-solving process of ...