Foundations of Data Science

ISBN-10: 1108617360
ISBN-13: 9781108617369
Category: Computers
Language: English
Published: 2020-01-23
Publisher: Cambridge University Press
Authors: Avrim Blum, John Hopcroft, Ravindran Kannan

Description

This book provides an introduction to the mathematical and algorithmic foundations of data science, including machine learning, high-dimensional geometry, and analysis of large networks. Topics include the counterintuitive nature of data in high dimensions, important linear algebraic techniques such as singular value decomposition, the theory of random walks and Markov chains, the fundamentals of and important algorithms for machine learning, algorithms and analysis for clustering, probabilistic models for large networks, representation learning including topic modelling and non-negative matrix factorization, wavelets and compressed sensing. Important probabilistic techniques are developed including the law of large numbers, tail inequalities, analysis of random projections, generalization guarantees in machine learning, and moment methods for analysis of phase transitions in large random graphs. Additionally, important structural and complexity measures are discussed such as matrix norms and VC-dimension. This book is suitable for both undergraduate and graduate courses in the design and analysis of algorithms for data.

Get the book

Other editions

Foundations of Data Science
- 2020-01-23
- 433 pages
- Paperback
- Cambridge University Press

Similar books

Statistical Foundations of Data Science
By Runze Li, Cun-Hui Zhang, Jianqing Fan
It includes ample exercises that involve both theoretical studies as well as empirical applications. The book begins with an introduction to the stylized features of big data and their impacts on statistical analysis.
Statistical Foundations of Data Science
By Runze Li, Cun-Hui Zhang, Jianqing Fan
It includes ample exercises that involve both theoretical studies as well as empirical applications. The book begins with an introduction to the stylized features of big data and their impacts on statistical analysis.
Mathematical Foundations of Data Science Using R
By Matthias Dehmer, Frank Emmert-Streib, Salissou Moutari
The aim of the book is to help students become data scientists.
Foundations of Statistics for Data Scientists: With R and Python
By Alan Agresti, Maria Kateri
" Appendices introduce R and Python and contain solutions for odd-numbered exercises. The book's website (http://stat4ds.rwth-aachen.de/) has expanded R, Python, and Matlab appendices and all data sets from the examples and exercises.
Mathematical Foundations for Data Analysis
By Jeff M. Phillips
In particular, this book was designed and written as preparation for students planning to take rigorous Machine Learning and Data Mining courses.
Data Science Foundations: Geometry and Topology of Complex Hierarchic Systems and Big Data Analytics
By Fionn Murtagh
This book is designed to provide a new framework for Data Science, based on a solid foundation in mathematics and computational science.
Foundations of Data Science for Engineering Problem Solving
By Gitanjali Rahul Shinde, Parikshit Narendra Mahalle, Priya Dudhale Pise
This book is one-stop shop which offers essential information one must know and can implement in real-time business expansions to solve engineering problems in various disciplines.
Mathematical Foundations of Big Data Analytics
By Vladimir Shikhman, David Müller
C.C. Aggarwal, Recommender Systems (Springer, New York, 2016) D. Arthur, S. Vassilvitskii, k-means++: the advantages of careful seeding, in Proceedings of the eighteenth annual ACM-SIAM symposium on Discrete algorithms, 2007, pp.
On the Epistemology of Data Science: Conceptual Tools for a New Inductivism
By Wolfgang Pietsch
This book addresses controversies concerning the epistemological foundations of data science: Is it a genuine science?
Introduction to Data Science: Data Analysis and Prediction Algorithms with R
By Rafael A. Irizarry
This book is a textbook for a first course in data science. No previous knowledge of R is necessary, although some experience with programming may be helpful.