16 Derivatives and Gradients

Book Overview & Buying
Table Of Contents
Feedback & Rating

Mathematics of Machine Learning

By : Tivadar Danka

Buy this Book

Mathematics of Machine Learning

By: Tivadar Danka

Buy this Book

Overview of this book

Mathematics of Machine Learning provides a rigorous yet accessible introduction to the mathematical underpinnings of machine learning, designed for engineers, developers, and data scientists ready to elevate their technical expertise. With this book, you’ll explore the core disciplines of linear algebra, calculus, and probability theory essential for mastering advanced machine learning concepts. PhD mathematician turned ML engineer Tivadar Danka—known for his intuitive teaching style that has attracted 100k+ followers—guides you through complex concepts with clarity, providing the structured guidance you need to deepen your theoretical knowledge and enhance your ability to solve complex machine learning problems. Balancing theory with application, this book offers clear explanations of mathematical constructs and their direct relevance to machine learning tasks. Through practical Python examples, you’ll learn to implement and use these ideas in real-world scenarios, such as training machine learning models with gradient descent or working with vectors, matrices, and tensors. By the end of this book, you’ll have gained the confidence to engage with advanced machine learning literature and tailor algorithms to meet specific project requirements.

Free Chapter

Introduction

What is this book about?

How to read this book

Conventions used

What this book covers

To get the most out of this book

Part 1: Linear Algebra

1 Vectors and Vector Spaces

1.1 What is a vector space?

1.2 The basis

1.3 Vectors in practice

1.4 Summary

1.5 Problems

2 The Geometric Structure of Vector Spaces

2.1 Norms and distances

2.2 Inner products, angles, and lots of reasons to care about them

2.3 Summary

2.4 Problems

3 Linear Algebra in Practice

3.1 Vectors in NumPy

3.2 Matrices, the workhorses of linear algebra

3.3 Summary

3.4 Problems

4 Linear Transformations

4.1 What is a linear transformation?

4.2 Change of basis

4.3 Linear transformations in the Euclidean plane

4.4 Determinants, or how linear transformations affect volume

4.5 Summary

4.6 Problems

5 Matrices and Equations

5.1 Linear equations

5.2 The LU decomposition

5.3 Determinants in practice

5.4 Summary

5.5 Problems

6 Eigenvalues and Eigenvectors

6.1 Eigenvalues of matrices

6.2 Finding eigenvalue-eigenvector pairs

6.3 Eigenvectors, eigenspaces, and their bases

6.4 Summary

6.5 Problems

7 Matrix Factorizations

7.1 Special transformations

7.2 Self-adjoint transformations and the spectral decomposition theorem

7.3 The singular value decomposition

7.4 Orthogonal projections

7.5 Computing eigenvalues

7.6 The QR algorithm

7.7 Summary

7.8 Problems

8 Matrices and Graphs

8.1 The directed graph of a nonnegative matrix

8.2 Benefits of the graph representation

8.3 The Frobenius normal form

8.4 Summary

8.5 Problems

References

Part 2: Calculus

9 Functions

9.1 Functions in theory

9.2 Functions in practice

9.3 Summary

9.4 Problems

10 Numbers, Sequences, and Series

10.1 Numbers

10.2 Sequences

10.3 Series

10.4 Summary

10.5 Problems

11 Topology, Limits, and Continuity

11.1 Topology

11.2 Limits

11.3 Continuity

11.4 Summary

11.5 Problems

12 Differentiation

12.1 Differentiation in theory

12.2 Differentiation in practice

12.3 Summary

12.4 Problems

13 Optimization

13.1 Minima, maxima, and derivatives

13.2 The basics of gradient descent

13.3 Why does gradient descent work?

13.4 Summary

13.5 Problems

14 Integration

14.1 Integration in theory

14.2 Integration in practice

14.3 Summary

14.4 Problems

Join our community on Discord

References

Part 3: Multivariable Calculus

15 Multivariable Functions

15.1 What is a multivariable function?

15.2 Linear functions in multiple variables

15.3 The curse of dimensionality

15.4 Summary

16 Derivatives and Gradients

16.1 Partial and total derivatives

16.2 Derivatives of vector-valued functions

16.3 Summary

16.4 Problems

17 Optimization in Multiple Variables

17.1 Multivariable functions in code

17.2 Minima and maxima, revisited

17.3 Gradient descent in its full form

17.4 Summary

17.5 Problems

References

Part 4: Probability Theory

18 What is Probability?

18.1 The language of thinking

18.2 The axioms of probability

18.3 Conditional probability

18.4 Summary

18.5 Problems

19 Random Variables and Distributions

19.1 Random variables

19.2 Discrete distributions

19.3 Real-valued distributions

19.4 Density functions

19.5 Summary

19.6 Problems

20 The Expected Value

20.1 Discrete random variables

20.2 Continuous random variables

20.3 Properties of the expected value

20.4 Variance

20.5 The law of large numbers

20.6 Information theory

20.7 The Maximum Likelihood Estimation

20.8 Summary

20.9 Problems

References

Part 5: Appendix

Other Books You May Enjoy

Index

Appendix A It’s Just Logic

A.1 Mathematical logic 101

A.2 Logical connectives

A.3 The propositional calculus

A.4 Variables and predicates

A.5 Existential and universal quantification

A.6 Problems

Appendix B The Structure of Mathematics

B.1 What is a definition?

B.2 What is a theorem?

B.3 What is a proof?

B.4 Equivalences

B.5 Proof techniques

Appendix C Basics of Set Theory

C.1 What is a set?

C.2 Operations on sets

C.3 The Cartesian product

C.4 The cardinality of sets

C.5 The Russell paradox (optional)

Appendix D Complex Numbers

D.1 The definition of complex numbers

D.2 The geometric representation

D.3 The fundamental theorem of algebra

D.4 Why are complex numbers important?

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Mathematics of Machine Learning

By : Tivadar Danka

Mathematics of Machine Learning

By: Tivadar Danka

Overview of this book

Confirmation

Buy this book with your credits?

Submit Your Feedback

Create a Free Account To Continue Reading

Sign in to activate your 7-day free access

16
Derivatives and Gradients