Neural Networks: Zero to Hero by Andrej Karpathy 🧠🚀

This repository contains my Jupyter Notebook files on the neural networks course taught by Andrej Karpathy. The course covers neural network basics and progresses to more advanced topics. Each lecture is represented as a Jupyter Notebook file (.ipynb).

YouTube playlist

Andrej's Repo

Lectures 📚

Lecture 1: The spelled-out intro to neural networks and backpropagation: building micrograd

Introduction to gradients and calculating the slope of a function using small increments (numerical differentiation)
Recreating micrograd (Value class) to create mathematical expressions that can be automatically backpropagated
Visualizing mathematical expressions with a computational graph composed of the operations tracked by micrograd
Manual backpropagation for a simple neuron model and its activation function (tanh)
Backpropagate using pytorch.
Building a basic neural network (multi-layer perceptron) from scratch using micrograd and applying the tanh activation function.
Using the neural network for a simple dataset and performing forward and backward passes (training) to minimize the loss (squared error) through gradient descent.

Lecture 2: The spelled-out intro to language modeling: building makemore

Bigram Generation:
1. Creating bigrams from the dataset and counting their occurrences.
2. Visualizing the bigram frequency using a heatmap.
Probability Calculations:
1. Initializing probability matrix 'P' based on bigram counts for each character.
2. Smoothing the model to prevent zero probabilities. (Opcional)
3. Generating new words using the trained model.
Model Quality Evaluation:
- Calculating the likelihood and negative log likelihood of the data with respect to model parameters.
Neural Network Approach - Bigrams:
1. Building a simple neural network for bigram prediction.
2. Performing a forward pass using random weights and one-hot encoding.
3. Exponentiating log counts to obtain counts and converting to probabilities using softmax.
Optimization:
1. Implementing gradient descent to optimize the neural network.
2. Incorporating regularization to the loss function.
Sampling from the trained neural network to generate new words.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
Lecture1.ipynb		Lecture1.ipynb
Lecture2.ipynb		Lecture2.ipynb
README.md		README.md
names.txt		names.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Neural Networks: Zero to Hero by Andrej Karpathy 🧠🚀

Lectures 📚

Lecture 1: The spelled-out intro to neural networks and backpropagation: building micrograd

Lecture 2: The spelled-out intro to language modeling: building makemore

About

Releases

Packages

Languages

zF4ke/nn-zero-to-hero

Folders and files

Latest commit

History

Repository files navigation

Neural Networks: Zero to Hero by Andrej Karpathy 🧠🚀

Lectures 📚

Lecture 1: The spelled-out intro to neural networks and backpropagation: building micrograd

Lecture 2: The spelled-out intro to language modeling: building makemore

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages