Notes
Why Create This Page
I was getting tired of taking notes that involve a lot of math equations using the
Lexical Rich Text Editor, so I am creating this page, where I can upload a
file and create an article purely from the
markup. I am also creating this page to practice writing in hopes of improving my knowledge of the markup language.
I plan on using this page to provide links to notes on things, mostly math-heavy textbooks and research papers, that I took using and to keep track of research papers or math-heavy textbooks that I plan to read in the future. I plan on using
Tex4ht, combined with some post-processing, to convert the uploaded documents to HTML. You can use the document below to test out this to HTML system.

LEDITS++: Limitless Image Editing Using Text-to-Image Models
This paper goes over image editing using Text-to-Image models.

TikTok Recommendation System Paper
Monolith: Real Time Recommendation System With Collisionless Embedding Table
I heard that TikTok published a paper on their recommendation engine sometime ago, and since I know some things about recommendation systems and plan ...

Gradient-Based Learning Applied to Document Recognition
Gradient-Based Learning Applied to Document Recognition Paper
Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of ...

Dropout: A Simple Way to Prevent Neural Networks from Overfitting
Dropout: A Simple Way to Prevent Neural Networks from Overfitting Paper
Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of ...

Neural Image Caption Generation with Visual Attention
Neural Image Caption Generation with Visual Attention Paper
Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of ...

YOLOv4: An Incremental Improvement
YOLOv4: An Incremental Improvement Paper
Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of ...

YOLO9000: Better, Faster, Stronger
YOLO9000: Better, Faster, Stronger Paper
Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of ...

You Only Look Once: Unified, Real-Time Object Detection
You Only Look Once: Unified, Real-Time Object Detection Paper
Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of ...

Fully Convolutional Networks for Semantic Segmentation
Fully Convolutional Networks for Semantic Segmentation Paper
Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of ...

Crowdsourcing in Computer Vision
Crowdsourcing in Computer Vision Paper
Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of ...

Squeeze-and-Excitation Network
Squeeze-and-Excitation Network Paper
Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of ...

Xception: Deep Learning with Depthwise Separable Convolutions
Xception: Deep Learning with Depthwise Separable Convolutions Paper
Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of ...

Deep Residual Learning for Image Recognition
Deep Residual Learning for Image Recognition Paper
Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of ...

Very Deep Convlutional Networks for Large-Scale Image Recognition
Very Deep Convlutional Networks for Large-Scale Image Recognition Paper
Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of ...

Going Deeper with Convolutions
Going Deeper with Convolutions Paper
Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of ...

ImageNet Classification with Deep Convolutional Neural Networks
ImageNet Classification with Deep Convolutional Neural Networks Paper
Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of ...

neocognitron: A Self-Organizing Neural Network Model for a Mechanism of Pattern Recognition Unaffected by Shift in Position
Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of ...

Receptive Fields and Function Architecture of Monkey Striate Cortex
Receptive Fields and Function Architecture of Monkey Striate Cortex Paper
Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of ...

Receptive Fields of Single Neurones in The Cat's Striate Cortex
Receptive Fields of Single Neurones in The Cat's Striate Cortex Paper
Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of ...

Single Unit Activity in Striate Cortex of Unrestrained Cats
Single Unit Activity in Striate Cortex of Unrestrained Cats Paper
Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of ...

Distributed Representations of Words and Phrases and their Compositionality
Distributed Representations of Words and Phrases and their Compositionality Paper
Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of ...

Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning
Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning Paper
Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of ...

Improving Neural Networks by Preventing Co-Adaptation of Feature Detectors
Improving Neural Networks by Preventing Co-Adaptation of Feature Detectors Paper
Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of ...

Adam: A Method for Stochastic Optimization
Adam: A Method for Stochastic Optimization Paper
Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of ...

Adaptive Subgradient Methods for Online Learning and Stochastic Optimization
Adaptive Subgradient Methods for Online Learning and Stochastic Optimization Paper
Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of ...

Some methods of speeding up the convergence of iteration methods
Some methods of speeding up the convergence of iteration methods Paper
Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of ...

Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift Paper
Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of ...

Self-Normalizing Neural Networks
Self-Normalizing Neural Networks Paper
Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of ...

Fast and Accurate Deep Learning by Exponential Linear Units (ELUs)
Fast and Accurate Deep Learning by Exponential Linear Units (ELUs) Paper
Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of ...

Empirical Evaluation of Rectified Activations in Convolutional Network
Empirical Evaluation of Rectified Activations in Convolutional Network Paper
Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of ...

Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification
Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification Paper
Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of ...

Practical recommendations for gradient-based training of deep architectures
Practical recommendations for gradient-based training of deep architectures Paper
Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of ...

Using Evolutionary AutoML to Discover Neural Network Architectures
Using Evolutionary AutoML to Discover Neural Network Architectures Paper
Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of ...

Population Based Training of Neural Networks
Population Based Training of Neural Networks Paper
Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of ...

Wide & Deep Learning for Recommender Systems
Wide & Deep Learning for Recommender Systems Paper
Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of ...

Web-scale k-means clustering
Web-scale k-means clustering Paper
Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of ...

Least Square Quantization in PCM
Least Square Quantization in PCM Paper
Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of ...

Fast Kernel Classifiers with Online and Active Learning
Fast Kernel Classifiers with Online and Active Learning Paper
Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of ...

A Dual Coordinate Descent Method for Large-scale Linear SVM
A Dual Coordinate Descent Method for Large-scale Linear SVM Paper
Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of ...

Stacked Generalization
Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of ...

A Decision-Theoretic Generalization of On-Line Learning and an Application to Boosting
A Decision-Theoretic Generalization of On-Line Learning and an Application to Boosting Paper
Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of ...

A Mathematical Theory of Communication
A Mathematical Theory of Communication Paper
Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of ...

A Tutorial Introduction to the Minimum Description Length Principle
I am reading this paper because it was recommended as part of Ilya Sutskever's approx. 30 papers that he recommended to John Carmack to learn what rea...

Deep Speech 2: End-to-End Speech Recognition in English and Mandarin
I am reading this paper because it was recommended as part of Ilya Sutskever's approx. 30 papers that he recommended to John Carmack to learn what rea...

Neural Turing Machines
I am reading this paper because it was recommended as part of Ilya Sutskever's approx. 30 papers that he recommended to John Carmack to learn what rea...

Quantifying the Rise and Fall of Complexity in Closed Systems: The Coffee Automaton
I am reading this paper because it was recommended as part of Ilya Sutskever's approx. 30 papers that he recommended to John Carmack to learn what rea...

Variational Lossy Autoencoder
I am reading this paper because it was recommended as part of Ilya Sutskever's approx. 30 papers that he recommended to John Carmack to learn what rea...

A Simple NN Module for Relational Reasoning
I am reading this paper because it was recommended as part of Ilya Sutskever's approx. 30 papers that he recommended to John Carmack to learn what rea...

Identity Mappings in Deep Residual Networks
I am reading this paper because it was recommended as part of Ilya Sutskever's approx. 30 papers that he recommended to John Carmack to learn what rea...

Neural Machine Translation by Jointly Learning to Align and Translate
I am reading this paper because it was recommended as part of Ilya Sutskever's approx. 30 papers that he recommended to John Carmack to learn what rea...

Neural Message Passing for Quantum Chemistry
I am reading this paper because it was recommended as part of Ilya Sutskever's approx. 30 papers that he recommended to John Carmack to learn what rea...

Multi-Scale Context Aggregation by Dilated Convolutions
I am reading this paper because it was recommended as part of Ilya Sutskever's approx. 30 papers that he recommended to John Carmack to learn what rea...

GPipe: Efficient Training of Giant Neural Networks using Pipeline Parallelism
I am reading this paper because it was recommended as part of Ilya Sutskever's approx. 30 papers that he recommended to John Carmack to learn what rea...

Keeping Neural Networks Simple by Minimizing the Description Length of the Weights
I am reading this paper because it was recommended as part of Ilya Sutskever's approx. 30 papers that he recommended to John Carmack to learn what rea...
Notes
January, 2025

A Logical Calculus of Ideas Immanent in Nervous Activity
Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of those papers. These papers are mainly on machine learning and deep learning topics....

Using the Triangle Inequality to Accelerate k-Means
Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of those papers. These papers are mainly on machine learning and deep learning topics....

Sequential Minimal Optimization: A fast Algorithm for Training Support Vector Machines
Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of those papers. These papers are mainly on machine learning and deep learning topics....

Incremental and Decremental SVM Learning
Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of those papers. These papers are mainly on machine learning and deep learning topics....

Nonlinear Dimensionality Reduction by Locally Linear Embedding
Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of those papers. These papers are mainly on machine learning and deep learning topics....

MTEB: Massive Text Embedding Benchmark
I need to do a lot with text embeddings, so I am going to read this paper. This paper establishes a benchmark to test text embeddings.

ARCING THE EDGE
Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of those papers. These papers are mainly on machine learning and deep learning topics....

Understanding the Difficulty of Training Deep Feedforward Neural Networks
Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of those papers. These papers are mainly on machine learning and deep learning topics....

Kernel Principal Component Analysis
Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of those papers. These papers are mainly on machine learning and deep learning topics....

Diffusion quality diffuses in the digital public square
I wanted to read this comment because it should affect the process by which comments are initially displayed on a post.

Extremely Randomized Trees
Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of those papers. These papers are mainly on machine learning and deep learning topics....

Random Decision Forests
Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of those papers. These papers are mainly on machine learning and deep learning topics....

The Random Subspace Method for Constructing Decision Forests
Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of those papers. These papers are mainly on machine learning and deep learning topics....

Ensembles on Random Patches
Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of those papers. These papers are mainly on machine learning and deep learning topics....

Pasting Small Votes for Classification in Large Databases and On-Line
Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of those papers. These papers are mainly on machine learning and deep learning topics....

Bagging Predictors
Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of those papers. These papers are mainly on machine learning and deep learning topics....

The Lack of A Priori Distinctions Between Learning Algorithms
Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of those papers. These papers are mainly on machine learning and deep learning topics....

API design for machine learning software: experiences from the scikit-learn project
Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of those papers. These papers are mainly on machine learning and deep learning topics....

Scaling to very very large corpora for natural language disambiguation
Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of those papers. These papers are mainly on machine learning and deep learning topics....

The Unreasonable Effectiveness of Data
Looking for more research papers to read, I scanned my Hands-On Machine Learning notes for the many papers that were referenced there. This is one of those papers. These papers are mainly on machine learning and deep learning topics....

Pointer Networks
I am reading this paper because it was recommended as part of Ilya Sutskever's approx. 30 papers that he recommended to John Carmack to learn what really matters for machine learning / AI today. This paper introduces a new neural architecture to learn the conditional probability of an output sequence with elements that are discrete tokens corresponding to positions in an input sequence. ...

Order Matters: Sequence to Sequence for Sets
I am reading this paper because it was recommended as part of Ilya Sutskever's approx. 30 papers that he recommended to John Carmack to learn what really matters for machine learning / AI today. This paper shows that the order in which input/output data sequences are organized matters significantly when learning an underlying model....

Relational Recurrent Neural Networks
I am reading this paper because it was recommended as part of Ilya Sutskever's approx. 30 papers that he recommended to John Carmack to learn what really matters for machine learning / AI today. This paper confirms suspicions that standard memory architectures may stuggle at tasks that involve understanding the ways in which entities are connected and then improves upon the problem by using a new memory module - a Relational Memory Core....

Deep Residual Learning for Image Recognition
I am reading this paper because it was recommended as part of Ilya Sutskever's approx. 30 papers that he recommended to John Carmack to learn what really matters for machine learning / AI today. This paper "presents a residual training framework to ease the training of networks that are substantially deeper than those used previously"....

Attention Is All You Need
I am reading this paper because it was recommended as part of Ilya Sutskever's approx. 30 papers that he recommended to John Carmack to learn what really matters for machine learning / AI today. This paper introduces a new simple architecture, the Transformer, based solely on attention mechanisms, dispensing with recurrence and convolutions entirely....

Recurrent Neural Network Regularization
I am reading this paper because it was recommended as part of Ilya Sutskever's approx. 30 papers that he recommended to John Carmack to learn what really matters for machine learning / AI today. This paper shows how to apply the dropout regularization technique to LSTMs, and it shows that this application of dropout substantially reduces overfitting on a variety of tasks. ...

Understanding LSTM Networks
I am reading this paper because it was recommended as part of Ilya Sutskever's approx. 30 papers that he recommended to John Carmack to learn what really matters for machine learning / AI today. This blog post, by Christopher Olah, is about better understanding LSTM Networks....

ImageNet Classification with Deep Convolutional Neural Networks
I am reading this paper because it was recommended as part of Ilya Sutskever's approx. 30 papers that he recommended to John Carmack to learn what really matters for machine learning / AI today. This paper describes the state of the art deep CNN approach taken to the ImageNet task which achieved state of the art results....

The Unreasonable Effectiveness of Recurrent Neural Networks
I am reading this paper because it was recommended as part of Ilya Sutskever's approx. 30 papers that he recommended to John Carmack to learn what really matters for machine learning / AI today. This blog post is about Karpathy sharing the "magic" of Recurrent Neural Networks (RNNs)....

Scaling Laws for Neural Language Models
I am reading this paper because it was recommended as part of Ilya Sutskever's approx. 30 papers that he recommended to John Carmack to learn what really matters for machine learning / AI today. This paper studies empirical scaling laws for language model performance on the cross entropy loss. ...