learning
-
[Being Human 3/n]: moving on from previous unmet goals... Jun 27, 2025
-
[Being Human 2/n] Being scrappy shows we are Human in this Brave New World Jun 27, 2025
-
[IA Series 7/n] Building a Self-Consistency LLM-Agent: From PEAS Analysis to Production Code Jun 26, 2025
-
[IA Series 5/n] The Evolution from Logic to Probability to Deep Learning: A course correction to Transformers May 20, 2025
-
[IA Series 4/n] A Big Question: Why Study Logic in a World of Probabilistic AI? May 19, 2025
-
[IA Series 3/n] Intelligent Agents Term Sheet May 16, 2025
-
Building an Intelligent Agent May 10, 2025
-
[zero-RL] Summarising what LUFFY offers Apr 29, 2025
-
[zero-RL] where is the exploration? Apr 29, 2025
-
[zero-RL] LUFFY: Learning to reason Under oFF policY guidance Apr 28, 2025
-
[zero-RL] what is it? Apr 28, 2025
-
[zero-RL] When you SFT a smaller LM on the reasoning traces of a larger LM Apr 28, 2025
-
Notes and links on SVMs (WIP) Apr 26, 2025
-
[IA Series 2/n] Search Algorithms and Intelligent Agents Apr 24, 2025
-
[IA Series 1/n] AI Search - Terms and Algorithms Apr 24, 2025
-
Mar 22, 2025
-
[NN Series 5/n] Regularisation: reducing the complexity of a model without compromising accuracy Mar 17, 2025
-
[NN Series 4/n] Feature Normalisation Mar 6, 2025
-
[NN Series 3/n] Calculating the error before quantisation: Gradient Descent Feb 25, 2025
-
[NN Series 2/n] Circuits that can be trained to match patterns: The Adaline Feb 24, 2025
-
#BeingHuman - look after your << self >>: love is all it needs. Feb 23, 2025
-
[NN Series 1/n] From Neurons to Neural Networks: The Perceptron Feb 12, 2025
-
[RL Series 2/n] From Animals to Agents: Linking Psychology, Behaviour, Mathematics, and Decision Making Feb 7, 2025
-
[RL Series 1/n] Defining Artificial Intelligence and Reinforcement Learning Jan 31, 2025
-
What is Off-Policy learning? Jan 31, 2025
-
Dopamine as temporal difference errors !! 🤯 Jan 15, 2025