Notes

January 1, 2025
Synthetic Data in 2024 - Progress, Opportunities and Challenges
synthetic-data responsible-ai data-science notes
A review of key developments in the synthetic data landscape over the past few years, driven by advances in generative AI and falling costs, and a practitioner's perspective on the opportunities and challenges ahead.
Read article
December 16, 2024
Executing ComfyUI Workflows as Standalone Scripts
data-science generative-ai comfyui notes
A guide on how to execute ComfyUI workflows as standalone scripts.
Read article
October 19, 2024
Progressively Updating UI with FastAPI and Streamed Structured Responses
data-science llm generative-ai musings next-js notes
Walkthrough on streaming structured objects to create progressively updating interfaces with FastAPI and Next.js.
Read article
November 8, 2023
Generating Structured Output from LLMs
data-science llm generative-ai notes
A survey on the different methodologies used to generate structured output from LLMs, from model fine-tuning, to domain specific language, and schema engineering.
Read article
June 18, 2023
SQLite WASM with Custom Extensions
sqlite wasm notes
A concise guide on building SQLite WASM on Ubuntu Linux with custom extensions. Run SQLite in the browser and enable new possibilities by providing an interface to other C libraries through custom extensions.
Read article
January 12, 2022
Building open source projects - Lessons learned (2021)
open-source musings notes
A review of the things I have learned from building in the open over the past year. Thoughts and reflections on what it takes to grow a project and the difficulty translating open-source success to commerical success.
Read article
August 29, 2021
Creating a Rehype Syntax Highlighting Plugin
mdx javascript markdown notes
An exploration of markdown and HTML syntax trees. Documenting my experience creating rehype-prism-plus, a syntax highlighting plugin that creates pretty code blocks.
Read article
February 21, 2021
Growth Hacking Github - How to Get Github Stars
marketing musings notes
A good project is only one part of the puzzle. Getting stars is really all about marketing and promoting it. A guide on growth hacking a Github project.
Read article
August 1, 2020
Schelling's Segregation Model in Julia
julia learning-julia notes agent-based-models
Learn Julia by implementing Schelling's famous segregation model. You will see many similarities to Python - no types need to be specified (it's a dynamic language) and pick up some nice syntactical properties of Julia.
Read article
May 10, 2020
Benchmark of popular graph/network packages v2
benchmarks networks notes python r julia
A revised benchmark of graphs / network computation packages featuring an updated methodology and more comprehensive testing. Find out how Networkx, igraph, graph-tool, Networkit, SNAP and lightgraphs perform
Read article
March 29, 2020
Efficient Large Graph Propagation Algorithm
gcp networks notes python crypto
How we engineered a large scale label propagation algorithm at Cylynx
Read article
January 22, 2020
Serverless Machine Learning with R on Cloud Run
notes r visualization gcp serverless
The serverless way - using Google Cloud Platform to deploy simple machine learning models via Cloud Run. A fun weekend project that analyses the twitter-verse
Read article
December 17, 2019
Speeding up R Plotly web apps - R x Javascript
notes javascript r visualisation Dashboard
Tips and tricks to speed up R and plotly based web apps
Read article
May 5, 2019
Benchmark of popular graph/network packages
benchmarks networks notes python r julia
Benchmark of 5 popular graph/network packages - Networkx, igraph, graph-tool, Networkit and SNAP
Read article
February 11, 2019
Binance hackathon - 2nd place solution
javascript react visualisation networks notes crypto
Technical overview of our 2nd place solution and my experience at the Binance hackathon
Read article
January 5, 2019
Cleaning openstreetmap intersections in python
python spatial visualisation notes
In this post, I explore the problem of simplifying route intersections and document some Python code that can be used to clean and visualize Open Street Maps as a network representation
Read article
October 14, 2018
Visualising Networks in ASOIAF - Part II
r notes visualisation graph-theory networks
Part II in the network exploration of the Game of Thrones series. In this post, we combine the plots together and use gganimate to visualise relationships across all 5 books
Read article
September 9, 2018
Visualising Networks in ASOIAF
r notes visualisation graph-theory networks
A network exploration on the links between characters in the Game of Thrones series with the help of igraph and tidygraph
Read article
August 9, 2018
Applications of DAGs in Causal Inference
r dags notes musings causal-inference
Chains, Forks, Colliders, paths and d-seperation - how DAGs can contribute to better causal inference
Read article
February 26, 2018
Notes on Regression - Approximation of the Conditional Expectation Function
regression ols notes
Deriving the OLS formula as a means of approximating the conditional expectation function
Read article
December 25, 2017
Notes on Graphs and Spectral Properties
graph-theory notes
A reference cheatsheet on adjacency matrix, incidence matrix, laplacian matrix and the basics of algebraic graph theory
Read article
November 18, 2017
Choosing a Control Group in a RCT with Multiple Treatment Periods
R notes simulation metrics
How should we choose the control group in a situation where we have multiple treatments and time periods? A simple statistical simulation exercise
Read article
October 21, 2017
Notes on Regression - Singular Vector Decomposition
regression ols notes
Applying the SVD to the regression framework
Read article
October 1, 2017
Comparing the Population and Group Level Regression
regression notes
To what extent do the coefficients obtained from a regression carried out at the group level correspond to the estimates at the individual level?
Read article
September 21, 2017
Notes on Regression - Maximum Likelihood
regression ols notes
Deriving the OLS estimator via the maximum likelihood approach
Read article
September 13, 2017
Using Leaflet in R - Tutorial
Singapore R spatial visualisation notes
A tutorial on using Leaflet in R for geospatial visualisation
Read article
August 31, 2017
Notes on Regression - Method of Moments
regression ols notes
Establishing the OLS formula via the method of moments approach
Read article
August 23, 2017
Notes on Regression - Projection
regression ols notes
Deriving the OLS estimator - projection method
Read article
August 16, 2017
Notes on Regression - OLS
regression ols notes
This post is the first in a series of my study notes on regression techniques. It covers regression as a solution to the least squares minimisation problem
Read article