Recently Updated Pages
BigQuery Analytics Hub
Analytics Hub is a mechanism for sharing datasets between BigQuery users. Google's official produ...
The Copying Task
The Copying Task is a benchmarking task in NLP that assesses recurrent models (and other sequenti...
PyLLMCore
PyLLMCore is a python library for working with a variety of LLM models and it supports both OpenA...
Embeddings and Llama.cpp
SQLite VSS - Lightweight Vector DB SQLite VSS is a SQLite extension that adds vector search on...
LangChain and Zephyr
Zephyr is pretty powerful and it will quite happily use tools if you prompt it correctly. Zephy...
Local LLMs
LLM Utility I'm a big fan of Simon Willison's llm package. It works nicely with llama-cpp. Inst...
Variance
Variance essentially refers to how spread out your data is relative to its mean. In the diagra...
Assessing Data Quality
One of the biggest difficulties with ML is dealing with messy data. This is a common and reoccurr...
Exploratory Data Analysis (EDA)
There are a number of powerful tools like Pandas Profiling and SweetViz that can make EDA fast an...
Model Registry
A model registry is a service that provides version-control-like behaviour for ML models. There a...
DVC
DVC or Data Version Control is an open source tool for managing data assets. It is very useful bu...
Galaxy S3 Tab
Galaxy Tab S3 is a 10 inch (9.7") tablet released in 2017. Installing TWRP 1. To install TWRP on ...
NeoVim
NeoVim is a new super configurable version of the VIM editor. I've been learning to configure and...
Hugo Static Site Generation
I use Hugo to maintain most of my websites. Extended Edition Hugo has an extended version which i...
Firefox on Ubuntu 22.04 Non Snap
In the latest Ubuntu they made Firefox a snap instead of just installing via deb. This walkthru t...
Online Reading and Feeds
RSS I use FreshRSS to manage my feeds for me and the associated Android client for on the go. On ...
Security
- The OWASP API Top 10 security measures may be a good place to start when trying to decide what ...
Intro to Microcosm
Warning: This page is very much a work in progress Microcosm is a tiny and lightweight micropub s...
Planning
Move configuration out into yaml file - the mishmash of environment variables is pretty gross Ad...
Being a CTO
Being a CTO is interesting and is probably different in every company. It is also a role that cha...