Skip to main content

Recently Updated Pages

BigQuery Analytics Hub

Google Cloud Platform

Analytics Hub is a mechanism for sharing datasets between BigQuery users. Google's official produ...

Updated 1 year ago by James

The Copying Task

AI Benchmarks and Exercises

The Copying Task is a benchmarking task in NLP that assesses recurrent models (and other sequenti...

Updated 1 year ago by James

PyLLMCore

Working with LLMs

PyLLMCore is a python library for working with a variety of LLM models and it supports both OpenA...

Updated 1 year ago by James

Embeddings and Llama.cpp

Working with LLMs

SQLite VSS - Lightweight Vector DB SQLite VSS is a SQLite extension that adds vector search on...

Updated 1 year ago by James

LangChain and Zephyr

Working with LLMs

Zephyr is pretty powerful and it will quite happily use tools if you prompt it correctly.  Zephy...

Updated 1 year ago by James

Local LLMs

Working with LLMs

LLM Utility I'm a big fan of Simon Willison's llm package. It works nicely with llama-cpp. Inst...

Updated 1 year ago by James

Variance

Data Quality and Preparation

Variance essentially refers to how spread out your data is relative to its mean. In the diagra...

Updated 1 year ago by James

Assessing Data Quality

Data Quality and Preparation

One of the biggest difficulties with ML is dealing with messy data. This is a common and reoccurr...

Updated 1 year ago by James

Exploratory Data Analysis (EDA)

Data Quality and Preparation

There are a number of powerful tools like Pandas Profiling and SweetViz that can make EDA fast an...

Updated 1 year ago by James

Model Registry

Data Engineering and MLOps

A model registry is a service that provides version-control-like behaviour for ML models. There a...

Updated 1 year ago by James

DVC

Data Engineering and MLOps

DVC or Data Version Control is an open source tool for managing data assets. It is very useful bu...

Updated 1 year ago by James

Galaxy S3 Tab

Devices and Tech

Galaxy Tab S3 is a 10 inch (9.7") tablet released in 2017. Installing TWRP 1. To install TWRP on ...

Updated 1 year ago by James

NeoVim

Devices and Tech

NeoVim is a new super configurable version of the VIM editor. I've been learning to configure and...

Updated 1 year ago by James

Hugo Static Site Generation

Devices and Tech

I use Hugo to maintain most of my websites. Extended Edition Hugo has an extended version which i...

Updated 1 year ago by James

Firefox on Ubuntu 22.04 Non Snap

Devices and Tech

In the latest Ubuntu they made Firefox a snap instead of just installing via deb. This walkthru t...

Updated 1 year ago by James

Online Reading and Feeds

Devices and Tech

RSS I use FreshRSS to manage my feeds for me and the associated Android client for on the go. On ...

Updated 1 year ago by James

Security

Software Misc

- The OWASP API Top 10 security measures may be a good place to start when trying to decide what ...

Updated 1 year ago by James

Intro to Microcosm

Microcosm

Warning: This page is very much a work in progress Microcosm is a tiny and lightweight micropub s...

Updated 1 year ago by James

Planning

Microcosm

Move configuration out into yaml file - the mishmash of environment variables is pretty gross Ad...

Updated 1 year ago by James

Being a CTO

Engineering Leadership and CTO

Being a CTO is interesting and is probably different in every company. It is also a role that cha...

Updated 1 year ago by James