Skip to main content

Recently Updated Pages

Core Scientific Concepts (CoreSC)

AI and ML

Core Scientific Concepts (CoreSC) is an annotation scheme used to delineate different parts of sc...

Updated 2 years ago by James

Large Scale Multi-Label Learning

AI and ML Tasks

The Keras website has a tutorial on how to do multi-label learning with a large number of labels:...

Updated 2 years ago by James

SpaCy CoRef

AI and ML

Spacy Coref is an experimental coreference resolution model in spacy The project repository is he...

Updated 2 years ago by James

Federated Learning

AI and ML Tasks

Flower is a federated learning framework with compatibility with Torch, Tensorflow and others

Updated 2 years ago by James

SpaCy GPU

AI and ML

Set Up Environment It's relatively easy to use SpaCy with a GPU these days. First set up your con...

Updated 2 years ago by James

ML Best Practices

AI and ML

Machine learning is a complex and multifaceted activity that requires the combination of a number...

Updated 2 years ago by James

Explainability

AI and ML Explainability and Model Analysis

Explainability is a big challenge in machine learning. I wrote a blog post about the ELI5 library...

Updated 2 years ago by James

Model Confidence Scores

AI and ML Explainability and Model Analysis

Many ML classification models can provide a confidence score which tells the user how confident t...

Updated 2 years ago by James

Learning with Limited Data

AI and ML Machine Learning with Limited Data

Good machine learning is heavily dependent on good data. A few more good data-points is likely to...

Updated 2 years ago by James

Relationship Extraction

AI and ML Tasks

Relationship Extraction (RE) is a task that is related to Coreference Resolution but with a focus...

Updated 2 years ago by James

Pattern Exploitative Training

AI and ML Machine Learning with Limited Data

PET or Pattern Exploitative Training @article{schick2020exploiting, title={Exploiting Cloze Qu...

Updated 2 years ago by James

Design Frameworks

Software Misc

Design frameworks provide out of the box styling and components for use in websites. Many framewo...

Updated 2 years ago by James

Mental Health Primer

Mental Health

May I have the serenity to accept the things I cannot change,the courage to change the things I...

Updated 2 years ago by James

Exporting Issues from Linear

JIRA Migrating from Linear to JIRA

The first step we need to take is to export our issues from linear. The easiest way to do this is...

Updated 2 years ago by James

Deploying Django Apps

Python Django

Packaging a Django App in Docker I wrote a blog about packaging django apps up for shipping in d...

Updated 2 years ago by James

Django and PostgreSQL

Python Django

When working with Django and PostgreSQL it is typically best to use the psycopg[binary] package: ...

Updated 2 years ago by James

DBT

Data Engineering and MLOps

DBT is a data transformation tool with a SaaS platform and an open-core command line tool. The to...

Updated 2 years ago by James

Data Wrangling

Data Engineering and MLOps

DuckDB DuckDB is a lightweight OLAP type database system written in C++ and designed to be used f...

Updated 2 years ago by James

Data Loading with Airbyte

Data Engineering and MLOps

Airbyte is a FOSS tool for mass data import and export when working with common flavours of SQL a...

Updated 2 years ago by James

Keyword Extraction

AI and ML Tasks

Graph-Based Keyword Extraction Graph-based approaches like TextRank allows the extraction of keyw...

Updated 2 years ago by James