Skip to main content

AI and ML

Cover photo by Conny Schneider https://unsplash.com/@choys_

ML Introduction

Welcome to my Machine Learning and AI notebook. 🧱 AI and ML Fundamentals AI Best Practices - ...

ML Best Practices

Machine learning is a complex and multifaceted activity that requires the combination of a number...

Model Quantization

Deploying models that are performant (obviously statistically but in this context I primarily mea...

SpaCy GPU

Set Up Environment It's relatively easy to use SpaCy with a GPU these days. First set up your con...

SpaCy CoRef

Spacy Coref is an experimental coreference resolution model in spacy The project repository is he...

Core Scientific Concepts (CoreSC)

Core Scientific Concepts (CoreSC) is an annotation scheme used to delineate different parts of sc...

Stratified Sampling in Pandas

Use groupby on the label column to create sub-frames for each label and then use the sample() fu...

From Crowd Ratings to Predictive Models of Newsworthiness to Support Science Journalism

Paper Link Authors: Sachita Nishal Nicholas Diakopoulos Notes Their work comes at the pr...

Stable Diffusion

Web UI API Run the web UI with: ./webui.sh --api   Example Python Script import webuiapi ...

AI Causing Chaos

So much AI turns out to be low-waged people in a call center in the Global South pretending to ...

Tasks

Tasks within ML and NLP

Machine Learning with Limited Data

DeBERTa Zero Shot

The DeBERTa zero-shot model is an NLP zero-shot classifier trained by Moritz Laurer and made publ...

Explainability and Model Analysis

Argilla

Argilla is a data annotation tool with an API that allows you to use external models to automatic...