Recently Updated Pages
Logseq HTTP API
LogSeq provides a HTTP API for developing plugins. The documentation is not particularly intuitiv...
Python and Gitea
Tips and tricks for Python and Gitea and CI stuff. Use Drone CI for Gitea I use Drone + Gitea for...
Music Production Home
I am an amateur musician and I like to mess around in GarageBand and make music of my own.
AKAI MPK mini plus
The AKAI MPK mini plus is a small MIDI controller with a three octave keyboard and drum pads.
Dell XPS 13 9305
I had one as my daily driver at work from late 2020 to late 2023. I was so pleased with it I boug...
Availability
As tempting as it is, there is no need to be always available to others via instant messenger. Al...
Argilla
Argilla is a data annotation tool with an API that allows you to use external models to automatic...
Model Quantization
Deploying models that are performant (obviously statistically but in this context I primarily mea...
AI Causing Chaos
So much AI turns out to be low-waged people in a call center in the Global South pretending to ...
DeBERTa Zero Shot
The DeBERTa zero-shot model is an NLP zero-shot classifier trained by Moritz Laurer and made publ...
Stable Diffusion
Web UI API Run the web UI with: ./webui.sh --api Example Python Script import webuiapi ...
From Crowd Ratings to Predictive Models of Newsworthiness to Support Science Journalism
Paper Link Authors: Sachita Nishal Nicholas Diakopoulos Notes Their work comes at the pr...
Stratified Sampling in Pandas
Use groupby on the label column to create sub-frames for each label and then use the sample() fu...
Core Scientific Concepts (CoreSC)
Core Scientific Concepts (CoreSC) is an annotation scheme used to delineate different parts of sc...
Large Scale Multi-Label Learning
The Keras website has a tutorial on how to do multi-label learning with a large number of labels:...
SpaCy CoRef
Spacy Coref is an experimental coreference resolution model in spacy The project repository is he...
Federated Learning
Flower is a federated learning framework with compatibility with Torch, Tensorflow and others
SpaCy GPU
Set Up Environment It's relatively easy to use SpaCy with a GPU these days. First set up your con...
ML Best Practices
Machine learning is a complex and multifaceted activity that requires the combination of a number...
Explainability
Explainability is a big challenge in machine learning. I wrote a blog post about the ELI5 library...