koaning.io

Blog of a data person.

Blog of a data person.

2024-11-01 Mousetrap.js - TIL
2024-10-22 Minimal Spanning Trees - BLOG
2024-10-20 Best practices stink - BLOG
2024-10-16 Stackoverflow: artifical artifical intelligence - TIL
2024-10-15 Seeing Faces - TIL
2024-10-15 Rock, Paper, Swirlies - BLOG
2024-10-11 Label propagation - BLOG
2024-09-30 Bacteria from the 7th Guest - BLOG
2024-09-28 Rock Paper Scissors Lizard Spock - BLOG
2024-09-27 Lanchesters' Battle Simulator - BLOG
2024-09-25 Pyperclip - TIL
2024-09-07 MatplotPong - TIL
2024-09-06 Instructor classification with fields - SCRIPT
2024-09-05 ZeroShot NER with GliNER - SCRIPT
2024-09-01 Rename NER - SCRIPT
2024-08-30 Embedding some texts - SCRIPT
2024-08-30 spaCy phrases/noun chunks - SCRIPT
2024-08-30 spaCy disable - SCRIPT
2024-08-29 Today I Scripted - BLOG
2024-08-01 Video game sales in the 80s - TIL
2024-08-01 The making of Diablo - TIL
2024-06-08 SciNCL - TIL
2024-06-08 Tetris is NP-Hard - TIL
2024-06-08 Searching for a better search box - BLOG
2024-05-27 Documint - TIL
2024-05-20 Keyboards, keyboards, keyboards - BLOG
2024-04-20 Garage Points - TIL
2024-04-20 Interpreting Eomji - TIL
2024-03-27 The roll, yaw and pitch of strawberries. - TIL
2023-11-22 Detecting Chessboards - TIL
2023-10-05 Machine UnLearning for Harry Potter - TIL
2023-09-29 Invasive species - TIL
2023-09-22 Citrus Fruits - TIL
2023-09-22 One Dimensional Word2Vec - TIL
2023-09-08 Classification via Segmented Attention - TIL
2023-09-08 Doppelganger Buildings - TIL
2023-07-26 text2fabric - TIL
2023-07-18 Plant datasets - TIL
2023-07-17 The worst kind of duplicate - TIL
2023-07-17 Sleep vs. Code - TIL
2023-06-02 Memes and other strange images - TIL
2023-05-26 Large Disagreement Modelling - BLOG
2023-04-21 Rubik's TSNE - TIL
2023-04-21 Human Label Variation Datasets - TIL
2023-04-19 Wearables as a Multi-Model dataset - TIL
2023-03-29 Low Light Computer Vision - TIL
2023-03-13 Colorizing Mobile Websites - TIL
2023-03-12 Automating Esports Commentary - TIL
2023-03-11 Finding Text in Comic Books - TIL
2023-02-23 Spreadsheet Risk Management - TIL
2023-01-16 The Corrupted Blood Incident - TIL
2023-01-14 Orcs of the Office - BLOG
2022-12-20 Agree not to Disagree - BLOG
2022-12-07 Open Sanctions - TIL
2022-12-04 Angry AI Birds - TIL
2022-12-04 Typo/Spelling Error Dataset - TIL
2022-11-20 Playtesting Candycrush - TIL
2022-11-18 Bot Bowl - TIL
2022-11-02 Missing Pedestrians - TIL
2022-11-01 Ascent - TIL
2022-10-27 Game Time Distribution - TIL
2022-10-26 Minecraft Diffusion - TIL
2022-10-22 Only 7 Percent - TIL
2022-10-21 Zelda Street View - TIL
2022-10-08 Data Duplications - TIL
2022-10-06 Annotation Datasets - TIL
2022-10-05 Punderstanding - TIL
2022-09-09 DALC - TIL
2022-07-28 Being a Research Advocate. - BLOG
2022-07-21 Generating Receipts - TIL
2022-07-13 Annotators vs. Tasks - TIL
2022-05-17 Won't Predict via Disagreement - TIL
2022-05-13 Interactive Confusion Matrices - TIL
2022-05-02 Active Churning - TIL
2022-04-23 Active Street Signs - TIL
2022-04-22 Perfect Fit - TIL
2022-04-21 Active, but Visual, Learning - TIL
2022-01-16 The Story Theory - TIL
2022-01-08 Enjoy the Silence - BLOG
2021-12-20 Vulnerable Contributions at Scale - TIL
2021-12-05 VADER - TIL
2021-12-03 Linkrot - TIL
2021-10-29 Learning to Place - TIL
2021-10-21 Beyond Broken - BLOG
2021-10-13 Optimal Seeds - TIL
2021-10-12 1.4 Million Jupyter Notebooks - TIL
2021-09-27 Sentiment and Bias - TIL
2021-09-26 Gorilla Hypotheses - TIL
2021-09-13 Scots Wikipedia - TIL
2021-09-02 Bad Labels - BLOG
2021-09-01 Analytics Providers - TIL
2021-08-27 poke2vec - TIL
2021-08-10 Pandas Format - TIL
2021-08-06 Stopwords - TIL
2021-07-29 Dixit Data - TIL
2021-07-22 Label Errors - TIL
2021-07-17 DnD Data - TIL
2021-07-17 Shaded Screenshots - TIL
2021-07-16 Copilot & Pytest - TIL
2021-07-15 metatags.io - TIL
2021-07-08 Copilot & Submodules - TIL
2021-06-25 Github Actions as a Number - TIL
2021-06-23 Plenty of Bad Labels - TIL
2021-06-18 Recursive HTML - TIL
2021-06-13 Urban Dictionary Embeddings - TIL
2021-06-05 Tesla vs. Stoplights - TIL
2021-06-03 Kolektor - TIL
2021-06-01 Flight Simulatoops - TIL
2021-05-05 Naive Bias[tm] and Fairness Tooling - BLOG
2021-03-03 A Loop to Stop Writing. - BLOG
2020-09-11 Oops and Optimality - BLOG
2020-08-24 Uncommon Contributions - BLOG
2020-06-28 Mean Squared Terror - BLOG
2020-04-25 Sharing is Caring - BLOG
2020-02-15 Introduction to Inference - BLOG
2020-01-16 Theoretical Dependence - BLOG
2019-12-07 Roman Reasoning - BLOG
2019-11-25 What Overfitting Looks Like - BLOG
2019-11-16 Parallel Grid - BLOG
2019-10-16 Goodhart, Bad Metric - BLOG
2019-05-22 High on Probability, Low on Confidence - BLOG
2019-05-14 The Future of Data Science is Past - BLOG
2018-08-01 Gaussian Auto Embeddings - BLOG
2017-11-01 Feed Forward Posteriors - BLOG
2017-08-29 Vary Very Optimally - BLOG
2017-01-14 Switching to Sampling in Order to Switch - BLOG
2016-11-19 Bayesian/Streaming Algorithms - BLOG
2016-10-26 Hello DeepQ - BLOG
2016-10-20 Avoiding, and Preventing, Joins - BLOG
2014-12-18 Variable Selection in Machine Learning - BLOG
2013-10-14 Digital Nomad - BLOG
2012-09-23 Vanity Metrics - BLOG