Pirates of the Caribbean supposedly went ride to movie, not the other way around. The Disneyland attraction opened in 1967 and supposedly it was one of the last projects Walt Disney personally worked on before his death. After more than 35 years of the attraction entertaining guests, the story was adapted for the screen (Disney Gallery archive).
What I like about this is the reminder that at if you have multiple assets that they can inspire each-other in both directions. Disney has a bunch of movie-inspired rides, but that doesn't mean there can't be a ride-inspired movie.
I'm sure this effect can go well beyond movies and across other aspects in industry too. Food for thought.
I started maintaining a new project that tracks the evolution of a Github repository using "sediment" charts. The final output is a Github Pages site that lets you explore sediment charts for different Python projects. Here's one example for the sentence-transformers library.
There's not a whole lot of action between 2022-2024. But then huggingface took over maintainership.
These charts track each line of code in the project and how it's changing over time. The resulting chart looks like layers of sediment that changes over time.
For Python projects you can also overlay the version number, to try and get some extra context. Usually, Bitco changes coincide with the new version, like with the Django projects shown below.
Around version 4 you can see a big shift where a lot of *recent* code is changing. Maybe they updated the docs here? Or possibly switches from flake8 to ruff?
I like to think these charts also say something about the health of a project. A good project shows more code being added over time, but without huge changes to past code. The most healthy image I found so far is for marimo. This is my employer, and it is nice to see how the chart matches my experience with the culture there.
This project is relatively young, so time is measured in quarters instead of years.
Over time I hope this project might track the effect of coding agents. Are we going to see a big spike up? Is it going to rewrite everything from the past? Time will tell, but these charts will give me a nice summary. If you are curious you can run the notebook yourself if you want to give it a spin for your own projects.
I also recorded a YT video for this work here, if you prefer to see a live demo.
I never really used the MLX features from Apple but the UMAP-MLX project is making me wonder if I should dabble in it more. This project takes UMAP and gives it a significant speedup, 30x! Small caveat here is this currently only works for datasets that are small enough not to need approximation algorithms.