andersource

Spraying Digital Graffiti

2024-05-30T12:00:00+00:00

Background

My partner has an upcoming show she’s promoting. She has a couple of nice pictures with colored walls, and I thought it could be cool to spray “digital graffiti” on the walls with info about the show. My options were:

Plaster semi-transparent text on the wall, which would look cheap and lousy
Fiddle with some gen AI, but where’s the sport in that
Spend half a day to figure out how to do that computationally

Since we’re here, we all know which option I chose!

Let’s get rolling

Say we have this nice picture of a textured wall (Image by kues1 on Freepik):

First, using image color replacement from a while back, we can paint the wall pink, which already looks neat:

Now, suppose we also have a binary mask depicting the graffiti we’d like to spray:

We can use the mask to mix between the two wall images (original and painted):

That looks pretty nice, but the mask boundary is too smooth and detached from the texture. Real paint would behave slightly differently based on the bumps in the cement. How do we modify the boundary?

Pathfinding to the rescue

We want to modify the mask’s contour, such that it follows the original contour pretty closely, but also tries to avoid crossing “high-energy” areas of the image. We can use edge detection to define these high-energy areas.

Defining the path

To define the path, we sample the contour uniformly, but then wiggle the indices a bit to start from low-energy areas of the image.

Next we stitch the new contour as a series of paths using skimage’s graph.route_through_array, which finds the minimum-cost-path through a cost landscape. To define the cost landscape, we compose the edge map with the distsance transform of the original contour. We can vary the weighing between the distance transform and the edge map to balance how strictly we want the new contour to adhere to the original one.

Here’s an example cost landscape:

Here’s the minimal cost path in this case:

Finally, after we obtain the new contour, we can use it to define a new mask and mix the images:

The effect is subtle but, in my opinion, makes the result a lot more realistic.

Gimme the code

Image color replacement is here
Rest of the process described is here. This code is specifically tailored to what I wanted, but it shouldn’t be hard to make it more general

BFS zero-to-hero, part 5: the 15-puzzle

2024-04-15T21:14:00+00:00

Part 1 | Part 2 | Parts 3 & 4

Challenge 5: the 15-puzzle

The 15-puzzle is a famous sliding puzzle. Your aim is to slide the numbered squares around, using a single empty square, to finally achieve an orderly board.

The puzzle is amenable to BFS, although the search space gets big quickly so bigger boards can require minutes (or more!) to solve.

I think an elegant way to represent a solution is to trace the imaginary path of the empty square, which can have repeat positions. However, in this puzzle’s case, the state represents the entire board’s configuration.

Give it a shot

Clone the repo and crack it. The last test case might take several seconds to compute (but shouldn’t take too long.)

BFS zero-to-hero, parts 3 & 4

2024-04-12T07:30:00+00:00

Part 1 | Part 2

It’s been a while! The crazy war situation in Israel knocked me off track, but here we are.

Challenge 3: water jugs

In this challenge you need to write a general solution to the water pouring problem: given the maximum capacity of two water jugs and a goal amount, concoct a plan to fill / empty / pour the jugs to achieve the desired amount (assuming a large source of water is available). The catch: there’s no way to measure water amounts in the jugs other than completely empty / full. Although this problem can be solved with more specialized math, it’s also efficiently solvable using BFS.

Challenge 4: escape from the fort

Olivia, Amelia and Lucas have been imprisoned in a fort by a dragon. They have access to a pulley with large baskets at the ends that could allow them to escape. Alas, the pulley has been magicked by the dragon to be lowerable only when the weight difference between the upper end and the lower end is exactly 25 kg.

Amelia weighs 50 kg, Olivia weighs 75 kg, and Lucas weighs 125 kg. Aditionally, they’ve found a 25 kg weight to aid them.

Your challenge is to implement a search that finds the quickest way to escape the fort.

Try it out

Clone the repo and hack away! Pay attention to all the implementation instructions.

This post from a while back might be helpful too. Enjoy :)

Python Turtle Bingo

2024-04-07T20:30:00+00:00

Write turtle code to recreate simple pictures

BFS zero-to-hero, part 2: snake

2023-10-15T18:50:00+00:00

This is part 2 of the BFS challenge series, in which you’ll need to apply BFS to progressively more abstract problems.

Challenge 2: playing snake

In this challenge you’re tasked with assisting a self-playing snake game in planning the snake’s route.

A familiar problem with a twist

This seems like a simple chaining of the previous part’s solution - whenever the food is eaten, plan a route from the snake’s head to the new food position with the snake and game boundaries as walls. However, this case is slightly different - as the snake moves, some walls become empty cells, and vice versa. Can you think of an appropriate adaptation to the algorithm? Solution (spoiler alert):

Fortunately we don't have to account for the newly-created walls, as there's no reason to return to a previously-visited cell. This is fortunate because otherwise formulating the solution with BFS would have been much more difficult or even infeasible.

Regarding the vacated walls, an elegant way to utilize them is to include in the search space state the time that elapsed since planning the route. This way, for each state popped from the BFS queue, we can use the elapsed-time value to "trim" the snake's tail and determine exactly which snake cells constitute as walls and which don't.

Have a go!

To try your hand at this challenge, clone the same repo from the previous part (if you haven’t yet), install requirements.txt, and open 2-snake/main.py. The code has implementation instructions for the pathfinding function.

Note the direction representation scheme and the solution structure (list of direction indices).

Good luck, and enjoy!

#StandWithUs

BFS zero-to-hero, part 1: intro & maze

2023-09-30T13:00:00+00:00

Intro

In the last year or so I’ve been working with a program providing tech-ed to Israeli youth in disadvantaged areas. We’re doing a lot of fun learning activities and challenges, and there was one idea that had me particularly excited. Unfortunately it didn’t make it into the curriculum, but I was so hyped about it I decided to do it anyway and post it here. The idea is to create a series of BFS (breadth-first search) application challenges, where:

The challenges get progressively more abstract, allowing the students to both gain a deep understanding of how the algorithm works, and learn how to identify problems where a seemingly unrelated algorithm can be applied.
Solutions are animated using pygame, allowing students to visually see the result of their implementation and experience the satisfaction of solving the problem.

The abstraction progression happens by starting with problems where the state space strongly corresponds to a physical space (like a 2D maze) and gradually moving to problems where the state space is more abstract.

This seems like a fun thing to make, so here’s my go at it!

Challenge 1: solving a maze

The first challenge is straightforward - solving a maze, i.e. getting from a starting point to an end point on a 2D cell grid, moving only between adjacent cells, where some cells are blocked (“walls”). The graph nodes are all the (non-wall) maze cells, and edges exist between any two adjacent cells.

Getting started

To try your hand at this challenge, clone this repo, install requirements.txt, and open 1-maze/main.py. The code has implementation instructions for the pathfinding function.

Good luck, and enjoy!

Procedural boiderflies

2022-12-31T21:45:00+00:00

Boiderflies, hur hur. Also, Happy New Year!

The exhaustive TODO List

2022-11-25T09:00:00+00:00

I hate productivity porn. Whenever a “use this one trick to stop procrastinating” post does the rounds, I itch to respond with “maybe you shouldn’t want to stop procrastinating?” Fortunately the banner is held by many more qualified than me to explain why issues with procrastination are less about pushing yourself harder and more about emotional regulation and figuring out what you want to prioritize in life.

And so it is with surprise, and no small sense of hypocrisy, that I find myself wanting to share my own “one trick” to overcoming procrastination. But I have to concede my purist approach - reality is nuanced, and life is such that sometimes I find myself in an emotional rut with no immediate way out. There are still bills to pay and things that need to get done. In such cases I’ve found a particular approach works quite well - the exhaustive TODO list.

The idea is quite simple: split the task to tiny action items, each taking no more than 5-10 minutes to complete (including the context switch from whatever I was doing before). I use a simple text-based TODO list. This list then serves as the absolute guide for what to do when I want to make some progress; I update the list frequently to reflect changes in requirements or the way I see the task. If creating such an exhaustive, detailed list is an intimidating task on its own, I’ll create an outline TODO list with action items to expand each topic. If I don’t have all the info to make a complete TODO list, I’ll make action items to find out the relevant info and update the list accordingly.

The reason this works ties back to the emotional regulation aspect of procrastination: in the first place, the reason for procrastinating on something is not having enough emotional energy to commit to working on it. But procrastination has its own emotional cost, fostering feelings of guilt and draining mental energy. Having the list offers a third alternative: I don’t have to commit to working on the task for hours, but I also don’t have to guilty-scroll social media until I’m completely drained - I can simply tick off just one action item and be done with it. This means I’m always at most 10 minutes away from having done my part for the day. And sometimes the one thing I do gets me in the mood, and I cross off a whole bunch of items. There’s a small virtuous cycle there - getting things done, even if slowly, alleviates some of the guilt I associate with the task, which means working on it is less of an emotional drain.

Of course, this isn’t a magic solution to procrastination, and unsustainable for too many tasks or for too long. Again, if you’re wondering how to stop procrastinating, I’d recommend asking yourself if you’re missing something and there’s a deeper cause for either lack of emotional energy to do things you want, or unfulfilled (maybe unacknowledged) desires manifesting as the desire to be more productive (our society’s solution to all maladies /s).

And now, if you’ll excuse me, there’s this thing I’ve been procrastinating on…

Interactive data exploration

2022-06-05T16:30:00+00:00

One of the first priorities when approaching a new data task is getting to know the data, and visualizations are an integral part of the process. For me, interactive visualizations (with libraries such as plotly, bokeh or d3.js) are especially powerful in bringing the data “closer” and making it almost physically tangible.

In a few distinct cases, the standard visualizations weren’t enough for me to feel I properly grok the data, and I wanted something more. In those cases I ended up creating custom interactive visualizations to explore the data. A key aspect of those visualizations was that they contained all the samples in some condensed form, a way to interact with the samples, and additional visualizations that accompany the interaction.

In this post I’ll walk through a demo of such a utility, to explore data of taxi rides in NYC.

If you’re on desktop, you can go ahead and try the demo here, though I’d recommend skimming the post to understand what’s going on.

NYC taxi dataset

In the demo you can explore a small subset (a little less than 10K) of the New York City Taxi Ride Dataset from 2016, downloaded from Kaggle. An extensive exploratory data analysis of the dataset (with the goal of predicting the ride duration, as per the Kaggle competition), by Kaggle user Heads or Tails, can be found here.

The features I focused on in the demo are:

Pickup location (latitude and longitude)
Dropoff location
Pickup time (day, hour)
Ride duration

Here are a few samples from the dataset:

pickup_day	pickup_time	duration_hours	pickup_lon	pickup_lat	dropoff_lon	dropoff_lat
3	7.68	0.31	-73.96...	40.78...	-73.98...	40.76...
4	9.83	0.37	-73.95...	40.78...	-73.98...	40.74...
0	22.02	0.11	-73.99...	40.72...	-73.99...	40.73...
1	15.82	0.02	-73.91...	40.77...	-73.91...	40.77...
0	13.97	0.13	-73.98...	40.76...	-74.00...	40.76...

Dimensionality reduction

My primary approach for including all samples in the visualization is using some dimensionality reduction technique.

For the demo I used UMAP, which gave the following 2D embedding of the rides:

Interactive exploration

The interactive exploration utility is composed of two main areas: a visualization of the embedded samples, and another section with visualizations of the interactions.

The sample area supports zoom and pan.

The “side” visualizations show the distributions of features for the full dataset as well as for highlighted samples:

Map of all pickup-dropoff pairs
Count plot for days
Histograms for time and duration
Compass-like arrows for showing the dominant ride direction

Sample and selection inspection

Two related modes involve highlighting a set of samples, and visualizing the feature distributions of the selection compared to the full dataset.

One mode (which I coined “inspection”, the one with the magnifying glass icon) provides highlighting by hovering.

The other mode (“selection”, brush icon) provides highlighting by selecting samples with a brush (by pressing the ctrl/cmd keys).

Using these we can quickly see that UMAP created a big blob for each day of the week, with some smaller blobs for specific types of rides.

The day-blobs are organized such that across their length they correspond to the pickup time, and the perpendicular direction roughly corresponds to pickup / dropoff location in Manhattan.

Each day-blob has a slightly separated portion for late-travellers from the day before (or very early?), with the size of the portion increasing as we get closer to the weekend.

Additionally, there are some smaller blobs for airport rides (JFK and LGA), some of which are also organized by time of day; these blobs seem to be split by:

to / from for JFK (as indicated by the direction arrows)
day of week for LGA
- Interestingly, the weekend rides from LGA have been annexed to the rest of Sunday’s rides
- Also interesting to note that rides to LGA have been mixed with the rest of the rides (unlike rides to JFK, which have a blob of their own)

The peculiarities can indicate interesting patterns in the data, but they can also be a result of the way the chosen dimensionality reduction technique works (more on that soon).

Projection tool

The projection tool (enabled only when samples are selected with the brush) allows us to specify an axis and observe how the selected samples project onto the axis, by coloring them and showing a scatterplot of pickup time and duration by projection.

This allows us to inspect the way blobs are organized, more easily than the hover inspection tool. For example, in the animation above we select the Saturday blob, first projecting it along its long axis, which we see corresponds to the pickup time; then we project it along the perpendicular axis, and see that it roughly corresponds to the ride location.

Discussion and variations

It is evident that the exploration hinges on the dimensionality reduction; a random projection, for example, would be no better (and arguably worse) than looking at random subsets of the data. Thus it’s important to choose a proper dimensionality reduction approach, and maybe even provide interactivity of the dimensionality reduction itself.

Some approaches to dimensionality reduction:

Playing with different DR techniques (e.g. see scikit-learn’s page on manifold learning or PyMDE)
Giving different weights to different features (up to completely removing features) before running through a DR technique
- This can be done interactively (though might require lots of pre-computation or fast realtime-ish DR)
If sample similarity is easy to obtain, can use multidimensional scaling or matrix factorization
If there’s a DL model involved, can use DR on last layer representations

The main sample area doesn’t have to be a 2D embedding of the samples either. In one project with compositional data we used DR to 1 dimension (PCA), and displayed the samples as a stacked percentage chart.

Another bunch of interactive tools could allow the user to filter or highlight samples according to some criteria. This can enable a sort of ping-pong game of generating hypotheses by highlighting samples and validating them by applying criteria and inspecting the resulting patterns.

Empowering other roles in the org

Apart from using such utilities myself to play with the data, these tools were of interest to other people in the team:

When the data was navigation flow through a mobile app, the product manager used the utility to gain a better understanding of user experience and behavior
When the data was user interaction with content items, content moderators used the utility to understand the items as experienced by users, and uncover specific issues with certain items that led to unexpected behavior

Of course, hacking something for your own usage is very different from developing a tool used by other people (even if it’s for internal use only), so there’s obviously an effort trade-off here.

Performance considerations

Naturally, when visualizing large datasets, performance can be an issue, even a blocker.

Some tricks that can be used for performance:

Dividing the data into pre-filtered subsets
Not re-calculating the full subset stats but updating them based on added / removed samples
Using an appropriate data structure (e.g. k-d tree et al.) for detecting selected samples
Moving heavy operations server-side
Utilizing GPU with WebGL or, hopefully soon, WebGPU - e.g. see this great post

It’s important to choose the right sort of interactivity for the tools. For example, if calculating subset distributions took a long time, doing that for every mouse move would have been a very bad idea, and a better choice could have been box or lasso selection.

Code & Disclaimers

Code for the demo can be found here.

I’m not a frontend dev, and hacked this demo over a few weekends, so some disclaimers:

The code could certainly use a refactor, there’s a lot of global state management, code duplication etc.
Might not look good or work smoothly on different browsers / screens
There are probably a few bugs lurking around
The design could use some refinement

Overall though I’m pretty happy with how it turned out.

Call for interesting data

I’m curious to learn how this idea can be applied across diverse domains. If you have data you’re struggling to grok, and think such a utility could provide value, ping me (hi@andersource.dev) - I might like to give it a shot!

Lasers in Space

2022-04-21T08:20:00+00:00

Remote-friendly physical cooperation game developed in 24 hours for a hackathon