From Agents to Coordination: Reframing Distributed AI as a Systems Problem
What feels new about agentic AI is the tooling. What isn’t new is the problem.
ReadShort posts about building and operating software. Mostly cloud systems, data pipelines, and geospatial work.
Written for future me, shared publicly. I hope something here saves you time.
What feels new about agentic AI is the tooling. What isn’t new is the problem.
ReadExactly-once execution is a myth. The only workable approach is at-least-once delivery with idempotent effects.
ReadMost agent systems don’t fail because models are weak — they fail because coordination is underspecified, and the failures are silent.
ReadDiscovery isn't a one-time onboarding step; it's a recurring tax that compounds across agent-driven tasks.
ReadOIDC-backed SSH certificates, a Rust signing gateway, and Traefik multi-port routing for ephemeral workspaces.
ReadA hybrid Rust + Python library for fast EO primitives (indices, masking, temporal stats) that plays nicely with NumPy, XArray, and Dask.
ReadRunning over localhost in a Kubernetes Job Pod - can be adapted for other environments
ReadGeoreferenced AOI grid-offset creation of bounding box sample areas.
ReadSaving a GeoPandas GeoDataFrame (also with Parquet and Feather examples) to a remote Cloud storage bucket and then read from the bucket - demonstrating Storage Client IO
ReadA few considerations with Dask Futures to improve computation efficiency and data management.
ReadThere are many reasons why snow mapping is beneficial. During Spring, we can use it to provide flood risk mitigation - especially for areas prone to flooding. During Winter, it cou
ReadWe will use the Cartopy library, available for Conda environments. This library allows us to project and transform appropriate mappings from coordinates and images.
ReadCurating the scheme of task allocation and its impact on the system
ReadThe effects and approaches to consider when dealing with straggling workers
ReadThis article uses the PyTorch framework to develop an Autoencoder to detect corrupted (anomalous) MNIST data.
ReadThe abstraction of distributed systems has become more prevalent as services like Amazon AWS , Google Cloud , and Microsoft Azure take the reigns away from developers and let them
ReadNo posts match that filter yet.