About — shilovskii.dev

The work

I split my time between independent product work — Apple-platform apps focused on running language models locally — and a small amount of consulting for teams shipping on-device ML features.

The thread connecting everything is a single bet: that as small models get good enough, the right place to run them is the device they're already on, not a datacenter the user can't see. It's faster, more private, and — once you've felt it — surprisingly hard to give up.

Background

13+ years of software engineering, primarily focused on AI and distributed systems. Experience spans:

Architecting AI infrastructure for autonomous multi-modal agents — large language models, vision transformers, speech-to-text and text-to-speech systems
Building fault-tolerant distributed backends in Rust using the actor model
Designing transactional graph-based memory systems for real-time retrieval-augmented generation
Establishing simulation-first learning pipelines for AI agent validation
Leading R&D teams from research prototype through industrial-grade deployment

Currently applying that background to consumer apps where on-device inference replaces network calls, and the same engineering rigor used in serious AI architecture goes into tools people use daily.

Stack

Daily drivers: Swift, Rust, llama.cpp, MLX, Metal.

Comfortable across Python, C++, TypeScript, and Go when the situation calls for them. Long history with C# and Java from earlier in my career.

Working together

I take on a small number of consulting engagements each year — typically helping teams architect on-device ML features, optimize Apple Silicon inference, or design Swift ↔ Rust bridges. Engagements are usually 4–8 weeks.

If that sounds useful, send a note via the form with a sentence or two about what you're building. I read every message and reply within a couple of days.

Elsewhere

GitHub — the open-source side of things
LinkedIn — full professional history
georgii@shilovskii.dev — for serious correspondence

Hi, I'm Georgii.

The work

Background

Stack

Working together

Elsewhere