Doeon Kwon

Background

SPACE0Former Co-founder / CPO / Engineer

A multiplayer voxel world running one Manifold Dual Contouring engine in Rust across web, iOS, macOS, and Windows. I shipped native ports, embodied agents, a photo-to-3D asset pipeline, multiplayer infrastructure, and the analytics and guest-to-owner activation funnel, and ran shortform GTM that drove 560K organic views (TikTok 496K, YouTube 55K, Reels 9K)

2025.04-2026.06

DisquietFounding Member / Growth

A Korean career networking community for people in tech. Raised ₩1.4B at Pre-A (~$1.0M), scaled to ~130K MAU with 40% M1 retention. I ran Product Maker's Club cohorts of 80, 60, and 160 teams, grew monthly posts from under 100 to ~1,500 at peak, and helped it through its acquisition by Relate (YC S22). I wrote about it here

2022.01-2025.03

JunctionHead of People

Ran JunctionX Seoul 2021 and Junction Asia 2022

2021.01-2022.08

J2KBFounder

A nonprofit community teaching non-majors to code. Grew it to ~300 members and a 30-person crew across four 10-week cohorts, ran the Uni-Con and Pony-Con project competitions, and was selected for a regional nonprofit startup program

2020.04-2021.10

University of Technology SydneyInformation Technology

Left after my first year

2019.02-2020.01

Selected work

TLDR

An LLM can join a live multiplayer 3D world as a persistent player: it perceives its surroundings, remembers places, chats, moves, builds, and comes back with its memory intact across sessions.

Built

I built the embodiment layer that let an LLM act as a player inside a live SPACE0 world. A stateless MCP server exposed the full embodied loop; 3D-anchored long-term memory carried a soul document and brain-state checkpoints that survived restarts. I added a Voyager-pattern skill library, and wrote the server-side movement and collision the relay needed because it had no physics engine.

Signals

60 MCP tools across 8 modules (presence, build, memory, identity, commitment, skill, brain_state, media), the surface enforced by a manifest-drift test
Live on the public MCP registry as a Cloudflare Worker; tested with two agent brains, one embedded local model and one cloud model
Memories persist in pgvector with dual spatial anchors (observer and subject); soul revisions are logged and revertible
Builds draw from a 146-material palette; tool calls fan out to 23 authenticated relay actions

How it works

Embodiment parity: there is no privileged model-only write API. A world mutation issues the identical box-brush primitive humans use, and agent reach is pinned to the human client's 15m interaction raycast, so the relay re-derives server-side the placement physics the human client handles by default: reach, no floating blocks, claim containment.
Renderer-less symbolic perception: every look_around is built from the terrain SDF plus the sparse edit overlay through a fixed grid-to-world transform, an 11x11x7 density probe and a marched gaze ray, no GPU and no pixels; the grid cell ships beside the world position so a small model never hand-converts coordinates.
A prompt-injection fence treats all other-authored, in-world content as untrusted data, never instructions; human identity is set server-side, never self-declared.

Try the MCP

A spatial memory design document posted inside a live SPACE0 world, with a read prompt.

The memory design doc, posted inside the world.

The spatial memory design document opened in the in-world reading overlay, its thesis paragraphs readable.

The same doc opened in the reading overlay: index, placement, organization.

A wireframe view of the spatial memory regions laid over a live SPACE0 world.

Memory regions shown as a spatial index over the live world.

TLDR

A measured study of what an LLM agent's spatial memory actually needs to store: not just locations, but enough geometry to compute occlusion.

Built

I built the spatial-memory evaluation system on the live SPACE0 world and wrote it up as a paper: the recall-versus-visibility separation, a minimum-representation schema per query type, a ray-versus-voxel DDA visibility predicate, and a pre-registered live confirmation.

Signals

The shipped memory-palace blend failed its own pre-set test (mean ΔHit@5 -0.0375, at a position-blind baseline) while geometry-led weighting won (+0.32): geometry must lead recall when the query regime is spatial
On 849 behind-wall targets, text and a live FoV cone could not distinguish visible from occluded (0.000); adding the ray-versus-voxel DDA raycast games commonly use reached 0.982
Live confirmation with the criteria locked before the run: 8 scripted worlds, 96 behind-wall targets, false-visible 1.000 to 0.000, surfacing and fixing a relay anchor defect

How it works

The test is definitional: if a non-spatial text or vector index could answer the query, it is not a spatial-memory test. BM25, RAG, GraphRAG, and HippoRAG miss geometry questions not because the algorithms are weak, but because nothing spatial is stored; a minimum-representation schema pairs each query type with what to store and what to compute.
Recall and visibility are different problems: the fridge behind the wall should stay remembered (recall is occlusion-blind by design), while 'is it visible from here' is computed separately over stored geometry; one line of the DDA raycast games already use recovers occlusion.
The result: storage, not a renderer, is the irreducible piece. Handed coordinates and occluders as text, an LLM computes occlusion at parity with ray-marching (0.99 vs 0.985), so the contribution is measurement and isolation, and the robustness checks narrow the claim rather than inflate it.

arxiv paper

TLDR

One Manifold Dual Contouring voxel engine in Rust, extracted behind a typed boundary and shipped across four native runtimes (a web app, an App Store iOS client, and native macOS and Windows desktop apps) plus an agent relay, kept in sync by an FFI, TOML codegen, and a parity gate.

Built

I extracted an 18.9K-LOC Rust engine kernel out of the Next.js monolith into versioned packages. I carried it to native through a ~5,100-LOC C FFI xcframework (the iOS C++ to Rust swap) and a wgpu/Slint desktop client. I also built the TOML codegen that keeps physics constants, copy, design tokens, and analytics events in sync across TypeScript, Swift, Rust, and Slint.

Signals

An 18.9K-LOC engine kernel extracted from the Next.js monolith: 1,189+ import sites codemodded in one compile-green commit
An atomic C++ to Rust engine swap behind a 3-slice C-FFI xcframework: 120/120 XCTests, blake3 byte-parity, legacy engine deleted the same day
iOS: a Swift 6 / TCA / Metal App Store client; desktop: a native Rust client (wgpu, Slint) with PKCE auth and signed Sparkle/WinSparkle auto-updates
TOML codegen single-sources physics constants, copy, design tokens, and analytics events into web, Swift, Rust, and Slint
QA across two languages: Rust proptest, Kani formal-verification, cargo-mutants, TypeScript fast-check, and a record/replay divergence harness

How it works

The engine kernel was lifted out of the app monolith into versioned workspace packages, proven app-independent by a leak audit, then a custom ESLint rule machine-enforces the dependency graph.
iOS runs the engine on-device through a hand-written C FFI packaged as a three-slice xcframework (Swift 6 strict concurrency, TCA, Metal); desktop is a fully native Rust client (wgpu, Slint), no embedded browser, with PKCE OAuth and signed auto-updates.
One typed TOML source fans physics constants, copy, design tokens, and analytics events into four platforms, or the build fails. QA spans Rust proptest, Kani, and cargo-mutants on the Rust side and TypeScript fast-check on the web side, with a record/replay divergence harness on the push gate.

Try on the web App Store

More work

SPACE0 iOS camera pointed at a real bouquet, ready to capture it as a 3D object.

Snap a photo, and the pipeline job kicks off.

A real product captured into a 3D model, shown in the app with its detail card.

The finished model, ready to drop into any world.

Side by side: a traditional lacquered table photographed in a museum, and the generated 3D model of it.

A museum table, captured into a placeable asset.

TLDR

A single photo becomes a rigged, animated, compressed, game-ready 3D asset, with GPU workers, validation, rig merge, material maps, sound, and web/iOS/desktop export handled by the server-side pipeline.

Built

I built the pipeline that turns a single phone photo into a rigged, compressed, game-ready 3D asset. Self-hosted Hunyuan3D 2.1 runs on GPU workers, UniRig handles auto-rigging, and material maps are generated. The meshopt/KTX2/USDZ output runs behind a Supabase RPC job queue with atomic claims.

Signals

Self-hosted Hunyuan3D 2.1 on SaladCloud RTX 4090 workers, with weights baked into the image so cold workers start without downloading checkpoints
Material maps (normal, height, roughness, AO) generated from a single albedo; output as meshopt/KTX2/USDZ with skeletal animation and generated sound
A Redis-free GPU job queue on Supabase, claimed atomically by RPC, scaling independently of the app

How it works

Self-hosted Hunyuan3D 2.1 as Python GPU workers on SaladCloud RTX 4090s, with model weights baked into the container image for faster cold starts.
Auto-rigging with UniRig, merging the predicted rig back onto the full-resolution mesh so topology, UVs, and textures all survive.
A production asset path with validation and failure handling around mesh generation, rig merge, material-map generation, compression, and export, so bad outputs fail as jobs rather than leaking into the world.

TLDR

A browser editor that carves 3D Gaussian splat scenes at voxel resolution: per-fragment masking cuts cube-shaped holes in a live splat render, without forking the renderer.

Built

I built splatcarve, an open-source WebGL editor on Spark and Three.js: a voxel occupancy mask evaluated per fragment in the splat shader, voxel-level picking and snapping, undo/redo, and experimental block-stack and first-person modes on top of the carved scene.

Signals

Per-fragment carving holds p95 9.6ms at 256 carves on a 177K-splat scene; voxel picking answers in p95 5.3ms
The shader hook injects through Spark's public onBeforeCompile API: no fork, no custom rasterizer, one O(1) sampler3D lookup per fragment
15.7K LOC of TypeScript with 197 passing tests and reproducible latency benchmarks

How it works

Most 3DGS editors delete whole splats, which leaves fuzzy halos because neighboring Gaussians still cover the hole; splatcarve masks per fragment against a 3D occupancy texture, so carved holes get clean cube boundaries.
A clip-to-local matrix computed once per frame moves the voxel test into the fragment shader cheaply: a bounds check, one texture sample, discard.
Carves are visual-only masks over immutable splat data: an edit history gives full undo/redo, and the same voxel grid drives the experimental stack and first-person collision modes.

Live demo GitHub

TLDR

One Cloudflare Durable Object per world carries presence, edits, chat, and agent actions, so humans and agents share the same live space.

Built

I built the shared live layer: one Cloudflare Durable Object per SPACE0 world carries presence, edits, chat, and agent actions. Short-lived signed tokens authorize every connection.

Signals

One Cloudflare Durable Object per world, holding presence and session state in memory
Short-lived signed tokens minted by the web app authorize every connection
Web, iOS, macOS, and Windows clients all connect to the same multiplayer server

How it works

Every read is local to the world's Durable Object, and writes propagate instantly to all connected clients.
Server-side validation rejects malformed or out-of-bounds action payloads before they reach shared world state, and repeat offenders are denylisted.
Short-lived signed tokens minted by the web app gate every WebSocket connection, so the relay never trusts claimed identity.

Framed image posts placed on a wooden structure in a SPACE0 world.

Framed posts sitting on a real surface, on web.

A media card with a track list displayed on a surface in a SPACE0 world.

A media card with real depth and clipping, on web.

TLDR

Posts sit on surfaces inside the world rather than in a panel, so user content sits in the scene, with text, image, and video cards rendering across web, desktop, and iOS from one backend.

Built

I shipped in-world media posts on three platforms: a card of text, image, or video placed on a surface renders as a depth-correct decal on web (Three.js), desktop (a GPU-accelerated Slint canvas over wgpu), and iOS (SwiftUI).

Signals

Decals projected with correct depth, occlusion, and surface alignment on every client (web/desktop/iOS)

How it works

Posts are placed in world-space, not a sidebar panel: they sit on surfaces as depth-correct decals with occlusion, so a post is part of the scene rather than an overlay.
One backend serves three distinct renderers (Three.js web, wgpu desktop, SwiftUI iOS) without platform-specific divergence in the post schema.

The slintcn registry components page showing 56 components with live WASM previews.

Every component is browseable and live right in the page.

The slintcn Button component page with a live WASM preview and cargo and npm install tabs.

Each page: a live preview plus copy-paste cargo and npm install.

TLDR

slintcn is an open-source design system for Slint native apps, shipping 56 components with npm and crates.io installers, an MCP server, and live Slint-WASM docs.

Built

I built slintcn: a shadcn-style component registry for Slint native apps with 56 components plus 8 blocks, npm and crates.io installers sharing one registry, an MCP server for AI agents, and live Slint-WASM docs.

Signals

56 components and 8 blocks, installable via npm or crates.io from the same registry
A 62K-view r/rust launch post
MCP server lets AI agents browse, install, and compose components

How it works

Web tooling and native Rust clients share the same component source, without duplication.
The MCP server exposes the registry to AI agents, letting them browse available components, read docs, and install by name into a project.
The component system grew out of the SPACE0 desktop client and was dogfooded back into it, so every component shipped in a live product before it reached the registry.

Live demo Docs npm package Rust crate

The simulated planet seen from space, volumetric clouds drifting over landmasses and a dark ocean, half in night.

The whole planet from space, volumetric clouds and a day-night terminator.

A ground-level view across forested ridges and water on the simulated planet, the sun low through layered cloud.

Ground level: forested ridges, water, the sky.

A top-down view of the voxel terrain, rocky land split by dark channels of water.

The terrain from above: voxel land and water.

TLDR

A kilometer-scale voxel planet simulated live in a browser tab, with real atmospheric circulation, an ocean that obeys real oceanography, 36 climate-derived biomes, volumetric weather, and a metabolism engine that runs the ecosystem.

Built

I built the systems that run the planet. A Rust-WASM climate model drives three-cell global circulation, Coriolis, an ITCZ, and a -6.5C/1000m lapse rate. An oceanography-driven Gerstner ocean in TSL adds the M2 lunar tide, Ekman and thermohaline currents, and polar ice. A 36-biome layer turns climate into terrain and color. An 8,418-line volumetric cloud and weather system sits on top, and a five-phase metabolism engine runs the ecosystem.

Signals

Rust-WASM climate: three-cell global circulation (Hadley/Ferrel/Polar), Coriolis, ITCZ, and lapse-rate temperature, compiled to WASM and run in the browser
An ocean that simulates real oceanography: an 8-wave Gerstner cascade with analytical normals on the spherical planet, a 12.42-hour M2 lunar tide, Ekman and thermohaline currents, Beer-Lambert depth color, and polar ice past 70 degrees latitude
36 Whittaker-classified biomes derived from the simulated climate (temperature, precipitation, altitude), each driving its own terrain erosion, surface palette, and cloud physics
Volumetric cloud and weather: SDF raymarching, an offline imposter baker, and chunked streaming

How it works

The climate is real physics, not a texture: a wind module implements three-cell global circulation with Coriolis deflection, and a spherical-climate module models the ITCZ, storm tracks, and a -6.5C/1000m lapse rate, all running in WASM.
Every term in the ocean shader comes from the simulation: waves driven by the global wind field, tides on the true M2 period, with currents, depth color, and polar ice all derived from the climate model.
A 36-biome classification layer turns the climate field into the visible world: Whittaker temperature-precipitation classes with altitude bands, so terrain, color, and clouds all follow from the climate.

Recognition

Ray Vibe Awards 2025, Best Social Vibe winnerJury included an a16z games investment partner2025

Asan Voyager, cohort 1Asan Nanum Foundation US go-to-market program, selected with the Disquiet team2023

Chung Ju-yung Startup Competition, Excellence AwardAsan Nanum Foundation award named for Hyundai's founder, won with the Disquiet team2022

TIPS₩500M (~$360K) from Korea's national startup R&D program for a Graph Neural Network social networking platform; with the Disquiet team2022

JunctionX Seoul 2020, Rakuten Rapid API Track 2nd placeHackathon, team entry2020

Writing

Personal reasons for creating the game Reflections on Disquiet Agents, virtual worlds, and Minecraft Beautiful and useful things Thoughts on the Taste of Work Real Growth

Contact

Email GitHub LinkedIn X Substack

Open to founding / staff engineering roles in AI + spatial computing.