Pipeline Architecture

This document describes the pipeline architecture, which uses a Directed Acyclic Graph (DAG) to orchestrate artifact generation. The pipeline transforms raw design data into final outputs for visualization and manufacturing, with dependency-aware scheduling and efficient artifact caching.

Core Concepts

Artifact Nodes and the Dependency Graph

The pipeline uses a Directed Acyclic Graph (DAG) to model artifacts and their dependencies. Each artifact is represented as an ArtifactNode in the graph.

ArtifactNode

Each node contains:

ArtifactKey: A unique identifier consisting of an ID and a group type (workpiece, step, job, or view)
State: The current lifecycle state of the node
Dependencies: List of nodes this node depends on (children)
Dependents: List of nodes that depend on this node (parents)

Node States

Nodes progress through four states:

State	Description
`DIRTY`	The artifact needs to be regenerated
`PROCESSING`	A task is currently generating the artifact
`VALID`	The artifact is ready and up-to-date
`ERROR`	Generation failed

When a node is marked as dirty, all its dependents are also marked dirty, propagating invalidation up the graph.

PipelineGraph

The PipelineGraph is built from the Doc model and contains:

One node for each (WorkPiece, Step) pair
One node for each Step
One node for the Job

Dependencies are established:

Steps depend on their (WorkPiece, Step) pair nodes
Job depends on all Steps

DagScheduler

The DagScheduler is the central orchestrator of the pipeline. It owns the PipelineGraph and is responsible for:

Building the graph from the Doc model
Identifying ready nodes (DIRTY with all VALID dependencies)
Launching tasks to generate artifacts
Tracking state through the generation process
Notifying consumers when artifacts are ready

The scheduler works with generation IDs to track which artifacts belong to which document version, allowing reuse of valid artifacts across generations.

Key behaviors:

When the graph is built, the scheduler syncs node states with the artifact manager to identify cached artifacts that can be reused
Artifacts from the previous generation can be reused if they remain valid
The scheduler tracks which generation IDs have running tasks to preserve artifacts during generation transitions
Invalidations are tracked even before graph rebuild and re-applied after

ArtifactManager

The ArtifactManager is a pure cache manager for artifact handles. It:

Stores and retrieves artifact handles
Manages reference counting for cleanup
Handles lifecycle (creation, retention, release)
Does NOT track state (state is managed by the DAG scheduler)

Shared Memory Lifecycle

Artifacts are stored in shared memory (multiprocessing.shared_memory) for efficient inter-process communication between worker processes and the main process. The ArtifactStore manages the lifecycle of these memory blocks.

Ownership Patterns

Local Ownership: The creating process owns the handle and releases it when done. This is the simplest pattern.

Inter-Process Handoff: A worker creates an artifact, sends it to the main process via IPC, and transfers ownership. The worker "forgets" the handle (closes its file descriptor without unlinking the memory), while the main process "adopts" it and becomes responsible for eventual release.

Reference Counting

The ArtifactStore maintains reference counts for each shared memory block. Multiple callers can retain() a handle, and the block is only unlinked when the count reaches zero. This is used by the ViewManager for progressive rendering where multiple callbacks may access the same artifact.

Pipeline Stages

The pipeline stages (WorkPiecePipelineStage, StepPipelineStage, JobPipelineStage) now serve as interfaces rather than task launchers:

They handle invalidation requests from the UI
They delegate task launching to the DagScheduler
They provide access to cached artifacts
They forward signals from the scheduler to the UI

InvalidationScope

The InvalidationScope enum defines the scope of invalidation for downstream artifacts:

Scope	Description
`FULL_REPRODUCTION`	Invalidates workpieces, which cascades to steps and then to the job. Used for changes that require artifact regeneration (geometry, parameters, size changes).
`STEP_ONLY`	Invalidates steps directly, which cascades to the job. Used for position/rotation-only transform changes where workpiece geometry remains unchanged.

Detailed Breakdown

Input

The process begins with the Doc Model, which contains:

WorkPieces: Individual design elements (SVGs, images) placed on canvas
Steps: Processing instructions (Contour, Raster) with settings

Pipeline Core

Pipeline (Orchestrator)

The Pipeline class is the high-level conductor that:

Listens to the Doc model for changes
Coordinates with the DagScheduler to trigger regeneration
Manages the overall processing state
Connects signals between components

DagScheduler

The DagScheduler:

Builds and maintains the PipelineGraph
Identifies nodes ready for processing
Launches tasks via the TaskManager
Tracks node state transitions
Emits signals when artifacts are ready

ArtifactManager

The ArtifactManager:

Caches artifact handles in shared memory
Manages reference counting for cleanup
Provides lookup by ArtifactKey and generation ID

Artifact Generation

WorkPieceArtifacts

Generated for each (WorkPiece, Step) combination, containing:

Toolpaths (Ops) in local coordinate system
Vertex data for lines
Texture data for raster fills

Processing sequence:

Modifiers: (Optional) Image conditioning (grayscale, etc.)
Producer: Creates raw toolpaths (Ops)
Transformers: Per-workpiece modifications (Tabs, Smooth)
Vertex Encoder: Creates GPU-friendly data

StepArtifacts

Generated for each Step, consuming all related WorkPieceArtifacts:

**StepRenderArtifact:

Combined vertex and texture data for all workpieces
Transformed to world-space coordinates
Optimized for 3D canvas rendering

**StepOpsArtifact:

Combined Ops for all workpieces
Transformed to world-space coordinates
Includes per-step transformers (Optimize, Multi-Pass)

JobArtifact

Generated on demand when G-code is needed, consuming all StepOpsArtifacts:

Final G-code for the entire job
Complete vertex data for simulation
High-fidelity time estimate

ViewManager (Separated)

The ViewManager is decoupled from the data pipeline. It handles rendering for the 2D canvas based on UI state:

RenderContext

Contains the current view parameters:

Pixels per millimeter (zoom level)
Viewport offset (pan)
Display options (show travel moves, etc.)

WorkPieceViewArtifacts

The ViewManager creates WorkPieceViewArtifacts that:

Rasterize WorkPieceArtifacts to screen space
Apply the current RenderContext
Are cached and updated when context or source changes

Lifecycle

ViewManager tracks source WorkPieceArtifact handles
When render context changes, ViewManager triggers re-rendering
When source artifact changes, ViewManager triggers re-rendering
Throttling prevents excessive updates during continuous changes

The ViewManager indexes views by (workpiece_uid, step_uid) to support visualizing intermediate states of a workpiece across multiple steps.

Consumers

Consumer	Uses	Purpose
2D Canvas	WorkPieceViewArtifacts	Renders workpieces in screen space
3D Canvas	StepRenderArtifacts	Renders full step in world space
Simulator	JobArtifact	Accurate simulation of machine path
Machine	JobArtifact G-code	Manufacturing output

Key Differences from Previous Architecture

DAG-based Scheduling: Instead of sequential stages, artifacts are generated as their dependencies become available.
State Management: Node state is tracked in the DAG graph, not in individual components.
ViewManager Separation: Rendering for the 2D canvas is now handled by a separate ViewManager, not as part of the data pipeline.
Generation IDs: Artifacts are tracked with generation IDs, allowing efficient reuse across document versions.
Centralized Orchestration: The DagScheduler is the single point of control for task launching and state tracking.
Pure Cache Manager: The ArtifactManager is now a simple cache, delegating all state management to the DAG scheduler.
Invalidation Tracking: Keys marked dirty before graph rebuild are preserved and re-applied after rebuild.
Pending Work Detection: Only PROCESSING nodes count as pending work; DIRTY nodes may have unsatisfied dependencies (e.g., no view context).

Core Concepts

Artifact Nodes and the Dependency Graph​

ArtifactNode​

Node States​

PipelineGraph​

DagScheduler​

ArtifactManager​

Shared Memory Lifecycle​

Ownership Patterns​

Reference Counting​

Pipeline Stages​

InvalidationScope​

Detailed Breakdown

Input​

Pipeline Core​

Pipeline (Orchestrator)​

DagScheduler​

ArtifactManager​

Artifact Generation​

WorkPieceArtifacts​

StepArtifacts​

JobArtifact​

ViewManager (Separated)​

RenderContext​

WorkPieceViewArtifacts​

Lifecycle​

Consumers​

Key Differences from Previous Architecture

Artifact Nodes and the Dependency Graph

ArtifactNode

Node States

PipelineGraph

DagScheduler

ArtifactManager

Shared Memory Lifecycle

Ownership Patterns

Reference Counting

Pipeline Stages

InvalidationScope

Input

Pipeline Core

Pipeline (Orchestrator)

DagScheduler

ArtifactManager

Artifact Generation

WorkPieceArtifacts

StepArtifacts

JobArtifact

ViewManager (Separated)

RenderContext

WorkPieceViewArtifacts

Lifecycle

Consumers