Interactive Simulation

Earth-Space Distributed AI

Explore how RotaStellar coordinates federated learning, model partitioning, and synchronization across Earth and orbital infrastructure.

Federated Learning Simulation

Round 1,247 · 00:04:32

RotaStellar vs Naive

Bandwidth -99.0%

Convergence +23.4%

Fault tolerance +∞

Accuracy delta -0.31%

Compression 100×

Nodes 6

Model Architecture

Accuracy
94.72%
Target: 95.0%

Loss

0.0523

Bandwidth

1.2 MB/s

99% reduction

Sync Rate

2.4/min

Async aggregation

🖥️

us-west

computing

🖥️

eu-west

syncing

🖥️

ap-south

ready

🖥️

ap-ne

computing

🛰️

orb-1

sunlight

🛰️

orb-2

eclipse

04:32.1 ✓ Round 1247 aggregated

04:31.8 ↑ eu-west gradient: 38 KB

04:31.2 ⚡ TopK sparsification: 99.1%

04:30.5 ⚠ orbital-2 entering eclipse

04:29.8 🛰 ISL handover complete

Training Convergence

RotaStellar (compressed)

Naive (full gradients)

Centralized baseline

Technical Deep Dive: Gradient Compression

Our gradient compression achieves 100× reduction while maintaining convergence by combining Top-K sparsification with stochastic quantization. This is critical for space links where bandwidth is measured in KB/s, not GB/s.

∇'_compressed = Q_8bit(TopK(∇, k=0.01)) + e_accumulated

The key insight is error feedback: compression errors are accumulated locally and added to the next round's gradients, ensuring no information is permanently lost.

# RotaStellar gradient compression
def compress_gradient(gradient, k_ratio=0.01):
    # Top-K sparsification: keep only top 1% by magnitude
    k = int(gradient.numel() * k_ratio)
    values, indices = torch.topk(gradient.abs().flatten(), k)
    sparse = torch.zeros_like(gradient.flatten())
    sparse[indices] = gradient.flatten()[indices]

    # 8-bit stochastic quantization
    scale = sparse.abs().max() / 127
    quantized = (sparse / scale).round().to(torch.int8)

    # Error feedback for next round
    error = gradient - decompress(quantized, indices, scale)
    return quantized, indices, scale, error

# Compression ratio: 32-bit × N params → 8-bit × 0.01N + indices
# For 70B params: 280GB → 2.8GB (indices) + 0.7GB (values) ≈ 100×

100×

Compression Ratio

<0.5%

Accuracy Loss

+23%

Faster Convergence

Comparison: RotaStellar vs Alternatives

Metric	Naive Sync	FedAvg	RotaStellar
Bandwidth per round	280 GB	280 GB	2.8 GB
Handles intermittent connectivity	No	Partial	Yes (async)
Eclipse resilience	Fails	Stalls	Continues
Convergence (rounds to 95%)	N/A	1,800	1,250
Final accuracy	N/A	94.2%	94.7%
Supports orbital nodes	No	No	Native

Model Partitioning Optimizer

Inference latency: 127ms

Technical Deep Dive: Optimal Partitioning

Finding the optimal model partition is a constrained optimization problem. We minimize end-to-end latency subject to memory, bandwidth, and energy constraints.

min_s [ T_compute(0:s) + T_transfer(s) + T_compute(s:L) ]

Where s is the split layer, subject to:

Memory(0:s) ≤ Ground_VRAM
Memory(s:L) ≤ Orbital_VRAM
Activation_size(s) × RTT ≤ Latency_budget
Compute(s:L) ≤ Orbital_energy_budget

-29%

Latency vs Ground-only

+34%

vs Naive Split

340W

Solar Surplus Used

Ground Station Pass Scheduler

Next pass: 2m 34s

12:34:12 ✓ P1 gradients synced

12:34:08 ↑ Pass started (Svalbard)

12:33:45 ⚡ Queue reordered (P1 priority)

12:32:20 ⚠ Alaska pass skipped (weather)

Built on Open Research

Every capability demonstrated here is grounded in our published research, open datasets, and benchmarks.

Models

gradient-compress, model-partition, sync-scheduler, checkpoint-optimizer, bandwidth-predict

View 15 models →

Datasets

Link Budget Archive, ISL Topology, Space Network Traces, Federated Training Logs

View 13 datasets →

Benchmarks

FedSpace, PartitionBench, SyncEfficiency, CheckpointOpt, MeshRoute

View 15 benchmarks →

Ready to build Earth-space AI?

Get early access to distributed compute capabilities and start coordinating AI workloads across ground and orbital infrastructure.

Get Early Access Talk to Us