RL-Ops:
Production RL

A metaclass-driven, registry-first RL framework. Hash-locked RLExperimentSpec, RLRuntime, and Iceberg trajectory store for deterministic, replayable training and evaluation.

RL Lab Mockup

The PRUDEX Evaluation Framework

Moving beyond Sharpe ratio. Our Phase 9 PRUDEX-Compass framework provides 17 independent measures and 5 advanced visualizations to truly understand agent behavior before deploying capital.

Iceberg Trajectory Store

Every step, observation, and reward is persisted to an Iceberg warehouse for forensic analysis and replay.

Weight-Centric Pipeline

FinRL-X inspired pipeline (f_S → f_A → f_T → f_R) for robust portfolio-level decision making.

Pre-built Agents

EIIE (Ensemble of Identical Independent Experts)
DeepTrader (Asset scoring + risk control)
Investor Imitator (Inverse RL)
DeepScalper (HFT execution)
PPO In-house (Optimized for low-latency)
FinAgent (LLM-hybrid adapter)

RL-Ops:Production RL

The PRUDEX Evaluation Framework

Iceberg Trajectory Store

Weight-Centric Pipeline

Pre-built Agents

RL-Ops:
Production RL