RL-Ops:
Production RL

A metaclass-driven, registry-first RL framework. Hash-locked RLExperimentSpec, RLRuntime, and Iceberg trajectory store for deterministic, replayable training and evaluation.

RL Lab Mockup

The PRUDEX Evaluation Framework

Moving beyond Sharpe ratio. Our Phase 9 PRUDEX-Compass framework provides 17 independent measures and 5 advanced visualizations to truly understand agent behavior before deploying capital.

Iceberg Trajectory Store

Every step, observation, and reward is persisted to an Iceberg warehouse for forensic analysis and replay.

Weight-Centric Pipeline

FinRL-X inspired pipeline (f_S → f_A → f_T → f_R) for robust portfolio-level decision making.

Pre-built Agents

  • EIIE (Ensemble of Identical Independent Experts)
  • DeepTrader (Asset scoring + risk control)
  • Investor Imitator (Inverse RL)
  • DeepScalper (HFT execution)
  • PPO In-house (Optimized for low-latency)
  • FinAgent (LLM-hybrid adapter)