RL-Ops:
Production RL
A metaclass-driven, registry-first RL framework. Hash-locked RLExperimentSpec, RLRuntime, and Iceberg trajectory store for deterministic, replayable training and evaluation.
RL Lab Mockup
The PRUDEX Evaluation Framework
Moving beyond Sharpe ratio. Our Phase 9 PRUDEX-Compass framework provides 17 independent measures and 5 advanced visualizations to truly understand agent behavior before deploying capital.
Iceberg Trajectory Store
Every step, observation, and reward is persisted to an Iceberg warehouse for forensic analysis and replay.
Weight-Centric Pipeline
FinRL-X inspired pipeline (f_S → f_A → f_T → f_R) for robust portfolio-level decision making.
Pre-built Agents
- EIIE (Ensemble of Identical Independent Experts)
- DeepTrader (Asset scoring + risk control)
- Investor Imitator (Inverse RL)
- DeepScalper (HFT execution)
- PPO In-house (Optimized for low-latency)
- FinAgent (LLM-hybrid adapter)