Dynamic Dual-Policy Optimization