Concave-Augmented Pareto Q-Learning (CAPQL)ΒΆ