Active Preference-Based Gaussian Process Regression for Reward Learning | doi.page