publication venue for Q-learning via deep learning-based Buckley-James method for non-linear censored data. 32. 2026