Stress Testing Machine Learning in Astronomy