The bbox input are the bounding boxes (i.e.