Appendix A Object Query Generation

Neural Information Processing Systems 

The text-guided object detection network, as described in Section 3.1.1, As mentioned in Section 3.1.2, The computation of these spatial relation features is explained in detail below. The orientation between two objects is represented by encoding the angle values of the line that connects their centers in the spherical coordinate system. The above calculation results are combined as the spatial relation features of "Distance & Orientation".