Abstract: Text-to-image pedestrian retrieval involves querying a database of images based on a given textual description to find matching images. The main challenge lies in learning how to map visual ...