-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
About LargeScaleJitter #3
Comments
Thanks. Pretrained-Pix2Seq/playground/pix2seq/pix2seq.py Lines 88 to 90 in 7d908d4
The conversion of relative coordinate in transforms.py is because we use the same dataloader as DETR. And for the large Scale jittering, we basically follow the same pipeline proposed in CopyPaste which is cited in Pix2seq. |
Hello, does using absolute coordinate gets a better AP? |
The only difference between relative and absolute coordinate is the normalization factor ? Absolute will normalize by the longest image size instead of the actual image size, which relative coordinate does, am I right ? |
hi, great work! We also try to reimplement the Pix2Seq, we find the absolute coordinate is useful, which is similar to your LargeScaleJitter (pad or crop the image to the fix desired size),
the absolute coordinate means that normalized the position by dividing the fix size.
boxes = boxes / 1333.
instead ofboxes = boxes / torch.tensor([w, h, w, h], dtype=torch.float32)
,Then, padding or croppinf the image to the fix desired size is not necessary.The text was updated successfully, but these errors were encountered: