Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Any ideas about modifying model to detect multiple objects which are described by one query? #56

Open
Shuaicong97 opened this issue Jun 17, 2024 · 0 comments

Comments

@Shuaicong97
Copy link

Hi,

I want to extend this model to match the following situation:

Based on one text query, e.g. "a person skateboarding", I want to search in the video clip to find out all objects which match this query. For example, person 1 is skateboarding between frame 5 to frame 15, person 2 is skateboarding between frame 7 to frame 24. => Then on the output, it will show person 1 between frame 5 to frame 6, person 1 and 2 between frame 7 to frame 15, person 2 between frame 16 to frame 24.

For now, in inference_ytvos.py, it uses

max_scores, _ = pred_scores.max(-1)  # [q,] 
_, max_ind = max_scores.max(-1)  # [1,] 

to get the maximum one. How to modify them and related files?

Any ideas?

Thank you in advance!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant