Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

What does uNet_test.npy do? #37

Open
mrlihellohorld opened this issue Dec 31, 2020 · 8 comments
Open

What does uNet_test.npy do? #37

mrlihellohorld opened this issue Dec 31, 2020 · 8 comments

Comments

@mrlihellohorld
Copy link

Many thanks to the author for the excellent open source code.
I have learned thatuNet_test.npy is the classification of each test video, but what is the specific function of uNet_test.npy in post-processing?

@frostinassiky
Copy link
Owner

Hello @mrlihellohorld , please check the post-processing code that uses this file:

# sort video classification

The video classification scores help to assign class labels and reweight the G-TAD prediction.

@mrlihellohorld
Copy link
Author

mrlihellohorld commented Jan 5, 2021

Hello @mrlihellohorld , please check the post-processing code that uses this file:

# sort video classification

The video classification scores help to assign class labels and reweight the G-TAD prediction.

Thank you very much for your reply. I am puzzled by this sentence“The video classification scores help to assign class labels “. Because according to this code

tmp_proposal["label"] = thumos_class[int(unet_classes[k])]
, I understand is to assign the tag of the whole video to the tag of the tmp_proposal,right?

@frostinassiky
Copy link
Owner

Sorry for the confusing sentence. Your understand is correct.
1, To decide the class label (or tag), we need to find the top-k classification scores.
2, From the indexes of the top-k scores, we can decide the video tag by thumos_class variable.

Please let me know if you have more questions. J

@mrlihellohorld
Copy link
Author

mrlihellohorld commented Jan 5, 2021

Sorry for the confusing sentence. Your understand is correct.
1, To decide the class label (or tag), we need to find the top-k classification scores.
2, From the indexes of the top-k scores, we can decide the video tag by thumos_class variable.

Please let me know if you have more questions. J

Thank you very much for your patient answer. According to your answer, I have described my understanding and hope you can correct me,THX.
a. ' From the indexes of the top-k scores, we can decide the video tag by thumos_class variable.' If I understand you correctly, here is the classification of tmp_proposal(snippet video),so the 'video' mean ‘snippet video’?
b, In my own data set, each video has multiple actions. According to the data set, I first train a video classification model to get video level scores, and then classify each snippet according to the video scores. Do I understand that right?If correct, is it reasonable to classify snippet video by the whole video (including multiple actions)?
I don't understand this part very well. I am looking forward to your reply. thanks!!!

@frostinassiky
Copy link
Owner

Hello @mrlihellohorld

The video means the untrimmed video that includes the snippet.
If your video has multiple actions that belong to different classes, a reasonable setting is classifying those tmp_proposal via a separate branch. A new classification score ( e.g. this line ) can be added on G-TAD to achieve this.

@mrlihellohorld
Copy link
Author

I don't quite understand you said. Could you please explain it more clearly? Thank you

@frostinassiky
Copy link
Owner

Sorry I didn't explain the questions clearly.
If you are very interested in G-TAD, we can schedule a time and chat about it. My WeChat ID is FrostXuMengmeng.

@ASMIftekhar
Copy link

I am curious, did anyone actually implement action class prediction from another branch in gtad?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants