Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Could you tell me the kinetics's details in accuracy of you project #3

Open
Violetgyw opened this issue May 11, 2021 · 5 comments
Open
Labels
enhancement New feature or request

Comments

@Violetgyw
Copy link

No description provided.

@v-iashin
Copy link
Owner

v-iashin commented May 11, 2021

I haven't evaluated the pre-trained models on Kinetics. The feature extraction relies on the original pre-trained checkpoints. Therefore, the quality should be the same as of the original models.

In case of R(2+1)D RGB-only, it should match the reported one in here. One thing, however, it would be difficult to obtain the same quality of features for I3D because in the original paper, the model uses TVL1 algorithm for optical flow frame calculation. In this repo, there are two faster options to calculate the optical flow: PWC and RAFT.

I would be nice if someone could evaluate the quality of the methods from this repo by training the classification layer, esp I3D with different flow extractors.

@v-iashin
Copy link
Owner

By the way, another option on how to evaluate the performance would be to use --show_pred argument from here which shows the predictions on the kinetics (classes and probs). But one still needs to change the script and download the dataset

@Violetgyw
Copy link
Author

By the way, another option on how to evaluate the performance would be to use --show_pred argument from here which shows the predictions on the kinetics (classes and probs). But one still needs to change the script and download the dataset

Thank you for your answer, I tried to use TV-L1, RAFT, PWC, the sample v_CricketShot_g04_c01.mp4 provided at https://github.com/deepmind/kinetics-i3d, using the same processing method, I found only TV-L1 Which category can be accurately distinguished, the official i3d uses TV-L1, other optical flow distributions are different, the effect is not good as it should be

@v-iashin
Copy link
Owner

Can you provide a code snippet? I am afraid, I have a hard time understanding what you mean.

@varungupta31
Copy link

@Violetgyw Please share which repo you used for I3D with TV-L1?

@v-iashin v-iashin added the enhancement New feature or request label Jan 21, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants