Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to evaluate the accuracy? #30

Open
Deep-I opened this issue Jul 19, 2020 · 1 comment
Open

How to evaluate the accuracy? #30

Deep-I opened this issue Jul 19, 2020 · 1 comment

Comments

@Deep-I
Copy link

Deep-I commented Jul 19, 2020

The dataset is consist of video and each video has one class(target).
The video is captured by frame and the captured image is the input of the model.
So can I ask the evaluation method of video classification?
When evaluating the video classification model, I have to measure the accuracy of the label when one video is input?
Or I have to measure the accuracy of each frame of video when each frame is input?

@a-hassanzadeh
Copy link

a-hassanzadeh commented Aug 14, 2020

Take CRNN for example. The input to the model is in form of 28x3x224x224 where 28 is the number of frames extracted from a video, 3 is the number of channels and 224x224 is the resized frame, from the original video. the target for this input is 1 label. As explained in the readme, the CNN (encoder) takes in this input and generates and encoding (feature vector) and passes it to RNN (decoder) which takes into account the temporal resolution of the video.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants