Inference: only images without audio #6

Oktai15 · 2019-03-11T10:27:41Z

Hello, @miha-skalic, great work!

Can I use your model without audio features? For example, I want to test your model on my video, I don't have feature extractor for audio (because it was not published), do I have ability to try your model? If yes, so how can I?

miha-skalic · 2019-03-16T14:31:01Z

Hi @Oktai15 ,

Unfortunately, Google has (not yet) release the audio feature extraction part. I'm guessing that one could use a vector of zeros for the audio features. Note that we have not tested this and thus cannot tell anything-thing about the impact on the performance.

Oktai15 · 2019-03-16T20:32:40Z

Thank you, @miha-skalic!

ideaRunner · 2019-04-18T08:30:09Z

About the audio feature extraction, I found this code https://github.com/tensorflow/models/tree/master/research/audioset#output-embeddings
Also a person who have used it and get a good result.
antoine77340/Youtube-8M-WILLOW#28

The released AudioSet embeddings were postprocessed before release by applying a PCA transformation (which performs both PCA and whitening) as well as quantization to 8 bits per embedding element. This was done to be compatible with the YouTube-8M project which has released visual and audio embeddings for millions of YouTube videos in the same PCA/whitened/quantized format.
We provide a Python implementation of the postprocessing which can be applied to batches of embeddings produced by VGGish. vggish_inference_demo.py shows how the postprocessor can be run after inference.
If you don't need to use the released embeddings or YouTube-8M, then you could skip postprocessing and use raw embeddings.

Oktai15 closed this as completed Mar 16, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Inference: only images without audio #6

Inference: only images without audio #6

Oktai15 commented Mar 11, 2019

miha-skalic commented Mar 16, 2019

Oktai15 commented Mar 16, 2019

ideaRunner commented Apr 18, 2019 •

edited

Loading

Inference: only images without audio #6

Inference: only images without audio #6

Comments

Oktai15 commented Mar 11, 2019

miha-skalic commented Mar 16, 2019

Oktai15 commented Mar 16, 2019

ideaRunner commented Apr 18, 2019 • edited Loading

ideaRunner commented Apr 18, 2019 •

edited

Loading