Sequence to Sequence - Video to Text Sequence to Sequence - Video to Text
Paper summary It is a nice paper on video captioning. They exploit LSTM ability to learn long term dependencies to modeling the problem of translating video sequence to language sequence. The new thing in this paper is that they have two LSTM layers for modeling frames in videos and also words in sentences.

Summary by beahacker 4 years ago
Your comment: allows researchers to publish paper summaries that are voted on and ranked!

Sponsored by: and