Akira's Machine Learning news -- #26
In the following sections, I will introduce various articles and papers not only on the above contents but also on the following five topics. MERLOT: Multimodal Neural Script Knowledge Models Using as much as 6 million video data and accompanying subtitles, MERIOT is proposed to perform self-supervised learning on both temporal and spatial tasks. It does not use any label information but can achieve SotA performance. Also, the accuracy of the pre-training continues to increase even with 6 million data, which is considered a promising research direction for the future.
Sep-1-2021, 05:46:31 GMT
- Technology: