Online Spatiotemporal Action Detection and Prediction via Causal Representations

Aug-31-2020–arXiv.org Artificial Intelligence

In this thesis, we focus on video action understanding problems from an online and real-time processing point of view. We start with the conversion of the traditional offline spatiotemporal action detection pipeline into an online spatiotemporal action tube detection system. An action tube is a set of bounding connected over time, which bounds an action instance in space and time. Next, we explore the future prediction capabilities of such detection methods by extending the an existing action tube into the future by regression. Later, we seek to establish that online/causal representations can achieve similar performance to that of offline three dimensional (3D) convolutional neural networks (CNNs) on various tasks, including action recognition, temporal action segmentation and early prediction.

artificial intelligence, deep learning, machine learning, (18 more...)

arXiv.org Artificial Intelligence

Aug-31-2020

arXiv.org PDF

Add feedback

Country:
- North America
  - Canada (0.04)
  - United States
    - New York (0.04)
    - Utah > Salt Lake County
      - Salt Lake City (0.04)
- Europe
  - Czechia > Prague (0.04)
  - United Kingdom > England
    - Oxfordshire > Oxford (0.04)
  - Spain > Galicia
    - Madrid (0.04)
  - Netherlands > North Holland
    - Amsterdam (0.04)
  - France > Auvergne-Rhône-Alpes
    - Isère > Grenoble (0.04)
- Asia
  - Middle East > Jordan (0.04)
  - India > Karnataka
    - Bengaluru (0.04)
  - China > Beijing
    - Beijing (0.04)

Genre:
- Workflow (1.00)
- Overview (0.92)
- Summary/Review (0.92)
- Research Report > New Finding (0.67)

Industry:
- Information Technology (1.00)
- Health & Medicine (0.92)
- Media (0.67)
- Leisure & Entertainment > Sports (0.45)

Technology:
- Information Technology
  - Communications > Networks (1.00)
  - Sensing and Signal Processing > Image Processing (0.92)
  - Artificial Intelligence
    - Vision (1.00)
    - Robots (1.00)
    - Representation & Reasoning (1.00)
    - Cognitive Science (1.00)
    - Machine Learning
      - Statistical Learning (1.00)
      - Neural Networks > Deep Learning (1.00)
      - Learning Graphical Models > Undirected Networks
        Markov Models (0.46)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found