Chrome Extension
WeChat Mini Program
Use on ChatGLM

OpenPifPaf: Composite Fields for Semantic Keypoint Detection and Spatio-Temporal Association

IEEE Transactions on Intelligent Transportation Systems(2022)

Ecole Polytech Fed Lausanne

Cited 94|Views36
Abstract
Many image-based perception tasks can be formulated as detecting, associating and tracking semantic keypoints, e.g. , human body pose estimation and tracking. In this work, we present a general framework that jointly detects and forms spatio-temporal keypoint associations in a single stage, making this the first real-time pose detection and tracking algorithm. We present a generic neural network architecture that uses Composite Fields to detect and construct a spatio-temporal pose which is a single, connected graph whose nodes are the semantic keypoints ( e.g ., a person’s body joints) in multiple frames. For the temporal associations, we introduce the Temporal Composite Association Field (TCAF) which requires an extended network architecture and training method beyond previous Composite Fields. Our experiments show competitive accuracy while being an order of magnitude faster on multiple publicly available datasets such as COCO, CrowdPose and the PoseTrack 2017 and 2018 datasets. We also show that our method generalizes to any class of semantic keypoints such as car and animal parts to provide a holistic perception framework that is well suited for urban mobility such as self-driving cars and delivery robots.
More
Translated text
Key words
Pose estimation,Automobiles,Animals,Semantics,Autonomous automobiles,Task analysis,Three-dimensional displays,Composite fields,pose estimation,pose tracking
PDF
Bibtex
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Related Papers
Data Disclaimer
The page data are from open Internet sources, cooperative publishers and automatic analysis results through AI technology. We do not make any commitments and guarantees for the validity, accuracy, correctness, reliability, completeness and timeliness of the page data. If you have any questions, please contact us by email: report@aminer.cn
Chat Paper

要点】:本文提出了一种名为OpenPifPaf的框架,通过使用复合字段在单个阶段实现语义关键点的检测和时空关联,实现了实时姿态检测和跟踪,并扩展到不同类别对象的关键点检测。

方法】:作者设计了一个通用的神经网络架构,采用复合字段检测并构建时空姿态图,图中节点为多帧中的语义关键点,并引入了时间复合关联字段(TCAF)来处理时间上的关联。

实验】:研究在COCO、CrowdPose以及PoseTrack 2017和2018等多个公开数据集上进行了测试,结果表明该方法在保持竞争力准确度的同时,速度提高了几个数量级。