Chrome Extension

WeChat Mini Program

Use on ChatGLM

Log in

Academic Profile User Profile

My Following Paper Collections Browse History

OpenPifPaf: Composite Fields for Semantic Keypoint Detection and Spatio-Temporal Association

Sven Kreiss,Lorenzo Bertoni,Alexandre Alahi

IEEE Transactions on Intelligent Transportation Systems（2022）

Ecole Polytech Fed Lausanne

Cited 94|Views36

Abstract

Many image-based perception tasks can be formulated as detecting, associating and tracking semantic keypoints, e.g. , human body pose estimation and tracking. In this work, we present a general framework that jointly detects and forms spatio-temporal keypoint associations in a single stage, making this the first real-time pose detection and tracking algorithm. We present a generic neural network architecture that uses Composite Fields to detect and construct a spatio-temporal pose which is a single, connected graph whose nodes are the semantic keypoints ( e.g ., a person’s body joints) in multiple frames. For the temporal associations, we introduce the Temporal Composite Association Field (TCAF) which requires an extended network architecture and training method beyond previous Composite Fields. Our experiments show competitive accuracy while being an order of magnitude faster on multiple publicly available datasets such as COCO, CrowdPose and the PoseTrack 2017 and 2018 datasets. We also show that our method generalizes to any class of semantic keypoints such as car and animal parts to provide a holistic perception framework that is well suited for urban mobility such as self-driving cars and delivery robots.

More

Translated text

Key words

Pose estimation,Automobiles,Animals,Semantics,Autonomous automobiles,Task analysis,Three-dimensional displays,Composite fields,pose estimation,pose tracking

Bibtex

AI Read Science

Must-Reading Tree

Example

Generate MRT to find the research sequence of this paper

Related Papers

Reference papers

Cited Papers

Microsoft COCO: Common Objects in Context

Tsung-Yi Lin,Michael Maire,Serge Belongie,James Hays,Pietro Perona,Deva Ramanan,Piotr Dollar,C. Lawrence Zitnick

2014

被引用58932 | 浏览

DeepPose: Human Pose Estimation Via Deep Neural Networks

Alexander Toshev,Christian Szegedy

2014

被引用4240 | 浏览

2D Human Pose Estimation: New Benchmark and State of the Art Analysis

Mykhaylo Andriluka,Leonid Pishchulin,Peter Gehler,Bernt Schiele

2014

被引用3552 | 浏览

Beyond PASCAL: A Benchmark for 3D Object Detection in the Wild

Yu Xiang,Roozbeh Mottaghi,Silvio Savarese

2014

被引用999 | 浏览

Large-Scale Machine Learning with Stochastic Gradient Descent

2010

被引用8244 | 浏览

Stacked Hourglass Networks for Human Pose Estimation.

Alejandro Newell,Kaiyu Yang,Jia Deng

2016

被引用6868 | 浏览

Deep Residual Learning for Image Recognition

Kaiming He,Xiangyu Zhang,Shaoqing Ren,Jian Sun

2016

被引用268379 | 浏览

DeepCut: Joint Subset Partition and Labeling for Multi Person Pose Estimation.

Leonid Pishchulin,Eldar Insafutdinov,Siyu Tang,Bjoern Andres,Mykhaylo Andriluka,Peter Gehler,Bernt Schiele

2016

被引用1439 | 浏览

Optimizing Distributed Actor Systems for Dynamic Interactive Services

Andrew Newell,Gabriel Kliot,Ishai Menache,Aditya Gopalan,Soramichi Akiyama,Mark Silberstein

2016

被引用602 | 浏览

Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network

Wenzhe Shi,Jose Caballero,Ferenc Huszar,Johannes Totz,Andrew P. Aitken,Rob Bishop,Daniel Rueckert,Zehan Wang

2016

被引用8199 | 浏览

Towards Accurate Multi-person Pose Estimation in the Wild.

George Papandreou,Tyler Zhu,Nori Kanazawa,Alexander Toshev,Jonathan Tompson,Chris Bregler,Kevin Murphy

2017

被引用1139 | 浏览

PersonLab: Person Pose Estimation and Instance Segmentation with a Bottom-Up, Part-Based, Geometric Embedding Model

George Papandreou,Tyler Zhu,Liang-Chieh Chen,Spyros Gidaris,Jonathan Tompson,Kevin Murphy

2018

被引用818 | 浏览

Simple Baselines for Human Pose Estimation and Tracking

Bin Xiao,Haiping Wu,Yichen Wei

被引用2523 | 浏览

MultiPoseNet: Fast Multi-Person Pose Estimation Using Pose Residual Network

Muhammed Kocabas,Salih Karagoz,Emre Akbas

2018

被引用392 | 浏览

ShuffleNet V2: Practical Guidelines for Efficient CNN Architecture Design

Ningning Ma,Xiangyu Zhang,Hai-Tao Zheng,Jian Sun

2018

被引用7546 | 浏览

CarFusion: Combining Point Tracking and Part Detection for Dynamic 3D Reconstruction of Vehicles

N. Dinesh Reddy,Minh Vo,Srinivasa G. Narasimhan

2018

被引用97 | 浏览

Lions and Tigers and Bears: Capturing Non-Rigid, 3D, Articulated Shape from Images.

Silvia Zuffi,Angjoo Kanazawa,Michael J. Black

2018

被引用150 | 浏览

ApolloCar3D: A Large 3D Car Instance Understanding Benchmark for Autonomous Driving

Xibin Song,Peng Wang,Dingfu Zhou,Rui Zhu,Chenye Guan,Yuchao Dai,Hao Su,Hongdong Li,Ruigang Yang

2019

被引用223 | 浏览

Efficient Online Multi-Person 2D Pose Tracking with Recurrent Spatio-Temporal Affinity Fields

Yaadhav Raaj,Haroon Idrees,Gines Hidalgo,Yaser Sheikh

2018

被引用152 | 浏览

DeepLabCut: Markerless Pose Estimation of User-Defined Body Parts with Deep Learning

Alexander Mathis,Pranav Mamidanna,Kevin M. Cury,Taiga Abe,Venkatesh N. Murthy,Mackenzie Weygandt Mathis,Matthias Bethge

2018

被引用4583 | 浏览

Fast Animal Pose Estimation Using Deep Neural Networks.

Talmo D. Pereira,Diego E. Aldarondo,Lindsay Willmore,Mikhail Kislin,Samuel S.-H. Wang,Mala Murthy,Joshua W. Shaevitz

2018

被引用448 | 浏览

CrowdPose: Efficient Crowded Scenes Pose Estimation and a New Benchmark

Jiefeng Li,Can Wang,Hao Zhu,Yihuan Mao,Hao-Shu Fang,Cewu Lu

2018

被引用694 | 浏览

Trajectories of Sleep Problems among Adolescents after the Wenchuan Earthquake: the Role of Posttraumatic Stress Disorder Symptoms.

Xiao Zhou,Rui Zhen,Xinchun Wu

2019

被引用642 | 浏览

Occlusion-Net: 2D/3D Occluded Keypoint Localization Using Graph Networks.

N. Dinesh Reddy,Minh Vo,Srinivasa G. Narasimhan

2019

被引用91 | 浏览

Spatiotemporal Trends in Reference Evapotranspiration over South Korea

Ju Ha Hwang,Muhammad Azam,Maeng Seung Jin, Yong Ho Kang,Jae Eun Lee,Muhammad Latif,Rehan Ahmed,Muhammad Umar,Muhammad Zaffar Hashmi

2019

被引用8 | 浏览

Learning from Synthetic Animals

Jiteng Mu,Weichao Qiu,Gregory Hager,Alan Yuille

2020

被引用150 | 浏览

SMOKE: Single-Stage Monocular 3D Object Detection Via Keypoint Estimation

Zechen Liu,Zizhang Wu,Roland Toth

2020

被引用473 | 浏览

A Novel Finite-Time Control for Nonstrict Feedback Saturated Nonlinear Systems with Tracking Error Constraint

Kangkang Sun,Jianbin Qiu,Hamid Reza Karimi,Huijun Gao

2019

被引用1063 | 浏览

Who Left the Dogs Out? 3D Animal Reconstruction with Expectation Maximization in the Loop.

Benjamin Biggs,Oliver Boyne,James Charles,Andrew Fitzgibbon,Roberto Cipolla

2020

被引用123 | 浏览

GSNet: Joint Vehicle Pose and Shape Reconstruction with Geometrical and Scene-Aware Supervision.

Lei Ke,Shichao Li,Yanan Sun,Yu-Wing Tai,Chi-Keung Tang

2020

被引用63 | 浏览

Data Disclaimer

The page data are from open Internet sources, cooperative publishers and automatic analysis results through AI technology. We do not make any commitments and guarantees for the validity, accuracy, correctness, reliability, completeness and timeliness of the page data. If you have any questions, please contact us by email: report@aminer.cn

Chat Paper

【要点】：本文提出了一种名为OpenPifPaf的框架，通过使用复合字段在单个阶段实现语义关键点的检测和时空关联，实现了实时姿态检测和跟踪，并扩展到不同类别对象的关键点检测。

【方法】：作者设计了一个通用的神经网络架构，采用复合字段检测并构建时空姿态图，图中节点为多帧中的语义关键点，并引入了时间复合关联字段（TCAF）来处理时间上的关联。

【实验】：研究在COCO、CrowdPose以及PoseTrack 2017和2018等多个公开数据集上进行了测试，结果表明该方法在保持竞争力准确度的同时，速度提高了几个数量级。

去 AI 文献库对话