Tracklet

Tracklet

Tracklet[1] is a novel method for optimizing tracklet consistency, which directly takes the prediction errors into account. There are some details of reading and implementing it.

Contents


Paper & Code & note


Paper: Multi-object Tracking via End-to-end Tracklet Searching and Ranking(arXiv 2020 paper)
Code: [Pytorch][Updating]
Note: Tracklet

Paper


Abstract

Tracklet_Abstract.png

  1. Recent work use sequence model to calculate the similarity score between the detections and the previous tracklets, but the forced exposure to ground-truth in the training stage leads to the training-inference discrepancy problem.
  2. This paper directly takes the prediction errors into account to optimize tracklet consistency.
  3. It havs achieved state-of-the-art in MOT15-17 challenge benchmarks using public detection and online settings.

Problem Description

Tracklet_PD1.png
Tracklet_PD2.png

  1. pairwise-detection matching based on affinity model: It has limited capability to associate long-term consistent trajectories.
  2. affinity model on sequence model: tracklet representative feature for matching can somewhat be ill-posed and ideal assumption brings up a potential vulnerability.

Problem Solution

Tracklet_PS.png

  1. They propose a global score to measure the inner appearance consistency of tracklet.
  2. It optimizes the whole tracklet with a margin loss.
  3. a novel algorithm has been established to simulate the prediction data distribution on training by introducing realistic discombobulated candidates to model.

Conceptual Understanding

Tracklet_CU.png

  1. Tracklet-level based tracking: It constructs an affinity model on the tracklet level and then uses it to associate the tracklet with detection or connect short tracklets.
  2. Pair-wise association methods: They establish an affinity model on the isolated detections, and then generate tracking results from the bottom up.
    The common concern of these two types of methods is to guarantee the consistency of the entire associated trajectories.

Core Conception

Tracklet_Network.png

  1. Training procedure: It follows a “searching-learning-ranking-pruning” pipeline.
  2. Scoring Network: The appearance feature of each detection are extracted with CNN(ResNet-50), and the appearance embedding of tracklet are obtained through encoder(LSTM).
  3. It trained by online hypothesis tracklet searching with margin loss and rank loss, details as follow.

Tracklet_CC.png

Experiments

Tracklet_Results.png

They report the quantitative results on the three datasets in MOT Challenge Benchmark.

Code


Tracklet_SBTO.png

Note


More details of Tracklet optimization and the like can be found in [2].

References


[1] Hu, T., Huang, L., & Shen, H. (2020). Multi-object Tracking via End-to-end Tracklet Searching and Ranking. arXiv preprint arXiv:2003.02795.
[2] Change_ZH. “Tracklet: MOT Scoring Network.” https://blog.csdn.net/qq_36449741/article/details/104815321?depth_1-utm_source=distribute.pc_relevant.none-task&utm_source=distribute.pc_relevant.none-task.


  DLMOTTracking

Comments

Your browser is out-of-date!

Update your browser to view this website correctly. Update my browser now

×