PGNet(Pyramid Graph Networks)[1] modeled structured segmentation data with
graphsand further proposed apyramid-likestructure that models different sizes of image regions as graph nodes. There are some details of reading and implementing it.
PV-RCNN[1] is a 3D Object Detection framework to integrate
3D voxel CNNandPointNet-based set abstractionto learn more discriminative point cloud features. The most contributions in this papar is two-stage strategy including thevoxel-to-keypoint3D scene encoding and thekeypoint-to-gridRoI feature abstraction. There are some details of reading and implementing it.
FairMOT[1] is a one-shot tracker to fuse object detection and re-identification in a single network. The most contributions in this papar are
anchor-freeRe-ID feture extraction, multi-layerfeature aggregationandlower-dimensionalre-ID fetures. There are some details of reading and implementing it.
Image Transformer[1] is a sequence modeling formulation of image generation generalized by
Transformer, which restricting the self-attention mechanism to attend to local neighborhoods, while maintaininglarge receptive field. There are some details of reading and implementing it.
Update your browser to view this website correctly. Update my browser now