源码实现-Normalization

2026-05-08 2026-05-08

DeepLearning / PyTorch

12 minutes read (About 1756 words)

本文整理 BatchNorm / LayerNorm / RMSNorm 的作用与差异，并给出与 PyTorch 思路一致的简化实现（dummy），便于对照官方源码阅读。

normalization, pytorch, source_code

源码实现-MobileNet

2024-03-26 2026-05-09

DeepLearning / PyTorch

6 minutes read (About 872 words)

本文整理 MobileNetV1 / V2 （Depthwise Separable、Inverted Residual）中与标准卷积的差异，并给出与常见实现思路一致的 PyTorch 极简模块，便于对照 timm 等源码阅读。

mobilenet, pytorch, source_code

PPNet

2020-12-02 2023-03-10

DeepLearning / Few-Shot Segmentation / PPNet

2 minutes read (About 322 words)

PPNet(Part-aware Prototype Network for Few-shot Semantic Segmentation)[1] decompose the holistic class representation into a set of part-aware prototypes, and leverage unlabeled data to better modeling of intra-class variations. Besides, graph neural network model is used to generate and enhance the proposed part-aware prototypes. There are some details of reading and implementing it.

DL, FSS, PPNet

PANet

2020-12-02 2023-03-10

DeepLearning / Few-Shot Segmentation / PANet

2 minutes read (About 299 words)

PANet(PANet: Few-Shot Image Semantic Segmentation with Prototype Alignment)[1] learns class-specific prototype representations for images and matches each pixel to the learned prototypes. There are some details of reading and implementing it.

DL, FSS, PANet

CANet

2020-10-20 2023-03-10

DeepLearning / Few-Shot Segmentation / CANet

3 minutes read (About 503 words)

CANet(CANet: Class-Agnostic Segmentation Networks with Iterative Refinement and Attentive Few-Shot Learning)[1] consists of a two-branch dense comparison module which performs multi-level feature comparison, and an iterative optimization module which iteratively refines the predicted results. There are some details of reading and implementing it.

CANet, DL, FSS

SG-One

2020-10-20 2023-03-10

DeepLearning / Few-Shot Segmentation / SG-One

3 minutes read (About 483 words)

SG-One(SG-One: Similarity Guidance Network for One-Shot Semantic Segmentation)[1] adopt a masked average pooling strategy for producing the guidance features, then leverage the cosine similarity to build the relationship. There are some details of reading and implementing it.

DL, FSS, SG-One

co-FCN

2020-10-19 2023-03-10

DeepLearning / Few-Shot Segmentation / co-FCN

2 minutes read (About 306 words)

co-FCN(Conditional Networks for Few-Shot Semantic Segmentation)[1] handle sparse pixel-wise annotations to achieve nearly the same accuracy. There are some details of reading and implementing it.

DL, FSS, co-FCN

OSLSM

2020-10-19 2023-03-10

DeepLearning / Few-Shot Segmentation / OSLSM

3 minutes read (About 382 words)

OSLSM(One-Shot Learning for Semantic Segmentation)[1] firstly proposed two-branch approach to one-shot semantic segmentation. Conditioning branch trains a network to get parameter $\theta$, and Segmentaion branch outputs the final mask based on parameter $\theta$. There are some details of reading and implementing it.

DL, FSS, OSLSM

Mask R-CNN

2020-08-17 2023-03-10

DeepLearning / Object Segmentation / Mask R-CNN

3 minutes read (About 428 words)

Mask R-CNN[1] is a framework for object instance segmentation, which adds a branch for predicting an object mask in parallel with the existing branch for bounding box recognition of Faster R-CNN. There are some details of reading and implementing it.

DL, Mask R-CNN, Segmentation

LTM

2020-07-29 2023-03-10

DeepLearning / Few-Shot Segmentation / LTM

3 minutes read (About 478 words)

LTM(Local Transformation Module)[1] focus on the relationship of the local features. It uses linear transformation of the relationship matrix in a high-dimensional metric embedding space to accomplish the transformation. There are some details of reading and implementing it.

DL, FSS, LTM

源码实现-Normalization

源码实现-MobileNet

PPNet

PANet

CANet

SG-One

co-FCN

OSLSM

Mask R-CNN

LTM

Tag Cloud

Categories

Recent

Archives

Recent

Archives

Your browser is out-of-date!