Skip to content

Commit

Permalink
[Docs] (CodeCamp #76) translate add_transforms.md and conventions.md (#…
Browse files Browse the repository at this point in the history
…674)

* translate add_transforms.md and conventions.md

* translate add_transforms.md and conventions.md

* update add_translate.md format

* update add_translate.md format

* update

Co-authored-by: fangyixiao18 <fangyx18@hotmail.com>
  • Loading branch information
Caczhtus and fangyixiao18 authored Jan 16, 2023
1 parent 386f66a commit c951bb6
Showing 3 changed files with 181 additions and 4 deletions.
4 changes: 2 additions & 2 deletions docs/en/advanced_guides/add_transforms.md
Original file line number Diff line number Diff line change
@@ -66,11 +66,11 @@ class NewTransform(BaseTransform):
return results
```

**Note:** For the implementation of transforms, you could apply functions in [mmcv](https://github.com/open-mmlab/mmcv/tree/dev-2.x/mmcv/image).
**Note:** For the implementation of transforms, you could apply functions in [mmcv](https://github.com/open-mmlab/mmcv/tree/2.x/mmcv/image).

### Step 2: Add NewTransform to \_\_init\_\_py

Then, add the transform to [\_\_init\_\_.py](https://github.com/open-mmlab/mmselfsup/blob/1.x/mmselfsup/datasets/transforms/__init__.py).
Then, add the transform to [\_\_init\_\_.py](https://github.com/open-mmlab/mmselfsup/blob/dev-1.x/mmselfsup/datasets/transforms/__init__.py).

```python
...
119 changes: 118 additions & 1 deletion docs/zh_cn/advanced_guides/add_transforms.md
Original file line number Diff line number Diff line change
@@ -1 +1,118 @@
# Add Transforms
# 添加数据变换

在本教程中, 我们将介绍创建自定义转换的基本步骤。在学习创建自定义转换之前, 建议先了解文件 [transforms.md](transforms.md) 中转换的基本概念。

- [添加数据变换](#添加数据变换)
- [管道概述](#管道概述)
- [在管道中创建新转换](#在管道中创建新转换)
- [步骤 1: 创建转换](#步骤-1-创建转换)
- [步骤 2: 将新转换添加到 \_\_init\_\_py](#步骤-2-将新转换添加到-__init__py)
- [步骤 3: 修改配置文件](#步骤-3-修改配置文件)

## 管道概述

`Dataset` 中, `Pipeline` 是中的一个重要组件, 主要负责对图像应用一系列数据增强, 例如: `RandomResizedCrop`, `RandomFlip` 等操作。

以下代码是 `Pipeline` 用于 `SimCLR` 训练的配置示例:

```python
view_pipeline = [
dict(type='RandomResizedCrop', size=224, backend='pillow'),
dict(type='RandomFlip', prob=0.5),
dict(
type='RandomApply',
transforms=[
dict(
type='ColorJitter',
brightness=0.8,
contrast=0.8,
saturation=0.8,
hue=0.2)
],
prob=0.8),
dict(
type='RandomGrayscale',
prob=0.2,
keep_channels=True,
channel_weights=(0.114, 0.587, 0.2989)),
dict(type='RandomGaussianBlur', sigma_min=0.1, sigma_max=2.0, prob=0.5),
]

train_pipeline = [
dict(type='LoadImageFromFile', file_client_args=file_client_args),
dict(type='MultiView', num_views=2, transforms=[view_pipeline]),
dict(type='PackSelfSupInputs', meta_keys=['img_path'])
]
```

在这个 `Pipeline` 中, 每个数据增强接收一个 `dict` , 它们作为输入和输出时刻, 包含图像增强以及其他相关信息的 `dict`

## 在管道中创建新转换

以下是创建新转换的步骤。

### 步骤 1: 创建转换

[processing.py](https://github.com/open-mmlab/mmselfsup/tree/dev-1.x/mmselfsup/datasets/transforms/processing.py) 中编写一个新的转换类, 并在类中覆盖这个 `transform` 函数, 这个函数接收一个 `dict` 的对象, 并返回一个 `dict` 对象

```python
@TRANSFORMS.register_module()
class NewTransform(BaseTransform):
"""Docstring for transform.
"""

def transform(self, results: dict) -> dict:
# apply transform
return results
```

**注意**: 对于这些转换的实现, 您可以应用 [mmcv](https://github.com/open-mmlab/mmcv/tree/2.x/mmcv/image) 中的函数。

### 步骤 2: 将新转换添加到 \_\_init\_\_py

然后, 将转换添加到 [\_\_init\_\_.py](https://github.com/open-mmlab/mmselfsup/blob/dev-1.x/mmselfsup/datasets/transforms/__init__.py)

```python
...
from .processing import NewTransform, ...

__all__ = [
..., 'NewTransform'
]
```

### 步骤 3: 修改配置文件

要使用新添加的 `NewTransform`, 你可以按以下的方式修改配置文件:

```python
view_pipeline = [
dict(type='RandomResizedCrop', size=224, backend='pillow'),
dict(type='RandomFlip', prob=0.5),
# add `NewTransform`
dict(type='NewTransform'),
dict(
type='RandomApply',
transforms=[
dict(
type='ColorJitter',
brightness=0.8,
contrast=0.8,
saturation=0.8,
hue=0.2)
],
prob=0.8),
dict(
type='RandomGrayscale',
prob=0.2,
keep_channels=True,
channel_weights=(0.114, 0.587, 0.2989)),
dict(type='RandomGaussianBlur', sigma_min=0.1, sigma_max=2.0, prob=0.5),
]

train_pipeline = [
dict(type='LoadImageFromFile', file_client_args=file_client_args),
dict(type='MultiView', num_views=2, transforms=[view_pipeline]),
dict(type='PackSelfSupInputs', meta_keys=['img_path'])
]
```
62 changes: 61 additions & 1 deletion docs/zh_cn/advanced_guides/conventions.md
Original file line number Diff line number Diff line change
@@ -1 +1,61 @@
# Conventions
# 约定

- [约定](#约定)
- [损失](#损失)

如果您想将 MMSelfSup 修改为您自己的项目, 请检查以下约定。

## 损失

当算法实现时, 函数 `loss` 返回的损失应该是 `dict` 类型。

举个 `MAE` 的例子:

```python
class MAE(BaseModel):
"""MAE.
Implementation of `Masked Autoencoders Are Scalable Vision Learners
<https://arxiv.org/abs/2111.06377>`_.
"""

def extract_feat(self, inputs: List[torch.Tensor],
**kwarg) -> Tuple[torch.Tensor]:
...

def loss(self, inputs: List[torch.Tensor],
data_samples: List[SelfSupDataSample],
**kwargs) -> Dict[str, torch.Tensor]:
"""The forward function in training.
Args:
inputs (List[torch.Tensor]): The input images.
data_samples (List[SelfSupDataSample]): All elements required
during the forward function.
Returns:
Dict[str, torch.Tensor]: A dictionary of loss components.
"""
# ids_restore: the same as that in original repo, which is used
# to recover the original order of tokens in decoder.
latent, mask, ids_restore = self.backbone(inputs[0])
pred = self.neck(latent, ids_restore)
loss = self.head(pred, inputs[0], mask)
losses = dict(loss=loss)
return losses

```

`MAE` 模型正向传播期间, 这个 `MAE.loss()` 函数将被调用用于计算损失并返回这个损失值。

默认情况下, 只有 `dict` 中的键包含的 `loss` 值时, 才会进行反向传播, 如果你的算法需要多个损失值, 你可以用多个键打包损失字典。

```python
class YourAlgorithm(BaseModel):

def loss():
...

losses['loss_1'] = loss_1
losses['loss_2'] = loss_2
```

0 comments on commit c951bb6

Please sign in to comment.