DeepGlobe Land Cover Classification Challenge 土地利用分类竞赛

English | 简体中文

数据集

数据

训练数据集包括803张卫星图片，RGB格式，尺寸2448 * 2448
图像分辨率为50cm，由 DigitalGlobe's 卫星提供
可通过下载页点击"Starting Kit"下载数据。(根据主办方的版权声明，下载数据请到原竞赛网页，即使竞赛已经结束，仍然可以参加)

标注

每张卫星图片有一张与之对应的标注图片。这张标注图片也是RGB格式，一共分为7类，每类对应的图像(R,G,B)编码对应关系如下：
- 城市土地： 0，255，255，浅蓝色，人造建筑（可以忽略道路）
- 农业用地： 255，255，0，黄色，农田，任何计划中（定期）的种植、农田、果园、葡萄园、苗圃、观赏性园艺以及养殖区
- 牧场： 255，0，255，紫色，除了森林，农田之外的绿色土地，草地
- 森林：0，255，0，绿色，任何土地上有x%的树冠密度。
- 水系：0，0，255，深蓝色，江河湖海湿地
- 荒地：255，255，255, 白色，山地，沙漠，戈壁，沙滩，没有植被的地方
- 未知土地： 0，0，0，黑色，云层遮盖或其他因素
卫星图片和与其对应的标注图片的命名格式为 id_sat.jpg和id_mask.png,id 是一个随机的整数。
需要注意：
- 由于压缩，标注图像的值可能不是准确的目标颜色值。当转换到标签时，请将每个R/G/B通道按128阈值二值化。
- 高分辨率卫星图像的土地覆被分割仍然是一个探索性的任务，由于标注多类分割的代价很大，标签还远远不够完善。此外，我们故意不标注道路，因为它已经包含在一个单独的道路提取挑战赛中。

评价指标

我们将采用像素级别的平均IoU分数作为评价指标。
- IoU的定义是：交集/并集，公式：预测准确的面积/(预测准确面积 + 没有预测出来的面积 + 预测错误的面积)
- 平均IoU由各个类别的IoU取均值得到
需要注意的是"未知土地"(0,0,0)并不是一个真实的类别，在这个计算中也没有起到作用。所以有效的mIoU是前6个类别IoU的均值。

结果展示

致谢

rishizek's repo tensorflow-deeplab-v3-plus
DeepGlobe_2018_A_CVPR_2018_paper
deepglobe.

如何复现此工作

文件结构组织

deepglobe_land
- dataset
  - land_train (存放下载下来的原始数据 )
  - onechannel_label (内部数据通过运行 rgb2label.py 生成)
  - voc_train_all.record (通过运行 create_tf_record_all.py 生成 )
- ini_checkpoints
  - resnet_v2_101 (resnet_v2_101.ckpt and train.graph)
- utils (工具包)
- rgb2label.py
- create_tf_record_all.py
- deeplab_model.py

代码执行顺序与注意事项

reg2label.py 数据标注文件 id_mask.png 是 RGB 三通道图像，输入训练集的时候必须首先转为单通道图像，故该代码用于执行，图像到单通道图像的转换，此处原始卫星图像 id_sat.jpg, 标注图像 id_mask.png，单通道标注图像 id_label.png 。 这段代码可以作为一段工具代码在其他地方使用。

代码中
```
 rgb_label_path = 'dataset/land_train'
 onechannel_path = 'dataset/onechannel_label'
```
这两行指定了输入输出路径。

create_tf_record_all.py 建立 tfrecord 文件，该代码执行后，直接将 land_train文件夹里的所有 id_sat.jpg 文件与 onechannel_label 里的 id_label 文件打包生成一个 voc_train_all.record文件，该文件中包含了数据集和验证集，不需要手动区分数据集验证集，训练过程中会随机选择一部分作为数据集，另一部分作为验证集。

parser.add_argument('--data_dir', type=str, default='./dataset/',
                    help='Path to the directory containing the PASCAL VOC data.')

parser.add_argument('--output_path', type=str, default='./dataset',
                    help='Path to the directory to create TFRecords outputs.')

parser.add_argument('--image_data_dir', type=str, default='land_train',
                    help='The directory containing the image data.')

parser.add_argument('--label_data_dir', type=str, default='onechannel_label',
                    help='The directory containing the augmented label data.')

这一段设置了默认的输入输出路径，如需修改可直接改源代码，或在命令行指定参数。

train.py 需要在里面进行一些参数设置，类别，训练集合验证集的数据比，这里我们设置训练集723张图片，验证集80张。 参数解释 epoch:

In Deep Learning, an epoch is a hyperparameter which is defined before training a model. One epoch is when an entire dataset is passed both forward and backward through the neural network only once.

1个 epoch 意味着所有数据通过神经网络一遍 batch_size:

A batch is the total number of training examples present in a single batch and an iteration is the number of batches needed to complete one epoch.

训练数据不可能一次性全部通过神经网络，所以需要分为不同的 batch。batch_size 大小将影响占用内存的数量。 iteration:

If we divide a dataset of 2000 training examples into 500 batches, then 4 iterations will complete 1 epoch

迭代几个循环

max_iter:

最大迭代次数，也是训练停止的条件。

learning rate:

Learning rate might be the most important hyper parameter in deep learning, as learning rate decides how much gradient to be back propagated.

学习率，决定模型收敛速度的一个重要超参数，学习率大，下降快，但可能陷入震荡，学习率小，下降慢，却容易获得收敛。

learning_rate_policy: 学习率函数，可选参数为 poly(polynomial function 多项式函数), piece wise
inference.py 将模型结果应用到新的图片上，实现分割。

支撑代码

utils 自定义的 package，包含两个工具文件
deeplab_model.py deeplab模型基本不用动函数 atrous_spatial_pyramid_pooling:空洞卷积空间金字塔池化 atrous_rate 空洞卷积膨胀率，空洞卷积的核心指标。

函数 deeplab_v3_plus_generator deeplabv3+ 网络生成器

函数 deeplabv3_plus_model_fn 模型方程
tensorboard 可视化
```
tensorboard --logdir MODEL_DIR
```
如果在服务器上运行，需要在本地浏览器查看的话，可以在 ssh 链接的时候将服务器的一个端口和本地端口绑定。

参考 Stack Overflow

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

readme_ch.md

readme_ch.md

DeepGlobe Land Cover Classification Challenge 土地利用分类竞赛

数据集

数据

标注

评价指标

结果展示

致谢

如何复现此工作

文件结构组织

代码执行顺序与注意事项

支撑代码

Files

readme_ch.md

Latest commit

History

readme_ch.md

File metadata and controls

DeepGlobe Land Cover Classification Challenge 土地利用分类竞赛

数据集

数据

标注

评价指标

结果展示

致谢

如何复现此工作

文件结构组织

代码执行顺序与注意事项

支撑代码