Caffe深度学习入门（5）—— caffenet 微调网络训练自己的数据并测试训练的模型

释放双眼，带上耳机，听听看~！

微调网络，通常我们有一个初始化的模型参数文件，这里是不同于training from scratch，scrachtch指的是我们训练一个新的网络，在训练过程中，这些参数都被随机初始化，而fine-tuning，是我们可以在ImageNet上1000类分类训练好的参数的基础上，根据我们的分类识别任务进行特定的微调

这里我以一个车的识别为例，假设我们有1种车需要识别，我的任务对象是车，现在有ImageNet的模型参数文件，在这里使用的网络模型是CaffeNet，是一个小型的网络，其实别的网络如GoogleNet也是一样的原理。那么这个任务的变化可以表示为：

任务：分类类别数目：1000（ImageNet上1000类的分类任务）——> 1(自己的特定数据集的分类任务车)

那么在网络的微调中，我们的整个流程分为以下几步：


1
2
3
4
5
6
7
11. 依然是准备好我们的训练数据和测试数据

22. 计算数据集的均值文件，因为集中特定领域的图像均值文件会跟ImageNet上比较General的数据的均值不太一样

33. 修改网络最后一层的输出类别，并且需要加快最后一层的参数学习速率

44. 调整Solver的配置参数，通常学习速率和步长，迭代次数都要适当减少

55. 启动训练，并且需要加载pretrained模型的参数

6

7

1.准备数据集
这一点就不用说了，准备两个txt文件，放成list的形式，可以参考caffe下的example，图像路径之后一个空格之后跟着类别的ID，如下，这里记住ID必须从0开始，要连续，否则会出错，loss不下降，按照要求写就OK。
这个是训练的图像label，测试的也同理


1
2
3
11. 创建lmdb文件，使用caffe下的convert_imageset 工具，具体命令如下：

2

3


1
2
3
1./build/tools/convert_imageset /media/***/801328a5-39c6-4e08-b070-19fc662a5236/resnet/caffe/data/cartest/  data/cartest/carlist.txt data/cartest/train_car_lmdb -resize_width=227 -resize_height=227 -check_size -shuffle true

2

3

其中第一个参数是基地址路径用来拼接的，第二个是label的文件，第三个是生成的数据库文件支持leveldb或者lmdb，接着是resize的大小，最后是否随机图片顺序

计算均值，使用caffe下的convert_imageset 工具，具体命令


1
2
3
1./build/tools/compute_image_mean /media/***/801328a5-39c6-4e08-b070-19fc662a5236/resnet/caffe/data/cartest/train_car_lmdb/ data/carmean.binaryproto

2

3

第一个参数是基地址路径用来拼接的，第二个是lmdb文件，第三个是生成的均值文件carmean.binaryproto

3.调整网络层参数
参照Caffe上的例程，我用的是CaffeNet，首先在输入层data层，修改我们的source 和 meanfile，根据之前生成的lmdb 和mean.binaryproto修改即可。

最后输出层是fc8，

1.首先修改名字，这样预训练模型赋值的时候这里就会因为名字不匹配从而重新训练，也就达成了我们适应新任务的目的。
2.调整学习速率，因为最后一层是重新学习，因此需要有更快的学习速率相比较其他层，因此我们将，weight和bias的学习速率加快10倍。

修改./models/bvlc_reference_caffenet/train_cal_resnet_lily.prototxt中 train和test对应的相关路径


1
2
3
4
1mean_file: &quot;/media/***/801328a5-39c6-4e08-b070-19fc662a5236/resnet/caffe/data/cartest/carmean.binaryproto

2source: &quot;/media/***/801328a5-39c6-4e08-b070-19fc662a5236/resnet/caffe/data/cartest/train_car_lmdb&quot;

3

4

修改./models/bvlc_reference_caffenet/solver_resnet_lily.prototxt


1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
1net: &quot;models/bvlc_reference_caffenet/train_val_resnet_lily.prototxt&quot;

2test_iter: 100

3test_interval: 1000

4base_lr: 0.001

5lr_policy: &quot;step&quot;

6gamma: 0.1

7stepsize: 20000

8display: 20

9max_iter: 50000

10momentum: 0.9

11weight_decay: 0.0005

12snapshot: 10000

13snapshot_prefix: &quot;models/bvlc_reference_caffenet/caffenet_resnet_model_lily&quot;

14solver_mode: GPU

15

16

原来是fc8，记得把跟fc8连接的名字都要修改掉，修改修改./models/bvlc_reference_caffenet/train_val_resnet_lily.prototxt 后如下


1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
1layer {

2  name: &quot;fc8_comp_model&quot;

3  type: &quot;InnerProduct&quot;

4  bottom: &quot;fc7&quot;

5  top: &quot;fc8_comp_model&quot;

6  param {

7    lr_mult: 1

8    decay_mult: 1

9  }

10  param {

11    lr_mult: 2

12    decay_mult: 0

13  }

14  inner_product_param {

15    num_output: 1000

16    weight_filler {

17      type: &quot;gaussian&quot;

18      std: 0.01

19    }

20    bias_filler {

21      type: &quot;constant&quot;

22      value: 0

23    }

24  }

25}

26layer {

27  name: &quot;accuracy&quot;

28  type: &quot;Accuracy&quot;

29  bottom: &quot;fc8_comp_model&quot;

30  bottom: &quot;label&quot;

31  top: &quot;accuracy&quot;

32  include {

33    phase: TEST

34  }

35}

36layer {

37  name: &quot;loss&quot;

38  type: &quot;SoftmaxWithLoss&quot;

39  bottom: &quot;fc8_comp_model&quot;

40  bottom: &quot;label&quot;

41  top: &quot;loss&quot;

42}

43

44

主要的调整有：test_iter从1000改为了100，因为数据量减少了，base_lr从0.01变成了0.001，这个很重要，微调时的基本学习速率不能太大，学习策略没有改变，步长从原来的100000变成了20000，最大的迭代次数也从450000变成了50000，动量和权重衰减项都没有修改，依然是GPU模型，网络模型文件和快照的路径根据自己修改

train_val_resnet_lily.prototxt完整文件为：


1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
1name: &quot;CaffeNet&quot;

2layer {

3  name: &quot;data&quot;

4  type: &quot;Data&quot;

5  top: &quot;data&quot;

6  top: &quot;label&quot;

7  include {

8    phase: TRAIN

9  }

10  transform_param {

11    mirror: true

12    crop_size: 227

13    mean_file: &quot;/media/***/801328a5-39c6-4e08-b070-19fc662a5236/resnet/caffe/data/cartest/carmean.binaryproto&quot;

14  }

15# mean pixel / channel-wise mean instead of mean image

16#  transform_param {

17#    crop_size: 227

18#    mean_value: 104

19#    mean_value: 117

20#    mean_value: 123

21#    mirror: true

22#  }

23  data_param {

24    source: &quot;/media/***/801328a5-39c6-4e08-b070-19fc662a5236/resnet/caffe/data/cartest/train_car_lmdb&quot;

25    batch_size: 256

26    backend: LMDB

27  }

28}

29layer {

30  name: &quot;data&quot;

31  type: &quot;Data&quot;

32  top: &quot;data&quot;

33  top: &quot;label&quot;

34  include {

35    phase: TEST

36  }

37  transform_param {

38    mirror: false

39    crop_size: 227

40    mean_file: &quot;/media/***/801328a5-39c6-4e08-b070-19fc662a5236/resnet/caffe/data/cartest/carmean.binaryproto&quot;

41  }

42# mean pixel / channel-wise mean instead of mean image

43#  transform_param {

44#    crop_size: 227

45#    mean_value: 104

46#    mean_value: 117

47#    mean_value: 123

48#    mirror: false

49#  }

50  data_param {

51    source: &quot;/media/***/801328a5-39c6-4e08-b070-19fc662a5236/resnet/caffe/data/cartest/train_car_lmdb&quot;

52    batch_size: 50

53    backend: LMDB

54  }

55}

56layer {

57  name: &quot;conv1&quot;

58  type: &quot;Convolution&quot;

59  bottom: &quot;data&quot;

60  top: &quot;conv1&quot;

61  param {

62    lr_mult: 1

63    decay_mult: 1

64  }

65  param {

66    lr_mult: 2

67    decay_mult: 0

68  }

69  convolution_param {

70    num_output: 96

71    kernel_size: 11

72    stride: 4

73    weight_filler {

74      type: &quot;gaussian&quot;

75      std: 0.01

76    }

77    bias_filler {

78      type: &quot;constant&quot;

79      value: 0

80    }

81  }

82}

83layer {

84  name: &quot;relu1&quot;

85  type: &quot;ReLU&quot;

86  bottom: &quot;conv1&quot;

87  top: &quot;conv1&quot;

88}

89layer {

90  name: &quot;pool1&quot;

91  type: &quot;Pooling&quot;

92  bottom: &quot;conv1&quot;

93  top: &quot;pool1&quot;

94  pooling_param {

95    pool: MAX

96    kernel_size: 3

97    stride: 2

98  }

99}

100layer {

101  name: &quot;norm1&quot;

102  type: &quot;LRN&quot;

103  bottom: &quot;pool1&quot;

104  top: &quot;norm1&quot;

105  lrn_param {

106    local_size: 5

107    alpha: 0.0001

108    beta: 0.75

109  }

110}

111layer {

112  name: &quot;conv2&quot;

113  type: &quot;Convolution&quot;

114  bottom: &quot;norm1&quot;

115  top: &quot;conv2&quot;

116  param {

117    lr_mult: 1

118    decay_mult: 1

119  }

120  param {

121    lr_mult: 2

122    decay_mult: 0

123  }

124  convolution_param {

125    num_output: 256

126    pad: 2

127    kernel_size: 5

128    group: 2

129    weight_filler {

130      type: &quot;gaussian&quot;

131      std: 0.01

132    }

133    bias_filler {

134      type: &quot;constant&quot;

135      value: 1

136    }

137  }

138}

139layer {

140  name: &quot;relu2&quot;

141  type: &quot;ReLU&quot;

142  bottom: &quot;conv2&quot;

143  top: &quot;conv2&quot;

144}

145layer {

146  name: &quot;pool2&quot;

147  type: &quot;Pooling&quot;

148  bottom: &quot;conv2&quot;

149  top: &quot;pool2&quot;

150  pooling_param {

151    pool: MAX

152    kernel_size: 3

153    stride: 2

154  }

155}

156layer {

157  name: &quot;norm2&quot;

158  type: &quot;LRN&quot;

159  bottom: &quot;pool2&quot;

160  top: &quot;norm2&quot;

161  lrn_param {

162    local_size: 5

163    alpha: 0.0001

164    beta: 0.75

165  }

166}

167layer {

168  name: &quot;conv3&quot;

169  type: &quot;Convolution&quot;

170  bottom: &quot;norm2&quot;

171  top: &quot;conv3&quot;

172  param {

173    lr_mult: 1

174    decay_mult: 1

175  }

176  param {

177    lr_mult: 2

178    decay_mult: 0

179  }

180  convolution_param {

181    num_output: 384

182    pad: 1

183    kernel_size: 3

184    weight_filler {

185      type: &quot;gaussian&quot;

186      std: 0.01

187    }

188    bias_filler {

189      type: &quot;constant&quot;

190      value: 0

191    }

192  }

193}

194layer {

195  name: &quot;relu3&quot;

196  type: &quot;ReLU&quot;

197  bottom: &quot;conv3&quot;

198  top: &quot;conv3&quot;

199}

200layer {

201  name: &quot;conv4&quot;

202  type: &quot;Convolution&quot;

203  bottom: &quot;conv3&quot;

204  top: &quot;conv4&quot;

205  param {

206    lr_mult: 1

207    decay_mult: 1

208  }

209  param {

210    lr_mult: 2

211    decay_mult: 0

212  }

213  convolution_param {

214    num_output: 384

215    pad: 1

216    kernel_size: 3

217    group: 2

218    weight_filler {

219      type: &quot;gaussian&quot;

220      std: 0.01

221    }

222    bias_filler {

223      type: &quot;constant&quot;

224      value: 1

225    }

226  }

227}

228layer {

229  name: &quot;relu4&quot;

230  type: &quot;ReLU&quot;

231  bottom: &quot;conv4&quot;

232  top: &quot;conv4&quot;

233}

234layer {

235  name: &quot;conv5&quot;

236  type: &quot;Convolution&quot;

237  bottom: &quot;conv4&quot;

238  top: &quot;conv5&quot;

239  param {

240    lr_mult: 1

241    decay_mult: 1

242  }

243  param {

244    lr_mult: 2

245    decay_mult: 0

246  }

247  convolution_param {

248    num_output: 256

249    pad: 1

250    kernel_size: 3

251    group: 2

252    weight_filler {

253      type: &quot;gaussian&quot;

254      std: 0.01

255    }

256    bias_filler {

257      type: &quot;constant&quot;

258      value: 1

259    }

260  }

261}

262layer {

263  name: &quot;relu5&quot;

264  type: &quot;ReLU&quot;

265  bottom: &quot;conv5&quot;

266  top: &quot;conv5&quot;

267}

268layer {

269  name: &quot;pool5&quot;

270  type: &quot;Pooling&quot;

271  bottom: &quot;conv5&quot;

272  top: &quot;pool5&quot;

273  pooling_param {

274    pool: MAX

275    kernel_size: 3

276    stride: 2

277  }

278}

279layer {

280  name: &quot;fc6&quot;

281  type: &quot;InnerProduct&quot;

282  bottom: &quot;pool5&quot;

283  top: &quot;fc6&quot;

284  param {

285    lr_mult: 1

286    decay_mult: 1

287  }

288  param {

289    lr_mult: 2

290    decay_mult: 0

291  }

292  inner_product_param {

293    num_output: 4096

294    weight_filler {

295      type: &quot;gaussian&quot;

296      std: 0.005

297    }

298    bias_filler {

299      type: &quot;constant&quot;

300      value: 1

301    }

302  }

303}

304layer {

305  name: &quot;relu6&quot;

306  type: &quot;ReLU&quot;

307  bottom: &quot;fc6&quot;

308  top: &quot;fc6&quot;

309}

310layer {

311  name: &quot;drop6&quot;

312  type: &quot;Dropout&quot;

313  bottom: &quot;fc6&quot;

314  top: &quot;fc6&quot;

315  dropout_param {

316    dropout_ratio: 0.5

317  }

318}

319layer {

320  name: &quot;fc7&quot;

321  type: &quot;InnerProduct&quot;

322  bottom: &quot;fc6&quot;

323  top: &quot;fc7&quot;

324  param {

325    lr_mult: 1

326    decay_mult: 1

327  }

328  param {

329    lr_mult: 2

330    decay_mult: 0

331  }

332  inner_product_param {

333    num_output: 4096

334    weight_filler {

335      type: &quot;gaussian&quot;

336      std: 0.005

337    }

338    bias_filler {

339      type: &quot;constant&quot;

340      value: 1

341    }

342  }

343}

344layer {

345  name: &quot;relu7&quot;

346  type: &quot;ReLU&quot;

347  bottom: &quot;fc7&quot;

348  top: &quot;fc7&quot;

349}

350layer {

351  name: &quot;drop7&quot;

352  type: &quot;Dropout&quot;

353  bottom: &quot;fc7&quot;

354  top: &quot;fc7&quot;

355  dropout_param {

356    dropout_ratio: 0.5

357  }

358}

359layer {

360  name: &quot;fc8_comp_model&quot;

361  type: &quot;InnerProduct&quot;

362  bottom: &quot;fc7&quot;

363  top: &quot;fc8_comp_model&quot;

364  param {

365    lr_mult: 1

366    decay_mult: 1

367  }

368  param {

369    lr_mult: 2

370    decay_mult: 0

371  }

372  inner_product_param {

373    num_output: 1000

374    weight_filler {

375      type: &quot;gaussian&quot;

376      std: 0.01

377    }

378    bias_filler {

379      type: &quot;constant&quot;

380      value: 0

381    }

382  }

383}

384layer {

385  name: &quot;accuracy&quot;

386  type: &quot;Accuracy&quot;

387  bottom: &quot;fc8_comp_model&quot;

388  bottom: &quot;label&quot;

389  top: &quot;accuracy&quot;

390  include {

391    phase: TEST

392  }

393}

394layer {

395  name: &quot;loss&quot;

396  type: &quot;SoftmaxWithLoss&quot;

397  bottom: &quot;fc8_comp_model&quot;

398  bottom: &quot;label&quot;

399  top: &quot;loss&quot;

400}

401

402

403训练的指令如下：

404./build/tools/caffe train --solver ./models/bvlc_reference_caffenet/solver_resnet_lily.prototxt --weights ./models/bvlc_reference_caffenet/bvlc_reference_caffenet.caffemodel --gpu 0

405

406

测试指令：


1
2
3
1python ./python/classify02.py --model_def ./models/bvlc_reference_caffenet/train_val_test_resnet_lily.prototxt --pretrained_model ./models/bvlc_reference_caffenet/caffenet_resnet_model_lily_iter_50000.caffemodel --labels_file ./data/cartest/cartest.txt --center_only ./data/cartest/JPEGImages/crk201706301341.jpg foo

2

3

注意这里的train_val_test_resnet_lily.prototxt文件与训练时的文件train_val_resnet_lily.prototxt文件是不一样的。

train_val_resnet_lily.prototxt文件为：


1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
1name: &quot;train_resnet_lily&quot;

2

3layer {

4  name: &quot;data&quot;

5  type: &quot;Input&quot;

6  top: &quot;data&quot;

7  input_param { shape: { dim: 1 dim: 3 dim: 227 dim: 227 } }

8}

9

10layer {

11  name: &quot;conv1&quot;

12  type: &quot;Convolution&quot;

13  bottom: &quot;data&quot;

14  top: &quot;conv1&quot;

15  param {

16    lr_mult: 1

17    decay_mult: 1

18  }

19  param {

20    lr_mult: 2

21    decay_mult: 0

22  }

23  convolution_param {

24    num_output: 96

25    kernel_size: 11

26    stride: 4

27    weight_filler {

28      type: &quot;gaussian&quot;

29      std: 0.01

30    }

31    bias_filler {

32      type: &quot;constant&quot;

33      value: 0

34    }

35  }

36}

37layer {

38  name: &quot;relu1&quot;

39  type: &quot;ReLU&quot;

40  bottom: &quot;conv1&quot;

41  top: &quot;conv1&quot;

42}

43layer {

44  name: &quot;pool1&quot;

45  type: &quot;Pooling&quot;

46  bottom: &quot;conv1&quot;

47  top: &quot;pool1&quot;

48  pooling_param {

49    pool: MAX

50    kernel_size: 3

51    stride: 2

52  }

53}

54layer {

55  name: &quot;norm1&quot;

56  type: &quot;LRN&quot;

57  bottom: &quot;pool1&quot;

58  top: &quot;norm1&quot;

59  lrn_param {

60    local_size: 5

61    alpha: 0.0001

62    beta: 0.75

63  }

64}

65layer {

66  name: &quot;conv2&quot;

67  type: &quot;Convolution&quot;

68  bottom: &quot;norm1&quot;

69  top: &quot;conv2&quot;

70  param {

71    lr_mult: 1

72    decay_mult: 1

73  }

74  param {

75    lr_mult: 2

76    decay_mult: 0

77  }

78  convolution_param {

79    num_output: 256

80    pad: 2

81    kernel_size: 5

82    group: 2

83    weight_filler {

84      type: &quot;gaussian&quot;

85      std: 0.01

86    }

87    bias_filler {

88      type: &quot;constant&quot;

89      value: 1

90    }

91  }

92}

93layer {

94  name: &quot;relu2&quot;

95  type: &quot;ReLU&quot;

96  bottom: &quot;conv2&quot;

97  top: &quot;conv2&quot;

98}

99layer {

100  name: &quot;pool2&quot;

101  type: &quot;Pooling&quot;

102  bottom: &quot;conv2&quot;

103  top: &quot;pool2&quot;

104  pooling_param {

105    pool: MAX

106    kernel_size: 3

107    stride: 2

108  }

109}

110layer {

111  name: &quot;norm2&quot;

112  type: &quot;LRN&quot;

113  bottom: &quot;pool2&quot;

114  top: &quot;norm2&quot;

115  lrn_param {

116    local_size: 5

117    alpha: 0.0001

118    beta: 0.75

119  }

120}

121layer {

122  name: &quot;conv3&quot;

123  type: &quot;Convolution&quot;

124  bottom: &quot;norm2&quot;

125  top: &quot;conv3&quot;

126  param {

127    lr_mult: 1

128    decay_mult: 1

129  }

130  param {

131    lr_mult: 2

132    decay_mult: 0

133  }

134  convolution_param {

135    num_output: 384

136    pad: 1

137    kernel_size: 3

138    weight_filler {

139      type: &quot;gaussian&quot;

140      std: 0.01

141    }

142    bias_filler {

143      type: &quot;constant&quot;

144      value: 0

145    }

146  }

147}

148layer {

149  name: &quot;relu3&quot;

150  type: &quot;ReLU&quot;

151  bottom: &quot;conv3&quot;

152  top: &quot;conv3&quot;

153}

154layer {

155  name: &quot;conv4&quot;

156  type: &quot;Convolution&quot;

157  bottom: &quot;conv3&quot;

158  top: &quot;conv4&quot;

159  param {

160    lr_mult: 1

161    decay_mult: 1

162  }

163  param {

164    lr_mult: 2

165    decay_mult: 0

166  }

167  convolution_param {

168    num_output: 384

169    pad: 1

170    kernel_size: 3

171    group: 2

172    }

173}

174layer {

175  name: &quot;relu4&quot;

176  type: &quot;ReLU&quot;

177  bottom: &quot;conv4&quot;

178  top: &quot;conv4&quot;

179}

180layer {

181  name: &quot;conv5&quot;

182  type: &quot;Convolution&quot;

183  bottom: &quot;conv4&quot;

184  top: &quot;conv5&quot;

185  param {

186    lr_mult: 1

187    decay_mult: 1

188  }

189  param {

190    lr_mult: 2

191    decay_mult: 0

192  }

193  convolution_param {

194    num_output: 256

195    pad: 1

196    kernel_size: 3

197    group: 2

198    

199  }

200}

201layer {

202  name: &quot;relu5&quot;

203  type: &quot;ReLU&quot;

204  bottom: &quot;conv5&quot;

205  top: &quot;conv5&quot;

206}

207layer {

208  name: &quot;pool5&quot;

209  type: &quot;Pooling&quot;

210  bottom: &quot;conv5&quot;

211  top: &quot;pool5&quot;

212  pooling_param {

213    pool: MAX

214    kernel_size: 3

215    stride: 2

216  }

217}

218layer {

219  name: &quot;fc6&quot;

220  type: &quot;InnerProduct&quot;

221  bottom: &quot;pool5&quot;

222  top: &quot;fc6&quot;

223  param {

224    lr_mult: 1

225    decay_mult: 1

226  }

227  param {

228    lr_mult: 2

229    decay_mult: 0

230  }

231  inner_product_param {

232    num_output: 4096

233    weight_filler {

234      type: &quot;gaussian&quot;

235      std: 0.005

236    }

237    bias_filler {

238      type: &quot;constant&quot;

239      value: 1

240    }

241  }

242}

243layer {

244  name: &quot;relu6&quot;

245  type: &quot;ReLU&quot;

246  bottom: &quot;fc6&quot;

247  top: &quot;fc6&quot;

248}

249layer {

250  name: &quot;drop6&quot;

251  type: &quot;Dropout&quot;

252  bottom: &quot;fc6&quot;

253  top: &quot;fc6&quot;

254  dropout_param {

255    dropout_ratio: 0.5

256  }

257}

258layer {

259  name: &quot;fc7&quot;

260  type: &quot;InnerProduct&quot;

261  bottom: &quot;fc6&quot;

262  top: &quot;fc7&quot;

263  param {

264    lr_mult: 1

265    decay_mult: 1

266  }

267  param {

268    lr_mult: 2

269    decay_mult: 0

270  }

271  inner_product_param {

272    num_output: 4096

273  

274    }

275}

276layer {

277  name: &quot;relu7&quot;

278  type: &quot;ReLU&quot;

279  bottom: &quot;fc7&quot;

280  top: &quot;fc7&quot;

281}

282layer {

283  name: &quot;drop7&quot;

284  type: &quot;Dropout&quot;

285  bottom: &quot;fc7&quot;

286  top: &quot;fc7&quot;

287  dropout_param {

288    dropout_ratio: 0.5

289  }

290}

291layer {

292  name: &quot;fc8_comp_model&quot;

293  type: &quot;InnerProduct&quot;

294  bottom: &quot;fc7&quot;

295  top: &quot;fc8_comp_model&quot;

296  param {

297    lr_mult: 1

298    decay_mult: 1

299  }

300  param {

301    lr_mult: 2

302    decay_mult: 0

303  }

304  inner_product_param {

305    num_output: 1000

306    

307  }

308}

309layer {

310  name: &quot;prob&quot;

311  type: &quot;Softmax&quot;

312  bottom: &quot;fc8_comp_model&quot;

313  top: &quot;prob&quot;

314}

315

316

报错及解决方案

在最后一步测试的时候运行报错

报错：


1
2
3
4
5
6
7
8
9
1File &quot;python/classify.py&quot;, line 138, in &lt;module&gt;

2    main(sys.argv)

3  File &quot;python/classify.py&quot;, line 110, in main

4    channel_swap=channel_swap)

5  File &quot;/media/futurus/801328a5-39c6-4e08-b070-19fc662a5236/resnet/caffe/python/caffe/classifier.py&quot;, line 29, in __init__

6    in_ = self.inputs[0]

7IndexError: list index out of range

8

9

参考解决方案;
加入：


1
2
3
4
5
6
7
8
9
10
1net: &quot;train_resnet_lily&quot;

2input: &quot;data&quot;

3input_shape {

4  dim: 10

5  dim: 3

6  dim: 224

7  dim: 224

8}

9

10

加入之后又报了其他错误：


1
2
3
4
5
6
1[libprotobuf ERROR google/protobuf/text_format.cc:274] Error parsing text-format caffe.NetParameter: 1:4: Message type &quot;caffe.NetParameter&quot; has no field named &quot;net&quot;.

2F0125 11:48:14.708683 42586 upgrade_proto.cpp:88] Check failed: ReadProtoFromTextFile(param_file, param) Failed to parse NetParameter file: ./models/bvlc_reference_caffenet/train_val_test_resnet_lily.prototxt

3*** Check failure stack trace: ***

4Aborted (core dumped)

5

6

根据网友的博客修改加入为


1
2
3
4
5
6
7
8
1net: &quot;train_resnet_lily&quot;

2input: &quot;data&quot;

3input_dim: 10

4input_dim: 3

5input_dim: 224

6input_dim: 224

7

8

测试还不不行，还是报错。

最终解决方案如下,运行通过。加入


1
2
3
4
5
6
7
8
9
1name: &quot;train_resnet_lily&quot;

2layer {

3  name: &quot;data&quot;

4  type: &quot;Input&quot;

5  top: &quot;data&quot;

6  input_param { shape: { dim: 1 dim: 3 dim: 227 dim: 227 } }

7}

8

9

{{userData.name}}已认证

Caffe深度学习入门（5）—— caffenet 微调网络训练自己的数据并测试训练的模型

MongoDB数据建模小案例：朋友圈评论内容管理

Ubuntu上NFS的安装配置

{{userData.name}}已认证

Related posts:

MongoDB数据建模小案例：朋友圈评论内容管理

Ubuntu上NFS的安装配置

Docker与Kubernetes系列(二): Docker的基本用法

python 操作redis

超级全面的MySQL优化面试解析

Caffe 深度学习框架上手教程