-
Notifications
You must be signed in to change notification settings - Fork 18
/
train_mixup_in_data.log
3927 lines (3927 loc) · 262 KB
/
train_mixup_in_data.log
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
nohup: ignoring input
INFO:root:start with arguments Namespace(batch_size=128, benchmark=0, data_nthreads=4, data_train='data/cifar10_train.rec', data_train_idx='', data_val='data/cifar10_val.rec', data_val_idx='', disp_batches=20, dtype='float32', gpus='0,1', image_shape='3,28,28', is_train=True, kv_store='device', load_epoch=None, lr=0.7, lr_factor=0.1, lr_step_epochs='10,100,200', max_random_aspect_ratio=0, max_random_h=36, max_random_l=50, max_random_rotate_angle=0, max_random_s=50, max_random_scale=1, max_random_shear_ratio=0, min_random_scale=1, model_prefix='models/mix', mom=0.9, monitor=0, network='resnet_mixup', num_classes=10, num_epochs=300, num_examples=50000, num_layers=50, optimizer='sgd', pad_size=4, random_crop=1, random_mirror=1, rgb_mean='123.68,116.779,103.939', test_io=0, top_k=0, wd=0.0001)
{'alpha': '0.2', 'mix_rate': '0.7', 'batch_size': '128', 'num_classes': '10'}
{}
[03:00:06] src/io/iter_image_recordio_2.cc:169: ImageRecordIOParser2: data/cifar10_train.rec, use 4 threads for decoding..
[03:00:06] src/io/iter_image_recordio_2.cc:169: ImageRecordIOParser2: data/cifar10_val.rec, use 4 threads for decoding..
[03:00:07] src/operator/././cudnn_algoreg-inl.h:106: Running performance tests to find the best convolution algorithm, this can take a while... (setting env variable MXNET_CUDNN_AUTOTUNE_DEFAULT to 0 to disable)
INFO:root:Epoch[0] Batch [20] Speed: 3582.98 samples/sec Loss=2.342777
INFO:root:Epoch[0] Batch [40] Speed: 3694.52 samples/sec Loss=2.246202
INFO:root:Epoch[0] Batch [60] Speed: 3871.72 samples/sec Loss=2.155675
INFO:root:Epoch[0] Batch [80] Speed: 3693.45 samples/sec Loss=2.065198
INFO:root:Epoch[0] Batch [100] Speed: 3726.89 samples/sec Loss=1.974102
INFO:root:Epoch[0] Batch [120] Speed: 3835.81 samples/sec Loss=1.946788
INFO:root:Epoch[0] Batch [140] Speed: 3695.40 samples/sec Loss=1.911029
INFO:root:Epoch[0] Batch [160] Speed: 3744.27 samples/sec Loss=1.919498
INFO:root:Epoch[0] Batch [180] Speed: 3585.76 samples/sec Loss=1.873903
INFO:root:Epoch[0] Batch [200] Speed: 3745.46 samples/sec Loss=1.853349
INFO:root:Epoch[0] Batch [220] Speed: 3887.81 samples/sec Loss=1.839257
INFO:root:Epoch[0] Batch [240] Speed: 3639.04 samples/sec Loss=1.877601
INFO:root:Epoch[0] Batch [260] Speed: 3546.74 samples/sec Loss=1.793658
INFO:root:Epoch[0] Batch [280] Speed: 3669.19 samples/sec Loss=1.813744
INFO:root:Epoch[0] Batch [300] Speed: 3601.04 samples/sec Loss=1.801585
INFO:root:Epoch[0] Batch [320] Speed: 3840.77 samples/sec Loss=1.765409
INFO:root:Epoch[0] Batch [340] Speed: 3609.26 samples/sec Loss=1.798288
INFO:root:Epoch[0] Batch [360] Speed: 3689.70 samples/sec Loss=1.765062
INFO:root:Epoch[0] Batch [380] Speed: 3713.74 samples/sec Loss=1.750561
INFO:root:Epoch[0] Train-Loss=1.752431
INFO:root:Epoch[0] Time cost=13.999
INFO:root:Saved checkpoint to "models/mix-0001.params"
INFO:root:Epoch[0] Validation-Loss=1.658290
INFO:root:Epoch[1] Batch [20] Speed: 3764.77 samples/sec Loss=1.727725
INFO:root:Epoch[1] Batch [40] Speed: 3701.91 samples/sec Loss=1.708446
INFO:root:Epoch[1] Batch [60] Speed: 3684.39 samples/sec Loss=1.695748
INFO:root:Epoch[1] Batch [80] Speed: 3810.10 samples/sec Loss=1.725637
INFO:root:Epoch[1] Batch [100] Speed: 3518.31 samples/sec Loss=1.701827
INFO:root:Epoch[1] Batch [120] Speed: 3827.50 samples/sec Loss=1.657070
INFO:root:Epoch[1] Batch [140] Speed: 3799.21 samples/sec Loss=1.655193
INFO:root:Epoch[1] Batch [160] Speed: 3800.25 samples/sec Loss=1.657749
INFO:root:Epoch[1] Batch [180] Speed: 3772.47 samples/sec Loss=1.611735
INFO:root:Epoch[1] Batch [200] Speed: 3629.09 samples/sec Loss=1.643090
INFO:root:Epoch[1] Batch [220] Speed: 3590.80 samples/sec Loss=1.621215
INFO:root:Epoch[1] Batch [240] Speed: 3952.25 samples/sec Loss=1.570741
INFO:root:Epoch[1] Batch [260] Speed: 3873.54 samples/sec Loss=1.598520
INFO:root:Epoch[1] Batch [280] Speed: 3886.83 samples/sec Loss=1.574801
INFO:root:Epoch[1] Batch [300] Speed: 3921.26 samples/sec Loss=1.573884
INFO:root:Epoch[1] Batch [320] Speed: 3636.03 samples/sec Loss=1.592597
INFO:root:Epoch[1] Batch [340] Speed: 3747.92 samples/sec Loss=1.546124
INFO:root:Epoch[1] Batch [360] Speed: 3800.45 samples/sec Loss=1.542879
INFO:root:Epoch[1] Batch [380] Speed: 3746.41 samples/sec Loss=1.548682
INFO:root:Epoch[1] Train-Loss=1.552988
INFO:root:Epoch[1] Time cost=13.309
INFO:root:Saved checkpoint to "models/mix-0002.params"
INFO:root:Epoch[1] Validation-Loss=1.503490
INFO:root:Epoch[2] Batch [20] Speed: 3823.58 samples/sec Loss=1.508840
INFO:root:Epoch[2] Batch [40] Speed: 3715.68 samples/sec Loss=1.473463
INFO:root:Epoch[2] Batch [60] Speed: 3969.44 samples/sec Loss=1.509732
INFO:root:Epoch[2] Batch [80] Speed: 3711.14 samples/sec Loss=1.484333
INFO:root:Epoch[2] Batch [100] Speed: 3838.04 samples/sec Loss=1.433627
INFO:root:Epoch[2] Batch [120] Speed: 3630.67 samples/sec Loss=1.514890
INFO:root:Epoch[2] Batch [140] Speed: 3721.24 samples/sec Loss=1.502105
INFO:root:Epoch[2] Batch [160] Speed: 3765.57 samples/sec Loss=1.438914
INFO:root:Epoch[2] Batch [180] Speed: 3481.78 samples/sec Loss=1.388227
INFO:root:Epoch[2] Batch [200] Speed: 3716.46 samples/sec Loss=1.434197
INFO:root:Epoch[2] Batch [220] Speed: 3958.74 samples/sec Loss=1.447680
INFO:root:Epoch[2] Batch [240] Speed: 3972.69 samples/sec Loss=1.396764
INFO:root:Epoch[2] Batch [260] Speed: 3862.75 samples/sec Loss=1.426825
INFO:root:Epoch[2] Batch [280] Speed: 3875.86 samples/sec Loss=1.336821
INFO:root:Epoch[2] Batch [300] Speed: 3754.70 samples/sec Loss=1.403687
INFO:root:Epoch[2] Batch [320] Speed: 3922.22 samples/sec Loss=1.345291
INFO:root:Epoch[2] Batch [340] Speed: 3810.30 samples/sec Loss=1.394461
INFO:root:Epoch[2] Batch [360] Speed: 3684.74 samples/sec Loss=1.371990
INFO:root:Epoch[2] Batch [380] Speed: 3760.44 samples/sec Loss=1.387426
INFO:root:Epoch[2] Train-Loss=1.384350
INFO:root:Epoch[2] Time cost=13.177
INFO:root:Saved checkpoint to "models/mix-0003.params"
INFO:root:Epoch[2] Validation-Loss=1.516316
INFO:root:Epoch[3] Batch [20] Speed: 3586.05 samples/sec Loss=1.343441
INFO:root:Epoch[3] Batch [40] Speed: 3964.31 samples/sec Loss=1.309117
INFO:root:Epoch[3] Batch [60] Speed: 4060.71 samples/sec Loss=1.321634
INFO:root:Epoch[3] Batch [80] Speed: 3724.27 samples/sec Loss=1.319371
INFO:root:Epoch[3] Batch [100] Speed: 3875.52 samples/sec Loss=1.307463
INFO:root:Epoch[3] Batch [120] Speed: 4037.20 samples/sec Loss=1.296695
INFO:root:Epoch[3] Batch [140] Speed: 3922.73 samples/sec Loss=1.338026
INFO:root:Epoch[3] Batch [160] Speed: 3674.01 samples/sec Loss=1.317023
INFO:root:Epoch[3] Batch [180] Speed: 3635.92 samples/sec Loss=1.318409
INFO:root:Epoch[3] Batch [200] Speed: 3695.47 samples/sec Loss=1.291207
INFO:root:Epoch[3] Batch [220] Speed: 3976.98 samples/sec Loss=1.306005
INFO:root:Epoch[3] Batch [240] Speed: 3761.62 samples/sec Loss=1.293276
INFO:root:Epoch[3] Batch [260] Speed: 3917.31 samples/sec Loss=1.308702
INFO:root:Epoch[3] Batch [280] Speed: 3995.84 samples/sec Loss=1.255389
INFO:root:Epoch[3] Batch [300] Speed: 3578.21 samples/sec Loss=1.248143
INFO:root:Epoch[3] Batch [320] Speed: 3744.53 samples/sec Loss=1.268043
INFO:root:Epoch[3] Batch [340] Speed: 3389.54 samples/sec Loss=1.242926
INFO:root:Epoch[3] Batch [360] Speed: 3743.04 samples/sec Loss=1.269126
INFO:root:Epoch[3] Batch [380] Speed: 3600.54 samples/sec Loss=1.310697
INFO:root:Epoch[3] Train-Loss=1.270178
INFO:root:Epoch[3] Time cost=13.243
INFO:root:Saved checkpoint to "models/mix-0004.params"
INFO:root:Epoch[3] Validation-Loss=1.281809
INFO:root:Epoch[4] Batch [20] Speed: 3882.58 samples/sec Loss=1.250262
INFO:root:Epoch[4] Batch [40] Speed: 3737.02 samples/sec Loss=1.245812
INFO:root:Epoch[4] Batch [60] Speed: 3695.76 samples/sec Loss=1.264684
INFO:root:Epoch[4] Batch [80] Speed: 3811.17 samples/sec Loss=1.250917
INFO:root:Epoch[4] Batch [100] Speed: 3712.39 samples/sec Loss=1.191108
INFO:root:Epoch[4] Batch [120] Speed: 3723.86 samples/sec Loss=1.232084
INFO:root:Epoch[4] Batch [140] Speed: 3614.87 samples/sec Loss=1.218612
INFO:root:Epoch[4] Batch [160] Speed: 3618.46 samples/sec Loss=1.243120
INFO:root:Epoch[4] Batch [180] Speed: 3790.13 samples/sec Loss=1.233880
INFO:root:Epoch[4] Batch [200] Speed: 3840.46 samples/sec Loss=1.266944
INFO:root:Epoch[4] Batch [220] Speed: 3594.61 samples/sec Loss=1.241611
INFO:root:Epoch[4] Batch [240] Speed: 3740.23 samples/sec Loss=1.197197
INFO:root:Epoch[4] Batch [260] Speed: 3558.51 samples/sec Loss=1.240909
INFO:root:Epoch[4] Batch [280] Speed: 3632.23 samples/sec Loss=1.173882
INFO:root:Epoch[4] Batch [300] Speed: 3682.38 samples/sec Loss=1.236662
INFO:root:Epoch[4] Batch [320] Speed: 3862.26 samples/sec Loss=1.198326
INFO:root:Epoch[4] Batch [340] Speed: 3661.70 samples/sec Loss=1.141737
INFO:root:Epoch[4] Batch [360] Speed: 3860.32 samples/sec Loss=1.200049
INFO:root:Epoch[4] Batch [380] Speed: 3655.92 samples/sec Loss=1.176281
INFO:root:Epoch[4] Train-Loss=1.226745
INFO:root:Epoch[4] Time cost=13.449
INFO:root:Saved checkpoint to "models/mix-0005.params"
INFO:root:Epoch[4] Validation-Loss=1.191472
INFO:root:Epoch[5] Batch [20] Speed: 3825.23 samples/sec Loss=1.200922
INFO:root:Epoch[5] Batch [40] Speed: 3817.81 samples/sec Loss=1.164515
INFO:root:Epoch[5] Batch [60] Speed: 3985.51 samples/sec Loss=1.197263
INFO:root:Epoch[5] Batch [80] Speed: 3687.44 samples/sec Loss=1.197212
INFO:root:Epoch[5] Batch [100] Speed: 3591.94 samples/sec Loss=1.169588
INFO:root:Epoch[5] Batch [120] Speed: 3749.69 samples/sec Loss=1.171624
INFO:root:Epoch[5] Batch [140] Speed: 3703.62 samples/sec Loss=1.149759
INFO:root:Epoch[5] Batch [160] Speed: 3726.20 samples/sec Loss=1.143298
INFO:root:Epoch[5] Batch [180] Speed: 3573.85 samples/sec Loss=1.164518
INFO:root:Epoch[5] Batch [200] Speed: 3657.96 samples/sec Loss=1.218354
INFO:root:Epoch[5] Batch [220] Speed: 3492.17 samples/sec Loss=1.162091
INFO:root:Epoch[5] Batch [240] Speed: 3627.14 samples/sec Loss=1.131930
INFO:root:Epoch[5] Batch [260] Speed: 3690.31 samples/sec Loss=1.165203
INFO:root:Epoch[5] Batch [280] Speed: 3655.85 samples/sec Loss=1.126168
INFO:root:Epoch[5] Batch [300] Speed: 3611.08 samples/sec Loss=1.145882
INFO:root:Epoch[5] Batch [320] Speed: 3682.01 samples/sec Loss=1.162660
INFO:root:Epoch[5] Batch [340] Speed: 3706.03 samples/sec Loss=1.106049
INFO:root:Epoch[5] Batch [360] Speed: 3885.32 samples/sec Loss=1.152234
INFO:root:Epoch[5] Batch [380] Speed: 3714.64 samples/sec Loss=1.133865
INFO:root:Epoch[5] Train-Loss=1.113487
INFO:root:Epoch[5] Time cost=13.477
INFO:root:Saved checkpoint to "models/mix-0006.params"
INFO:root:Epoch[5] Validation-Loss=1.102339
INFO:root:Epoch[6] Batch [20] Speed: 3760.46 samples/sec Loss=1.110440
INFO:root:Epoch[6] Batch [40] Speed: 3835.87 samples/sec Loss=1.110398
INFO:root:Epoch[6] Batch [60] Speed: 3771.96 samples/sec Loss=1.117684
INFO:root:Epoch[6] Batch [80] Speed: 4008.75 samples/sec Loss=1.095932
INFO:root:Epoch[6] Batch [100] Speed: 3732.52 samples/sec Loss=1.131535
INFO:root:Epoch[6] Batch [120] Speed: 3942.28 samples/sec Loss=1.117673
INFO:root:Epoch[6] Batch [140] Speed: 3662.26 samples/sec Loss=1.094104
INFO:root:Epoch[6] Batch [160] Speed: 3873.39 samples/sec Loss=1.112694
INFO:root:Epoch[6] Batch [180] Speed: 3705.29 samples/sec Loss=1.109740
INFO:root:Epoch[6] Batch [200] Speed: 3573.88 samples/sec Loss=1.131038
INFO:root:Epoch[6] Batch [220] Speed: 3703.72 samples/sec Loss=1.125734
INFO:root:Epoch[6] Batch [240] Speed: 3852.51 samples/sec Loss=1.103684
INFO:root:Epoch[6] Batch [260] Speed: 3767.41 samples/sec Loss=1.067906
INFO:root:Epoch[6] Batch [280] Speed: 3437.49 samples/sec Loss=1.090297
INFO:root:Epoch[6] Batch [300] Speed: 3734.64 samples/sec Loss=1.068026
INFO:root:Epoch[6] Batch [320] Speed: 3653.07 samples/sec Loss=1.081505
INFO:root:Epoch[6] Batch [340] Speed: 3743.08 samples/sec Loss=1.080152
INFO:root:Epoch[6] Batch [360] Speed: 3704.77 samples/sec Loss=1.064086
INFO:root:Epoch[6] Batch [380] Speed: 3689.70 samples/sec Loss=1.108270
INFO:root:Epoch[6] Train-Loss=1.156017
INFO:root:Epoch[6] Time cost=13.391
INFO:root:Saved checkpoint to "models/mix-0007.params"
INFO:root:Epoch[6] Validation-Loss=1.055257
INFO:root:Epoch[7] Batch [20] Speed: 3644.92 samples/sec Loss=1.076870
INFO:root:Epoch[7] Batch [40] Speed: 3573.06 samples/sec Loss=1.093684
INFO:root:Epoch[7] Batch [60] Speed: 3614.53 samples/sec Loss=1.088447
INFO:root:Epoch[7] Batch [80] Speed: 3687.66 samples/sec Loss=1.107528
INFO:root:Epoch[7] Batch [100] Speed: 3516.63 samples/sec Loss=1.038797
INFO:root:Epoch[7] Batch [120] Speed: 3628.45 samples/sec Loss=1.112034
INFO:root:Epoch[7] Batch [140] Speed: 3579.50 samples/sec Loss=1.054336
INFO:root:Epoch[7] Batch [160] Speed: 3628.80 samples/sec Loss=1.075198
INFO:root:Epoch[7] Batch [180] Speed: 3750.80 samples/sec Loss=1.070821
INFO:root:Epoch[7] Batch [200] Speed: 3758.67 samples/sec Loss=1.103515
INFO:root:Epoch[7] Batch [220] Speed: 3761.69 samples/sec Loss=1.057294
INFO:root:Epoch[7] Batch [240] Speed: 3574.35 samples/sec Loss=1.087323
INFO:root:Epoch[7] Batch [260] Speed: 3594.90 samples/sec Loss=1.016295
INFO:root:Epoch[7] Batch [280] Speed: 3745.60 samples/sec Loss=1.073605
INFO:root:Epoch[7] Batch [300] Speed: 3716.44 samples/sec Loss=1.115436
INFO:root:Epoch[7] Batch [320] Speed: 3750.53 samples/sec Loss=1.055417
INFO:root:Epoch[7] Batch [340] Speed: 3730.84 samples/sec Loss=1.022167
INFO:root:Epoch[7] Batch [360] Speed: 3863.56 samples/sec Loss=1.029583
INFO:root:Epoch[7] Batch [380] Speed: 3804.20 samples/sec Loss=1.047977
INFO:root:Epoch[7] Train-Loss=1.069231
INFO:root:Epoch[7] Time cost=13.547
INFO:root:Saved checkpoint to "models/mix-0008.params"
INFO:root:Epoch[7] Validation-Loss=1.039080
INFO:root:Epoch[8] Batch [20] Speed: 3659.16 samples/sec Loss=1.040619
INFO:root:Epoch[8] Batch [40] Speed: 3674.42 samples/sec Loss=1.093843
INFO:root:Epoch[8] Batch [60] Speed: 3958.45 samples/sec Loss=1.076453
INFO:root:Epoch[8] Batch [80] Speed: 3902.69 samples/sec Loss=1.068553
INFO:root:Epoch[8] Batch [100] Speed: 3640.48 samples/sec Loss=1.044856
INFO:root:Epoch[8] Batch [120] Speed: 3519.69 samples/sec Loss=1.094577
INFO:root:Epoch[8] Batch [140] Speed: 3432.99 samples/sec Loss=1.092502
INFO:root:Epoch[8] Batch [160] Speed: 3780.72 samples/sec Loss=1.022132
INFO:root:Epoch[8] Batch [180] Speed: 3790.16 samples/sec Loss=1.055206
INFO:root:Epoch[8] Batch [200] Speed: 3637.15 samples/sec Loss=1.043726
INFO:root:Epoch[8] Batch [220] Speed: 3819.40 samples/sec Loss=1.104035
INFO:root:Epoch[8] Batch [240] Speed: 3444.19 samples/sec Loss=1.030135
INFO:root:Epoch[8] Batch [260] Speed: 3662.58 samples/sec Loss=1.035520
INFO:root:Epoch[8] Batch [280] Speed: 3777.17 samples/sec Loss=1.060214
INFO:root:Epoch[8] Batch [300] Speed: 3788.60 samples/sec Loss=1.048097
INFO:root:Epoch[8] Batch [320] Speed: 3866.83 samples/sec Loss=1.043763
INFO:root:Epoch[8] Batch [340] Speed: 3762.22 samples/sec Loss=1.016095
INFO:root:Epoch[8] Batch [360] Speed: 3750.99 samples/sec Loss=1.018011
INFO:root:Epoch[8] Batch [380] Speed: 3879.84 samples/sec Loss=1.015541
INFO:root:Epoch[8] Train-Loss=1.037940
INFO:root:Epoch[8] Time cost=13.518
INFO:root:Saved checkpoint to "models/mix-0009.params"
INFO:root:Epoch[8] Validation-Loss=1.043047
INFO:root:Epoch[9] Batch [20] Speed: 3691.56 samples/sec Loss=1.002852
INFO:root:Epoch[9] Batch [40] Speed: 3912.60 samples/sec Loss=1.021921
INFO:root:Epoch[9] Batch [60] Speed: 3696.57 samples/sec Loss=1.033873
INFO:root:Epoch[9] Batch [80] Speed: 3700.53 samples/sec Loss=1.062614
INFO:root:Epoch[9] Batch [100] Speed: 3782.79 samples/sec Loss=1.030338
INFO:root:Epoch[9] Batch [120] Speed: 3615.58 samples/sec Loss=1.019048
INFO:root:Epoch[9] Batch [140] Speed: 3690.63 samples/sec Loss=1.021395
INFO:root:Epoch[9] Batch [160] Speed: 3862.56 samples/sec Loss=1.053888
INFO:root:Epoch[9] Batch [180] Speed: 3747.37 samples/sec Loss=1.053811
INFO:root:Epoch[9] Batch [200] Speed: 3798.09 samples/sec Loss=0.988264
INFO:root:Epoch[9] Batch [220] Speed: 3756.56 samples/sec Loss=1.019958
INFO:root:Epoch[9] Batch [240] Speed: 3877.94 samples/sec Loss=1.023912
INFO:root:Epoch[9] Batch [260] Speed: 3988.01 samples/sec Loss=1.055933
INFO:root:Epoch[9] Batch [280] Speed: 3920.16 samples/sec Loss=0.994646
INFO:root:Epoch[9] Batch [300] Speed: 3646.93 samples/sec Loss=1.031748
INFO:root:Epoch[9] Batch [320] Speed: 3682.19 samples/sec Loss=0.958390
INFO:root:Epoch[9] Batch [340] Speed: 3681.14 samples/sec Loss=1.050418
INFO:root:Epoch[9] Batch [360] Speed: 3448.60 samples/sec Loss=1.032891
INFO:root:Epoch[9] Batch [380] Speed: 3532.95 samples/sec Loss=1.028187
INFO:root:Update[3901]: Change learning rate to 7.00000e-02
INFO:root:Epoch[9] Train-Loss=1.092662
INFO:root:Epoch[9] Time cost=13.375
INFO:root:Saved checkpoint to "models/mix-0010.params"
INFO:root:Epoch[9] Validation-Loss=1.259423
INFO:root:Epoch[10] Batch [20] Speed: 3824.43 samples/sec Loss=1.072831
INFO:root:Epoch[10] Batch [40] Speed: 3832.36 samples/sec Loss=1.006503
INFO:root:Epoch[10] Batch [60] Speed: 3800.07 samples/sec Loss=0.946848
INFO:root:Epoch[10] Batch [80] Speed: 3420.93 samples/sec Loss=0.920415
INFO:root:Epoch[10] Batch [100] Speed: 3752.17 samples/sec Loss=0.882000
INFO:root:Epoch[10] Batch [120] Speed: 3413.74 samples/sec Loss=0.860655
INFO:root:Epoch[10] Batch [140] Speed: 3712.80 samples/sec Loss=0.863218
INFO:root:Epoch[10] Batch [160] Speed: 3764.45 samples/sec Loss=0.875472
INFO:root:Epoch[10] Batch [180] Speed: 3687.23 samples/sec Loss=0.886291
INFO:root:Epoch[10] Batch [200] Speed: 3530.63 samples/sec Loss=0.866454
INFO:root:Epoch[10] Batch [220] Speed: 3781.96 samples/sec Loss=0.864715
INFO:root:Epoch[10] Batch [240] Speed: 3252.59 samples/sec Loss=0.847931
INFO:root:Epoch[10] Batch [260] Speed: 3731.46 samples/sec Loss=0.854609
INFO:root:Epoch[10] Batch [280] Speed: 3417.71 samples/sec Loss=0.785217
INFO:root:Epoch[10] Batch [300] Speed: 3791.15 samples/sec Loss=0.818709
INFO:root:Epoch[10] Batch [320] Speed: 3827.50 samples/sec Loss=0.832540
INFO:root:Epoch[10] Batch [340] Speed: 3830.00 samples/sec Loss=0.804382
INFO:root:Epoch[10] Batch [360] Speed: 3645.56 samples/sec Loss=0.823427
INFO:root:Epoch[10] Batch [380] Speed: 3748.28 samples/sec Loss=0.794204
INFO:root:Epoch[10] Train-Loss=0.843919
INFO:root:Epoch[10] Time cost=13.608
INFO:root:Saved checkpoint to "models/mix-0011.params"
INFO:root:Epoch[10] Validation-Loss=0.688972
INFO:root:Epoch[11] Batch [20] Speed: 3778.63 samples/sec Loss=0.830588
INFO:root:Epoch[11] Batch [40] Speed: 3633.23 samples/sec Loss=0.843395
INFO:root:Epoch[11] Batch [60] Speed: 3786.71 samples/sec Loss=0.809238
INFO:root:Epoch[11] Batch [80] Speed: 3771.64 samples/sec Loss=0.815546
INFO:root:Epoch[11] Batch [100] Speed: 3817.19 samples/sec Loss=0.817988
INFO:root:Epoch[11] Batch [120] Speed: 3715.36 samples/sec Loss=0.834008
INFO:root:Epoch[11] Batch [140] Speed: 3556.03 samples/sec Loss=0.819849
INFO:root:Epoch[11] Batch [160] Speed: 3799.46 samples/sec Loss=0.802552
INFO:root:Epoch[11] Batch [180] Speed: 3725.76 samples/sec Loss=0.811564
INFO:root:Epoch[11] Batch [200] Speed: 3817.29 samples/sec Loss=0.834967
INFO:root:Epoch[11] Batch [220] Speed: 3711.38 samples/sec Loss=0.769836
INFO:root:Epoch[11] Batch [240] Speed: 3785.67 samples/sec Loss=0.758995
INFO:root:Epoch[11] Batch [260] Speed: 3681.27 samples/sec Loss=0.796761
INFO:root:Epoch[11] Batch [280] Speed: 3736.48 samples/sec Loss=0.797585
INFO:root:Epoch[11] Batch [300] Speed: 3747.76 samples/sec Loss=0.758434
INFO:root:Epoch[11] Batch [320] Speed: 3584.94 samples/sec Loss=0.745678
INFO:root:Epoch[11] Batch [340] Speed: 3761.45 samples/sec Loss=0.767868
INFO:root:Epoch[11] Batch [360] Speed: 3663.50 samples/sec Loss=0.765282
INFO:root:Epoch[11] Batch [380] Speed: 3679.43 samples/sec Loss=0.785436
INFO:root:Epoch[11] Train-Loss=0.791905
INFO:root:Epoch[11] Time cost=13.414
INFO:root:Saved checkpoint to "models/mix-0012.params"
INFO:root:Epoch[11] Validation-Loss=0.647994
INFO:root:Epoch[12] Batch [20] Speed: 3724.08 samples/sec Loss=0.748379
INFO:root:Epoch[12] Batch [40] Speed: 3681.99 samples/sec Loss=0.798602
INFO:root:Epoch[12] Batch [60] Speed: 3700.73 samples/sec Loss=0.820248
INFO:root:Epoch[12] Batch [80] Speed: 3663.45 samples/sec Loss=0.799700
INFO:root:Epoch[12] Batch [100] Speed: 3566.31 samples/sec Loss=0.752740
INFO:root:Epoch[12] Batch [120] Speed: 3688.42 samples/sec Loss=0.792050
INFO:root:Epoch[12] Batch [140] Speed: 3596.77 samples/sec Loss=0.774302
INFO:root:Epoch[12] Batch [160] Speed: 3428.60 samples/sec Loss=0.774190
INFO:root:Epoch[12] Batch [180] Speed: 3697.28 samples/sec Loss=0.773912
INFO:root:Epoch[12] Batch [200] Speed: 3529.68 samples/sec Loss=0.803792
INFO:root:Epoch[12] Batch [220] Speed: 3486.14 samples/sec Loss=0.767819
INFO:root:Epoch[12] Batch [240] Speed: 3604.28 samples/sec Loss=0.755899
INFO:root:Epoch[12] Batch [260] Speed: 3657.84 samples/sec Loss=0.781743
INFO:root:Epoch[12] Batch [280] Speed: 3579.39 samples/sec Loss=0.769826
INFO:root:Epoch[12] Batch [300] Speed: 3545.24 samples/sec Loss=0.742524
INFO:root:Epoch[12] Batch [320] Speed: 3665.77 samples/sec Loss=0.747874
INFO:root:Epoch[12] Batch [340] Speed: 3738.53 samples/sec Loss=0.731062
INFO:root:Epoch[12] Batch [360] Speed: 3549.62 samples/sec Loss=0.756822
INFO:root:Epoch[12] Batch [380] Speed: 3576.27 samples/sec Loss=0.779100
INFO:root:Epoch[12] Train-Loss=0.768203
INFO:root:Epoch[12] Time cost=13.824
INFO:root:Saved checkpoint to "models/mix-0013.params"
INFO:root:Epoch[12] Validation-Loss=0.641387
INFO:root:Epoch[13] Batch [20] Speed: 3482.34 samples/sec Loss=0.729837
INFO:root:Epoch[13] Batch [40] Speed: 3656.77 samples/sec Loss=0.759371
INFO:root:Epoch[13] Batch [60] Speed: 3867.54 samples/sec Loss=0.750554
INFO:root:Epoch[13] Batch [80] Speed: 3747.22 samples/sec Loss=0.739489
INFO:root:Epoch[13] Batch [100] Speed: 3785.23 samples/sec Loss=0.734682
INFO:root:Epoch[13] Batch [120] Speed: 3545.98 samples/sec Loss=0.714964
INFO:root:Epoch[13] Batch [140] Speed: 3607.35 samples/sec Loss=0.768107
INFO:root:Epoch[13] Batch [160] Speed: 3465.56 samples/sec Loss=0.760062
INFO:root:Epoch[13] Batch [180] Speed: 3647.24 samples/sec Loss=0.768595
INFO:root:Epoch[13] Batch [200] Speed: 3668.25 samples/sec Loss=0.777000
INFO:root:Epoch[13] Batch [220] Speed: 3485.12 samples/sec Loss=0.769774
INFO:root:Epoch[13] Batch [240] Speed: 3526.43 samples/sec Loss=0.737425
INFO:root:Epoch[13] Batch [260] Speed: 3613.01 samples/sec Loss=0.765103
INFO:root:Epoch[13] Batch [280] Speed: 3659.39 samples/sec Loss=0.791556
INFO:root:Epoch[13] Batch [300] Speed: 3602.52 samples/sec Loss=0.773750
INFO:root:Epoch[13] Batch [320] Speed: 3747.88 samples/sec Loss=0.739108
INFO:root:Epoch[13] Batch [340] Speed: 3777.99 samples/sec Loss=0.700182
INFO:root:Epoch[13] Batch [360] Speed: 3902.30 samples/sec Loss=0.733064
INFO:root:Epoch[13] Batch [380] Speed: 3882.00 samples/sec Loss=0.718806
INFO:root:Epoch[13] Train-Loss=0.763809
INFO:root:Epoch[13] Time cost=13.602
INFO:root:Saved checkpoint to "models/mix-0014.params"
INFO:root:Epoch[13] Validation-Loss=0.632581
INFO:root:Epoch[14] Batch [20] Speed: 3943.88 samples/sec Loss=0.737646
INFO:root:Epoch[14] Batch [40] Speed: 3717.81 samples/sec Loss=0.727235
INFO:root:Epoch[14] Batch [60] Speed: 3850.70 samples/sec Loss=0.738299
INFO:root:Epoch[14] Batch [80] Speed: 3454.78 samples/sec Loss=0.753188
INFO:root:Epoch[14] Batch [100] Speed: 3701.91 samples/sec Loss=0.739876
INFO:root:Epoch[14] Batch [120] Speed: 3789.76 samples/sec Loss=0.747028
INFO:root:Epoch[14] Batch [140] Speed: 3779.89 samples/sec Loss=0.735578
INFO:root:Epoch[14] Batch [160] Speed: 3785.88 samples/sec Loss=0.755958
INFO:root:Epoch[14] Batch [180] Speed: 3905.94 samples/sec Loss=0.763476
INFO:root:Epoch[14] Batch [200] Speed: 3673.27 samples/sec Loss=0.746332
INFO:root:Epoch[14] Batch [220] Speed: 3870.79 samples/sec Loss=0.727565
INFO:root:Epoch[14] Batch [240] Speed: 3811.92 samples/sec Loss=0.711658
INFO:root:Epoch[14] Batch [260] Speed: 3931.66 samples/sec Loss=0.745204
INFO:root:Epoch[14] Batch [280] Speed: 3685.10 samples/sec Loss=0.754717
INFO:root:Epoch[14] Batch [300] Speed: 3632.85 samples/sec Loss=0.715948
INFO:root:Epoch[14] Batch [320] Speed: 3548.66 samples/sec Loss=0.730964
INFO:root:Epoch[14] Batch [340] Speed: 3583.27 samples/sec Loss=0.720178
INFO:root:Epoch[14] Batch [360] Speed: 3630.42 samples/sec Loss=0.708451
INFO:root:Epoch[14] Batch [380] Speed: 3607.91 samples/sec Loss=0.754175
INFO:root:Epoch[14] Train-Loss=0.750200
INFO:root:Epoch[14] Time cost=13.427
INFO:root:Saved checkpoint to "models/mix-0015.params"
INFO:root:Epoch[14] Validation-Loss=0.641079
INFO:root:Epoch[15] Batch [20] Speed: 3706.26 samples/sec Loss=0.705884
INFO:root:Epoch[15] Batch [40] Speed: 3897.82 samples/sec Loss=0.730481
INFO:root:Epoch[15] Batch [60] Speed: 3763.07 samples/sec Loss=0.709593
INFO:root:Epoch[15] Batch [80] Speed: 3677.81 samples/sec Loss=0.735911
INFO:root:Epoch[15] Batch [100] Speed: 3620.19 samples/sec Loss=0.727126
INFO:root:Epoch[15] Batch [120] Speed: 3596.10 samples/sec Loss=0.732506
INFO:root:Epoch[15] Batch [140] Speed: 3513.87 samples/sec Loss=0.722601
INFO:root:Epoch[15] Batch [160] Speed: 3710.83 samples/sec Loss=0.788057
INFO:root:Epoch[15] Batch [180] Speed: 3797.93 samples/sec Loss=0.742027
INFO:root:Epoch[15] Batch [200] Speed: 3736.34 samples/sec Loss=0.762324
INFO:root:Epoch[15] Batch [220] Speed: 3612.49 samples/sec Loss=0.709811
INFO:root:Epoch[15] Batch [240] Speed: 3486.94 samples/sec Loss=0.728954
INFO:root:Epoch[15] Batch [260] Speed: 3688.00 samples/sec Loss=0.702826
INFO:root:Epoch[15] Batch [280] Speed: 3801.33 samples/sec Loss=0.732500
INFO:root:Epoch[15] Batch [300] Speed: 3768.41 samples/sec Loss=0.748458
INFO:root:Epoch[15] Batch [320] Speed: 3564.55 samples/sec Loss=0.702013
INFO:root:Epoch[15] Batch [340] Speed: 3773.67 samples/sec Loss=0.719786
INFO:root:Epoch[15] Batch [360] Speed: 3797.33 samples/sec Loss=0.713632
INFO:root:Epoch[15] Batch [380] Speed: 3754.94 samples/sec Loss=0.735334
INFO:root:Epoch[15] Train-Loss=0.719853
INFO:root:Epoch[15] Time cost=13.514
INFO:root:Saved checkpoint to "models/mix-0016.params"
INFO:root:Epoch[15] Validation-Loss=0.600065
INFO:root:Epoch[16] Batch [20] Speed: 3968.98 samples/sec Loss=0.735932
INFO:root:Epoch[16] Batch [40] Speed: 3832.86 samples/sec Loss=0.694314
INFO:root:Epoch[16] Batch [60] Speed: 3583.32 samples/sec Loss=0.740239
INFO:root:Epoch[16] Batch [80] Speed: 3469.80 samples/sec Loss=0.728967
INFO:root:Epoch[16] Batch [100] Speed: 3815.01 samples/sec Loss=0.676705
INFO:root:Epoch[16] Batch [120] Speed: 3537.87 samples/sec Loss=0.672961
INFO:root:Epoch[16] Batch [140] Speed: 3530.33 samples/sec Loss=0.696404
INFO:root:Epoch[16] Batch [160] Speed: 3878.79 samples/sec Loss=0.723448
INFO:root:Epoch[16] Batch [180] Speed: 3510.64 samples/sec Loss=0.728308
INFO:root:Epoch[16] Batch [200] Speed: 3903.58 samples/sec Loss=0.723730
INFO:root:Epoch[16] Batch [220] Speed: 3917.34 samples/sec Loss=0.742159
INFO:root:Epoch[16] Batch [240] Speed: 3342.49 samples/sec Loss=0.710198
INFO:root:Epoch[16] Batch [260] Speed: 3585.34 samples/sec Loss=0.736356
INFO:root:Epoch[16] Batch [280] Speed: 3165.84 samples/sec Loss=0.725822
INFO:root:Epoch[16] Batch [300] Speed: 3655.67 samples/sec Loss=0.700933
INFO:root:Epoch[16] Batch [320] Speed: 3733.53 samples/sec Loss=0.708660
INFO:root:Epoch[16] Batch [340] Speed: 3534.36 samples/sec Loss=0.694317
INFO:root:Epoch[16] Batch [360] Speed: 3718.98 samples/sec Loss=0.700591
INFO:root:Epoch[16] Batch [380] Speed: 3673.23 samples/sec Loss=0.709709
INFO:root:Epoch[16] Train-Loss=0.665619
INFO:root:Epoch[16] Time cost=13.789
INFO:root:Saved checkpoint to "models/mix-0017.params"
INFO:root:Epoch[16] Validation-Loss=0.603529
INFO:root:Epoch[17] Batch [20] Speed: 3613.28 samples/sec Loss=0.685660
INFO:root:Epoch[17] Batch [40] Speed: 3674.73 samples/sec Loss=0.727806
INFO:root:Epoch[17] Batch [60] Speed: 3591.75 samples/sec Loss=0.718608
INFO:root:Epoch[17] Batch [80] Speed: 3402.17 samples/sec Loss=0.719782
INFO:root:Epoch[17] Batch [100] Speed: 3571.56 samples/sec Loss=0.730632
INFO:root:Epoch[17] Batch [120] Speed: 3698.25 samples/sec Loss=0.708644
INFO:root:Epoch[17] Batch [140] Speed: 3454.98 samples/sec Loss=0.697764
INFO:root:Epoch[17] Batch [160] Speed: 3522.98 samples/sec Loss=0.711294
INFO:root:Epoch[17] Batch [180] Speed: 3465.95 samples/sec Loss=0.723910
INFO:root:Epoch[17] Batch [200] Speed: 3569.33 samples/sec Loss=0.715741
INFO:root:Epoch[17] Batch [220] Speed: 3547.06 samples/sec Loss=0.722815
INFO:root:Epoch[17] Batch [240] Speed: 3528.79 samples/sec Loss=0.697838
INFO:root:Epoch[17] Batch [260] Speed: 3427.45 samples/sec Loss=0.694388
INFO:root:Epoch[17] Batch [280] Speed: 3652.58 samples/sec Loss=0.671311
INFO:root:Epoch[17] Batch [300] Speed: 3781.38 samples/sec Loss=0.712014
INFO:root:Epoch[17] Batch [320] Speed: 3691.18 samples/sec Loss=0.695748
INFO:root:Epoch[17] Batch [340] Speed: 3927.78 samples/sec Loss=0.677600
INFO:root:Epoch[17] Batch [360] Speed: 3903.56 samples/sec Loss=0.704156
INFO:root:Epoch[17] Batch [380] Speed: 3884.50 samples/sec Loss=0.720849
INFO:root:Epoch[17] Train-Loss=0.686994
INFO:root:Epoch[17] Time cost=13.783
INFO:root:Saved checkpoint to "models/mix-0018.params"
INFO:root:Epoch[17] Validation-Loss=0.607191
INFO:root:Epoch[18] Batch [20] Speed: 3663.51 samples/sec Loss=0.695825
INFO:root:Epoch[18] Batch [40] Speed: 3709.85 samples/sec Loss=0.747032
INFO:root:Epoch[18] Batch [60] Speed: 3779.13 samples/sec Loss=0.723795
INFO:root:Epoch[18] Batch [80] Speed: 3532.83 samples/sec Loss=0.689386
INFO:root:Epoch[18] Batch [100] Speed: 3724.69 samples/sec Loss=0.677721
INFO:root:Epoch[18] Batch [120] Speed: 3809.13 samples/sec Loss=0.710812
INFO:root:Epoch[18] Batch [140] Speed: 3529.58 samples/sec Loss=0.706600
INFO:root:Epoch[18] Batch [160] Speed: 3372.63 samples/sec Loss=0.690961
INFO:root:Epoch[18] Batch [180] Speed: 3799.84 samples/sec Loss=0.700694
INFO:root:Epoch[18] Batch [200] Speed: 3558.18 samples/sec Loss=0.708840
INFO:root:Epoch[18] Batch [220] Speed: 3808.00 samples/sec Loss=0.704469
INFO:root:Epoch[18] Batch [240] Speed: 3887.61 samples/sec Loss=0.691975
INFO:root:Epoch[18] Batch [260] Speed: 3667.17 samples/sec Loss=0.729394
INFO:root:Epoch[18] Batch [280] Speed: 3824.70 samples/sec Loss=0.679403
INFO:root:Epoch[18] Batch [300] Speed: 3696.64 samples/sec Loss=0.723979
INFO:root:Epoch[18] Batch [320] Speed: 3780.03 samples/sec Loss=0.701124
INFO:root:Epoch[18] Batch [340] Speed: 3748.37 samples/sec Loss=0.696224
INFO:root:Epoch[18] Batch [360] Speed: 3602.01 samples/sec Loss=0.663673
INFO:root:Epoch[18] Batch [380] Speed: 3663.83 samples/sec Loss=0.702105
INFO:root:Epoch[18] Train-Loss=0.696073
INFO:root:Epoch[18] Time cost=13.533
INFO:root:Saved checkpoint to "models/mix-0019.params"
INFO:root:Epoch[18] Validation-Loss=0.632939
INFO:root:Epoch[19] Batch [20] Speed: 3823.49 samples/sec Loss=0.699279
INFO:root:Epoch[19] Batch [40] Speed: 3394.97 samples/sec Loss=0.699084
INFO:root:Epoch[19] Batch [60] Speed: 3718.27 samples/sec Loss=0.721312
INFO:root:Epoch[19] Batch [80] Speed: 3603.45 samples/sec Loss=0.668331
INFO:root:Epoch[19] Batch [100] Speed: 3831.17 samples/sec Loss=0.724077
INFO:root:Epoch[19] Batch [120] Speed: 3724.12 samples/sec Loss=0.711194
INFO:root:Epoch[19] Batch [140] Speed: 3543.97 samples/sec Loss=0.689208
INFO:root:Epoch[19] Batch [160] Speed: 3646.01 samples/sec Loss=0.680505
INFO:root:Epoch[19] Batch [180] Speed: 3705.99 samples/sec Loss=0.716271
INFO:root:Epoch[19] Batch [200] Speed: 3671.01 samples/sec Loss=0.717763
INFO:root:Epoch[19] Batch [220] Speed: 3720.40 samples/sec Loss=0.717338
INFO:root:Epoch[19] Batch [240] Speed: 3656.30 samples/sec Loss=0.725808
INFO:root:Epoch[19] Batch [260] Speed: 3622.15 samples/sec Loss=0.729790
INFO:root:Epoch[19] Batch [280] Speed: 3711.79 samples/sec Loss=0.705218
INFO:root:Epoch[19] Batch [300] Speed: 3242.95 samples/sec Loss=0.683478
INFO:root:Epoch[19] Batch [320] Speed: 3648.69 samples/sec Loss=0.684137
INFO:root:Epoch[19] Batch [340] Speed: 2979.12 samples/sec Loss=0.679259
INFO:root:Epoch[19] Batch [360] Speed: 3774.20 samples/sec Loss=0.682486
INFO:root:Epoch[19] Batch [380] Speed: 3733.67 samples/sec Loss=0.681961
INFO:root:Epoch[19] Train-Loss=0.683937
INFO:root:Epoch[19] Time cost=13.873
INFO:root:Saved checkpoint to "models/mix-0020.params"
INFO:root:Epoch[19] Validation-Loss=0.608800
INFO:root:Epoch[20] Batch [20] Speed: 3732.10 samples/sec Loss=0.669203
INFO:root:Epoch[20] Batch [40] Speed: 3781.98 samples/sec Loss=0.675592
INFO:root:Epoch[20] Batch [60] Speed: 3730.67 samples/sec Loss=0.681351
INFO:root:Epoch[20] Batch [80] Speed: 3733.52 samples/sec Loss=0.690328
INFO:root:Epoch[20] Batch [100] Speed: 3772.05 samples/sec Loss=0.690922
INFO:root:Epoch[20] Batch [120] Speed: 3757.09 samples/sec Loss=0.682715
INFO:root:Epoch[20] Batch [140] Speed: 3820.35 samples/sec Loss=0.656756
INFO:root:Epoch[20] Batch [160] Speed: 3729.84 samples/sec Loss=0.681585
INFO:root:Epoch[20] Batch [180] Speed: 3673.78 samples/sec Loss=0.708750
INFO:root:Epoch[20] Batch [200] Speed: 3857.04 samples/sec Loss=0.705855
INFO:root:Epoch[20] Batch [220] Speed: 3768.47 samples/sec Loss=0.670684
INFO:root:Epoch[20] Batch [240] Speed: 3773.01 samples/sec Loss=0.696488
INFO:root:Epoch[20] Batch [260] Speed: 3656.47 samples/sec Loss=0.642127
INFO:root:Epoch[20] Batch [280] Speed: 3805.74 samples/sec Loss=0.683180
INFO:root:Epoch[20] Batch [300] Speed: 3622.31 samples/sec Loss=0.704840
INFO:root:Epoch[20] Batch [320] Speed: 3720.89 samples/sec Loss=0.634942
INFO:root:Epoch[20] Batch [340] Speed: 3866.70 samples/sec Loss=0.685557
INFO:root:Epoch[20] Batch [360] Speed: 3760.15 samples/sec Loss=0.680847
INFO:root:Epoch[20] Batch [380] Speed: 3798.38 samples/sec Loss=0.651922
INFO:root:Epoch[20] Train-Loss=0.722950
INFO:root:Epoch[20] Time cost=13.331
INFO:root:Saved checkpoint to "models/mix-0021.params"
INFO:root:Epoch[20] Validation-Loss=0.604060
INFO:root:Epoch[21] Batch [20] Speed: 3864.72 samples/sec Loss=0.672576
INFO:root:Epoch[21] Batch [40] Speed: 3613.15 samples/sec Loss=0.673750
INFO:root:Epoch[21] Batch [60] Speed: 3899.19 samples/sec Loss=0.693991
INFO:root:Epoch[21] Batch [80] Speed: 3828.25 samples/sec Loss=0.685390
INFO:root:Epoch[21] Batch [100] Speed: 3819.01 samples/sec Loss=0.674414
INFO:root:Epoch[21] Batch [120] Speed: 3622.09 samples/sec Loss=0.674786
INFO:root:Epoch[21] Batch [140] Speed: 3655.64 samples/sec Loss=0.666751
INFO:root:Epoch[21] Batch [160] Speed: 3844.02 samples/sec Loss=0.675219
INFO:root:Epoch[21] Batch [180] Speed: 3836.07 samples/sec Loss=0.686592
INFO:root:Epoch[21] Batch [200] Speed: 3698.93 samples/sec Loss=0.742442
INFO:root:Epoch[21] Batch [220] Speed: 3739.43 samples/sec Loss=0.664004
INFO:root:Epoch[21] Batch [240] Speed: 3902.35 samples/sec Loss=0.664971
INFO:root:Epoch[21] Batch [260] Speed: 3677.65 samples/sec Loss=0.659759
INFO:root:Epoch[21] Batch [280] Speed: 3830.59 samples/sec Loss=0.716063
INFO:root:Epoch[21] Batch [300] Speed: 3764.41 samples/sec Loss=0.685093
INFO:root:Epoch[21] Batch [320] Speed: 3661.88 samples/sec Loss=0.693950
INFO:root:Epoch[21] Batch [340] Speed: 3683.39 samples/sec Loss=0.666198
INFO:root:Epoch[21] Batch [360] Speed: 3574.56 samples/sec Loss=0.684066
INFO:root:Epoch[21] Batch [380] Speed: 3867.50 samples/sec Loss=0.682298
INFO:root:Epoch[21] Train-Loss=0.703121
INFO:root:Epoch[21] Time cost=13.265
INFO:root:Saved checkpoint to "models/mix-0022.params"
INFO:root:Epoch[21] Validation-Loss=0.606019
INFO:root:Epoch[22] Batch [20] Speed: 3602.77 samples/sec Loss=0.684511
INFO:root:Epoch[22] Batch [40] Speed: 3700.50 samples/sec Loss=0.669285
INFO:root:Epoch[22] Batch [60] Speed: 3563.08 samples/sec Loss=0.652102
INFO:root:Epoch[22] Batch [80] Speed: 3681.94 samples/sec Loss=0.707289
INFO:root:Epoch[22] Batch [100] Speed: 3653.30 samples/sec Loss=0.690833
INFO:root:Epoch[22] Batch [120] Speed: 3651.66 samples/sec Loss=0.685126
INFO:root:Epoch[22] Batch [140] Speed: 3488.30 samples/sec Loss=0.709031
INFO:root:Epoch[22] Batch [160] Speed: 3812.29 samples/sec Loss=0.679855
INFO:root:Epoch[22] Batch [180] Speed: 3685.57 samples/sec Loss=0.687029
INFO:root:Epoch[22] Batch [200] Speed: 3645.75 samples/sec Loss=0.722735
INFO:root:Epoch[22] Batch [220] Speed: 3574.07 samples/sec Loss=0.646748
INFO:root:Epoch[22] Batch [240] Speed: 3721.64 samples/sec Loss=0.656498
INFO:root:Epoch[22] Batch [260] Speed: 3643.38 samples/sec Loss=0.696684
INFO:root:Epoch[22] Batch [280] Speed: 3615.66 samples/sec Loss=0.687397
INFO:root:Epoch[22] Batch [300] Speed: 3498.55 samples/sec Loss=0.638067
INFO:root:Epoch[22] Batch [320] Speed: 3576.55 samples/sec Loss=0.658924
INFO:root:Epoch[22] Batch [340] Speed: 3679.85 samples/sec Loss=0.651855
INFO:root:Epoch[22] Batch [360] Speed: 3779.10 samples/sec Loss=0.681393
INFO:root:Epoch[22] Batch [380] Speed: 3708.76 samples/sec Loss=0.656917
INFO:root:Epoch[22] Train-Loss=0.708877
INFO:root:Epoch[22] Time cost=13.719
INFO:root:Saved checkpoint to "models/mix-0023.params"
INFO:root:Epoch[22] Validation-Loss=0.595694
INFO:root:Epoch[23] Batch [20] Speed: 3748.77 samples/sec Loss=0.681722
INFO:root:Epoch[23] Batch [40] Speed: 3736.62 samples/sec Loss=0.657191
INFO:root:Epoch[23] Batch [60] Speed: 3701.03 samples/sec Loss=0.653217
INFO:root:Epoch[23] Batch [80] Speed: 3652.69 samples/sec Loss=0.679831
INFO:root:Epoch[23] Batch [100] Speed: 3830.13 samples/sec Loss=0.665441
INFO:root:Epoch[23] Batch [120] Speed: 3783.29 samples/sec Loss=0.673747
INFO:root:Epoch[23] Batch [140] Speed: 3615.37 samples/sec Loss=0.670311
INFO:root:Epoch[23] Batch [160] Speed: 3602.59 samples/sec Loss=0.676587
INFO:root:Epoch[23] Batch [180] Speed: 3580.63 samples/sec Loss=0.629261
INFO:root:Epoch[23] Batch [200] Speed: 3595.17 samples/sec Loss=0.697040
INFO:root:Epoch[23] Batch [220] Speed: 3557.66 samples/sec Loss=0.664504
INFO:root:Epoch[23] Batch [240] Speed: 3549.79 samples/sec Loss=0.651793
INFO:root:Epoch[23] Batch [260] Speed: 3688.16 samples/sec Loss=0.648848
INFO:root:Epoch[23] Batch [280] Speed: 3618.77 samples/sec Loss=0.681512
INFO:root:Epoch[23] Batch [300] Speed: 3546.33 samples/sec Loss=0.659922
INFO:root:Epoch[23] Batch [320] Speed: 3582.01 samples/sec Loss=0.664819
INFO:root:Epoch[23] Batch [340] Speed: 3617.69 samples/sec Loss=0.640244
INFO:root:Epoch[23] Batch [360] Speed: 3461.88 samples/sec Loss=0.647905
INFO:root:Epoch[23] Batch [380] Speed: 3701.65 samples/sec Loss=0.669959
INFO:root:Epoch[23] Train-Loss=0.712996
INFO:root:Epoch[23] Time cost=13.726
INFO:root:Saved checkpoint to "models/mix-0024.params"
INFO:root:Epoch[23] Validation-Loss=0.598990
INFO:root:Epoch[24] Batch [20] Speed: 3584.32 samples/sec Loss=0.633332
INFO:root:Epoch[24] Batch [40] Speed: 3566.54 samples/sec Loss=0.665724
INFO:root:Epoch[24] Batch [60] Speed: 3610.03 samples/sec Loss=0.700031
INFO:root:Epoch[24] Batch [80] Speed: 3623.13 samples/sec Loss=0.657693
INFO:root:Epoch[24] Batch [100] Speed: 3513.39 samples/sec Loss=0.693871
INFO:root:Epoch[24] Batch [120] Speed: 3729.43 samples/sec Loss=0.644958
INFO:root:Epoch[24] Batch [140] Speed: 3537.08 samples/sec Loss=0.659392
INFO:root:Epoch[24] Batch [160] Speed: 3761.81 samples/sec Loss=0.650677
INFO:root:Epoch[24] Batch [180] Speed: 3686.20 samples/sec Loss=0.668892
INFO:root:Epoch[24] Batch [200] Speed: 3800.22 samples/sec Loss=0.690001
INFO:root:Epoch[24] Batch [220] Speed: 3597.87 samples/sec Loss=0.657516
INFO:root:Epoch[24] Batch [240] Speed: 3616.71 samples/sec Loss=0.658360
INFO:root:Epoch[24] Batch [260] Speed: 3582.03 samples/sec Loss=0.649148
INFO:root:Epoch[24] Batch [280] Speed: 3776.12 samples/sec Loss=0.682126
INFO:root:Epoch[24] Batch [300] Speed: 3773.03 samples/sec Loss=0.654196
INFO:root:Epoch[24] Batch [320] Speed: 3623.88 samples/sec Loss=0.673249
INFO:root:Epoch[24] Batch [340] Speed: 3808.76 samples/sec Loss=0.684663
INFO:root:Epoch[24] Batch [360] Speed: 3731.43 samples/sec Loss=0.642683
INFO:root:Epoch[24] Batch [380] Speed: 3859.04 samples/sec Loss=0.671540
INFO:root:Epoch[24] Train-Loss=0.612882
INFO:root:Epoch[24] Time cost=13.641
INFO:root:Saved checkpoint to "models/mix-0025.params"
INFO:root:Epoch[24] Validation-Loss=0.607116
INFO:root:Epoch[25] Batch [20] Speed: 3581.60 samples/sec Loss=0.690174
INFO:root:Epoch[25] Batch [40] Speed: 3777.81 samples/sec Loss=0.686793
INFO:root:Epoch[25] Batch [60] Speed: 3478.33 samples/sec Loss=0.653801
INFO:root:Epoch[25] Batch [80] Speed: 3737.82 samples/sec Loss=0.671012
INFO:root:Epoch[25] Batch [100] Speed: 3840.27 samples/sec Loss=0.662007
INFO:root:Epoch[25] Batch [120] Speed: 3858.65 samples/sec Loss=0.651625
INFO:root:Epoch[25] Batch [140] Speed: 3767.64 samples/sec Loss=0.655797
INFO:root:Epoch[25] Batch [160] Speed: 3925.09 samples/sec Loss=0.656160
INFO:root:Epoch[25] Batch [180] Speed: 3514.81 samples/sec Loss=0.663374
INFO:root:Epoch[25] Batch [200] Speed: 3703.45 samples/sec Loss=0.664415
INFO:root:Epoch[25] Batch [220] Speed: 3596.13 samples/sec Loss=0.637962
INFO:root:Epoch[25] Batch [240] Speed: 3503.77 samples/sec Loss=0.648526
INFO:root:Epoch[25] Batch [260] Speed: 3557.34 samples/sec Loss=0.649939
INFO:root:Epoch[25] Batch [280] Speed: 3556.56 samples/sec Loss=0.649157
INFO:root:Epoch[25] Batch [300] Speed: 3637.16 samples/sec Loss=0.663754
INFO:root:Epoch[25] Batch [320] Speed: 3606.29 samples/sec Loss=0.682470
INFO:root:Epoch[25] Batch [340] Speed: 3690.03 samples/sec Loss=0.641545
INFO:root:Epoch[25] Batch [360] Speed: 3682.92 samples/sec Loss=0.655575
INFO:root:Epoch[25] Batch [380] Speed: 3610.11 samples/sec Loss=0.622896
INFO:root:Epoch[25] Train-Loss=0.644857
INFO:root:Epoch[25] Time cost=13.672
INFO:root:Saved checkpoint to "models/mix-0026.params"
INFO:root:Epoch[25] Validation-Loss=0.597118
INFO:root:Epoch[26] Batch [20] Speed: 3612.98 samples/sec Loss=0.637669
INFO:root:Epoch[26] Batch [40] Speed: 3699.13 samples/sec Loss=0.625864
INFO:root:Epoch[26] Batch [60] Speed: 3691.28 samples/sec Loss=0.684270
INFO:root:Epoch[26] Batch [80] Speed: 3707.72 samples/sec Loss=0.671037
INFO:root:Epoch[26] Batch [100] Speed: 3715.28 samples/sec Loss=0.625016
INFO:root:Epoch[26] Batch [120] Speed: 3671.39 samples/sec Loss=0.659072
INFO:root:Epoch[26] Batch [140] Speed: 3643.68 samples/sec Loss=0.641622
INFO:root:Epoch[26] Batch [160] Speed: 3692.86 samples/sec Loss=0.645970
INFO:root:Epoch[26] Batch [180] Speed: 3699.29 samples/sec Loss=0.648427
INFO:root:Epoch[26] Batch [200] Speed: 3619.46 samples/sec Loss=0.698852
INFO:root:Epoch[26] Batch [220] Speed: 3804.66 samples/sec Loss=0.675220
INFO:root:Epoch[26] Batch [240] Speed: 3875.25 samples/sec Loss=0.635191
INFO:root:Epoch[26] Batch [260] Speed: 3723.09 samples/sec Loss=0.661473
INFO:root:Epoch[26] Batch [280] Speed: 3527.42 samples/sec Loss=0.675411
INFO:root:Epoch[26] Batch [300] Speed: 3771.89 samples/sec Loss=0.677173
INFO:root:Epoch[26] Batch [320] Speed: 3588.31 samples/sec Loss=0.653368
INFO:root:Epoch[26] Batch [340] Speed: 3553.74 samples/sec Loss=0.643478
INFO:root:Epoch[26] Batch [360] Speed: 3624.51 samples/sec Loss=0.653358
INFO:root:Epoch[26] Batch [380] Speed: 3696.95 samples/sec Loss=0.642817
INFO:root:Epoch[26] Train-Loss=0.596165
INFO:root:Epoch[26] Time cost=13.529
INFO:root:Saved checkpoint to "models/mix-0027.params"
INFO:root:Epoch[26] Validation-Loss=0.591280
INFO:root:Epoch[27] Batch [20] Speed: 3643.41 samples/sec Loss=0.648418
INFO:root:Epoch[27] Batch [40] Speed: 3805.28 samples/sec Loss=0.662840
INFO:root:Epoch[27] Batch [60] Speed: 3615.87 samples/sec Loss=0.671189
INFO:root:Epoch[27] Batch [80] Speed: 3670.15 samples/sec Loss=0.632504
INFO:root:Epoch[27] Batch [100] Speed: 3767.40 samples/sec Loss=0.653318
INFO:root:Epoch[27] Batch [120] Speed: 3667.82 samples/sec Loss=0.643342
INFO:root:Epoch[27] Batch [140] Speed: 3797.40 samples/sec Loss=0.643276
INFO:root:Epoch[27] Batch [160] Speed: 3527.00 samples/sec Loss=0.645022
INFO:root:Epoch[27] Batch [180] Speed: 3690.73 samples/sec Loss=0.672473
INFO:root:Epoch[27] Batch [200] Speed: 3437.43 samples/sec Loss=0.666129
INFO:root:Epoch[27] Batch [220] Speed: 3402.66 samples/sec Loss=0.656062
INFO:root:Epoch[27] Batch [240] Speed: 3524.99 samples/sec Loss=0.631983
INFO:root:Epoch[27] Batch [260] Speed: 3475.86 samples/sec Loss=0.645308
INFO:root:Epoch[27] Batch [280] Speed: 3551.04 samples/sec Loss=0.674180
INFO:root:Epoch[27] Batch [300] Speed: 3682.21 samples/sec Loss=0.647653
INFO:root:Epoch[27] Batch [320] Speed: 3485.67 samples/sec Loss=0.615759
INFO:root:Epoch[27] Batch [340] Speed: 3633.70 samples/sec Loss=0.653333
INFO:root:Epoch[27] Batch [360] Speed: 3505.79 samples/sec Loss=0.655787
INFO:root:Epoch[27] Batch [380] Speed: 3444.17 samples/sec Loss=0.639463
INFO:root:Epoch[27] Train-Loss=0.694287
INFO:root:Epoch[27] Time cost=13.905
INFO:root:Saved checkpoint to "models/mix-0028.params"
INFO:root:Epoch[27] Validation-Loss=0.646846
INFO:root:Epoch[28] Batch [20] Speed: 3647.99 samples/sec Loss=0.672475
INFO:root:Epoch[28] Batch [40] Speed: 3579.98 samples/sec Loss=0.663809
INFO:root:Epoch[28] Batch [60] Speed: 3746.06 samples/sec Loss=0.670974
INFO:root:Epoch[28] Batch [80] Speed: 3759.60 samples/sec Loss=0.686594
INFO:root:Epoch[28] Batch [100] Speed: 3601.15 samples/sec Loss=0.607449
INFO:root:Epoch[28] Batch [120] Speed: 3743.25 samples/sec Loss=0.630542
INFO:root:Epoch[28] Batch [140] Speed: 3746.58 samples/sec Loss=0.606512
INFO:root:Epoch[28] Batch [160] Speed: 3651.31 samples/sec Loss=0.648239
INFO:root:Epoch[28] Batch [180] Speed: 3803.27 samples/sec Loss=0.678287
INFO:root:Epoch[28] Batch [200] Speed: 3686.86 samples/sec Loss=0.655637
INFO:root:Epoch[28] Batch [220] Speed: 3811.32 samples/sec Loss=0.635419
INFO:root:Epoch[28] Batch [240] Speed: 3814.87 samples/sec Loss=0.645598
INFO:root:Epoch[28] Batch [260] Speed: 3617.71 samples/sec Loss=0.655814
INFO:root:Epoch[28] Batch [280] Speed: 3659.86 samples/sec Loss=0.652595
INFO:root:Epoch[28] Batch [300] Speed: 3856.57 samples/sec Loss=0.656809
INFO:root:Epoch[28] Batch [320] Speed: 3543.10 samples/sec Loss=0.662520
INFO:root:Epoch[28] Batch [340] Speed: 3583.25 samples/sec Loss=0.670149
INFO:root:Epoch[28] Batch [360] Speed: 3557.59 samples/sec Loss=0.624298
INFO:root:Epoch[28] Batch [380] Speed: 3513.89 samples/sec Loss=0.645003
INFO:root:Epoch[28] Train-Loss=0.622348
INFO:root:Epoch[28] Time cost=13.594
INFO:root:Saved checkpoint to "models/mix-0029.params"
INFO:root:Epoch[28] Validation-Loss=0.610760
INFO:root:Epoch[29] Batch [20] Speed: 3849.49 samples/sec Loss=0.649731
INFO:root:Epoch[29] Batch [40] Speed: 3457.60 samples/sec Loss=0.683057
INFO:root:Epoch[29] Batch [60] Speed: 3726.70 samples/sec Loss=0.660960
INFO:root:Epoch[29] Batch [80] Speed: 3520.72 samples/sec Loss=0.679623
INFO:root:Epoch[29] Batch [100] Speed: 3655.47 samples/sec Loss=0.625817
INFO:root:Epoch[29] Batch [120] Speed: 3549.00 samples/sec Loss=0.619498
INFO:root:Epoch[29] Batch [140] Speed: 3616.33 samples/sec Loss=0.663334
INFO:root:Epoch[29] Batch [160] Speed: 3660.79 samples/sec Loss=0.646300
INFO:root:Epoch[29] Batch [180] Speed: 3740.42 samples/sec Loss=0.670646
INFO:root:Epoch[29] Batch [200] Speed: 3632.70 samples/sec Loss=0.684066
INFO:root:Epoch[29] Batch [220] Speed: 3642.26 samples/sec Loss=0.638562
INFO:root:Epoch[29] Batch [240] Speed: 3743.15 samples/sec Loss=0.642081
INFO:root:Epoch[29] Batch [260] Speed: 3621.48 samples/sec Loss=0.636738
INFO:root:Epoch[29] Batch [280] Speed: 3424.10 samples/sec Loss=0.626941
INFO:root:Epoch[29] Batch [300] Speed: 3589.47 samples/sec Loss=0.633008
INFO:root:Epoch[29] Batch [320] Speed: 3799.08 samples/sec Loss=0.650467
INFO:root:Epoch[29] Batch [340] Speed: 3808.70 samples/sec Loss=0.620939
INFO:root:Epoch[29] Batch [360] Speed: 3591.60 samples/sec Loss=0.610961
INFO:root:Epoch[29] Batch [380] Speed: 3820.25 samples/sec Loss=0.640997
INFO:root:Epoch[29] Train-Loss=0.608110
INFO:root:Epoch[29] Time cost=13.647
INFO:root:Saved checkpoint to "models/mix-0030.params"
INFO:root:Epoch[29] Validation-Loss=0.617081
INFO:root:Epoch[30] Batch [20] Speed: 3796.07 samples/sec Loss=0.607404
INFO:root:Epoch[30] Batch [40] Speed: 3812.75 samples/sec Loss=0.658630
INFO:root:Epoch[30] Batch [60] Speed: 3794.30 samples/sec Loss=0.670070
INFO:root:Epoch[30] Batch [80] Speed: 3741.16 samples/sec Loss=0.651859
INFO:root:Epoch[30] Batch [100] Speed: 3820.76 samples/sec Loss=0.660285
INFO:root:Epoch[30] Batch [120] Speed: 3596.22 samples/sec Loss=0.631769
INFO:root:Epoch[30] Batch [140] Speed: 3822.70 samples/sec Loss=0.627549
INFO:root:Epoch[30] Batch [160] Speed: 3786.60 samples/sec Loss=0.692115
INFO:root:Epoch[30] Batch [180] Speed: 3823.50 samples/sec Loss=0.629304
INFO:root:Epoch[30] Batch [200] Speed: 3579.56 samples/sec Loss=0.687995
INFO:root:Epoch[30] Batch [220] Speed: 3472.30 samples/sec Loss=0.618445
INFO:root:Epoch[30] Batch [240] Speed: 3598.06 samples/sec Loss=0.616496
INFO:root:Epoch[30] Batch [260] Speed: 3650.22 samples/sec Loss=0.645008
INFO:root:Epoch[30] Batch [280] Speed: 3606.84 samples/sec Loss=0.667066
INFO:root:Epoch[30] Batch [300] Speed: 3527.16 samples/sec Loss=0.641928
INFO:root:Epoch[30] Batch [320] Speed: 3593.47 samples/sec Loss=0.665880
INFO:root:Epoch[30] Batch [340] Speed: 3610.10 samples/sec Loss=0.610400
INFO:root:Epoch[30] Batch [360] Speed: 3729.79 samples/sec Loss=0.617079
INFO:root:Epoch[30] Batch [380] Speed: 3719.46 samples/sec Loss=0.662507
INFO:root:Epoch[30] Train-Loss=0.638191
INFO:root:Epoch[30] Time cost=13.547
INFO:root:Saved checkpoint to "models/mix-0031.params"
INFO:root:Epoch[30] Validation-Loss=0.764387
INFO:root:Epoch[31] Batch [20] Speed: 3537.91 samples/sec Loss=0.660907
INFO:root:Epoch[31] Batch [40] Speed: 3875.47 samples/sec Loss=0.651465
INFO:root:Epoch[31] Batch [60] Speed: 3743.84 samples/sec Loss=0.663917
INFO:root:Epoch[31] Batch [80] Speed: 3638.65 samples/sec Loss=0.676285
INFO:root:Epoch[31] Batch [100] Speed: 3412.42 samples/sec Loss=0.670103
INFO:root:Epoch[31] Batch [120] Speed: 3627.04 samples/sec Loss=0.659204
INFO:root:Epoch[31] Batch [140] Speed: 3616.38 samples/sec Loss=0.658986
INFO:root:Epoch[31] Batch [160] Speed: 3697.62 samples/sec Loss=0.639175
INFO:root:Epoch[31] Batch [180] Speed: 3590.19 samples/sec Loss=0.663754
INFO:root:Epoch[31] Batch [200] Speed: 3650.31 samples/sec Loss=0.673908
INFO:root:Epoch[31] Batch [220] Speed: 3758.14 samples/sec Loss=0.616215
INFO:root:Epoch[31] Batch [240] Speed: 3712.63 samples/sec Loss=0.627443
INFO:root:Epoch[31] Batch [260] Speed: 3761.92 samples/sec Loss=0.637336
INFO:root:Epoch[31] Batch [280] Speed: 3688.99 samples/sec Loss=0.670167
INFO:root:Epoch[31] Batch [300] Speed: 3612.09 samples/sec Loss=0.639677
INFO:root:Epoch[31] Batch [320] Speed: 3737.07 samples/sec Loss=0.667430
INFO:root:Epoch[31] Batch [340] Speed: 3663.81 samples/sec Loss=0.599611
INFO:root:Epoch[31] Batch [360] Speed: 3742.88 samples/sec Loss=0.623044
INFO:root:Epoch[31] Batch [380] Speed: 3722.85 samples/sec Loss=0.639271
INFO:root:Epoch[31] Train-Loss=0.659612
INFO:root:Epoch[31] Time cost=13.610
INFO:root:Saved checkpoint to "models/mix-0032.params"
INFO:root:Epoch[31] Validation-Loss=0.562825
INFO:root:Epoch[32] Batch [20] Speed: 3431.35 samples/sec Loss=0.651906
INFO:root:Epoch[32] Batch [40] Speed: 3727.31 samples/sec Loss=0.643506
INFO:root:Epoch[32] Batch [60] Speed: 3721.82 samples/sec Loss=0.659033
INFO:root:Epoch[32] Batch [80] Speed: 3639.86 samples/sec Loss=0.639989
INFO:root:Epoch[32] Batch [100] Speed: 3759.77 samples/sec Loss=0.633572
INFO:root:Epoch[32] Batch [120] Speed: 3514.94 samples/sec Loss=0.596856
INFO:root:Epoch[32] Batch [140] Speed: 3699.95 samples/sec Loss=0.634029
INFO:root:Epoch[32] Batch [160] Speed: 3688.11 samples/sec Loss=0.641810
INFO:root:Epoch[32] Batch [180] Speed: 3877.21 samples/sec Loss=0.635941
INFO:root:Epoch[32] Batch [200] Speed: 3984.24 samples/sec Loss=0.670311
INFO:root:Epoch[32] Batch [220] Speed: 3809.81 samples/sec Loss=0.664873
INFO:root:Epoch[32] Batch [240] Speed: 3608.66 samples/sec Loss=0.615863
INFO:root:Epoch[32] Batch [260] Speed: 3698.44 samples/sec Loss=0.644148
INFO:root:Epoch[32] Batch [280] Speed: 3644.11 samples/sec Loss=0.667939
INFO:root:Epoch[32] Batch [300] Speed: 3434.88 samples/sec Loss=0.647372
INFO:root:Epoch[32] Batch [320] Speed: 3620.86 samples/sec Loss=0.639525
INFO:root:Epoch[32] Batch [340] Speed: 3637.06 samples/sec Loss=0.643836
INFO:root:Epoch[32] Batch [360] Speed: 3616.21 samples/sec Loss=0.616895
INFO:root:Epoch[32] Batch [380] Speed: 3538.94 samples/sec Loss=0.637095
INFO:root:Epoch[32] Train-Loss=0.642991
INFO:root:Epoch[32] Time cost=13.711
INFO:root:Saved checkpoint to "models/mix-0033.params"
INFO:root:Epoch[32] Validation-Loss=0.605553
INFO:root:Epoch[33] Batch [20] Speed: 3515.24 samples/sec Loss=0.610355
INFO:root:Epoch[33] Batch [40] Speed: 3838.91 samples/sec Loss=0.627178
INFO:root:Epoch[33] Batch [60] Speed: 3839.88 samples/sec Loss=0.664483
INFO:root:Epoch[33] Batch [80] Speed: 3805.15 samples/sec Loss=0.641994
INFO:root:Epoch[33] Batch [100] Speed: 3748.69 samples/sec Loss=0.641636
INFO:root:Epoch[33] Batch [120] Speed: 3640.00 samples/sec Loss=0.662607
INFO:root:Epoch[33] Batch [140] Speed: 3823.52 samples/sec Loss=0.625638
INFO:root:Epoch[33] Batch [160] Speed: 3696.25 samples/sec Loss=0.659712
INFO:root:Epoch[33] Batch [180] Speed: 3764.36 samples/sec Loss=0.629803
INFO:root:Epoch[33] Batch [200] Speed: 3634.40 samples/sec Loss=0.651264
INFO:root:Epoch[33] Batch [220] Speed: 3733.36 samples/sec Loss=0.634479
INFO:root:Epoch[33] Batch [240] Speed: 3692.86 samples/sec Loss=0.644774
INFO:root:Epoch[33] Batch [260] Speed: 3642.85 samples/sec Loss=0.627686
INFO:root:Epoch[33] Batch [280] Speed: 3680.75 samples/sec Loss=0.628490
INFO:root:Epoch[33] Batch [300] Speed: 3819.72 samples/sec Loss=0.634190
INFO:root:Epoch[33] Batch [320] Speed: 3498.07 samples/sec Loss=0.577682
INFO:root:Epoch[33] Batch [340] Speed: 3681.27 samples/sec Loss=0.626454
INFO:root:Epoch[33] Batch [360] Speed: 3589.59 samples/sec Loss=0.611474
INFO:root:Epoch[33] Batch [380] Speed: 3633.06 samples/sec Loss=0.644315
INFO:root:Epoch[33] Train-Loss=0.613029
INFO:root:Epoch[33] Time cost=13.525
INFO:root:Saved checkpoint to "models/mix-0034.params"
INFO:root:Epoch[33] Validation-Loss=0.631390
INFO:root:Epoch[34] Batch [20] Speed: 3731.37 samples/sec Loss=0.617963
INFO:root:Epoch[34] Batch [40] Speed: 3668.02 samples/sec Loss=0.625897
INFO:root:Epoch[34] Batch [60] Speed: 3445.33 samples/sec Loss=0.650314
INFO:root:Epoch[34] Batch [80] Speed: 3575.61 samples/sec Loss=0.611559
INFO:root:Epoch[34] Batch [100] Speed: 3761.16 samples/sec Loss=0.635225
INFO:root:Epoch[34] Batch [120] Speed: 3553.87 samples/sec Loss=0.621304
INFO:root:Epoch[34] Batch [140] Speed: 3560.76 samples/sec Loss=0.633505
INFO:root:Epoch[34] Batch [160] Speed: 3729.03 samples/sec Loss=0.630885
INFO:root:Epoch[34] Batch [180] Speed: 3634.79 samples/sec Loss=0.625552
INFO:root:Epoch[34] Batch [200] Speed: 3415.51 samples/sec Loss=0.666531
INFO:root:Epoch[34] Batch [220] Speed: 3200.19 samples/sec Loss=0.646385
INFO:root:Epoch[34] Batch [240] Speed: 3708.74 samples/sec Loss=0.627044
INFO:root:Epoch[34] Batch [260] Speed: 3633.40 samples/sec Loss=0.628010
INFO:root:Epoch[34] Batch [280] Speed: 3664.75 samples/sec Loss=0.599993
INFO:root:Epoch[34] Batch [300] Speed: 3764.53 samples/sec Loss=0.603265
INFO:root:Epoch[34] Batch [320] Speed: 3849.49 samples/sec Loss=0.630331
INFO:root:Epoch[34] Batch [340] Speed: 3646.71 samples/sec Loss=0.604625
INFO:root:Epoch[34] Batch [360] Speed: 3765.33 samples/sec Loss=0.608363
INFO:root:Epoch[34] Batch [380] Speed: 3775.06 samples/sec Loss=0.609993
INFO:root:Epoch[34] Train-Loss=0.617057
INFO:root:Epoch[34] Time cost=13.740
INFO:root:Saved checkpoint to "models/mix-0035.params"
INFO:root:Epoch[34] Validation-Loss=0.543230
INFO:root:Epoch[35] Batch [20] Speed: 3648.10 samples/sec Loss=0.620362
INFO:root:Epoch[35] Batch [40] Speed: 3589.43 samples/sec Loss=0.649720
INFO:root:Epoch[35] Batch [60] Speed: 3677.47 samples/sec Loss=0.622409
INFO:root:Epoch[35] Batch [80] Speed: 3499.49 samples/sec Loss=0.635384
INFO:root:Epoch[35] Batch [100] Speed: 3673.20 samples/sec Loss=0.634381
INFO:root:Epoch[35] Batch [120] Speed: 3531.14 samples/sec Loss=0.607341
INFO:root:Epoch[35] Batch [140] Speed: 3705.79 samples/sec Loss=0.647321
INFO:root:Epoch[35] Batch [160] Speed: 3623.59 samples/sec Loss=0.619249
INFO:root:Epoch[35] Batch [180] Speed: 3714.81 samples/sec Loss=0.643755
INFO:root:Epoch[35] Batch [200] Speed: 3696.18 samples/sec Loss=0.632040
INFO:root:Epoch[35] Batch [220] Speed: 3689.97 samples/sec Loss=0.604241
INFO:root:Epoch[35] Batch [240] Speed: 3512.29 samples/sec Loss=0.582286
INFO:root:Epoch[35] Batch [260] Speed: 3925.72 samples/sec Loss=0.653569
INFO:root:Epoch[35] Batch [280] Speed: 3776.59 samples/sec Loss=0.640028
INFO:root:Epoch[35] Batch [300] Speed: 3665.02 samples/sec Loss=0.629414
INFO:root:Epoch[35] Batch [320] Speed: 3542.72 samples/sec Loss=0.632983
INFO:root:Epoch[35] Batch [340] Speed: 3719.19 samples/sec Loss=0.631958
INFO:root:Epoch[35] Batch [360] Speed: 3565.17 samples/sec Loss=0.577585
INFO:root:Epoch[35] Batch [380] Speed: 3590.86 samples/sec Loss=0.606110
INFO:root:Epoch[35] Train-Loss=0.644853
INFO:root:Epoch[35] Time cost=13.721
INFO:root:Saved checkpoint to "models/mix-0036.params"
INFO:root:Epoch[35] Validation-Loss=0.605589
INFO:root:Epoch[36] Batch [20] Speed: 3681.24 samples/sec Loss=0.616620
INFO:root:Epoch[36] Batch [40] Speed: 3769.36 samples/sec Loss=0.598550
INFO:root:Epoch[36] Batch [60] Speed: 3666.42 samples/sec Loss=0.638782
INFO:root:Epoch[36] Batch [80] Speed: 3844.71 samples/sec Loss=0.634773
INFO:root:Epoch[36] Batch [100] Speed: 3863.49 samples/sec Loss=0.647214
INFO:root:Epoch[36] Batch [120] Speed: 3551.13 samples/sec Loss=0.610662
INFO:root:Epoch[36] Batch [140] Speed: 3872.93 samples/sec Loss=0.635986
INFO:root:Epoch[36] Batch [160] Speed: 3677.57 samples/sec Loss=0.621816
INFO:root:Epoch[36] Batch [180] Speed: 3363.77 samples/sec Loss=0.602946
INFO:root:Epoch[36] Batch [200] Speed: 3464.20 samples/sec Loss=0.607844
INFO:root:Epoch[36] Batch [220] Speed: 3771.98 samples/sec Loss=0.642001
INFO:root:Epoch[36] Batch [240] Speed: 3804.44 samples/sec Loss=0.621052
INFO:root:Epoch[36] Batch [260] Speed: 3571.06 samples/sec Loss=0.640810
INFO:root:Epoch[36] Batch [280] Speed: 3527.78 samples/sec Loss=0.616237
INFO:root:Epoch[36] Batch [300] Speed: 3399.35 samples/sec Loss=0.624266
INFO:root:Epoch[36] Batch [320] Speed: 3668.86 samples/sec Loss=0.630198
INFO:root:Epoch[36] Batch [340] Speed: 3801.75 samples/sec Loss=0.629346
INFO:root:Epoch[36] Batch [360] Speed: 3491.06 samples/sec Loss=0.651795
INFO:root:Epoch[36] Batch [380] Speed: 3804.30 samples/sec Loss=0.626377
INFO:root:Epoch[36] Train-Loss=0.614603
INFO:root:Epoch[36] Time cost=13.645
INFO:root:Saved checkpoint to "models/mix-0037.params"
INFO:root:Epoch[36] Validation-Loss=0.553874
INFO:root:Epoch[37] Batch [20] Speed: 3579.90 samples/sec Loss=0.624029
INFO:root:Epoch[37] Batch [40] Speed: 3670.04 samples/sec Loss=0.638932
INFO:root:Epoch[37] Batch [60] Speed: 3577.11 samples/sec Loss=0.664953
INFO:root:Epoch[37] Batch [80] Speed: 3795.10 samples/sec Loss=0.629435
INFO:root:Epoch[37] Batch [100] Speed: 3686.99 samples/sec Loss=0.631905
INFO:root:Epoch[37] Batch [120] Speed: 3708.69 samples/sec Loss=0.608359
INFO:root:Epoch[37] Batch [140] Speed: 3593.35 samples/sec Loss=0.618092
INFO:root:Epoch[37] Batch [160] Speed: 3758.70 samples/sec Loss=0.621798
INFO:root:Epoch[37] Batch [180] Speed: 3783.22 samples/sec Loss=0.644885
INFO:root:Epoch[37] Batch [200] Speed: 3903.39 samples/sec Loss=0.621410
INFO:root:Epoch[37] Batch [220] Speed: 3836.95 samples/sec Loss=0.635255
INFO:root:Epoch[37] Batch [240] Speed: 3432.33 samples/sec Loss=0.617733
INFO:root:Epoch[37] Batch [260] Speed: 3541.28 samples/sec Loss=0.613967
INFO:root:Epoch[37] Batch [280] Speed: 3666.02 samples/sec Loss=0.609691
INFO:root:Epoch[37] Batch [300] Speed: 3621.21 samples/sec Loss=0.615544
INFO:root:Epoch[37] Batch [320] Speed: 3531.43 samples/sec Loss=0.596671
INFO:root:Epoch[37] Batch [340] Speed: 3689.79 samples/sec Loss=0.585624
INFO:root:Epoch[37] Batch [360] Speed: 3659.44 samples/sec Loss=0.614442
INFO:root:Epoch[37] Batch [380] Speed: 3790.81 samples/sec Loss=0.617310
INFO:root:Epoch[37] Train-Loss=0.610130
INFO:root:Epoch[37] Time cost=13.577
INFO:root:Saved checkpoint to "models/mix-0038.params"
INFO:root:Epoch[37] Validation-Loss=0.629493
INFO:root:Epoch[38] Batch [20] Speed: 3810.34 samples/sec Loss=0.628641
INFO:root:Epoch[38] Batch [40] Speed: 3627.08 samples/sec Loss=0.615861
INFO:root:Epoch[38] Batch [60] Speed: 3802.55 samples/sec Loss=0.639903
INFO:root:Epoch[38] Batch [80] Speed: 3756.67 samples/sec Loss=0.589169
INFO:root:Epoch[38] Batch [100] Speed: 3428.70 samples/sec Loss=0.615855
INFO:root:Epoch[38] Batch [120] Speed: 3596.78 samples/sec Loss=0.624697
INFO:root:Epoch[38] Batch [140] Speed: 3330.72 samples/sec Loss=0.610806
INFO:root:Epoch[38] Batch [160] Speed: 3671.21 samples/sec Loss=0.602508
INFO:root:Epoch[38] Batch [180] Speed: 3663.19 samples/sec Loss=0.607976
INFO:root:Epoch[38] Batch [200] Speed: 3524.48 samples/sec Loss=0.616392
INFO:root:Epoch[38] Batch [220] Speed: 3585.25 samples/sec Loss=0.600533
INFO:root:Epoch[38] Batch [240] Speed: 3481.13 samples/sec Loss=0.615638
INFO:root:Epoch[38] Batch [260] Speed: 3580.15 samples/sec Loss=0.619860
INFO:root:Epoch[38] Batch [280] Speed: 3536.32 samples/sec Loss=0.635634
INFO:root:Epoch[38] Batch [300] Speed: 3800.88 samples/sec Loss=0.632592
INFO:root:Epoch[38] Batch [320] Speed: 3646.60 samples/sec Loss=0.609665
INFO:root:Epoch[38] Batch [340] Speed: 3650.88 samples/sec Loss=0.618043
INFO:root:Epoch[38] Batch [360] Speed: 3724.47 samples/sec Loss=0.618110
INFO:root:Epoch[38] Batch [380] Speed: 3668.37 samples/sec Loss=0.629343
INFO:root:Epoch[38] Train-Loss=0.575469
INFO:root:Epoch[38] Time cost=13.800
INFO:root:Saved checkpoint to "models/mix-0039.params"
INFO:root:Epoch[38] Validation-Loss=0.599305
INFO:root:Epoch[39] Batch [20] Speed: 3619.22 samples/sec Loss=0.614072
INFO:root:Epoch[39] Batch [40] Speed: 3590.49 samples/sec Loss=0.658848
INFO:root:Epoch[39] Batch [60] Speed: 3869.74 samples/sec Loss=0.600813
INFO:root:Epoch[39] Batch [80] Speed: 3632.00 samples/sec Loss=0.588729
INFO:root:Epoch[39] Batch [100] Speed: 3695.13 samples/sec Loss=0.615350
INFO:root:Epoch[39] Batch [120] Speed: 3864.10 samples/sec Loss=0.582576
INFO:root:Epoch[39] Batch [140] Speed: 3682.38 samples/sec Loss=0.628465
INFO:root:Epoch[39] Batch [160] Speed: 3767.82 samples/sec Loss=0.656351
INFO:root:Epoch[39] Batch [180] Speed: 3682.80 samples/sec Loss=0.605801
INFO:root:Epoch[39] Batch [200] Speed: 3630.00 samples/sec Loss=0.636667
INFO:root:Epoch[39] Batch [220] Speed: 3585.84 samples/sec Loss=0.604659
INFO:root:Epoch[39] Batch [240] Speed: 3404.33 samples/sec Loss=0.630919
INFO:root:Epoch[39] Batch [260] Speed: 3470.14 samples/sec Loss=0.604603
INFO:root:Epoch[39] Batch [280] Speed: 3624.21 samples/sec Loss=0.602092
INFO:root:Epoch[39] Batch [300] Speed: 3734.34 samples/sec Loss=0.620085
INFO:root:Epoch[39] Batch [320] Speed: 3712.65 samples/sec Loss=0.629461
INFO:root:Epoch[39] Batch [340] Speed: 3609.11 samples/sec Loss=0.576444
INFO:root:Epoch[39] Batch [360] Speed: 3603.55 samples/sec Loss=0.602214
INFO:root:Epoch[39] Batch [380] Speed: 3554.57 samples/sec Loss=0.629962
INFO:root:Epoch[39] Train-Loss=0.650105
INFO:root:Epoch[39] Time cost=13.707
INFO:root:Saved checkpoint to "models/mix-0040.params"
INFO:root:Epoch[39] Validation-Loss=0.623355
INFO:root:Epoch[40] Batch [20] Speed: 3519.97 samples/sec Loss=0.606179
INFO:root:Epoch[40] Batch [40] Speed: 3542.76 samples/sec Loss=0.626663
INFO:root:Epoch[40] Batch [60] Speed: 3339.11 samples/sec Loss=0.632914
INFO:root:Epoch[40] Batch [80] Speed: 3466.16 samples/sec Loss=0.631073
INFO:root:Epoch[40] Batch [100] Speed: 3698.03 samples/sec Loss=0.622487
INFO:root:Epoch[40] Batch [120] Speed: 3773.01 samples/sec Loss=0.607072
INFO:root:Epoch[40] Batch [140] Speed: 3850.18 samples/sec Loss=0.606900
INFO:root:Epoch[40] Batch [160] Speed: 3484.04 samples/sec Loss=0.604208
INFO:root:Epoch[40] Batch [180] Speed: 3630.98 samples/sec Loss=0.624021
INFO:root:Epoch[40] Batch [200] Speed: 3635.10 samples/sec Loss=0.602561
INFO:root:Epoch[40] Batch [220] Speed: 3601.22 samples/sec Loss=0.620399
INFO:root:Epoch[40] Batch [240] Speed: 3786.54 samples/sec Loss=0.625347
INFO:root:Epoch[40] Batch [260] Speed: 3804.32 samples/sec Loss=0.612511
INFO:root:Epoch[40] Batch [280] Speed: 3547.98 samples/sec Loss=0.601645
INFO:root:Epoch[40] Batch [300] Speed: 3679.93 samples/sec Loss=0.622798
INFO:root:Epoch[40] Batch [320] Speed: 3518.99 samples/sec Loss=0.633115
INFO:root:Epoch[40] Batch [340] Speed: 3570.15 samples/sec Loss=0.605688
INFO:root:Epoch[40] Batch [360] Speed: 3788.63 samples/sec Loss=0.614590
INFO:root:Epoch[40] Batch [380] Speed: 3733.17 samples/sec Loss=0.624161
INFO:root:Epoch[40] Train-Loss=0.629093
INFO:root:Epoch[40] Time cost=13.850
INFO:root:Saved checkpoint to "models/mix-0041.params"
INFO:root:Epoch[40] Validation-Loss=0.562761
INFO:root:Epoch[41] Batch [20] Speed: 3703.74 samples/sec Loss=0.621762
INFO:root:Epoch[41] Batch [40] Speed: 3745.37 samples/sec Loss=0.632858
INFO:root:Epoch[41] Batch [60] Speed: 3799.68 samples/sec Loss=0.610056
INFO:root:Epoch[41] Batch [80] Speed: 3827.44 samples/sec Loss=0.615784
INFO:root:Epoch[41] Batch [100] Speed: 3668.94 samples/sec Loss=0.616343
INFO:root:Epoch[41] Batch [120] Speed: 3409.10 samples/sec Loss=0.592828
INFO:root:Epoch[41] Batch [140] Speed: 3603.66 samples/sec Loss=0.619982
INFO:root:Epoch[41] Batch [160] Speed: 3597.01 samples/sec Loss=0.625751
INFO:root:Epoch[41] Batch [180] Speed: 3551.10 samples/sec Loss=0.588424
INFO:root:Epoch[41] Batch [200] Speed: 3852.43 samples/sec Loss=0.619360
INFO:root:Epoch[41] Batch [220] Speed: 3763.75 samples/sec Loss=0.590968
INFO:root:Epoch[41] Batch [240] Speed: 3663.76 samples/sec Loss=0.592819
INFO:root:Epoch[41] Batch [260] Speed: 3693.00 samples/sec Loss=0.636206
INFO:root:Epoch[41] Batch [280] Speed: 3713.03 samples/sec Loss=0.627927
INFO:root:Epoch[41] Batch [300] Speed: 3552.08 samples/sec Loss=0.606657
INFO:root:Epoch[41] Batch [320] Speed: 3643.01 samples/sec Loss=0.605326
INFO:root:Epoch[41] Batch [340] Speed: 3530.85 samples/sec Loss=0.608690
INFO:root:Epoch[41] Batch [360] Speed: 3709.56 samples/sec Loss=0.606310
INFO:root:Epoch[41] Batch [380] Speed: 3800.64 samples/sec Loss=0.609468
INFO:root:Epoch[41] Train-Loss=0.590857
INFO:root:Epoch[41] Time cost=13.605
INFO:root:Saved checkpoint to "models/mix-0042.params"
INFO:root:Epoch[41] Validation-Loss=0.602958
INFO:root:Epoch[42] Batch [20] Speed: 3647.81 samples/sec Loss=0.599917
INFO:root:Epoch[42] Batch [40] Speed: 3666.03 samples/sec Loss=0.605443
INFO:root:Epoch[42] Batch [60] Speed: 3726.55 samples/sec Loss=0.596275
INFO:root:Epoch[42] Batch [80] Speed: 3739.16 samples/sec Loss=0.605156
INFO:root:Epoch[42] Batch [100] Speed: 3729.11 samples/sec Loss=0.614010
INFO:root:Epoch[42] Batch [120] Speed: 3673.63 samples/sec Loss=0.603715
INFO:root:Epoch[42] Batch [140] Speed: 3690.19 samples/sec Loss=0.611314
INFO:root:Epoch[42] Batch [160] Speed: 3706.34 samples/sec Loss=0.607734
INFO:root:Epoch[42] Batch [180] Speed: 3589.31 samples/sec Loss=0.614211
INFO:root:Epoch[42] Batch [200] Speed: 3603.84 samples/sec Loss=0.681233
INFO:root:Epoch[42] Batch [220] Speed: 3674.73 samples/sec Loss=0.626490
INFO:root:Epoch[42] Batch [240] Speed: 3409.50 samples/sec Loss=0.617589
INFO:root:Epoch[42] Batch [260] Speed: 3505.47 samples/sec Loss=0.601415
INFO:root:Epoch[42] Batch [280] Speed: 3511.52 samples/sec Loss=0.616769
INFO:root:Epoch[42] Batch [300] Speed: 3491.38 samples/sec Loss=0.609785
INFO:root:Epoch[42] Batch [320] Speed: 3488.80 samples/sec Loss=0.594979
INFO:root:Epoch[42] Batch [340] Speed: 3758.83 samples/sec Loss=0.595842
INFO:root:Epoch[42] Batch [360] Speed: 3595.52 samples/sec Loss=0.590721
INFO:root:Epoch[42] Batch [380] Speed: 3814.83 samples/sec Loss=0.594909
INFO:root:Epoch[42] Train-Loss=0.629987
INFO:root:Epoch[42] Time cost=13.750
INFO:root:Saved checkpoint to "models/mix-0043.params"
INFO:root:Epoch[42] Validation-Loss=0.618437
INFO:root:Epoch[43] Batch [20] Speed: 3592.63 samples/sec Loss=0.592418
INFO:root:Epoch[43] Batch [40] Speed: 3537.50 samples/sec Loss=0.632597
INFO:root:Epoch[43] Batch [60] Speed: 3519.21 samples/sec Loss=0.635551