我自己train了一下，为什么出来的结果不像你们训练出的那样智能？ #42

profection · 2022-04-03T16:41:21Z

我训练的是wp，代码是你们的源码，训练后相比训练前的ckpt很奇怪
第一个表现，不管是地主还是农民，在出牌预测时，胜率都变为不高于50%
第二个表现，总是出大牌，压制对方，即使自己是农民，也会压制队友，导致最后无牌可出
第三个表现，有炸会拆着走，比如自己手牌剩下最大的2炸和一个对三，会直接走四带二
这是我训练了一小时之后尝试的效果，loss是0.9
所以请问你们训练多久？loss到多少算成功？是需要什么trick吗？

daochenzha · 2022-04-03T21:03:28Z

@profection 我们按每秒6000 frame的速度训练了大约两个月

profection · 2022-04-04T00:57:52Z

@profection 我们按每秒6000 frame的速度训练了大约两个月

哦，那时间有点长的啊，两月后最好的loss到什么程度了？
代码我稍微修改了一些，原有的代码逻辑是没有main.tar时自己生成ckpt，但我直接改成加载你们的ckpt，但使用你们提供的ckpt后，为什么评估loss是0.9，而且train了几分钟后得到的ckpt出牌预测时胜率变为不高于50%？很奇怪啊

profection · 2022-04-04T01:14:26Z

补充一下，还有一个很奇怪的现象，如果出牌中有QQQ9997766635，对家出了8884，这边会直接出999Q，导致对家出TTT4后自己无牌可出，这是什么bug？能修复吗？

profection · 2022-04-04T01:19:03Z

QQQ9997766635
这是生成的可出牌序列
[[3, 12, 12, 12], [5, 12, 12, 12], [6, 12, 12, 12], [7, 12, 12, 12], [9, 12, 12, 12], [3, 9, 9, 9], [5, 9, 9, 9], [6, 9, 9, 9], [7, 9, 9, 9], [9, 9, 9, 12]]

daochenzha · 2022-04-04T06:47:58Z

@profection loss并不是越低越好得看胜率。这种情况就是没学好，神经网络不能保证百分之百对

profection · 2022-04-04T08:01:09Z

@profection loss并不是越低越好得看胜率。这种情况就是没学好，神经网络不能保证百分之百对

哦，那你们怎么判断什么时候算已经训练完了？或者什么时候该结束训练？

daochenzha · 2022-04-04T14:51:00Z

@profection 这个只能靠和baseline的胜率判断

profection · 2022-04-04T15:05:54Z

@profection 这个只能靠和baseline的胜率判断

和我想一块去了，刚改了代码，三个角色有两个角色用baseline当老师，另一个当学生，我跑跑试试

profection · 2022-04-04T15:08:38Z

另外有个关于神经网络结构的问题
为什么不用resnet，不用prelu，为什么lstm只用一层，为什么没有用dropout？只用6层linear是不是少了点？现在gpt都24层了

daochenzha · 2022-04-04T15:46:10Z

@profection 复杂的网络比如resnet效果会更好。只是我们没有怎么调网络结构。

profection · 2022-04-05T01:06:43Z

哦，我还以为你们都试过，现在的网络结构是排除出来的，因为我试了一下改网络结构，训练出来效果不是很好

profection · 2022-04-07T00:08:48Z

我又回来了= =，大神还有个问题，我训练这么久，loss一直在0.6徘徊（训练的是wp），为什么啊？这个loss不收敛吗？
PS：不管是现有的模型还是已经改过的模型，训练都不收敛

daochenzha · 2022-04-09T03:12:12Z

@profection 强化学习是这样的，loss不会掉，得根据得分判断学习进程

profection · 2022-04-16T05:18:04Z

@profection 强化学习是这样的，loss不会掉，得根据得分判断学习进程

明白了，谢谢，我再多训练几天看看

cxk555 · 2023-12-14T06:59:54Z

请问是如何改的用baseline当老师啊，能否告知一下

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

我自己train了一下，为什么出来的结果不像你们训练出的那样智能？ #42

我自己train了一下，为什么出来的结果不像你们训练出的那样智能？ #42

profection commented Apr 3, 2022

daochenzha commented Apr 3, 2022

profection commented Apr 4, 2022

profection commented Apr 4, 2022

profection commented Apr 4, 2022

daochenzha commented Apr 4, 2022

profection commented Apr 4, 2022

daochenzha commented Apr 4, 2022

profection commented Apr 4, 2022

profection commented Apr 4, 2022

daochenzha commented Apr 4, 2022

profection commented Apr 5, 2022

profection commented Apr 7, 2022 •

edited

Loading

daochenzha commented Apr 9, 2022

profection commented Apr 16, 2022

cxk555 commented Dec 14, 2023

我自己train了一下，为什么出来的结果不像你们训练出的那样智能？ #42

我自己train了一下，为什么出来的结果不像你们训练出的那样智能？ #42

Comments

profection commented Apr 3, 2022

daochenzha commented Apr 3, 2022

profection commented Apr 4, 2022

profection commented Apr 4, 2022

profection commented Apr 4, 2022

daochenzha commented Apr 4, 2022

profection commented Apr 4, 2022

daochenzha commented Apr 4, 2022

profection commented Apr 4, 2022

profection commented Apr 4, 2022

daochenzha commented Apr 4, 2022

profection commented Apr 5, 2022

profection commented Apr 7, 2022 • edited Loading

daochenzha commented Apr 9, 2022

profection commented Apr 16, 2022

cxk555 commented Dec 14, 2023

profection commented Apr 7, 2022 •

edited

Loading