Commit Graph

52 Commits

Author SHA1 Message Date
weixin_46229132
a375832b6c 添加q-learning TSP 2025-03-28 10:53:41 +08:00
weixin_46229132
1485fb2bd6 更新q_table 2025-03-27 21:48:07 +08:00
weixin_46229132
6f8fcd15b7 加入q learning 2025-03-27 20:50:46 +08:00
weixin_46229132
6f44d142bc 修改模拟退火bug 2025-03-24 19:28:24 +08:00
weixin_46229132
fe37f7ac0f 修改超参数设置 2025-03-24 17:09:51 +08:00
weixin_46229132
61be8ad37c 修改蒙特卡洛的输出 2025-03-24 16:11:38 +08:00
weixin_46229132
9599215e2e 模拟退火微调分割 2025-03-24 15:42:42 +08:00
weixin_46229132
d9d1214f7c 小改 2025-03-22 21:44:06 +08:00
weixin_46229132
8e8d9a25df 修改GA bug 2025-03-22 21:43:11 +08:00
weixin_46229132
17acfa5409 修改GA bug 2025-03-22 17:24:45 +08:00
weixin_46229132
c9db9244b3 添加遍历-遗传算法求解 2025-03-22 17:16:58 +08:00
weixin_46229132
a9ee5ceec7 环境增加delay_time 2025-03-22 09:47:52 +08:00
weixin_46229132
5b468deb9d SAC 2025-03-21 16:04:42 +08:00
weixin_46229132
67c7a9d6c7 DQN让它先看答案 2025-03-20 14:05:15 +08:00
weixin_46229132
f4fb963c06 修改env参数 2025-03-20 09:29:30 +08:00
weixin_46229132
c5023fb360 添加价值评估的mask 2025-03-19 21:52:33 +08:00
weixin_46229132
3dba6e4a53 修改离散环境,连续不动给惩罚 2025-03-19 20:58:34 +08:00
weixin_46229132
4972306ca7 更新env_dis 2025-03-19 20:40:35 +08:00
weixin_46229132
ff23b5e745 调整奖励 2025-03-19 16:31:23 +08:00
weixin_46229132
d364a1e4df 修ppo bug 2025-03-19 15:23:55 +08:00
weixin_46229132
6dc285d3f8 加入PPO代码 2025-03-19 15:12:52 +08:00
weixin_46229132
7ca5ce08b1 修改环境 2025-03-19 14:22:24 +08:00
weixin_46229132
e35dd10326 验证阶段加输出,更新奖励 2025-03-19 11:29:02 +08:00
weixin_46229132
c96c36d4cd 调整eval的输出 2025-03-19 10:58:43 +08:00
weixin_46229132
2362de4c54 修改dqn 2025-03-19 01:04:03 +08:00
weixin_46229132
f19e8fbdbf 加入dqn算法 2025-03-18 21:16:48 +08:00
weixin_46229132
343008bc9f 简化初始化迷宫的方式 2025-03-18 17:27:49 +08:00
weixin_46229132
55e45fe14e 小改ddpg main 2025-03-18 14:45:50 +08:00
weixin_46229132
b3812a3193 format ddpg_main 2025-03-18 14:30:41 +08:00
weixin_46229132
19f8b6246a test 2025-03-18 14:29:16 +08:00
weixin_46229132
75e5237272 修改DDPG 2025-03-14 16:06:59 +08:00
weixin_46229132
ab51727253 添加ddpg代码 2025-03-14 15:27:05 +08:00
weixin_46229132
4fdb8aa152 env代码小调整 2025-03-14 11:17:12 +08:00
weixin_46229132
dfec68e122 修改蒙特卡洛采样法 2025-03-14 11:01:02 +08:00
weixin_46229132
b3b5e597b8 添加requirements.txt 2025-03-14 10:10:09 +08:00
weixin_46229132
c1eb9d9528 就用cpu训练网络 2025-03-14 09:45:46 +08:00
weixin_46229132
64935bf92f 添加人工操作,修改环境bug 2025-03-14 09:42:56 +08:00
weixin_46229132
db890f83cf 改网络的激活函数 2025-03-14 09:22:40 +08:00
weixin_46229132
3086413171 修改car_pos 2025-03-13 21:28:30 +08:00
weixin_46229132
ee914ff930 调整奖励 2025-03-13 15:55:14 +08:00
weixin_46229132
aecd86b245 修改env数据结构 2025-03-13 15:09:58 +08:00
weixin_46229132
1f18d9d96f 修改算法的输出,把可视化模块单独分离出来 2025-03-13 11:18:58 +08:00
weixin_46229132
b1851ac489 修改bug 2025-03-13 10:46:28 +08:00
weixin_46229132
d53eda2570 修PPObug 2025-03-12 16:09:19 +08:00
weixin_46229132
fe4e754cc4 添加greedy求解代码 2025-03-12 11:33:35 +08:00
weixin_46229132
3818343085 PPO能够跑起来了 2025-03-11 19:43:04 +08:00
weixin_46229132
4474a33cba 添加yaml文件 2025-03-11 16:40:20 +08:00
weixin_46229132
1058f37be6 添加PPO代码 2025-03-11 16:01:07 +08:00
weixin_46229132
e7a4395340 保存当前状态 2025-03-11 15:46:11 +08:00
weixin_46229132
01c6a71b4f 使用遗传算法求解多旅行商问题 2025-03-09 16:53:01 +08:00