Commit Graph

74 Commits

Author SHA1 Message Date
weixin_46229132
6a82010112 改成50_50_3场景 2025-04-12 22:55:01 +08:00
weixin_46229132
d64ec83042 可视化reward 2025-04-08 15:49:22 +08:00
weixin_46229132
90ad3e829d dqn 100_100_6 2025-04-05 11:06:08 +08:00
weixin_46229132
c6c7cb47f1 跑完100_100_6的实验 2025-04-05 10:36:03 +08:00
weixin_46229132
87ee65087f 修改100_100_6的dqn场景 2025-04-04 10:59:31 +08:00
weixin_46229132
23aafc2998 修改划分列举方法 2025-04-03 17:24:54 +08:00
weixin_46229132
adaf8cc50e dqn跑通一个场景 2025-04-03 14:20:27 +08:00
weixin_46229132
0be9fa596a 修改dqn参数 2025-04-02 21:33:40 +08:00
weixin_46229132
981681c1bd 修改dqn bug 2025-04-01 20:45:13 +08:00
weixin_46229132
db04a87ffd 修改dqn奖励 2025-04-01 17:46:23 +08:00
weixin_46229132
58952f1fdb 添加greedy算法 2025-04-01 10:24:52 +08:00
weixin_46229132
27829c5d48 修改场景 2025-03-31 14:23:29 +08:00
weixin_46229132
dab8f4fd8f 调整奖励函数 2025-03-31 11:12:01 +08:00
weixin_46229132
84f69f4293 离散情况 2025-03-29 21:28:39 +08:00
weixin_46229132
3e6887c655 每一个加一个奖励 2025-03-29 16:53:03 +08:00
weixin_46229132
f347ca8276 微调分区 2025-03-29 16:28:30 +08:00
along
f05f8400fb ddpg求解env_part 2025-03-29 12:00:26 +08:00
along
0cf336c96d ppo内层加入ga 2025-03-29 11:43:04 +08:00
weixin_46229132
ff2b914eb5 修复env_partion bug 2025-03-29 10:48:47 +08:00
weixin_46229132
2c88915112 跑通PPO partition 2025-03-28 21:37:31 +08:00
weixin_46229132
8d79e8cc66 mTSP代码 2025-03-28 19:57:44 +08:00
weixin_46229132
656e822528 format 2025-03-28 15:13:23 +08:00
weixin_46229132
a375832b6c 添加q-learning TSP 2025-03-28 10:53:41 +08:00
weixin_46229132
1485fb2bd6 更新q_table 2025-03-27 21:48:07 +08:00
weixin_46229132
6f8fcd15b7 加入q learning 2025-03-27 20:50:46 +08:00
weixin_46229132
6f44d142bc 修改模拟退火bug 2025-03-24 19:28:24 +08:00
weixin_46229132
fe37f7ac0f 修改超参数设置 2025-03-24 17:09:51 +08:00
weixin_46229132
61be8ad37c 修改蒙特卡洛的输出 2025-03-24 16:11:38 +08:00
weixin_46229132
9599215e2e 模拟退火微调分割 2025-03-24 15:42:42 +08:00
weixin_46229132
d9d1214f7c 小改 2025-03-22 21:44:06 +08:00
weixin_46229132
8e8d9a25df 修改GA bug 2025-03-22 21:43:11 +08:00
weixin_46229132
17acfa5409 修改GA bug 2025-03-22 17:24:45 +08:00
weixin_46229132
c9db9244b3 添加遍历-遗传算法求解 2025-03-22 17:16:58 +08:00
weixin_46229132
a9ee5ceec7 环境增加delay_time 2025-03-22 09:47:52 +08:00
weixin_46229132
5b468deb9d SAC 2025-03-21 16:04:42 +08:00
weixin_46229132
67c7a9d6c7 DQN让它先看答案 2025-03-20 14:05:15 +08:00
weixin_46229132
f4fb963c06 修改env参数 2025-03-20 09:29:30 +08:00
weixin_46229132
c5023fb360 添加价值评估的mask 2025-03-19 21:52:33 +08:00
weixin_46229132
3dba6e4a53 修改离散环境,连续不动给惩罚 2025-03-19 20:58:34 +08:00
weixin_46229132
4972306ca7 更新env_dis 2025-03-19 20:40:35 +08:00
weixin_46229132
ff23b5e745 调整奖励 2025-03-19 16:31:23 +08:00
weixin_46229132
d364a1e4df 修ppo bug 2025-03-19 15:23:55 +08:00
weixin_46229132
6dc285d3f8 加入PPO代码 2025-03-19 15:12:52 +08:00
weixin_46229132
7ca5ce08b1 修改环境 2025-03-19 14:22:24 +08:00
weixin_46229132
e35dd10326 验证阶段加输出,更新奖励 2025-03-19 11:29:02 +08:00
weixin_46229132
c96c36d4cd 调整eval的输出 2025-03-19 10:58:43 +08:00
weixin_46229132
2362de4c54 修改dqn 2025-03-19 01:04:03 +08:00
weixin_46229132
f19e8fbdbf 加入dqn算法 2025-03-18 21:16:48 +08:00
weixin_46229132
343008bc9f 简化初始化迷宫的方式 2025-03-18 17:27:49 +08:00
weixin_46229132
55e45fe14e 小改ddpg main 2025-03-18 14:45:50 +08:00