当前位置：首页 > news >正文

人工智能（Educoder）-- 搜索技术 -- 启发式搜索

news 2026/2/9 21:41:48

任务描述

本关任务：八数码问题是在一个3×3的棋盘上有1−8位数字随机分布，以及一个空格，与空格相连的棋子可以滑动到空格中，问题的解是通过空格滑动，使得棋盘转化为目标状态，如下图所示。

为了简化问题的输入，首先将空格用数字0表示，然后将3×3的棋盘用9位长的字符串表示，则上图的初始状态为724506831，目标状态为012345678，本关卡所有目标状态均为012345678，也保证初始状态到目标状态有解。

对于上图的初始状态，将数字2移动到空格，称之为u操作（空格上移），将数字3移动到空格，称之为d操作（空格下移），将数字5移动到空格，称之为l操作（空格左移），将数字6移动到空格，称之为r操作（空格右移），则一个合法移动路径为lurdrdllurrdllurrulldrrull。

编程要求

本关的编程任务是补全右侧代码片段 salvePuzzle 、 calcDistH 和 moveMap 中 Begin 至 End 中间的代码，具体要求如下：

在 salvePuzzle 中，根据输入参数init（初始状态，如724506831）和targ（目标状态，均为012345678），实现 A* 搜索算法，返回八数码问题的移动路径，如上图的移动路径：lurdrdllurrdllurrulldrrull。
在 calcDistH 中，计算当前状态（参数srcmap，如724506831）到目标状态（参数destmap，如012345678）的启发式函数值h(n)，并返回h(n)。
在 moveMap 中，实现行动转换，并返回下一个状态，例如当前状态为参数curmap=724506831，当前 8 数码状态curmap中空格 0 的位置索引i=4，移动空格到位置j=3，则返回的新状态为newmap=724056831。

测试说明

平台将自动编译补全后的代码，并生成若干组测试数据，接着测试程序会调用上述函数，并判断函数返回的路径是否为合法解，若是则输出 Accepted 表示程序正确，否则程序错误。

以下是平台的测试样例：

测试输入： 724506831

预期输出： Accepted

代码

# -*- coding:utf-8 -*-class Solution:def salvePuzzle(self, init, targ):''' 求解8数码问题参数：init - 初始状态 例如'123046758'targ - 目标状态 均为'012345678'返回值：clf - 由udlr组成的移动路径字符串'''#请在这里补充代码，完成本关任务#********** Begin **********#clf = ''  # 初始化移动路径字符串state_open = []  # 初始化开放列表state_close = []  # 初始化关闭列表state_open.append([init,99,'test',init,0])  # 将初始状态加入开放列表fn = 2  # 初始化启发式函数的权重flag = 1  # 初始化标志位while True:cur_state = state_open.pop(0)  # 取出开放列表中的第一个状态state_close.append(cur_state)  # 将当前状态加入关闭列表if cur_state[0] == targ:  # 如果当前状态等于目标状态while 1:clf += cur_state[2]  # 将当前状态的移动方向加入移动路径字符串if cur_state[3] == init:  # 如果当前状态的父状态等于初始状态breakfor id,item in enumerate(state_close[1:]):  # 遍历关闭列表中的状态if item[0] == cur_state[3]:  # 如果找到父状态cur_state = item  # 更新当前状态为父状态return  clf[::-1]  # 返回逆序的移动路径字符串i = cur_state[0].find('0')  # 找到空格0的位置索引flag = 1  # 重置标志位if str(i) not in '036':  # 如果空格0不在第一行、第三行和第六行tmp_map = self.moveMap(cur_state[0],i,i-1)  # 尝试将空格0向左移动if tmp_map not in [tmp[0] for tmp in state_close]:  # 如果新状态不在关闭列表中for id,item in enumerate(state_open):  # 遍历开放列表中的状态if item[0] == tmp_map:  # 如果找到新状态if item[1] + item[4] > self.calcDistH(tmp_map,targ) + cur_state[4] + fn:  # 如果新状态的代价大于当前状态的代价state_open[id] = [tmp_map,self.calcDistH(tmp_map,targ),'l',cur_state[0],cur_state[4]+fn]  # 更新开放列表中的状态flag = 0  # 设置标志位为0breakbreakif flag == 1:  # 如果标志位为1state_open.append([tmp_map,self.calcDistH(tmp_map,targ),'l',cur_state[0],cur_state[4]+fn])  # 将新状态加入开放列表flag = 1  # 重置标志位if str(i) not in '258':  # 如果空格0不在第二行、第五行和第八行tmp_map = self.moveMap(cur_state[0],i,i+1)  # 尝试将空格0向右移动if tmp_map not in [tmp[0] for tmp in state_close]:  # 如果新状态不在关闭列表中for id,item in enumerate(state_open):  # 遍历开放列表中的状态if item[0] == tmp_map:  # 如果找到新状态if item[1] + item[4] > self.calcDistH(tmp_map,targ) + cur_state[4] + fn:  # 如果新状态的代价大于当前状态的代价state_open[id] = [tmp_map,self.calcDistH(tmp_map,targ),'r',cur_state[0],cur_state[4]+fn]  # 更新开放列表中的状态flag = 0  # 设置标志位为0breakbreakif flag ==1:  # 如果标志位为1state_open.append([tmp_map,self.calcDistH(tmp_map,targ),'r',cur_state[0],cur_state[4]+fn])  # 将新状态加入开放列表flag = 1  # 重置标志位if i-3>=0:  # 如果空格0不在最左边的三列tmp_map = self.moveMap(cur_state[0],i,i-3)  # 尝试将空格0向上移动if tmp_map not in [tmp[0] for tmp in state_close]:  # 如果新状态不在关闭列表中for id,item in enumerate(state_open):  # 遍历开放列表中的状态if item[0] == tmp_map:  # 如果找到新状态if item[1] + item[4] > self.calcDistH(tmp_map,targ) + cur_state[4] + fn:  # 如果新状态的代价大于当前状态的代价state_open[id] = [tmp_map,self.calcDistH(tmp_map,targ),'u',cur_state[0],cur_state[4]+fn]  # 更新开放列表中的状态flag = 0  # 设置标志位为0breakbreakif flag ==1:  # 如果标志位为1state_open.append([tmp_map,self.calcDistH(tmp_map,targ),'u',cur_state[0],cur_state[4]+fn])  # 将新状态加入开放列表flag = 1  # 重置标志位if i+3<=8:  # 如果空格0不在最右边的三列tmp_map = self.moveMap(cur_state[0],i,i+3)  # 尝试将空格0向下移动if tmp_map not in [tmp[0] for tmp in state_close]:  # 如果新状态不在关闭列表中for id,item in enumerate(state_open):  # 遍历开放列表中的状态if item[0] == tmp_map:  # 如果找到新状态if item[1] + item[4] > self.calcDistH(tmp_map,targ) + cur_state[4] + fn:  # 如果新状态的代价大于当前状态的代价state_open[id] = [tmp_map,self.calcDistH(tmp_map,targ),'d',cur_state[0],cur_state[4]+fn]  # 更新开放列表中的状态flag = 0  # 设置标志位为0breakbreakif flag ==1:  # 如果标志位为1state_open.append([tmp_map,self.calcDistH(tmp_map,targ),'d',cur_state[0],cur_state[4]+fn])  # 将新状态加入开放列表state_open.sort(key=lambda x : x[1] + x[4])  # 根据代价对开放列表进行排序#********** End **********#def calcDistH(self, src_map, dest_map):'''启发式函数h(n)参数：src_map  - 当前8数码状态dest_map - 目标8数码状态返回值：clf - 当前状态到目标状态的启发式函数值'''#请在这里补充代码，完成本关任务#********** Begin **********#if src_map is None or dest_map is None:return 0 clf = 0for i in range(9):clf += abs(int(src_map[i])-int(dest_map[i]))return clf#********** End **********#def moveMap(self, cur_map, i, j):'''状态转换（交换位置i和j）参数：cur_map - 当前8数码状态i - 当前8数码状态中空格0的位置索引j - 将空格0的位置i移动到位置j，位置j移动到位置i返回值：clf - 新的8数码状态'''#请在这里补充代码，完成本关任务#********** Begin **********#if i>j:i,j=j,itmp_i = cur_map[i]tmp_j = cur_map[j]tmp_map = cur_map[:i]+tmp_j+cur_map[i+1:j]+tmp_i+cur_map[j+1:]return tmp_map#********** End **********#

任务描述

相关知识

评估函数

贪婪最佳优先搜索

A*搜索：缩小总评估代价

求解思路

编程要求

测试说明

代码

相关文章：