从列表中抓取后无法操作字符串

我希望删除用于解析的规则中的最后一条语句。语句用@字符封装，规则本身用模式标签封装。我想要做的只是删除最后一条规则语句。我目前实现这一目标的想法是这样的：打开规则文件，将每一行作为一个元素保存到列表中。选择包含正确规则 ID 的行，然后将规则模式另存为新字符串。反转保存的规则模式。删除最后一条规则语句。重新反转规则模式。添加尾随模式标记。所以输入看起来像：<pattern>@this is a statement@ @this is also a statement@</pattern>输出将如下所示：<pattern>@this is a statement@ </pattern>我目前的尝试是这样的：with open(rules) as f: lines = f.readlines()string = ""for line in lines: if ruleid in line: position = lines.index(line) string = lines[position + 2] # the rule pattern will be two lines down # from where the rule-id is located, hence # the position + 2def reversed_string(a_string): #reverses the string return a_string[::-1] def remove_at(x): #removes everything until the @ character return re.sub('^.*?@','',x) print(reversed_string(remove_at(remove_at(reversed_string(string)))))这将反转字符串，但不会删除最后一条规则语句，一旦它被反转。仅运行该reversed_string()函数将成功反转字符串，但尝试通过该remove_at()函数运行相同的字符串将根本不起作用。但是，如果您手动创建输入字符串（到相同的规则模式），并放弃打开和抓取规则模式，它将成功删除尾随的规则语句。成功的代码如下所示：string = '<pattern>@this is a statement@ @this is also a statement@</pattern>'def reversed_string(a_string): #reverses the string return a_string[::-1] def remove_at(x): #removes everything until the @ character return re.sub('^.*?@','',x) print(reversed_string(remove_at(remove_at(reversed_string(string)))))另外，删除完成后如何添加模式标签？

查看完整描述

1 回答

HUH函数

TA贡献1836条经验获得超4个赞

您正在阅读的行可能\n在末尾有一个，这就是您的替代品不起作用的原因。这个问题可以指导您阅读没有换行的文件。

在这些选项中，一个可能是像这样删除\nusing rstrip() ：

string = lines[position + 2].rstrip("\n")

现在，关于替换，我认为你可以使用这个正则表达式来简化它：

@[^@]+@(?!.*@)

它由以下部分组成：

@[^@]+@匹配一个@后跟一个或多个不是an的字符@，然后是另一个@。
(?!.*@)是一个否定的先行检查，以检查@前面没有找到任何其他字符的零次或多次出现。

此表达式应与最后一条语句匹配，您不需要反转字符串：

re.sub("@[^@]+@(?!.*@)", "", string)

反对回复 2023-05-16

热搜

最近搜索清空

从列表中抓取后无法操作字符串

从列表中抓取后无法操作字符串

1 回答

添加回答