正则表达式在 c++ 中不匹配?

Regular Expressions are not match in c++?

本文关键字:不匹配 c++ 正则表达式      更新时间:2023-10-16

我在正则表达式中遇到了问题。我有一个字符串,字符串是用正则表达式验证的,它正在使用 python 脚本,但不能使用 c++。

工作蟒蛇代码:

import re
txt = "x01msvc-serverx1Cmsvc-xyzy4x02<?xml version="1.0" encoding="UTF-8"?><SVCMessage currency="INR" hostName="msvc-xyz4" language="US-en" retransmit="N" sequence="00" timeout="90" version="8"><Amount>0.01</Amount><BusinessDate>20190506</BusinessDate><CheckNumber>0</CheckNumber><LocalDate>20170506</LocalDate><LocalTime>160722</LocalTime><RequestCode>POINT_REDEMPTION</RequestCode><RevenueCenter>0</RevenueCenter><TerminalID>21</TerminalID><TraceID>190506860722N000000</TraceID><Track2>1161111112</Track2><TransactionEmployee>0</TransactionEmployee></SVCMessage>x03x04"
matcher = re.compile(r".*x01([A-Za-z0-9_-]*)x1C([A-Za-z0-9_-]*)x02([^x00-x1Fx7F]*)x03x04.*")
results = matcher.match(txt)
if results == None:
print ('Invalid query , closed')
else:
print ('sucess')

我的 c++ 代码:

#include <iostream>
#include <regex>
using namespace std;
int main()
{
string a = "x01msvc-serverx1Cmsvc-xyzy4x02<?xml version="1.0" encoding="UTF-8"?><SVCMessage currency="INR" hostName="msvc-xyz4" language="US-en" retransmit="N" sequence="00" timeout="90" version="8"><Amount>0.01</Amount><BusinessDate>20190506</BusinessDate><CheckNumber>0</CheckNumber><LocalDate>20170506</LocalDate><LocalTime>160722</LocalTime><RequestCode>POINT_REDEMPTION</RequestCode><RevenueCenter>0</RevenueCenter><TerminalID>21</TerminalID><TraceID>190506860722N000000</TraceID><Track2>1161111112</Track2><TransactionEmployee>0</TransactionEmployee></SVCMessage>x03x04";
// Here b is object of regex- Regular Expression
regex b(".*x01([A-Za-z0-9_-]*)x1C([A-Za-z0-9_-]*)x02([^x00-x1Fx7F]*)x03x04.*");
cout<< a << endl;

if( regex_match(a, b)){
cout << "String is matches Reguler Expreation " << endl;
}else{
cout << "String are not match" << endl;
}
return 0;
}

预期的结果是 - 字符串匹配...在 C++ 中

将正则表达式字符串中的加倍怎么样?

//.........VV...................VV...................VV......VV.........VV......VV...VV
regex b(".*\x01([A-Za-z0-9-_]*)\x1C([A-Za-z0-9-_]*)\x02([^\x00-\x1F\x7F]*)\x03\x04.*");

否则,您可以使用原始文本字符串

// .....VVV...........................................................................VV
regex b(R"(.*x01([A-Za-z0-9-_]*)x1C([A-Za-z0-9-_]*)x02([^x00-x1Fx7F]*)x03x04.*)");

题外建议:避免使用std::coutstd::stringstd::regex等进行using namespace std;和明确的std

最有可能的问题出在终止正则表达式字符串x00字符中。

您可以使用调用重载explicit basic_regex( const CharT* s, flag_type f = std::regex_constants::ECMAScript );字符串文本来初始化b

为避免此问题,您可以尝试使用std::string初始化它,而这反过来又可以像这样初始化:

char re[] = ".*x01([A-Za-z0-9_-]*)x1C([A-Za-z0-9_-]*)x02([^x00-x1Fx7F]*)x03x04.*";
std::string re_str(re, sizeof(re));