使用boost::spirit phrase_Parse解析字符串

Parse string with boost::spirit phrase_parse

本文关键字:Parse 字符串 phrase boost spirit 使用      更新时间:2023-10-16

我想解析字符串

    std::string entry = "127.0.0.1 - [16/Aug/2012:01:50:02 +0000] "GET /check.htm HTTP/1.1" 200 17 "AgentName/0.1 libwww-perl/5.833""

具有以下内容:

    ip_rule %= lexeme[(+char_("0-9."))[ref(ip) = _1]];
    timestamp_rule %= lexeme[('[' >> +(char_ - ']') >> ']')[ref(timestamp) = _1]];
    user_rule %= lexeme[(+char_)[ref(user) = _1]];
    request_rule %= lexeme[('"' >> +(char_ - '"') >> '"')[ref(req) = _1]];
    referer_rule %= lexeme[('"' >> +(char_ - '"') >> '"')[ref(referer) = _1]];
    bool r = phrase_parse(first, last,
    ip_rule >> user_rule >> timestamp_rule >> request_rule >> uint_[ref(status) = _1]
    >> uint_[ref(transferred_bytes) = _1] >> referer_rule, space);

但它并不匹配。如果我从字符串中删除"-",当然还有规则"user_rule",那么它就会匹配。你能告诉我如何将这个字符串与"-"匹配吗?

您的user_rule"吃掉"文本的其余部分。这样定义它:+~qi::char_("[")),这样它就会停在'['字符处。以下代码按预期工作:

#include <boost/spirit/include/qi.hpp> 
using namespace boost::spirit::qi;
int main()
{
    std::string ip, user, timestamp, req, referer;
    unsigned status, transferred_bytes;
    std::string entry = "127.0.0.1 - [16/Aug/2012:01:50:02 +0000] "GET /check.htm HTTP/1.1" 200 17 "AgentName/0.1 libwww-perl/5.833"";
    bool r = phrase_parse(entry.begin(), entry.end(), 
    lexeme[+char_("0-9.")] >> 
        +~char_("[") >> 
        lexeme[('[' >> +~char_("]") >> ']')] >> 
        lexeme[('"' >> +~char_(""") >> '"')] >> 
        uint_ >> 
        uint_ >> 
        lexeme[('"' >> +~char_(""") >> '"')], space, ip, user, timestamp, req, status, transferred_bytes, referer); 
}