为什么 Python 的 I/O 比 C++ 慢得多

Why is Pythons I/O so much slower than C++?

本文关键字:C++ Python 为什么      更新时间:2023-10-16

我创建了一些用于在Python和C++中进行测试的代码,其中我从一个文件中读取两个矩阵并打印一些东西。似乎Python需要大约两倍的I/O时间:

$ ./test.sh -i Testing/2000.in -p "C++/read-write-only.out" -n 2
Executing: C++/read-write-only.out -i Testing/2000.in > TMPcurrentFileResult.out
It took 8 seconds for 2 executions
MIN: 4 seconds
MAX: 4 seconds
$ ./test.sh -i Testing/2000.in -p "python Python/read-write-only.py" -n 2
Executing: python Python/read-write-only.py -i Testing/2000.in > TMP..Results.out
It took 16 seconds for 2 executions
MIN: 8 seconds
MAX: 8 seconds

这是我用于 Python 的代码:

#!/usr/bin/python
# -*- coding: utf-8 -*-
from optparse import OptionParser
parser = OptionParser()
parser.add_option("-i", dest="filename", default="bigMatrix.in",
     help="input file with two matrices", metavar="FILE")
(options, args) = parser.parse_args()
def read(filename):
    lines = open(filename, 'r').read().splitlines()
    A = []
    B = []
    matrix = A
    for line in lines:
        if line != "":
            matrix.append(map(int, line.split("t")))
        else:
            matrix = B
    return A, B
def printMatrix(matrix):
    for line in matrix:
        print "t".join(map(str,line))
A, B = read(options.filename)
# Do something
printMatrix(B)

这是C++代码

#include <sstream>
#include <string>
#include <fstream>
#include <iostream>
#include <vector>
#include <algorithm>
using namespace std;
int getMatrixSize(string filename) {
    string line;
    ifstream infile;
    infile.open (filename.c_str());
    getline(infile, line);
    return count(line.begin(), line.end(), 't') + 1;
}
void read(string filename, vector< vector<int> > &A, vector< vector<int> > &B){
    string line;
    FILE* matrixfile = freopen(filename.c_str(), "r", stdin);
    int i = 0, j, a;
    while (getline(cin, line) && !line.empty()) {
        istringstream iss(line);
        j = 0;
        while (iss >> a) {
            A[i][j] = a;
            j++;
        }
        i++;
    }
    i = 0;
    while (getline(cin, line)) {
        istringstream iss(line);
        j = 0;
        while (iss >> a) {
            B[i][j] = a;
            j++;
        }
        i++;
    }
    fclose (matrixfile);
}
void printMatrix(vector< vector<int> > matrix, int n) {
    for (int i=0; i < n; i++) {
        for (int j=0; j < n; j++) {
            if (j != 0) {
                cout << "t";
            }
            cout << matrix[i][j];
        }
        cout << endl;
    }
}
int main (int argc, char* argv[]) {
    string filename;
    if (argc < 3) {
        filename = "bigMatrix.in";
    } else {
        filename = argv[2];
    }
    int n = getMatrixSize(filename);
    vector<int> inner (n);
    vector< vector<int> > A(n, inner), B(n, inner), C(n, inner);
    read (filename, A, B);
    // do something with the matrices
    printMatrix(C, n);
    return 0;
}

是否有可能像 I/O 一样快地获得 Python C++?如何改进 Python/C++ 的 I/O?

(我听说scanf应该比cin快。为什么它应该更快?

这是包含所有代码的 GIT 存储库。

启动 python 解释器需要一段时间。运行测试时要考虑到这一点。

当您不混合使用 C 和 C++ 文件操作例程时,应关闭与 stdio 的同步。

http://www.cplusplus.com/reference/iostream/ios_base/sync_with_stdio/

cincout通常应该比它们的 C 对应项更快(关闭同步)。

至于Python的缓慢,好吧,你为什么不检查I/O函数的实现呢?

必须缓冲任何语言的 I/O。

你为什么要比较Python和C++?

Python是一种解释型语言,而C++是编译的。