Python逐行从子进程捕获标准输出

Python capture stdout from subprocess line by line

本文关键字:标准输出 子进程 逐行 Python      更新时间:2023-10-16

我已经阅读了许多与此相关的问题,并学到了很多,但我仍然无法解决我的问题。我正在构建一个wxPython应用程序,它运行一个c++可执行文件,并实时显示该可执行文件的标准输出。我在尝试这个方法时遇到了几个奇怪的结果。以下是我当前的设置/问题:

//test.cc (compiled as test.out with gcc 4.5.2)
#include <stdio.h>
int main()
{
  FILE* fh = fopen("output.txt", "w");
  for (int i = 0; i < 10000; i++)
  {
      printf("Outputting: %dn", i);
      fprintf(fh, "Outputting: %dn", i);
  }
  fclose(fh);
  return 0;
}
#wxPythonScript.py (running on 2.7 interpreter)
def run(self):
  self.externalBinary = subprocess.Popen(['./test.out'], shell=False, stdout=subprocess.PIPE)
  while not self.wantAbort:
      line = self.externalBinary.stdout.readline()
      wx.PostEvent(self.notifyWindow, Result_Event(line, Result_Event.EVT_STDOUT_ID))
    print('Subprocess still running')
  print('Subprocess aborted smoothly')

如果我运行上面的代码,子进程需要很长时间才能完成,即使它所要做的只是写出数据并退出。但是,如果我运行以下命令,它会很快完成:

#wxPythonScript.py (running on 2.7 interpreter)
def run(self):
  outFile = open('output.txt', 'r+')
  self.externalBinary = subprocess.Popen(['./test.out'], shell=False, stdout=outFile)
  while not self.wantAbort:
      #line = self.externalBinary.stdout.readline()
      #wx.PostEvent(self.notifyWindow, Result_Event(line, Result_Event.EVT_STDOUT_ID))
    print('Subprocess still running')
  print('Subprocess aborted smoothly')

所以基本上每当我将标准输出从子进程重定向到PIPE时,它就会减慢/挂起,但如果我将其写入文件或根本不重定向,那么它就没事了。为什么呢?

我只在Windows上测试过,但它在2.6.6,2.7.2和3.2.1中都可以工作:

from __future__ import print_function
from subprocess import PIPE, Popen
from threading  import Thread
import sys
try:
    from Queue import Queue, Empty
except ImportError:
    from queue import Queue, Empty  # python 3.x
ON_POSIX = 'posix' in sys.builtin_module_names
def enqueue_output(out, queue):
    for line in iter(out.readline, b''):
        line = line.decode(sys.stdout.encoding)
        queue.put(line)
    out.close()
def main():
    p = Popen(['c/main.exe'], stdout=PIPE, bufsize=1, close_fds=ON_POSIX)
    q = Queue()
    t = Thread(target=enqueue_output, args=(p.stdout, q))
    t.daemon = True # thread dies with the program
    t.start()
    #initially the queue is empty and stdout is open
    #stdout is closed when enqueue_output finishes
    #then continue printing until the queue is empty 
    while not p.stdout.closed or not q.empty():
        try:
            line = q.get_nowait()
        except Empty:
            continue
        else:
            print(line, end='')
    return 0
if __name__ == '__main__':
    sys.exit(main())
输出:

Outputting: 0
Outputting: 1
Outputting: 2
...
Outputting: 9997
Outputting: 9998
Outputting: 9999
编辑:

readline()将阻塞,直到程序的标准输出缓冲区刷新,如果数据流是间歇性的,这可能需要很长时间。如果可以编辑源代码,一种选择是手动调用fflush(stdout),或者可以在程序开始时使用setvbuf禁用缓冲。例如:

#include <stdio.h>
int main() {
    setvbuf(stdout, NULL, _IONBF, 0);
    FILE* fh = fopen("output.txt", "w");
    int i;
    for (i = 0; i < 10; i++) {
        printf("Outputting: %dn", i);
        fprintf(fh, "Outputting: %dn", i);
        sleep(1);
    }
    fclose(fh);
    return 0;
}

还可以考虑使用unbuffer或stdbuf来修改现有程序的输出流。