Python: Reading all buffered bytes without blocking

707 views

Is it possible to say to a BufferedReader stream "give me all the bytes you have available in the buffer, or do one OS call and give me everything you get back"? The problem is that the "number of bytes" argument to read1() isn't optional, so I can't do available_bytes = fd.read1().

I need this because I want to decode the returned bytes from UTF-8, and I *might* get a character split across the boundary of any arbitrary block size I choose. (I'm happy to ignore the possibility that the *source* did a flush part-way through a character).

I don't really want to have to do incremental encoding if I can avoid it - it looks hard...

posted Mar 3, 2015 by Ankit

Looking for an answer? Promote on:

Just specify large size.

commented Mar 3, 2015 by Majula Joshi

Thanks. Looking at the source, it appears that a large size will allocate a buffer that size for the data even if the amount actually read is small (thinking about it, of couse it has to, doh, because the syscall needs it).

Anyway, it's a pretty microscopic risk in practice, and when I looked at them, the incremental codecs (codecs.iterdecode) really aren't that hard to use, so I can do it that way if it matters enough.

For what it's worth, in case anyone wants to know, incremental decoding looks like this:

def get():
while True:
data = process.stdout.read(1000)
if not data:
break
yield data
for data in codecs.iterdecode(get(), encoding):
sys.stdout.write(data)
sys.stdout.flush()

commented Mar 3, 2015 by anonymous

import subprocess p = subprocess.Popen("D:PythonPython27Scriptspip.exe list -o", stdout=subprocess.PIPE, stderr=subprocess.STDOUT, bufsize=1, universal_newlines=True, shell=False) for line in p.stdout: print line

Python: Reading all buffered bytes without blocking

Your comment on this post:

Your answer

Preview