Skip to content

Crash when calling textiowrapper_iternext and writing to a text file simultaneously in ft build #134908

Closed
@duaneg

Description

@duaneg

Running this script under free-threading build:

import os
import tempfile
import threading

N=2
COUNT=100

def writer(file, barrier):
    barrier.wait()
    for _ in range(COUNT):
        f.write("x")

def reader(file, stopping):
    while not stopping.is_set():
        for line in file:
            assert line == ""

stopping = threading.Event()
with tempfile.NamedTemporaryFile("w+") as f:
    reader = threading.Thread(target=reader, args=(f, stopping))
    reader.start()

    barrier = threading.Barrier(N)
    writers = [threading.Thread(target=writer, args=(f, barrier))
               for _ in range(N)]
    for t in writers:
        t.start()
    for t in writers:
        t.join()
    stopping.set()
    reader.join()

    f.flush()
    assert(os.stat(f.name).st_size == COUNT * N)

...results in a crash:

python: ./Modules/_io/textio.c:1751: _io_TextIOWrapper_write_impl: Assertion `self->pending_bytes_count == 0' failed.
Aborted (core dumped)

The textiowrapper_iternext method is not protected by a critical section and calls _textiowrapper_readline, which calls _textiowrapper_writeflush, which relies on the GIL or critical section to synchronise access to its internal data. In a free-threading build this means iterating over lines in a text file is not thread-safe and can crash if it races with writes or other operations.

It looks like all other entry points that could call _textiowrapper_writeflush are protected by the critical section, so the easiest fix is probably to just likewise protect textiowrapper_iternext.

CPython versions tested on:

CPython main branch

Operating systems tested on:

Linux

Output from running 'python -VV' on the command line:

No response

Linked PRs

Activity

added
type-crashA hard crash of the interpreter, possibly with a core dump
on May 30, 2025
added a commit that references this issue on May 30, 2025

pythongh-134908: protect ``textiowrapper_iternext`` with critical sec…

duaneg

duaneg commented on May 30, 2025

@duaneg
ContributorAuthor

Note that I found this while investigating #118138, however I think it is a separate bug that should be addressed under a separate issue.

CC @cmaloney who seems to be looking at related issues in this area already 👋

ZeroIntensity

ZeroIntensity commented on May 30, 2025

@ZeroIntensity
Member

I'm pretty sure we don't support concurrent calls to __next__. You need to manually serialize the calls yourself. See gh-124397.

duaneg

duaneg commented on May 30, 2025

@duaneg
ContributorAuthor

See gh-124397.

That is a very interesting discussion, thanks for linking!

I'm pretty sure we don't support concurrent calls to __next__. You need to manually serialize the calls yourself.

I totally agree that in general iterators should not be assumed to be thread-safe, in the broader sense. However they should not crash, which is happening here. In the description I reported an assertion failure, but the repro script will crash with assertions disabled too, just in more "interesting" ways.

ZeroIntensity

ZeroIntensity commented on May 30, 2025

@ZeroIntensity
Member

You can get similar crashes from pure-Python by concurrently calling an object's __init__. I don't think there's much we can do without substantially hurting performance.

duaneg

duaneg commented on May 30, 2025

@duaneg
ContributorAuthor

I'm sure there are places where fixing crashes without substantially hurting performance might be difficult or impossible, but I don't think this is one of them. It should have no impact on GIL performance and minimal in free-threading.

The third bullet point in #124397 is:

Other iterators implemented in C will get only the minimal changes necessary to cause them to not crash in a free-threaded build. The edits should be made in a way that does not impact existing semantics or performance (i.e. do not damage the standard GIL build).

I think this fix is exactly that...or at least, that was exactly what I intended when I wrote it. Of course, I may well have missed something!

ZeroIntensity

ZeroIntensity commented on May 30, 2025

@ZeroIntensity
Member

I'll let others weigh in here, but my personal stance is that concurrent iteration without synchronization by the caller is not a real use-case. The example you sent would suffer from so much lock contention that it would be no better than calling it in a single thread :(

I think that last point is in regards to making the iterator thread-safe when others are calling separate methods on it, not necessarily concurrent calls to __next__. I think this needs clarification, though.

colesbury

colesbury commented on May 30, 2025

@colesbury
Contributor

From looking at the repro code, this looks like something we should fix.

ZeroIntensity

ZeroIntensity commented on May 30, 2025

@ZeroIntensity
Member

Ok, thanks Sam. For clarity: when should we fix these cases?

colesbury

colesbury commented on May 30, 2025

@colesbury
Contributor

This looks like concurrent operations on a NamedTemporaryFile. I think I/O should generally be internally synchronized. That's mostly true for Python already, it's usually true at the OS level, as well as in other language runtimes.

I get that there is a crash in iteration, but it doesn't look like there's an iterator shared between threads.

15 remaining items

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Metadata

Metadata

Assignees

No one assigned

    Labels

    interpreter-core(Objects, Python, Grammar, and Parser dirs)topic-free-threadingtype-crashA hard crash of the interpreter, possibly with a core dump

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions

      Crash when calling `textiowrapper_iternext` and writing to a text file simultaneously in ft build · Issue #134908 · python/cpython

      Follow Lee on X/Twitter - Father, Husband, Serial builder creating AI, crypto, games & web tools. We are friends :) AI Will Come To Life!

      Check out: eBank.nz (Art Generator) | Netwrck.com (AI Tools) | Text-Generator.io (AI API) | BitBank.nz (Crypto AI) | ReadingTime (Kids Reading) | RewordGame | BigMultiplayerChess | WebFiddle | How.nz | Helix AI Assistant