*> when doing IO calls in Python so GIL is usually released so the kernel can al...

Znafon · on Aug 14, 2020

> doing that means the GIL doesn't get released so that thread will prevent any further network I/O while it's running, the same as would happen in an async framework if a worker did a lot of CPU work. So Python doesn't really let you realize the advantages of threads in this context.

I don't think that's true, the GIL get released for many computing intensive or IO bound tasks in Python, for example when reading from a socket the GIL gets released at https://github.com/python/cpython/blob/e822e37946f27c09953bb...

pdonis · on Aug 14, 2020

> the GIL get released for many computing intensive or IO bound tasks in Python

I/O bound, yes, since that requires system calls, and system calls (reading from a socket is an example of a system call) release the GIL.

Computing intensive, no. Code that is doing a CPU intensive computation but makes no system calls will never release the GIL.

Znafon · on Aug 15, 2020

> Computing intensive, no. Code that is doing a CPU intensive computation but makes no system calls will never release the GIL.

Any code that does not involve Python objects can release the GIL, no matter whether it makes system call or not.

For example, NumPy the most popular scientific computation package in Python, on which many other popular packages like Pandas are based, releases the GIL when doing operation on matrix. This is documented at https://numpy.org/doc/stable/reference/internals.code-explan...:

> If NPY_ALLOW_THREADS is defined during compilation, then as long as no object arrays are involved, the Python Global Interpreter Lock (GIL) is released prior to calling the loops. It is re-acquired if necessary to handle error conditions.

To do so it uses the same macro used by the socket module when doing system calls: https://github.com/numpy/numpy/blob/18a6e3e505ee416ddfc617f3...

pdonis · on Aug 15, 2020

> Any code that does not involve Python objects

And does not involve running Python bytecode. Yes, numpy and other packages that provide C extensions do this when they are doing computations that don't require running Python bytecode.

> no matter whether it makes system call or not

Yes, you're right, my statement was too broad.

jashmatthews · on Aug 15, 2020

The GIL gets released whenever it gets released. C extensions like zlib release the GIL while (de)compressing requests/responses.

https://docs.python.org/3/c-api/init.html#releasing-the-gil-...

pdonis · on Aug 15, 2020

> C extensions like zlib release the GIL while (de)compressing requests/responses

Yes, you're right, my statement was too broad.

colinmhayes · on Aug 15, 2020

GIL gets released on blocking calls. CPU intensive is synonymous with non-blocking calls.

zzzeek · on Aug 14, 2020

there is an advantage to threads in the CPU bound case which is that the work of other threads will not be blocked for a CPU-intense operation. With an IO-event based scheduler, your CPU bound task will not context switch leading to network logic elsewhere to simply time out. A particularly acute example is something like a network library logging into the MySQL database which gives the client a ten second window to respond to the initial security challenge. It was both an extremely difficult bug for me to diagnose as well as helpful for my role at work that I was able to track that one down in Openstack :).