Main Contexts
=============

Summary
-------

- Use ``g_main_context_invoke_full()`` to invoke functions in other threads,
  assuming every thread has a thread default main context which runs throughout
  the lifetime of that thread
- Use ``GTask`` to run a function in the background without caring about the
  specific thread used
- Liberally use assertions to check which context executes each function, and
  add these assertions when first writing the code
- Explicitly document contexts a function is expected to be called in, a
  callback will be invoked in, or a signal will be emitted in
- Beware of ``g_idle_add()`` and similar functions which implicitly use the
  global-default main context

What is GMainContext?
---------------------

``GMainContext`` is a generalized implementation of an event loop, useful for
implementing polled file I/O or event-based widget systems (such as GTK). It is
at the core of almost every GLib application. To understand ``GMainContext``
requires understanding ``poll()`` and polled I/O.

A ``GMainContext`` has a set of sources which are ‘attached’ to it, each of
which can be thought of as an expected event with an associated callback
function which will be invoked when that event is received; or equivalently as a
set of file descriptors (FDs) to check. An event could be a timeout or data
being received on a socket, for example. One iteration of the event loop will:

- Prepare sources, determining if any of them are ready to dispatch immediately.
- Poll the sources, blocking the current thread until an event is received for
  one of the sources.
- Check which of the sources received an event (several could have).
- Dispatch callbacks from those sources.

This is explained very well in the `GLib documentation
<https://docs.gtk.org/glib/main-loop.html>`__.

At its core, ``GMainContext`` is just a ``poll()`` loop, with the preparation,
check and dispatch stages of the loop corresponding to the normal preamble and
postamble in a typical ``poll()`` loop implementation. Typically, some
complexity is needed in non-trivial ``poll()``-using applications to track the
lists of FDs which are being polled. Additionally, ``GMainContext`` adds a lot
of useful functionality which vanilla ``poll()`` doesn’t support. Most
importantly, it adds thread safety.

``GMainContext`` is completely thread safe, meaning that a ``GSource`` can be
created in one thread and attached to a ``GMainContext`` running in another
thread. A typical use for this might be to allow worker threads to control which
sockets are being listened to by a ``GMainContext`` in a central I/O thread.
Each ``GMainContext`` is ‘acquired’ by a thread for each iteration it’s put
through. Other threads cannot iterate a ``GMainContext`` without acquiring it,
which guarantees that a ``GSource`` and its FDs will only be polled by one
thread at once (since each ``GSource`` is attached to at most one
``GMainContext``). A ``GMainContext`` can be swapped between threads across
iterations, but this is expensive.

``GMainContext`` is used instead of ``poll()`` mostly for convenience, as it
transparently handles dynamically managing the array of FDs to pass to
``poll()``, especially when operating over multiple threads. This is done by
encapsulating file descriptors inside a ``GSource``, which decide whether those
FDs should be passed to the ``poll()`` call on each ‘prepare’ stage of the main
context iteration.

What is GMainLoop?
------------------

``GMainLoop`` is essentially the following few lines of code, once reference
counting and locking have been removed:

.. code-block:: c

   loop->is_running = TRUE;
   while (loop->is_running)
     {
       if (quit_condition)
         loop->is_running = FALSE;

       g_main_context_iteration (context, TRUE);
     }


Setting ``quit_condition`` to ``TRUE`` will cause the loop to terminate once the
current main context iteration ends.

Hence, ``GMainLoop`` is a convenient, thread-safe way of running a
``GMainContext`` to process events until a desired exit condition is met, at
which point ``g_main_loop_quit()`` should be called. Typically, in a UI program,
this will be the user clicking ‘exit’. In a socket handling program, this might
be the final socket closing.

It is important not to confuse main contexts with main loops. Main contexts do
the bulk of the work: preparing source lists, waiting for events, and
dispatching callbacks. A main loop simply iterates a context.

Default Contexts
----------------

One of the important features of ``GMainContext`` is its support for ‘default’
contexts. There are two levels of default context: the thread-default, and the
global-default. The global-default (accessed using ``g_main_context_default()``) is
run by GTK when ``g_application_run()`` is called. It’s also used for timeouts
(``g_timeout_add()``) and idle callbacks (``g_idle_add()``) — these won’t be dispatched
unless the default context is running!

Thread-default contexts are generally used for I/O operations which need to run
and dispatch callbacks in a thread. By calling
``g_main_context_push_thread_default()`` before starting an I/O operation, the
thread-default context is set and the I/O operation can add its sources to that
context. The context can then be run in a new main loop in an I/O thread,
causing the callbacks to be dispatched on that thread’s stack rather than on the
stack of the thread running the global-default main context. This allows I/O
operations to be run entirely in a separate thread without explicitly passing a
specific GMainContext pointer around everywhere.

Conversely, by starting a long-running operation with a specific thread-default
context set, the calling code can guarantee that the operation’s callbacks will
be emitted in that context, even if the operation itself runs in a worker
thread. This is the principle behind ``GTask``: when a new ``GTask`` is created, it
stores a reference to the current thread-default context, and dispatches its
completion callback in that context, even if the task itself is run using
``g_task_run_in_thread()``.

For example, the code below will run a ``GTask`` which performs two writes in
parallel from a thread. The callbacks for the writes will be dispatched in the
worker thread, whereas the callback from the task as a whole will be dispatched
in the interesting_context.

.. code-block:: c

   typedef struct {
     GMainLoop *main_loop;
     guint n_remaining;
   } WriteData;
   
   /* This is always called in the same thread as thread_cb() because
    * it’s always dispatched in the @worker_context. */
   static void
   write_cb (GObject      *source_object,
             GAsyncResult *result,
             gpointer      user_data)
   {
     WriteData *data = user_data;
     GOutputStream *stream = G_OUTPUT_STREAM (source_object);
     GError *error = NULL;
     gssize len;
   
     /* Finish the write. */
     len = g_output_stream_write_finish (stream, result, &error);
     if (error != NULL)
       {
         g_error ("Error: %s", error->message);
         g_error_free (error);
       }
   
     /* Check whether all parallel operations have finished. */
     write_data->n_remaining--;
   
     if (write_data->n_remaining == 0)
       {
         g_main_loop_quit (write_data->main_loop);
       }
   }
   
   /* This is called in a new thread. */
   static void
   thread_cb (GTask        *task,
              gpointer      source_object,
              gpointer      task_data,
              GCancellable *cancellable)
   {
     /* These streams come from somewhere else in the program: */
     GOutputStream *output_stream1, *output_stream;
     GMainContext *worker_context;
     GBytes *data;
     const guint8 *buf;
     gsize len;
   
     /* Set up a worker context for the writes’ callbacks. */
     worker_context = g_main_context_new ();
     g_main_context_push_thread_default (worker_context);
   
     /* Set up the writes. */
     write_data.n_remaining = 2;
     write_data.main_loop = g_main_loop_new (worker_context, FALSE);
   
     data = g_task_get_task_data (task);
     buf = g_bytes_get_data (data, &len);
   
     g_output_stream_write_async (output_stream1, buf, len,
                                  G_PRIORITY_DEFAULT, NULL, write_cb,
                                  &write_data);
     g_output_stream_write_async (output_stream2, buf, len,
                                  G_PRIORITY_DEFAULT, NULL, write_cb,
                                  &write_data);
   
     /* Run the main loop until both writes have finished. */
     g_main_loop_run (write_data.main_loop);
     g_task_return_boolean (task, TRUE);  /* ignore errors */
   
     g_main_loop_unref (write_data.main_loop);
   
     g_main_context_pop_thread_default (worker_context);
     g_main_context_unref (worker_context);
   }
   
   /* This can be called from any thread. Its @callback will always be
    * dispatched in the thread which currently owns
    * @interesting_context. */
   void
   parallel_writes_async (GBytes              *data,
                          GMainContext        *interesting_context,
                          GCancellable        *cancellable,
                          GAsyncReadyCallback  callback,
                          gpointer             user_data)
   {
     GTask *task;
   
     g_main_context_push_thread_default (interesting_context);
   
     task = g_task_new (NULL, cancellable, callback, user_data);
     g_task_set_task_data (task, data,
                           (GDestroyNotify) g_bytes_unref);
     g_task_run_in_thread (task, thread_cb);
     g_object_unref (task);
   
     g_main_context_pop_thread_default (interesting_context);
   }


Implicit Use of the Global-Default Main Context
-----------------------------------------------

Several functions implicitly add sources to the global-default main context.
They should not be used in threaded code. Instead, use ``g_source_attach()``
with the ``GSource`` created by the replacement function from the table below.

Implicit use of the global-default main context means the callback functions are
invoked in the main thread, typically resulting in work being brought back from
a worker thread into the main thread. 

.. list-table::
   :widths: 50 50
   :header-rows: 1

   * - Do not use
     - Use instead
   * - ``g_timeout_add()``
     - ``g_timeout_source_new()``
   * - ``g_idle_add()``
     - ``g_idle_source_new()``
   * - ``g_child_watch_add()``
     - ``g_child_watch_source_new()``

So to delay some computation in a worker thread, use the following code:

.. code-block:: c

   static guint
   schedule_computation (guint delay_seconds)
   {
     /* Get the calling context. */
     GMainContext *context = g_main_context_get_thread_default ();
   
     GSource *source = g_timeout_source_new_seconds (delay_seconds);
     g_source_set_callback (source, do_computation, NULL, NULL);

     guint id = g_source_attach (source, context);

     g_source_unref (source);
   
     /* The ID can be used with the same @context to
      * cancel the scheduled computation if needed. */
     return id;
   }
   
   static void
   do_computation (gpointer user_data)
   {
     // ...
   }


Using GMainContext in a Library
-------------------------------

At a high level, library code must not make changes to main contexts which could
affect the execution of an application using the library, for example by
changing when the application’s sources are dispatched. There are various best
practices which can be followed to aid this.

Never iterate a context created outside the library, including the
global-default or thread-default contexts. Otherwise, sources created in the
application may be dispatched when the application is not expecting it, causing
re-entrancy problems for the application code.

Always remove sources from a main context before dropping the library’s last
reference to the context, especially if it may have been exposed to the
application (for example, as a thread-default). Otherwise the application may
keep a reference to the main context and continue iterating it after the library
has returned, potentially causing unexpected source dispatches in the library.
This is equivalent to not assuming that dropping the library’s last reference to
a main context will finalize that context.

If the library is designed to be used from multiple threads, or in a
context-aware fashion, always document which context each callback will be
dispatched in. For example, “callbacks will always be dispatched in the context
which is the thread-default at the time of the object’s construction”.
Developers using the library’s API need to know this information.

Use ``g_main_context_invoke()`` to ensure callbacks are dispatched in the right
context. It’s much easier than manually using ``g_idle_source_new()`` to
transfer work between contexts.

Libraries should never use ``g_main_context_default()`` (or, equivalently, pass
NULL to a ``GMainContext``-typed parameter). Always store and explicitly use a
specific ``GMainContext``, even if it often points to some default context. This
makes the code easier to split out into threads in future, if needed, without
causing hard-to-debug problems caused by callbacks being invoked in the wrong
context.

Write things asynchronously internally (using ``GTask`` where appropriate), and
keep synchronous wrappers at the very top level of an API, where they can be
implemented by calling ``g_main_context_iteration()`` on a specific
``GMainContext``. Again, this makes future refactoring easier. This is
demonstrated in the previous example: the thread uses
``g_output_stream_write_async()`` rather than ``g_output_stream_write()``. A
worker thread may be used instead, and this can simplify the callback chain for
long series of asynchronous calls; but at the cost of increased complexity in
verifying the code is race-free.

Always match pushes and pops of the thread-default main context:
``g_main_context_push_thread_default()`` and
``g_main_context_pop_thread_default()``.

Ensuring Functions are Called in the Right Context
--------------------------------------------------

The ‘right context’ is the thread-default main context of the thread the
function should be executing in. This assumes the typical case that every thread
has a single main context running in a main loop. A main context effectively
provides a work or message queue for the thread — something which the thread can
periodically check to determine if there is work pending from another thread.
Putting a message on this queue – invoking a function in another main context –
will result in it eventually being dispatched in that thread.

For example, if an application does a long and CPU-intensive computation it
should schedule this in a background thread so that UI updates in the main
thread are not blocked. The results of the computation, however, might need to
be displayed in the UI, so some UI update function must be called in the main
thread once the computation’s complete.

Furthermore, if the computation function can be limited to a single thread, it
becomes easy to eliminate the need for locking a lot of the data it accesses.
This assumes that other threads are implemented similarly and hence most data is
only accessed by a single thread, with threads communicating by message passing.
This allows each thread to update its data at its leisure, which significantly
simplifies locking.

For some functions, there might be no reason to care which context they’re
executed in, perhaps because they’re asynchronous and hence do not block the
context. However, it is still advisable to be explicit about which context is
used, since those functions may emit signals or invoke callbacks, and for
reasons of thread safety it’s necessary to know which threads those signal
handlers or callbacks are going to be invoked in.

For example, the progress callback in ``g_file_copy_async()`` is documented as
being called in the thread-default main context at the time of the initial call.

Principles of Invocation
------------------------

The core principle of invoking a function in a specific context is simple, and
is walked through below to explain the concepts. In practice the convenience
method, ``g_main_context_invoke_full()`` should be used instead.

A ``GSource`` has to be added to the target ``GMainContext``, which will invoke
the function when it’s dispatched. This ``GSource`` should almost always be an
idle source created with ``g_idle_source_new()``, but this doesn’t have to be
the case. It could be a timeout source so that the function is executed after a
delay, for example.

The ``GSource`` will be dispatched as soon as it’s ready, calling the function
on the thread’s stack. In the case of an idle source, this will be as soon as
all sources at a higher priority have been dispatched — this can be tweaked
using the idle source’s priority parameter with ``g_source_set_priority()``. The
source will typically then be destroyed so the function is only executed once
(though again, this doesn’t have to be the case).

Data can be passed between threads as the ``user_data`` passed to the
``GSource``’s callback. This is set on the source using
``g_source_set_callback()``, along with the callback function to invoke. Only a
single pointer is provided, so if multiple data fields need passing, they must
be wrapped in an allocated structure.

The example below demonstrates the underlying principles, but there are
convenience methods explained below which simplify things.

.. code-block:: c

   /* Main function for the background thread, thread1. */
   static gpointer
   thread1_main (gpointer user_data)
   {
     GMainContext *thread1_main_context = user_data;
     GMainLoop *main_loop;
   
     /* Set up the thread’s context and run it forever. */
     g_main_context_push_thread_default (thread1_main_context);
   
     main_loop = g_main_loop_new (thread1_main_context, FALSE);
     g_main_loop_run (main_loop);
     g_main_loop_unref (main_loop);
   
     g_main_context_pop_thread_default (thread1_main_context);
     g_main_context_unref (thread1_main_context);
   
     return NULL;
   }
   
   /* A data closure structure to carry multiple variables between
    * threads. */
   typedef struct {
     gchar   *some_string;  /* owned */
     guint    some_int;
     GObject *some_object;  /* owned */
   } MyFuncData;
   
   static void
   my_func_data_free (MyFuncData *data)
   {
     g_free (data->some_string);
     g_clear_object (&data->some_object);
     g_free (data);
   }
   
   static void
   my_func (const gchar *some_string,
            guint        some_int,
            GObject     *some_object)
   {
     /* Do something long and CPU intensive! */
   }
   
   /* Convert an idle callback into a call to my_func(). */
   static gboolean
   my_func_idle (gpointer user_data)
   {
     MyFuncData *data = user_data;
   
     my_func (data->some_string, data->some_int, data->some_object);
   
     return G_SOURCE_REMOVE;
   }
   
   /* Function to be called in the main thread to schedule a call to
    * my_func() in thread1, passing the given parameters along. */
   static void
   invoke_my_func (GMainContext *thread1_main_context,
                   const gchar  *some_string,
                   guint         some_int,
                   GObject      *some_object)
   {
     GSource *idle_source;
     MyFuncData *data;
   
     /* Create a data closure to pass all the desired variables
      * between threads. */
     data = g_new0 (MyFuncData, 1);
     data->some_string = g_strdup (some_string);
     data->some_int = some_int;
     data->some_object = g_object_ref (some_object);
   
     /* Create a new idle source, set my_func() as the callback with
      * some data to be passed between threads, bump up the priority
      * and schedule it by attaching it to thread1’s context. */
     idle_source = g_idle_source_new ();
     g_source_set_callback (idle_source, my_func_idle, data,
                            (GDestroyNotify) my_func_data_free);
     g_source_set_priority (idle_source, G_PRIORITY_DEFAULT);
     g_source_attach (idle_source, thread1_main_context);
     g_source_unref (idle_source);
   }
   
   /* Main function for the main thread. */
   static void
   main (void)
   {
     GThread *thread1;
     GMainContext *thread1_main_context;
   
     /* Spawn a background thread and pass it a reference to its
      * GMainContext. Retain a reference for use in this thread
      * too. */
     thread1_main_context = g_main_context_new ();
     g_thread_new ("thread1", thread1_main,
                   g_main_context_ref (thread1_main_context));
   
     /* Maybe set up your UI here, for example. */
   
     /* Invoke my_func() in the other thread. */
     invoke_my_func (thread1_main_context,
                     "some data which needs passing between threads",
                     123456, some_object);
   
     /* Continue doing other work. */
   }


This invocation is uni-directional: it calls ``my_func()`` in ``thread1``, but
there’s no way to return a value to the main thread. To do that, the same
principle needs to be used again, invoking a callback function in the main
thread. It’s a straightforward extension which isn’t covered here.

To maintain thread safety, data which is potentially accessed by multiple
threads must make those accesses mutually exclusive using a mutex. Data
potentially accessed by multiple threads: ``thread1_main_context``, passed in
the fork call to ``thread1_main``; and ``some_object``, a reference to which is
passed in the data closure. Critically, GLib guarantees that ``GMainContext`` is
thread safe, so sharing ``thread1_main_context`` between threads is safe. The
example assumes that other code accessing ``some_object`` is thread safe.

Note that ``some_string`` and ``some_int`` cannot be accessed from both threads,
because copies of them are passed to ``thread1``, rather than the originals.
This is a standard technique for making cross-thread calls thread safe without
requiring locking. It also avoids the problem of synchronizing freeing
``some_string``.

Similarly, a reference to ``some_object`` is transferred to ``thread1``, which
works around the issue of synchronizing destruction of the object.

``g_idle_source_new()`` is used rather than the simpler ``g_idle_add()`` so the
``GMainContext`` to attach to can be specified.

Convenience Method: g_main_context_invoke_full()
------------------------------------------------

This is simplified greatly by the convenience method,
``g_main_context_invoke_full()``. It invokes a callback so that the specified
``GMainContext`` is owned during the invocation. Owning a main context is almost
always equivalent to running it, and hence the function is invoked in the thread
for which the specified context is the thread-default.

``g_main_context_invoke()`` can be used instead if the user data does not need
to be freed by a ``GDestroyNotify`` callback after the invocation returns.

Modifying the earlier example, the ``invoke_my_func()`` function can be replaced
by the following:

.. code-block:: c

   static void
   invoke_my_func (GMainContext *thread1_main_context,
                   const gchar  *some_string,
                   guint         some_int,
                   GObject      *some_object)
   {
     MyFuncData *data;
   
     /* Create a data closure to pass all the desired variables
      * between threads. */
     data = g_new0 (MyFuncData, 1);
     data->some_string = g_strdup (some_string);
     data->some_int = some_int;
     data->some_object = g_object_ref (some_object);
   
     /* Invoke the function. */
     g_main_context_invoke_full (thread1_main_context,
                                 G_PRIORITY_DEFAULT, my_func_idle,
                                 data,
                                 (GDestroyNotify) my_func_data_free);
   }


Consider what happens if ``invoke_my_func()`` were called from ``thread1``,
rather than from the main thread. With the original implementation, the idle
source would be added to ``thread1``’s context and dispatched on the context’s
next iteration (assuming no pending dispatches with higher priorities). With the
improved implementation, ``g_main_context_invoke_full()`` will notice that the
specified context is already owned by the thread (or ownership can be acquired
by it), and will call ``my_func_idle()`` directly, rather than attaching a
source to the context and delaying the invocation to the next context iteration.

This subtle behavior difference doesn’t matter in most cases, but is worth
bearing in mind since it can affect blocking behavior (``invoke_my_func()``
would go from taking negligible time, to taking the same amount of time as
``my_func()`` before returning).

Checking Threading
------------------

It is useful to document which thread each function should be called in, in the form of an assertion:

::

   g_assert (g_main_context_is_owner (expected_main_context));


If that’s put at the top of each function, any assertion failure will highlight
a case where a function has been called from the wrong thread. It is much easier
to write these assertions when initially developing code, rather than debugging
race conditions which can easily result from a function being called in the
wrong thread.

This technique can also be applied to signal emissions and callbacks, improving
type safety as well as asserting the right context is used. Note that signal
emission via ``g_signal_emit()`` is synchronous, and doesn’t involve a main
context at all.

For example, instead of using the following when emitting a signal:

.. code-block:: c

   guint param1;  /* arbitrary example parameters */
   gchar *param2;
   guint retval = 0;

   g_signal_emit_by_name (my_object,
                          "some-signal",
                          param1,
                          param2,
                          &retval);


The following can be used:

.. code-block:: c

   static guint
   emit_some_signal (GObject     *my_object,
                     guint        param1,
                     const gchar *param2)
   {
     guint retval = 0;
   
     g_assert (g_main_context_is_owner (expected_main_context));
   
     g_signal_emit_by_name (my_object, "some-signal",
                            param1, param2, &retval);
   
     return retval;
   }


GTask
-----

``GTask`` provides a slightly different approach to invoking functions in other
threads, which is more suited to the case where a function should be executed in
some background thread, but not a specific one.

``GTask`` takes a data closure and a function to execute, and provides ways to
return the result from this function. It handles everything necessary to run
that function in an arbitrary thread belonging to some thread pool internal to
GLib.

By combining ``g_main_context_invoke_full()`` and ``GTask``, it is possible to
run a task in a specific context and effortlessly return its result to the
current context:

.. code-block:: c

   /* This will be invoked in thread1. */
   static gboolean
   my_func_idle (gpointer user_data)
   {
     GTask *task = G_TASK (user_data);
     MyFuncData *data;
     gboolean retval;
   
     /* Call my_func() and propagate its returned boolean to
      * the main thread. */
     data = g_task_get_task_data (task);
     retval = my_func (data->some_string, data->some_int,
                       data->some_object);
     g_task_return_boolean (task, retval);
   
     return G_SOURCE_REMOVE;
   }
   
   /* Whichever thread this is invoked in, the @callback will be
    * invoked in, once my_func() has finished and returned a result. */
   static void
   invoke_my_func_with_result (GMainContext        *thread1_main_context,
                               const gchar         *some_string,
                               guint                some_int,
                               GObject             *some_object,
                               GAsyncReadyCallback  callback,
                               gpointer             user_data)
   {
     MyFuncData *data;
   
     /* Create a data closure to pass all the desired variables
      * between threads. */
     data = g_new0 (MyFuncData, 1);
     data->some_string = g_strdup (some_string);
     data->some_int = some_int;
     data->some_object = g_object_ref (some_object);
   
     /* Create a GTask to handle returning the result to the current
      * thread-default main context. */
     task = g_task_new (NULL, NULL, callback, user_data);
     g_task_set_task_data (task, data,
                           (GDestroyNotify) my_func_data_free);
   
     /* Invoke the function. */
     g_main_context_invoke_full (thread1_main_context,
                                 G_PRIORITY_DEFAULT, my_func_idle,
                                 task,
                                 (GDestroyNotify) g_object_unref);
   }