/home/docs/checkouts/readthedocs.org/user_builds/advanced-micro-devices-composable-kernel/checkouts/3643/include/ck_tile/core/utility/persistent_async_input_scheduler.hpp Source File#
persistent_async_input_scheduler.hpp
Go to the documentation of this file.
Definition: cluster_descriptor.hpp:13
Scheduler for persistent GEMM kernels with asynchronous input streaming.
Definition: persistent_async_input_scheduler.hpp:27
uint32_t tiles_per_chunk_m
Number of M-dimension tiles grouped into each chunk. Grouping tiles balances synchronization overhead...
Definition: persistent_async_input_scheduler.hpp:31
int32_t tile_idx_pivot_m
Pivot offset for rotating the chunk assignment. Allows shifting which tiles map to which chunks,...
Definition: persistent_async_input_scheduler.hpp:41
uint32_t * chunk_signals
Device pointer to array of signal values (uint32_t), one per chunk. Producer sets signals to coordina...
Definition: persistent_async_input_scheduler.hpp:36
uint32_t num_chunks
Number of signal chunks allocated. Must equal ceil((tiles_m + tile_idx_pivot_m) / tiles_per_chunk_m)....
Definition: persistent_async_input_scheduler.hpp:46