DAMON-based LRU-lists Sorting

DAMON-based LRU-lists Sorting (DAMON_LRU_SORT) is a static kernel module that aimed to be used for proactive and lightweight data access pattern based (de)prioritization of pages on their LRU-lists for making LRU-lists a more trusworthy data access pattern source.

Where Proactive LRU-lists Sorting is Required?

As page-granularity access checking overhead could be significant on huge systems, LRU lists are normally not proactively sorted but partially and reactively sorted for special events including specific user requests, system calls and memory pressure. As a result, LRU lists are sometimes not so perfectly prepared to be used as a trustworthy access pattern source for some situations including reclamation target pages selection under sudden memory pressure.

Because DAMON can identify access patterns of best-effort accuracy while inducing only user-specified range of overhead, proactively running DAMON_LRU_SORT could be helpful for making LRU lists more trustworthy access pattern source with low and controlled overhead.

How It Works?

DAMON_LRU_SORT finds hot pages (pages of memory regions that showing access rates that higher than a user-specified threshold) and cold pages (pages of memory regions that showing no access for a time that longer than a user-specified threshold) using DAMON, and prioritizes hot pages while deprioritizing cold pages on their LRU-lists. To avoid it consuming too much CPU for the prioritizations, a CPU time usage limit can be configured. Under the limit, it prioritizes and deprioritizes more hot and cold pages first, respectively. System administrators can also configure under what situation this scheme should automatically activated and deactivated with three memory pressure watermarks.

Its default parameters for hotness/coldness thresholds and CPU quota limit are conservatively chosen. That is, the module under its default parameters could be widely used without harm for common situations while providing a level of benefits for systems having clear hot/cold access patterns under memory pressure while consuming only a limited small portion of CPU time.

Interface: Module Parameters

To use this feature, you should first ensure your system is running on a kernel that is built with CONFIG_DAMON_LRU_SORT=y.

To let sysadmins enable or disable it and tune for the given system, DAMON_LRU_SORT utilizes module parameters. That is, you can put damon_lru_sort.<parameter>=<value> on the kernel boot command line or write proper values to /sys/module/damon_lru_sort/parameters/<parameter> files.

Below are the description of each parameter.

enabled

Enable or disable DAMON_LRU_SORT.

You can enable DAMON_LRU_SORT by setting the value of this parameter as Y. Setting it as N disables DAMON_LRU_SORT. Note that DAMON_LRU_SORT could do no real monitoring and LRU-lists sorting due to the watermarks-based activation condition. Refer to below descriptions for the watermarks parameter for this.

commit_inputs

Make DAMON_LRU_SORT reads the input parameters again, except enabled.

Input parameters that updated while DAMON_LRU_SORT is running are not applied by default. Once this parameter is set as Y, DAMON_LRU_SORT reads values of parametrs except enabled again. Once the re-reading is done, this parameter is set as N. If invalid parameters are found while the re-reading, DAMON_LRU_SORT will be disabled.

hot_thres_access_freq

Access frequency threshold for hot memory regions identification in permil.

If a memory region is accessed in frequency of this or higher, DAMON_LRU_SORT identifies the region as hot, and mark it as accessed on the LRU list, so that it could not be reclaimed under memory pressure. 50% by default.

cold_min_age

Time threshold for cold memory regions identification in microseconds.

If a memory region is not accessed for this or longer time, DAMON_LRU_SORT identifies the region as cold, and mark it as unaccessed on the LRU list, so that it could be reclaimed first under memory pressure. 120 seconds by default.

quota_ms

Limit of time for trying the LRU lists sorting in milliseconds.

DAMON_LRU_SORT tries to use only up to this time within a time window (quota_reset_interval_ms) for trying LRU lists sorting. This can be used for limiting CPU consumption of DAMON_LRU_SORT. If the value is zero, the limit is disabled.

10 ms by default.

quota_reset_interval_ms

The time quota charge reset interval in milliseconds.

The charge reset interval for the quota of time (quota_ms). That is, DAMON_LRU_SORT does not try LRU-lists sorting for more than quota_ms milliseconds or quota_sz bytes within quota_reset_interval_ms milliseconds.

1 second by default.

wmarks_interval

The watermarks check time interval in microseconds.

Minimal time to wait before checking the watermarks, when DAMON_LRU_SORT is enabled but inactive due to its watermarks rule. 5 seconds by default.

wmarks_high

Free memory rate (per thousand) for the high watermark.

If free memory of the system in bytes per thousand bytes is higher than this, DAMON_LRU_SORT becomes inactive, so it does nothing but periodically checks the watermarks. 200 (20%) by default.

wmarks_mid

Free memory rate (per thousand) for the middle watermark.

If free memory of the system in bytes per thousand bytes is between this and the low watermark, DAMON_LRU_SORT becomes active, so starts the monitoring and the LRU-lists sorting. 150 (15%) by default.

wmarks_low

Free memory rate (per thousand) for the low watermark.

If free memory of the system in bytes per thousand bytes is lower than this, DAMON_LRU_SORT becomes inactive, so it does nothing but periodically checks the watermarks. 50 (5%) by default.

sample_interval

Sampling interval for the monitoring in microseconds.

The sampling interval of DAMON for the cold memory monitoring. Please refer to the DAMON documentation (Detailed Usages) for more detail. 5ms by default.

aggr_interval

Aggregation interval for the monitoring in microseconds.

The aggregation interval of DAMON for the cold memory monitoring. Please refer to the DAMON documentation (Detailed Usages) for more detail. 100ms by default.

min_nr_regions

Minimum number of monitoring regions.

The minimal number of monitoring regions of DAMON for the cold memory monitoring. This can be used to set lower-bound of the monitoring quality. But, setting this too high could result in increased monitoring overhead. Please refer to the DAMON documentation (Detailed Usages) for more detail. 10 by default.

max_nr_regions

Maximum number of monitoring regions.

The maximum number of monitoring regions of DAMON for the cold memory monitoring. This can be used to set upper-bound of the monitoring overhead. However, setting this too low could result in bad monitoring quality. Please refer to the DAMON documentation (Detailed Usages) for more detail. 1000 by defaults.

monitor_region_start

Start of target memory region in physical address.

The start physical address of memory region that DAMON_LRU_SORT will do work against. By default, biggest System RAM is used as the region.

monitor_region_end

End of target memory region in physical address.

The end physical address of memory region that DAMON_LRU_SORT will do work against. By default, biggest System RAM is used as the region.

kdamond_pid

PID of the DAMON thread.

If DAMON_LRU_SORT is enabled, this becomes the PID of the worker thread. Else, -1.

nr_lru_sort_tried_hot_regions

Number of hot memory regions that tried to be LRU-sorted.

bytes_lru_sort_tried_hot_regions

Total bytes of hot memory regions that tried to be LRU-sorted.

nr_lru_sorted_hot_regions

Number of hot memory regions that successfully be LRU-sorted.

bytes_lru_sorted_hot_regions

Total bytes of hot memory regions that successfully be LRU-sorted.

nr_hot_quota_exceeds

Number of times that the time quota limit for hot regions have exceeded.

nr_lru_sort_tried_cold_regions

Number of cold memory regions that tried to be LRU-sorted.

bytes_lru_sort_tried_cold_regions

Total bytes of cold memory regions that tried to be LRU-sorted.

nr_lru_sorted_cold_regions

Number of cold memory regions that successfully be LRU-sorted.

bytes_lru_sorted_cold_regions

Total bytes of cold memory regions that successfully be LRU-sorted.

nr_cold_quota_exceeds

Number of times that the time quota limit for cold regions have exceeded.

Example

Below runtime example commands make DAMON_LRU_SORT to find memory regions having >=50% access frequency and LRU-prioritize while LRU-deprioritizing memory regions that not accessed for 120 seconds. The prioritization and deprioritization is limited to be done using only up to 1% CPU time to avoid DAMON_LRU_SORT consuming too much CPU time for the (de)prioritization. It also asks DAMON_LRU_SORT to do nothing if the system's free memory rate is more than 50%, but start the real works if it becomes lower than 40%. If DAMON_RECLAIM doesn't make progress and therefore the free memory rate becomes lower than 20%, it asks DAMON_LRU_SORT to do nothing again, so that we can fall back to the LRU-list based page granularity reclamation.

# cd /sys/module/damon_lru_sort/parameters
# echo 500 > hot_thres_access_freq
# echo 120000000 > cold_min_age
# echo 10 > quota_ms
# echo 1000 > quota_reset_interval_ms
# echo 500 > wmarks_high
# echo 400 > wmarks_mid
# echo 200 > wmarks_low
# echo Y > enabled