~ [ source navigation ] ~ [ diff markup ] ~ [ identifier search ] ~

TOMOYO Linux Cross Reference
Linux/Documentation/dev-tools/kfence.rst

Version: ~ [ linux-6.12-rc7 ] ~ [ linux-6.11.7 ] ~ [ linux-6.10.14 ] ~ [ linux-6.9.12 ] ~ [ linux-6.8.12 ] ~ [ linux-6.7.12 ] ~ [ linux-6.6.60 ] ~ [ linux-6.5.13 ] ~ [ linux-6.4.16 ] ~ [ linux-6.3.13 ] ~ [ linux-6.2.16 ] ~ [ linux-6.1.116 ] ~ [ linux-6.0.19 ] ~ [ linux-5.19.17 ] ~ [ linux-5.18.19 ] ~ [ linux-5.17.15 ] ~ [ linux-5.16.20 ] ~ [ linux-5.15.171 ] ~ [ linux-5.14.21 ] ~ [ linux-5.13.19 ] ~ [ linux-5.12.19 ] ~ [ linux-5.11.22 ] ~ [ linux-5.10.229 ] ~ [ linux-5.9.16 ] ~ [ linux-5.8.18 ] ~ [ linux-5.7.19 ] ~ [ linux-5.6.19 ] ~ [ linux-5.5.19 ] ~ [ linux-5.4.285 ] ~ [ linux-5.3.18 ] ~ [ linux-5.2.21 ] ~ [ linux-5.1.21 ] ~ [ linux-5.0.21 ] ~ [ linux-4.20.17 ] ~ [ linux-4.19.323 ] ~ [ linux-4.18.20 ] ~ [ linux-4.17.19 ] ~ [ linux-4.16.18 ] ~ [ linux-4.15.18 ] ~ [ linux-4.14.336 ] ~ [ linux-4.13.16 ] ~ [ linux-4.12.14 ] ~ [ linux-4.11.12 ] ~ [ linux-4.10.17 ] ~ [ linux-4.9.337 ] ~ [ linux-4.4.302 ] ~ [ linux-3.10.108 ] ~ [ linux-2.6.32.71 ] ~ [ linux-2.6.0 ] ~ [ linux-2.4.37.11 ] ~ [ unix-v6-master ] ~ [ ccs-tools-1.8.12 ] ~ [ policy-sample ] ~
Architecture: ~ [ i386 ] ~ [ alpha ] ~ [ m68k ] ~ [ mips ] ~ [ ppc ] ~ [ sparc ] ~ [ sparc64 ] ~

Diff markup

Differences between /Documentation/dev-tools/kfence.rst (Architecture sparc) and /Documentation/dev-tools/kfence.rst (Architecture alpha)


  1 .. SPDX-License-Identifier: GPL-2.0                 1 .. SPDX-License-Identifier: GPL-2.0
  2 .. Copyright (C) 2020, Google LLC.                  2 .. Copyright (C) 2020, Google LLC.
  3                                                     3 
  4 Kernel Electric-Fence (KFENCE)                      4 Kernel Electric-Fence (KFENCE)
  5 ==============================                      5 ==============================
  6                                                     6 
  7 Kernel Electric-Fence (KFENCE) is a low-overhe      7 Kernel Electric-Fence (KFENCE) is a low-overhead sampling-based memory safety
  8 error detector. KFENCE detects heap out-of-bou      8 error detector. KFENCE detects heap out-of-bounds access, use-after-free, and
  9 invalid-free errors.                                9 invalid-free errors.
 10                                                    10 
 11 KFENCE is designed to be enabled in production     11 KFENCE is designed to be enabled in production kernels, and has near zero
 12 performance overhead. Compared to KASAN, KFENC     12 performance overhead. Compared to KASAN, KFENCE trades performance for
 13 precision. The main motivation behind KFENCE's     13 precision. The main motivation behind KFENCE's design, is that with enough
 14 total uptime KFENCE will detect bugs in code p     14 total uptime KFENCE will detect bugs in code paths not typically exercised by
 15 non-production test workloads. One way to quic     15 non-production test workloads. One way to quickly achieve a large enough total
 16 uptime is when the tool is deployed across a l     16 uptime is when the tool is deployed across a large fleet of machines.
 17                                                    17 
 18 Usage                                              18 Usage
 19 -----                                              19 -----
 20                                                    20 
 21 To enable KFENCE, configure the kernel with::      21 To enable KFENCE, configure the kernel with::
 22                                                    22 
 23     CONFIG_KFENCE=y                                23     CONFIG_KFENCE=y
 24                                                    24 
 25 To build a kernel with KFENCE support, but dis     25 To build a kernel with KFENCE support, but disabled by default (to enable, set
 26 ``kfence.sample_interval`` to non-zero value),     26 ``kfence.sample_interval`` to non-zero value), configure the kernel with::
 27                                                    27 
 28     CONFIG_KFENCE=y                                28     CONFIG_KFENCE=y
 29     CONFIG_KFENCE_SAMPLE_INTERVAL=0                29     CONFIG_KFENCE_SAMPLE_INTERVAL=0
 30                                                    30 
 31 KFENCE provides several other configuration op     31 KFENCE provides several other configuration options to customize behaviour (see
 32 the respective help text in ``lib/Kconfig.kfen     32 the respective help text in ``lib/Kconfig.kfence`` for more info).
 33                                                    33 
 34 Tuning performance                                 34 Tuning performance
 35 ~~~~~~~~~~~~~~~~~~                                 35 ~~~~~~~~~~~~~~~~~~
 36                                                    36 
 37 The most important parameter is KFENCE's sampl     37 The most important parameter is KFENCE's sample interval, which can be set via
 38 the kernel boot parameter ``kfence.sample_inte     38 the kernel boot parameter ``kfence.sample_interval`` in milliseconds. The
 39 sample interval determines the frequency with      39 sample interval determines the frequency with which heap allocations will be
 40 guarded by KFENCE. The default is configurable     40 guarded by KFENCE. The default is configurable via the Kconfig option
 41 ``CONFIG_KFENCE_SAMPLE_INTERVAL``. Setting ``k     41 ``CONFIG_KFENCE_SAMPLE_INTERVAL``. Setting ``kfence.sample_interval=0``
 42 disables KFENCE.                                   42 disables KFENCE.
 43                                                    43 
 44 The sample interval controls a timer that sets     44 The sample interval controls a timer that sets up KFENCE allocations. By
 45 default, to keep the real sample interval pred     45 default, to keep the real sample interval predictable, the normal timer also
 46 causes CPU wake-ups when the system is complet     46 causes CPU wake-ups when the system is completely idle. This may be undesirable
 47 on power-constrained systems. The boot paramet     47 on power-constrained systems. The boot parameter ``kfence.deferrable=1``
 48 instead switches to a "deferrable" timer which     48 instead switches to a "deferrable" timer which does not force CPU wake-ups on
 49 idle systems, at the risk of unpredictable sam     49 idle systems, at the risk of unpredictable sample intervals. The default is
 50 configurable via the Kconfig option ``CONFIG_K     50 configurable via the Kconfig option ``CONFIG_KFENCE_DEFERRABLE``.
 51                                                    51 
 52 .. warning::                                       52 .. warning::
 53    The KUnit test suite is very likely to fail     53    The KUnit test suite is very likely to fail when using a deferrable timer
 54    since it currently causes very unpredictabl     54    since it currently causes very unpredictable sample intervals.
 55                                                    55 
 56 By default KFENCE will only sample 1 heap allo     56 By default KFENCE will only sample 1 heap allocation within each sample
 57 interval. *Burst mode* allows to sample succes     57 interval. *Burst mode* allows to sample successive heap allocations, where the
 58 kernel boot parameter ``kfence.burst`` can be      58 kernel boot parameter ``kfence.burst`` can be set to a non-zero value which
 59 denotes the *additional* successive allocation     59 denotes the *additional* successive allocations within a sample interval;
 60 setting ``kfence.burst=N`` means that ``1 + N`     60 setting ``kfence.burst=N`` means that ``1 + N`` successive allocations are
 61 attempted through KFENCE for each sample inter     61 attempted through KFENCE for each sample interval.
 62                                                    62 
 63 The KFENCE memory pool is of fixed size, and i     63 The KFENCE memory pool is of fixed size, and if the pool is exhausted, no
 64 further KFENCE allocations occur. With ``CONFI     64 further KFENCE allocations occur. With ``CONFIG_KFENCE_NUM_OBJECTS`` (default
 65 255), the number of available guarded objects      65 255), the number of available guarded objects can be controlled. Each object
 66 requires 2 pages, one for the object itself an     66 requires 2 pages, one for the object itself and the other one used as a guard
 67 page; object pages are interleaved with guard      67 page; object pages are interleaved with guard pages, and every object page is
 68 therefore surrounded by two guard pages.           68 therefore surrounded by two guard pages.
 69                                                    69 
 70 The total memory dedicated to the KFENCE memor     70 The total memory dedicated to the KFENCE memory pool can be computed as::
 71                                                    71 
 72     ( #objects + 1 ) * 2 * PAGE_SIZE               72     ( #objects + 1 ) * 2 * PAGE_SIZE
 73                                                    73 
 74 Using the default config, and assuming a page      74 Using the default config, and assuming a page size of 4 KiB, results in
 75 dedicating 2 MiB to the KFENCE memory pool.        75 dedicating 2 MiB to the KFENCE memory pool.
 76                                                    76 
 77 Note: On architectures that support huge pages     77 Note: On architectures that support huge pages, KFENCE will ensure that the
 78 pool is using pages of size ``PAGE_SIZE``. Thi     78 pool is using pages of size ``PAGE_SIZE``. This will result in additional page
 79 tables being allocated.                            79 tables being allocated.
 80                                                    80 
 81 Error reports                                      81 Error reports
 82 ~~~~~~~~~~~~~                                      82 ~~~~~~~~~~~~~
 83                                                    83 
 84 A typical out-of-bounds access looks like this     84 A typical out-of-bounds access looks like this::
 85                                                    85 
 86     ==========================================     86     ==================================================================
 87     BUG: KFENCE: out-of-bounds read in test_ou     87     BUG: KFENCE: out-of-bounds read in test_out_of_bounds_read+0xa6/0x234
 88                                                    88 
 89     Out-of-bounds read at 0xffff8c3f2e291fff (     89     Out-of-bounds read at 0xffff8c3f2e291fff (1B left of kfence-#72):
 90      test_out_of_bounds_read+0xa6/0x234            90      test_out_of_bounds_read+0xa6/0x234
 91      kunit_try_run_case+0x61/0xa0                  91      kunit_try_run_case+0x61/0xa0
 92      kunit_generic_run_threadfn_adapter+0x16/0     92      kunit_generic_run_threadfn_adapter+0x16/0x30
 93      kthread+0x176/0x1b0                           93      kthread+0x176/0x1b0
 94      ret_from_fork+0x22/0x30                       94      ret_from_fork+0x22/0x30
 95                                                    95 
 96     kfence-#72: 0xffff8c3f2e292000-0xffff8c3f2     96     kfence-#72: 0xffff8c3f2e292000-0xffff8c3f2e29201f, size=32, cache=kmalloc-32
 97                                                    97 
 98     allocated by task 484 on cpu 0 at 32.91933     98     allocated by task 484 on cpu 0 at 32.919330s:
 99      test_alloc+0xfe/0x738                         99      test_alloc+0xfe/0x738
100      test_out_of_bounds_read+0x9b/0x234           100      test_out_of_bounds_read+0x9b/0x234
101      kunit_try_run_case+0x61/0xa0                 101      kunit_try_run_case+0x61/0xa0
102      kunit_generic_run_threadfn_adapter+0x16/0    102      kunit_generic_run_threadfn_adapter+0x16/0x30
103      kthread+0x176/0x1b0                          103      kthread+0x176/0x1b0
104      ret_from_fork+0x22/0x30                      104      ret_from_fork+0x22/0x30
105                                                   105 
106     CPU: 0 PID: 484 Comm: kunit_try_catch Not     106     CPU: 0 PID: 484 Comm: kunit_try_catch Not tainted 5.13.0-rc3+ #7
107     Hardware name: QEMU Standard PC (i440FX +     107     Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.14.0-2 04/01/2014
108     ==========================================    108     ==================================================================
109                                                   109 
110 The header of the report provides a short summ    110 The header of the report provides a short summary of the function involved in
111 the access. It is followed by more detailed in    111 the access. It is followed by more detailed information about the access and
112 its origin. Note that, real kernel addresses a    112 its origin. Note that, real kernel addresses are only shown when using the
113 kernel command line option ``no_hash_pointers`    113 kernel command line option ``no_hash_pointers``.
114                                                   114 
115 Use-after-free accesses are reported as::         115 Use-after-free accesses are reported as::
116                                                   116 
117     ==========================================    117     ==================================================================
118     BUG: KFENCE: use-after-free read in test_u    118     BUG: KFENCE: use-after-free read in test_use_after_free_read+0xb3/0x143
119                                                   119 
120     Use-after-free read at 0xffff8c3f2e2a0000     120     Use-after-free read at 0xffff8c3f2e2a0000 (in kfence-#79):
121      test_use_after_free_read+0xb3/0x143          121      test_use_after_free_read+0xb3/0x143
122      kunit_try_run_case+0x61/0xa0                 122      kunit_try_run_case+0x61/0xa0
123      kunit_generic_run_threadfn_adapter+0x16/0    123      kunit_generic_run_threadfn_adapter+0x16/0x30
124      kthread+0x176/0x1b0                          124      kthread+0x176/0x1b0
125      ret_from_fork+0x22/0x30                      125      ret_from_fork+0x22/0x30
126                                                   126 
127     kfence-#79: 0xffff8c3f2e2a0000-0xffff8c3f2    127     kfence-#79: 0xffff8c3f2e2a0000-0xffff8c3f2e2a001f, size=32, cache=kmalloc-32
128                                                   128 
129     allocated by task 488 on cpu 2 at 33.87132    129     allocated by task 488 on cpu 2 at 33.871326s:
130      test_alloc+0xfe/0x738                        130      test_alloc+0xfe/0x738
131      test_use_after_free_read+0x76/0x143          131      test_use_after_free_read+0x76/0x143
132      kunit_try_run_case+0x61/0xa0                 132      kunit_try_run_case+0x61/0xa0
133      kunit_generic_run_threadfn_adapter+0x16/0    133      kunit_generic_run_threadfn_adapter+0x16/0x30
134      kthread+0x176/0x1b0                          134      kthread+0x176/0x1b0
135      ret_from_fork+0x22/0x30                      135      ret_from_fork+0x22/0x30
136                                                   136 
137     freed by task 488 on cpu 2 at 33.871358s:     137     freed by task 488 on cpu 2 at 33.871358s:
138      test_use_after_free_read+0xa8/0x143          138      test_use_after_free_read+0xa8/0x143
139      kunit_try_run_case+0x61/0xa0                 139      kunit_try_run_case+0x61/0xa0
140      kunit_generic_run_threadfn_adapter+0x16/0    140      kunit_generic_run_threadfn_adapter+0x16/0x30
141      kthread+0x176/0x1b0                          141      kthread+0x176/0x1b0
142      ret_from_fork+0x22/0x30                      142      ret_from_fork+0x22/0x30
143                                                   143 
144     CPU: 2 PID: 488 Comm: kunit_try_catch Tain    144     CPU: 2 PID: 488 Comm: kunit_try_catch Tainted: G    B             5.13.0-rc3+ #7
145     Hardware name: QEMU Standard PC (i440FX +     145     Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.14.0-2 04/01/2014
146     ==========================================    146     ==================================================================
147                                                   147 
148 KFENCE also reports on invalid frees, such as     148 KFENCE also reports on invalid frees, such as double-frees::
149                                                   149 
150     ==========================================    150     ==================================================================
151     BUG: KFENCE: invalid free in test_double_f    151     BUG: KFENCE: invalid free in test_double_free+0xdc/0x171
152                                                   152 
153     Invalid free of 0xffff8c3f2e2a4000 (in kfe    153     Invalid free of 0xffff8c3f2e2a4000 (in kfence-#81):
154      test_double_free+0xdc/0x171                  154      test_double_free+0xdc/0x171
155      kunit_try_run_case+0x61/0xa0                 155      kunit_try_run_case+0x61/0xa0
156      kunit_generic_run_threadfn_adapter+0x16/0    156      kunit_generic_run_threadfn_adapter+0x16/0x30
157      kthread+0x176/0x1b0                          157      kthread+0x176/0x1b0
158      ret_from_fork+0x22/0x30                      158      ret_from_fork+0x22/0x30
159                                                   159 
160     kfence-#81: 0xffff8c3f2e2a4000-0xffff8c3f2    160     kfence-#81: 0xffff8c3f2e2a4000-0xffff8c3f2e2a401f, size=32, cache=kmalloc-32
161                                                   161 
162     allocated by task 490 on cpu 1 at 34.17532    162     allocated by task 490 on cpu 1 at 34.175321s:
163      test_alloc+0xfe/0x738                        163      test_alloc+0xfe/0x738
164      test_double_free+0x76/0x171                  164      test_double_free+0x76/0x171
165      kunit_try_run_case+0x61/0xa0                 165      kunit_try_run_case+0x61/0xa0
166      kunit_generic_run_threadfn_adapter+0x16/0    166      kunit_generic_run_threadfn_adapter+0x16/0x30
167      kthread+0x176/0x1b0                          167      kthread+0x176/0x1b0
168      ret_from_fork+0x22/0x30                      168      ret_from_fork+0x22/0x30
169                                                   169 
170     freed by task 490 on cpu 1 at 34.175348s:     170     freed by task 490 on cpu 1 at 34.175348s:
171      test_double_free+0xa8/0x171                  171      test_double_free+0xa8/0x171
172      kunit_try_run_case+0x61/0xa0                 172      kunit_try_run_case+0x61/0xa0
173      kunit_generic_run_threadfn_adapter+0x16/0    173      kunit_generic_run_threadfn_adapter+0x16/0x30
174      kthread+0x176/0x1b0                          174      kthread+0x176/0x1b0
175      ret_from_fork+0x22/0x30                      175      ret_from_fork+0x22/0x30
176                                                   176 
177     CPU: 1 PID: 490 Comm: kunit_try_catch Tain    177     CPU: 1 PID: 490 Comm: kunit_try_catch Tainted: G    B             5.13.0-rc3+ #7
178     Hardware name: QEMU Standard PC (i440FX +     178     Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.14.0-2 04/01/2014
179     ==========================================    179     ==================================================================
180                                                   180 
181 KFENCE also uses pattern-based redzones on the    181 KFENCE also uses pattern-based redzones on the other side of an object's guard
182 page, to detect out-of-bounds writes on the un    182 page, to detect out-of-bounds writes on the unprotected side of the object.
183 These are reported on frees::                     183 These are reported on frees::
184                                                   184 
185     ==========================================    185     ==================================================================
186     BUG: KFENCE: memory corruption in test_kma    186     BUG: KFENCE: memory corruption in test_kmalloc_aligned_oob_write+0xef/0x184
187                                                   187 
188     Corrupted memory at 0xffff8c3f2e33aff9 [ 0    188     Corrupted memory at 0xffff8c3f2e33aff9 [ 0xac . . . . . . ] (in kfence-#156):
189      test_kmalloc_aligned_oob_write+0xef/0x184    189      test_kmalloc_aligned_oob_write+0xef/0x184
190      kunit_try_run_case+0x61/0xa0                 190      kunit_try_run_case+0x61/0xa0
191      kunit_generic_run_threadfn_adapter+0x16/0    191      kunit_generic_run_threadfn_adapter+0x16/0x30
192      kthread+0x176/0x1b0                          192      kthread+0x176/0x1b0
193      ret_from_fork+0x22/0x30                      193      ret_from_fork+0x22/0x30
194                                                   194 
195     kfence-#156: 0xffff8c3f2e33afb0-0xffff8c3f    195     kfence-#156: 0xffff8c3f2e33afb0-0xffff8c3f2e33aff8, size=73, cache=kmalloc-96
196                                                   196 
197     allocated by task 502 on cpu 7 at 42.15930    197     allocated by task 502 on cpu 7 at 42.159302s:
198      test_alloc+0xfe/0x738                        198      test_alloc+0xfe/0x738
199      test_kmalloc_aligned_oob_write+0x57/0x184    199      test_kmalloc_aligned_oob_write+0x57/0x184
200      kunit_try_run_case+0x61/0xa0                 200      kunit_try_run_case+0x61/0xa0
201      kunit_generic_run_threadfn_adapter+0x16/0    201      kunit_generic_run_threadfn_adapter+0x16/0x30
202      kthread+0x176/0x1b0                          202      kthread+0x176/0x1b0
203      ret_from_fork+0x22/0x30                      203      ret_from_fork+0x22/0x30
204                                                   204 
205     CPU: 7 PID: 502 Comm: kunit_try_catch Tain    205     CPU: 7 PID: 502 Comm: kunit_try_catch Tainted: G    B             5.13.0-rc3+ #7
206     Hardware name: QEMU Standard PC (i440FX +     206     Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.14.0-2 04/01/2014
207     ==========================================    207     ==================================================================
208                                                   208 
209 For such errors, the address where the corrupt    209 For such errors, the address where the corruption occurred as well as the
210 invalidly written bytes (offset from the addre    210 invalidly written bytes (offset from the address) are shown; in this
211 representation, '.' denote untouched bytes. In    211 representation, '.' denote untouched bytes. In the example above ``0xac`` is
212 the value written to the invalid address at of    212 the value written to the invalid address at offset 0, and the remaining '.'
213 denote that no following bytes have been touch    213 denote that no following bytes have been touched. Note that, real values are
214 only shown if the kernel was booted with ``no_    214 only shown if the kernel was booted with ``no_hash_pointers``; to avoid
215 information disclosure otherwise, '!' is used     215 information disclosure otherwise, '!' is used instead to denote invalidly
216 written bytes.                                    216 written bytes.
217                                                   217 
218 And finally, KFENCE may also report on invalid    218 And finally, KFENCE may also report on invalid accesses to any protected page
219 where it was not possible to determine an asso    219 where it was not possible to determine an associated object, e.g. if adjacent
220 object pages had not yet been allocated::         220 object pages had not yet been allocated::
221                                                   221 
222     ==========================================    222     ==================================================================
223     BUG: KFENCE: invalid read in test_invalid_    223     BUG: KFENCE: invalid read in test_invalid_access+0x26/0xe0
224                                                   224 
225     Invalid read at 0xffffffffb670b00a:           225     Invalid read at 0xffffffffb670b00a:
226      test_invalid_access+0x26/0xe0                226      test_invalid_access+0x26/0xe0
227      kunit_try_run_case+0x51/0x85                 227      kunit_try_run_case+0x51/0x85
228      kunit_generic_run_threadfn_adapter+0x16/0    228      kunit_generic_run_threadfn_adapter+0x16/0x30
229      kthread+0x137/0x160                          229      kthread+0x137/0x160
230      ret_from_fork+0x22/0x30                      230      ret_from_fork+0x22/0x30
231                                                   231 
232     CPU: 4 PID: 124 Comm: kunit_try_catch Tain    232     CPU: 4 PID: 124 Comm: kunit_try_catch Tainted: G        W         5.8.0-rc6+ #7
233     Hardware name: QEMU Standard PC (i440FX +     233     Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.13.0-1 04/01/2014
234     ==========================================    234     ==================================================================
235                                                   235 
236 DebugFS interface                                 236 DebugFS interface
237 ~~~~~~~~~~~~~~~~~                                 237 ~~~~~~~~~~~~~~~~~
238                                                   238 
239 Some debugging information is exposed via debu    239 Some debugging information is exposed via debugfs:
240                                                   240 
241 * The file ``/sys/kernel/debug/kfence/stats``     241 * The file ``/sys/kernel/debug/kfence/stats`` provides runtime statistics.
242                                                   242 
243 * The file ``/sys/kernel/debug/kfence/objects`    243 * The file ``/sys/kernel/debug/kfence/objects`` provides a list of objects
244   allocated via KFENCE, including those alread    244   allocated via KFENCE, including those already freed but protected.
245                                                   245 
246 Implementation Details                            246 Implementation Details
247 ----------------------                            247 ----------------------
248                                                   248 
249 Guarded allocations are set up based on the sa    249 Guarded allocations are set up based on the sample interval. After expiration
250 of the sample interval, the next allocation th    250 of the sample interval, the next allocation through the main allocator (SLAB or
251 SLUB) returns a guarded allocation from the KF    251 SLUB) returns a guarded allocation from the KFENCE object pool (allocation
252 sizes up to PAGE_SIZE are supported). At this     252 sizes up to PAGE_SIZE are supported). At this point, the timer is reset, and
253 the next allocation is set up after the expira    253 the next allocation is set up after the expiration of the interval.
254                                                   254 
255 When using ``CONFIG_KFENCE_STATIC_KEYS=y``, KF    255 When using ``CONFIG_KFENCE_STATIC_KEYS=y``, KFENCE allocations are "gated"
256 through the main allocator's fast-path by rely    256 through the main allocator's fast-path by relying on static branches via the
257 static keys infrastructure. The static branch     257 static keys infrastructure. The static branch is toggled to redirect the
258 allocation to KFENCE. Depending on sample inte    258 allocation to KFENCE. Depending on sample interval, target workloads, and
259 system architecture, this may perform better t    259 system architecture, this may perform better than the simple dynamic branch.
260 Careful benchmarking is recommended.              260 Careful benchmarking is recommended.
261                                                   261 
262 KFENCE objects each reside on a dedicated page    262 KFENCE objects each reside on a dedicated page, at either the left or right
263 page boundaries selected at random. The pages     263 page boundaries selected at random. The pages to the left and right of the
264 object page are "guard pages", whose attribute    264 object page are "guard pages", whose attributes are changed to a protected
265 state, and cause page faults on any attempted     265 state, and cause page faults on any attempted access. Such page faults are then
266 intercepted by KFENCE, which handles the fault    266 intercepted by KFENCE, which handles the fault gracefully by reporting an
267 out-of-bounds access, and marking the page as     267 out-of-bounds access, and marking the page as accessible so that the faulting
268 code can (wrongly) continue executing (set ``p    268 code can (wrongly) continue executing (set ``panic_on_warn`` to panic instead).
269                                                   269 
270 To detect out-of-bounds writes to memory withi    270 To detect out-of-bounds writes to memory within the object's page itself,
271 KFENCE also uses pattern-based redzones. For e    271 KFENCE also uses pattern-based redzones. For each object page, a redzone is set
272 up for all non-object memory. For typical alig    272 up for all non-object memory. For typical alignments, the redzone is only
273 required on the unguarded side of an object. B    273 required on the unguarded side of an object. Because KFENCE must honor the
274 cache's requested alignment, special alignment    274 cache's requested alignment, special alignments may result in unprotected gaps
275 on either side of an object, all of which are     275 on either side of an object, all of which are redzoned.
276                                                   276 
277 The following figure illustrates the page layo    277 The following figure illustrates the page layout::
278                                                   278 
279     ---+-----------+-----------+-----------+--    279     ---+-----------+-----------+-----------+-----------+-----------+---
280        | xxxxxxxxx | O :       | xxxxxxxxx |      280        | xxxxxxxxx | O :       | xxxxxxxxx |       : O | xxxxxxxxx |
281        | xxxxxxxxx | B :       | xxxxxxxxx |      281        | xxxxxxxxx | B :       | xxxxxxxxx |       : B | xxxxxxxxx |
282        | x GUARD x | J : RED-  | x GUARD x | R    282        | x GUARD x | J : RED-  | x GUARD x | RED-  : J | x GUARD x |
283        | xxxxxxxxx | E :  ZONE | xxxxxxxxx |      283        | xxxxxxxxx | E :  ZONE | xxxxxxxxx |  ZONE : E | xxxxxxxxx |
284        | xxxxxxxxx | C :       | xxxxxxxxx |      284        | xxxxxxxxx | C :       | xxxxxxxxx |       : C | xxxxxxxxx |
285        | xxxxxxxxx | T :       | xxxxxxxxx |      285        | xxxxxxxxx | T :       | xxxxxxxxx |       : T | xxxxxxxxx |
286     ---+-----------+-----------+-----------+--    286     ---+-----------+-----------+-----------+-----------+-----------+---
287                                                   287 
288 Upon deallocation of a KFENCE object, the obje    288 Upon deallocation of a KFENCE object, the object's page is again protected and
289 the object is marked as freed. Any further acc    289 the object is marked as freed. Any further access to the object causes a fault
290 and KFENCE reports a use-after-free access. Fr    290 and KFENCE reports a use-after-free access. Freed objects are inserted at the
291 tail of KFENCE's freelist, so that the least r    291 tail of KFENCE's freelist, so that the least recently freed objects are reused
292 first, and the chances of detecting use-after-    292 first, and the chances of detecting use-after-frees of recently freed objects
293 is increased.                                     293 is increased.
294                                                   294 
295 If pool utilization reaches 75% (default) or a    295 If pool utilization reaches 75% (default) or above, to reduce the risk of the
296 pool eventually being fully occupied by alloca    296 pool eventually being fully occupied by allocated objects yet ensure diverse
297 coverage of allocations, KFENCE limits current    297 coverage of allocations, KFENCE limits currently covered allocations of the
298 same source from further filling up the pool.     298 same source from further filling up the pool. The "source" of an allocation is
299 based on its partial allocation stack trace. A    299 based on its partial allocation stack trace. A side-effect is that this also
300 limits frequent long-lived allocations (e.g. p    300 limits frequent long-lived allocations (e.g. pagecache) of the same source
301 filling up the pool permanently, which is the     301 filling up the pool permanently, which is the most common risk for the pool
302 becoming full and the sampled allocation rate     302 becoming full and the sampled allocation rate dropping to zero. The threshold
303 at which to start limiting currently covered a    303 at which to start limiting currently covered allocations can be configured via
304 the boot parameter ``kfence.skip_covered_thres    304 the boot parameter ``kfence.skip_covered_thresh`` (pool usage%).
305                                                   305 
306 Interface                                         306 Interface
307 ---------                                         307 ---------
308                                                   308 
309 The following describes the functions which ar    309 The following describes the functions which are used by allocators as well as
310 page handling code to set up and deal with KFE    310 page handling code to set up and deal with KFENCE allocations.
311                                                   311 
312 .. kernel-doc:: include/linux/kfence.h            312 .. kernel-doc:: include/linux/kfence.h
313    :functions: is_kfence_address                  313    :functions: is_kfence_address
314                kfence_shutdown_cache              314                kfence_shutdown_cache
315                kfence_alloc kfence_free __kfen    315                kfence_alloc kfence_free __kfence_free
316                kfence_ksize kfence_object_star    316                kfence_ksize kfence_object_start
317                kfence_handle_page_fault           317                kfence_handle_page_fault
318                                                   318 
319 Related Tools                                     319 Related Tools
320 -------------                                     320 -------------
321                                                   321 
322 In userspace, a similar approach is taken by `    322 In userspace, a similar approach is taken by `GWP-ASan
323 <http://llvm.org/docs/GwpAsan.html>`_. GWP-ASa    323 <http://llvm.org/docs/GwpAsan.html>`_. GWP-ASan also relies on guard pages and
324 a sampling strategy to detect memory unsafety     324 a sampling strategy to detect memory unsafety bugs at scale. KFENCE's design is
325 directly influenced by GWP-ASan, and can be se    325 directly influenced by GWP-ASan, and can be seen as its kernel sibling. Another
326 similar but non-sampling approach, that also i    326 similar but non-sampling approach, that also inspired the name "KFENCE", can be
327 found in the userspace `Electric Fence Malloc     327 found in the userspace `Electric Fence Malloc Debugger
328 <https://linux.die.net/man/3/efence>`_.           328 <https://linux.die.net/man/3/efence>`_.
329                                                   329 
330 In the kernel, several tools exist to debug me    330 In the kernel, several tools exist to debug memory access errors, and in
331 particular KASAN can detect all bug classes th    331 particular KASAN can detect all bug classes that KFENCE can detect. While KASAN
332 is more precise, relying on compiler instrumen    332 is more precise, relying on compiler instrumentation, this comes at a
333 performance cost.                                 333 performance cost.
334                                                   334 
335 It is worth highlighting that KASAN and KFENCE    335 It is worth highlighting that KASAN and KFENCE are complementary, with
336 different target environments. For instance, K    336 different target environments. For instance, KASAN is the better debugging-aid,
337 where test cases or reproducers exists: due to    337 where test cases or reproducers exists: due to the lower chance to detect the
338 error, it would require more effort using KFEN    338 error, it would require more effort using KFENCE to debug. Deployments at scale
339 that cannot afford to enable KASAN, however, w    339 that cannot afford to enable KASAN, however, would benefit from using KFENCE to
340 discover bugs due to code paths not exercised     340 discover bugs due to code paths not exercised by test cases or fuzzers.
                                                      

~ [ source navigation ] ~ [ diff markup ] ~ [ identifier search ] ~

kernel.org | git.kernel.org | LWN.net | Project Home | SVN repository | Mail admin

Linux® is a registered trademark of Linus Torvalds in the United States and other countries.
TOMOYO® is a registered trademark of NTT DATA CORPORATION.

sflogo.php