~ [ source navigation ] ~ [ diff markup ] ~ [ identifier search ] ~

TOMOYO Linux Cross Reference
Linux/Documentation/driver-api/cxl/access-coordinates.rst

Version: ~ [ linux-6.12-rc7 ] ~ [ linux-6.11.7 ] ~ [ linux-6.10.14 ] ~ [ linux-6.9.12 ] ~ [ linux-6.8.12 ] ~ [ linux-6.7.12 ] ~ [ linux-6.6.60 ] ~ [ linux-6.5.13 ] ~ [ linux-6.4.16 ] ~ [ linux-6.3.13 ] ~ [ linux-6.2.16 ] ~ [ linux-6.1.116 ] ~ [ linux-6.0.19 ] ~ [ linux-5.19.17 ] ~ [ linux-5.18.19 ] ~ [ linux-5.17.15 ] ~ [ linux-5.16.20 ] ~ [ linux-5.15.171 ] ~ [ linux-5.14.21 ] ~ [ linux-5.13.19 ] ~ [ linux-5.12.19 ] ~ [ linux-5.11.22 ] ~ [ linux-5.10.229 ] ~ [ linux-5.9.16 ] ~ [ linux-5.8.18 ] ~ [ linux-5.7.19 ] ~ [ linux-5.6.19 ] ~ [ linux-5.5.19 ] ~ [ linux-5.4.285 ] ~ [ linux-5.3.18 ] ~ [ linux-5.2.21 ] ~ [ linux-5.1.21 ] ~ [ linux-5.0.21 ] ~ [ linux-4.20.17 ] ~ [ linux-4.19.323 ] ~ [ linux-4.18.20 ] ~ [ linux-4.17.19 ] ~ [ linux-4.16.18 ] ~ [ linux-4.15.18 ] ~ [ linux-4.14.336 ] ~ [ linux-4.13.16 ] ~ [ linux-4.12.14 ] ~ [ linux-4.11.12 ] ~ [ linux-4.10.17 ] ~ [ linux-4.9.337 ] ~ [ linux-4.4.302 ] ~ [ linux-3.10.108 ] ~ [ linux-2.6.32.71 ] ~ [ linux-2.6.0 ] ~ [ linux-2.4.37.11 ] ~ [ unix-v6-master ] ~ [ ccs-tools-1.8.12 ] ~ [ policy-sample ] ~
Architecture: ~ [ i386 ] ~ [ alpha ] ~ [ m68k ] ~ [ mips ] ~ [ ppc ] ~ [ sparc ] ~ [ sparc64 ] ~

Diff markup

Differences between /Documentation/driver-api/cxl/access-coordinates.rst (Architecture sparc) and /Documentation/driver-api/cxl/access-coordinates.rst (Architecture i386)


  1 .. SPDX-License-Identifier: GPL-2.0                 1 .. SPDX-License-Identifier: GPL-2.0
  2 .. include:: <isonum.txt>                           2 .. include:: <isonum.txt>
  3                                                     3 
  4 ==================================                  4 ==================================
  5 CXL Access Coordinates Computation                  5 CXL Access Coordinates Computation
  6 ==================================                  6 ==================================
  7                                                     7 
  8 Shared Upstream Link Calculation                    8 Shared Upstream Link Calculation
  9 ================================                    9 ================================
 10 For certain CXL region construction with endpo     10 For certain CXL region construction with endpoints behind CXL switches (SW) or
 11 Root Ports (RP), there is the possibility of t     11 Root Ports (RP), there is the possibility of the total bandwidth for all
 12 the endpoints behind a switch being more than      12 the endpoints behind a switch being more than the switch upstream link.
 13 A similar situation can occur within the host,     13 A similar situation can occur within the host, upstream of the root ports.
 14 The CXL driver performs an additional pass aft     14 The CXL driver performs an additional pass after all the targets have
 15 arrived for a region in order to recalculate t     15 arrived for a region in order to recalculate the bandwidths with possible
 16 upstream link being a limiting factor in mind.     16 upstream link being a limiting factor in mind.
 17                                                    17 
 18 The algorithm assumes the configuration is a s     18 The algorithm assumes the configuration is a symmetric topology as that
 19 maximizes performance. When asymmetric topolog     19 maximizes performance. When asymmetric topology is detected, the calculation
 20 is aborted. An asymmetric topology is detected     20 is aborted. An asymmetric topology is detected during topology walk where the
 21 number of RPs detected as a grandparent is not     21 number of RPs detected as a grandparent is not equal to the number of devices
 22 iterated in the same iteration loop. The assum     22 iterated in the same iteration loop. The assumption is made that subtle
 23 asymmetry in properties does not happen and al     23 asymmetry in properties does not happen and all paths to EPs are equal.
 24                                                    24 
 25 There can be multiple switches under an RP. Th     25 There can be multiple switches under an RP. There can be multiple RPs under
 26 a CXL Host Bridge (HB). There can be multiple      26 a CXL Host Bridge (HB). There can be multiple HBs under a CXL Fixed Memory
 27 Window Structure (CFMWS).                          27 Window Structure (CFMWS).
 28                                                    28 
 29 An example hierarchy:                              29 An example hierarchy:
 30                                                    30 
 31 >                CFMWS 0                           31 >                CFMWS 0
 32 >                  |                               32 >                  |
 33 >         _________|_________                      33 >         _________|_________
 34 >        |                   |                     34 >        |                   |
 35 >    ACPI0017-0          ACPI0017-1                35 >    ACPI0017-0          ACPI0017-1
 36 > GP0/HB0/ACPI0016-0   GP1/HB1/ACPI0016-1          36 > GP0/HB0/ACPI0016-0   GP1/HB1/ACPI0016-1
 37 >    |          |        |           |             37 >    |          |        |           |
 38 >   RP0        RP1      RP2         RP3            38 >   RP0        RP1      RP2         RP3
 39 >    |          |        |           |             39 >    |          |        |           |
 40 >  SW 0       SW 1     SW 2        SW 3            40 >  SW 0       SW 1     SW 2        SW 3
 41 >  |   |      |   |    |   |       |   |           41 >  |   |      |   |    |   |       |   |
 42 > EP0 EP1    EP2 EP3  EP4  EP5    EP6 EP7          42 > EP0 EP1    EP2 EP3  EP4  EP5    EP6 EP7
 43                                                    43 
 44 Computation for the example hierarchy:             44 Computation for the example hierarchy:
 45                                                    45 
 46 Min (GP0 to CPU BW,                                46 Min (GP0 to CPU BW,
 47      Min(SW 0 Upstream Link to RP0 BW,             47      Min(SW 0 Upstream Link to RP0 BW,
 48          Min(SW0SSLBIS for SW0DSP0 (EP0), EP0      48          Min(SW0SSLBIS for SW0DSP0 (EP0), EP0 DSLBIS, EP0 Upstream Link) +
 49          Min(SW0SSLBIS for SW0DSP1 (EP1), EP1      49          Min(SW0SSLBIS for SW0DSP1 (EP1), EP1 DSLBIS, EP1 Upstream link)) +
 50      Min(SW 1 Upstream Link to RP1 BW,             50      Min(SW 1 Upstream Link to RP1 BW,
 51          Min(SW1SSLBIS for SW1DSP0 (EP2), EP2      51          Min(SW1SSLBIS for SW1DSP0 (EP2), EP2 DSLBIS, EP2 Upstream Link) +
 52          Min(SW1SSLBIS for SW1DSP1 (EP3), EP3      52          Min(SW1SSLBIS for SW1DSP1 (EP3), EP3 DSLBIS, EP3 Upstream link))) +
 53 Min (GP1 to CPU BW,                                53 Min (GP1 to CPU BW,
 54      Min(SW 2 Upstream Link to RP2 BW,             54      Min(SW 2 Upstream Link to RP2 BW,
 55          Min(SW2SSLBIS for SW2DSP0 (EP4), EP4      55          Min(SW2SSLBIS for SW2DSP0 (EP4), EP4 DSLBIS, EP4 Upstream Link) +
 56          Min(SW2SSLBIS for SW2DSP1 (EP5), EP5      56          Min(SW2SSLBIS for SW2DSP1 (EP5), EP5 DSLBIS, EP5 Upstream link)) +
 57      Min(SW 3 Upstream Link to RP3 BW,             57      Min(SW 3 Upstream Link to RP3 BW,
 58          Min(SW3SSLBIS for SW3DSP0 (EP6), EP6      58          Min(SW3SSLBIS for SW3DSP0 (EP6), EP6 DSLBIS, EP6 Upstream Link) +
 59          Min(SW3SSLBIS for SW3DSP1 (EP7), EP7      59          Min(SW3SSLBIS for SW3DSP1 (EP7), EP7 DSLBIS, EP7 Upstream link))))
 60                                                    60 
 61 The calculation starts at cxl_region_shared_up     61 The calculation starts at cxl_region_shared_upstream_perf_update(). A xarray
 62 is created to collect all the endpoint bandwid     62 is created to collect all the endpoint bandwidths via the
 63 cxl_endpoint_gather_bandwidth() function. The      63 cxl_endpoint_gather_bandwidth() function. The min() of bandwidth from the
 64 endpoint CDAT and the upstream link bandwidth      64 endpoint CDAT and the upstream link bandwidth is calculated. If the endpoint
 65 has a CXL switch as a parent, then min() of ca     65 has a CXL switch as a parent, then min() of calculated bandwidth and the
 66 bandwidth from the SSLBIS for the switch downs     66 bandwidth from the SSLBIS for the switch downstream port that is associated
 67 with the endpoint is calculated. The final ban     67 with the endpoint is calculated. The final bandwidth is stored in a
 68 'struct cxl_perf_ctx' in the xarray indexed by     68 'struct cxl_perf_ctx' in the xarray indexed by a device pointer. If the
 69 endpoint is direct attached to a root port (RP     69 endpoint is direct attached to a root port (RP), the device pointer would be an
 70 RP device. If the endpoint is behind a switch,     70 RP device. If the endpoint is behind a switch, the device pointer would be the
 71 upstream device of the parent switch.              71 upstream device of the parent switch.
 72                                                    72 
 73 At the next stage, the code walks through one      73 At the next stage, the code walks through one or more switches if they exist
 74 in the topology. For endpoints directly attach     74 in the topology. For endpoints directly attached to RPs, this step is skipped.
 75 If there is another switch upstream, the code      75 If there is another switch upstream, the code takes the min() of the current
 76 gathered bandwidth and the upstream link bandw     76 gathered bandwidth and the upstream link bandwidth. If there's a switch
 77 upstream, then the SSLBIS of the upstream swit     77 upstream, then the SSLBIS of the upstream switch.
 78                                                    78 
 79 Once the topology walk reaches the RP, whether     79 Once the topology walk reaches the RP, whether it's direct attached endpoints
 80 or walking through the switch(es), cxl_rp_gath     80 or walking through the switch(es), cxl_rp_gather_bandwidth() is called. At
 81 this point all the bandwidths are aggregated p     81 this point all the bandwidths are aggregated per each host bridge, which is
 82 also the index for the resulting xarray.           82 also the index for the resulting xarray.
 83                                                    83 
 84 The next step is to take the min() of the per      84 The next step is to take the min() of the per host bridge bandwidth and the
 85 bandwidth from the Generic Port (GP). The band     85 bandwidth from the Generic Port (GP). The bandwidths for the GP is retrieved
 86 via ACPI tables SRAT/HMAT. The min bandwidth a     86 via ACPI tables SRAT/HMAT. The min bandwidth are aggregated under the same
 87 ACPI0017 device to form a new xarray.              87 ACPI0017 device to form a new xarray.
 88                                                    88 
 89 Finally, the cxl_region_update_bandwidth() is      89 Finally, the cxl_region_update_bandwidth() is called and the aggregated
 90 bandwidth from all the members of the last xar     90 bandwidth from all the members of the last xarray is updated for the
 91 access coordinates residing in the cxl region      91 access coordinates residing in the cxl region (cxlr) context.
                                                      

~ [ source navigation ] ~ [ diff markup ] ~ [ identifier search ] ~

kernel.org | git.kernel.org | LWN.net | Project Home | SVN repository | Mail admin

Linux® is a registered trademark of Linus Torvalds in the United States and other countries.
TOMOYO® is a registered trademark of NTT DATA CORPORATION.

sflogo.php