~ [ source navigation ] ~ [ diff markup ] ~ [ identifier search ] ~

TOMOYO Linux Cross Reference
Linux/Documentation/mm/split_page_table_lock.rst

Version: ~ [ linux-6.12-rc7 ] ~ [ linux-6.11.7 ] ~ [ linux-6.10.14 ] ~ [ linux-6.9.12 ] ~ [ linux-6.8.12 ] ~ [ linux-6.7.12 ] ~ [ linux-6.6.60 ] ~ [ linux-6.5.13 ] ~ [ linux-6.4.16 ] ~ [ linux-6.3.13 ] ~ [ linux-6.2.16 ] ~ [ linux-6.1.116 ] ~ [ linux-6.0.19 ] ~ [ linux-5.19.17 ] ~ [ linux-5.18.19 ] ~ [ linux-5.17.15 ] ~ [ linux-5.16.20 ] ~ [ linux-5.15.171 ] ~ [ linux-5.14.21 ] ~ [ linux-5.13.19 ] ~ [ linux-5.12.19 ] ~ [ linux-5.11.22 ] ~ [ linux-5.10.229 ] ~ [ linux-5.9.16 ] ~ [ linux-5.8.18 ] ~ [ linux-5.7.19 ] ~ [ linux-5.6.19 ] ~ [ linux-5.5.19 ] ~ [ linux-5.4.285 ] ~ [ linux-5.3.18 ] ~ [ linux-5.2.21 ] ~ [ linux-5.1.21 ] ~ [ linux-5.0.21 ] ~ [ linux-4.20.17 ] ~ [ linux-4.19.323 ] ~ [ linux-4.18.20 ] ~ [ linux-4.17.19 ] ~ [ linux-4.16.18 ] ~ [ linux-4.15.18 ] ~ [ linux-4.14.336 ] ~ [ linux-4.13.16 ] ~ [ linux-4.12.14 ] ~ [ linux-4.11.12 ] ~ [ linux-4.10.17 ] ~ [ linux-4.9.337 ] ~ [ linux-4.4.302 ] ~ [ linux-3.10.108 ] ~ [ linux-2.6.32.71 ] ~ [ linux-2.6.0 ] ~ [ linux-2.4.37.11 ] ~ [ unix-v6-master ] ~ [ ccs-tools-1.8.12 ] ~ [ policy-sample ] ~
Architecture: ~ [ i386 ] ~ [ alpha ] ~ [ m68k ] ~ [ mips ] ~ [ ppc ] ~ [ sparc ] ~ [ sparc64 ] ~

  1 =====================
  2 Split page table lock
  3 =====================
  4 
  5 Originally, mm->page_table_lock spinlock protected all page tables of the
  6 mm_struct. But this approach leads to poor page fault scalability of
  7 multi-threaded applications due high contention on the lock. To improve
  8 scalability, split page table lock was introduced.
  9 
 10 With split page table lock we have separate per-table lock to serialize
 11 access to the table. At the moment we use split lock for PTE and PMD
 12 tables. Access to higher level tables protected by mm->page_table_lock.
 13 
 14 There are helpers to lock/unlock a table and other accessor functions:
 15 
 16  - pte_offset_map_lock()
 17         maps PTE and takes PTE table lock, returns pointer to PTE with
 18         pointer to its PTE table lock, or returns NULL if no PTE table;
 19  - pte_offset_map_nolock()
 20         maps PTE, returns pointer to PTE with pointer to its PTE table
 21         lock (not taken), or returns NULL if no PTE table;
 22  - pte_offset_map()
 23         maps PTE, returns pointer to PTE, or returns NULL if no PTE table;
 24  - pte_unmap()
 25         unmaps PTE table;
 26  - pte_unmap_unlock()
 27         unlocks and unmaps PTE table;
 28  - pte_alloc_map_lock()
 29         allocates PTE table if needed and takes its lock, returns pointer to
 30         PTE with pointer to its lock, or returns NULL if allocation failed;
 31  - pmd_lock()
 32         takes PMD table lock, returns pointer to taken lock;
 33  - pmd_lockptr()
 34         returns pointer to PMD table lock;
 35 
 36 Split page table lock for PTE tables is enabled compile-time if
 37 CONFIG_SPLIT_PTLOCK_CPUS (usually 4) is less or equal to NR_CPUS.
 38 If split lock is disabled, all tables are guarded by mm->page_table_lock.
 39 
 40 Split page table lock for PMD tables is enabled, if it's enabled for PTE
 41 tables and the architecture supports it (see below).
 42 
 43 Hugetlb and split page table lock
 44 =================================
 45 
 46 Hugetlb can support several page sizes. We use split lock only for PMD
 47 level, but not for PUD.
 48 
 49 Hugetlb-specific helpers:
 50 
 51  - huge_pte_lock()
 52         takes pmd split lock for PMD_SIZE page, mm->page_table_lock
 53         otherwise;
 54  - huge_pte_lockptr()
 55         returns pointer to table lock;
 56 
 57 Support of split page table lock by an architecture
 58 ===================================================
 59 
 60 There's no need in special enabling of PTE split page table lock: everything
 61 required is done by pagetable_pte_ctor() and pagetable_pte_dtor(), which
 62 must be called on PTE table allocation / freeing.
 63 
 64 Make sure the architecture doesn't use slab allocator for page table
 65 allocation: slab uses page->slab_cache for its pages.
 66 This field shares storage with page->ptl.
 67 
 68 PMD split lock only makes sense if you have more than two page table
 69 levels.
 70 
 71 PMD split lock enabling requires pagetable_pmd_ctor() call on PMD table
 72 allocation and pagetable_pmd_dtor() on freeing.
 73 
 74 Allocation usually happens in pmd_alloc_one(), freeing in pmd_free() and
 75 pmd_free_tlb(), but make sure you cover all PMD table allocation / freeing
 76 paths: i.e X86_PAE preallocate few PMDs on pgd_alloc().
 77 
 78 With everything in place you can set CONFIG_ARCH_ENABLE_SPLIT_PMD_PTLOCK.
 79 
 80 NOTE: pagetable_pte_ctor() and pagetable_pmd_ctor() can fail -- it must
 81 be handled properly.
 82 
 83 page->ptl
 84 =========
 85 
 86 page->ptl is used to access split page table lock, where 'page' is struct
 87 page of page containing the table. It shares storage with page->private
 88 (and few other fields in union).
 89 
 90 To avoid increasing size of struct page and have best performance, we use a
 91 trick:
 92 
 93  - if spinlock_t fits into long, we use page->ptr as spinlock, so we
 94    can avoid indirect access and save a cache line.
 95  - if size of spinlock_t is bigger then size of long, we use page->ptl as
 96    pointer to spinlock_t and allocate it dynamically. This allows to use
 97    split lock with enabled DEBUG_SPINLOCK or DEBUG_LOCK_ALLOC, but costs
 98    one more cache line for indirect access;
 99 
100 The spinlock_t allocated in pagetable_pte_ctor() for PTE table and in
101 pagetable_pmd_ctor() for PMD table.
102 
103 Please, never access page->ptl directly -- use appropriate helper.

~ [ source navigation ] ~ [ diff markup ] ~ [ identifier search ] ~

kernel.org | git.kernel.org | LWN.net | Project Home | SVN repository | Mail admin

Linux® is a registered trademark of Linus Torvalds in the United States and other countries.
TOMOYO® is a registered trademark of NTT DATA CORPORATION.

sflogo.php