Skip to content
Snippets Groups Projects
  1. Aug 30, 2005
    • Nick Piggin's avatar
      [PATCH] Lazy page table copies in fork() · d992895b
      Nick Piggin authored
      
      Defer copying of ptes until fault time when it is possible to reconstruct
      the pte from backing store. Idea from Andi Kleen and Nick Piggin.
      
      Thanks to input from Rik van Riel and Linus and to Hugh for correcting
      my blundering.
      
      Ray Fucillo <fucillo@intersystems.com> reports:
      
        "I applied this latest patch to a 2.6.12 kernel and found that it does
         resolve the problem.  Prior to the patch on this machine, I was
         seeing about 23ms spent in fork for ever 100MB of shared memory
         segment.
      
         After applying the patch, fork is taking about 1ms regardless of the
         shared memory size."
      
      Signed-off-by: default avatarNick Piggin <npiggin@suse.de>
      Signed-off-by: default avatarLinus Torvalds <torvalds@osdl.org>
      d992895b
  2. Aug 20, 2005
    • Linus Torvalds's avatar
      Fix nasty ncpfs symlink handling bug. · cc314eef
      Linus Torvalds authored
      
      This bug could cause oopses and page state corruption, because ncpfs
      used the generic page-cache symlink handlign functions.  But those
      functions only work if the page cache is guaranteed to be "stable", ie a
      page that was installed when the symlink walk was started has to still
      be installed in the page cache at the end of the walk.
      
      We could have fixed ncpfs to not use the generic helper routines, but it
      is in many ways much cleaner to instead improve on the symlink walking
      helper routines so that they don't require that absolute stability.
      
      We do this by allowing "follow_link()" to return a error-pointer as a
      cookie, which is fed back to the cleanup "put_link()" routine.  This
      also simplifies NFS symlink handling.
      
      Signed-off-by: default avatarLinus Torvalds <torvalds@osdl.org>
      cc314eef
  3. Aug 05, 2005
    • David Gibson's avatar
      [PATCH] Fix hugepage crash on failing mmap() · c7546f8f
      David Gibson authored
      
      This patch fixes a crash in the hugepage code.  unmap_hugepage_area() was
      assuming that (due to prefault) PTEs must exist for all the area in
      question.  However, this may not be the case, if mmap() encounters an error
      before the prefault and calls unmap_region() to clean up any partial
      mapping.
      
      Depending on the hugepage configuration, this crash can be triggered by an
      unpriveleged user.
      
      Signed-off-by: default avatarDavid Gibson <david@gibson.dropbear.id.au>
      Cc: William Lee Irwin III <wli@holomorphy.com>
      Signed-off-by: default avatarAndrew Morton <akpm@osdl.org>
      Signed-off-by: default avatarLinus Torvalds <torvalds@osdl.org>
      c7546f8f
    • Simon Derr's avatar
      [PATCH] __vm_enough_memory() signedness fix · 2f60f8d3
      Simon Derr authored
      
      We have found what seems to be a small bug in __vm_enough_memory() when
      sysctl_overcommit_memory is set to OVERCOMMIT_NEVER.
      
      When this bug occurs the systems fails to boot, with /sbin/init whining
      about fork() returning ENOMEM.
      
      We hunted down the problem to this:
      
      The deferred update mecanism used in vm_acct_memory(), on a SMP system,
      allows the vm_committed_space counter to have a negative value.
      
      This should not be a problem since this counter is known to be inaccurate.
      
      But in __vm_enough_memory() this counter is compared to the `allowed'
      variable, which is an unsigned long.  This comparison is broken since it
      will consider the negative values of vm_committed_space to be huge positive
      values, resulting in a memory allocation failure.
      
      Signed-off-by: default avatar <Jean-Marc.Saffroy@ext.bull.net>
      Signed-off-by: default avatar <Simon.Derr@bull.net>
      Signed-off-by: default avatarAndrew Morton <akpm@osdl.org>
      Signed-off-by: default avatarLinus Torvalds <torvalds@osdl.org>
      2f60f8d3
  4. Aug 04, 2005
  5. Aug 03, 2005
    • Linus Torvalds's avatar
      Fix up recent get_user_pages() handling · a68d2ebc
      Linus Torvalds authored
      
      The VM_FAULT_WRITE thing is an extra bit, not a valid return value, and
      has to be treated as such by get_user_pages().
      
      Signed-off-by: default avatarLinus Torvalds <torvalds@osdl.org>
      a68d2ebc
    • Nick Piggin's avatar
      [PATCH] fix get_user_pages bug · f33ea7f4
      Nick Piggin authored
      
      Checking pte_dirty instead of pte_write in __follow_page is problematic
      for s390, and for copy_one_pte which leaves dirty when clearing write.
      
      So revert __follow_page to check pte_write as before, and make
      do_wp_page pass back a special extra VM_FAULT_WRITE bit to say it has
      done its full job: once get_user_pages receives this value, it no longer
      requires pte_write in __follow_page.
      
      But most callers of handle_mm_fault, in the various architectures, have
      switch statements which do not expect this new case.  To avoid changing
      them all in a hurry, make an inline wrapper function (using the old
      name) that masks off the new bit, and use the extended interface with
      double underscores.
      
      Yes, we do have a call to do_wp_page from do_swap_page, but no need to
      change that: in rare case it's needed, another do_wp_page will follow.
      
      Signed-off-by: default avatarHugh Dickins <hugh@veritas.com>
      [ Cleanups by Nick Piggin ]
      Signed-off-by: default avatarLinus Torvalds <torvalds@osdl.org>
      f33ea7f4
  6. Aug 02, 2005
  7. Aug 01, 2005
    • Linus Torvalds's avatar
      Fix get_user_pages() race for write access · 4ceb5db9
      Linus Torvalds authored
      
      There's no real guarantee that handle_mm_fault() will always be able to
      break a COW situation - if an update from another thread ends up
      modifying the page table some way, handle_mm_fault() may end up
      requiring us to re-try the operation.
      
      That's normally fine, but get_user_pages() ended up re-trying it as a
      read, and thus a write access could in theory end up losing the dirty
      bit or be done on a page that had not been properly COW'ed.
      
      This makes get_user_pages() always retry write accesses as write
      accesses by making "follow_page()" require that a writable follow has
      the dirty bit set.  That simplifies the code and solves the race: if the
      COW break fails for some reason, we'll just loop around and try again.
      
      Signed-off-by: default avatarLinus Torvalds <torvalds@osdl.org>
      4ceb5db9
  8. Jul 30, 2005
  9. Jul 27, 2005
    • Andy Whitcroft's avatar
      [PATCH] Remove bogus warning in page_alloc.c · 12b1c5f3
      Andy Whitcroft authored
      
      Originally __free_pages_bulk used the relative page number within a zone to
      define its buddies.  This meant that to maintain the "maximally aligned"
      requirements (that an allocation of size N will be aligned at least to N
      physically) zones had to also be aligned to 1<<MAX_ORDER pages.  When
      __free_pages_bulk was updated to use the relative page frame numbers of the
      free'd pages to pair buddies this released the alignment constraint on the
      'left' edge of the zone.  This allows _either_ edge of the zone to contain
      partial MAX_ORDER sized buddies.  These simply never will have matching
      buddies and thus will never make it to the 'top' of the pyramid.
      
      The patch below removes a now redundant check ensuring that the mem_map was
      aligned to MAX_ORDER.
      
      Signed-off-by: default avatarAndy Whitcroft <apw@shadowen.org>
      Cc: Christoph Lameter <christoph@lameter.com>
      Signed-off-by: default avatarAndrew Morton <akpm@osdl.org>
      Signed-off-by: default avatarLinus Torvalds <torvalds@osdl.org>
      12b1c5f3
    • suzuki's avatar
      [PATCH] madvise() does not always return -EBADF on non-file mapped area · 165cd402
      suzuki authored
      
      The madvise() system call returns -EBADF for areas which does not map to
      files, only for *behaviour* request MADV_WILLNEED.
      
      According to man pages, madvise returns :
      
      EBADF - the map exists, but the area maps something that isn't a file.
      
      Fixes bug 2995.
      
      Signed-off-by: default avatarSuzuki K P <suzuki@in.ibm.com>
      Signed-off-by: default avatarAndrew Morton <akpm@osdl.org>
      Signed-off-by: default avatarLinus Torvalds <torvalds@osdl.org>
      165cd402
    • Andrew Morton's avatar
      [PATCH] check_user_page_readable() deadlock fix · 1aaf18ff
      Andrew Morton authored
      
      Fix bug identifued by Richard Purdie <rpurdie@rpsys.net>.
      
      oprofile calls check_user_page_readable() from interrupt context, so we
      deadlock over various VFS locks.
      
      But check_user_page_readable() doesn't imply either a read or a write of the
      page's contents.  Change __follow_page() so that check_user_page_readable()
      can tell __follow_page() that we're not accessing the page's contents, and use
      that info to avoid the troublesome lock-takings.
      
      Also, make follow_page() inline for the single callsite in memory.c to save a
      bit of stack space.
      
      Signed-off-by: default avatarAndrew Morton <akpm@osdl.org>
      Signed-off-by: default avatarLinus Torvalds <torvalds@osdl.org>
      1aaf18ff
    • Andi Kleen's avatar
      [PATCH] Undo mempolicy shared policy rbtree microoptimization · 90c5029e
      Andi Kleen authored
      
      All mempolicy changes must be inside the spinlock and readding the rb_erase
      prevents a crash while doing:
      
      > echo "1" > /tmp/numatest
      > numactl --length=0x4000 --shm /tmp/numatest --localalloc
      > numactl --length=0x2000 --offset=0 --shm /tmp/numatest --membind=0
      > numactl --length=0x2000 --offset=0x2000 --shm /tmp/numatest --membind=1
      > ipcs
      > ipcrm -M "the_key_value_of_this_shm_area"
      
      Based on a patch by John Blackwood
      
      Cc: <john.blackwood@ccur.com>
      Cc: <andrea@suse.de>
      Signed-off-by: default avatarAndi Kleen <ak@suse.de>
      Signed-off-by: default avatarAndrew Morton <akpm@osdl.org>
      Signed-off-by: default avatarLinus Torvalds <torvalds@osdl.org>
      90c5029e
  10. Jul 15, 2005
    • Carsten Otte's avatar
      [PATCH] execute-in-place fixes · afa597ba
      Carsten Otte authored
      
      This patch includes feedback from Andrew and Christoph. Thanks for
      taking time to review.
      
      Use of empty_zero_page was eliminated to fix compilation for architectures
      that don't have it.
      
      This patch removes setting pages up-to-date in ext2_get_xip_page and all
      bug checks to verify that the page is indeed up to date.  Setting the page
      state on mapping to userland is bogus.  None of the code patchs involved
      with these pages in mm cares about the page state.
      
      still on my ToDo list: identify a place outside second extended where
      __inode_direct_access should reside
      
      Signed-off-by: default avatarCarsten Otte <cotte@de.ibm.com>
      Signed-off-by: default avatarAndrew Morton <akpm@osdl.org>
      Signed-off-by: default avatarLinus Torvalds <torvalds@osdl.org>
      afa597ba
  11. Jul 12, 2005
  12. Jul 08, 2005
  13. Jul 06, 2005
  14. Jun 29, 2005
  15. Jun 27, 2005
  16. Jun 26, 2005
    • Christoph Lameter's avatar
      [PATCH] Cleanup patch for process freezing · 3e1d1d28
      Christoph Lameter authored
      
      1. Establish a simple API for process freezing defined in linux/include/sched.h:
      
         frozen(process)		Check for frozen process
         freezing(process)		Check if a process is being frozen
         freeze(process)		Tell a process to freeze (go to refrigerator)
         thaw_process(process)	Restart process
         frozen_process(process)	Process is frozen now
      
      2. Remove all references to PF_FREEZE and PF_FROZEN from all
         kernel sources except sched.h
      
      3. Fix numerous locations where try_to_freeze is manually done by a driver
      
      4. Remove the argument that is no longer necessary from two function calls.
      
      5. Some whitespace cleanup
      
      6. Clear potential race in refrigerator (provides an open window of PF_FREEZE
         cleared before setting PF_FROZEN, recalc_sigpending does not check
         PF_FROZEN).
      
      This patch does not address the problem of freeze_processes() violating the rule
      that a task may only modify its own flags by setting PF_FREEZE. This is not clean
      in an SMP environment. freeze(process) is therefore not SMP safe!
      
      Signed-off-by: default avatarChristoph Lameter <christoph@lameter.com>
      Signed-off-by: default avatarLinus Torvalds <torvalds@osdl.org>
      3e1d1d28
  17. Jun 25, 2005
  18. Jun 24, 2005
  19. Jun 23, 2005
Loading