kernel_optimize_test

History

Dave Hansen c4e1be9ec1 mm, sparsemem: break out of loops early There are a number of times that we loop over NR_MEM_SECTIONS, looking for section_present() on each section. But, when we have very large physical address spaces (large MAX_PHYSMEM_BITS), NR_MEM_SECTIONS becomes very large, making the loops quite long. With MAX_PHYSMEM_BITS=46 and a section size of 128MB, the current loops are 512k iterations, which we barely notice on modern hardware. But, raising MAX_PHYSMEM_BITS higher (like we will see on systems that support 5-level paging) makes this 64x longer and we start to notice, especially on slower systems like simulators. A 10-second delay for 512k iterations is annoying. But, a 640- second delay is crippling. This does not help if we have extremely sparse physical address spaces, but those are quite rare. We expect that most of the "slow" systems where this matters will also be quite small and non-sparse. To fix this, we track the highest section we've ever encountered. This lets us know when we will never see another section_present(), and lets us break out of the loops earlier. Doing the whole for_each_present_section_nr() macro is probably overkill, but it will ensure that any future loop iterations that we grow are more likely to be correct. Kirrill said "It shaved almost 40 seconds from boot time in qemu with 5-level paging enabled for me". Link: http://lkml.kernel.org/r/20170504174434.C45A4735@viggo.jf.intel.com Signed-off-by: Dave Hansen <dave.hansen@linux.intel.com> Tested-by: Kirill A. Shutemov <kirill.shutemov@linux.intel.com> Signed-off-by: Andrew Morton <akpm@linux-foundation.org> Signed-off-by: Linus Torvalds <torvalds@linux-foundation.org>		2017-07-06 16:24:31 -07:00
..
acpi	arm64 updates for 4.13:	2017-07-05 17:09:27 -07:00
asm-generic	clocksource/drivers: Rename CLKSRC_OF to TIMER_OF	2017-06-14 12:01:03 +02:00
clocksource
crypto	crypto: engine - replace pr_xxx by dev_xxx	2017-06-19 14:19:54 +08:00
drm
dt-bindings	ARM: 64-bit DT updates	2017-07-04 14:50:59 -07:00
keys
kvm
linux	mm, sparsemem: break out of loops early	2017-07-06 16:24:31 -07:00
math-emu
media
memory
misc
net	Merge branch 'for-4.13' of git://git.kernel.org/pub/scm/linux/kernel/git/tj/percpu	2017-07-06 08:59:41 -07:00
pcmcia
ras	trace, ras: add ARM processor error trace event	2017-06-22 18:22:05 +01:00
rdma	Merge branch 'next' of git://git.kernel.org/pub/scm/linux/kernel/git/jmorris/linux-security	2017-07-05 11:26:35 -07:00
rxrpc
scsi	Merge branch 'for-4.13' of git://git.kernel.org/pub/scm/linux/kernel/git/tj/libata	2017-07-06 09:41:58 -07:00
soc	Merge git://git.kernel.org/pub/scm/linux/kernel/git/davem/net-next	2017-07-05 12:31:59 -07:00
sound
target
trace	Merge branch 'for-4.13' of git://git.kernel.org/pub/scm/linux/kernel/git/tj/percpu	2017-07-06 08:59:41 -07:00
uapi	ocfs2: use magic.h	2017-07-06 16:24:30 -07:00
video
xen