Update to jemalloc 4.0.3 #14

fbernier · 2015-10-21T17:50:58Z

As discussed in rust-lang/rust#29196. I guess you should just push my branch upstream instead of merging the pull request into dev.

This resolves jemalloc#150.

The size of the source allocation is known at this point, so reading the chunk header can be avoided for the small size class fast path. This is not very useful right now, but it provides a significant performance boost with an alternate ralloc entry point taking the old size.

use sized deallocation internally for ralloc

Fix variable declaration with no type in the configure script.

This cleans up the fast path a bit more by moving away more code.

* use sized deallocation in iralloct_realign * iralloc and ixalloc always need the old size, so pass it in from the caller where it's often already calculated

It has no use for the arena_t since unlike rallocx it never makes a new memory allocation. It's just an unused parameter in ixalloc_helper.

Unlike the preceeding attempted fix, this version avoids the potential for converting an invalid bin index to a size class.

It is possible for the thread's tdata to be NULL late during thread destruction, so take care not to dereference a NULL pointer in such cases.

Fix quarantine to actually update tsd when expanding, and to avoid double initialization (leaking the first quarantine) due to recursive initialization. This resolves jemalloc#161.

Reported by Denis Denisov.

Reported by Guilherme Gonçalves. This resolves jemalloc#166.

This provides in-place expansion of huge allocations when the end of the allocation is at the end of the sbrk heap. There's already the ability to extend in-place via recycled chunks but this handles the initial growth of the heap via repeated vector / string reallocations. A possible future extension could allow realloc to go from the following: | huge allocation | recycled chunks | ^ dss_end To a larger allocation built from recycled *and* new chunks: | huge allocation | ^ dss_end Doing that would involve teaching the chunk recycling code to request new chunks to satisfy the request. The chunk_dss code wouldn't require any further changes. #include <stdlib.h> int main(void) { size_t chunk = 4 * 1024 * 1024; void *ptr = NULL; for (size_t size = chunk; size < chunk * 128; size *= 2) { ptr = realloc(ptr, size); if (!ptr) return 1; } } dss:secondary: 0.083s dss:primary: 0.083s After: dss:secondary: 0.083s dss:primary: 0.003s The dss heap grows in the upwards direction, so the oldest chunks are at the low addresses and they are used first. Linux prefers to grow the mmap heap downwards, so the trick will not work in the *current* mmap chunk allocator as a huge allocation will only be at the top of the heap in a contrived case.

Fix OOM cleanup in huge_palloc() to call idalloct() rather than base_node_dalloc(). This bug is a result of incomplete refactoring, and has no impact other than leaking memory during OOM.

This eliminates the malloc tunables as tools for an attacker. Closes jemalloc#173

In addition to true/false, opt.junk can now be either "alloc" or "free", giving applications the possibility of junking memory only on allocation or deallocation. This resolves jemalloc#172.

Currently pprof will print output for all threads if a single thread is not specified, but this doesn't play well with many output formats (e.g., any of the dot-based formats). Instead, default to printing just the overall profile when no specific thread is requested. This resolves jemalloc#157.

This resolves jemalloc#274.

This resolves jemalloc#257.

Systems that do not support chunk split/merge cannot shrink/grow huge allocations in place.

Fix ixallocx_prof_sample() to never modify nor create sampled small allocations. xallocx() is in general incapable of moving small allocations, so this fix removes buggy code without loss of generality.

Fix irallocx_prof_sample() to always allocate large regions, even when alignment is non-zero.

Simplify imallocx_prof_sample() to always operate on usize rather than sometimes using size. This avoids redundant usize computations and more closely fits the style adopted by i[rx]allocx_prof_sample() to fix sampling bugs.

Fix prof_alloc_rollback() to read tdata from thread-specific data rather than dereferencing a potentially invalid tctx.

Run integration tests with MALLOC_CONF="prof:true,prof_active:false" in addition to MALLOC_CONF="prof:true".

Fix prof_tctx_dump_iter() to filter out nodes that were created after heap profile dumping started. Prior to this fix, spurious entries with arbitrary object/byte counts could appear in heap profiles, which resulted in jeprof inaccuracies or failures.

Zero all trailing bytes of large allocations when --enable-cache-oblivious configure option is enabled. This regression was introduced by 8a03cf0 (Implement cache index randomization for large allocations.). Zero trailing bytes of huge allocations when resizing from/to a size class that is not a multiple of the chunk size.

Make mallocx() OOM testing work correctly even on systems that can allocate the majority of virtual address space in a single contiguous region.

Work around a potentially bad thread-specific data initialization interaction with NPTL (glibc's pthreads implementation). This resolves jemalloc#283.

In addition to depending on map coalescing, the test depended on munmap() being disabled so that chunk recycling would always succeed.

rust-highfive · 2015-10-21T17:51:12Z

Thanks for the pull request, and welcome! The Rust team is excited to review your changes, and you should hear from @alexcrichton (or someone else) soon.

If any changes to this PR are deemed necessary, please add them as extra commits. This ensures that the reviewer can see what has changed since they last reviewed the code. The way Github handles out-of-date commits, this should also make it reasonably obvious what issues have or haven't been addressed. Large or tricky changes may require several passes of review and changes.

Please see the contribution instructions for more information.

rust-highfive · 2015-10-21T17:51:13Z

Warning

Pull requests are usually filed against the master branch for this repo, but this one is against dev. Please double check that you specified the right target!

alexcrichton · 2015-10-21T19:58:16Z

Thanks! Merged as https://github.com/rust-lang/jemalloc/tree/rust-2015-10-21

Jason Evans and others added 30 commits October 16, 2014 12:33

Initialize chunks_mtx for all configurations.

c83bccd

This resolves jemalloc#150.

Merge pull request jemalloc#151 from thestinger/ralloc

8f47e3d

use sized deallocation internally for ralloc

Fix variable declaration with no type in the configure script.

79725aa

Merge pull request jemalloc#154 from guilherme-pg/implicit-int

1f725eb

Fix variable declaration with no type in the configure script.

Use JEMALLOC_INLINE_C everywhere it's appropriate.

af1f592

Fix prof_{enter,leave}() calls to pass tdata_self.

c93ed81

mark huge allocations as unlikely

809b0ac

This cleans up the fast path a bit more by moving away more code.

avoid redundant chunk header reads

d33f834

* use sized deallocation in iralloct_realign * iralloc and ixalloc always need the old size, so pass it in from the caller where it's often already calculated

Miscellaneous cleanups.

cfc5706

rm unused arena wrangling from xallocx

dc65213

It has no use for the arena_t since unlike rallocx it never makes a new memory allocation. It's just an unused parameter in ixalloc_helper.

Fix arena_sdalloc() to use promoted size.

6da2e9d

Fix arena_sdalloc() to use promoted size (second attempt).

d7a9bab

Unlike the preceeding attempted fix, this version avoids the potential for converting an invalid bin index to a size class.

Don't dereference NULL tdata in prof_{enter,leave}().

82cb603

It is possible for the thread's tdata to be NULL late during thread destruction, so take care not to dereference a NULL pointer in such cases.

Disable arena_dirty_count() validation.

2b2f6dc

Fix two quarantine regressions.

c002a5c

Fix quarantine to actually update tsd when expanding, and to avoid double initialization (leaking the first quarantine) due to recursive initialization. This resolves jemalloc#161.

Make quarantine_init() static.

9cf2be0

Fix pointer arithmetic undefined behavior.

2012d5a

Reported by Denis Denisov.

Fix more pointer arithmetic undefined behavior.

d49cb68

Reported by Guilherme Gonçalves. This resolves jemalloc#166.

Remove extra definition of je_tsd_boot on win32.

a213602

Fix test_stats_arenas_bins for 32-bit builds.

f79e01f

Fix OOM cleanup in huge_palloc().

1036ddb

Fix OOM cleanup in huge_palloc() to call idalloct() rather than base_node_dalloc(). This bug is a result of incomplete refactoring, and has no impact other than leaking memory during OOM.

Style fixes.

a18c2b1

Add a C11 atomics-based implementation of atomic.h API.

59cd80e

Style and spelling fixes.

e12eaf9

Ignore MALLOC_CONF in set{uid,gid,cap} binaries.

b74041f

This eliminates the malloc tunables as tools for an attacker. Closes jemalloc#173

Introduce two new modes of junk filling: "alloc" and "free".

2c5cb61

In addition to true/false, opt.junk can now be either "alloc" or "free", giving applications the possibility of junking memory only on allocation or deallocation. This resolves jemalloc#172.

Move variable declaration to the top its block for MSVC compatibility.

9c6a8d3

Dmitry-Me and others added 25 commits September 15, 2015 11:19

Reduce variable scope.

78ae1ac

This resolves jemalloc#274.

Add more xallocx() overflow tests.

aca490f

Centralize xallocx() size[+extra] overflow checks.

9a505b7

Link test to librt if it contains clock_gettime(2).

345c1b0

This resolves jemalloc#257.

Loosen expected xallocx() results.

65b940a

Systems that do not support chunk split/merge cannot shrink/grow huge allocations in place.

Update ChangeLog for 4.0.1.

1d7540c

Merge branch 'dev'

9898051

Fix ixallocx_prof_sample().

38e2c8f

Fix ixallocx_prof_sample() to never modify nor create sampled small allocations. xallocx() is in general incapable of moving small allocations, so this fix removes buggy code without loss of generality.

Fix irallocx_prof_sample().

4be9c79

Fix irallocx_prof_sample() to always allocate large regions, even when alignment is non-zero.

Simplify imallocx_prof_sample().

3263be6

Simplify imallocx_prof_sample() to always operate on usize rather than sometimes using size. This avoids redundant usize computations and more closely fits the style adopted by i[rx]allocx_prof_sample() to fix sampling bugs.

Fix prof_alloc_rollback().

3ca0cf6

Fix prof_alloc_rollback() to read tdata from thread-specific data rather than dereferencing a potentially invalid tctx.

Expand check_integration_prof testing.

4d0e162

Run integration tests with MALLOC_CONF="prof:true,prof_active:false" in addition to MALLOC_CONF="prof:true".

Add mallocx() OOM tests.

2152329

Make arena_dalloc_large_locked_impl() static.

e56b24e

Fix tsd_boot1() to use explicit 'void' parameter list.

66814c1

Update ChangeLog for 4.0.2.

b8e966f

Merge branch 'dev'

486d249

Make mallocx() OOM test more robust.

03eb37e

Make mallocx() OOM testing work correctly even on systems that can allocate the majority of virtual address space in a single contiguous region.

Work around an NPTL-specific TSD issue.

d36c7eb

Work around a potentially bad thread-specific data initialization interaction with NPTL (glibc's pthreads implementation). This resolves jemalloc#283.

Remove fragile xallocx() test case.

044047f

In addition to depending on map coalescing, the test depended on munmap() being disabled so that chunk recycling would always succeed.

Update ChangeLog for 4.0.3.

0270968

Merge branch 'dev'

e9192ea

Check-in the generated configure script

0242aee

rust-highfive assigned alexcrichton Oct 21, 2015

alexcrichton closed this Oct 21, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update to jemalloc 4.0.3 #14

Update to jemalloc 4.0.3 #14

Uh oh!

fbernier commented Oct 21, 2015

Uh oh!

rust-highfive commented Oct 21, 2015

Uh oh!

rust-highfive commented Oct 21, 2015

Uh oh!

alexcrichton commented Oct 21, 2015

Uh oh!

Uh oh!

Update to jemalloc 4.0.3 #14

Update to jemalloc 4.0.3 #14

Uh oh!

Conversation

fbernier commented Oct 21, 2015

Uh oh!

rust-highfive commented Oct 21, 2015

Uh oh!

rust-highfive commented Oct 21, 2015

Uh oh!

alexcrichton commented Oct 21, 2015

Uh oh!

Uh oh!