Store token trees in contiguous `Vec` instead of as a tree #18327

ChayimFriedman2 · 2024-10-18T08:14:09Z

As part of my attempts to optimize #18074, I noticed that token trees contain a significant portion of our memory usage (around 200mb out of 1650mb). As such, I went to optimize them.

TokenTree is quite large (64 bytes), and a significant part of this is spans. I came up an idea to shrink spans, but it will be easier to do if token trees were contiguous in memory. Given that this seemed like something worth doing perf-wise anyway, I went to do this :)

I expected this to be faster (due to less allocations and better cache locality), but benchmarked it is not (neither it is slower). I guess tt construction is just not hot. Memory usage, however, drops by ~50mb (of analysis-stats .), probably due to TokenTree shrinking to 48 bytes.

Some workflows are more easily expressed with a flat tt, while some are better expressed with a tree. With the right helpers, though (which was mostly a matter of trial and error), even the worst workflows become very easy indeed.

Veykril · 2024-10-18T08:25:40Z

Related https://rust-lang.zulipchat.com/#narrow/channel/185405-t-compiler.2Frust-analyzer/topic/mbe.20transcription I tried this before (i also saw no perf impact like you are, with a slight memory usage decrease iirc but the branch seems to be lost now), for some reason I was unable to actually integrate it without rewriting the mbe crate at the time though. Cant recall why that was. But neat either way. Though I guess this will heavily clash with #17830 now 😬 (wonder when Ill be able to get back to that PR, probably not this year)

ChayimFriedman2 · 2024-10-18T08:54:00Z

for some reason I was unable to actually integrate it without rewriting the mbe crate at the time though. Cant recall why that was

Perhaps that was the hardness of traversing the tree correctly? It was very ugly in my early attempts, but when I discovered the secret sauce (a TtIter that yields nested TtIters, and heavy usage of it) it became just neat, and as a bonus Cursor became significantly easier :)

Veykril · 2024-10-18T09:00:27Z

Might very well have been the traversal yes

bors · 2024-10-21T10:11:27Z

☔ The latest upstream changes (presumably #17954) made this pull request unmergeable. Please resolve the merge conflicts.

bors · 2024-10-22T10:06:56Z

☔ The latest upstream changes (presumably #18366) made this pull request unmergeable. Please resolve the merge conflicts.

Veykril

I am generally fine with the changes just one question. The iterator / cursor stuff does clean up some things as well which is nice!

crates/proc-macro-srv/src/lib.rs

I expected this to be faster (due to less allocations and better cache locality), but benchmarked it is not (neither it is slower). Memory usage, however, drops by ~50mb (of `analysis-stats .`). I guess tt construction is just not hot. This also simplifies using even less memory for token trees by compressing equal span, which I plan to do right after. Some workflows are more easily expressed with a flat tt, while some are better expressed with a tree. With the right helpers, though (which was mostly a matter of trial and error), even the worst workflows become very easy indeed.

We should immediately mark them as finished, on the first entry. The funny (or sad) part was that this bug was pre-existing, but previously to rust-lang#18327, it was causing us to generate bindings non-stop, 65535 of them, until we get to the hardcoded repetition limit, and then throw it all away. And it was so Blazingly Fast that nobody noticed. With rust-lang#18327 however, this is still what happens, except that now instead of *merging* the fragments into the result, we write them on-demand. Meaning that when we hit the limit, we've already written all previous entries. This is a minor change, I thought for myself when I was writing this, and it's actually for the better, so who cares. Minor change? Not so fast. This caused us to emit 65535 repetitions, all of which the MBE infra needs to handle when calling other macros with the expansion, and convert to rowan tree etc., which resulted a *massive* hang. The test (and also `analysis-stats`) used to crash with stack overflow on this macro, because we were dropping some crazily deep rowan tree. Now they work properly. Because I am lazy, and also because I could not find the exact conditions that causes a macro match but with a missing binding, I just copied all macros from tracing. Easy.

rustbot added the S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. label Oct 18, 2024

ChayimFriedman2 force-pushed the flat-tt branch 5 times, most recently from 4bda4a9 to 3796848 Compare October 20, 2024 18:49

ChayimFriedman2 force-pushed the flat-tt branch from 3796848 to 9ff252b Compare October 21, 2024 19:04

ChayimFriedman2 mentioned this pull request Dec 30, 2024

Implement builtin derive CoercePointee #18764

Closed

ChayimFriedman2 force-pushed the flat-tt branch 3 times, most recently from 448b361 to a84450b Compare December 30, 2024 18:52

Veykril reviewed Jan 2, 2025

View reviewed changes

crates/proc-macro-srv/src/lib.rs Outdated Show resolved Hide resolved

ChayimFriedman2 force-pushed the flat-tt branch from a84450b to ceba289 Compare January 2, 2025 17:21

Veykril approved these changes Jan 3, 2025

View reviewed changes

Veykril added this pull request to the merge queue Jan 3, 2025

Merged via the queue into rust-lang:master with commit b6910ed Jan 3, 2025
9 checks passed

ChayimFriedman2 deleted the flat-tt branch January 4, 2025 16:29

asquared31415 mentioned this pull request Jan 6, 2025

Trying to type in a function with a proc macro applied causes a panic in rust analyzer #18840

Closed

ChayimFriedman2 mentioned this pull request Jan 7, 2025

fix: Fix a bug with missing binding in MBE #18877

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Store token trees in contiguous `Vec` instead of as a tree #18327

Store token trees in contiguous `Vec` instead of as a tree #18327

Uh oh!

ChayimFriedman2 commented Oct 18, 2024

Uh oh!

Veykril commented Oct 18, 2024

Uh oh!

ChayimFriedman2 commented Oct 18, 2024

Uh oh!

Veykril commented Oct 18, 2024

Uh oh!

bors commented Oct 21, 2024

Uh oh!

bors commented Oct 22, 2024

Uh oh!

Veykril left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Store token trees in contiguous Vec instead of as a tree #18327

Store token trees in contiguous Vec instead of as a tree #18327

Uh oh!

Conversation

ChayimFriedman2 commented Oct 18, 2024

Uh oh!

Veykril commented Oct 18, 2024

Uh oh!

ChayimFriedman2 commented Oct 18, 2024

Uh oh!

Veykril commented Oct 18, 2024

Uh oh!

bors commented Oct 21, 2024

Uh oh!

bors commented Oct 22, 2024

Uh oh!

Veykril left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Store token trees in contiguous `Vec` instead of as a tree #18327

Store token trees in contiguous `Vec` instead of as a tree #18327