Revamp empty special-casing in LINQ #96602

stephentoub · 2024-01-08T03:06:48Z

Enumerable.Empty used to return Array.Empty. Towards the beginning of .NET Core, LINQ was imbued with an internal "partition" concept for flowing more information around between operators, and as part of that, Empty was changed to return a singleton instance of a specialized partition implementation. The upside of this was that methods typed to return IPartition could return the same singleton as Empty. There are multiple downsides, however. For one, the whole IPartition concept is only built into a "speed-optimized" build of LINQ; builds that care more about size (e.g. browser) end up not having it, and thus Empty there ends up being Array.Empty, such that a different type ends up being returned based on the build, which is not ideal. Further, any paths that check for empty now effectively have two things to check for: the empty partition or an empty array, making those checks more expensive, if they're even done at all, or in some cases missing out on possible optimization. This is more pronounced today, now that [] with collection expressions will produce Array.Empty, and it'd be really nice if there wasn't a difference between Enumerable.Empty and [] assigned to IEnumerable<T>.

This change puts Enumerable.Empty back to always being Array.Empty. The internal IPartition-based APIs that drove us to need the EmptyPartition are changed to just use null as an indication of empty. Places we were already checking for is EmptyPartition are changed to check for an empty array (if they weren't already), and other APIs that weren't checking at all now have a check if it makes sense to do so (I audited all of the APIs, and didn't include checks in ones where it could meaningfully affect semantics, e.g. a fast path that might cause us not to get an enumerator from a secondary enumerable input).

Enumerable.Empty used to return Array.Empty. Towards the beginning of .NET Core, LINQ was imbued with an internal "partition" concept for flowing more information around between operators, and as part of that, Empty was changed to return a singleton instance of a specialized partition implementation. The upside of this was that methods typed to return IPartition could return the same singleton as Empty. There are multiple downsides, however. For one, the whole IPartition concept is only built into a "speed-optimized" build of LINQ; builds that care more about size (e.g. browser) end up not having it, and thus Empty there ends up being Array.Empty, such that a different type ends up being returned based on the build, which is not ideal. Further, any paths that check for empty now effectively have two things to check for: the empty partition or an empty array, making those checks more expensive, if they're even done at all, or in some cases missing out on possible optimization. This is more pronounced today, now that `[]` with collection expressions will produce Array.Empty, and it'd be really nice if there wasn't a difference between Enumerable.Empty and `[]` assigned to `IEnumerable<T>`. This change puts Enumerable.Empty back to always being Array.Empty. The internal IPartition-based APIs that drove us to need the EmptyPartition are changed to just use null as an indication of empty. Places we were already checking for `is EmptyPartition` are changed to check for an empty array (if they weren't already), and other APIs that weren't checking at all now have a check if it makes sense to do so (I audited all of the APIs, and didn't include checks in ones where it could meaningfully affect semantics, e.g. a fast path that might cause us not to get an enumerator from a secondary enumerable input).

ghost · 2024-01-08T03:06:56Z

Tagging subscribers to this area: @dotnet/area-system-linq
See info in area-owners.md if you want to be subscribed.

Issue Details

Enumerable.Empty used to return Array.Empty. Towards the beginning of .NET Core, LINQ was imbued with an internal "partition" concept for flowing more information around between operators, and as part of that, Empty was changed to return a singleton instance of a specialized partition implementation. The upside of this was that methods typed to return IPartition could return the same singleton as Empty. There are multiple downsides, however. For one, the whole IPartition concept is only built into a "speed-optimized" build of LINQ; builds that care more about size (e.g. browser) end up not having it, and thus Empty there ends up being Array.Empty, such that a different type ends up being returned based on the build, which is not ideal. Further, any paths that check for empty now effectively have two things to check for: the empty partition or an empty array, making those checks more expensive, if they're even done at all, or in some cases missing out on possible optimization. This is more pronounced today, now that [] with collection expressions will produce Array.Empty, and it'd be really nice if there wasn't a difference between Enumerable.Empty and [] assigned to IEnumerable<T>.

This change puts Enumerable.Empty back to always being Array.Empty. The internal IPartition-based APIs that drove us to need the EmptyPartition are changed to just use null as an indication of empty. Places we were already checking for is EmptyPartition are changed to check for an empty array (if they weren't already), and other APIs that weren't checking at all now have a check if it makes sense to do so (I audited all of the APIs, and didn't include checks in ones where it could meaningfully affect semantics, e.g. a fast path that might cause us not to get an enumerator from a secondary enumerable input).

Author:	stephentoub
Assignees:	-
Labels:	`area-System.Linq`, `tenet-performance`
Milestone:	9.0.0

eiriktsarpalis

Seems to be causing test failures?

src/libraries/System.Linq/src/System/Linq/Enumerable.cs

src/libraries/System.Linq/src/System/Linq/Range.cs

The tests were validating the underlying type of the operator returned for Concat, which is not material.

The check needs to be moved to before the count <= 0 check... otherwise calling Skip on an empty array with a count of 0 would still allocate an iterator.

stephentoub · 2024-01-08T14:23:35Z

Seems to be causing test failures?

Yup. Fixed.

stephentoub added area-System.Linq tenet-performance Performance related issue labels Jan 8, 2024

stephentoub added this to the 9.0.0 milestone Jan 8, 2024

stephentoub requested a review from eiriktsarpalis January 8, 2024 03:06

ghost assigned stephentoub Jan 8, 2024

build-analysis bot mentioned this pull request Jan 8, 2024

Checkout failure: "Git fetch failed with exit code 128" dotnet/arcade#9009

Open

2 tasks

eiriktsarpalis reviewed Jan 8, 2024

View reviewed changes

src/libraries/System.Linq/src/System/Linq/Enumerable.cs Outdated Show resolved Hide resolved

eiriktsarpalis reviewed Jan 8, 2024

View reviewed changes

src/libraries/System.Linq/src/System/Linq/Range.cs Show resolved Hide resolved

stephentoub added 4 commits January 8, 2024 09:09

Merge branch 'main' into linqempty

14063af

Rename IsImmutableEmpty to IsEmptyArray per PR feedback

da95bbf

Remove bogus PLINQ tests

bc1c21a

The tests were validating the underlying type of the operator returned for Concat, which is not material.

Fix Skip special-casing

b21c3ad

The check needs to be moved to before the count <= 0 check... otherwise calling Skip on an empty array with a count of 0 would still allocate an iterator.

eiriktsarpalis approved these changes Jan 8, 2024

View reviewed changes

stephentoub mentioned this pull request Jan 8, 2024

"Unexpected end of archive" while unzipping test assets #96627

Closed

stephentoub merged commit 8ffc96c into dotnet:main Jan 8, 2024

stephentoub deleted the linqempty branch January 8, 2024 16:32

cincuranet mentioned this pull request Jan 18, 2024

[Perf] Linux/arm64: 3 Regressions on 1/8/2024 10:15:44 PM dotnet/perf-autofiling-issues#27497

Closed

cincuranet mentioned this pull request Feb 1, 2024

[Perf] Linux/arm64: 3 Regressions on 1/8/2024 10:15:44 PM dotnet/perf-autofiling-issues#28329

Closed

github-actions bot locked and limited conversation to collaborators Feb 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Revamp empty special-casing in LINQ #96602

Revamp empty special-casing in LINQ #96602

Uh oh!

stephentoub commented Jan 8, 2024

Uh oh!

ghost commented Jan 8, 2024

Uh oh!

eiriktsarpalis left a comment

Uh oh!

Uh oh!

Uh oh!

stephentoub commented Jan 8, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Revamp empty special-casing in LINQ #96602

Revamp empty special-casing in LINQ #96602

Uh oh!

Conversation

stephentoub commented Jan 8, 2024

Uh oh!

ghost commented Jan 8, 2024

Uh oh!

eiriktsarpalis left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

stephentoub commented Jan 8, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants