-
Notifications
You must be signed in to change notification settings - Fork 13.6k
Document become
keyword
#113095
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: master
Are you sure you want to change the base?
Document become
keyword
#113095
Conversation
(rustbot has picked a reviewer for you, use r? to override) |
This comment has been minimized.
This comment has been minimized.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
A few small nits.
This comment has been minimized.
This comment has been minimized.
library/std/src/keyword_docs.rs
Outdated
/// is part of a recursive cycle in the call graph. | ||
/// | ||
/// For example note that the functions `halt` and `halt_loop` below are | ||
/// identical, they both do nothing, forever. However, `stack_overflow` is |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
/// identical, they both do nothing, forever. However, `stack_overflow` is | |
/// identical: they both do nothing, forever. However, `stack_overflow` is |
library/std/src/keyword_docs.rs
Outdated
/// | ||
/// For example note that the functions `halt` and `halt_loop` below are | ||
/// identical, they both do nothing, forever. However, `stack_overflow` is | ||
/// different from them, even though it is written almost identically to |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
/// different from them, even though it is written almost identically to | |
/// different from them. Even though it is written almost identically to |
library/std/src/keyword_docs.rs
Outdated
/// For example note that the functions `halt` and `halt_loop` below are | ||
/// identical, they both do nothing, forever. However, `stack_overflow` is | ||
/// different from them, even though it is written almost identically to | ||
/// `halt`, `stack_overflow` exhausts the stack and so causes a stack | ||
/// overflow, instead of running forever. | ||
/// | ||
/// | ||
/// ``` | ||
/// #![feature(explicit_tail_calls)] | ||
/// | ||
/// # #[allow(unreachable_code)] | ||
/// fn halt() -> ! { | ||
/// become halt() | ||
/// } | ||
/// | ||
/// fn halt_loop() -> ! { | ||
/// loop {} | ||
/// } | ||
/// | ||
/// # #[allow(unconditional_recursion)] | ||
/// fn stack_overflow() -> ! { | ||
/// stack_overflow() // implicit return | ||
/// } | ||
/// ``` |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This discusses a function that is "obviously wrong", which means it does not make it clear why one wants to use become
in "real" code. I think we can do slightly better than this, as the documentation should focus on improving the good cases, like e.g. writing "natural" recursive merge-sorts. The example improvement can still be contrived, however.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
That makes sense, hmm. I guess the problem (similarly to the discussions on the RFC) is that there is no concise example where using tail calls makes sense in rust — most, if not all, small examples can be written just as good with a loop.
Maybe it would make sense to have two examples? One a bit silly, maybe a slice fold, and the other longer one with something like an interpreter?
Reading it now I see that this is a bad example, but I'm not sure what example would be good.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
A silly fold would be good! I'm not looking for "a loop wouldn't be just as good", just something that actually feels like something a human would want to write.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
How about a basic fibonacci sequence example? Technically this is a simple fold.
The first version presented could be the naive recursive version that is extremely inefficient and quickly hits a wall:
/// Returns the n-th fibonacci number. (using recursion)
fn fib_rec(n: i64) -> i64 {
if n <= 1 {
return n
}
fib_rec(n - 1) + fib_rec(n - 2)
}
Then we could introduce a tail-call based version that is way more efficient:
/// Returns the n-th fibonacci number. (using tail-calls)
fn fib_tail(n: i64) -> i64 {
fn fib_tail_acc(n: i64, a: i64, b: i64) -> i64 {
if n == 0 {
return a
}
become fib_tail_acc(n - 1, b, a + b)
}
become fib_tail_acc(n, 0, 1)
}
Note that a naive iteration based version isn't much more concise:
/// Returns the n-th fibonacci number. (using iteration)
fn fib_iter(n: i64) -> i64 {
if n <= 1 {
return n
}
let mut a = 1;
let mut b = 1;
for _ in 2..n {
let tmp = a + b;
a = b;
b = tmp;
}
b
}
Finally test that everything works:
#[test]
fn test_fib() {
for n in 0..30 {
assert_eq!(fib_iter(n), fib_tail(n));
assert_eq!(fib_iter(n), fib_rec(n));
}
}
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't think you would write fib_rec
like that, specifically because it re-does a lot of work. A more fare comparison would be to
fn fib_rec(n: i64) -> i64 {
fn fib_rec_acc(n: i64, a: i64, b: i64) -> i64 {
if n == 0 {
return a
}
fib_rec_acc(n - 1, b, a + b)
}
fib_rec_acc(n, 0, 1)
}
At which point it's all a bit moot... But either way it's not like we can actually show the problem with stack overflow in these simple examples.
I usually prefer factorial
, because it's less awkward since it doesn't require two previous values. But factorial
grows so fast that stack overflowing is not an actual problem soo idk...
Also just a nitpick: become
in fib_tail
won't compile since the signatures don't match :')
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I was previously thinking of writing a fold
, but 🤷
I think whatever example we put, it won't be perfect and the difference would need additional explanation of "this may stack overflow and this can't".
What we surely need is an example that shows difference in drop order and explains how without it LLVM/the optimizer can't necessarily do this as an optimization.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't think you would write fib_rec like that, specifically because it re-does a lot of work.
Yes, the more comparable solution was indeed the fib_iter
to start with.
Also just a nitpick: become in fib_tail won't compile since the signatures don't match :')
Good catch!
So a fold like this could serve as an example?
pub fn fold<T, U>(init: T, mut f: impl FnMut(T, U) -> T, iter: impl IntoIterator<Item = U>) -> T {
let mut iter = iter.into_iter();
match iter.next() {
None => init,
Some(item) => fold(f(init, item), f, iter),
}
}
I just tested it locally and with sufficiently large iterators it causes a stack overflow on my system whereas tail calls would prevent this by putting become
in front of the recursive fold
call. I tested the above fold with this little function:
#[test]
fn test_fold() {
let iterations = 100_000;
let output = fold(
String::new(),
|mut s, n| {
const HEX: [char; 16] = [
'0', '1', '2', '3', '4', '5', '6', '7', '8', '9', 'A', 'B', 'C', 'D', 'E', 'F',
];
s.push(HEX[n % HEX.len()]);
s
},
(0..).take(iterations),
);
assert_eq!(output.len(), iterations);
}
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I was thinking more of something like
pub fn fold<T, B>(slice: &[T], init: B, mut f: impl FnMut(B, &T) -> B) -> B {
match slice {
[] => init,
[first, rest @ ..] => fold(rest, f(init, first), f),
}
}
recursion + slice patterns looks nice
library/std/src/keyword_docs.rs
Outdated
/// For example note that the functions `halt` and `halt_loop` below are | ||
/// identical, they both do nothing, forever. However, `stack_overflow` is | ||
/// different from them, even though it is written almost identically to | ||
/// `halt`, `stack_overflow` exhausts the stack and so causes a stack | ||
/// overflow, instead of running forever. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Isn't LLVM allowed to optimize stack_overflow()
into loop { }
? I know we don't allow it to remove infinite loops, but...
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It is allowed, but it also is allowed not to.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yeah. I guess it's allowed to do it in all these cases, right?
I guess what I'm concerned about is the example being so trivial that it doesn't hold up to even trivial examination.
library/std/src/keyword_docs.rs
Outdated
// | ||
/// Perform a tail-call of a function. | ||
/// | ||
/// A `become` transfers the execution flow to a function in such a way, that |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
/// A `become` transfers the execution flow to a function in such a way, that | |
/// A `become` transfers the execution flow to a function in such a way that |
Unsure about the direction of this documentation focusing on the absurd case. @rustbot author |
@WaffleLapkin |
@JohnCSimon this is blocked on actually implementing the feature (it would be silly to document a feature one can't use). Also I need to address the review comments above. Note for future self: example of |
@WaffleLapkin With #144232 merged, time to get back to this? |
0888a94
to
46ffda7
Compare
@workingjubilee I rewrote the documentation, could you do a general vibe check? does this direction seem better than the old one? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Some nits
/// This means that as long as a loop in a call graph only uses tail calls, the | ||
/// stack growth will be bounded. | ||
/// | ||
/// This is useful for writing functional-style code (since it prevent recursion |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
/// This is useful for writing functional-style code (since it prevent recursion | |
/// This is useful for writing functional-style code (since it prevents recursion |
/// fn fold<T: Copy, S>(slice: &[T], init: S, f: impl Fn(S, T) -> S) -> S { | ||
/// match slice { | ||
/// // without `become`, on big inputs this could easily overflow the | ||
/// // stack. using a tail call guarantees that the stack will not grow unboundly |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
/// // stack. using a tail call guarantees that the stack will not grow unboundly | |
/// // stack. using a tail call guarantees that the stack will not grow unboundedly |
/// } | ||
/// ``` | ||
/// | ||
/// Compiler can already perform "tail call optimization" -- it can replace |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
/// Compiler can already perform "tail call optimization" -- it can replace | |
/// The compiler can already perform "tail call optimization" -- it can replace |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm not sure I agree with this, can you elaborate?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
"compiler" is singular and countable, I believe such nouns require a determiner
but if we drop the formalities, I just feel that way and sentence does looks more complete with "the"
/// normal calls with tail calls (although no guarantees if it will perform it). | ||
/// However, to perform TCO, the call needs to be the last thing that happens | ||
/// in the functions and be returned from it. This requirement is often broken | ||
/// by drop code for locals, which is run after computing the return expression: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
/// by drop code for locals, which is run after computing the return expression: | |
/// by drop code for locals, which runs after computing the return expression: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is run seems fine to me?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yeah, looking at this now, this is fine as it is
/// fn help() {} | ||
/// ``` | ||
/// | ||
/// For this reason `become` also changes the drop order, such that locals are |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
/// For this reason `become` also changes the drop order, such that locals are | |
/// For this reason, `become` also changes the drop order, such that locals are |
/// | ||
/// In order to guarantee that the compiler can perform a tail call, `become` | ||
/// currently has these requirements: | ||
/// 1. callee and caller must have the same ABI, arguments and return type |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
/// 1. callee and caller must have the same ABI, arguments and return type | |
/// 1. callee and caller must have the same ABI, arguments, and return type |
I believe it is Serial comma case here
/// | ||
/// fn fold<T: Copy, S>(slice: &[T], init: S, f: impl Fn(S, T) -> S) -> S { | ||
/// match slice { | ||
/// // without `become`, on big inputs this could easily overflow the |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
/// // without `become`, on big inputs this could easily overflow the | |
/// // without `become`, on large inputs this could easily overflow the |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can you elaborate?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
There is no actual rule for it, but I will try to explain this: both "big" and "large" are pretty the same in casual speech, but in techical/scientific writings "large" is more preffered term, it just sounds more neutral or formal
/// 1. callee and caller must have the same ABI, arguments and return type | ||
/// 2. callee and caller must not have varargs | ||
/// 3. callee and caller must not be marked with `#[track_caller]` | ||
/// 4. callee and caller cannot be a closure |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
/// 4. callee and caller cannot be a closure | |
/// 4. callee and caller cannot be a closures |
Shouldn’t it be plural, since varargs
above are plural? Not sure about this honestly
The feature is not yet implemented, so I'm not sure if we should merge this right away, promoting an incomplete feature is probably not the best idea. But the docs can be reviewed while the implementation work is being done.