More

Groxx · 2026-05-02T20:06:02 1777752362

That's true for signed numbers too though? `int_min - 2 > int_min`

I agree they're a bit more error-prone in practice, but I suspect a huge part of that is because people are so used to signed numbers because they're usually the default (and thus most examples assume signed, if they handle extreme values correctly at all (much example code does not)). And, legitimately, zero is a more commonly-encountered value... but that can push errors to occur sooner, which is generally a desirable thing.

pron · 2026-05-02T21:15:41 1777756541

> That's true for signed numbers too though? `int_min - 2 > int_min`

As someone else already pointed out, that's undefined behaviour in C and C++ (in Java they wrap), but the more important point is that the vast majority of integers used in programs are much closer to zero than to int_min/max. Sizes of buffers etc. tend to be particularly small. There are, of course, overflow problems with signed integers, but they're not as common.

oasisaimlessly · 2026-05-02T20:28:55 1777753735

> That's true for signed numbers too though? `int_min - 2 > int_min`

No, that's undefined behavior in C, and if you care about correctness, you run at least your testsuite in CI with -ftrapv so it turns into an abort().

Groxx · 2026-05-02T20:31:33 1777753893

Which makes them even less safe than unsigned, where it is defined, yes? The optimizations that can lead to are incredibly hard to predict.

Besides, for safety there are much clearer options, like wrapping_add / saturating_add. Aborting is great as a safety tool though, agreed - it'd be nice if more code used it.

uecker · 2026-05-03T09:42:46 1777801366

You can have the trap during production, and then it is safer. If you need to catch the problem at run-time, there are checked integer options in C that you can use.

Groxx · 2026-05-02T19:20:57 1777749657

>If sizes are unsigned, like in C, C++, Rust and Zig – then it follows that anything involving indexing into data will need to either be all unsigned or require casts. With C’s loose semantics, the problem is largely swept under the rug, but for Rust it meant that you’d regularly need to cast back and forth when dealing with sizes.

TBH I've had very little struggle with this at all. As long as you keep your values and types separate, the unsigned type that you got a number from originally feeds just fine into the unsigned type that you send it to next. Needing casting then becomes a very clear sign that you're mixing sources and there be dragons, back up and fix the types or stop using the wrong variable. It's a low-cost early bug detector.

Implicitly casting between integer types though... yeah, that's an absolute freaking nightmare.

kibwen · 2026-05-02T20:16:30 1777752990

> As long as you keep your values and types separate, the unsigned type that you got a number from originally feeds just fine into the unsigned type that you send it to next.

Part of me feels like direct numeric array indexing is one of the last holdouts of a low-level operation screaming for some standardized higher-level abstraction. I'm not saying to get rid of the ability to index directly, but if the error-resistant design here is to use numeric array indices as though they were opaque handles, maybe we just need to start building support for opaque handles into our languages, rather than just handing out numeric indices and using thoughts and prayers to stop people from doing math on them.

For analogy, it's like how standardizing on iterators means that Rust code rarely needs to index directly, in contrast with C, where the design of for-loops encourages indexing. But Rust could still benefit from opaque handles to take care of those in-between cases where iterators are too limiting and yet where numeric indices are more powerful than needed.

bradrn · 2026-05-02T21:01:42 1777755702

> Part of me feels like direct numeric array indexing is one of the last holdouts of a low-level operation screaming for some standardized higher-level abstraction.

This paragraph reminds me a bit of Dex: https://arxiv.org/abs/2104.05372

dnmc · 2026-05-02T20:46:07 1777754767

Maybe this isn't what you're suggesting, but it's already possible to make an interface that prevents callers from doing math on indices in Rust — just return a struct that has a private member for the index. The caller can pass the value back at which point you can unwrap it and do index arithmetic.

kibwen · 2026-05-03T00:54:34 1777769674

More than that, in theory an opaque handle would also do things like statically prevent a handle taken from one array from being used to access a different array. I feel like this should be possible in Rust with type-level shenanigans (e.g. GhostCell).

thezipcreator · 2026-05-04T00:54:59 1777856099

this is possible in rust, albeit with a lot of shenanigans. See this article where someone made a GC in rust where the external references are bound to a specific GC via a unique lifetime: https://kyju.org/blog/tokioconf-2026/

Groxx · 2026-05-02T20:21:44 1777753304

You do need to store those if they're totally opaque though, e.g. how do you represent a range without holding N tokens? Often I like it, and it allows changing the underlying storage to be e.g. generational with ~no changes, but it kinda can't be enforced for runtime-cost reasons.

Using a unique type per array instance though, that I quite like, and in index-heavy code I often build that explicitly (e.g. in Go) because it makes writing the code trivially correct in many cases. Indices are only very rarely shared between arrays, and exceptions can and should look different because they need careful scrutiny compared to "this is definitely intended for that array".

Groxx · 2026-05-02T19:10:42 1777749042

Better than the current standard for AV, which is "what floor?"

loeg · 2026-05-02T19:16:56 1777749416

Is it? https://www.vice.com/en/article/california-dmv-suspends-crui...

HDThoreaun · 2026-05-02T19:24:52 1777749892

Cruise was entirely shut down because of an incident that didnt even result in a death. Thats way worse than what people tend to get

scarmig · 2026-05-02T20:04:14 1777752254

IIRC Cruise got into the most trouble not because of the accident itself, but because it tried to hide evidence from and deceive regulators.

HALtheWise · 2026-05-03T13:28:04 1777814884

In the context of this thread, it's worth pointing out that "trying to deceive regulators" is quite normal behavior for individual human drivers involved in car incidents, and iirc the Cruise collision itself also involved a human driver performing a hit-and-run who didn't afaict ever get prosecuted or come forward to police.

oceanplexian · 2026-05-03T04:31:59 1777782719

They have to operate in California though, so I don't blame them.

This is a state that made me a criminal for putting the wrong air filter on my car (Clearly my bad for putting on the 49 State legal version that makes the tailpipe emissions cleaner).

fragmede · 2026-05-02T22:38:32 1777761512

"it"; Kyle Vogt, their CEO at the time, is the person that decided to do it.

sagarm · 2026-05-02T23:10:52 1777763452

An incident, by the way, triggered by a human driver hitting a pedestrian and knocking them into Cruise's path.

That driver was never found. It's not clear what efforts, if any, were made to find them. After all the Cruise is covered in cameras.

input_sh · 2026-05-02T21:37:18 1777757838

It wasn't "because of an incident", it was because they were required to submit a report about that (or any other) incident, did so, and then the security footage proved that they straight up lied in the report about that particular incident.

If they just told the truth, they wouldn't lose their licence, but they couldn't even oblige by this piss-poor regulatory action in which they were required to do nothing but self-report any incident.

roenxi · 2026-05-03T02:19:25 1777774765

I believe you, but that really highlights how dangerous small regulatory overheads are. One - quite reasonable - frame on what you're saying is that there was no problem with Cruise except they failed to engage with the bureaucracy properly on some relatively minor points.

That sort of behaviour should be an aggravating factor if they're actually misbehaving. If they aren't, then it is poor policy to try and put them out of business over paperwork.

Groxx · 2026-05-01T21:33:45 1777671225

Definitely. At the very least, given the slow change in which ones are accepted, a cheap rental setup seems like the baseline that should exist... but everyone had to buy their own for my schools.

Groxx · 2026-05-01T16:39:17 1777653557

I can't tell if my main source of hate there is the homogeneity itself, or that it's excessively marketing-flavored (fluffy and aggrandizing, mere inches from "our incredible journey..." at all times).

Groxx · 2026-05-01T07:24:45 1777620285

And making them worse, let's not forget that.

Groxx · 2026-04-30T21:13:25 1777583605

>That's 3 ms per second = 0.3% CPU time wasted for every waiting thread.

I suspect that's actually "per process, per database (usually 1)", and not based on number of threads or tables. `data_version` semantics mean there's no need for more than one connection polling it, and it's being used as a relatively lightweight "DB has changed, check queues" check (that's pretty much its whole purpose).

Also I believe this is mostly intended for multi-process use, e.g. out-of-process workers, so an in-process dirty tracker (e.g. just check after insert/update/delete) isn't sufficient.

So I do think it's somewhat crazy, but it is at least very simple. fsnotify-like monitoring seems like a fairly obvious improvement tho, not sure why that isn't part of it. Maybe it's slower? I haven't tried to do anything actually-performant-or-reliable with fs notifications, dunno what dragons lie in wait.

Groxx · 2026-04-29T18:03:45 1777485825

Yeah, I really can't stand every vscode has done to the ecosystem for settings. JSON as a storage format for config is entirely fine, but it's a truly awful UX for changing things. But they're successful and it's easy to build, so everyone mindlessly copies them.

spockz · 2026-04-29T20:53:35 1777496015

Mind you that vscode does have an ui to change the settings. Also it has code completion when editing the json because of the linked json schema.

Groxx · 2026-04-29T21:47:19 1777499239

It has a UI to change some of the settings. And not even all of the settings with autocomplete help - I have no idea why those aren't equivalent to each other at least as a fallback, but they very clearly are not. And plugins frequently have major settings documentation gaps, including absolutely massively used ones like Go.

Compare that to something like a JetBrains product.

DangitBobby · 2026-04-29T20:53:18 1777495998

Zed is highly configurable via JSON (so already puts them in a tier above many editors) and also for some things like shortcuts provides an actual interface for editing that JSON. I imagine as time goes on they'll expose more interfaces for editing configurations. For now I'll take JSON over nothing/gui-only.

Groxx · 2026-04-28T19:49:11 1777405751

>even really strong development teams still occasionally let bugs like this slip through

agreed, though I think you'd be hard-pressed to find anyone who uses healthcare-related software professionally who thinks any "really strong development team" was involved in its creation.

Groxx · 2026-04-28T19:44:33 1777405473

Seems like this would probably be solved if github returned a patch file formatted like `git show` provides, specifically with the commit message indented? I do see that `git format-patch` doesn't do this indentation though.

In any case, agreed that it's not a great "feature" to use in-band signaling of when patch data starts, with no escaping. Confusion and misbehavior is pretty much guaranteed.

thaumasiotes · 2026-04-28T21:06:06 1777410366

> Seems like this would probably be solved if github returned a patch file formatted like `git show` provides, specifically with the commit message indented? I do see that `git format-patch` doesn't do this indentation though.

This would be "solved" if the patch file only included the patch. That's pretty straightforward. The file github provides includes fake email headers for no particular reason. The commit message appears to be part of the subject header. The subject header is never terminated, so arguably applying this patch shouldn't do anything. (Because the actual patch data is also part of the email headers.) The other headers aren't terminated either, so actually there is no subject header. This shouldn't really matter, because the patch file isn't email, but it does seem to want to pretend to be.

The usual question to ask here would be "why are you applying patch files from an untrusted source?". If patch(1) was stricter about the format of its input files... applying patches from an untrusted source would still be a good way to get owned. If you think I can get you to patch inappropriate files by writing a fake diff into my commit message... wait until you see what I can do by writing those same changes into the real diff.

quuxplusone · 2026-04-28T23:13:07 1777417987

> fake email headers

That's the output you get from `git format-patch --stdout -1 dd28283`. The idea is that it's suitable for emailing to a mailing list for review (hence the subject line beginning with [PATCH], and so on).

If you ask for colorized output with `git format-patch --color=always --stdout -1 dd28283` you'll see that `git format-patch` itself knows which bits are the commit message and which bits are the diff. (Well, of course it does, I guess.)

I suspect that if you sent a patch like this to the mailing list, they'd get mad at you. So `git format-patch` is working OK for its intended use-case. Arguably it's GitHub causing the problem here by "misusing" `git format-patch` as a way to deliver patches that are (these days) expected to be machine-readable — something you can just curl and pipe into `patch`. `git format-patch` doesn't do that.

That said, yeah, it's amusing that (as TFA says)

    git format-patch -1 HEAD --stdout > 0001 ;
    git checkout HEAD~ ;
    git am 0001

isn't a clean round-trip. `git am` applies the fake diff.

> If you think I can get you to patch inappropriate files by writing a fake diff into my commit message... wait until you see what I can do by writing those same changes into the real diff.

Well put. :)

quuxplusone · 2026-04-29T15:02:48 1777474968

jolmg points out that if you use a GitHub URL ending in .diff instead of .patch, you get something much more suitable to feed mechanically into `patch`. (And probably not so exploitable.)

Therefore I retract my claim that this is even a "misuse" of `git format-patch` by GitHub. Seems like GitHub provides both a git-am-able endpoint and a (less exploitable) patch-able endpoint, and the issue is just that OP chose the less suitable one of those two endpoints.

thaumasiotes · 2026-04-29T12:20:05 1777465205

> The idea is that it's suitable for emailing to a mailing list for review

It doesn't comply with RFC 5322 ( https://www.rfc-editor.org/rfc/rfc5322#section-2.2 ), which requires that email headers terminate in CRLF.