How Integers Should Work (In Systems Programming Languages)

colanderman · on Dec 5, 2011

Propagating NaNs?? Gah, at least exceptions provide a backtrace.

What's needed is a dependent type system (like that of ATS or Ada) that allows the range of allowable values for an integer to be decalred. A halfway-smart compiler can then statically check (using e.g. abstract interpretation) that the intermediate results of arithmetic performed on such integers does not exceed the system integer size, and that the results do not exceed the declared size of the result variable.

(In fact, if the programmer places runtime checks at the right places, such range declarations aren't even necessary as they can be inferred by the compiler.)

Such a system would statically guarantee that no overflow will occur. This means no spurious hard-to-track-down NaN results at runtime.

Locke1689 · on Dec 5, 2011

This is interesting but I've never seen a dependent type system (even Coq) that can do this completely. For example, let's say I'd like to turn a pointer into an integer (a reasonably common systems operation) so I define a type addr bounded between 0 and 2^32-1. If I add 100 to this value I don't see how I could statically prove that no overflow could occur. For that to work, wouldn't I need some way of defining the type of my address as 0 to (2^32-1)-100? If the address is returned from a call to malloc how can I prove that it not only has type addr, but also type addr_100_restricted at compile time?

Peaker · on Dec 5, 2011

To add 100 to a pointer, you would need a pointer to an allocation of at least 100 bytes.

Thus, your pointer would have a type that provides evidence that it points to at least 100 bytes. This would make pointer arithmetic adding 100 bytes possible. Something like:

  addPtr :: Address (size : Nat) -> (x : Fin size) -> Address (size - x)

Someone · on Dec 5, 2011

Wouldn't that better be:

  addPtr :: Address (presize : Nat, size : Nat) -> (x : Fin size) -> Address ( presize + x, size - x)

presize = number of 'slots' available before the current value.

size = number of 'slots' available after the current value.

That would allow for later subtraction, too.

Peaker · on Dec 5, 2011

Yes, indeed :-)

I did not mean to write an actual implementation, just to make a small point.

In an actual implementation, I'd probably want the type to convey information about the content stored at the address, too.

scott_s · on Dec 5, 2011

Can you explain the semantics of that notation? I can glean some of its meaning, but I'd like to understand it in full.

Peaker · on Dec 5, 2011

    addPtr : Address (size : Nat) -> (x : Fin size) -> Address (size - x)

(the :: above should have been a single colon) Reads as:

   "addPtr" is a value of the following type:

Function from two args: "Address (size : Nat)" and "(x : Fin size)" to result "Address (size - x)"

"Address (size : Nat)" is the "Address" type applied to one Natural number parameter named "size" (indicating the allocation size the value of that Address points at).

(x : Fin size) means it takes an "x" of type "Fin size". The "Fin param" type is a type that can hold a natural number 0..param-1. The resulting Address has a smaller size it points at.

Locke1689 · on Dec 5, 2011

Ah, very clever construction, I like it. Thanks for the input.

doublec · on Dec 5, 2011

ATS has similar constructs. Once you have allocated memory you can't do pointer arithmetic outside the bounds of the size of the memory allocated, checked at compile time.

sirclueless · on Dec 5, 2011

This was posted as a comment on the article, where his comment system mangled my shifts thinking I was trying to use HTML:

I'm not sure I like allowing invalid intermediate results. It means that there are many expressions whose correctness is dependent on a compiler, and more specifically a compilation target. It's not hard to come up with an example of an expression that is valid, but only calculable on specific architectures.

If none of the examples that Arseniy came up with satisfy you, think of something absurd like (x << 20000) >> 19999. As it stands, no computer architecture I know of could calculate the proper result. But with a clever compiler you can derive an equivalent program that will execute correctly. If you have the clever compiler and allow invalid intermediate results, your program will execute correctly. If you have the same constraints and a slightly dumber compiler, then you will trap.

There are probably examples that are flat out impossible to execute without trapping on particular architectures, but fine on others. As a result, architecture and compiler details will start bubbling up and affecting program correctness. The ways in which this will happen will be far more perfidious (if less common) than the current architecture details program writers often must consider such as word length.

repsilat · on Dec 5, 2011

Most of the time you'd want to use these sorts of integers you wouldn't use shift operators - as the author said,

> Of course there are specialized domains like DSP that want

> saturation and crypto codes that want wraparound; there’s

> nothing wrong with having special datatypes to support

> those semantics.

More to the point, I think you'd want to standardise on the optimisations a compliant compiler can/must implement. Algebraic rearrangements might be the way to do it, but I'm not 100% sure even they're possible. For the example given:

    result = result * 10 + cursor - '0';

If `result` is unsigned then I think the compiler has to assume that `cursor` is greater than the character literal for zero. That's obviously true to us, but I'm not sure that sort of thing could be (reliably) implemented programmatically, even for simple cases.

sirclueless · on Dec 6, 2011

> More to the point, I think you'd want to standardise on the optimisations a compliant compiler can/must implement.

That has two problems.

1. Once you start formalizing optimizations that must occur, you run the risk that some optimizations will be impossible or have horrible performance on some compilation targets. For example saying something like "an expression made up of only 32-bit addition operations must successfully evaluate if the result fits in a 32-bit integer" might be really painful to implement on an embedded system with only a few 32-bit registers.

2. Even if the correctness of an expression can be determined statically and reliably by the language standard, if the correctness of an expression depends on optimizations performed, then someone reading a program needs to know all of those optimizations. For example, it is often quite a laborious task to tell whether a C expression will evaluate correctly, because C has occasionally bizarre rules for the promotion of the types of operands. Fortunately there are some decently reliable heuristics that apply, that let you assume that an unsigned char added to an unsigned long will probably promote the unsigned char to a long and do the addition. Now imagine instead of understanding C's type system, you had to understand the algebraic reductions available to your compiler. The volume of arcana required to determine whether an expression will be correctly optimized or not would be absurd. And heaven forbid you tried to modify the expression. You might add a simple constant somewhere that breaks an optimization and suddenly the expression fails to evaluate when x is 2^31 - 1 or something.

When you consider these things, hopefully you will see why it is so much more preferable to just enforce that all intermediate results must be valid, and have the program writer do the algebraic reductions necessary to make (x << 20000) >> 19999 evaluate with no invalid intermediate values.

tobiasu · on Dec 5, 2011

This should be titled "How Integers Should Work (In CPU architectures)". Programming languages have little to work with currently, and it is very inconsistent across architectures. Having an IEEE standard for integer math like we have for FP numbers could (...) help in the long term.

The only explicit "invalid" value I know of is available for floating point numbers. NULL in C is usually* not an invalid address, even if user-space programmers often seem to think so. Rather there are no pages mapped at that address and the kernel hands this page fault to user-space in some way. There is plenty of hardware where physical memory starts at 0x0. Lots of fun can be had debugging such a thing.

* IIRC there was a CPU architecture that interpreted "9" as an invalid address, but I can't remember what it was..?

modeless · on Dec 5, 2011

I like the idea of making INT_MIN == -INT_MAX, but I already think null pointers are a bad idea, and having an invalid integer value might be even worse. You'd have similar problems as with floating point NaN, where the unexpected NaN != NaN case can cause subtle bugs.

In any case, processor vendors aren't about to change the way they do integer math so this discussion, while quite interesting, is completely academic.

__david__ · on Dec 5, 2011

It's funny I think the opposite: I kind of like the idea of an iNaN, but I've never been bothered by the asymmetry of INT_MIN and INT_MAX. I also don't think overflow should necessarily cause an iNaN right away. "INT_MAX + 10 - 20" is legit, even though it overflows twice.

> In any case, processor vendors aren't about to change the way they do integer math so this discussion, while quite interesting, is completely academic.

I thought so too initially, but really every processor already has a status register that shows you when overflow happened (otherwise big integer algorithms would be hard) which is all you really need for this proposal. But it does require a bunch of tests around and integer arithmetic which makes it unlikely to be a good candidate for a C language addition (a language whose main impetus is speed).

jerf · on Dec 5, 2011

'"INT_MAX + 10 - 20" is legit, even though it overflows twice.'

In general, maybe so. But in systems programming? What could that possibly mean that is not a bug? There are answers to that, but since the answers are going to be <1% corner cases it doesn't justify defaulting to the dangerous case while requiring one to jump through hoops to get the safe case. If you desperately need "INT_MAX + 10 - 20", ask for it.

modeless · on Dec 5, 2011

He addresses delaying iNaN generation in Premise 3. Adding explicit overflow tests after every integer operation is clearly a non-starter for a systems programing language.

ScottBurson · on Dec 5, 2011

I agree that silent wraparound is evil. Overflow should trap (and throw an exception if the language has those).

But I don't care for iNaN. I'd rather get the exception at the point the problem was created.

I'm also not sure what I think about the idea that intermediate results should be able to be wider than the integer type being computed with. I'd rather be able to introduce temporaries for subexpressions without changing the meaning of the program -- this is basic referential transparency.

That said, I'm not completely convinced by the argument that it's too dangerous for a language to support arbitrary precision by default. Special types or operators can be used in contexts, like interrupt handlers, where allocation is undesirable. We have an existence proof that this is workable: the Lisp Machine operating system.

acqq · on Dec 5, 2011

This is wrong on so many levels it's hard to even start criticizing. TL/DR: the guy wants more or less the semantics of decimal floating point numbers (with NaNs and all) in integers, and believes that such "integers" would be "better" in his words"to make it difficult for those specialized integers to flow into allocation functions, array indices, etc. That sort of thing is the root of a lot of serious bugs in C/C++ applications."

In fact, integers that we have in current CPUs are product of let's say at least 50 years of engineering and they are optimum, even if somebody declares them "the root of a lot of serious bugs in C/C++ applications." I've got the news for him: a lot of other and older languages have the same behavior because that's the local optimum for a hardware implementation and it will remain so even once we have decimal fp numbers of the same speed as integers (even if we ignore that actually we won't). Before two's complement there were other integer variants implemented in hardware and they turned out to be less functional as two's complement representation.

Then gives the example that if this code

  result = result * 10 + cursor - '0';

were evaluated as

   result * 10 + (cursor - '0')

there won't be overflow, totally ignoring that the languages often have exactly defined order of evaluation and that if the language has the defined order of evaluation from left to right for + and - then the only proper evaluation is only

  (result * 10 + cursor) - '0'

It's as simple as that.

Then he ends with the loop that wouldn't work with floating point NaNs (and therefore wouldn't fit in NaNs for integers too):

  for (x = INT_MIN; x != iNaN; x++)

because every number is not equal to NaN, so the above loop would never end.

I'd say everything he wrote is not even wrong.

repsilat · on Dec 5, 2011

> that's the local optimum for a hardware implementation and it will remain so even once we have decimal fp numbers of the same speed as integers

This doesn't preclude the use of twos complement, and twos complement isn't enforced (or mentioned) in C's standard. It doesn't need to be expensive, either - as said in the article, "These should be simple ALU hacks." I don't see any reason why these operations on processors would be slower than the ones we have already. I wouldn't be surprised if they used a few more transistors, but on the scale of today's processors it won't cost anything at all compared to all of the silicon required for instruction pipelining, branch prediction and out-of-order execution.

> a lot of other and older languages have the same behavior

This isn't true either. If you'd take the time to read the other posts on the site (and probably the LLVM guide to undefined behaviour[1]) you'll find that the C and C++ standards place quite different restrictions on the behaviour of integers than, say, Java (which is by comparison rather strict).

The loop that you say "would never end"? It would end. When `x` overflows it'll be set to `iNaN`, and the loop will exit.

Your "order of operations" complaint? He's talking about simple, unobvious errors with a real-life example. He then proposes semantics for a new language that would solve this problem. What's your complaint here?

On the whole I thought John came up with a bunch of ideas, most of them good, almost all of them better than the status quo. I thought your post was the exactly the sort of knee-jerk response I've come to expect from Reddit.

[1]: http://blog.llvm.org/2011/05/what-every-c-programmer-should-...

Locke1689 · on Dec 5, 2011

People also seem to be ignorant of the fact that this is from John Regehr, author of CSmith. He probably knows the C standard and C compiler behavior as well as the primary C compiler developers. He's well aware of the issues of backwards compatibility but is ignoring them for the purposes of this post.

acqq · on Dec 5, 2011

"The loop that you say "would never end"? It would end. When `x` overflows it'll be set to `iNaN`, and the loop will exit."

I see you don't have experience with NaN semantics:

http://en.wikipedia.org/wiki/NaN

"A comparison with a NaN always returns an unordered result even when comparing with itself."

If you say "but iNaNs would be different" you already missed the point.

repsilat · on Dec 5, 2011

The author is proposing semantics for instruction sets and languages that don't exist.

He could have said "`!(x is iNaN)`", or "`!isINAN(x)`" instead of "`x != iNaN`", but he didn't. Maybe he should have, I'm sure he didn't give it much thought.

Nowhere does he say that the semantics of his `iNaN` are exactly analogous to the those of existing floating-point NaNs, though. Implying that he did so that you have one more nit to pick damages the tone of conversation.

acqq · on Dec 5, 2011

"Nowhere does he say that the semantics of his `iNaN` are exactly analogous to the those of existing floating-point NaNs"

Since current semantics is used everywhere today, using semantics that you imply for some new cases and keeping the current for existing would immensely add to confusion and it would be wrong. It's not nit to pick, it obviously shows that author (and you too) is not acquainted with something that should be a common knowledge in the profession since at least 1985, http://en.wikipedia.org/wiki/IEEE_754-1985

And that was exactly my point since my first post.

esrauch · on Dec 5, 2011

The chances that the author is not aware of what you are talking about is near 0, and what he is writing isn't an actual suggested specification it is musings about pain points in the current way that things work.

You are complaining about a single line of code that demonstrates a concept that cannot possibly be executed and there isn't any actual effort to make this line into something that can be executed. You are arguing that it is a bad idea for a reason that is completely orthogonal to the spirit of the post. So sure, implicit in that loop is the idea that iNaN would be treated subtly different than existing NaNs, but you are just responding to the text of an example that is written to be as simple and clear as possible to readers rather than the actual content of his message, which is nit picking at best.

rcfox · on Dec 5, 2011

1) regehr isn't saying that he doesn't like two's complement, he's saying that he doesn't like two's complement wraparound. He's saying that it would probably make sense for integers to operate linearly or fail if they get to a point where they will become non-linear.

2) Arguing to keep a local optimum seems silly to me. Why not reach for the global optimum?

3) He's not arguing for a change in the order of evaluation, he's identifying a situation that would work with integers that operate linearly and showing that it doesn't work when integers behave non-linearly (ie: around INT_MAX and INT_MIN).

angersock · on Dec 5, 2011

The biggest problem is that the author seems to be completely ignorant (willfully or otherwise) of the issues involved at the hardware level.

They go so far as to point out that high-level languages shouldn't have these limitations (sure, agreed), but then is baffled that something like C/C++ does (and should).

C maps more-or-less directly onto assembly, which in turn maps more-or-less directly onto command bit vectors for an ALU. There is no magic decision in the industry to use twos-complement, wrapping arithmetic, or fixed-size integers (which map onto machine words).

This person needs to spend some quality time with low-level architectural stuff and gain a better understanding of why things are the way they are.

(not that the points are bad, necessarily, for a DSL or something--but stay out of our systems languages!)

EDIT: Okay, author is clearly somewhat knowledgeable about the field, but still... these are interesting ideas that don't really make sense for low-level stuff. It's not a matter of allocating enough memory (for bignums or whatever)--it's how the mapping occurs to the actual hardware instructions and datatypes.

shasta · on Dec 5, 2011

You and others seem to be missing that he is talking about the default 'int' type. If you want a specific overflow behavior, he implies that would be under a different name. I agree -- 'int' doesn't need to be the type that maps most efficiently to the hardware, even in a "systems language". It can and should be a fairly efficient type that has reasonable semantics.

caf · on Dec 5, 2011

Modifying a 2s complement ALU so that if the overflow bit is set, then every bit of the result is set to 1 would be trivial. It's just another n OR gates, and those are pretty cheap (OK, you'd probably want an AND against a control bit, too).

The author doesn't ignore that this would require a change at the hardware level - he says "Processor architectures will have to add a family of integer math instructions that properly propagate NaN values..." - and he is right that the change would not be a costly one.

rayiner · on Dec 5, 2011

> The biggest problem is that the author seems to be completely ignorant (willfully or otherwise) of the issues involved at the hardware level.

There's not any issues at the hardware level. Integer arithmetic is simplistic mostly because languages that run on current CPU's have simplistic integer semantics. It's not because more complicated semantics would be impossible to implement at the hardware level.

Take a look at some of the complex integer instructions that SSE supports with single-cycle latency.

ldar15 · on Dec 5, 2011

In Systems Programming Languages? For "writing operating systems"? There are people who argue we should be writing our operating systems in high level languages anyway. Assuming the premise is we should be using a low-level language, then how does the author reconcile that with "but I want the language to hold my hand when it comes to math".

Choosing "overflow" or "underflow" to mean "I fucked up" is totally arbitrary. Variables usually indicate values that have a domain - a range of valid numbers. Saying "I don't want to think about what that it is, but oh if X hits 2 billion and change then warn me when some math fails" is no better than having it not fail at all. In most cases there's already a problem.

So, simply, writing "OS quality" code means explicitly checking inputs to ensure they are in the permissible range. Once you know what the range is, you know if your code needs to go up to 64bit math to handle them.

UPDATE: Some explanation for the downvote would be appreciated.

nitrogen · on Dec 5, 2011

There have been many security holes and crashes caused by undetected integer overflow. The rationale is that detecting this condition would be a useful step toward preventing that category of bug.

ldar15 · on Dec 7, 2011

The rationale, then, is that the compiler should catch mistakes that lead to security holes. On that basis, then, we'll be adding GC memory management, so we never access freed memory, also strongly defined types - e.g. bounded integers, and bounded arrays too. Writing the OS an ADA would satisfy this chap?

"catching security holes" is the compiler version of "think of the children".

adbge · on Dec 5, 2011

Writing an entire operating system without touching a low-level language is largely the stuff of fantasy. High-level languages are, after all, built on top of low-level languages.

rayiner · on Dec 5, 2011

http://en.wikipedia.org/wiki/Lisp_machine

The C compiler in that machine was written in Lisp. :)

mmphosis · on Dec 5, 2011

eighteen quintillion four hundred and forty-six quadrillion seven hundred and forty-four trillion seventy-three billion seven hundred and nine million five hundred and fifty-one thousand six hundred and sixteen

  18446744073709551616

kabdib · on Dec 5, 2011

Remind me not to hire anyone this guy has taught.

Oh. My. God.

[I'm kidding, of course. I learned a lot from teachers who had obvious axes to grind . . . just not the lessons they wanted me to come away with. Finals were an interesting exercise in remembering to cough up the party line, and even the TAs were snickering in some sessions.]