2022-05-01 core-typed | Clojure Slack Archive

core-typed

2022-05-01T02:11:31.697689Z

2022-05-01T03:20:19.402439Z

Random thought. How difficult would it be do design a static type checker (or maybe it's a kind of analyzer) which would track the keys and values of maps statically across an application so that: 1. It can assert that a map will have certain keys inside a given function 2. It can assert that a map might have certain keys inside a given function 3. It can be asked to tell you what keys a map at a particular place in the code could contain with a list of options

2022-05-01T03:21:42.643089Z

It wouldn't even need to track the type of keys or values, that be a bonus if it also can.

2022-05-01T03:28:45.469079Z

Without annotations?

2022-05-01T03:29:05.734689Z

I guess it would need to track the movement of the map through the code, and also track all instances of assoc or update on it where the key is constant. And somehow it would need to figure out that possibly unreachable code makes the key optional. I wouldn't expect it to track dynamic keys, like computed keys, though maybe indirect like a constant inside a variable maybe it could as well.

2022-05-01T03:29:52.283439Z

I guess part of my question is if it is possible at all, and how little annotations could it get away with

2022-05-01T03:31:58.181359Z

IIRC ocaml does something like this with row variables. I barely remember the details but you can imagine that it's held together delicately and has many restrictions (IIRC you can't use the same field in two unrelated objects in the same file).

Ben Sless 2022-05-01T04:56:04.425209Z

Seems like a rather arbitrary restriction, no?

2022-05-02T14:48:33.456739Z

Good point, I went back to the papers to stop guessing. You're right, the functions are inferred separately with polymorphic rows. See sections 2.1, 2.2 in https://caml.inria.fr/pub/papers/garrigue-structural_poly-fool02.pdf

2022-05-02T14:49:23.690259Z

I misremembered, the fields on an object are inferred on its definition, not its usages. see first example in https://caml.inria.fr/pub/papers/remy_vouillon-objective_ml-tapos98.pdf

2022-05-02T21:56:35.818409Z

I'm thinking not even doing anything with the types really, so lets assumes keys and values are just of type ANY. The question would be more:

(defn bazz [m] (get m :name))

Will :name exist on m in all cases, in only some cases, or in no cases.

2022-05-02T21:57:14.806909Z

And could you ask the type checker to list out all cases of the set of keys m could have from all the callsite to `bazz?

2022-05-02T21:59:51.152339Z

(bazz {:name 10 :age 10})
(bazz {:age 23})
(bazz (if wtv? {:hello true} {})

For example, you would get something like:

bazz is called with:
a :name key AND an :age key
or
a :age key
or
a :hello key
or
with no keys

2022-05-02T22:00:57.715969Z

So similarly, you could tell the type checker to assert that bazz is called with both a :name and :age key, and it could tell you if that's not the case. And maybe it could help you find where in the code woulds it get called without those keys.

2022-05-02T22:03:06.834209Z

And finally, I know some places would be undecidable, like say:

(bazz (hash-map (keyword (slurp "foo.txt")) "John"))

For those, I'd imagine the type checker would tell you that there are some unknown keys the map can have, and it would help you find where in the code those could be added.

2022-05-02T22:06:10.475429Z

And possibly it could let you add a runtime assertion and maybe the type checker could understand that or be told about it, for example:

(bazz (assert-key (hash-map (keyword (slurp "foo.txt")) "John") :name))

2022-05-02T22:07:10.637189Z

So now even though its dynamic, the checker would know that this should make it so the map contains the :name key as well.

2022-05-02T22:09:40.023729Z

Maybe calling it a type checker would be wrong, it be more like a static key checker.

2022-05-01T15:54:58.409219Z

Global inference is a whole other world, I think if you're trying to nail down the fields of a record and their types using global analysis it might be a necessary restriction. But take it with a grain of salt, haven't thought about this in 5 years.

Ben Sless 2022-05-01T15:59:58.701489Z

The basic model for type inference is sets, right?

2022-05-01T18:42:51.170759Z

Yeah sets of constraints. It's a delicate balance of expressivity to make them solvable.

Ben Sless 2022-05-01T18:53:46.115799Z

Can't maps be inferred then by treating them as a union of constraints regarding the occurrence of their entries?

Ben Sless 2022-05-01T18:54:36.029979Z

Maybe that just breaks everything, I'm no expert

2022-05-01T19:16:07.427079Z

I think you're right in terms of inference, but if you have 2 different records with the same field name, the type system won't know which record you're referring to.

(defn foo [m1] (inc (:a m1))
(defn bar [m2] (name (:a m2))
(foo {:a 1})
(bar {:a 'a})

If you're just trying to find the "type" of :a entries, is it Int or Sym or (U Int Sym)? That's the basic problem IIRC.

2022-05-01T19:17:07.398049Z

You still need (inc (:a m1)) to type check. So you need to choose. And IIRC it's simply forbidden so the choice is made implicitly.

Ben Sless 2022-05-02T03:26:56.899569Z

Why can't they be different sets? Their Union succeeds and their intersection fails. Their inference is tracked just like any set propagating through the calculation. What I'm missing is the need to have a global state associated with it

2022-05-01T03:32:31.766839Z

obviously that's one end of the spectrum (hindley-milner)

2022-05-01T03:34:28.682489Z

I think you end up with the same problem as automatically inferring Typed Clojure annotations. You'd need to track maps thru sequence fn's and transducers etc

2022-05-01T03:38:08.177319Z

looking at my dissertation again, I think this is a kind of Higher-order Control Flow Analysis https://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.36.6128

Clojurians Log v2

core-typed