2022-09-07 datascript | Clojure Slack Archive

datascript

Drew Verlee 2022-09-07T15:52:06.692069Z

Is there a function that i can wrap or modify that all queries somehow flow through? In our case we have some ids which might we want to look up two different ways, either by direct id or lookup ref. (d/pull db '[*] (choose [:server.db/id 1] 1)) and (d/entid db (choose [:server.db/id 1] 1)) or would i have to wrap pull, entid, q, etc...?

Niki 2022-09-07T20:45:18.245889Z

datascript.db/entid

Drew Verlee 2022-09-07T20:24:50.388149Z

And in the interest of exploring every option in fixing our datomic datascript syncing issue (that i mentioned several slack messages above) I wonder if i could actually just take a performance hit and swap out https://github.com/tonsky/persistent-sorted-set/blob/0683019c890de91b375baf4c7c8523d16d586e2a/src-clojure/me/tonsky/persistent_sorted_set.cljs#L2 for clojures sorted set and everything would just work. The only reason i assume this would be remotly possible is because of a liberal and optimistic interpenetration of the documentation on said persisted sorted set https://github.com/tonsky/persistent-sorted-set/blob/0683019c890de91b375baf4c7c8523d16d586e2a/src-clojure/me/tonsky/persistent_sorted_set.cljs#L2 which implies they are very similar; > "A B-tree based persistent sorted set. Supports transients, custom comparators, fast iteration, efficient slices (iterator over a part of the set) and reverse slices. Almost a drop-in replacement for [[clojure.core/sorted-set]], the only difference being this one can’t store nil."

Drew Verlee 2022-09-17T14:07:42.178889Z

Our fe browser app can use the emax set to the javascript max safe int.

Drew Verlee 2022-09-17T14:08:13.804589Z

@danbunea

Niki 2022-09-07T20:46:59.438509Z

Maybe, but it’s also possible you’d have to fix a few things. But why? Clojure’s sorted set is in every way inferior to persistent-sorted-set

Drew Verlee 2022-09-07T20:55:45.786049Z

Because it might be able to handle datomic generated db ids. Our application doesn't encapsulate the syncing process, so the db ids are hard to translate to shorter ones that the persisted sorted set can handle.

Drew Verlee 2022-09-07T20:59:26.647749Z

Trading perf for correctness would be a worthwhile trade off for us until i can setup something better or even just a way to fix problems faster we might discover in prod.

Niki 2022-09-07T21:00:16.998229Z

This is datascript limitation, not pss

Niki 2022-09-07T21:00:40.846289Z

You can easily lift it from datascript, it's completely artificial check

Niki 2022-09-07T21:01:13.657099Z

it's basically just used to distinguish entity ids from transaction ids

👀 1

Drew Verlee 2022-09-07T21:14:58.946469Z

Thanks, this is immensely valuable to me. @tonsky I have some simple questions that i should probably ask to make sure i understand completely. Are "transaction ids" temporary ids? I thought those were distinguished from entity ids by being negative.

Drew Verlee 2022-09-07T21:19:10.934609Z

When you say "lift" it, where "it" is the entid check to see if the number is less then some number, what do you mean? I could imagine forking datascript and changing the entid function to not have that max check, but maybe there is a better way? I can think of other ways: try catch block, you can probably just redefine a symobl, though i have never done that.

Drew Verlee 2022-09-07T21:25:45.518709Z

I thought, from reading this https://github.com/tonsky/datascript/issues/292, the entity ids in cljs had to be less then some max number, because of a bit wise operation that existed in the persisted sorted set library. But i suppose that's what the user claj suggested, but you never directly confirmed.

Drew Verlee 2022-09-07T21:50:08.111369Z

To be clear, this is an example of a db/id from datomic which throws if i try to transact it into datascript:

(def conn (datascript.core/create-conn {}))
(datascript.core/transact! conn [{:entity/name "foo" :db/id 285873023227265}])

Drew Verlee 2022-09-08T04:22:32.221119Z

I removed the (> eid emax) check and in cljs it doesn't throw any more (obviously!) but it also doesn't transact, so i'm confused because i thought you were implying it might be ok to remove that comparison check. 😕

Niki 2022-09-08T13:03:24.743769Z

> entity ids in cljs had to be less then some max number, because of a bit wise operation that existed in the persisted sorted set library Oh, JavaScript! Yes that might be the case

👀 1

Niki 2022-09-08T13:03:39.937939Z

> JavaScript stores numbers as 64 bits floating point numbers, but all bitwise operations are performed on 32 bits binary numbers. Before a bitwise operation is performed, JavaScript converts numbers to 32 bits signed integers. After the bitwise operation is performed, the result is converted back to 64 bits JavaScript numbers.

👀 1

Niki 2022-09-08T13:05:52.965399Z

I don’t think we use bit operations on keys though

Niki 2022-09-08T13:06:28.551279Z

But Datomic ids certainly exceed 32 bits that’s for sure

Niki 2022-09-08T13:09:48.496639Z

Seems to work in JS

Niki 2022-09-08T13:10:51.856029Z

Try increasing these numbers

(def ^:const tx0   0x20000000)
(def ^:const emax  0x7FFFFFFF)
(def ^:const txmax 0x7FFFFFFF)

👀 1

Niki 2022-09-08T13:11:37.965769Z

To the values that make sense for Datomic

Niki 2022-09-08T13:12:35.436709Z

emax/txmax probably something like Number.MAX_SAFE_INTEGER

Niki 2022-09-08T13:13:02.842999Z

tx0 is the boundary between entity ids and transaction ids. Check with Datomic where it draws the line and use the same

Niki 2022-09-08T13:18:46.548319Z

If I remember correctly, I limit those ids to 32-bit ints because they were (rumored) to be represented more optimally in JS engines. From correctness point, you can go as far as Number.MAX_SAFE_INTEGER (which I believe is something like 2^52-1), which should cover Datomic ids I hope

Drew Verlee 2022-09-08T13:44:41.876249Z

Why is there a different max for a transaction id (which i believe is either an entity id or a temp id) and entity ids? To get a sense of the difference, what would happen if tx0 and emax were set to the same value?

Niki 2022-09-08T13:50:32.107719Z

technically emax should be tx0 - 1. Not sure why it’s set here to the same as txmax

Niki 2022-09-08T13:51:09.287439Z

maybe entity ids were overflowing tx0 for somebody and index lookups stopped working

Drew Verlee 2022-09-08T14:04:21.281709Z

My previous question should have been why do we need tx0? To help me clarify, the id space seems to look like:

0 ---------------------------------------- emax
0        tx0----------------------------- txmax
 ^^^^^^^^^ ~ why have a range that tx cannot be but the eid can be? I assume this is because sometimes clients want to explicitly set there eids in a transaction, and that range (0-tx0) is what the library uses when they don't?

Drew Verlee 2022-09-08T14:07:16.984149Z

> technically emax should be tx0 - 1. Not sure why it’s set here to the same as txmax By chance did you mean to say txmax - 1?

Niki 2022-09-08T14:08:27.354149Z

0....eids....emax....txs...txmax

Niki 2022-09-08T14:08:35.302069Z

that was the idea

👀 1

Drew Verlee 2022-09-08T16:32:25.186289Z

In case anyone ever reads this thread, i want to point out that "transaction id" is the tx in a datoms [e a v tx op] and is necessary above and beyond an e/eid because we keep the history of an eid. see https://docs.datomic.com/cloud/whatis/data-model.html#example-entity for details. Some of my questions above are confusing because i misunderstood what the term meant.

Drew Verlee 2022-09-08T16:47:49.739899Z

i'm not sure in our case we need to increase the transaction id max, because were not getting more transactions.

Drew Verlee 2022-09-08T16:48:18.617129Z

ty ty niki. Right now it seems like i have everything i need.

👍 1

2022-09-16T11:22:15.235669Z

This topic of using datomic ids in datascript instead of making an artificial mapping between them, as I do now, is very interesting to me too. So @drewverlee at the end of the day you just increased the txmax, emax, tx0 and now you can use the same ids? If possible can you also tell me to which values?

👀 1

2022-09-19T07:31:22.150059Z

@drewverlee thank you. I'll try that too 🙂

Clojurians Log v2

datascript