datomic 2017-07-05 | Slack Archive

Hi, I am looking into trying out datomic.

Does datomic help maintaining any kind of schema or relationships among one's data, or does it simply store and retrieve keyed maps to the underlying storage, leaving any data connections to user code?

matan03:07:13

Could you kindly comment and/or point me in the right direction about that?

gordon03:07:39

@matan have you had a chance to read http://docs.datomic.com/schema.html yet?

henrik05:07:32

@val_waeselynck The point would be to make it seem like writes complete faster than they actually do. Like I said, it’s mostly a thought exercise than anything else. Optimistic updates are sometimes used in UI design to make it seem as if things are faster than they are. You show the update as completed to the user, even as it is being sent to the backend.

henrik05:07:53

Facebook appears to do something akin to this. Your post is more or less immediately visible to whatever users happen to be provisioned to the same server/cluster as you, while it can take some time before the post actually propagates to other servers/clusters.

henrik05:07:35

@potetm So, let’s say that we’re running a forum where a user tries to create a new thread. Checks are made to make sure that the user is posting somewhere where they are allowed to post, and then the data is sent off to the transactor. What could make the transactor reject the data, and how would we normally deal with it?

val_waeselynck06:07:24

> Does datomic help maintaining any kind of schema or relationships @matan yes, Datomic is very good at that. The schema is explicit, although more flexible than SQL databases. See http://docs.datomic.com/schema.html

val_waeselynck06:07:02

@matan http://www.learndatalogtoday.org/ is good at showing how expressive Datomic querying is.

val_waeselynck06:07:01

@henrik I see; well this approach definitely has a lot of challenges, both in managing the state location and in orchestrating subsequent writes

matan08:07:48

thanks @gws @val_waeselynck for pointing me in all the right directions!

matan08:07:42

okay, so with regard to schema, right, datomic uses a concept it calls schema, to describe the allowed attributes in the database (and this schema may leave dead data behind, when altered). Hopy I've followed the right lingo so far.

matan08:07:16

But what about relationships within the data? as I understand the unit of storage is a datom: > Each datom is an addition or retraction of a relation between an entity, an attribute, a value, and a transaction.

matan08:07:12

And as I understand, unlike e.g. a (legacy) RDBMS, datomic incorporates no enforcement of constraints on relations between datoms, am I correct insofar?

asier09:07:18

Hi, quick question. Can I get the :db/txInstant from (d/entity db 17592186046014) or it's only possible with datalog?

mgrbyte09:07:37

@aiser you can't reach the transaction directly through an entity. If you don't want to use a datalog query for some reason, you can use the datoms api to get tx entity, then get the tx instant from that: (->> (d/datoms db :eavt 17592186046014) first :t ((partial d/entity db)) :db/txInstant)

mgrbyte10:07:07

oops, sorry about the typo (:t, should of been :tx in the above)

asier09:07:31

Cheers - I have this error message IllegalArgumentException No matching clause: :t datomic.db.Datum (db.clj:326) - I'll dig into it

isaac10:07:50

@asier use :tx instead of :t

isaac10:07:30

5 elements tuple, [:e :a :v :tx :added]

asier10:07:35

@isaac aha - yes!

hmaurer10:07:53

Hello! Quick question: why is there a distinction between “t values” and transaction IDs in Datomic?

hmaurer10:07:18

Also, is it possible to keep the transaction IDs and/or t values the same after a complete backup restore?

hmaurer10:07:01

e.g. regarding my second question, I would like to know if I can safely store transaction IDs and/or t values outside the system to “point at particular point in times”, and whether those references will be broken in case of disaster recovery

hmaurer10:07:32

Obviously I could also use txInstants, but it seems a txId / t value would be better

andrei11:07:03

@pesterhazy we managed to solve our datomic query in 1 go using :with

[:find (max ?revision)
 :with ?id
 :in $ ?id
 :where [?m :appRegistry/id ?id]
        [?m :appRegistry/revisionNumber ?revision]]

this returns what we wanted, the app with the max revision for a given id.

pesterhazy11:07:30

Cool!

pesterhazy11:07:22

Haven't used with yet

andrei11:07:13

with allows to consider additional vars for an aggregation

andrei11:07:23

as far as I understand

andrei11:07:59

http://docs.datomic.com/query.html#sec-5-17-1

igrishaev12:07:38

potetm12:07:42

@henrik What happens when you get a network blip between the peer and the txor, and your transaction never makes it to the txor?

henrik12:07:25

@potetm That’s a good example! So I guess, silently retrying behind the scenes in that case?

potetm12:07:55

And if the server that is trying the request happens to die while retrying?

henrik12:07:49

@potetm We’d be in trouble I bet.

potetm12:07:02

Long story short, without some intermediary persistence there would be data loss.

henrik12:07:23

You’d have the same amount of data loss without optimistic updates though.

henrik12:07:34

The problem lies in user expectations.

laujensen12:07:35

Im looking for an intuitive way to update something like a user record. The user can fill out 50 fields which are submitted. If something has changed, it flies straight through without issue. But if something has been emptied, I cant just submit a map with a nil value, I need to make a separate retract transaction. Is there some idiomatic tool for simplifying that workflow?

potetm12:07:05

Not if you wait for the txor to respond that it's processed your tx.

henrik12:07:22

And what if the server waiting for the response dies in the meantime?

potetm12:07:49

You respond to the user that their request failed.

potetm12:07:58

Because you know that.

henrik12:07:48

In both cases we can know that the process failed, but in one case we made it seem like everything was a-OK before we were entirely sure.

henrik12:07:12

Equaling potential confusion for the user.

potetm12:07:47

Right. You would have to establish an understanding that things "aren't quite done until I get the green check mark"

laujensen12:07:02

Im looking for an intuitive way to update something like a user record. The user can fill out 50 fields which are submitted. If something has changed, it flies straight through without issue. But if something has been emptied, I cant just submit a map with a nil value, I need to make a separate retract transaction. Is there some idiomatic tool for simplifying that workflow? - Addendum: Something akin to nil making an automatic retraction

henrik12:07:10

@potetm I think you’re right, and for things that are very essential, you would want to be pessimistic rather than optimistic. Optimistic writes would only make sense in the case where we know that we will be correct 99.99% of the time or better.

henrik12:07:46

If the success rate is very, very high, we could assume that it approaches 100% and declare victory “prematurely”

potetm13:07:32

I mean, I think you hit the nail on the head before. If the user has a good mental model for what's going on, you can do whatever you want.

potetm13:07:25

But you don't get that for free by just asynchronously writing to the db. You have to build lots of things, most importantly you must build user understanding.

hmaurer13:07:06

@henrik quick comment on the earlier discussion on optimistic writes: you mentioned that UI applications do it, but it seems like they have different concerns. A user-facing app has to be very responsive, and network delays can be pretty large. A backend application can usually afford slightly longer delays, but most importantly network delays are very low since your database will usually be running in the same local network

hmaurer13:07:55

@laujensen this was asked earlier in this channel. Hang on, I’ll try to find the message

hmaurer13:07:47

@laujensen https://clojurians.slack.com/archives/C03RZMDSH/p1498911745673823

hmaurer13:07:16

Roughly speaking I think the summary is that you would need to write your own transaction function which retracts the missing attributes.

hmaurer13:07:51

Don’t take my word on it though

hmaurer13:07:47

@laujensen actually, couldn’t you just write a helper function which, given a map of attributes (with potential nil values), generate the right set of assertions and retractions?

laujensen13:07:22

Thanks buddy! And yeah I could, but it just feels like fixing datomic instead of using it. But looks like it needs some fixing

hmaurer13:07:22

What exactly would you like Datomic to do? If you would like to “replace” the entity (e.g. only keep the new attributes you are transacting, and retract all others) then you will need to write a transaction function from what I understand.

hmaurer13:07:12

But if you know, at the point where you make your new assertions, exactly which attributes need to be retracted, then I don’t see an issue generating those retractions in your app and transacting the assertions and retractions all at once

hmaurer13:07:02

Does this make any sense?

laujensen13:07:13

In my mind datomic should automatically retract anything thats assigned a nil value. That would simplify the interface

hmaurer13:07:31

Oh, I see. Yeah, I am not sure what the implications of this would be but it could be neat. As I said you can implement that very easily though

hmaurer13:07:55

The “map” syntax of transact is just a convenience for writing a vector of assertions

laujensen13:07:33

http://www.matthewboston.com/blog/setting-null-values-in-datomic/

laujensen13:07:53

Ive modelled a small wrapper after this principle of just joining retracted/edited fields in a generic sense.

hmaurer13:07:34

@laujensen oh I hadn’t read this blog post. Yes, that’s pretty much what I was suggesting

laujensen13:07:13

Good, then Im on the right trail

laujensen13:07:19

Thanks for weighing in

hmaurer13:07:51

@laujensen Happy to help. Good luck 🙂

matan21:07:32

Thanks for the answers yesterday. Last newb question I guess: Is it fair to say that any constraints between datoms are left to transactions to explicitly maintain, or is there any other mechanism which is more declarative? I am pretty sure its the former not the latter case.

hmaurer22:07:37

@matan the only constraint Datomic can keep a track of is uniqueness I think

hmaurer22:07:48

All others constraints are indeed left to the transactions to explicitly maintain

hmaurer22:07:46

This makes me wonder however: is it possible to define a transaction function in Datomic that should be executed on every transaction? As a way to maintain an arbitrary invariant

hmaurer22:07:26

@val_waeselynck maybe you would know? ^

val_waeselynck22:07:22

@hmaurer what you can do (at some performance cost of course) is define a transaction function which would wrap a transaction request, db.with() it, check the invariant, then transact it or throw an error.

hmaurer22:07:08

@val_waeselynck Oh I see, but what I meant is: is it possible to execute that function on every transaction, not upon request with the :db/fn attribute

hmaurer22:07:17

as a way to ensure that bad data can never get transacted

hmaurer22:07:33

It was more out of curiosity; I’m not sure I would want to do it in a production system

val_waeselynck22:07:51

@hmaurer no, there is no trigger-like mechanism in Datomic

val_waeselynck22:07:34

@hmaurer @matan I'm curious if there's a particular feature of other database systems you're trying to find here ?

hmaurer22:07:03

@val_waeselynck no. I was just curious if you could forcibly maintain an invariant in this way

spieden22:07:07

will the transactor process multiple transactions in parallel if they’re against different databases within the same storage?

val_waeselynck22:07:36

@hmaurer well just for performance reasons you'd probably want to be explicit about where to look for invariant violations, so having to wrap the tx in a function call does not seem like an additional cost to me

souenzzo22:07:42

@hmaurer you can use https://github.com/MichaelDrogalis/dire to wrap datomic.api/transact

hmaurer22:07:45

@souenzzo oh interesting, I’ll take a look, thank you

souenzzo22:07:37

(not sure if it's a best practice) 😅

val_waeselynck22:07:38

@souenzzo not sure this solves the same problem; error handling is about dealing with bad stuff after it happens, maintaining invariants is about preventing bad stuff from happening :)

val_waeselynck22:07:35

(Having said that, preconditions could do the trick)

hmaurer22:07:49

Over the top of my head, I guess a “soft” option would be to hope your code doesn’t mess up and respect the invariants (test it properly, etc), but just in case have a service watch the transactions through the Log API and reports any infraction

souenzzo22:07:34

Another day I thought of use dire to transform the tx-data of the d/transact by adding my db/fn ... But reviewing now, I do not know if it is able to do this

hmaurer22:07:34

I am not sure that would be a very judicious thing to do, it’s late, but I’ll throw it out here 😄

val_waeselynck22:07:35

Or a batch job which inspects the whole db periodically

hmaurer22:07:26

Or that, yes. So you get the peace of mind without the extra cost on each write

souenzzo22:07:42

At best, dire can inspect all tx-data and log in if any one is without your db / fn ... 😕

val_waeselynck22:07:59

@hmaurer Datomic has opened a world of new possibilities, we need all the crazy ideas we can get to explore it :)

hmaurer22:07:26

@val_waeselynck it’s great. It has a lot of the benefits of event sourcing without the pain of implementing it yourself

hmaurer22:07:43

I just started exploring it but I am going to have a lot of fun over the coming months 🙂

hmaurer22:07:03

Ah by the way, I have another question which you might know about @val_waeselynck :

hmaurer22:07:33

I understand there is no “order” clause with Datomic, but will the order of n-tuples returned by a Datalog query always be the same for a given db value?

hmaurer22:07:55

e.g. can I rely on it to do cursor pagination based on index in the result array, etc

val_waeselynck22:07:27

@hmaurer no, I don't believe so

hmaurer22:07:28

I suspect that the “re-indexing” step ran periodically by the transactor might mess this up

hmaurer22:07:34

Ah 😕

val_waeselynck22:07:44

You'll need to sort the whole result youtself then truncate. But you can pull most of the data downstream of that

hmaurer22:07:14

Yeah so long as the data isn’t huge in-memory sorting should be fine

hmaurer22:07:57

Also, do you know if I can rely on tx ids or “t values” to reference at point in times in the database, and store those externally?

val_waeselynck06:07:10

hmaurer: as you suspected, relying on Datomic eids remaining stable on the long term is generally discouraged, because that's not robust to log rewriting (having said that, I have yet to see a complete story about log rewriting with Datomic). Same goes for t values IMO. If :db/txInstant is not good enough for you, I suggest you annotate each transaction with a UUID-type attribute using datomic.api/squuid.

hmaurer22:07:14

e.g. if I ever need to restore the DB from a backup, will I be able to keep the same tx ids or “t values”?

hmaurer22:07:23

so as not to break those references

val_waeselynck22:07:51

These questions will have to wait until tomorrow - good night everyone, have fun!

hmaurer22:07:37

val_waeselynck: Good night!

spieden22:07:25

i’m looking at introducing a new database for handling some high latency (700k+ average datoms) transactions versus adding them to my current db of low latency ones (~30 average datoms). if i just do d/create-database using the same DDB table will the transactor process low latency transactions and high latency ones at the same time? (so former aren’t held up by latter?)

2017-07-05

Channels