datomic 2016-09-12 | Slack Archive

lvh02:09:49

hm; I’m trying to see if I can keep my stuff in datomic or if I should use core.match instead. I have data describing the general structure of something in terms of data (mostly predicates); I have something that’s maybe a specific instance of one of those things, and want to efficiently query if it is and if so, which one, e.g. kinds of things:

[{:id :xyzzy :path [“a" odd? “b” even?]}
 {:id :iddqd :path [“c” “d"]}]]

example:

{:path [“a” 3 “b” 2]}

… and I get :xyzzy. It seems like using the data there to construct a core.match expr than to query through datalog directly.

lvh02:09:44

(I appreciate that I can call arbitrary fns from datalog, but I guess core.match can probably do that path dispatchy bit faster :))

len08:09:58

Hi all, is there any ref on setting up datomic with postgres, specifically the URI for the peers and how it hangs together ?

danielstockton08:09:09

http://docs.datomic.com/storage.html#sql-database

hans08:09:14

I'm not sure about the ref, but it is simple and not really PG specific: Each peer needs to know the location of the storage and how to read it. When starting, it reads from the storage to determine the address of the current transactor.

len08:09:25

thanks hans that was what was confusing me

len08:09:40

since I only had the the storage in the uri

hans08:09:44

This is how failover works - A normal peer reads the transactor location and fails if it cannot contact it. The transactor tries to contact the transactor that's mentioned in the storage and if it cannot establish contact, registers itself as the new transactor in the storage.

len08:09:06

nice and simple !

yonatanel11:09:58

Given a parameter that is either an entity id (integer) or and identity uuid, I want to use the same query for both. I tried

(or [?e :entity/ex-id ?id]
    [?e :db/id ?id])

But I get :db.error/not-an-entity Unable to resolve entity: :db/id data: {:db/error :db.error/not-an-entity} How can this be done?

luposlip12:09:23

@yonatanel: I usually do something like this:

yonatanel13:09:58

@luposlip thanks. I wanted to use pull inside the query but ended up using if as well. Separating the identity from the pull could be nice though. Thanks.

luposlip13:09:59

You could also do something like this:

luposlip13:09:44

(or [?e :entity/some-ident ?obj-ident]
    [?e :entity/some-other-ident ?obj-ident]
    (and [?e :entity/some-attr-that-is-always-present-for-this-entity-type _]
         [(= ?e ?obj-ident)]))

luposlip13:09:46

Which is best/fastest depend on your style and use case 🙂

yonatanel13:09:51

@luposlip I remember trying that but couldn't make it work. Next time I'll remember to try this pattern again

luposlip13:09:14

Just tried it, it works fine.

luposlip13:09:37

In my case I use it for finding users.

luposlip13:09:51

But for performance reasons I usually check if the user-ident is a number, and then use a different (faster) query if that is the case. Then only if the ident is a String, I use the above, but without the (and …) clause.

robert-stuttaford13:09:07

@yonatanel (or [?id] [?e :entity/ex-id ?id])

robert-stuttaford13:09:45

but, let me introduce you to the wonder of lookup refs

luposlip13:09:20

Yes, I know lookup refs 🙂

luposlip13:09:31

[:user/username “some-user”]

robert-stuttaford13:09:34

(d/q '[:find ?e :in $ ?e :where [?e]] db a-long)
(d/q '[:find ?e :in $ ?e :where [?e]] db [:entity/ex-id a-uuid])

robert-stuttaford13:09:41

those two work the same

luposlip13:09:27

@robert-stuttaford for some reason I cannot get your example to work.

yonatanel13:09:38

If I remember correctly, if the ex-id doesn't belong to any entity, the lookup ref method explodes instead of returning nothing.

luposlip13:09:38

AssertionError Assert failed: All clauses in 'or' must use same set of vars, had [#{?id} #{?id ?e}]

yonatanel13:09:08

@robert-stuttaford Also, you must use the same vars on both sides of or

robert-stuttaford13:09:43

that is indeed a pity

robert-stuttaford13:09:43

didn't realise it threw when LR doesn't resolve

robert-stuttaford13:09:53

doesn't do that for missing ids

robert-stuttaford13:09:25

@luposlip: your query pattern works if the data type is the same for the first two clauses, but i think it may throw if not

robert-stuttaford13:09:53

@luposlip: curious; why return [?e ...] and not ?e .?

luposlip13:09:57

you’re probably right @robert-stuttaford

luposlip13:09:23

Just because in my case I simply want the user :db/id, not anything else.

robert-stuttaford13:09:35

?e . will give you the first result

robert-stuttaford13:09:46

[?e ...] will give you a vector of the results, no matter how many

luposlip13:09:50

ahh, didn’t realize that, cool! 🙂

robert-stuttaford13:09:51

you want ?e . 🙂

robert-stuttaford13:09:02

@luposlip @yonatanel these are pretty handy https://gist.github.com/robert-stuttaford/39d43c011e498542bcf8

luposlip13:09:52

will look at that @robert-stuttaford, seems sane.

robert-stuttaford13:09:53

(def uri "datomic:....") (as-conn uri) (as-db uri) (as-db (as-conn uri)) etc

robert-stuttaford13:09:23

oh, it's out of date, i've got more to add

robert-stuttaford13:09:42

added the log

luposlip13:09:47

:thumbsup::skin-tone-4:

luposlip15:09:40

Love the ?e . notation @robert-stuttaford. wonder why I haven’t seen this before 🙂

yonatanel17:09:01

When an entity has multiple identity attributes, which one identifies the entity if I change some of them in a transaction to values already in other entities?

robert-stuttaford17:09:20

@yonatanel what do you mean by identity attr -- indexed, or unique?

robert-stuttaford17:09:28

curious, and eager for input from the Cognitects amongst us, aside from d/query's timeout, what other protections do we have against queries that OOM the process cc @marshall @jaret?

robert-stuttaford17:09:27

by OOM, i mean the result set can't fit in the available ram. we've had it a couple times now where some query by someone somewhere will consume all the ram and then peg the CPU with a GC moshpit

robert-stuttaford17:09:35

sometimes it recovers, sometimes not

robert-stuttaford17:09:27

i'd like to know, conceptually, what the api offers to protect us from this. with traditional dbs you can issue a COUNT to the box 'over there' and protect the client app, but in this case, the peer will download everything when issuing a count

marshall17:09:43

In Datomic there isn’t an “over there"

robert-stuttaford17:09:53

i know 🙂 which is why i ask the question

jaret17:09:54

The results have to be fully realized in the JVM. So if the result set can’t fit in ram...

robert-stuttaford17:09:49

this is a growing pain, of course; things that were small a while ago have gotten bigger. we're already auditing our code and our queries, and finding ways to further partition and break down the work, but i just thought i'd ask what's in the peer library to help us

robert-stuttaford17:09:59

aside from d/query, which certainly does help somewhat

marshall17:09:42

Yeah, I’d say you might consider breaking up queries if you’re trying to realize large result sets.

marshall17:09:56

Possibly split into “get entites” then populate subsets of entities

marshall17:09:10

alternatively, there’s the direct datoms approach

robert-stuttaford17:09:13

i fully acknowledge that the answer may be 'nothing'

robert-stuttaford18:09:53

we're going to season everything we suspect with d/query and log the timeouts, and see where the fires start

marshall18:09:50

are you having OOM as the only issue or are you also looking for queries with perf issues?

robert-stuttaford18:09:21

we're having OOM on a box that only has Datomic as a ram sink. the rest is http + websockets, and we have way too little active connections to cause an OOM

robert-stuttaford18:09:52

we suspect that some query or queries for certain users (due to large source datasets) are consuming all the ram

robert-stuttaford18:09:06

so the diagnostic vector for us is query

marshall18:09:08

if you have things set up with timeouts and the ability to record them that sounds like a good approach. particularly if you can log the query parameters along with the timeouts

robert-stuttaford18:09:16

yes!

domkm19:09:30

Does sync-index include fulltext indexes?

domkm19:09:34

And if not, is there any way to ensure that fulltext indexes are fully created for a particular t before querying?

Lambda/Sierra20:09:12

Fulltext indexes are eventually-consistent by definition, so I expect there is no way to ensure they are up to date. http://docs.datomic.com/release-notices.html#0-9-4699

domkm20:09:22

Thanks @stuartsierra.

yonatanel22:09:52

@robert-stuttaford By identity attr I mean :db.unique/identity. I have several per entity. Perhaps I should have used only one as identity and the rest as unique, but I wanted to use flexible lookup refs, and even if I didn't have this problem it's still interesting to know which of the identity attributes will determine the entity.

yonatanel23:09:04

The docs say that in case of conflict the transaction will throw IllegalStateException, but in my case it didn't, since my "conflict" is between identities in the same entity, e.g asserting that [[email protected], "new unique name"] while [[email protected], "old name"] already exists will overwrite the existing name. Now I know.

Ben Kamphaus23:09:02

@yonatanel you asserted both of those datoms in one transaction (what that section of docs applies to)? It looks more like you’re running into the difference between unique identities http://docs.datomic.com/identity.html#unique-identities and unique values http://docs.datomic.com/identity.html#unique-values and how they behave when you assert different information.

2016-09-12

Channels