datomic 2021-08-10 | Slack Archive

Jakub Holý (HolyJak)10:08:54

Hi! Has anyone tried to build Gremlin support for Datomic (for Apache TinkerPop v3; I see there was a PoC "blueprint" for an old version)? Thanks

Jakub Holý (HolyJak)14:08:28

Does Datomic (on-prem) support Full-Text Search or does it have any integration with ElasticSearch? Neither scanning the docs nor searching the Internet answered that. Thank you!

favila14:08:44

On-prem supports fulltext: https://docs.datomic.com/on-prem/schema/schema.html#operational-schema-attributes

favila14:08:08

It’s very limited and you cannot remove it later. If you want flexible and powerful fulltext search over large text corpuses I suggest feeding the datomic transaction log into a secondary elasticsearch service and using that for fulltext searches.

favila14:08:49

If you just have tiny text fields and just want some kind of low-headache fuzzy matching, it’s probably fine.

Jakub Holý (HolyJak)17:08:39

Awesome, thanks!

Jakub Holý (HolyJak)08:08:11

@U09R86PA4 perhaps it would be good if https://docs.datomic.com/on-prem/schema/schema-change.html#schema-alteration listed :db/fulltext in the "You can never alter ..." list? Not sure how to propagate that suggestions to the Datomic team...

jaret11:08:29

Hi @U0522TWDA it's already there: You can never alter _:db/valueType_, _:db/fulltext_, _:db/tupleAttrs_, _:db/tupleTypes_, or _:db/tupleType_.

Jakub Holý (HolyJak)11:08:41

Ah, thank you, I'm blind today :)

kenny16:08:40

Do the following two dbs exhibit the same performance?

(def db (d/db conn))
(def db-as-of (d/as-of db (:t db)))

Put another way, as (:t db) grows further from the current :t (e.g., current :t is 1 > (:t db), 10 >, ..., 1e6 >, ...), does db pay the same performance penalty an as-of db pays? (current :t = (:t (d/db conn))).

favila16:08:37

A “current” (unfiltered) db is always fastest, because it only ever has to retrieve the current index. It only merges against that index and the memlog.

favila16:08:05

Potentially an as-of >= last-indexed-t will do the same and be the same complexity, but I donno.

favila16:08:34

anything older is going to have to look at the history or mid-history indexes to reconstruct the value at that moment in time.

favila16:08:02

whether this is slower-enough in practice depends on object cache, valcache, datasize, selectivity of queries, etc

favila16:08:17

but you’re merging across 3-4 indexes instead of just 2

kenny16:08:23

Interesting, thanks for responding. By "anything older is going to ...", do you mean both an as-of and an unfiltered db would go down this path?

favila17:08:29

an unfiltered db is by definition a “now” db, so only looks at the “now” index--the one that only has assertions valid at the moment it was indexed.

favila17:08:02

By “anything older” I mean anything that needs to look at datoms older than the last-indexed T

2021-08-10

Channels