Fork me on GitHub
#clojure-uk
<
2018-03-24
>
dominicm07:03:19

@mccraigmccraig do you use much tinkerpop? I was looking at the various gremlin dsls last night.

dominicm07:03:37

Writing a datalog -> gremlin bytecode compiler is appealing, mostly because I like datalog.

mccraigmccraig08:03:28

we’re not using any tinkerpop atm - i’ve been trying to avoid being committed to DSE, so whatever graph stuff we do is currently done with set ops

dominicm09:03:09

@mccraigmccraig what are set ops? I don't think you would be committed to DSE, as you can use janusgraph, or is that not true?

yogidevbear09:03:56

Morning, and hello @gingerwizard simple_smile

👍 4
mccraigmccraig10:03:56

@dominicm set ops as in joins on edge-sets... i hadn’t come across janusgraph though - that might be useful

dominicm10:03:20

@mccraigmccraig fun history lesson. JanusGraph is forked from TitanGraph. TitanGraph were bought by DSE.

dominicm10:03:42

@mccraigmccraig another question, what are oltp queries like on dse?

mccraigmccraig10:03:02

yeah, i knew about titan->dse... didn’t know there was still a live oss version tho

mccraigmccraig10:03:59

dse is cassandra plus some integrated bits (spark aka analytics, graph aka titan and search aka solr)...

mccraigmccraig10:03:18

we’re only using cassandra + spark

mccraigmccraig10:03:27

so our oltp is just cassandra - it’s fast and scales well, but modelling is very different from an rdb

dominicm11:03:50

Wasn't sure about cassandra with oltp queries at all. Hard to get a feel when most people use it for olap/big-data stuff. I'm wondering about if mixing oltp & olap data in cassandra is fine, especially if you have graph queries to do lovely work on them both.

otfrom11:03:45

C* people generally want to heavy reads on a replicated dB rather than on the one taking heavy writes

jasonbell12:03:42

morning (near enough)

dominicm12:03:35

Oh C* is a Cassandra abbreviation.

dominicm12:03:16

@otfrom what do you mean by replicated db here? A materialised view?

otfrom13:03:20

A separate analytical database cluster

dominicm14:03:41

I see. That explains the Spark+C* of DSE better

mccraigmccraig14:03:36

@dominicm generally you have a separate “datacenter” (c* terminology) for spark, still synced with all the other “datacenters” in your cluster so no need for any ETL

mccraigmccraig14:03:47

DSE has a metadata store impl for spark-sql too, do you can point BI tools like tableau st your c*+spark cluster

dominicm15:03:49

@mccraigmccraig I'm wondering if that plays nice with the graph stuff. Why metadata and not something else?

mccraigmccraig16:03:35

the metadata server is part of the DSE special sauce - spark+c* can be made to work with oss versions