This page is not created by, affiliated with, or supported by Slack Technologies, Inc.
2017-05-16
Channels
- # beginners (176)
- # boot (11)
- # cider (12)
- # cljs-dev (65)
- # cljsrn (54)
- # clojars (18)
- # clojure (195)
- # clojure-austin (1)
- # clojure-dev (2)
- # clojure-italy (8)
- # clojure-quebec (1)
- # clojure-russia (51)
- # clojure-serbia (3)
- # clojure-spec (24)
- # clojure-uk (28)
- # clojurescript (41)
- # cursive (14)
- # data-science (60)
- # datascript (2)
- # datomic (111)
- # emacs (6)
- # figwheel (1)
- # graphql (16)
- # hoplon (26)
- # juxt (2)
- # lein-figwheel (3)
- # lumo (12)
- # off-topic (8)
- # om (14)
- # pedestal (22)
- # perun (2)
- # proton (1)
- # re-frame (29)
- # reagent (27)
- # ring (17)
- # ring-swagger (2)
- # rum (3)
- # spacemacs (3)
- # unrepl (155)
- # untangled (28)
- # vim (4)
Hi @elise_huard - I don't know the reason - but I suspect it is just because the maintainers have moved onto other interests
The Clojure community does need a collection of good libs to support Data Science - either clojure based or good interop with java libs
@gigasquid I agree. I wonder whether Incanter is close enough that it's worth pushing that, or whether other tools need to be pushed
I suspect Incanter is trying to do too much
@elise_huard - there is also an interesting comment by nblumoe (one of the (former?) developer) of Incanter https://clojurians-log.clojureverse.org/rdf/2017-02-14.html starting at 09:55:30
so it seems that some developers moved on and some don’t need Incanter anymore (i.e are using directly matrix.core).
@gigasquid have you already had a look at https://github.com/MastodonC/kixi.stats and does this qualify as a good lib to support data science?
Also what @elise_huard said: Incanter might better be split up into multiple, composable libs.
what kind of contribution are you looking for? and it looks like that for a while now, pull requests haven’t been merged.
kixi.stats was made by someone who collaborates with the company I work for (Mastodon C) - it's very early days
and explicity for 'mid-size' data sets
nblumoe: I'm evaluating, let me get back to you on that 🙂
kixi.stats does some basic stats with transducers, but doesn't go much further than that (cc @henrygarner )
I've used Weka before - it's pretty good. Here are some other ones http://www.datasciencecentral.com/profiles/blogs/25-java-machine-learning-tools-libraries
of course there is a plethora of things for basic data handling, visualisation, etc. then there are basic number crunching libs (linear algebra etc.) and there are also higher level “data science” libs/frameworks for ML etc
maybe before the python era took off, Java was the prime environment for industrial data science with a general purpose language (excluding more specialized envs like Matlab etc.)?!
I see. Well I guess there are multiple but I am not sure how well they all play together, I mean stuff like jblas, nd4j, etc for the matrices/array processing. For plotting there are GRAL, JFreeChart, JavaFx, etc. etc.
So things definitely exist but I would be very skeptical about getting the same convenience as you get in the Python or R ecosystem
core.matrix
also has multiple, pluggable backends which might be worth having a look at to get a feel for the underlying (Java) numerical libs
@miikka core.matrix covers the numpy/pandas stuff (it may or may not at the moment have everything you may need). That, of course, is Clojure (but sits on various impls, including Java libs like vectorz). For matplotlib (ggplot2 is better imo), gyptis https://github.com/dvdt/gyptis may be sufficient, and it is browser vega/d3 based making it very dynamic in principal.
@elise_huard congrats on your EuroClojure talk http://2017.euroclojure.org/the-return-of-clojure/
@gigasquid Thank you 🙂
our company really does want to do everything data in clojure, so it's a practical objective 🙂
data exploration happens mostly in R atm, and we're looking to change that
@elise_huard also just saw you on the speaker list too. and even mentioning incanter in your abstract 😄
let’s definitely meet at the conference then. I am up to helping with DS tools for Clojure
cool, sounds good!
Bath, actually
as a first effort I think I'll try to reproduce some R work in clojure (possibly with incanter develop branch) and see how far that goes
(remote company)
we have quite a good share of AI/DS talks on the EuroClojure program. Will try to contact people to meet and discuss 🙂
cool 🙂
definitely up for meeting and discussing!
great. also don’t hesitate getting in touch regarding your thoughts and issues with Incanter.
will do. I'd like to contribute to the open source ecosystem around this.
and I AM looking forward to your talk as I really went the lazy route in the last year or so and just did stuff with R and Python
no pressure and all that 😉
same here though, so far
now that I got your attention already. any experience on your side or mastodonC with probabilistic programming? maybe even Anglican?
looks interesting, haven't used anglican
we have done some markov chain work, but not me personally
(probably completely unrelated)
I do wonder about using core.matrix only when you want to do something very matrix-oriented. I wonder if taking a "everything is a dataframe/dataset" approach like in R doesn't work for clojure where the powertools are all related to seqs and maps (partly why I find kixi.stats interesting)
there is powderkeg for mixing transducers with spark which I find interesting too https://github.com/HCADatalab/powderkeg