https://scicloj.github.io/tableplot/ is getting close to Beta stage. While it still needs better documentation and a few missing features, we are close to deciding that the API is relatively stable. If anybody wishes to try it out and provide some feedback, that would be of great help. The main current question is: Does the API make sense, or should we change anything about it, such as function or parameter names, or certain behaviors?
> Is there a reference page that lists the main functions and supported keywords? @smith.adriane here is an initial attempt to do that: https://scicloj.github.io/tableplot/tableplot_book.plotly_reference
> Keep developing mainly the Plotly.js-based API. Does that imply that the plotly API is preferred over the other API?
Is there a reference page that lists the main functions and supported keywords?
> Does that imply that the plotly API is preferred over the other API? Yes. Maybe I should state that. It seems easier to extend for our needs.
> Is there a reference page that lists the main functions and supported keywords? No 😳 Soon!
Thanks for looking into it.
If the plotly API is preferred, then I would rearrange the docs to focus on that API. I might even move most of the docs for the other API to its own section at the end.
that is a big shift in the library for me (moving away from vega-lite/hanami) I hadn't quite realised that from the outside
Thanks. We are not moving away from Vega and classical Hanami, but might not extend it much in the near future. The hopes for additional functionality (coordinates, facets, etc.) will be easier to implement in Plotly. In any case, I'd be glad to discuss if you think we should and could extend the Vega-Lite/Hanami part further. Learning from your perspective will help a lot in prioritizing.
@otfrom does it seem to make sense? Both are supported and will be supported, but one of them seems easier to extend.
For someone that hasn't tried the API of either tech.ml.dataset or tablecloth, does someone know of an overview, or ideally a side-by-side comparison, of how they differ? (Basically something that 'sells' the added value of tablecloth)
tablecloth is a wrapper on the top of the tech.ml.dataset The main goal was to tidy up the API (which was a case a the very beginning of TMD) and add some missing stuff like pivoting and operations on grouped datasets. Generally the main idea was taken from R dplyr and tidyr packages.
tablecloth should work well for midsize datasets (say: millions of rows), while tech.ml.dataset allows to work on bigger stuff, many more sources and has some paths highly optimized, for example: grouping and aggregations in one step.
So, no simple answer for your question 🙂 You can use both libraries, entities are the same.
Thanks. I guess what I wanted to get at was: for those midsize datasets, how much more convenient is the tablecloth API... if it's just "once in a while it will spare you one line of code or one additional transformation", to me it wouldn't be worth it for one additional place to look at documentation. (and possibly to trace bugs of the abstraction on top of TMD)
I guess I will start experimenting with TMD directly and reconsider if I find anything cumbersome.
I'm just getting into this myself, but for what it's worth, I just picked up tablecloth for my problem, under the assumption it's easier/higher-level. I've found it to be nice to work with.
So, if you get going with TMD, and find it also-nice, then I guess you can just keep going. If you find it ponderous though (and I really have no idea), you could take a look at tablecloth.