data-science

Zach 2024-10-28T01:35:00.853159Z

Any cider users? I’m curious how people view tablecloth or dataset tables in emacs

2024-10-28T06:41:59.949579Z

C-c C-p

Daniel Slutsky 2024-10-28T07:27:30.514489Z

I also really like cider-pprint-eval-last-sexp as otform proposed. Sometimes I use clay-make-last-sexp or clay-make-defun-at-point with https://github.com/scicloj/clay.el to send datasets to the browser. Then they can be styled with zebra colours etc., Then they can also be turned into interactive tables and hold some visualizations inside. https://scicloj.github.io/clay/clay_book.examples.html#datasets https://scicloj.github.io/clay/clay_book.examples.html#tables

2024-10-28T11:33:55.778599Z

I have sometimes used clerk for this as well (tho not usually via tap>) but cider-pprint-eval-last-sexp is probably what I use most often

Hasan Ahmed 2024-10-28T11:30:17.583389Z

hello, couple of stupid questions about tech.ml.dataset: 1- how to append new rows into a dataset? 2- is there a way to return json str instead of writing directly into a file? I'd like to use this to preview data in my web app, so datasets are really small.

2024-10-28T11:36:05.223899Z

re: 1 - I create a new dataset with the data and then concat the two usually. This from the Tablecloth documentation might help: https://scicloj.github.io/tablecloth/#joinconcat-datasets

šŸ‘ 1
šŸ™Œ 1
2024-10-28T11:36:58.397119Z

tablecloth will also let you get the rows back as maps using (tc/rows ds :as-maps) IIRC which can then easily be turned into json using whatever json tooling you have

šŸ‘ 1
šŸ™Œ 1
Hasan Ahmed 2024-10-28T11:54:22.501459Z

I am not using tablecloth though but I'll consider it further.

Hasan Ahmed 2024-10-28T11:54:34.321549Z

Thanks man

2024-10-28T11:56:01.840179Z

there will be tmd things that are being called underneath, I just don't know what they are off the top of my head. Tablecloth is very helpful though

2024-10-28T11:56:27.887639Z

I drop into tmd for things like group-by-column-agg and even into hamfisted for some of the parallel reducing there

Hasan Ahmed 2024-10-28T11:59:17.915569Z

yea, I like tc api better but i feel like using tmd will force me (in a away) into writing things that are more performant

Hasan Ahmed 2024-10-28T11:59:32.745169Z

just a hunch, I'll most likely end up using tc

2024-10-28T12:02:41.806279Z

tc sugar helps me go faster and then I can drop down to the stuff underneath when I need to

2024-10-28T12:02:48.217509Z

but YMMV

genmeblog 2024-10-28T15:24:03.586569Z

I think this is exactly the use case. If you need a performance - go for TMD and it's reducing pathways, if you work on small/mid-sized dataset TC should be enough.