Fork me on GitHub
#data-science
<
2023-08-18
>
Ben Lieberman20:08:03

Is there a tech.ml.dataset function like pandas.DataFrame.dropna (specifically I am after the subset parameter) or is it okay to use Clojure core functions on the underlying maps of a dataset?

chrisn21:08:48

There is a remove-rows command that you can pass the missing sets into. The missing sets themselves are roaringbitmaps and they have an .or member or you can use ham-fisted's set namespace to union the sets together. As far as clojure core on the rows - yes for sure and if they don't work exactly right that is an issue with TMD - you can and note there are functions such as row-map if you want to work row by row.

👀 2
genmeblog21:08:28

drop-missing in tablecloth does this job. https://scicloj.github.io/tablecloth/#Drop39

genmeblog21:08:23

If you pass a column selector it will act as subset