beginners 2021-10-19 | Slack Archive

Abhinav09:10:21

hi, I had a question about parallelism. suppose I have a bunch of functions, and I want to run all of them on separate threads (1 thread per fn), • I don't care about the return value, I'm running them for their side effects. • There's no shared state between the fns • I want the thread to clean up after itself. • I need it to run immediately when called. what do I need? thread, future, promise, or something else?

jaihindhreddy11:10:49

Clojure fns implement runnable, which means you can construct a thread with it and call .run on that.

(defn run-in-thread [f]
  (.start (Thread. f)))

(defn print-after [n]
  #(do (Thread/sleep (* n 1000))
       (println n)))

(run! run-in-thread (map print-after (range 8)))

run! is map but purely for side-effects, discarding return values. Edit: (map print-after (range 8)) is an example bunch of functions. > I want the thread to clean up after itself. What do you mean by this? Another thing to note, is the fact that you'll of course be creating n threads, where n is the number of functions you have. That may or may not be a good idea.

Abhinav15:10:32

> I want the thread to clean up after itself. I meant that it shouldn't just stay there consuming resources (I'm not fully aware about how threads work) I didn't know about run! also, wouldn't futures do the same thing? I did a little reading and I think futures might be what I need. Is there a more elegant solution than using n threads? I want all n functions to run concurrently irrespective of how long they take to run, i.e I would want them to start running at a given time. That's why n threads

indy17:10:51

You can very well use futures. Futures are tasks (Callables like functions) submitted to a threadpool (Executors.newCachedThreadPool to be specific). This threadpool will reuse previously spawned threads if they haven’t already been cleaned up but won’t limit the the number of threads, so none of the tasks will be waiting in the pool. Futures are slightly more “managed” than raw threads. If you don’t care about their return values (fire and forget) and the number of functions is large then using a capped threadpool is better. Otherwise you’ll get OOM errors if you’re spawning a lot of threads. Take a look at https://github.com/TheClimateCorporation/claypoole or simply Java Executor service which the former library is built on.

💯 4

Abhinav04:10:58

@UMPJRJU9E thanks for the explanation, I appreciate it.

Abhinav04:10:35

what would be a good estimate of the number of threads that would most likely cause me an error, I know that it is dependent on the CPU, but a rough estimate would be nice

indy05:10:56

That would depend on multiple of factors like the configured max heap size, the size of the objects the threads are dealing with and how quickly the objects can be garbage collected. Better if you experiment with the numbers yourself by looking at the heap usage, garbage collection logs/graph, the state of the threads (waiting, parked or running) using tools like VisualVM (free) or YourKit (rave reviews, expensive, haven’t tried it myself).

💯 2

Abhinav03:10:28

thanks

Andrei Stan10:10:06

hello everyone, i have two vectors of maps, and i want to return one of the vectors, but with the values of the second for IP, or whatever i need...

(let [vecA [{:name "Home Desktop" :IP "192.168.0.1" :id "hhhhhhhhh"} {:name "Work Station" :IP "192.0.0.1" :id "ggg"}]
      vecB [{:name "Home Desktop" :IP ""} {:name "Work Station" :IP "178.138.35.22" }]]
<function_here>)

The expected result:

[{:name "Home Desktop" :IP "" :id "hhhhhhhhh"} {:name "Work Station" :IP "178.138.35.22" :id "ggg"}]

I tried to use map function in some ways without success.

Abhinav10:10:10

you can use update-in. (update-in vecA [1 :IP] (fn [_] "178.138.35.22")) 1 refers to the index of the map you want to change, :IP refers to the key of the map you want to change. update-in takes a function which it calls to get the new value replace "178.138.35.22" with whichever value you need

emccue13:10:46

map with multiple arguments will do the trick

emccue13:10:17

(map 
  (fn [a b]
    (assoc a :IP (:IP b)))
  vecA
  vecB)

emccue13:10:15

(let [vecA [{:name "Home Desktop" :IP "192.168.0.1" :id "hhhhhhhhh"} {:name "Work Station" :IP "192.0.0.1" :id "ggg"}]
      vecB [{:name "Home Desktop" :IP ""} {:name "Work Station" :IP "178.138.35.22"}]]
  (map
   (fn [a b]
     (assoc a :IP (:IP b)))
   vecA
   vecB))
=> ({:name "Home Desktop", :IP "", :id "hhhhhhhhh"} {:name "Work Station", :IP "178.138.35.22", :id "ggg"})

emccue13:10:33

mapv if you want to preserve the vector-ness

Apple13:10:10

if they are ordered

(let [vecA [{:name "Home Desktop" :IP "192.168.0.1" :id "hhhhhhhhh"} {:name "Work Station" :IP "192.0.0.1" :id "ggg"}]
      vecB [{:name "Home Desktop" :IP ""} {:name "Work Station" :IP "178.138.35.22" }]]
  (map into vecA vecB ))

Marius06:10:48

@U3JH98J4R This only works if both vectors are complete (contain all items in the same order), right?

emccue12:10:15

yes

Benjamin11:10:15

I'm trying to use httpkit to post to a amazon lambda url and it throws this:

Unrecognized record version (D)TLS-0.0 , plaintext connection?

do you know?

Benjamin11:10:45

doing the same with curl works for some reason

Ben Sless11:10:30

which version of httpkit?

Benjamin11:10:37

2.6.0-alpha1

Ben Sless11:10:08

are you using the sni client or regular client?

Benjamin11:10:38

hm I require like this [org.httpkit.client :as client]

Benjamin11:10:49

probably regular if I don't know

Ben Sless11:10:18

try to require org.http.sni-client as sni

Ben Sless11:10:50

then assoc to the request in :client @sni/default-client

Benjamin11:10:59

yeee bois that works thanks 🎉 😄

🎉 1

Ben Sless11:10:22

which JVM version were you using?

Benjamin11:10:02

"16.0.2+7"

Ben Sless11:10:54

Also, brief note on how I recognized the issue for other readers and our future selves, TLS issues can occur if the client has no ssl configurer, Docstring from the namespace

Provides an SNI-capable SSL configurer and client, Ref. #335.
  In a separate namespace from `org.httpkit.client` so that
  http-kit can retain backwards-compatibility with JVM < 8.

The sni client:

Like `org.httpkit.client/default-client`, but provides SNI support using `ssl-configurer`. NB Hostname verification currently requires Java version >= 11.

👍 1

Ben Sless11:10:11

so the first thing to check with TLS is the SNI client. If you have more issues with TLS is might be the list of cyphers you're using

Ben Sless11:10:32

Also, you should probably switch to Java 17, it's LTS

👀 1

Ben Sless11:10:42

but that's besides the point

Lycheese12:10:40

Hi, I'm currently working on transforming deeply nested maps into hiccup-html. Input:

{:val "Tree 1", :children [{:val "var1", :children
                             [{:val "nested1", :children []}
                              {:val "nested2" :children []}]}
                           {:val "var2", :children
                             [{:val "child of var2", :children []}]}]}

Output:

[:ul
 [:li "Tree1"]
 [:ul
  [:li "var1"]
  [:ul
   [:li "nested1"]
   [:li "nested2"]]
  [:li "var2"]
  [:ul
   [:li "child of var2"]]]]

Current solution:

(defn render-tree [{:keys [val children]}]
  [:ul
   [:li val]
   (for [child children]
     (render-tree child))])

The above solution does exactly what I want it to, but -- if I understand recursion in Clojure correctly -- will cause a stack overflow on deeply nested lists. What should I read up on to be able to design a better solution than this?

emccue13:10:45

i’m not certain there is a tail recursive version of this to be made without getting creative (in any language)

emccue13:10:12

buuuut, one idea could be to “explode” your nested data

Apple13:10:50

lazy comes to mind...

emccue13:10:19

{ [0]     "Tree 1"
  [0 0]   "var1"
  [0 0 0] "nested1"
  [0 0 1] "nested2"
  [0 1]   "var2"
  [0 1 0] "child of var2" }

emccue13:10:18

like if you could get it to this (which should be possible) you couldbe more clever about building the data structure - maybe building “inside out”

emccue13:10:29

but for reasonable data sizes idk if its gonna be worth it (or possible, i’m just guessing here). Even clojure.walk doesn’t do anything like this https://github.com/clojure/clojure/blob/master/src/clj/clojure/walk.clj

emccue13:10:35

(defn explode-nested-data [data]
  (let [data [data]]
    (loop [[position & todo] [[0]]
           result         {}]
      (if (nil? position)
        result
        (let [node (get-in data (interpose :children position))]
          (recur (concat (map (fn [n] (conj position n))
                              (range (count (:children node))))
                         todo)
                 (assoc result position {:val (:val node)})))))))
=> #'dev.mccue.mtgbot/explode-nested-data
(explode-nested-data
 data)
=>
{[0] {:val "Tree 1"},
 [0 0] {:val "var1"},
 [0 0 0] {:val "nested1"},
 [0 0 1] {:val "nested2"},
 [0 1] {:val "var2"},
 [0 1 0] {:val "child of var2"}}

emccue13:10:03

here is a crack at the first one - i get nervous with concat but I think this is a valid use of it

Lycheese13:10:45

I think you're right and exploding the data is probably the most feasible way to do this (with the added benefit of making it simpler to update the structure). Thank you for your input 🙂

emccue13:10:59

(def exploded {[0] {:val "Tree 1"},
               [0 0] {:val "var1"},
               [0 0 0] {:val "nested1"},
               [0 0 1] {:val "nested2"},
               [0 1] {:val "var2"},
               [0 1 0] {:val "child of var2"}})
(defn implode-data-back
  [data]
  (let [sorted-by-nesting (sort-by (fn [[k _]] (count k))
                                   data)]
    (loop [[[nest {:keys [val]}] & remaining] sorted-by-nesting
           built-up-data                 [[]]]
      (if (nil? nest)
        (first built-up-data)
        (recur remaining
               (update-in built-up-data
                          (cons (first nest) (map #(+ 2 %) (rest nest)))
                          (fn [data]
                            (conj (or data []) :ul [:li val]))))))))

(implode-data-back exploded)
=>
[:ul
 [:li "Tree 1"]
 [:ul [:li "var1"] [:ul [:li "nested1"]] [:ul [:li "nested2"]]]
 [:ul [:li "var2"] [:ul [:li "child of var2"]]]]

emccue13:10:06

this….might do it

Lycheese13:10:03

That is a lot more sophisticated than what I was doing. And more necessary than I thought. (I tried just making [:ul>ul>ul>li val] constructs with the times ul appears being the length of the vector, but I need to connect the list items with lines so…) Thank you so much for this solution.

emccue13:10:06

definitely ask a smarter person - i’m sure there is a way to do this that isn’t as horrible

Lycheese13:10:01

I guarantee you that it is of higher quality than the rest of the codebase XD

emccue14:10:38

more clever != higher quality

Lycheese14:10:06

also true I'm still struggling to understand how implode-data-back guarantees the correct order of the nested items 😅

Lycheese14:10:31

And why map + 2? Is that relative to the maximal nesting?

emccue14:10:20

because [:ul [:li "…"] <HERE>]

emccue14:10:34

so if its 0 0, you need to put the data in slot 2 there

Lycheese14:10:52

ah ok

Lycheese14:10:06

making nested1 and nested2 be in the same ul is going to be fun.

Ed15:10:37

I'm not totally convinced that you'll get a stack overflow from your original code @U02FM0NNZAB... for is lazy (in fact what you've written there is the same as (map render-tree children)) and you're returning a lazy data structure that will call render tree when realised. I would think that means that you're only calling into render-tree from outside render-tree ... if that makes sense? ..

Ed15:10:59

(defn render-tree [{:keys [val children]}]
    [:ul
     [:li val]
     (for [child children]
       (render-tree child))])

  (defn build-tree [acc n]
    (if (pos? n)
      (recur {:val "val" :children [acc]} (dec n))
      acc))

  (count (render-tree (build-tree {:val "end"} 1000000)))

that takes a few seconds to run on my machine, but it succeeds

Ed15:10:32

I think that's what @UP82LQR9N was alluding to earlier in the thread

Lycheese15:10:39

@U0P0TMEFJ Thank you for testing that. I'm still struggling to understand the practical implications of laziness…

Ed15:10:38

👍 ... it's easy enough to test at the repl 😉

Lycheese16:10:21

yeah, but I was too caught up in thinking that it wouldn't work anyway that I never actually tested whether it fails so, thank you for that

Lycheese16:10:19

I tested it now and since I need to render something from the output it causes a stackoverflow. The maximum n that I can pass to build-tree is ~~192~~ 193. Not sure if I can somehow split it up into chunks to prevent that. (or if that is even relevant for my use case)

Lycheese18:10:02

I was able to increase n to 217 by wrapping it in lazy-seq:

(defn render-tree [{:keys [val children]}]
  (into [] (lazy-seq
            [:ul
             [:li val]
             (map render-tree children)])))

Ed09:10:16

I would think that that call to lazy-seq isn't doing anything, because you're calling into which will eagerly consume the lazy seq. The lazyness in there is due to map returning a lazy seq. So that code should be equivalent to

(defn render-tree [{:keys [val children]}]
    [:ul
     [:li val]
     (map render-tree children)])

Lycheese09:10:35

You're right. Without the into it goes up to 223. (but it won't render correctly without the into since hiccup can't render lazy-seqs)

Ed10:10:48

I don't think that the into + lazy-seq is doing anything different to just returning a vector with [:ul ,,,] ... lazy-seq will just turn the vector it's given into a lazy seq and the (into [] ,,,) will turn that lazy-seq back into a vector. These are not recursive things, and will not change the individual elements, so the third element will still be a lazy seq (returned by map) either way. If it needs to be a vector, then you can use mapv which will eagerly generate a vector instead of returning a lazy-seq and will probably introduce the recursion problem you were worried about 😉

Lycheese10:10:08

I really don't know what conclusion I should draw from this. Pray no user ever generates a tree larger than 217 nodes?

Lycheese11:10:45

I think I'll just use the recursive solution for now since it at least works and lets me test the rest of my structure. I suspect that it's possible to implement a more robust solution with clojure.walk/postwalk so I'll put that in my todos.

Ed12:10:05

I'm not sure where your 217 limit is coming from ... The example code I posted above has a nesting of 1000000, so if you're getting a stack overflow I'd suggest it's from something else?

Ed12:10:19

Or you've simplified the code posted here too much?

Lycheese12:10:40

I haven't simplified it but count does not realize the entire seq. <-- at least it doesn't in cljs

Lycheese12:10:46

Huh. It's cljs specific

Lycheese12:10:26

So js allows less linear recursion than the jvm I don't think I'll be able to outsource generating the html to the backend since it needs to incorporate re-frame functions in the final form.

Ed12:10:57

I'd be surprised if count didn't realise the whole seq ... either it knows how big the seq is cos it's already realised, like a vector, or it needs to find out how many elements are in the seq by realising ... right?

Lycheese12:10:26

But it's nested, right? And count doesn't count down to the last level.

(count [1 [2 3 [4]]])
;; => 2

Lycheese12:10:52

So it's inefficient to realize the entire seq just to count it's elements

Lycheese12:10:30

It only needs to know how many are in the top level.

Ed12:10:25

yeah .. count realised the whole seq ... but doesn't touch any of it's elements

Ed12:10:31

I don't think we've got to the bottom of where the limit is coming from

Ed12:10:40

what's the actual error you're getting?

Lycheese12:10:54

too much recursion in the js/console

Ed12:10:11

is there a stack trace to go along with that?

Lycheese12:10:20

sec

Ed12:10:40

... sorry I don't do a lot of cljs, so I may well be asking a dumb question ...

Lycheese12:10:24

action-request-action failed 
Object { meta: null, cnt: 3, shift: 5, root: {…}, tail: (3) […], __hash: null, "cljs$lang$protocol_mask$partition0$": 167666463, "cljs$lang$protocol_mask$partition1$": 139268 }
 InternalError: too much recursion
    cljs$core$array_map_index_of core.cljs:6631
    cljs$core$ILookup$_lookup$arity$3 core.cljs:6916
    cljs$core$ILookup$_lookup$arity$2 core.cljs:6913
    cljs$core$IFn$_invoke$arity$2 core.cljs:1955
    cljs$core$IFn$_invoke$arity$1 core.cljs:3331
    cljs$core$pr_sequential_writer core.cljs:10053
    cljs$core$IPrintWithWriter$_pr_writer$arity$3 core.cljs:10378
    cljs$core$_pr_writer core.cljs:776
    cljs$core$pr_writer_impl core.cljs:10124

Lycheese12:10:31

I hate cljs errors

Ed12:10:42

😉 ... people say that about the clj errors too ... but tbh I used to work with jboss, so I kinda find them an improvement over that

Lycheese12:10:11

I'm okay with Java Stacktraces since I learned programming with Java but js is something else

Ed12:10:13

so the error is coming from the printer?

Lycheese12:10:35

because I generated it from the repl probably

Ed12:10:42

can you print the incoming data structure?

Ed12:10:54

I mean the thing you pass to render-tree

Ed12:10:27

it should have the same depth of data structure ... right?

Lycheese12:10:55

no I can't same error

Lycheese12:10:27

but build-tree isn't mundane recursion right?

Lycheese12:10:44

does cljs have tco?

Ed13:10:59

the build-tree fn I posted above uses recur to do tail call recursion which basically transforms the recursive form into a loop so you don't consume the stack when you use it ... I think that's supported in cljs

Ed13:10:55

I think that's the only way to do tco in both clj and cljs

Lycheese13:10:41

not sure if mutual recursion via trampoline counts as well, but that's the only one I know of as well

Lycheese13:10:54

but build-tree failing seems to suggest that even if I managed to get an iterative implementation of render-tree it would still cause the same error…

Ed13:10:13

I wonder if it's the printer that's the problem

Ed13:10:01

the data returned by build-tree is eagerly generated - there's no lazyness there

Ed13:10:08

if instead of trying to print the result, you render it in the browser or something does the error go away?

Lycheese13:10:57

error when calling lifecycle function teapot.core/mount-components InternalError: too much recursion
    cljs$core$array_index_of_keyword_QMARK_ core.cljs:6581
    cljs$core$array_index_of core.cljs:6618
    cljs$core$array_map_index_of core.cljs:6631
    cljs$core$ILookup$_lookup$arity$3 core.cljs:6916
    cljs$core$ILookup$_lookup$arity$2 core.cljs:6913
    cljs$core$IFn$_invoke$arity$2 core.cljs:1955
    cljs$core$IFn$_invoke$arity$1 core.cljs:3331
    cljs$core$pr_writer core.cljs:10204
    print_prefix_map core.cljs:10327
env.cljs:198:78

Lycheese13:10:01

I'll try to reproduce this with less dependencies in a new project. Maybe it's a library in between.

Lycheese13:10:21

although if it fails in the repl

Lycheese13:10:24

hmm

Ed13:10:07

(defn build-tree [acc n]                                                                                                                                                                                           
    (if (pos? n)                                                                                                                                                                                                     
      (recur {:val "val" :children [acc]} (dec n))                                                                                                                                                                   
      acc))                                                                                                                                                                                                          
                                                                                                                                                                                                                     
  (defn consume-tree                                                                                                                                                                                                 
    ([t]                                                                                                                                                                                                             
     (consume-tree t 0))                                                                                                                                                                                             
    ([t c]                                                                                                                                                                                                           
     (if (:children t)                                                                                                                                                                                               
       (recur (first (:children t)) (inc c))                                                                                                                                                                         
       c)))                                                                                                                                                                                                          
                                                                                                                                                                                                                     
  (consume-tree (build-tree {:val "end"} 1000000))

If I run that at a cljs repl, it prints the number 1000000 ... which suggests to me that there's not too much recursion in generating and consuming the data structure. However,

cljs.user=> (build-tree {:val "end"} 1000000)
Execution error (RangeError) at (<cljs repl>:1).
Maximum call stack size exceeded

that suggests to me that the cljs printer is using recursion to print the nested data structure at the repl ... so I'd suggest the problem isn't in the code we've been talking about 😉

Lycheese13:10:23

And something inside my frontend stack uses the printer to produce html leading to the error.

Lycheese13:10:19

But someone else must have had this problem before, right? I can't be the only one working with deeply nested trees in cljs.

Ed13:10:28

I would think so ... maybe ask in #clojurescript???

Lycheese13:10:31

I'll ask in #clojurescript tomorrow how I should go about debugging this. I think I'm done for today, my head hurts. Thank you for your help!

Ed13:10:12

👍 ... I think you have a pretty minimal case ... good luck 😉

Max Deineko22:10:48

You got me interested 🙂. Please keep in mind that I'm perpetual clojure beginner, so everything I write here should be taken with caution. First, I cannot currently see how lazyness would possibly avoid potential stack overflows here -- afaics as soon as you need to realize/process the tree structure the stack will be used just as without lazyness, since there are no parts of lazy sequences one can possibly discard. So I started out with

(ns my.scratchpad
  (:require
   [clojure.pprint :as pp]
   [clojure.walk :as walk]))

(def small-tree
  {:val "Tree 1", :children [{:val "var1", :children
                              [{:val "nested1", :children []}
                               {:val "nested2" :children []}]}
                             {:val "var2", :children
                              [{:val "child of var2", :children []}]}]})

(defn build-tree [acc n]
  (if (pos? n)
    (recur {:val "val" :children [acc]} (dec n))
    acc))

(def big-tree (build-tree {:val "end"}  100000))

and a barebones tail recursive depth first search -- which would only visit and output the nodes without care for nested structure:

(defn dfs
  "Tail recursive depth first search.
  Value of `trees` represents stack of not yet visited (sub)trees."
  [out trees]
  (if (empty? trees) out
      (let [t    (peek trees)
            ts   (pop trees)
            v    (:val t)
            cs   (:children t)
            ts'  (into ts (reverse cs))
            out' (conj out v)]
        (recur out' ts'))))

(dfs '() [small-tree])
;; => ("child of var2" "var2" "nested2" "nested1" "var1" "Tree 1")

Now we only need to add the nesting info to above traversal and build the output accordingly:

(defn transform-tree
  "Depth first search with structure-preserving output construction."
  [tree]
  (letfn [(annotate [cs] (conj (into [:start] (map #(do {:tree %}) cs)) :end))
          (dfs [result stack]
            (if (empty? stack) result
                (let [e (peek stack)
                      stack' (pop stack)]
                  (cond (= e :start)
                        (recur (conj result [:ul]) stack')

                        (= e :end)
                        (let [[v2 v1] (into [] (take 2 result))]
                          (recur (conj (pop (pop result)) (conj v1 v2)) stack'))

                        :else
                        (let [t (:tree e)
                              cs (:children t)
                              result' (conj (pop result) (conj (peek result) [:li (:val t)]))
                              stack'' (if (empty? cs) stack' (into stack' (reverse (annotate cs))))]
                          (recur result' stack''))))))]
    (first
     (dfs '() (list :start {:tree tree})))))

(pp/pprint small-tree)
;; {:val "Tree 1",
;;  :children
;;  [{:val "var1",
;;    :children
;;    [{:val "nested1", :children []} {:val "nested2", :children []}]}
;;   {:val "var2", :children [{:val "child of var2", :children []}]}]}

(pp/pprint
 (transform-tree small-tree))
;; [:ul
;;  [:li "Tree 1"]
;;  [:ul
;;   [:li "var1"]
;;   [:ul [:li "nested1"] [:li "nested2"]]
;;   [:li "var2"]
;;   [:ul [:li "child of var2"]]]]

(time (let [_ (transform-tree big-tree)] nil))
;; "Elapsed time: 220.219634 msecs"

Afaics this builds the tree as needed. The problem now is that neither clojure's print nor pprint can process the big tree:

;; (pp/pprint (transform-tree big-tree))
;; StackOverflowError

so you'd probably need to roll your own as well, as in e.g.

(defn tree->string
  "Don't build it, print it."
  [tree]
  (letfn [(annotate [cs] (conj (into [:start] (map #(do {:tree %}) cs)) :end))
          (dfs [result stack]
            (if (empty? stack) result
                (let [e (peek stack)
                      stack' (pop stack)]
                  (cond (= e :start)
                        (recur (str result "[:ul") stack')

                        (= e :end)
                        (recur (str result "]") stack')

                        :else
                        (let [t (:tree e)
                              cs (:children t)
                              result' (str result "[:li " (:val t) "]")
                              stack'' (if (empty? cs) stack' (into stack' (reverse (annotate cs))))]
                          (recur result' stack''))))))]
    (dfs ""  (list :start {:tree tree} :end))))

(println
 (tree->string small-tree))
;; [:ul[:li Tree 1][:ul[:li var1][:ul[:li nested1][:li nested2]][:li var2][:ul[:li child of var2]]]]

(comment
  (time
   (count (tree->string big-tree))))
;; => 1400014
;; Elapsed time: 75058.648122 msecs

As far as I can see clojure.walk also uses call stack for recursion

(walk/postwalk (fn [_] {}) small-tree)
;; => {}

;; (walk/postwalk (fn [_] {}) big-tree)
;; StackOverflowError

This was at one point also my (limited) experience with most popular clojure json libraries, which choked on deeply nested structures. While I have no experience with clojurescript, my takeaway for now would generally be that if you really care about possible breakage on deeply recursive data in clojure & friends, you'll need to test for it and expect that not only your code will fail but possibly common libraries as well. Now I'm in no way an expert on parsing or tree manipulation, so maybe there are very elegant and/or powerful solutions to the problem which are not subject to clojure's call stack limitation; it's also possible that I made some horrible mistakes above. In any case I'd be delighted to learn more on the topic.

Ed10:10:16

> afaics as soon as you need to realize/process the tree structure the stack will be used just as without lazyness, since there are no parts of lazy sequences one can possibly discard. When we call render-tree, we get a vector back containing [:ul [:li "something"] *lazy-seq*] where the third element is a lazily realised thing. render-tree is no longer on the stack, so when we try to realise the third element, it doesn't matter that it calls render-tree again because we've already returned from the previous call. You could prove that by printing the stack trace every time you call render-tree.

(defn build-tree [acc n]
    (if (pos? n)
      (recur {:val "val" :children [acc]} (dec n))
      acc))

  (defn render-tree [{:keys [val children]}]
    (.printStackTrace (Exception. ""))
    [:ul
     [:li val]
     (map render-tree children)])

  (defn consume-rendered-tree
    ([t]
     (consume-rendered-tree t 0))
    ([t c]
     (if (nth t 2)
       (recur (first (nth t 2)) (inc c))
       c)))

  (consume-rendered-tree (render-tree (build-tree {:val "end"} 3)))

The relevant bit of the stack trace is

java.lang.Exception: 
	at user$render_tree.invokeStatic(NO_SOURCE_FILE:79)
	at user$render_tree.invoke(NO_SOURCE_FILE:79)
	at clojure.core$map$fn__5866.invoke(core.clj:2753)
	at clojure.lang.LazySeq.sval(LazySeq.java:42)
	at clojure.lang.LazySeq.seq(LazySeq.java:51)
	at clojure.lang.LazySeq.first(LazySeq.java:73)
	at clojure.lang.RT.first(RT.java:692)
	at clojure.core$first__5384.invokeStatic(core.clj:55)
	at clojure.core$first__5384.invoke(core.clj:55)
	at user$consume_rendered_tree.invokeStatic(NO_SOURCE_FILE:90)
	at user$consume_rendered_tree.invoke(NO_SOURCE_FILE:85)
	at user$consume_rendered_tree.invokeStatic(NO_SOURCE_FILE:87)
	at user$consume_rendered_tree.invoke(NO_SOURCE_FILE:85)

where you see only one call to render-tree as it's realising the lazy seq. Does that make sense?

Max Deineko14:10:25

@U0P0TMEFJ I should have said «there are no parts of lazy sequences one can possibly discard in general» -- consume-rendered-tree above is very specific in that it goes down one tree branch without caring about the rest. If we need a more general consumer -- say, if we wanted to print the tree (as described in original post or as xml structure), concatenate all contained values etc -- then lazyness does not give us any benefit, the evaluation will still overflow the stack if applied to the nested lazy structure naively. But maybe we're solving different problems: @U02FM0NNZAB note that

[:ul
 [:li "Tree1"]
 [:ul
  [:li "var1"]
  [:ul
   [:li "nested1"]
   [:li "nested2"]]
  [:li "var2"]
  [:ul
   [:li "child of var2"]]]]

and what

(defn render-tree [{:keys [val children]}]
  [:ul
   [:li val]
   (for [child children]
     (render-tree child))])

(or the lazy map variant) yields are not the same. Now, just computing render-tree will not result in a stack overflow. But if you want to process the tree structure further, generally laziness will not help with the particular problem of stack exhaustion afaics -- for example, if you wanted to print the structure as in the former snippet -- you'll need to move the nesting from the stack to memory somehow, e.g. like in tail recursive tree traversal.

Ed15:10:18

ah yes ... right ... getcha ... 😉 ... I think I failed to understand that you were talking purely about the rendering part of the problem ... apologies ... I think I read the original question as "generating this data structure gets me a stack overflow" and I think that's not true, it's consuming that data structure that creates the stack overflow, and I think your point is that the only sensible way to consume such a datastructure is recursive and that seems fair enough 😉

Max Deineko17:10:42

@U0P0TMEFJ yes, above I compute "Output" from "Input" of the original post avoiding possible stack overflow caused by recursion -- the potential overflow problem stays the same whether we compute "Output" from "Input" or from "current solution". Hope it helped @U02FM0NNZAB 🙂

👍 1

Lycheese09:10:34

I sadly don't have time right now to continue working on this particular problem, but thank you for your input. I'll save it for later when I have time for it. (For now I just imposed a limit on the depth of the nesting to keep the site from crashing)

👍 1

🤞 1

Benjamin13:10:55

how do I escape newlines in format like this? use case is to have small lines in code but the format string should be 1 line

(format "lorem lorem lorem lorem lorem lorem
lorem lorem lorem lorem lorem ")

Martin Půda14:10:29

(clojure.pprint/cl-format nil "lorem lorem ~
lorem lorem ~
lorem lorem")

Mno14:10:55

or uuhh

(format (str "lorem lorem lorem lorem lorem " 
             "lorem "
             "lorem lorem lorem lorem"))

☝️ 1

Mno14:10:12

as long as you don't mind the str conversion.

Benjamin15:10:15

ok cool

agigao16:10:35

trying to read .edn with #time/instant data in it, receiving No reader function for tag time/instant , any ideas fellow Clojurians?

Apple16:10:48

Do both clojure.core/read and clojure.edn/read fail?

Darin Douglass16:10:32

the built-in macro for dates is #inst which maps to java.util.Date. #time/instant must be provided by another library that’s not on your classpath

agigao16:10:37

Yes, it’s converted using tick.alpha.api/instant

agigao16:10:45

core/read fails as well

hiredman16:10:01

read and edn/read both provide a way to handle unknown tagged readers, I forget the details, check their doc strings

Apple16:10:26

https://clojure.org/reference/reader#tagged_literals

Apple16:10:33

Reader tags without namespace qualifiers are reserved for Clojure. Default reader tags are defined in https://clojure.github.io/clojure/clojure.core-api.html#clojure.core/default-data-readers but may be overridden in `data_readers.clj` or by rebinding https://clojure.github.io/clojure/clojure.core-api.html#clojure.core/%2Adata-readers%2A. If no data reader is found for a tag, the function bound in https://clojure.github.io/clojure/clojure.core-api.html#clojure.core/%2Adefault-data-reader-fn%2A will be invoked with the tag and value to produce a value. If default-data-reader-fn is nil (the default), a RuntimeException will be thrown.

Apple16:10:41

data_readers.clj

agigao16:10:20

Solved with customer reader and read-string instead of edn/read-string

(binding [*data-readers* {'time/instant t/instant}]
    (read-string ...))

agigao16:10:47

Thanks guys!

hiredman17:10:03

https://github.com/henryw374/time-literals gross

henryw37420:11:08

What's gross?

hiredman20:11:12

Using tag literals for type specific stuff, the way it just sort of exposes the java.time types as tag literals

hiredman20:11:49

Instant should absolutely just use the inst tag

hiredman20:11:51

I think the case could be made that some of the zoned stuff should as well, but it is admittedly murky there

hiredman20:11:06

The reader functions appear to be bad as well, they return forms that eval to the data structure instead of the data structure, so don't work entirely in the reader and need to pass through eval to work

hiredman20:11:01

Which is the root cause of this bug https://github.com/henryw374/time-literals/issues/3

henryw37420:11:54

Agree there is some overlap, but just for instant vs date. And even then one might want to distinguish.

hiredman20:11:25

Outside of being broken for pure serialization, you'll get weird results in things like macros (which are a thorny issue with nonedn literals anyway)

henryw37420:11:36

The readme talks about the problem... A bug in cljs.

henryw37420:11:05

Which I have a patch for.. but haven't managed to get much interest in

hiredman20:11:38

Does that answer the question of why I called it gross?

henryw37420:11:57

The alternative is to have 2 artifacts for this lib. One for jvm and one for cljs

hiredman20:11:52

It seems like you are trying to convince me it isn't gross

henryw37420:11:58

So your point re serialisation is valid

henryw37420:11:54

Meaning, there is an issue that should be fixed

hiredman20:11:28

At the time I said gross I hadn't even really seen any of that

henryw37420:11:53

Ok so just having some overlap between instant and date tag renders whole library gross

hiredman20:11:32

I called it gross pretty much entirely based on the fact it is basically exposing type specific tags, instead of creating / reusing tags based on the meaning of the information (using inst)

henryw37420:11:22

Haha ok!

agigao17:10:09

@eulered გაუმარჯოს!

edx8617:10:24

სალამი გიგა 😀

Sebastian Allard20:10:32

Are there any web based data browsers? Like REBL but starting a HTTP server instead. I found a project called https://github.com/lilactown/punk which seems to be what I'm looking for, but it is not actively maintained. Do you know any alternatives?

R.A. Porter20:10:38

Portal might fit your needs - https://github.com/djblue/portal

👍 3

seancorfield21:10:52

@U02HNEY0CPN I used to use REBL all the time, then switched to Reveal, but now I use Portal all the time for this. And if you're using VS Code, there's a Portal extension for it so you can run it inside VS Code in a webview so you don't need an external browser window.

👍 2

2021-10-19

Channels